Mortdecai

Author	SHA1	Message	Date
Mortdecai	d9acb653fe	Fix chart labels, add version history table to README Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-21 15:48:35 -04:00
Mortdecai	f5118505b1	0.5.0 bake-off results, knowledge lookup tools, training progress chart Bake-off (0.5.0 vs 0.4.0): - Overall: 46.8% vs 45.2% (+1.6%), 0 errors vs 2 - Enchantments: +47% (20% → 67%) - EssentialsX: +60% (0% → 60%) - Effects: +25% (0% → 25%) - Regressions: fill_build -67%, world -20% Knowledge Lookup Tools (4 new): - plugin.docs_lookup: WorldGuard, WorldEdit, CoreProtect, EssentialsX, LuckPerms docs - minecraft.changelog_lookup: version history from Minecraft Wiki - paper.docs_lookup: Paper server-specific documentation - Wired into gateway model-driven tool loop and exploration self-play Exploration Self-Play: - General (vanilla MC) and plugins focus modes - Wiki-grounded: model researches before acting, validates through RCON - 2,243 exploration examples generated, 150 kept after quality filtering Training Progress Chart: - SVG chart showing training examples and inverse loss across versions - Added to MODEL_CARD.md for Gitea display Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-21 15:28:09 -04:00
Mortdecai	da8f557219	GPU scheduler, 14-tool architecture, plugin deployment, event dispatcher GPU Scheduler (gpu.sethpc.xyz): - Live dashboard with 4 GPUs, training monitor, loss sparklines - Preset-based job scheduler with 3 triggers (time, finish_training, cost) - Model selection per GPU, pipeline configuration - Tool self-play and training pipeline types - Behind Google OAuth, live-refresh without page reload Tool Architecture (14 tools): - 3 new tools: world.nearby_entities, memory.read, memory.write - 7 script.* tools: write, validate, execute, read, list, delete, schedule - ScriptManager: full mcfunction datapack CRUD with RCON validation - Training data: 1,430 tool examples (up from 1,159) Plugin Deployment (paper-ai-25567): - WorldGuard 7.0.12, CoreProtect CE 23.1, EssentialsX 2.21.2, Vault 1.7.3 - Fresh greenfield world reset - 104 RCON-validated plugin training examples Event Dispatcher: - Watches server log for deaths, joins, advancements, PvP kills - Configurable trigger probability and cooldowns per event type - Deployed to dev server, fires god_system prompts on events - 21 event-response training examples Training Infrastructure: - train_lora.py: --save-steps 50, --resume from checkpoint - run_training.sh: stops Ollama, activates conda, restarts after - Passwordless sudo for ollama services on steel141 - Dev server added to MCSManager with autoStart Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-21 03:14:45 -04:00
Seth	f39809eaca	Semver rename: v1-v5 → 0.1.0-0.5.0 across all files Versioning scheme: semantic versioning (MAJOR.MINOR.PATCH) - 0.x.0 = pre-release development - 1.0.0 = first public/monetized release Renamed everywhere: PLAN.md, training scripts, self-play, overnight script, status printer, whitelist app, discord bot, all training data references. Ollama models retagged: mortdecai-v4 → mortdecai:0.4.0 Server configs updated on all three servers. Self-play restarted with new model name. Entity targeting + radius-aware kill + distance scale training added. Seed dataset: 2,503 + tool: 1,159 + self-play: 5,059 = 8,721 total examples Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 21:37:14 -04:00
Seth	027b835286	Session final: bakeoff fix, branding fonts, 3-GPU parallel self-play Current running state: - Prod: mortdecai-v4 on RTX 4000, single-call, error correction, fall protection - Dev: Gemini 3.1 Flash Lite (preview) + 5 bots generating training data - Bake-off: v4 running on steel141 (3090 Ti) - Self-play: ready for overnight — 3 GPUs parallel (3090 Ti + 2080 Ti + RTX 4000) Changes: - Bakeoff parser: strips think blocks, handles dict/list types - Branding fonts: Rajdhani-Bold (official), Exo2, Orbitron, Oxanium, SpaceGrotesk - Gemini 3.1 pricing added to cost tracker Active data collection: - Gemini 3.1 Flash Lite bots on dev ($20 budget, ~$4/hr with 5 bots) - Self-play overnight: 3 tiers × 3 GPUs = ~9x throughput - Training audit logging on all servers Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 00:56:45 -04:00
Seth	7a31e500e4	Qwen3.5-9B on prod, Gemini 2.5 Flash for dev, error correction, branding Prod deployment: - paper-ai and shrink-world switched from gemma3n:e4b to qwen3.5:9b - Error correction: detects RCON errors (<--[HERE]), asks model to fix, retries - Broadened error patterns: Unknown game mode, Unknown enchantment, etc. - Fixed fire fallback matching "firework" as fire intent - Fixed command format examples (WRONG vs RIGHT in prompt) - max_tokens bumped to 600 for command calls - Removed template workflow commands from sudo prompt Dev server: - Gemini 2.5 Flash ($0.15/$0.60 per M tokens) replaces Flash Lite - 10 bots for ~$1-1.5/hr training data generation - Dynamic pricing by model name in cost tracker Branding: - Rajdhani Bold as official Mortdecai font - Logo variants: mortdecai + mortdec.ai in 6 fonts - Whitelist page updated with Mortdecai branding + mortdec.ai domain Whitelist UUID fix: - Looks up real Mojang UUID via api.mojang.com - Patches all whitelist.json files directly - No more offline-mode UUID mismatches WorldEdit schematics: - 77 schematics installed (villages, bridges, lighthouses, parks, etc.) Mortdecai v4 training in progress: 63% complete on steel141 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 23:09:27 -04:00

6 Commits