T

Seth 5b28002001 0.6.0 training session: Oracle Bot, RL combat, Mind's Eye, multilingual pipeline

Major changes from this session:

Training:
- 0.6.0 training running: 9B on steel141 3090 Ti, 27B on rented H100 NVL
- 7,256 merged training examples (up from 3,183)
- New training data: failure modes (85), midloop messaging (27),
  prompt injection defense (29), personality (32), gold from quarantine
  bank (232), new tool examples (30), claude's own experience (10)
- All training data RCON-validated at 100% pass rate
- Bake-off: gemma3:27b 66%, qwen3.5:27b 61%, translategemma:27b 56%

Oracle Bot (Mind's Eye):
- Invisible spectator bot (mineflayer) streams world state via WebSocket
- HTML5 Canvas frontend at mind.mortdec.ai
- Real-time tool trace visualization with expandable entries
- Streaming model tokens during inference
- Gateway integration: fire-and-forget POST /trace on every tool call

Reinforcement Learning:
- Gymnasium environment wrapping mineflayer bot (minecraft_env.py)
- PPO training via Stable Baselines3 (10K param policy network)
- Behavioral cloning pretraining (97.5% accuracy on expert policy)
- Infinite training loop with auto-restart and checkpoint resume
- Bot learns combat, survival, navigation from raw experience

Bot Army:
- 8-soldier marching formation with autonomous combat
- Combat bots using mineflayer-pvp, pathfinder, armor-manager
- Multilingual prayer bots via translategemma:27b (18 languages)
- Frame-based AI architecture: LLM planner + reactive micro-scripts

Infrastructure:
- Fixed mattpc.sethpc.xyz billing gateway (API key + player list parser)
- Billing gateway now tracks all LAN traffic (LAN auto-auth)
- Gateway fallback for empty god-mode responses
- Updated mortdec.ai landing page

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-03-22 20:22:50 -04:00

.claude/good-times

GPU scheduler, 14-tool architecture, plugin deployment, event dispatcher

2026-03-21 03:14:45 -04:00

agent

22-tool architecture: log.query, user.ask, journal system deployed

2026-03-21 21:04:01 -04:00

branding

Fix chart labels, add version history table to README

2026-03-21 15:48:35 -04:00

data

0.6.0 training session: Oracle Bot, RL combat, Mind's Eye, multilingual pipeline

2026-03-22 20:22:50 -04:00

docs/superpowers

0.6.0 training session: Oracle Bot, RL combat, Mind's Eye, multilingual pipeline

2026-03-22 20:22:50 -04:00

eval

0.6.0 training session: Oracle Bot, RL combat, Mind's Eye, multilingual pipeline

2026-03-22 20:22:50 -04:00

good-times

Qwen3.5-9B on prod, Gemini 2.5 Flash for dev, error correction, branding

2026-03-19 23:09:27 -04:00

ingame

0.6.0 training session: Oracle Bot, RL combat, Mind's Eye, multilingual pipeline

2026-03-22 20:22:50 -04:00

knowledge

Minecraft knowledge corpus, recipe trees, GitHub scraper, 644 examples

2026-03-18 20:33:09 -04:00

oracle-bot

0.6.0 training session: Oracle Bot, RL combat, Mind's Eye, multilingual pipeline

2026-03-22 20:22:50 -04:00

scripts

1200+ distilled gold examples, journal system, redstone mastery, safety awareness

2026-03-21 20:50:52 -04:00

training

0.6.0 training session: Oracle Bot, RL combat, Mind's Eye, multilingual pipeline

2026-03-22 20:22:50 -04:00

USER_NOTES_IGNORE_ME

22-tool architecture: log.query, user.ask, journal system deployed

2026-03-21 21:04:01 -04:00

web

1200+ distilled gold examples, journal system, redstone mastery, safety awareness

2026-03-21 20:50:52 -04:00

.gitignore

GPU scheduler, 14-tool architecture, plugin deployment, event dispatcher

2026-03-21 03:14:45 -04:00

CHAT-IDEA.md

1200+ distilled gold examples, journal system, redstone mastery, safety awareness

2026-03-21 20:50:52 -04:00

CLAUDE.md

0.6.0 training session: Oracle Bot, RL combat, Mind's Eye, multilingual pipeline

2026-03-22 20:22:50 -04:00

CONTRIBUTING.md

Add LICENSE, MODEL_CARD, requirements, CONTRIBUTING

2026-03-20 21:43:21 -04:00

create_form.py

GPU scheduler, 14-tool architecture, plugin deployment, event dispatcher

2026-03-21 03:14:45 -04:00

FRIENDS_INVITE_DISCORD.md

Three-tier constraint model, mode-aware eval, boundary examples, playtest tooling

2026-03-18 15:57:01 -04:00

FRIENDS_INVITE.md

Three-tier constraint model, mode-aware eval, boundary examples, playtest tooling

2026-03-18 15:57:01 -04:00

good times rg.ttf

Qwen3.5-9B on prod, Gemini 2.5 Flash for dev, error correction, branding

2026-03-19 23:09:27 -04:00

IDEA.md

Initial project scaffold: dataset schema, 31 seed training examples, Mineflayer bot framework, and 7-phase roadmap

2026-03-18 01:51:28 -04:00

LICENSE

Add LICENSE, MODEL_CARD, requirements, CONTRIBUTING

2026-03-20 21:43:21 -04:00

MODEL_CARD.md

0.5.0 bake-off results, knowledge lookup tools, training progress chart

2026-03-21 15:28:09 -04:00

PLAN.md

Semver rename: v1-v5 → 0.1.0-0.5.0 across all files

2026-03-20 21:37:14 -04:00

POS_PRINT.md

GPU scheduler, 14-tool architecture, plugin deployment, event dispatcher

2026-03-21 03:14:45 -04:00

README.md

Fix chart labels, add version history table to README

2026-03-21 15:48:35 -04:00

REDDIT_EVAL_INVITE.md

Phase 2: eval harness, 182 examples, live bake-off, playtest infrastructure

2026-03-18 13:38:12 -04:00

REDDIT_MODMAIL.md

Phase 2: eval harness, 182 examples, live bake-off, playtest infrastructure

2026-03-18 13:38:12 -04:00

requirements-training.txt

Add LICENSE, MODEL_CARD, requirements, CONTRIBUTING

2026-03-20 21:43:21 -04:00

requirements.txt

Add LICENSE, MODEL_CARD, requirements, CONTRIBUTING

2026-03-20 21:43:21 -04:00

runpod.io-api-key.md

1200+ distilled gold examples, journal system, redstone mastery, safety awareness

2026-03-21 20:50:52 -04:00

SESSION.default.md

Initial project scaffold: dataset schema, 31 seed training examples, Mineflayer bot framework, and 7-phase roadmap

2026-03-18 01:51:28 -04:00

vast.ai-api-key.md

1200+ distilled gold examples, journal system, redstone mastery, safety awareness

2026-03-21 20:50:52 -04:00

whitelist.sh

Three-tier constraint model, mode-aware eval, boundary examples, playtest tooling

2026-03-18 15:57:01 -04:00

README.md

Mortdecai

A 9B parameter language model fine-tuned for Minecraft server operations. Translates natural language to commands, controls an AI God character, manages plugins, writes mcfunction scripts, and learns from its mistakes.

Base model: Qwen3.5-9B | Current version: 0.5.0 | Quantization: Q4_K_M (5.6GB)

Training Progress

Version	Base Model	Training Examples	Loss	Key Addition
0.1.0	Qwen3-8B	500	2.10	Seed data only
0.2.0	Qwen3-8B	1,200	1.45	+entities, +mobs
0.3.0	Qwen3-8B	2,100	0.82	+error correction
0.4.0	Qwen3.5-9B	3,175	0.35	+tool-calling, base model upgrade
0.5.0	Qwen3.5-9B	4,358	0.16	+plugins, +memory, +scripts

Bake-off: 0.5.0 vs 0.4.0

Category	0.4.0	0.5.0	Change
Enchantments	20%	67%	+47%
EssentialsX	0%	60%	+60%
Effects	0%	25%	+25%
Basic commands	75%	75%	—
Teleport	100%	100%	—
Overall	45.2%	46.8%	+1.6%

Architecture

17 tools across 5 categories:

Category	Tools
Execution	`rcon.execute`
Knowledge	`minecraft.wiki_lookup`, `plugin.docs_lookup`, `minecraft.changelog_lookup`, `paper.docs_lookup`
World	`world.player_info`, `world.server_state`, `world.nearby_entities`
Memory	`memory.read`, `memory.write`
Scripts	`script.write`, `script.validate`, `script.execute`, `script.read`, `script.list`, `script.delete`, `script.schedule`

Plugins: FastAsyncWorldEdit, WorldGuard, CoreProtect, EssentialsX, Vault, LuckPerms

Training Data

~20,000+ examples from:

Hand-curated seed data (3,196)
Tool-calling sequences with 17 tools (1,430)
IGLU build dataset — Microsoft Research (4,656)
RCON-validated plugin examples (104)
Exploration self-play with wiki grounding (150)
Self-play across 3 GPUs (2,900+)
Live server audit from wolf bots + real players (8,000+)

Infrastructure

GPU	Role
RTX 3090 Ti (24GB)	Training + self-play
RTX 2080 Ti (11GB)	Exploration self-play
Quadro RTX 4000 (8GB)	Production inference — 3 MC servers
GTX 1660 Super (6GB)	Prompt generation

GPU Scheduler: gpu.sethpc.xyz — preset-based job scheduler with live monitoring

README.md

Mortdecai

Training Progress

Bake-off: 0.5.0 vs 0.4.0

Architecture

Training Data

Infrastructure

Links