Add README with training progress chart and bake-off results
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
This commit is contained in:
@@ -0,0 +1,62 @@
|
||||
# Mortdecai
|
||||
|
||||
A 9B parameter language model fine-tuned for Minecraft server operations. Translates natural language to commands, controls an AI God character, manages plugins, writes mcfunction scripts, and learns from its mistakes.
|
||||
|
||||
**Base model:** Qwen3.5-9B | **Current version:** 0.5.0 | **Quantization:** Q4_K_M (5.6GB)
|
||||
|
||||
## Training Progress
|
||||
|
||||

|
||||
|
||||
## Bake-off: 0.5.0 vs 0.4.0
|
||||
|
||||
| Category | 0.4.0 | 0.5.0 | Change |
|
||||
|----------|-------|-------|--------|
|
||||
| Enchantments | 20% | 67% | **+47%** |
|
||||
| EssentialsX | 0% | 60% | **+60%** |
|
||||
| Effects | 0% | 25% | **+25%** |
|
||||
| Basic commands | 75% | 75% | — |
|
||||
| Teleport | 100% | 100% | — |
|
||||
| Overall | 45.2% | 46.8% | +1.6% |
|
||||
|
||||
## Architecture
|
||||
|
||||
**17 tools** across 5 categories:
|
||||
|
||||
| Category | Tools |
|
||||
|----------|-------|
|
||||
| Execution | `rcon.execute` |
|
||||
| Knowledge | `minecraft.wiki_lookup`, `plugin.docs_lookup`, `minecraft.changelog_lookup`, `paper.docs_lookup` |
|
||||
| World | `world.player_info`, `world.server_state`, `world.nearby_entities` |
|
||||
| Memory | `memory.read`, `memory.write` |
|
||||
| Scripts | `script.write`, `script.validate`, `script.execute`, `script.read`, `script.list`, `script.delete`, `script.schedule` |
|
||||
|
||||
**Plugins:** FastAsyncWorldEdit, WorldGuard, CoreProtect, EssentialsX, Vault, LuckPerms
|
||||
|
||||
## Training Data
|
||||
|
||||
~20,000+ examples from:
|
||||
- Hand-curated seed data (3,196)
|
||||
- Tool-calling sequences with 17 tools (1,430)
|
||||
- IGLU build dataset — Microsoft Research (4,656)
|
||||
- RCON-validated plugin examples (104)
|
||||
- Exploration self-play with wiki grounding (150)
|
||||
- Self-play across 3 GPUs (2,900+)
|
||||
- Live server audit from wolf bots + real players (8,000+)
|
||||
|
||||
## Infrastructure
|
||||
|
||||
| GPU | Role |
|
||||
|-----|------|
|
||||
| RTX 3090 Ti (24GB) | Training + self-play |
|
||||
| RTX 2080 Ti (11GB) | Exploration self-play |
|
||||
| Quadro RTX 4000 (8GB) | Production inference — 3 MC servers |
|
||||
| GTX 1660 Super (6GB) | Prompt generation |
|
||||
|
||||
**GPU Scheduler:** [gpu.sethpc.xyz](https://gpu.sethpc.xyz) — preset-based job scheduler with live monitoring
|
||||
|
||||
## Links
|
||||
|
||||
- **Play:** `minecraft.mortdec.ai`
|
||||
- **Model card:** [MODEL_CARD.md](MODEL_CARD.md)
|
||||
- **Domain:** [mortdec.ai](https://mortdec.ai)
|
||||
Reference in New Issue
Block a user