GPU scheduler, 14-tool architecture, plugin deployment, event dispatcher

GPU Scheduler (gpu.sethpc.xyz):
- Live dashboard with 4 GPUs, training monitor, loss sparklines
- Preset-based job scheduler with 3 triggers (time, finish_training, cost)
- Model selection per GPU, pipeline configuration
- Tool self-play and training pipeline types
- Behind Google OAuth, live-refresh without page reload

Tool Architecture (14 tools):
- 3 new tools: world.nearby_entities, memory.read, memory.write
- 7 script.* tools: write, validate, execute, read, list, delete, schedule
- ScriptManager: full mcfunction datapack CRUD with RCON validation
- Training data: 1,430 tool examples (up from 1,159)

Plugin Deployment (paper-ai-25567):
- WorldGuard 7.0.12, CoreProtect CE 23.1, EssentialsX 2.21.2, Vault 1.7.3
- Fresh greenfield world reset
- 104 RCON-validated plugin training examples

Event Dispatcher:
- Watches server log for deaths, joins, advancements, PvP kills
- Configurable trigger probability and cooldowns per event type
- Deployed to dev server, fires god_system prompts on events
- 21 event-response training examples

Training Infrastructure:
- train_lora.py: --save-steps 50, --resume from checkpoint
- run_training.sh: stops Ollama, activates conda, restarts after
- Passwordless sudo for ollama services on steel141
- Dev server added to MCSManager with autoStart

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
This commit is contained in:
Mortdecai
2026-03-21 03:14:45 -04:00
parent 434589d098
commit da8f557219
34 changed files with 7822 additions and 2 deletions
+59
View File
@@ -0,0 +1,59 @@
{
"a78c48c1": {
"id": "a78c48c1",
"name": "Everyone Test",
"pipeline": "self_play",
"gpus": [
"3090ti",
"2080ti",
"rtx4000"
],
"params": {
"model": "mortdecai:0.4.0",
"tiers": "1,2,3",
"rounds_per_tier": "50",
"rcon_host": "192.168.0.244",
"rcon_port": "25578",
"rcon_pass": "REDACTED_RCON"
},
"created_at": "2026-03-21T01:18:14Z"
},
"06356764": {
"id": "06356764",
"name": "Infer during training",
"pipeline": "self_play",
"gpus": [
"rtx4000"
],
"params": {
"model": "mortdecai:0.4.0",
"tiers": "1,2,3",
"rounds_per_tier": "50",
"rcon_host": "192.168.0.244",
"rcon_port": "25578",
"rcon_pass": "REDACTED_RCON"
},
"created_at": "2026-03-21T01:37:31Z"
},
"9cc95c0a": {
"id": "9cc95c0a",
"name": "Train",
"pipeline": "training",
"gpus": [
"3090ti",
"2080ti"
],
"params": {
"base_model": "Qwen/Qwen3.5-9B",
"dataset": "auto",
"output_name": "mortdecai-0.5.0",
"epochs": "1",
"lr": "0.0001",
"batch_size": "2",
"grad_accum": "4",
"max_seq_len": "2048",
"save_steps": "50"
},
"created_at": "2026-03-21T01:37:57Z"
}
}