Switch Ollama to gemma3n:e4b on node-197 GPU

Bake-off results: gemma3n:e4b (80.6% cmd match, 100% safety, 5.9s) outperforms qwen3-coder:30b (67.7%, 93.5%, 14.7s) on all metrics. Moved from steel141 CPU inference to node-197 RTX 4000 GPU. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-18 10:23:55 -04:00
parent 0ed3a512a2
commit 31be504f69
2 changed files with 12 additions and 3 deletions
@@ -5,9 +5,9 @@
  "rcon_host": "127.0.0.1",
  "rcon_port": 25576,
  "rcon_password": "REDACTED_RCON",
-  "ollama_url": "http://192.168.0.141:11434",
-  "model": "gemma3:12b",
-  "command_model": "qwen3-coder:30b",
+  "ollama_url": "http://192.168.0.179:11434",
+  "model": "gemma3n:e4b",
+  "command_model": "gemma3n:e4b",
  "temperature": 0.85,
  "max_tokens": 600,
  "cooldown_seconds": 20,