v3 model trained (1,308 examples, loss 0.55), API cascade, context update
v3 training: - 1,308 examples: curated + Claude-distilled + bot audit + recipes + command ref - 1 epoch, rank 16, LR 1e-4, loss 0.55 (sweet spot) - GGUF Q4_K_M exported, loaded in Ollama as qwen3-8b-mc-lora-v3 - Correct commands, no Chinese, proper safety refusals, dramatic God persona API cascade for dev server: - Stage 1: Claude Haiku ($20 budget, ~$11 spent) - Stage 2: Gemini 2.5 Flash Lite ($20 budget) - Stage 3: qwen3-8b-mc-lora-v3 (free, local) - Gemini call function with persistent cost tracking - Full status report printed at each $1 milestone Data collection: 2,677 dev audit entries and growing Bot status printer budget display fix Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
This commit is contained in:
Reference in New Issue
Block a user