v3 model trained (1,308 examples, loss 0.55), API cascade, context update

v3 training:
- 1,308 examples: curated + Claude-distilled + bot audit + recipes + command ref
- 1 epoch, rank 16, LR 1e-4, loss 0.55 (sweet spot)
- GGUF Q4_K_M exported, loaded in Ollama as qwen3-8b-mc-lora-v3
- Correct commands, no Chinese, proper safety refusals, dramatic God persona

API cascade for dev server:
- Stage 1: Claude Haiku ($20 budget, ~$11 spent)
- Stage 2: Gemini 2.5 Flash Lite ($20 budget)
- Stage 3: qwen3-8b-mc-lora-v3 (free, local)
- Gemini call function with persistent cost tracking
- Full status report printed at each $1 milestone

Data collection: 2,677 dev audit entries and growing
Bot status printer budget display fix

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

This commit is contained in:

Seth Freiberg

2026-03-19 04:52:04 -04:00

parent 234f2722db

commit e780aef8c6

5 changed files with 1130 additions and 0 deletions

data/raw/scraped_github.jsonl

View File