Premium/ultra users with a custom_writing_model in their session are routed to that model as the first cf-orch candidate; all other tiers use the shared Qwen2.5-3B-Instruct base. complete_json() is unchanged since fine-tuned writing models aren't trained for structured output. Adds _request_tier and _request_writing_model ContextVars. Resolution order: USER_WRITING_MODELS env var (Monday path) then Heimdall meta (future path via peregrine#110). |
||
|---|---|---|
| .. | ||
| apps/backend/app | ||