snipe/app/llm
pyr0ball 1bf95bba2a feat(llm): migrate query_translator to cf-orch task endpoint for cloud, keep LLMRouter for local (#54)
QueryTranslator now supports two backends chosen at startup:
- CF_ORCH_URL set: allocate via POST /api/inference/task (product=snipe,
  task=query_translation), call the allocated cf-text service, release the
  slot in a finally block to guarantee the VRAM lease is freed.
- CF_ORCH_URL absent: existing LLMRouter path unchanged (ollama/vllm/api keys).

Also moves httpx from dev-only to main dependencies (already used by mcp/server.py).
2026-05-13 15:22:09 -07:00
..
__init__.py feat: SearchParamsResponse dataclass and JSON parser for LLM query builder 2026-04-14 11:40:44 -07:00
query_translator.py feat(llm): migrate query_translator to cf-orch task endpoint for cloud, keep LLMRouter for local (#54) 2026-05-13 15:22:09 -07:00
router.py feat: wire Search with AI to cf-orch → Ollama (llama3.1:8b) 2026-04-14 13:23:44 -07:00