Replace two-step docuvision OCR + LLM structuring pipeline with a single multimodal VLM call. The bartowski Qwen2-VL-7B-Instruct Q5_K_M GGUF is served by cf-text (llama.cpp) and accepts image_url content blocks identical to the OpenAI vision API format. Removes docuvision dependency for recipe scanning; the addict-missing / DeepseekVLV2Processor-missing cf-docuvision error no longer blocks scans. Receipt OCR (kiwi.ocr task) still routes to cf-docuvision separately. |
||
|---|---|---|
| .. | ||
| api | ||
| core | ||
| db | ||
| mcp | ||
| models | ||
| services | ||
| staples | ||
| static | ||
| styles | ||
| tasks | ||
| utils | ||
| __init__.py | ||
| cloud_session.py | ||
| main.py | ||
| tiers.py | ||