Circuit-Forge/kiwi - Forgejo: Beyond coding. We Forge.

Circuit-Forge/kiwi

Fork 0

Commit graph

Author	SHA1	Message	Date
pyr0ball	4ac24e7920	fix(recipe-scan): wire cf-docuvision OCR + LLMRouter for cloud recipe scanning (kiwi#136) Some checks are pending CI / Backend (Python) (push) Waiting to run Details CI / Frontend (Vue) (push) Waiting to run Details Mirror / mirror (push) Waiting to run Details Two-step pipeline: task_allocate("kiwi", "recipe_scan", service_hint="cf-docuvision") acquires a docuvision allocation, calls /extract per image to get OCR text, then LLMRouter structures the combined OCR output into recipe JSON via the text extraction prompt. Also fixes DocuvisionClient bugs: - POST field was "image" (ignored by Pydantic) — should be "image_b64" - Response read "text" key — docuvision returns "raw_text" - Add hint parameter (use "text" for recipe cards, dense prose) - Configurable timeout (default 120s; docuvision lazy-loads model on first request)	2026-05-16 14:21:15 -07:00
pyr0ball	22e57118df	feat: add DocuvisionClient + cf-docuvision fast-path for OCR Introduces a thin HTTP client for the cf-docuvision service and wires it as a fast path in VisionLanguageOCR.extract_receipt_data(). When CF_ORCH_URL is set, the pipeline attempts docuvision allocation via CFOrchClient before loading the heavy local VLM; falls back gracefully if unavailable.	2026-04-02 12:33:05 -07:00

Author

SHA1

Message

Date

pyr0ball

4ac24e7920

fix(recipe-scan): wire cf-docuvision OCR + LLMRouter for cloud recipe scanning (kiwi#136)

CI / Backend (Python) (push) Waiting to run

Details

CI / Frontend (Vue) (push) Waiting to run

Details

Mirror / mirror (push) Waiting to run

Details

Two-step pipeline: task_allocate("kiwi", "recipe_scan", service_hint="cf-docuvision")
acquires a docuvision allocation, calls /extract per image to get OCR text, then
LLMRouter structures the combined OCR output into recipe JSON via the text
extraction prompt.

Also fixes DocuvisionClient bugs:
- POST field was "image" (ignored by Pydantic) — should be "image_b64"
- Response read "text" key — docuvision returns "raw_text"
- Add hint parameter (use "text" for recipe cards, dense prose)
- Configurable timeout (default 120s; docuvision lazy-loads model on first request)

2026-05-16 14:21:15 -07:00

pyr0ball

22e57118df

feat: add DocuvisionClient + cf-docuvision fast-path for OCR

Introduces a thin HTTP client for the cf-docuvision service and wires it
as a fast path in VisionLanguageOCR.extract_receipt_data(). When CF_ORCH_URL
is set, the pipeline attempts docuvision allocation via CFOrchClient before
loading the heavy local VLM; falls back gracefully if unavailable.

2026-04-02 12:33:05 -07:00

2 commits