Implement cf-docuvision managed HTTP service (ByteDance/Dolphin-v2) #8
Labels
No labels
architecture
backlog
enhancement
module:documents
module:hardware
module:manage
module:pipeline
module:voice
priority:backlog
priority:high
priority:medium
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference: Circuit-Forge/circuitforge-core#8
Loading…
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Purpose
cf-docuvisionis the document understanding inference service for the CF menagerie. It wraps ByteDance/Dolphin-v2 (Qwen2.5-VL-3B backbone, purpose-built document parser) behind an OpenAI-compatible or custom HTTP API so product apps (Kiwi, Falcon, Harrier, Ibis, Godwit) can call it without loading the model in-process.Why Dolphin-v2
cf-docuvisionVRAM slots in all GPU profilesVRAM slots (already in profiles)
Tasks
cf-docuvisionDocker image or process spectransformersDolphin-v2 inferenceByteDance/Dolphin-v2(if vLLM supports Qwen2.5-VL chat template)POST /extract→ accepts{image_path|image_b64}→ returns structured receipt/document JSONmanagedspec tocf-docuvisionin all GPU profilesServiceManager(DockerSpec or ProcessSpec)receipt_datatable)Downstream consumers
Notes
cf-visionis intentionally separate — reserved for general multimodal tasks (classification, tracking, visual search)/Library/Assets/LLM/(path TBD — suggestdocuvision/models/dolphin-v2)Qwen2.5-VL-7B-Instructas interim until this ships