feat: cf-text — direct text generation service module #41
Labels
No labels
architecture
backlog
enhancement
module:documents
module:hardware
module:manage
module:pipeline
module:voice
priority:backlog
priority:high
priority:medium
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference: Circuit-Forge/circuitforge-core#41
Loading…
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Summary
Scaffold
cf-textas a shared circuitforge-core service module that provides direct access to text generation models without routing through ollama or vllm.Motivation
Products like Peregrine (interview prep, cover letters, AI suggestions) and Kiwi (recipe suggestions, expiry advice) need text generation that:
cf-textservice type with its own VRAM budget andmax_concurrentlimitProposed interface
Service profile fields (cf-orch)
max_mb: per-model (3B Q4 ≈ 2048, 7B Q4 ≈ 4096)preferred_compute_cap: 7.5 minimum (INT8 tensor cores)max_concurrent: 2–3 depending on nodeshared: true— multiple products can share a running instanceConsumers
peregrine— interview prep responses, cover letter generation, job match suggestionskiwi— recipe suggestions, ingredient substitutions, expiry adviceNotes