feat: evaluate Agent-ModernColBERT as semantic upgrade to FTS5 log search #18
Labels
No labels
compliance
demo
deployment
docs
enhancement
parser
patterns
performance
security
ux
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference: Circuit-Forge/turnstone#18
Loading…
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Background
Turnstone currently uses SQLite FTS5 with Porter stemming for log retrieval (
app/services/search.py). This is fast and zero-dependency, but purely keyword-based — it cannot match:These semantic gaps are exactly what makes homelab diagnosis hard.
Proposed upgrade
lightonai/Agent-ModernColBERT— late-interaction token-level retriever designed for agentic/multi-hop queries. Would sit alongside (not replace) FTS5:llm.pyModel registered in cf-orch model registry as
agent-moderncolbert(~800MB VRAM).What to evaluate
/devl/storage budget?Implementation sketch (if evaluation passes)
pylate(LightOn ColBERT library) to dependenciesapp/services/search.py: addcolbert_search(query, db_path)alongsidefts_search()app/services/diagnose.py(or wherever search is called): merge FTS5 + ColBERT results, deduplicate, pass merged list tollm.summarize()turnstone.log_retrievetoassignments.yamlin cf-orchFTS5 stays
Do not remove FTS5 — it handles exact log-level/source filters efficiently and is the right tool for structured field queries. ColBERT is additive.
Related
agent-moderncolbert(already registered)app/services/search.py(FTS5 implementation)app/services/diagnose.py,app/services/llm.py