Embedding model comparison harness #59
Loading…
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Goal
Add a way to benchmark and compare embedding model results within Avocet, so we can make informed decisions about which model to use across the menagerie (pagepiper, peregrine, etc).
Context
Pagepiper embeds page chunks with
nomic-embed-textat 1024 dimensions via Ollama. When evaluating a new model (e.g.mxbai-embed-large,all-minilm, etc.), there is currently no tooling to compare retrieval quality side-by-side. Avocet already has the label/benchmark infrastructure that makes it the natural home for this.Proposed scope
Acceptance criteria
Labels
enhancement,backlog