Circuit-Forge/avocet

Fork 0

Avocet by Circuit Forge LLC — email classifier training tool: multi-account IMAP fetch, card-stack labeling UI, benchmark harness

Find a file

pyr0ball 95afddb772 feat: add nodes.py scaffold with set_config_dir and router mount - Create app/nodes.py with _CONFIG_DIR testability seam, _load_config, _profiles_dir, _profile_path, _load_profile, _get_ollama_url helpers, and stub list_nodes endpoint returning [] when no coordinator_url is set - Mount nodes router at /api/nodes-mgmt in app/api.py - Add profiles_dir comment to config/label_tool.yaml.example cforch section - Create tests/test_nodes.py with autouse fixture and two passing tests		2026-05-05 19:35:28 -07:00
app	feat: add nodes.py scaffold with set_config_dir and router mount	2026-05-05 19:35:28 -07:00
config	feat: add nodes.py scaffold with set_config_dir and router mount	2026-05-05 19:35:28 -07:00
data	feat: initial avocet repo — email classifier training tool	2026-02-27 14:07:38 -08:00
scripts	feat(benchmark): wire EmbeddingKNNAdapter into MODEL_REGISTRY as embed-knn-nomic	2026-05-05 12:43:48 -07:00
tests	feat: add nodes.py scaffold with set_config_dir and router mount	2026-05-05 19:35:28 -07:00
web	feat: plans benchmark harness — model scoring for CF planning prompts	2026-05-02 23:36:04 -07:00
.env.example	feat: plans benchmark harness — model scoring for CF planning prompts	2026-05-02 23:36:04 -07:00
.gitignore	chore: gitignore .worktrees/ directory	2026-05-01 12:25:23 -07:00
environment.yml	feat(#10 ): env var LLM config + cf-orch coordinator auth	2026-04-09 12:26:44 -07:00
manage.sh	feat: plans benchmark harness — model scoring for CF planning prompts	2026-05-02 23:36:04 -07:00
PRIVACY.md	docs: add privacy policy reference	2026-03-05 20:59:37 -08:00
pytest.ini	feat: initial avocet repo — email classifier training tool	2026-02-27 14:07:38 -08:00
README.md	chore: add README + gather_corpus.py script	2026-04-24 15:29:26 -07:00
requirements.txt	feat(benchmark): wire EmbeddingKNNAdapter into MODEL_REGISTRY; add embed_model config	2026-05-05 14:05:45 -07:00

README.md

Avocet — Email Classifier Training Tool

Part of the CircuitForge LLC internal infrastructure suite.

Status: Internal beta — label tool and benchmark harness complete. Used to build training data for Peregrine's email classifier.

What it does

Avocet is the data pipeline for building and benchmarking email classifiers. It has two layers:

No LLM required. Avocet uses zero-shot HuggingFace classification models — no API key, no cloud inference, no GPU required for the label tool. The benchmark harness can optionally export LLM-labeled emails from a Peregrine staging DB, but human labeling via the card-stack UI is the primary workflow.

Layer 1 — Label tool Card-stack UI for building ground-truth classifier benchmark data. Fetch emails from one or more IMAP accounts (with targeted date-range and sender/subject filters), review them card-by-card, and label each with a job-search category. Labeled output feeds the benchmark harness.

Layer 2 — Benchmark harness Scores HuggingFace zero-shot classification models against the labeled dataset. Supports slow/large model inclusion, visual side-by-side comparison on live emails, and export of LLM-labeled emails from a Peregrine staging DB.

Labels

Label	Key
`interview_scheduled`	1
`offer_received`	2
`rejected`	3
`positive_response`	4
`survey_received`	5
`neutral`	6
`event_rescheduled`	7
`unrelated`	8
`digest`	9

Stack

Layer	Tech
Label UI	Streamlit (port 8503, auto-increments on collision)
Benchmark	Python + HuggingFace Transformers
Email fetch	IMAP (multi-account, targeted date/sender/subject filter)
Data	JSONL (`data/email_label_queue.jsonl`, `data/email_score.jsonl`)
Config	`config/label_tool.yaml` (gitignored — see `.example`)

Conda environments:

job-seeker — label tool UI
job-seeker-classifiers — benchmark harness (separate env for heavy deps)

Running

./manage.sh start              # start label tool UI (port collision-safe from 8503)
./manage.sh stop               # stop
./manage.sh restart            # restart
./manage.sh status             # show running state and port
./manage.sh logs               # tail label tool log
./manage.sh open               # open in browser

Benchmark:

./manage.sh benchmark --list-models    # list available zero-shot models
./manage.sh score                      # score models against labeled JSONL
./manage.sh score --include-slow       # include large/slow models
./manage.sh compare --limit 30         # visual comparison on live IMAP emails

Dev:

./manage.sh test               # run pytest suite

Data flow

IMAP accounts → fetch (targeted or wide) → email_label_queue.jsonl
→ label tool card UI → email_score.jsonl
→ benchmark harness → model rankings
→ best model → Peregrine classifier adapter

Targeted fetch: date range + sender/subject filter for pulling historical emails on specific senders or topics without flooding the queue.

Discard: removes an email from the queue without writing to the score file — for emails that don't belong in the training set.

Classifier adapters

app/classifier_adapters.py provides a common interface for swapping classifier backends. Falls back to the label name when no LABEL_DESCRIPTIONS entry is configured for a label (RerankerAdapter).

License

BSL 1.1 — internal tool, not user-facing.