Commit graph

15 commits

Author SHA1 Message Date
682a958c28 fix(avocet): strip HTML from email bodies — stdlib HTMLParser, no deps 2026-03-03 16:28:18 -08:00
4a76f6ba41 fix(avocet): undo — commit-then-clear order, empty-records guard, skip dedup, stronger test 2026-03-03 15:41:58 -08:00
80a8195899 feat(avocet): discard, undo, labels config, static serving — backend complete 2026-03-03 15:35:01 -08:00
f0e9886ab2 feat(avocet): POST /api/skip endpoint 2026-03-03 15:21:32 -08:00
816c0e3bb2 fix(avocet): store original item in _last_action; add requirements.txt 2026-03-03 15:16:54 -08:00
ff27053aa9 feat(avocet): POST /api/label endpoint 2026-03-03 15:14:04 -08:00
9abae0478c feat(avocet): GET /api/queue endpoint 2026-03-03 15:00:59 -08:00
c8aea3c39f fix(avocet): _write_jsonl empty-list writes empty file; add reset_last_action helper 2026-03-03 14:36:18 -08:00
ffd1450f62 feat(avocet): FastAPI skeleton + JSONL helpers 2026-03-03 13:30:28 -08:00
9133cadd66 feat: discard button — removes email from queue without writing to score file 2026-02-27 15:48:40 -08:00
a26a21e71d feat: targeted fetch — date range + sender/subject filter for historical email pulls 2026-02-27 15:15:49 -08:00
fd476e4199 feat: 9 labels (add event_rescheduled/unrelated/digest), wildcard Other label, InvalidCharacterError fix 2026-02-27 14:34:15 -08:00
66f69ff9a2 fix: fetch log — overwrite per-email progress instead of appending
status.write() per email grows the log unboundedly on big pulls.
Now uses status.empty() to create one updatable slot; per-email
progress overwrites it, cleared after each account completes.
Per-account summaries still use status.write() (one line each).
2026-02-27 14:20:57 -08:00
2824fca70c feat: add Settings tab with IMAP account GUI + connection test
- ⚙️ Settings tab: add/edit/remove accounts without touching YAML
- Per-account: name, host, port, SSL, username, password (masked), folder, days back
- Test connection button: connect → login → select folder → report message count
- Save writes config/label_tool.yaml; Reload discards unsaved changes
- _sync_settings_to_state() prevents index-key drift on add/remove
- _test_imap_connection() helper shared with fetch tab indirectly
- CLAUDE.md: document new tab, Settings UI design notes
2026-02-27 14:18:51 -08:00
0e238a9e37 feat: initial avocet repo — email classifier training tool
Scrape → Store → Process pipeline for building email classifier
benchmark data across the CircuitForge menagerie.

- app/label_tool.py — Streamlit card-stack UI, multi-account IMAP fetch,
  6-bucket labeling, undo/skip, keyboard shortcuts (1-6/S/U)
- scripts/classifier_adapters.py — ZeroShotAdapter (+ two_pass),
  GLiClassAdapter, RerankerAdapter; ABC with lazy model loading
- scripts/benchmark_classifier.py — 13-model registry, --score,
  --compare, --list-models, --export-db; uses label_tool.yaml for IMAP
- tests/ — 20 tests, all passing, zero model downloads required
- config/label_tool.yaml.example — multi-account IMAP template
- data/email_score.jsonl.example — sample labeled data for CI

Labels: interview_scheduled, offer_received, rejected,
        positive_response, survey_received, neutral
2026-02-27 14:07:38 -08:00