5dee23f53c
fix(avocet): reduce deberta-small VRAM + auto-select freest GPU for training
...
- deberta-small: batch_size 16→8 + grad_accum 1→2 (same effective batch),
gradient_checkpointing=True (fp16 stays off: DeBERTa v3 disentangled
attention overflows fp16 at the gather step)
- api: _best_cuda_device() picks highest free-VRAM GPU via nvidia-smi;
sets CUDA_VISIBLE_DEVICES in subprocess env to prevent DataParallel
replication across both GPUs; adds PYTORCH_ALLOC_CONF=expandable_segments
- SSE log now reports which GPU was selected
2026-03-15 17:09:06 -07:00
60fe1231ce
fix(avocet): _MODELS_DIR overridable in tests; sanitize score paths against path traversal
2026-03-15 16:07:27 -07:00
ef8adfb035
feat(avocet): add /api/finetune/status and /api/finetune/run endpoints
2026-03-15 16:04:34 -07:00
8c22dd62de
feat(avocet): benchmark UI, label fixes, BenchmarkView with charts and SSE run
2026-03-15 09:39:37 -07:00
f38c73db97
feat: add GET /api/fetch/stream SSE endpoint for real-time IMAP progress
2026-03-04 12:05:23 -08:00
965362f5e3
feat: add POST /api/accounts/test endpoint
2026-03-04 12:04:42 -08:00
f64be8bbe0
feat: add GET /api/stats and GET /api/stats/download endpoints
2026-03-04 12:04:11 -08:00
c5a74d3821
feat: add GET/POST /api/config endpoints for IMAP account management
2026-03-04 12:03:40 -08:00
1d1f25641b
feat: extract IMAP logic to app/imap_fetch.py for reuse by API
2026-03-04 11:42:22 -08:00
82eeb4defc
fix: prevent blank page on rebuild and queue drain on skip/discard
...
Two bugs fixed:
1. Blank white page after vue SPA rebuild: browsers cached old index.html
referencing old asset hashes. Assets are deleted on rebuild, causing
404s for JS/CSS -> blank page. Fix: serve index.html with
Cache-Control: no-cache so browsers always fetch fresh HTML.
Hashed assets (/assets/chunk-abc123.js) remain cacheable forever.
2. Queue draining to empty on skip/discard: handleSkip and handleDiscard
never refilled the local queue buffer. After enough skips, store.current
went null and the empty state showed (blank-looking). Fix: both handlers
now call fetchBatch() when queue drops below 3, matching handleLabel.
Also: sync classifier_adapters LABELS to match current 10-label schema
(new_lead + hired, remove unrelated).
48 Python tests pass, 48 frontend tests pass.
2026-03-03 19:26:34 -08:00
b54b2a711e
fix(avocet): normalize queue schema + bind to 0.0.0.0 for LAN access
...
- Add _item_id() (content hash) + _normalize() to map legacy JSONL fields
(from_addr/account/no-id) to Vue schema (from/source/id)
- All mutating endpoints now look up by _normalize(x)[id] — handles both
stored-id (test fixtures) and content-hash (real data) transparently
- Change uvicorn bind from 127.0.0.1 to 0.0.0.0 so LAN clients can connect
2026-03-03 18:43:00 -08:00
682a958c28
fix(avocet): strip HTML from email bodies — stdlib HTMLParser, no deps
2026-03-03 16:28:18 -08:00
4a76f6ba41
fix(avocet): undo — commit-then-clear order, empty-records guard, skip dedup, stronger test
2026-03-03 15:41:58 -08:00
80a8195899
feat(avocet): discard, undo, labels config, static serving — backend complete
2026-03-03 15:35:01 -08:00
f0e9886ab2
feat(avocet): POST /api/skip endpoint
2026-03-03 15:21:32 -08:00
816c0e3bb2
fix(avocet): store original item in _last_action; add requirements.txt
2026-03-03 15:16:54 -08:00
ff27053aa9
feat(avocet): POST /api/label endpoint
2026-03-03 15:14:04 -08:00
9abae0478c
feat(avocet): GET /api/queue endpoint
2026-03-03 15:00:59 -08:00
c8aea3c39f
fix(avocet): _write_jsonl empty-list writes empty file; add reset_last_action helper
2026-03-03 14:36:18 -08:00
ffd1450f62
feat(avocet): FastAPI skeleton + JSONL helpers
2026-03-03 13:30:28 -08:00
9133cadd66
feat: discard button — removes email from queue without writing to score file
2026-02-27 15:48:40 -08:00
a26a21e71d
feat: targeted fetch — date range + sender/subject filter for historical email pulls
2026-02-27 15:15:49 -08:00
fd476e4199
feat: 9 labels (add event_rescheduled/unrelated/digest), wildcard Other label, InvalidCharacterError fix
2026-02-27 14:34:15 -08:00
66f69ff9a2
fix: fetch log — overwrite per-email progress instead of appending
...
status.write() per email grows the log unboundedly on big pulls.
Now uses status.empty() to create one updatable slot; per-email
progress overwrites it, cleared after each account completes.
Per-account summaries still use status.write() (one line each).
2026-02-27 14:20:57 -08:00
2824fca70c
feat: add Settings tab with IMAP account GUI + connection test
...
- ⚙️ Settings tab: add/edit/remove accounts without touching YAML
- Per-account: name, host, port, SSL, username, password (masked), folder, days back
- Test connection button: connect → login → select folder → report message count
- Save writes config/label_tool.yaml; Reload discards unsaved changes
- _sync_settings_to_state() prevents index-key drift on add/remove
- _test_imap_connection() helper shared with fetch tab indirectly
- CLAUDE.md: document new tab, Settings UI design notes
2026-02-27 14:18:51 -08:00
0e238a9e37
feat: initial avocet repo — email classifier training tool
...
Scrape → Store → Process pipeline for building email classifier
benchmark data across the CircuitForge menagerie.
- app/label_tool.py — Streamlit card-stack UI, multi-account IMAP fetch,
6-bucket labeling, undo/skip, keyboard shortcuts (1-6/S/U)
- scripts/classifier_adapters.py — ZeroShotAdapter (+ two_pass),
GLiClassAdapter, RerankerAdapter; ABC with lazy model loading
- scripts/benchmark_classifier.py — 13-model registry, --score,
--compare, --list-models, --export-db; uses label_tool.yaml for IMAP
- tests/ — 20 tests, all passing, zero model downloads required
- config/label_tool.yaml.example — multi-account IMAP template
- data/email_score.jsonl.example — sample labeled data for CI
Labels: interview_scheduled, offer_received, rejected,
positive_response, survey_received, neutral
2026-02-27 14:07:38 -08:00