avocet

History

pyr0ball bce932461a feat: plans benchmark harness — model scoring for CF planning prompts Adds benchmark_plans.py script, plans_bench API router, PlansBenchTab Vue component, and registers /api/plans-bench in api.py. Also extends models registry (cf-text catalog integration), cforch client, LlmEvalTab, and ModelsView with cf-orch fleet support. Wires Planning mode into BenchmarkView.		2026-05-02 23:36:04 -07:00
..
__init__.py	feat: initial avocet repo — email classifier training tool	2026-02-27 14:07:38 -08:00
benchmark_classifier.py	fix(avocet): guard discover_finetuned_models against malformed/incomplete training_info.json	2026-03-15 15:18:13 -07:00
benchmark_plans.py	feat: plans benchmark harness — model scoring for CF planning prompts	2026-05-02 23:36:04 -07:00
benchmark_style.py	refactor(bench): extract benchmark tabs — classifier, compare, llm-eval, style, voice	2026-04-24 14:56:17 -07:00
benchmark_voice.py	refactor(bench): extract benchmark tabs — classifier, compare, llm-eval, style, voice	2026-04-24 14:56:17 -07:00
classifier_adapters.py	fix(avocet): FineTunedAdapter GPU device routing + precise body truncation test	2026-03-15 10:56:47 -07:00
export_plans.py	feat: plans benchmark harness — model scoring for CF planning prompts	2026-05-02 23:36:04 -07:00
finetune_classifier.py	fix(avocet): use_reentrant=False for gradient checkpointing	2026-03-15 17:23:40 -07:00
gather_corpus.py	chore: add README + gather_corpus.py script	2026-04-24 15:29:26 -07:00
sft_import.py	fix: log on malformed json in _read_jsonl, use streaming id dedup	2026-04-08 07:37:22 -07:00