avocet/scripts
pyr0ball bce932461a feat: plans benchmark harness — model scoring for CF planning prompts
Adds benchmark_plans.py script, plans_bench API router, PlansBenchTab Vue
component, and registers /api/plans-bench in api.py. Also extends models
registry (cf-text catalog integration), cforch client, LlmEvalTab, and
ModelsView with cf-orch fleet support. Wires Planning mode into BenchmarkView.
2026-05-02 23:36:04 -07:00
..
__init__.py feat: initial avocet repo — email classifier training tool 2026-02-27 14:07:38 -08:00
benchmark_classifier.py fix(avocet): guard discover_finetuned_models against malformed/incomplete training_info.json 2026-03-15 15:18:13 -07:00
benchmark_plans.py feat: plans benchmark harness — model scoring for CF planning prompts 2026-05-02 23:36:04 -07:00
benchmark_style.py refactor(bench): extract benchmark tabs — classifier, compare, llm-eval, style, voice 2026-04-24 14:56:17 -07:00
benchmark_voice.py refactor(bench): extract benchmark tabs — classifier, compare, llm-eval, style, voice 2026-04-24 14:56:17 -07:00
classifier_adapters.py fix(avocet): FineTunedAdapter GPU device routing + precise body truncation test 2026-03-15 10:56:47 -07:00
export_plans.py feat: plans benchmark harness — model scoring for CF planning prompts 2026-05-02 23:36:04 -07:00
finetune_classifier.py fix(avocet): use_reentrant=False for gradient checkpointing 2026-03-15 17:23:40 -07:00
gather_corpus.py chore: add README + gather_corpus.py script 2026-04-24 15:29:26 -07:00
sft_import.py fix: log on malformed json in _read_jsonl, use streaming id dedup 2026-04-08 07:37:22 -07:00