avocet

Author	SHA1	Message	Date
pyr0ball	391ebb3cd1	feat(recipe-scan): labeling UI for Kiwi vision training pipeline (closes #65 ) - POST /api/recipe-scan/import — bulk ingest from Kiwi scanner pipeline, idempotent by item id - GET /api/recipe-scan/next — oldest-first pending item for review - POST /api/recipe-scan/items/{id}/approve\|edit\|reject — label actions - GET /api/recipe-scan/stats — counts by status and modality - GET /api/recipe-scan/export — JSONL training pairs (messages chat format, Option B: correction prompt + extracted draft → corrected ground truth) - GET /api/recipe-scan/image — path-traversal-safe image serving from /Library/Assets/kiwi/ - SQLite at data/recipe_scan.db with WAL mode; separate from corpus.db lifecycle - set_db_path() testability seam; 18 tests, all passing - RecipeScanView.vue: two-column review UI (image left, JSON diff right), keyboard shortcuts A/E/R, toast feedback, stats header, export download - Route /data/recipe-scan and sidebar nav entry added	2026-05-17 12:22:15 -07:00
pyr0ball	9bb88b168f	feat(corpus): pipeline log ingest from shared dir (closes #67 ) Pull-side companion to kiwi#141. Ingests structured JSONL pipeline logs from /Library/Assets/logs/pipeline/ into the log corpus for Turnstone logreading model training. - app/data/log_corpus.py: add ingested_pipeline_files tracking table, _pipeline_ingest_dir() config helper, _ingest_one_file() parser, and POST /api/corpus/pipeline-ingest endpoint - source_host = "pipeline_scrape"; source_id from logger field; extra dict stored as matched_patterns; batch_type = "pipeline_log" - Idempotent by filename: skips files already in ingested_pipeline_files - config/label_tool.yaml.example: add corpus section with pipeline_ingest_dir and push sources comment block - tests/test_log_corpus.py: 8 new tests covering ingest, idempotency, non-JSONL filtering, malformed line resilience, incremental runs	2026-05-17 11:28:33 -07:00
pyr0ball	d416ef8aa4	feat(imitate): task-model assignment routing via cf-orch Add _resolve_task_model() helper that looks up a product.task assignment from the coordinator and resolves its service_type from the model registry. Add task_ids param to run_imitate() (comma-separated "product/task" strings) so the imitate harness can dispatch to models chosen by the assignment layer rather than requiring explicit model IDs.	2026-05-17 11:23:55 -07:00
pyr0ball	2b990a603a	feat: log corpus receiver — accept Turnstone push batches and label for logreading fine-tune Adds corpus.db (corpus_sources, corpus_batches, corpus_entries), a FastAPI router at /api/corpus with receive/label/skip/stats/export endpoints, and seeds consent tokens for xanderland + orchard nodes from label_tool.yaml. PII flag excludes entries from JSONL export. Closes avocet#61.	2026-05-11 17:07:54 -07:00
pyr0ball	e11db5ccd9	fix: align train job/results API envelope, config_json key, progress SSE, dashboard model_key - GET /api/train/jobs now returns {"jobs":[...]} instead of bare array - GET /api/train/results now returns {"results":[...]} instead of bare array - POST /api/train/jobs body key renamed config -> config_json to match Pydantic model - SSE log handler now handles 'progress' event type (backend never emits 'log') - Dashboard _get_active_jobs() adds model_key to SELECT and return dict - corrections.py docstring updated: both /api/corrections and /api/sft prefixes noted - test_train.py assertions updated to unwrap new envelope shapes	2026-05-02 21:22:18 -07:00
pyr0ball	8fda821e15	feat: add POST /ingest endpoint to corrections API with Bearer auth Adds IngestRequest model and POST /api/sft/ingest route to app/data/corrections.py. Sibling CF products (Peregrine, Kiwi, etc.) can push pre-approved corrections via Bearer token auth (AVOCET_INGESTION_SECRET). Records land as status=approved in both sft_candidates.jsonl and sft_approved.jsonl immediately. 7 tests in tests/test_data_corrections.py cover 503 (secret unset), 401 (missing/malformed header), 403 (wrong secret), happy-path writes to both files, and optional label field.	2026-05-02 09:07:10 -07:00
pyr0ball	d74ad3f972	feat: move imitate API into app/data/imitate.py	2026-05-01 22:12:19 -07:00
pyr0ball	99ea39fe38	feat: move SFT corrections API into app/data/corrections.py	2026-05-01 22:02:22 -07:00
pyr0ball	2054866ff1	feat: extract fetch routes and IMAP helpers into app/data/fetch.py	2026-05-01 21:57:31 -07:00
pyr0ball	167d7351e3	feat: extract label queue API into app/data/label.py	2026-05-01 18:48:14 -07:00

10 commits