feat: writing style finetuning backend (Phase 2) #52

Open
opened 2026-05-01 12:13:45 -07:00 by pyr0ball · 0 comments
Owner

Context

Avocet already benchmarks writing style models. This adds the finetuning backend to close the loop: corpus → train → personal style LoRA adapter.

Work

  • scripts/finetune_style.py — LoRA finetuning on LLM using personal writing corpus from gather_corpus.py
  • Register as style job type in train queue
  • Corpus format: .txt files in data/style_corpus/ (already gathered by gather_corpus.py)
  • Hyperparams: base model, LoRA rank, epochs, sample length

Depends on

  • #43 (train job queue)
  • #46 (LLM SFT backend — reuse PEFT infrastructure)
## Context Avocet already benchmarks writing style models. This adds the finetuning backend to close the loop: corpus → train → personal style LoRA adapter. ## Work - `scripts/finetune_style.py` — LoRA finetuning on LLM using personal writing corpus from `gather_corpus.py` - Register as `style` job type in train queue - Corpus format: `.txt` files in `data/style_corpus/` (already gathered by `gather_corpus.py`) - Hyperparams: base model, LoRA rank, epochs, sample length ## Depends on - #43 (train job queue) - #46 (LLM SFT backend — reuse PEFT infrastructure)
pyr0ball added this to the v2 — Pipeline Architecture milestone 2026-05-01 12:13:45 -07:00
pyr0ball added the
ml
phase-2
backend
labels 2026-05-01 12:13:45 -07:00
Sign in to join this conversation.
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference: Circuit-Forge/avocet#52
No description provided.