Training data: SWE-ZERO-12M-trajectories as mid-training corpus for local LLM agentic tool-use #41
Labels
No labels
free-tier:live
priority:backlog
priority:high
priority:medium
status:active-dev
status:alpha
status:beta
status:concept
status:design
status:launched
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference: Circuit-Forge/roadmap#41
Loading…
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Dataset
HuggingFace:
AlienKevin/SWE-ZERO-12M-trajectoriesLicense: Apache 2.0
Size: 12.3M rollouts, 112B tokens, 122,908 unique PRs, 3,222 repos, 16 languages
Schema:
instance_id,repo,messages(multi-turn,{role, content}),trajectory_format,exit_status,duration_secWhat it is
The largest agentic coding trace dataset to date. Each row is a multi-turn agent-environment trajectory where an LLM navigates a real repo (bash, file reads, edits) attempting to resolve a GitHub PR. The stated purpose is mid-training to instill agentic tool-use priors in code models — not SFT.
Submission rate findings
Sampled across 6 offsets spanning the full dataset. Observed roughly 2-5%
exit_status=Submitted. Submitted trajectories tend to be short (4-8 messages) and simple (version bumps, single-file edits). Complex multi-file refactors rarely reach submission. Patches are unverified against tests.Value for CF
Submitted+ 4-8 message trajectoriesRecommended pipeline: mid-train on full corpus, SFT on filtered
Submitted/simple-PR slice, then SFT again on CF-specific task data. This produces a local model with strong tool-use priors before CF product data is ever shown.Relevant products: all CF products using local LLM agentic execution — Peregrine research loop, Turnstone log navigation, cf-orch task agents, Magpie scraping pipelines, future home assistant product.
Next steps
exit_status=Submitted,len(messages) <= 10, Python/JS repos only