Tutorials
Build unlimited training data and train state-of-the-art SWE-agents. This guide covers the complete workflow: from environment setup to model training and evaluation.
๐ Prerequisites
System Requirements
Required: Docker Tested on: Ubuntu 22.04.4 LTS Not supported: Windows, macOS
New to SWE-smith? Start with Installation and Quickstart.
๐ฏ Quick Navigation
-
Build Environments
Create reproducible Docker images for any repository. Capture dependencies, build containers, and validate with automated testing.
-
Create Instances
Generate task instances using LM prompts, procedural modifications, PR mirroring, or combined techniques. Scale to thousands of bugs.
-
Validate & Evaluate
Filter candidates that break tests and verify proposed solutions. Built-in harnesses for validation and evaluation workflows.
-
Generate Issue Text
Add natural language problem statements to task instances using LM generation or alternative methods.
-
Rate Difficulty ยท Optional
Classify tasks as easy/medium/hard using a fine-tuned Qwen 2.5 Coder model. Compare against SWE-bench benchmarks.
-
Train SWE-agents
Complete RSFT pipeline: generate trajectories, filter successful solutions, fine-tune models, and evaluate on SWE-bench.
Recommended Workflow
graph LR
A[Build Environments] --> B[Create Instances]
B --> C[Validate & Evaluate]
C --> D[Generate Issue Text]
D --> E[Rate Difficulty]
D --> F[Train SWE-agents]
E --> F
- Build Environments โ Set up Docker images
- Create Instances โ Generate synthetic bugs
- Validate & Evaluate โ Filter valid task instances
- Generate Issue Text โ Add problem descriptions
- Rate Difficulty (optional) โ Classify task complexity
- Train SWE-agents โ Fine-tune models with RSFT