Scale-SWE
Collection
4 items β’ Updated
| Field | Description |
|---|---|
instance_id |
A unique identifier formatted as {user}_{repo}_pr{id}. |
user |
The owner of the GitHub repository. |
repo |
The name of the GitHub repository. |
language |
The programming language of the codebase (currently Python). |
workdir |
The working directory path within the environment. |
image_url |
The URL of the pre-built Docker image for the task. |
patch |
The ground-truth patch (Golden Patch) from the corresponding pull request. |
pr_commit |
The commit hash of the pull request. |
parent_commit |
The commit hash of the parent commit (base state). |
problem_statement |
The issue description conveying the bug, provided to the model as input. |
f2p_patch |
The developer-written test patch containing tests that fail before the fix (if available). For evaluation, this patch should be applied. See this script. |
f2p_script |
The synthetic reproduction script generated by our unit-test creator agent. Because a lot of high qaulity pull request do not have author written F2P, we can only synthetic F2P. This should be applied as test_fail_to_pass.py file just under repository directory. just before evaluation. See this script. |
FAIL_TO_PASS |
Unit tests that fail on the buggy version but pass after the fix. |
PASS_TO_PASS |
Unit tests that pass in both versions (regression tests). |
github_url |
The URL of the original GitHub repository. |
pre_commands |
These commands must be executed immediately upon entering the container to check out the correct commit. |
Please use AweAgent to inference Scale-SWE-Agent. Scale-SWE-Agent model parameter is avaliable at Huggingface. Key parameters can be seen below:
| Parameter | Value |
|---|---|
| Max turns | 200 |
| Max sequence length | 256k |
| Temperature | 1 |
If you find this project useful for your research, please consider citing our paper:
@misc{zhao2026immersiongithubuniversescaling,
title={Immersion in the GitHub Universe: Scaling Coding Agents to Mastery},
author={Jiale Zhao and Guoxin Chen and Fanzhe Meng and Minghao Li and Jie Chen and Hui Xu and Yongshuai Sun and Xin Zhao and Ruihua Song and Yuan Zhang and Peng Wang and Cheng Chen and Jirong Wen and Kai Jia},
year={2026},
eprint={2602.09892},
archivePrefix={arXiv},
primaryClass={cs.SE},
url={https://arxiv.org/abs/2602.09892},
}