Loading...
Discovering the best AI tools for you
Next-gen SWE benchmark by Scale AI with harder tasks
Task Count
300
Scoring Method
Percentage of issues resolved
Released
2025-12-01