
What will be the highest score achieved on SWE-Bench Verified in 2025?
Plus
15
Ṁ15512026
1D
1W
1M
ALL
6%
<70
32%
70-85 inclusive
62%
>85
https://openai.com/index/introducing-swe-bench-verified/
https://www.swebench.com/
Highest performance reported before 2026. Any run on https://www.swebench.com/ counts. Large AI company reported numbers count whether or not they're listed on swebench.com Other claimed scores will generally not be counted unless verified by a third party.
This question is managed and resolved by Manifold.
Get
1,000and
3.00
Sort by:
@JacobPfau Does Introducing Codex resolve <70 NO? Very annoyingly they don't give a number, but in the plot codex-1 pass@1 is clearly above 70%.
@SanghyeonSeo Don't see an option to resolve individual options, IIRC there are two types of multiple choice questions
Related questions
Related questions
Top SWE-Bench Verified score in 2025?
-
What will be the best performance on SWE-bench Verified by December 31st 2025?
Top Multi-SWE-bench score in 2025?
-
Will an autonomous agent resolve 90% of tasks on SWE-bench by 2026?
50% chance
Will SotA on PaperBench (Code-Dev) surpass 75% in 2025?
40% chance
Will an autonomous agent resolve 90% of tasks on SWE-bench by 2027?
69% chance
When will SWE-bench be solved?
AI resolves at least X% on SWE-bench WITH assistance, by 2028?
AI resolves at least X% on SWE-bench without any assistance, by 2028?
What will be the best score (5/5 reliability) on ZeroBench by December 31st 2025?