How long until one of Gemini, Claude, etc... match the capabilities of O1? | Manifold

How long until one of Gemini, Claude, etc... match the capabilities of O1?

Plus

22

Ṁ5751

2026

2%

Oct 12th 2024

5%

Dec 12th 2024

45%

April 12th 2025

36%

September 12th 2025

5%

April 12th 2026

5%

Other

OpenAI's O1 model represents a new paradigm of LLMs. How long until a competitor catches up?

"Catches up" / "matches capabilities" is defined as matching or exceeding the O1 pass@1 benchmarks on AIME, Codeforces, and GPQA at the time of publication:

74.4-percentile on AIME
89-percentile on Codeforces
78% accuracy on GPQA

This question is managed and resolved by Manifold.

Get

1,000

and

3.00

Sort by:

It's soooooo slow though. And the results in real world day-to-day usage are rarely better than Sonnet 3.5 new, which are nearly instantaneous. (I ask both the same questions, and use them on a nearly daily basis)

Option of Oct 12th2024 should be resolved.

@Adamacki I can’t seem to find a way to partially resolve the market

bought Ṁ50 YES

Apparently o1's AIME score was pass@10000, not pass@1. Criteria should be updates accordingly

@JaundicedBaboon I updated the benchmark to the pass@1 score

Related questions

Will something named Gemini 1.5 Ultra be announced before the end of 2024?

-9% 1d2% chance

Before February 2025, will a Gemini model exceed Claude 3.5 Sonnet 10/22's Global Average score on Simple Bench?

Will Gemini 1.5 Pro seem to be as good as Gemini 1.0 Ultra for common use cases? [Poll]

Will "Gemini [Ultra, 1.0] smash GPT-4 by 5x"?

Will Gemini be released before 2024? x Will GPT-5 be released before 2025?

Will Gemini achieve a higher score on the SAT compared to GPT-4?

Will Gemini exceed the performance of GPT-4 on the 2022 AMC 10 and AMC 12 exams?

Will Gemini Ultra outperform GPT-4V on visual reasoning by the end of 2024?

What will be true of Gemini 2?

Will ANY Gemini or Apollo astronaut become centenarian?

Related questions

Will something named Gemini 1.5 Ultra be announced before the end of 2024?

Will Gemini achieve a higher score on the SAT compared to GPT-4?

Before February 2025, will a Gemini model exceed Claude 3.5 Sonnet 10/22's Global Average score on Simple Bench?

Will Gemini exceed the performance of GPT-4 on the 2022 AMC 10 and AMC 12 exams?

Will Gemini 1.5 Pro seem to be as good as Gemini 1.0 Ultra for common use cases? [Poll]

Will Gemini Ultra outperform GPT-4V on visual reasoning by the end of 2024?

Will "Gemini [Ultra, 1.0] smash GPT-4 by 5x"?

What will be true of Gemini 2?

Will Gemini be released before 2024? x Will GPT-5 be released before 2025?

Will ANY Gemini or Apollo astronaut become centenarian?

© Manifold Markets, Inc.•Terms + Mana-only Terms•Privacy•Rules