Gemini 3's execution time-horizon?
22
Ṁ1796Dec 31
Invalid contract
On the task described in https://arxiv.org/abs/2509.09677, what will be the length of tasks that Gemini 3 will be able to complete in one go?
I'm an author, and I will run the same setup above^ to resolve this.
Currently:
GPT-5 Thinking is 1024
Claude 4 Sonnet is 432
Grok-4 is 384
This question is managed and resolved by Manifold.
Get
1,000and
3.00
Related questions
Related questions
Gemini 3's METR 50% time horizon
Opus 4.5's METR time horizon beats Gemini 3.0 Pro's?
84% chance
Gemini 3.0 Pro outperforms GPT-5 on METR 50% time horizon?
81% chance
Before 2026, will Gemini 3.0 exceed GPT-5 in Metr estimated time horizon?
77% chance
Will GPT-5.1 have a longer METR time horizon than Gemini 3?
21% chance
Gemini 3 Deep Think available on API in 2025?
16% chance
Gemini 3 Pro exits preview by EOY?
22% chance
How many will Gemini 3.0 achieve? [Read description]
-