Will an LLM break 1400 ELO on LMSys before February?
Premium
33
แน26kFeb 2
44%
chance
1D
1W
1M
ALL
Google currently leads with Gemini -- which has two models at around 1370
But OpenAI just announced O3 -- which is getting great marks on things like hard science questions.
https://deepnewz.com/ai-modeling/openai-unveils-o3-o3-mini-models-exceeding-human-performance-on-arc-agi-4f05e4f7
The resolution is simple. Will and LMSys update contain a model with 1400 ELO? Cutoff is last day in January (East Coast time).
This question is managed and resolved by Manifold.
Get
1,000
and3.00
Sort by:
Worth noting: this market is essentially https://manifold.markets/bobbill/will-any-llm-outrank-gpt4-by-150-el but with a 1 month later close date
Related questions
Related questions
Which organization will have the top LLM on LMSys on March 1st?
What organization will top the LLM leaderboards on LMArena at end of 2025? ๐ค๐
EOY 2025: Will open LLMs match closed-source LLMs on coding to within 50 ELO points?
34% chance
Will LLMs mostly overcome the Reversal Curse by the end of 2025?
67% chance
Will an LLM improve its own ability along some important metric well beyond the best trained LLMs before 2026?
58% chance
400-point pwn solved by an LLM by 2025
69% chance
Llama 3 405B ELO on Lmsys Arena Leaderboard 2 weeks after first appearance?
Will any LLM outrank GPT-4 by 150 Elo in LMSYS chatbot arena before 2025?
1% chance
Will an LLM be able to solve a Rubik's Cube by 2025?
9% chance
Will LLM progress stall in 2024?
3% chance