Trader consensus on Polymarket heavily favors Anthropic's Claude Opus 4.6 (Thinking) at 46% implied probability for topping AI leaderboards by April 17, driven by its unchallenged dominance on the LMSYS Chatbot Arena since its February 2026 release, where it holds the top Elo score of around 1502 in text and code categories as of late March. This variant's enhanced chain-of-thought reasoning delivers superior performance in agentic tasks like SWE-Bench and Terminal-Bench compared to rivals, sustaining its edge despite earlier surges from Google's Gemini 3.1 Pro Preview and OpenAI's GPT-5.4 High. Recent previews of Anthropic's Claude Mythos show massive gains (e.g., 77.8% on SWE-Bench Pro vs. Opus 4.6's 53.4%), but full rollout uncertainty keeps traders anchored to the proven leader amid leaderboard volatility in the final week.
Resumo experimental gerado por IA com dados do Polymarket. Isto não é aconselhamento de trading e não tem qualquer papel na resolução deste mercado. · AtualizadoTop AI model on April 17? (Style Control On)
Top AI model on April 17? (Style Control On)
claude-opus-4-6-thinking 46%
claude-opus-4-6 6%
gemini-3.1-pro-preview 2.6%
gpt-5.4-high 2.5%
claude-opus-4-6-thinking
46%
claude-opus-4-6
6%
gemini-3.1-pro-preview
3%
gpt-5.4-high
3%
grok-4.20-beta1
2%
claude-opus-4-5-20251101-thinking-32k
2%
qwen3.5-max-preview
2%
dola-seed-2.0-preview
2%
gemini-3-flash
2%
kimi-k2.5-thinking
2%
gemini-3-pro
2%
gemini-2.5-pro
2%
grok-4.20-beta-0309-reasoning
2%
gpt-5.2-chat-latest-20260210
1%
claude-opus-4-6-thinking 46%
claude-opus-4-6 6%
gemini-3.1-pro-preview 2.6%
gpt-5.4-high 2.5%
claude-opus-4-6-thinking
46%
claude-opus-4-6
6%
gemini-3.1-pro-preview
3%
gpt-5.4-high
3%
grok-4.20-beta1
2%
claude-opus-4-5-20251101-thinking-32k
2%
qwen3.5-max-preview
2%
dola-seed-2.0-preview
2%
gemini-3-flash
2%
kimi-k2.5-thinking
2%
gemini-3-pro
2%
gemini-2.5-pro
2%
grok-4.20-beta-0309-reasoning
2%
gpt-5.2-chat-latest-20260210
1%
Results from the "Score" column under the "Text Arena | Overall" Leaderboard tab at https://lmarena.ai/leaderboard/text with style control on will be used to resolve this market.
Models will be ranked primarily by their arena score at this market’s check time, with alphabetical order of model names as listed in this market group (full string, including suffixes such as “-thinking”) used as a tiebreaker (e.g., if the two models are tied by arena score, “claude-opus-4-6” would be ranked ahead of “claude-opus-4-6-thinking”). This market will resolve based on the model that occupies first place under this ranking.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Mercado Aberto: Apr 9, 2026, 5:20 PM ET
Resolver
0x69c47De9D...Results from the "Score" column under the "Text Arena | Overall" Leaderboard tab at https://lmarena.ai/leaderboard/text with style control on will be used to resolve this market.
Models will be ranked primarily by their arena score at this market’s check time, with alphabetical order of model names as listed in this market group (full string, including suffixes such as “-thinking”) used as a tiebreaker (e.g., if the two models are tied by arena score, “claude-opus-4-6” would be ranked ahead of “claude-opus-4-6-thinking”). This market will resolve based on the model that occupies first place under this ranking.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Resolver
0x69c47De9D...Trader consensus on Polymarket heavily favors Anthropic's Claude Opus 4.6 (Thinking) at 46% implied probability for topping AI leaderboards by April 17, driven by its unchallenged dominance on the LMSYS Chatbot Arena since its February 2026 release, where it holds the top Elo score of around 1502 in text and code categories as of late March. This variant's enhanced chain-of-thought reasoning delivers superior performance in agentic tasks like SWE-Bench and Terminal-Bench compared to rivals, sustaining its edge despite earlier surges from Google's Gemini 3.1 Pro Preview and OpenAI's GPT-5.4 High. Recent previews of Anthropic's Claude Mythos show massive gains (e.g., 77.8% on SWE-Bench Pro vs. Opus 4.6's 53.4%), but full rollout uncertainty keeps traders anchored to the proven leader amid leaderboard volatility in the final week.
Resumo experimental gerado por IA com dados do Polymarket. Isto não é aconselhamento de trading e não tem qualquer papel na resolução deste mercado. · Atualizado
Cuidado com os links externos.
Cuidado com os links externos.
Frequently Asked Questions