Trader consensus on Polymarket assigns an 83.6% implied probability to claude-opus-4-6-thinking topping the LM Arena text leaderboard (Style Control Off) on April 17, driven by its current lead at 1502 Arena Score as of April 14—six points ahead of claude-opus-4-6 and nine over gemini-3.1-pro-preview. Anthropic's February flagship, featuring adaptive thinking and 1M-token context, continues dominating agentic coding benchmarks like SWE-Bench (93.2%) despite early-April user reports of "shrinkflation" regressions in reasoning depth and hallucinations. Challengers like gpt-5.4-high (1.1%) and gemini variants trail with lower scores and fewer recent capability demonstrations. Resolution hinges on tomorrow's leaderboard refresh, with no confirmed rival releases imminent.
Polymarket verilerine atıfta bulunan deneysel AI tarafından oluşturulmuş özet. Bu bir işlem tavsiyesi değildir ve bu piyasanın nasıl çözümlendiğinde hiçbir rolü yoktur. · GüncellendiBest AI model on April 17? (Style Control Off)
Best AI model on April 17? (Style Control Off)
claude-opus-4-6-thinking 83.9%
gpt-5.4-high 1.1%
gemini-3.1-pro-preview 1.0%
claude-opus-4-6 <1%
$17,668 Hac.
$17,668 Hac.
claude-opus-4-6-thinking
84%
gpt-5.4-high
1%
gemini-3.1-pro-preview
1%
claude-opus-4-6
1%
grok-4.20-beta1
1%
muse-spark
1%
glm-5.1
1%
gemini-3-flash
<1%
dola-seed-2.0-preview
<1%
gemini-3-pro
<1%
gemini-2.5-pro
<1%
kimi-k2.5-thinking
<1%
qwen3.5-max-preview
<1%
claude-opus-4-6-thinking 83.9%
gpt-5.4-high 1.1%
gemini-3.1-pro-preview 1.0%
claude-opus-4-6 <1%
$17,668 Hac.
$17,668 Hac.
claude-opus-4-6-thinking
84%
gpt-5.4-high
1%
gemini-3.1-pro-preview
1%
claude-opus-4-6
1%
grok-4.20-beta1
1%
muse-spark
1%
glm-5.1
1%
gemini-3-flash
<1%
dola-seed-2.0-preview
<1%
gemini-3-pro
<1%
gemini-2.5-pro
<1%
kimi-k2.5-thinking
<1%
qwen3.5-max-preview
<1%
Results from the "Score" column under the "Text Arena | Overall" Leaderboard tab at https://lmarena.ai/leaderboard/text with style control off will be used to resolve this market.
Models will be ranked primarily by their arena score at this market’s check time, with alphabetical order of model names as listed in this market group (full string, including suffixes such as “-thinking”) used as a tiebreaker (e.g., if the two models are tied by arena score, “claude-opus-4-6” would be ranked ahead of “claude-opus-4-6-thinking”). This market will resolve based on the model that occupies first place under this ranking.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Piyasa Açıldı: Apr 9, 2026, 5:18 PM ET
Resolver
0x69c47De9D...Results from the "Score" column under the "Text Arena | Overall" Leaderboard tab at https://lmarena.ai/leaderboard/text with style control off will be used to resolve this market.
Models will be ranked primarily by their arena score at this market’s check time, with alphabetical order of model names as listed in this market group (full string, including suffixes such as “-thinking”) used as a tiebreaker (e.g., if the two models are tied by arena score, “claude-opus-4-6” would be ranked ahead of “claude-opus-4-6-thinking”). This market will resolve based on the model that occupies first place under this ranking.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Resolver
0x69c47De9D...Trader consensus on Polymarket assigns an 83.6% implied probability to claude-opus-4-6-thinking topping the LM Arena text leaderboard (Style Control Off) on April 17, driven by its current lead at 1502 Arena Score as of April 14—six points ahead of claude-opus-4-6 and nine over gemini-3.1-pro-preview. Anthropic's February flagship, featuring adaptive thinking and 1M-token context, continues dominating agentic coding benchmarks like SWE-Bench (93.2%) despite early-April user reports of "shrinkflation" regressions in reasoning depth and hallucinations. Challengers like gpt-5.4-high (1.1%) and gemini variants trail with lower scores and fewer recent capability demonstrations. Resolution hinges on tomorrow's leaderboard refresh, with no confirmed rival releases imminent.
Polymarket verilerine atıfta bulunan deneysel AI tarafından oluşturulmuş özet. Bu bir işlem tavsiyesi değildir ve bu piyasanın nasıl çözümlendiğinde hiçbir rolü yoktur. · Güncellendi
Harici bağlantılara dikkat edin.
Harici bağlantılara dikkat edin.
Sıkça Sorulan Sorular