Arena.ai's Code Arena leaderboard, powered by community-voted Elo ratings on agentic coding tasks like multi-file web development and tool use, currently crowns Anthropic's Claude Opus 4.6 at the top with a score near 2000, ahead of Google's Gemini 3.1 Pro around 1940. Recent surges from open models like Z.AI's GLM-5.1—now the leading open contender and within 20 Elo points of the frontier—along with Alibaba's Qwen 3.6 and xAI's Grok 4.20 beta, reflect intensifying competition from scaled reasoning chains and specialized training. OpenAI's GPT-5.4 trails but shows gains in structured code gen. With eight months until year-end resolution on arena.ai/leaderboard/code, traders eye imminent releases like potential Claude 5 or GPT-6 previews at summer conferences, amid benchmark saturation pushing reliance on real-world evals.
Riepilogo sperimentale generato dall'AI con riferimento ai dati di Polymarket. Questo non è un consiglio di trading e non ha alcun ruolo nella risoluzione di questo mercato. · AggiornatoWill any AI model reach ___ Coding Arena Score by December 31?
Will any AI model reach ___ Coding Arena Score by December 31?
1560
93%
1580
53%
1600
51%
$2,083 Vol.
1560
93%
1580
53%
1600
51%
Results from the "Score" column under the "Text Arena | Coding" Leaderboard tab at https://arena.ai/leaderboard/text/coding-no-style-control with style control off will be used to resolve this market.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at arena.ai/leaderboard/text. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If permanently unavailable, this market will resolve to "No".
Mercato aperto: Apr 2, 2026, 6:09 PM ET
Resolver
0x65070BE91...Results from the "Score" column under the "Text Arena | Coding" Leaderboard tab at https://arena.ai/leaderboard/text/coding-no-style-control with style control off will be used to resolve this market.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at arena.ai/leaderboard/text. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If permanently unavailable, this market will resolve to "No".
Resolver
0x65070BE91...Arena.ai's Code Arena leaderboard, powered by community-voted Elo ratings on agentic coding tasks like multi-file web development and tool use, currently crowns Anthropic's Claude Opus 4.6 at the top with a score near 2000, ahead of Google's Gemini 3.1 Pro around 1940. Recent surges from open models like Z.AI's GLM-5.1—now the leading open contender and within 20 Elo points of the frontier—along with Alibaba's Qwen 3.6 and xAI's Grok 4.20 beta, reflect intensifying competition from scaled reasoning chains and specialized training. OpenAI's GPT-5.4 trails but shows gains in structured code gen. With eight months until year-end resolution on arena.ai/leaderboard/code, traders eye imminent releases like potential Claude 5 or GPT-6 previews at summer conferences, amid benchmark saturation pushing reliance on real-world evals.
Riepilogo sperimentale generato dall'AI con riferimento ai dati di Polymarket. Questo non è un consiglio di trading e non ha alcun ruolo nella risoluzione di questo mercato. · Aggiornato
Fai attenzione ai link esterni.
Fai attenzione ai link esterni.
Domande frequenti