Anthropic’s February 2026 release of Claude Opus 4.6 established clear leadership on frontier benchmarks for agentic coding, complex reasoning, and knowledge-work tasks, with the “thinking” variant delivering the highest scores on evaluations such as Terminal-Bench 2.0 and Humanity’s Last Exam. Traders have priced claude-opus-4-6-thinking at 94 percent because no subsequent model, including later Anthropic iterations or competing releases from OpenAI and Google, has displaced it in the specific criteria the market appears to weigh. The market resolves in six days, so any unannounced capability jump or new third-party leaderboard could still shift sentiment, though current data show no credible challenger on the immediate horizon.
Experimentelle KI-generierte Zusammenfassung mit Polymarket-Daten. Dies ist keine Handelsberatung und spielt keine Rolle bei der Auflösung dieses Marktes. · Aktualisiertclaude-opus-4-6-thinking 93.3%
claude-opus-4-6 4.3%
Andere 1.6%
gemini-3.5-flash <1%
claude-opus-4-6-thinking
93%
claude-opus-4-6
4%
Andere
2%
gemini-3.5-flash
1%
claude-opus-4-7-thinking
1%
gemini-3.1-pro-preview
1%
claude-opus-4-6-thinking 93.3%
claude-opus-4-6 4.3%
Andere 1.6%
gemini-3.5-flash <1%
claude-opus-4-6-thinking
93%
claude-opus-4-6
4%
Andere
2%
gemini-3.5-flash
1%
claude-opus-4-7-thinking
1%
gemini-3.1-pro-preview
1%
Results from the "Rank" column under the "Text Arena | Overall" Leaderboard tab at https://lmarena.ai/leaderboard/text with style control off will be used to resolve this market.
No new model will be added to this market after market creation. Any model not explicitly listed in this market will be encompassed under the "Other" option.
Models will be ordered primarily by their leaderboard rank at the market’s check time. If two or more models are tied on rank, they will be ordered by their Arena score, including any underlying, unrounded, granular values reflected in the data below the leaderboard. If a tie still remains, alphabetical order of model names as listed in this market group (full string, including suffixes such as “-thinking”) will be used as a final tiebreaker (e.g., if two models remain tied, “claude-opus-4-6” would be ranked ahead of “claude-opus-4-6-thinking”). This market will resolve to the model that comes first according to this order.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Markt eröffnet: Jun 5, 2026, 4:45 PM ET
Resolver
0x69c47De9D...Results from the "Rank" column under the "Text Arena | Overall" Leaderboard tab at https://lmarena.ai/leaderboard/text with style control off will be used to resolve this market.
No new model will be added to this market after market creation. Any model not explicitly listed in this market will be encompassed under the "Other" option.
Models will be ordered primarily by their leaderboard rank at the market’s check time. If two or more models are tied on rank, they will be ordered by their Arena score, including any underlying, unrounded, granular values reflected in the data below the leaderboard. If a tie still remains, alphabetical order of model names as listed in this market group (full string, including suffixes such as “-thinking”) will be used as a final tiebreaker (e.g., if two models remain tied, “claude-opus-4-6” would be ranked ahead of “claude-opus-4-6-thinking”). This market will resolve to the model that comes first according to this order.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Resolver
0x69c47De9D...Anthropic’s February 2026 release of Claude Opus 4.6 established clear leadership on frontier benchmarks for agentic coding, complex reasoning, and knowledge-work tasks, with the “thinking” variant delivering the highest scores on evaluations such as Terminal-Bench 2.0 and Humanity’s Last Exam. Traders have priced claude-opus-4-6-thinking at 94 percent because no subsequent model, including later Anthropic iterations or competing releases from OpenAI and Google, has displaced it in the specific criteria the market appears to weigh. The market resolves in six days, so any unannounced capability jump or new third-party leaderboard could still shift sentiment, though current data show no credible challenger on the immediate horizon.
Experimentelle KI-generierte Zusammenfassung mit Polymarket-Daten. Dies ist keine Handelsberatung und spielt keine Rolle bei der Auflösung dieses Marktes. · Aktualisiert
Vorsicht bei externen Links.
Vorsicht bei externen Links.
Häufig gestellte Fragen