Anthropic's Claude Mythos Preview, released April 7, 2026, has propelled trader sentiment by achieving a leading 56.8% score without tools and 64.7% with tools on Humanity's Last Exam—a grueling 2,500-question frontier benchmark testing AI reasoning across math, sciences, and humanities. This surpasses prior Claude Opus 4.6 results (around 40-53%) and edges competitors like Google's Gemini 3.1 Pro (44-46%) and OpenAI's GPT-5.4 (41-44%) on select leaderboards, amid varying evaluation methods. Anthropic's bi-weekly model iterations throughout 2026 signal potential for further gains before June 30, with Claude 5 Opus rumored for Q2 release boosting capabilities. Traders eye leaderboard updates and official evaluations, as benchmark discrepancies and alignment constraints could temper progress.
Riepilogo sperimentale generato dall'AI con riferimento ai dati di Polymarket. Questo non è un consiglio di trading e non ha alcun ruolo nella risoluzione di questo mercato. · AggiornatoPunteggio Claude antropico all'ultimo esame dell'umanità entro il 30 giugno?
Punteggio Claude antropico all'ultimo esame dell'umanità entro il 30 giugno?
$208,598 Vol.
35%+
98%
45%+
69%
$208,598 Vol.
35%+
98%
45%+
69%
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Mercato aperto: Jan 30, 2026, 12:00 AM ET
Resolver
0x65070BE91...The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Resolver
0x65070BE91...Anthropic's Claude Mythos Preview, released April 7, 2026, has propelled trader sentiment by achieving a leading 56.8% score without tools and 64.7% with tools on Humanity's Last Exam—a grueling 2,500-question frontier benchmark testing AI reasoning across math, sciences, and humanities. This surpasses prior Claude Opus 4.6 results (around 40-53%) and edges competitors like Google's Gemini 3.1 Pro (44-46%) and OpenAI's GPT-5.4 (41-44%) on select leaderboards, amid varying evaluation methods. Anthropic's bi-weekly model iterations throughout 2026 signal potential for further gains before June 30, with Claude 5 Opus rumored for Q2 release boosting capabilities. Traders eye leaderboard updates and official evaluations, as benchmark discrepancies and alignment constraints could temper progress.
Riepilogo sperimentale generato dall'AI con riferimento ai dati di Polymarket. Questo non è un consiglio di trading e non ha alcun ruolo nella risoluzione di questo mercato. · Aggiornato
Fai attenzione ai link esterni.
Fai attenzione ai link esterni.
Domande frequenti