Anthropic's Claude Mythos Preview, released in early April 2026, has surged to the top of select Humanity's Last Exam leaderboards with scores reaching 64.7% using tools and 56.8% without, outpacing prior Opus 4.6 results of around 53% and 34% on independent evals like Scale AI's. This reflects Anthropic's blistering 2026 iteration pace—major updates every two weeks—bolstered by massive scaling in parameters and agentic capabilities amid fierce rivalry from Google's Gemini 3.1 and OpenAI's GPT-5 variants. Traders should monitor the official Scale or Artificial Analysis leaderboards for resolution criteria, with Claude 5 Opus eyed for Q2 release potentially pushing scores higher before June 30, though benchmark methodologies vary on tool use and calibration.
Polymarket verilerine atıfta bulunan deneysel AI tarafından oluşturulmuş özet. Bu bir işlem tavsiyesi değildir ve bu piyasanın nasıl çözümlendiğinde hiçbir rolü yoktur. · Güncellendi$208,598 Hac.
%35+
98%
%45+
69%
$208,598 Hac.
%35+
98%
%45+
69%
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Piyasa Açıldı: Jan 30, 2026, 12:00 AM ET
Resolver
0x65070BE91...The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Resolver
0x65070BE91...Anthropic's Claude Mythos Preview, released in early April 2026, has surged to the top of select Humanity's Last Exam leaderboards with scores reaching 64.7% using tools and 56.8% without, outpacing prior Opus 4.6 results of around 53% and 34% on independent evals like Scale AI's. This reflects Anthropic's blistering 2026 iteration pace—major updates every two weeks—bolstered by massive scaling in parameters and agentic capabilities amid fierce rivalry from Google's Gemini 3.1 and OpenAI's GPT-5 variants. Traders should monitor the official Scale or Artificial Analysis leaderboards for resolution criteria, with Claude 5 Opus eyed for Q2 release potentially pushing scores higher before June 30, though benchmark methodologies vary on tool use and calibration.
Polymarket verilerine atıfta bulunan deneysel AI tarafından oluşturulmuş özet. Bu bir işlem tavsiyesi değildir ve bu piyasanın nasıl çözümlendiğinde hiçbir rolü yoktur. · Güncellendi
Harici bağlantılara dikkat edin.
Harici bağlantılara dikkat edin.
Sıkça Sorulan Sorular