OpenAI's GPT-5.4 currently scores around 41-44% on Humanity's Last Exam—a rigorous 2,500-question benchmark probing frontier AI capabilities across math, science, and humanities—trailing Google's Gemini 3.1 Pro Preview at 46% per recent leaderboards, reflecting trader consensus on steady but unsaturated progress. Scores have surged from under 10% in early 2025 via enhanced reasoning chains and larger-scale training, yet no model exceeds 50%, underscoring HLE's resilience against scaling alone. With 2.5 months to June 30, 2026, anticipation centers on OpenAI's next frontier model release amid competitive pressure from Anthropic's Claude Opus variants, though technical hurdles like novel reasoning could delay breakthroughs.
Polymarket डेटा का संदर्भ देने वाला प्रयोगात्मक AI-जनरेटेड सारांश। यह ट्रेडिंग सलाह नहीं है और इस बाज़ार के समाधान में कोई भूमिका नहीं निभाता। · अपडेट किया गया$14,898 वॉल्यूम
50%+
54%
$14,898 वॉल्यूम
50%+
54%
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
बाज़ार खुला: Jan 30, 2026, 12:00 AM ET
Resolver
0x65070BE91...The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Resolver
0x65070BE91...OpenAI's GPT-5.4 currently scores around 41-44% on Humanity's Last Exam—a rigorous 2,500-question benchmark probing frontier AI capabilities across math, science, and humanities—trailing Google's Gemini 3.1 Pro Preview at 46% per recent leaderboards, reflecting trader consensus on steady but unsaturated progress. Scores have surged from under 10% in early 2025 via enhanced reasoning chains and larger-scale training, yet no model exceeds 50%, underscoring HLE's resilience against scaling alone. With 2.5 months to June 30, 2026, anticipation centers on OpenAI's next frontier model release amid competitive pressure from Anthropic's Claude Opus variants, though technical hurdles like novel reasoning could delay breakthroughs.
Polymarket डेटा का संदर्भ देने वाला प्रयोगात्मक AI-जनरेटेड सारांश। यह ट्रेडिंग सलाह नहीं है और इस बाज़ार के समाधान में कोई भूमिका नहीं निभाता। · अपडेट किया गया
बाहरी लिंक से सावधान रहें।
बाहरी लिंक से सावधान रहें।
अक्सर पूछे जाने वाले प्रश्न