Google's Gemini 3.1 Pro Preview has surged to the top of the Humanity's Last Exam leaderboard with 44.7% accuracy on its 2,500 expert-vetted questions spanning mathematics, sciences, and humanities, outpacing OpenAI's GPT-5.4 at 41.6% and signaling stronger multi-step reasoning in recent model releases from March-April 2026. This positions Gemini favorably in the competitive AI landscape against Anthropic's Claude variants and Meta's new Muse Spark, which trails slightly despite strong showings. Traders note persistent high calibration errors across leaders, indicating overconfidence gaps versus expert human scores near 90%. Key catalysts ahead include Google I/O in May for potential Gemini 3.1 updates or previews, which could elevate scores before the June 30 resolution amid rapid benchmark iteration.
Polymarket डेटा का संदर्भ देने वाला प्रयोगात्मक AI-जनरेटेड सारांश। यह ट्रेडिंग सलाह नहीं है और इस बाज़ार के समाधान में कोई भूमिका नहीं निभाता। · अपडेट किया गया$305,941 वॉल्यूम
50%+
40%
55%+
20%
60%+
10%
$305,941 वॉल्यूम
50%+
40%
55%+
20%
60%+
10%
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
बाज़ार खुला: Jan 29, 2026, 12:50 PM ET
Resolver
0x65070BE91...The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Resolver
0x65070BE91...Google's Gemini 3.1 Pro Preview has surged to the top of the Humanity's Last Exam leaderboard with 44.7% accuracy on its 2,500 expert-vetted questions spanning mathematics, sciences, and humanities, outpacing OpenAI's GPT-5.4 at 41.6% and signaling stronger multi-step reasoning in recent model releases from March-April 2026. This positions Gemini favorably in the competitive AI landscape against Anthropic's Claude variants and Meta's new Muse Spark, which trails slightly despite strong showings. Traders note persistent high calibration errors across leaders, indicating overconfidence gaps versus expert human scores near 90%. Key catalysts ahead include Google I/O in May for potential Gemini 3.1 updates or previews, which could elevate scores before the June 30 resolution amid rapid benchmark iteration.
Polymarket डेटा का संदर्भ देने वाला प्रयोगात्मक AI-जनरेटेड सारांश। यह ट्रेडिंग सलाह नहीं है और इस बाज़ार के समाधान में कोई भूमिका नहीं निभाता। · अपडेट किया गया
बाहरी लिंक से सावधान रहें।
बाहरी लिंक से सावधान रहें।
अक्सर पूछे जाने वाले प्रश्न