Google's Gemini 3.1 Pro holds a 37% score on FrontierMath Tiers 1–3—the benchmark's challenging research-level math problems—as of mid-April 2026, trailing OpenAI's GPT-5.4 at over 50% and matching Meta's recent Muse Spark model at 39%. This reflects no meaningful gains since Gemini 3 Pro's late-2025 record of 38%, amid trader concerns over stalled progress in large language model mathematical reasoning despite rapid frontier AI advances elsewhere. Competitive pressures from OpenAI and Anthropic intensify scrutiny, with Polymarket odds implying skepticism for breakthroughs. Watch Google I/O on May 19–20 for potential Gemini 4 previews or optimizations that could shift capabilities before the June 30 cutoff.
Polymarket 데이터를 참조하는 실험적 AI 생성 요약입니다. 이것은 거래 조언이 아니며 이 마켓의 정산에 영향을 미치지 않습니다. · 업데이트$127,679 거래량
40%+
92%
45%+
38%
50% 이상
33%
60% 이상
16%
$127,679 거래량
40%+
92%
45%+
38%
50% 이상
33%
60% 이상
16%
This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
마켓 개설일: Feb 6, 2026, 6:03 PM ET
Resolver
0x65070BE91...This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Resolver
0x65070BE91...Google's Gemini 3.1 Pro holds a 37% score on FrontierMath Tiers 1–3—the benchmark's challenging research-level math problems—as of mid-April 2026, trailing OpenAI's GPT-5.4 at over 50% and matching Meta's recent Muse Spark model at 39%. This reflects no meaningful gains since Gemini 3 Pro's late-2025 record of 38%, amid trader concerns over stalled progress in large language model mathematical reasoning despite rapid frontier AI advances elsewhere. Competitive pressures from OpenAI and Anthropic intensify scrutiny, with Polymarket odds implying skepticism for breakthroughs. Watch Google I/O on May 19–20 for potential Gemini 4 previews or optimizations that could shift capabilities before the June 30 cutoff.
Polymarket 데이터를 참조하는 실험적 AI 생성 요약입니다. 이것은 거래 조언이 아니며 이 마켓의 정산에 영향을 미치지 않습니다. · 업데이트
외부 링크에 주의하세요.
외부 링크에 주의하세요.
자주 묻는 질문