xAI's Grok models currently trail OpenAI's GPT-5 series on the FrontierMath benchmark, where GPT-5.4 leads at 47.6% accuracy on expert-level math problems including unsolved research challenges, while Grok-4 scored around 14-20% on tiers 1-3 per Epoch AI evaluations. Recent Grok 4.20 release in early 2026 has topped instruction-following (IFBench 82%) and Arena leaderboards with multi-agent reasoning and a 2M-token context window, fueling optimism for math gains amid xAI's rapid iteration and 1GW Colossus training cluster. Competitive pressure intensifies as OpenAI pushes records like 31% on Tier 4; traders eye a potential Grok 5 rollout—rumored at 7 trillion parameters—before the June 30 deadline as the key catalyst for closing the gap.
Polymarket 데이터를 참조하는 실험적 AI 생성 요약입니다. 이것은 거래 조언이 아니며 이 마켓의 정산에 영향을 미치지 않습니다. · 업데이트$19,331 거래량
25%+
59%
30%+
53%
40%+
62%
50%+
23%
$19,331 거래량
25%+
59%
30%+
53%
40%+
62%
50%+
23%
This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
마켓 개설일: Jan 30, 2026, 12:01 AM ET
Resolver
0x65070BE91...This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Resolver
0x65070BE91...xAI's Grok models currently trail OpenAI's GPT-5 series on the FrontierMath benchmark, where GPT-5.4 leads at 47.6% accuracy on expert-level math problems including unsolved research challenges, while Grok-4 scored around 14-20% on tiers 1-3 per Epoch AI evaluations. Recent Grok 4.20 release in early 2026 has topped instruction-following (IFBench 82%) and Arena leaderboards with multi-agent reasoning and a 2M-token context window, fueling optimism for math gains amid xAI's rapid iteration and 1GW Colossus training cluster. Competitive pressure intensifies as OpenAI pushes records like 31% on Tier 4; traders eye a potential Grok 5 rollout—rumored at 7 trillion parameters—before the June 30 deadline as the key catalyst for closing the gap.
Polymarket 데이터를 참조하는 실험적 AI 생성 요약입니다. 이것은 거래 조언이 아니며 이 마켓의 정산에 영향을 미치지 않습니다. · 업데이트
외부 링크에 주의하세요.
외부 링크에 주의하세요.
자주 묻는 질문