Google’s Gemini models hold the current lead on Epoch AI’s FrontierMath benchmark, with Gemini 3 Pro and 3.1 Pro variants posting roughly 38% on Tiers 1–3 and isolated Deep Think or preview runs exceeding 40%, ahead of GPT-5.2 and Claude Opus 4.6 equivalents. With the June 30 resolution date only weeks away, traders price a high probability that at least one Gemini version clears the 40% threshold, driven by the narrow gap and Google DeepMind’s ongoing inference optimizations and internal scaling experiments. Limited runway remains for a new flagship release or training run, while the benchmark’s focus on unpublished, expert-vetted problems makes rapid gains difficult; any official confirmation of a higher score before deadline would serve as the decisive catalyst.
基于Polymarket数据的AI实验性摘要。这不是交易建议,也不影响该市场的结算方式。 · 更新于$146,439 交易量
40%+
是
45%+
是
50%+
是
60%+
是
$146,439 交易量
40%+
是
45%+
是
50%+
是
60%+
是
This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
市场开放时间: Feb 6, 2026, 6:03 PM ET
Resolver
0x65070BE91...已提议结果: 是
无争议
最终结果: 是
This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Resolver
0x65070BE91...已提议结果: 是
无争议
最终结果: 是
Google’s Gemini models hold the current lead on Epoch AI’s FrontierMath benchmark, with Gemini 3 Pro and 3.1 Pro variants posting roughly 38% on Tiers 1–3 and isolated Deep Think or preview runs exceeding 40%, ahead of GPT-5.2 and Claude Opus 4.6 equivalents. With the June 30 resolution date only weeks away, traders price a high probability that at least one Gemini version clears the 40% threshold, driven by the narrow gap and Google DeepMind’s ongoing inference optimizations and internal scaling experiments. Limited runway remains for a new flagship release or training run, while the benchmark’s focus on unpublished, expert-vetted problems makes rapid gains difficult; any official confirmation of a higher score before deadline would serve as the decisive catalyst.
基于Polymarket数据的AI实验性摘要。这不是交易建议,也不影响该市场的结算方式。 · 更新于
警惕外部链接哦。
警惕外部链接哦。
常见问题