OpenAI’s GPT-5.5 series currently leads FrontierMath leaderboards with scores of 51.7–52.4% on the main Tiers 1–4 benchmark of unpublished, research-level math problems, following incremental gains from GPT-5.4. Epoch AI released an updated v2 of the benchmark on June 12 that corrected errors in 42% of problems, potentially altering measured performance for all models. OpenAI funded the benchmark’s creation and retains exclusive access to a subset of problems and solutions, which has supported rapid iteration on its reasoning models. With the June 30 resolution deadline only weeks away, any new model release, API update, or refined evaluation scaffold from OpenAI could shift the highest reported GPT score before markets close.
Resumo experimental gerado por IA com dados do Polymarket. Isto não é aconselhamento de trading e não tem qualquer papel na resolução deste mercado. · Atualizado$43,059 Vol.
45%+
Yes
50%+
Yes
60%+
Yes
70%+
Yes
$43,059 Vol.
45%+
Yes
50%+
Yes
60%+
Yes
70%+
Yes
This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Mercado Aberto: Jan 29, 2026, 12:47 PM ET
Resolver
0x65070BE91...Resultado proposto: Yes
Sem contestação
Resultado final: Yes
This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Resolver
0x65070BE91...Resultado proposto: Yes
Sem contestação
Resultado final: Yes
OpenAI’s GPT-5.5 series currently leads FrontierMath leaderboards with scores of 51.7–52.4% on the main Tiers 1–4 benchmark of unpublished, research-level math problems, following incremental gains from GPT-5.4. Epoch AI released an updated v2 of the benchmark on June 12 that corrected errors in 42% of problems, potentially altering measured performance for all models. OpenAI funded the benchmark’s creation and retains exclusive access to a subset of problems and solutions, which has supported rapid iteration on its reasoning models. With the June 30 resolution deadline only weeks away, any new model release, API update, or refined evaluation scaffold from OpenAI could shift the highest reported GPT score before markets close.
Resumo experimental gerado por IA com dados do Polymarket. Isto não é aconselhamento de trading e não tem qualquer papel na resolução deste mercado. · Atualizado
Cuidado com os links externos.
Cuidado com os links externos.
Frequently Asked Questions