Putnam Top 100 #2: Will any AI score in the top 100 Putnam scorers by start of 2026?
22
10kṀ25k
Jan 1
88%
chance
4

Putnam exam link. Note that it happens every December.

  • The AI must score in the top 100 for a particular year.

  • By "top 100" I mean that its score must be >= the score of the 100th place scorer. (If 100th place is a tie I'll use the tying score).

  • If we know the details of the training date, then the training data must all have been released prior to the release of the Putnam questions for that year. i.e. if ModelNet is run on the 2026 Putnam, it must be trained on data from before the date of the 2026 Putnam exam.

  • The AI does not have to be trained before the relevant exam, as long as the data predates the exam.

  • The scoring for the AI's exam must either be done by the actual Putnam scorers, mathematicians who have been Putnam scorers, or mathematicians who are actively involved in competitive mathematics in some way. (i.e. a professor who runs a university's competitive team counts, a software engineer who did well in the Putnam 5 years ago does not).

  • I may accept scoring that isn't blinded, but I reserve the right to ignore any scoring that's vaguely suspect/biased/etc.

  • Update 2025-12-10 (PST) (AI summary of creator comment): "Released" data means data that existed prior to the exam, not necessarily publicly available open-source data. The AI model does not need to be trained on open data, but all training data must have existed before the Putnam exam date to prevent test set contamination.

Get
Ṁ1,000
to start trading!
Sort by:

Do the claimants need to provide evidence of scoring by sufficient specialists by the end of the year, or will they have indefinite time to provide it later?

bought Ṁ3,500 YES

Nous Research claims to have scored 87/120! https://x.com/NousResearch/status/1998536543565127968

Your requirements on training data seems weird.

The best open-data model we have is Olmo3, which is trash

@clementdupOz "Released" in the sense of "exists". It does not need to be an open data model, but it must be trained on data that existed prior to the 2025 Putnam exam. This requirement is to prevent any possibility of the test set getting into the training set.

reposted

Claim: DeepSeekMath-V2 hits gold-medal performance on Putnam. https://x.com/theturingpost/status/1994926897248288813?s=46

@SG that's not this year's putnam though right? whereas this is: https://x.com/axiommathai/status/1997767850279440715?s=20

(but we don't know rankings yet)

bought Ṁ250 NO

@Bayesian And news on the scoring/grading? We would need bullet five to be verified.

© Manifold Markets, Inc.Terms + Mana-only TermsPrivacyRules