
Putnam exam link. Note that it happens every December.
The AI must score in the top 100 for a particular year.
By "top 100" I mean that its score must be >= the score of the 100th place scorer. (If 100th place is a tie I'll use the tying score).
If we know the details of the training date, then the training data must all have been released prior to the release of the Putnam questions for that year. i.e. if ModelNet is run on the 2026 Putnam, it must be trained on data from before the date of the 2026 Putnam exam.
The AI does not have to be trained before the relevant exam, as long as the data predates the exam.
The scoring for the AI's exam must either be done by the actual Putnam scorers, mathematicians who have been Putnam scorers, or mathematicians who are actively involved in competitive mathematics in some way. (i.e. a professor who runs a university's competitive team counts, a software engineer who did well in the Putnam 5 years ago does not).
I may accept scoring that isn't blinded, but I reserve the right to ignore any scoring that's vaguely suspect/biased/etc.
Update 2025-12-10 (PST) (AI summary of creator comment): "Released" data means data that existed prior to the exam, not necessarily publicly available open-source data. The AI model does not need to be trained on open data, but all training data must have existed before the Putnam exam date to prevent test set contamination.
People are also trading
Nous Research claims to have scored 87/120! https://x.com/NousResearch/status/1998536543565127968
@clementdupOz "Released" in the sense of "exists". It does not need to be an open data model, but it must be trained on data that existed prior to the 2025 Putnam exam. This requirement is to prevent any possibility of the test set getting into the training set.
Claim: DeepSeekMath-V2 hits gold-medal performance on Putnam. https://x.com/theturingpost/status/1994926897248288813?s=46
@SG that's not this year's putnam though right? whereas this is: https://x.com/axiommathai/status/1997767850279440715?s=20
(but we don't know rankings yet)