Created in February 10, 2025
2025
A real-time benchmark has been released for math reasoning models on AIME 2025!