Created in February 10, 2025
2025
We released a real-time benchmark for math reasoning models on AIME 2025!