Anthropic/claude-3-5-sonnet | 74.23 | 82.3 | Claude 3.5 Sonnet Model Card Addendum | - |
OpenAI/o1-2024-12-17-high | 81.44 | 88.7 | 0/1 Deep Neural Networks via Block Coordinate Descent | - |
OpenAI/o3-mini-2025-01-31-high | 96.52 | 92.13 | o3-mini vs DeepSeek-R1: Which One is Safer? | |