AI SOTA Benchmarks
Latest AI model performance metrics, GPU benchmarks, and cutting-edge papers
Categories
Browse tasks by category
AI Model Performance Benchmarks
Performance metrics of mainstream AI models across various tasks, showcasing the state-of-the-art technology
Image Classification
45 papers | 166 benchmarks
Semantic Segmentation
8 papers | 149 benchmarks
Object Detection
15 papers | 123 benchmarks
Few-Shot Image Classification
47 papers | 90 benchmarks
Image Generation
37 papers | 90 benchmarks
Question Answering
15 papers | 149 benchmarks
Text Classification
13 papers | 85 benchmarks
Machine Translation
15 papers | 83 benchmarks
Named Entity Recognition (NER)
41 papers | 77 benchmarks
Text Generation
14 papers | 71 benchmarks
Medical Image Segmentation
36 papers | 48 benchmarks
Drug Discovery
28 papers | 28 benchmarks
Sleep Stage Detection
34 papers | 16 benchmarks
Within-Session ERP
1 papers | 15 benchmarks
Text-to-Image Generation
23 papers | 13 benchmarks
Image Super-Resolution
10 papers | 68 benchmarks
Recommendation Systems
7 papers | 55 benchmarks
Click-Through Rate Prediction
23 papers | 20 benchmarks
Molecular Property Prediction
29 papers | 18 benchmarks
Age Estimation
42 papers | 16 benchmarks
Object Detection
15 papers | 123 benchmarks
Anomaly Detection
29 papers | 76 benchmarks
Classification
49 papers | 71 benchmarks
Domain Adaptation
37 papers | 58 benchmarks
Unsupervised Domain Adaptation
27 papers | 41 benchmarks
Time Series Forecasting
49 papers | 86 benchmarks
Time Series Classification
13 papers | 52 benchmarks
Traffic Prediction
11 papers | 33 benchmarks
Pose Estimation
39 papers | 31 benchmarks
GLinear
1 papers | 19 benchmarks
Node Classification
42 papers | 127 benchmarks
Link Prediction
36 papers | 80 benchmarks
Graph Classification
7 papers | 72 benchmarks
Image Super-Resolution
10 papers | 68 benchmarks
Graph Regression
40 papers | 17 benchmarks
Audio Classification
44 papers | 26 benchmarks
Beat Tracking
18 papers | 15 benchmarks
Downbeat Tracking
11 papers | 13 benchmarks
Few-Shot Audio Classification
8 papers | 10 benchmarks
Bandwidth Extension
45 papers | 6 benchmarks
Speech Recognition
23 papers | 148 benchmarks
Speech Separation
49 papers | 19 benchmarks
Speaker Diarization
12 papers | 15 benchmarks
Speech Emotion Recognition
4 papers | 15 benchmarks
Speech Enhancement
39 papers | 14 benchmarks
Common Sense Reasoning
45 papers | 24 benchmarks
Zero-Shot Video Question Answer
25 papers | 16 benchmarks
Math Word Problem Solving
6 papers | 13 benchmarks
Visual Reasoning
24 papers | 12 benchmarks
3D Human Reconstruction
48 papers | 10 benchmarks
Semantic Segmentation
8 papers | 149 benchmarks
Code Generation
31 papers | 26 benchmarks
Deblurring
4 papers | 16 benchmarks
Text-To-SQL
23 papers | 10 benchmarks
Source Code Summarization
7 papers | 9 benchmarks
Semantic Segmentation
8 papers | 149 benchmarks
Fraud Detection
15 papers | 12 benchmarks
DeepFake Detection
5 papers | 11 benchmarks
Omniverse Isaac Gym
4 papers | 6 benchmarks
Visual Navigation
2 papers | 6 benchmarks
Math Word Problem Solving
6 papers | 13 benchmarks
Text-to-Image Generation
23 papers | 13 benchmarks
Entity Alignment
38 papers | 10 benchmarks
Multi-modal Entity Alignment
18 papers | 8 benchmarks
Document Summarization
46 papers | 7 benchmarks
Continuous Control
21 papers | 73 benchmarks
Image Super-Resolution
10 papers | 68 benchmarks
Atari Games
11 papers | 64 benchmarks
OpenAI Gym
23 papers | 17 benchmarks
SMAC+
18 papers | 16 benchmarks
DeepFake Detection
5 papers | 11 benchmarks
Music Transcription
40 papers | 6 benchmarks
Audio Super-Resolution
22 papers | 4 benchmarks
Cover song identification
18 papers | 4 benchmarks
Music Auto-Tagging
19 papers | 4 benchmarks
Open-Domain Question Answering
30 papers | 15 benchmarks
Face Detection
25 papers | 13 benchmarks
Handwritten Text Recognition
32 papers | 13 benchmarks
Adversarial Defense
34 papers | 10 benchmarks
Adversarial Robustness
5 papers | 7 benchmarks
multimodal
79 papers | 79 benchmarks
reasoning
61 papers | 57 benchmarks
understanding
47 papers | 48 benchmarks
other
35 papers | 33 benchmarks
knowledge
27 papers | 30 benchmarks
GPU Benchmarks
Latest GPU hardware and software performance evaluations to help you make informed hardware choices
Software Performance
DeepSeek-R1-Distill-Qwen-7B
Environment: vllm
DeepSeek-R1-Distill-Llama-8B
Environment: vllm
DeepSeek-R1-Distill-Qwen-14B
Environment: vllm
DeepSeek-R1-Distill-Qwen-32B
Environment: vllm
DeepSeek-R1-Distill-Llama-70B
Environment: vllm
DeepSeek-R1-Distill-Qwen-7B
Environment: sglang
DeepSeek-R1-Distill-Llama-8B
Environment: sglang
DeepSeek-R1-Distill-Qwen-14B
Environment: sglang
DeepSeek-R1-Distill-Qwen-32B
Environment: sglang
DeepSeek-R1-Distill-Llama-70B
Environment: sglang