Audio Classification On Audio Set
Metrics
Mean AP
Results
Performance results of various models on this benchmark
Model Name | Mean AP | Paper Title | Repository |
---|---|---|---|
VAB-Encodec (Ours) | 38.7 | From Vision to Audio and Beyond: A Unified Model for Audio-Visual Representation and Generation | |
M2D-AS/0.7 | 48.5 | Masked Modeling Duo: Towards a Universal Audio Pre-training Framework | |
LHGNN | 46.6 | LHGNN: Local-Higher Order Graph Neural Networks For Audio Classification and Tagging | - |
0 of 3 row(s) selected.