Video Question Answering On Intentqa
评估指标
Accuarcy
CH
CW
TPu0026TN
评测结果
各个模型在此基准测试上的表现结果
模型名称 | Accuarcy | CH | CW | TPu0026TN | Paper Title | Repository |
---|---|---|---|---|---|---|
VideoChat2_mistral | 81.9 | 86.9 | 82.6 | 77.0 | MVBench: A Comprehensive Multi-modal Video Understanding Benchmark | |
HQGA | 47.7 | 54.3 | 48.2 | 41.7 | Video as Conditional Graph Hierarchy for Multi-Granular Question Answering | |
VGT | 51.3 | 56.0 | 51.4 | 47.6 | Video Graph Transformer for Video Question Answering | |
IntentQA | 57.6 | 65.5 | 58.4 | 50.5 | IntentQA: Context-aware Video Intent Reasoning | |
VideoChat2_HD_mistral | 83.4 | 90.0 | 84.0 | 77.3 | MVBench: A Comprehensive Multi-modal Video Understanding Benchmark | |
Human | 78.5 | 80.2 | 77.8 | 79.1 | IntentQA: Context-aware Video Intent Reasoning |
0 of 6 row(s) selected.