Visual Question Answering Vqa On Activitynet 1
评估指标
ClipMatch@1
ClipMatch@5
Contains
ExactMatch
Follow-up ClipMatch@1
Follow-up ClipMatch@5
Follow-up Contains
Follow-up ExactMatch
评测结果
各个模型在此基准测试上的表现结果
模型名称 | ClipMatch@1 | ClipMatch@5 | Contains | ExactMatch | Follow-up ClipMatch@1 | Follow-up ClipMatch@5 | Follow-up Contains | Follow-up ExactMatch | Paper Title | Repository |
---|---|---|---|---|---|---|---|---|---|---|
BLIP-2 T5 | 53.39 | 74.71 | 15.70 | 7.07 | 62.02 | 75.13 | 18.09 | 8.84 | Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy |
0 of 1 row(s) selected.