Visual Question Answering Vqa On Whoops
评估指标
BEM
Exact Match
评测结果
各个模型在此基准测试上的表现结果
模型名称 | BEM | Exact Match | Paper Title | Repository |
---|---|---|---|---|
BLIP2 FlanT5-XXL (Text-only FT) | 24 | 4 | Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images | - |
BLIP2 FlanT5-XL (Fine-tuned) | 55 | 20 | Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images | - |
OFA Large | 38 | 8 | Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images | - |
BLIP Large | 39 | 6 | Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images | - |
BLIP2 FlanT5-XXL (Zero-shot) | 55 | 15 | Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images | - |
BLIP2 FlanT5-XXL (Fine-tuned) | 57 | 21 | Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images | - |
0 of 6 row(s) selected.