Explanation Generation On Whoops
评估指标
Human (%)
评测结果
各个模型在此基准测试上的表现结果
比较表格
模型名称 | Human (%) |
---|---|
breaking-common-sense-whoops-a-vision-and | 33 |
vlis-unimodal-language-models-guide | - |
breaking-common-sense-whoops-a-vision-and | 15 |
breaking-common-sense-whoops-a-vision-and | 27 |
vlis-unimodal-language-models-guide | - |
breaking-common-sense-whoops-a-vision-and | 68 |
breaking-common-sense-whoops-a-vision-and | 0 |