Explanation Generation On Vcr
评估指标
Human Explanation Rating
评测结果
各个模型在此基准测试上的表现结果
比较表格
模型名称 | Human Explanation Rating |
---|---|
harnessing-the-power-of-multi-task | 68.9 |
harnessing-the-power-of-multi-task | 77.3 |
各个模型在此基准测试上的表现结果
模型名称 | Human Explanation Rating |
---|---|
harnessing-the-power-of-multi-task | 68.9 |
harnessing-the-power-of-multi-task | 77.3 |