HyperAI超神经

Question Answering On Muld Hotpotqa

评估指标

BLEU-1
BLEU-4
METEOR
Rouge-L

评测结果

各个模型在此基准测试上的表现结果

模型名称
BLEU-1
BLEU-4
METEOR
Rouge-L
Paper TitleRepository
Longformer30.3816.764.9830.49MuLD: The Multitask Long Document Benchmark
T528.1113.634.4627.61MuLD: The Multitask Long Document Benchmark
0 of 2 row(s) selected.