Question Answering On Muld Hotpotqa

BLEU-1

BLEU-4

METEOR

Rouge-L

评测结果

各个模型在此基准测试上的表现结果

					Paper Title	Repository
Longformer	30.38	16.76	4.98	30.49	MuLD: The Multitask Long Document Benchmark
T5	28.11	13.63	4.46	27.61	MuLD: The Multitask Long Document Benchmark

0 of 2 row(s) selected.