HyperAI超神经

Question Answering On Narrativeqa

评估指标

BLEU-1
BLEU-4
METEOR
Rouge-L

评测结果

各个模型在此基准测试上的表现结果

模型名称
BLEU-1
BLEU-4
METEOR
Rouge-L
Paper TitleRepository
Masque (NarrativeQA only)48.720.9821.9554.74Multi-style Generative Reading Comprehension-
MHPGM + NOIC43.6321.0719.0344.16Commonsense for Generative Multi-Hop Question Answering Tasks
DecaProp44.3527.6121.8044.69Densely Connected Attention Propagation for Reading Comprehension
BERT-QA with Hard EM objective---58.8A Discrete Hard EM Approach for Weakly Supervised Question Answering
FiD+Distil35.37.511.132Distilling Knowledge from Reader to Retriever for Question Answering
ConZNet42.7622.4919.2446.67Cut to the Chase: A Context Zoom-in Network for Reading Comprehension-
Masque (NarrativeQA + MS MARCO)54.1130.4326.1359.87Multi-style Generative Reading Comprehension-
Oracle IR Models54.60/55.5526.71/27.78--The NarrativeQA Reading Comprehension Challenge
BiAttention + DCU-LSTM36.5519.7917.8741.44Multi-Granular Sequence Encoding via Dilated Compositional Units for Reading Comprehension-
BiDAF33.4515.6915.6836.74Bidirectional Attention Flow for Machine Comprehension
0 of 10 row(s) selected.