Question Answering On Sberquad
评估指标
EM
F1
评测结果
各个模型在此基准测试上的表现结果
模型名称 | EM | F1 | Paper Title | Repository |
---|---|---|---|---|
DeepPavlov R-Net | 60.62 | 80.04 | SberQuAD -- Russian Reading Comprehension Dataset: Description and Analysis | - |
DeepPavlov multilingual BERT | 64.35+-0.39 | 83.39+-0.08 | SberQuAD -- Russian Reading Comprehension Dataset: Description and Analysis | - |
DeepPavlov RuBERT | 66.30+-0.24 | 84.60+-0.11 | SberQuAD -- Russian Reading Comprehension Dataset: Description and Analysis | - |
0 of 3 row(s) selected.