HyperAI超神经

Question Answering On Squad20

评估指标

EM
F1

评测结果

各个模型在此基准测试上的表现结果

模型名称
EM
F1
Paper TitleRepository
BISAN-CC (single model)80.20883.149--
bert (single model)79.97183.184--
PMI-Masking Random Baseline (single model)80.03882.796--
PwP+BERT (single model)80.11783.189--
Tuned BERT Large Cased (single model)82.80385.863--
BERT-Base-DT (single model)74.76977.706--
ELECTRA+RL+EV (single model)89.02191.765--
ALBERT (single model)88.10790.902ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
PMI-Masking Additional Data Random Baseline (single model)80.37783.262--
XLNET-123 (single model)86.43689.086--
mgrc75.34478.381--
SemBERT (single model)84.80087.864--
batch2 (single model)73.74276.858--
Candi-Net+BERT (single model)80.38882.908--
Fusion Adapters TriviaQA NQ Singl78.93381.863--
BERT+AC(single model)78.05281.174--
electra+nlayers+kdav (ensemble)90.00292.497--
BERT + ConvLSTM + MTL + Verifier (single model)84.92488.204--
Ensemble ALBERT-90.123Ensemble ALBERT on SQuAD 2.0
L6Net + BERT (single model)79.18182.259--
0 of 286 row(s) selected.