HyperAI

Open Domain Question Answering On Kilt Eli5

Metrics

F1
KILT-F1
KILT-RL
R-Prec
ROUGE-L
Recall@5

Results

Performance results of various models on this benchmark

Model Name
F1
KILT-F1
KILT-RL
R-Prec
ROUGE-L
Recall@5
Paper TitleRepository
T5-base16.10.00.00.019.080.0KILT: a Benchmark for Knowledge Intensive Language Tasks
GENRE0.00.00.015.830.025.49--
chriskuei0.00.00.017.50.025.54--
Wikipedia15.912.382.4614.8316.4527.69--
RAG14.511.791.6911.014.0522.92--
arxiv.org/abs/2103.0633222.882.342.3610.6723.1924.56Hurdles to Progress in Long-form Question Answering
BART19.230.00.00.020.550.0--
Training Set Retrieval (top 1)21.620.00.00.018.660.0--
Sphere15.290.00.00.015.760.0--
somebody27.133.02.6210.8324.5327.25--
TABi0.00.00.018.330.028.21--
multi-task small16.40.00.00.017.670.0--
BART + DPR17.882.011.910.6717.4126.92--
Input Copying14.80.00.00.016.880.0--
Random Training Set Answer17.070.00.00.015.450.0--
Multi-task DPR0.00.00.015.50.027.51--
0 of 16 row(s) selected.