Question Answering On Quac
评估指标
F1
评测结果
各个模型在此基准测试上的表现结果
模型名称 | F1 | Paper Title | Repository |
---|---|---|---|
GPT-3 175B (few-shot, k=32) | 44.3 | Language Models are Few-Shot Learners | |
FlowQA (single model) | 64.1 | FlowQA: Grasping Flow in History for Conversational Machine Comprehension |
0 of 2 row(s) selected.