Question Answering On Stepgame
评估指标
1-of-100 Accuracy
评测结果
各个模型在此基准测试上的表现结果
模型名称 | 1-of-100 Accuracy | Paper Title | Repository |
---|---|---|---|
TP-MANN | 52.99 | StepGame: A New Benchmark for Robust Multi-Hop Spatial Reasoning in Texts |
0 of 1 row(s) selected.
各个模型在此基准测试上的表现结果
模型名称 | 1-of-100 Accuracy | Paper Title | Repository |
---|---|---|---|
TP-MANN | 52.99 | StepGame: A New Benchmark for Robust Multi-Hop Spatial Reasoning in Texts |