Question Answering On Strategyqa
评估指标
Accuracy
评测结果
各个模型在此基准测试上的表现结果
比较表格
模型名称 | Accuracy |
---|---|
rethinking-with-retrieval-faithful-large | 77.73 |
chain-of-action-faithful-and-multimodal | - |
transcending-scaling-laws-with-0-1-extra | 76.4 |
least-to-most-prompting-enables-complex | - |
模型 5 | 77.2 |
search-in-the-chain-towards-the-accurate | - |
chain-of-action-faithful-and-multimodal | - |
chain-of-action-faithful-and-multimodal | - |
chain-of-action-faithful-and-multimodal | - |
transcending-scaling-laws-with-0-1-extra | 76.6 |
transcending-scaling-laws-with-0-1-extra | 61.9 |
palm-2-technical-report-1 | 90.4 |