Question Answering On Quality
评估指标
Accuracy
评测结果
各个模型在此基准测试上的表现结果
模型名称 | Accuracy | Paper Title | Repository |
---|---|---|---|
Claude Instant 1.1 (5-shot) | 80.5 | Model Card and Evaluations for Claude Models | - |
Claude 1.3 (5-shot) | 84.1 | Model Card and Evaluations for Claude Models | - |
Claude 2 (5-shot) | 83.2 | Model Card and Evaluations for Claude Models | - |
RAPTOR + GPT-4 (June 2023) | 82.6 | RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval |
0 of 4 row(s) selected.