HyperAI超神经

Question Answering On Mapeval Api 1

评估指标

Accuracy (%)

评测结果

各个模型在此基准测试上的表现结果

模型名称
Accuracy (%)
Paper TitleRepository
GPT-3.5-Turbo (Chameleon)49.33MapEval: A Map-Based Evaluation of Geo-Spatial Reasoning in Foundation Models-
Claude-3.5-Sonnet (ReAct)64.00MapEval: A Map-Based Evaluation of Geo-Spatial Reasoning in Foundation Models-
0 of 2 row(s) selected.