HyperAI超神经

Question Answering On Finqa

评估指标

Execution Accuracy
Program Accuracy

评测结果

各个模型在此基准测试上的表现结果

比较表格
模型名称Execution AccuracyProgram Accuracy
elastic-numerical-reasoning-with-adaptive68.9665.21
finqa-a-dataset-of-numerical-reasoning-over57.4355.52
finqa-a-dataset-of-numerical-reasoning-over65.0563.52
are-chatgpt-and-gpt-4-general-purpose-solvers68.79-
apollo-an-optimized-training-approach-for71.0768.94
finqa-a-dataset-of-numerical-reasoning-over53.7151.71