Question Answering On Convfinqa
评估指标
Execution Accuracy
评测结果
各个模型在此基准测试上的表现结果
模型名称 | Execution Accuracy | Paper Title | Repository |
---|---|---|---|
General Crowd | 46.90 | Are ChatGPT and GPT-4 General-Purpose Solvers for Financial Text Analytics? A Study on Several Typical Tasks | - |
FinQANet (RoBERTa-large) | 68.9 | ConvFinQA: Exploring the Chain of Numerical Reasoning in Conversational Finance Question Answering | |
GPT-4 (8k) | 76.48 | Are ChatGPT and GPT-4 General-Purpose Solvers for Financial Text Analytics? A Study on Several Typical Tasks | - |
0 of 3 row(s) selected.