Automated Theorem Proving On Minif2F 1
评估指标
Pass@64
评测结果
各个模型在此基准测试上的表现结果
比较表格
模型名称 | Pass@64 |
---|---|
hypertree-proof-search-for-neural-theorem | 32.1 |
hypertree-proof-search-for-neural-theorem | 33.6 |
hypertree-proof-search-for-neural-theorem | 30.6 |
hypertree-proof-search-for-neural-theorem | 42.5 |