HyperAI超神经

Automated Theorem Proving On Holist Benchmark

评估指标

Percentage correct

评测结果

各个模型在此基准测试上的表现结果

比较表格
模型名称Percentage correct
holist-an-environment-for-machine-learning-of32.65
learning-to-reason-in-large-theories-without36.55
holist-an-environment-for-machine-learning-of38.88
graph-representations-for-higher-order-logic49.95