HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

HolStep: A Machine Learning Dataset for Higher-order Logic Theorem Proving

Cezary Kaliszyk; François Chollet; Christian Szegedy

HolStep: A Machine Learning Dataset for Higher-order Logic Theorem Proving

Abstract

Large computer-understandable proofs consist of millions of intermediate logical steps. The vast majority of such steps originate from manually selected and manually guided heuristics applied to intermediate goals. So far, machine learning has generally not been used to filter or generate these steps. In this paper, we introduce a new dataset based on Higher-Order Logic (HOL) proofs, for the purpose of developing new machine learning-based theorem-proving strategies. We make this dataset publicly available under the BSD license. We propose various machine learning tasks that can be performed on this dataset, and discuss their significance for theorem proving. We also benchmark a set of simple baseline machine learning models suited for the tasks (including logistic regression, convolutional neural networks and recurrent neural networks). The results of our baseline models show the promise of applying machine learning to HOL theorem proving.

Benchmarks

BenchmarkMethodologyMetrics
automated-theorem-proving-on-holstepSiamese 1D CNN
Classification Accuracy: 0.82
automated-theorem-proving-on-holstepSiamese 1D CNN-LSTM
Classification Accuracy: 0.83
automated-theorem-proving-on-holstep-11D CNN
Classification Accuracy: 0.83
automated-theorem-proving-on-holstep-11D CNN-LSTM
Classification Accuracy: 0.83

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
HolStep: A Machine Learning Dataset for Higher-order Logic Theorem Proving | Papers | HyperAI