HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Generative Language Modeling for Automated Theorem Proving

Stanislas Polu Ilya Sutskever

Generative Language Modeling for Automated Theorem Proving

Abstract

We explore the application of transformer-based language models to automated theorem proving. This work is motivated by the possibility that a major limitation of automated theorem provers compared to humans -- the generation of original mathematical terms -- might be addressable via generation from language models. We present an automated prover and proof assistant, GPT-f, for the Metamath formalization language, and analyze its performance. GPT-f found new short proofs that were accepted into the main Metamath library, which is to our knowledge, the first time a deep-learning based system has contributed proofs that were adopted by a formal mathematics community.

Benchmarks

BenchmarkMethodologyMetrics
automated-theorem-proving-on-metamath-setmmGPT-f
Percentage correct: 56.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Generative Language Modeling for Automated Theorem Proving | Papers | HyperAI