HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

PHYRE: A New Benchmark for Physical Reasoning

Anton Bakhtin; Laurens van der Maaten; Justin Johnson; Laura Gustafson; Ross Girshick

PHYRE: A New Benchmark for Physical Reasoning

Abstract

Understanding and reasoning about physics is an important ability of intelligent agents. We develop the PHYRE benchmark for physical reasoning that contains a set of simple classical mechanics puzzles in a 2D physical environment. The benchmark is designed to encourage the development of learning algorithms that are sample-efficient and generalize well across puzzles. We test several modern learning algorithms on PHYRE and find that these algorithms fall short in solving the puzzles efficiently. We expect that PHYRE will encourage the development of novel sample-efficient agents that learn efficient but useful models of physics. For code and to play PHYRE for yourself, please visit https://player.phyre.ai.

Benchmarks

BenchmarkMethodologyMetrics
visual-reasoning-on-phyre-1b-crossDQN
AUCCESS: 36.8
visual-reasoning-on-phyre-1b-withinDQN
AUCCESS: 77.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp