HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

ProTo: Program-Guided Transformer for Program-Guided Tasks

Zhao Zelin ; Samel Karan ; Chen Binghong ; Song Le

ProTo: Program-Guided Transformer for Program-Guided Tasks

Abstract

Programs, consisting of semantic and structural information, play animportant role in the communication between humans and agents. Towards learninggeneral program executors to unify perception, reasoning, and decision making,we formulate program-guided tasks which require learning to execute a givenprogram on the observed task specification. Furthermore, we propose theProgram-guided Transformer (ProTo), which integrates both semantic andstructural guidance of a program by leveraging cross-attention and maskedself-attention to pass messages between the specification and routines in theprogram. ProTo executes a program in a learned latent space and enjoys strongerrepresentation ability than previous neural-symbolic approaches. We demonstratethat ProTo significantly outperforms the previous state-of-the-art methods onGQA visual reasoning and 2D Minecraft policy learning datasets. Additionally,ProTo demonstrates better generalization to unseen, complex, and human-writtenprograms.

Code Repositories

Benchmarks

BenchmarkMethodologyMetrics
visual-question-answering-on-gqa-test-stdProTo
Accuracy: 65.14

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp