HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

PeTra: A Sparsely Supervised Memory Model for People Tracking

Shubham Toshniwal Allyson Ettinger Kevin Gimpel Karen Livescu

PeTra: A Sparsely Supervised Memory Model for People Tracking

Abstract

We propose PeTra, a memory-augmented neural network designed to track entities in its memory slots. PeTra is trained using sparse annotation from the GAP pronoun resolution dataset and outperforms a prior memory model on the task while using a simpler architecture. We empirically compare key modeling choices, finding that we can simplify several aspects of the design of the memory module while retaining strong performance. To measure the people tracking capability of memory models, we (a) propose a new diagnostic evaluation based on counting the number of unique entities in text, and (b) conduct a small scale human evaluation to compare evidence of people tracking in the memory logs of PeTra relative to a previous approach. PeTra is highly effective in both evaluations, demonstrating its ability to track people in its memory despite being trained with limited annotation.

Code Repositories

shtoshni92/petra
Official
pytorch

Benchmarks

BenchmarkMethodologyMetrics
coreference-resolution-on-gap-1PeTra
F1: 85.3

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp