HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Overcoming catastrophic forgetting with hard attention to the task

Joan Serrà; Dídac Surís; Marius Miron; Alexandros Karatzoglou

Overcoming catastrophic forgetting with hard attention to the task

Abstract

Catastrophic forgetting occurs when a neural network loses the information learned in a previous task after training on subsequent tasks. This problem remains a hurdle for artificial intelligence systems with sequential learning capabilities. In this paper, we propose a task-based hard attention mechanism that preserves previous tasks' information without affecting the current task's learning. A hard attention mask is learned concurrently to every task, through stochastic gradient descent, and previous masks are exploited to condition such learning. We show that the proposed mechanism is effective for reducing catastrophic forgetting, cutting current rates by 45 to 80%. We also show that it is robust to different hyperparameter choices, and that it offers a number of monitoring capabilities. The approach features the possibility to control both the stability and compactness of the learned knowledge, which we believe makes it also attractive for online learning or network compression applications.

Code Repositories

chilung/hat
pytorch
Mentioned in GitHub
joansj/hat
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
continual-learning-on-20newsgroup-10-tasksHAT
F1 - macro: 0.9521
continual-learning-on-asc-19-tasksHAT
F1 - macro: 0.7816
continual-learning-on-dsc-10-tasksHAT
F1 - macro: 0.8614
continual-learning-on-f-celeba-10-tasksHAT
Acc: 0.5673

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Overcoming catastrophic forgetting with hard attention to the task | Papers | HyperAI