HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Learning Associative Inference Using Fast Weight Memory

Imanol Schlag Tsendsuren Munkhdalai Jürgen Schmidhuber

Learning Associative Inference Using Fast Weight Memory

Abstract

Humans can quickly associate stimuli to solve problems in novel contexts. Our novel neural network model learns state representations of facts that can be composed to perform such associative inference. To this end, we augment the LSTM model with an associative memory, dubbed Fast Weight Memory (FWM). Through differentiable operations at every step of a given input sequence, the LSTM updates and maintains compositional associations stored in the rapidly changing FWM weights. Our model is trained end-to-end by gradient descent and yields excellent performance on compositional language reasoning problems, meta-reinforcement-learning for POMDPs, and small-scale word-level language modelling.

Code Repositories

Benchmarks

BenchmarkMethodologyMetrics
language-modelling-on-penn-treebank-wordAWD-FWM Schlag et al. (2020)
Params: 24M
Test perplexity: 54.48
Validation perplexity: 56.76
language-modelling-on-wikitext-2AWD-FWM Schlag et al. (2020)
Number of params: 37M
Test perplexity: 61.65
Validation perplexity: 54.48
question-answering-on-catbabi-lm-modeAWD-LSTM
Accuracy (mean): 80.15%
question-answering-on-catbabi-lm-modeMetalearned Neural Memory (plastic)
Accuracy (mean): 69.3%
question-answering-on-catbabi-lm-modeFast Weight Memory
Accuracy (mean): 93.04%
question-answering-on-catbabi-lm-modeAWD-Transformer XL
Accuracy (mean): 90.23%
question-answering-on-catbabi-qa-modeMetalearned Neural Memory (plastic)
1:1 Accuracy: 88.97%
question-answering-on-catbabi-qa-modeAWD-LSTM
1:1 Accuracy: 80.88%
question-answering-on-catbabi-qa-modeAWD-Transformer XL
1:1 Accuracy: 87.66%
question-answering-on-catbabi-qa-modeFast Weight Memory
1:1 Accuracy: 96.75%

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp