HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Autoencoding Variational Inference For Topic Models

Akash Srivastava; Charles Sutton

Autoencoding Variational Inference For Topic Models

Abstract

Topic models are one of the most popular methods for learning representations of text, but a major challenge is that any change to the topic model requires mathematically deriving a new inference algorithm. A promising approach to address this problem is autoencoding variational Bayes (AEVB), but it has proven diffi- cult to apply to topic models in practice. We present what is to our knowledge the first effective AEVB based inference method for latent Dirichlet allocation (LDA), which we call Autoencoded Variational Inference For Topic Model (AVITM). This model tackles the problems caused for AEVB by the Dirichlet prior and by component collapsing. We find that AVITM matches traditional methods in accuracy with much better inference time. Indeed, because of the inference network, we find that it is unnecessary to pay the computational cost of running variational optimization on test data. Because AVITM is black box, it is readily applied to new topic models. As a dramatic illustration of this, we present a new topic model called ProdLDA, that replaces the mixture model in LDA with a product of experts. By changing only one line of code from LDA, we find that ProdLDA yields much more interpretable topics, even if LDA is trained via collapsed Gibbs sampling.

Code Repositories

shining-spring/nvlda
tf
Mentioned in GitHub
vlukiyanov/pt-avitm
pytorch
Mentioned in GitHub
is0383kk/Dirichlet_VAE
pytorch
Mentioned in GitHub
yjxiao/ProdLDA
pytorch
Mentioned in GitHub
mind-Lab/octis
tf
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
topic-models-on-20newsgroupsProdLDA
C_v: 0.35
topic-models-on-ag-newsProdLDA
C_v: 0.32
NPMI: -0.22

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp