Command Palette
Search for a command to run...
Diederik P Kingma; Max Welling

Abstract
How can we perform efficient inference and learning in directed probabilistic models, in the presence of continuous latent variables with intractable posterior distributions, and large datasets? We introduce a stochastic variational inference and learning algorithm that scales to large datasets and, under some mild differentiability conditions, even works in the intractable case. Our contributions are two-fold. First, we show that a reparameterization of the variational lower bound yields a lower bound estimator that can be straightforwardly optimized using standard stochastic gradient methods. Second, we show that for i.i.d. datasets with continuous latent variables per datapoint, posterior inference can be made especially efficient by fitting an approximate inference model (also called a recognition model) to the intractable posterior using the proposed lower bound estimator. Theoretical advantages are reflected in experimental results.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| anomaly-detection-on-mvtec-loco-ad | VAE | Avg. Detection AUROC: 54.3 Detection AUROC (only logical): 53.8 Detection AUROC (only structural): 54.8 Segmentation AU-sPRO (until FPR 5%): 38.2 |
| image-clustering-on-cifar-10 | VAE | ARI: 0.168 Accuracy: 0.291 Backbone: VAE NMI: 0.245 Train set: Train+Test |
| image-clustering-on-cifar-100 | VAE | Accuracy: 0.152 NMI: 0.108 Train Set: Train+Test |
| image-clustering-on-imagenet-10 | VAE | Accuracy: 0.334 NMI: 0.193 |
| image-clustering-on-imagenet-dog-15 | VAE | Accuracy: 0.179 NMI: 0.107 |
| image-clustering-on-stl-10 | VAE | Accuracy: 0.282 NMI: 0.200 Train Split: Train+Test |
| image-clustering-on-tiny-imagenet | VAE | Accuracy: 0.036 NMI: 0.113 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.