3 months ago

Self-Guided Masked Autoencoders for Domain-Agnostic Self-Supervised Learning

Johnathan Xie Yoonho Lee Annie S. Chen Chelsea Finn

Abstract

Self-supervised learning excels in learning representations from large amounts of unlabeled data, demonstrating success across multiple data modalities. Yet, extending self-supervised learning to new modalities is non-trivial because the specifics of existing methods are tailored to each domain, such as domain-specific augmentations which reflect the invariances in the target task. While masked modeling is promising as a domain-agnostic framework for self-supervised learning because it does not rely on input augmentations, its mask sampling procedure remains domain-specific. We present Self-guided Masked Autoencoders (SMA), a fully domain-agnostic masked modeling method. SMA trains an attention based model using a masked modeling objective, by learning masks to sample without any domain-specific assumptions. We evaluate SMA on three self-supervised learning benchmarks in protein biology, chemical property prediction, and particle physics. We find SMA is capable of learning representations without domain-specific knowledge and achieves state-of-the-art performance on these three benchmarks.

Code Repositories

johnathan-xie/sma

Official

jax

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
molecular-property-prediction-on	SMA	RMSE: 0.609
molecular-property-prediction-on-bace-1	SMA	ROC-AUC: 84.3
molecular-property-prediction-on-bbbp-1	SMA	ROC-AUC: 75.0
molecular-property-prediction-on-esol	SMA	RMSE: 0.623
molecular-property-prediction-on-freesolv	SMA	RMSE: 1.09
molecular-property-prediction-on-hiv-dataset	SMA	AUC: 0.789

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette