HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

WILDS: A Benchmark of in-the-Wild Distribution Shifts

WILDS: A Benchmark of in-the-Wild Distribution Shifts

Abstract

Distribution shifts -- where the training distribution differs from the test distribution -- can substantially degrade the accuracy of machine learning (ML) systems deployed in the wild. Despite their ubiquity in the real-world deployments, these distribution shifts are under-represented in the datasets widely used in the ML community today. To address this gap, we present WILDS, a curated benchmark of 10 datasets reflecting a diverse range of distribution shifts that naturally arise in real-world applications, such as shifts across hospitals for tumor identification; across camera traps for wildlife monitoring; and across time and location in satellite imaging and poverty mapping. On each dataset, we show that standard training yields substantially lower out-of-distribution than in-distribution performance. This gap remains even with models trained by existing methods for tackling distribution shifts, underscoring the need for new methods for training models that are more robust to the types of distribution shifts that arise in practice. To facilitate method development, we provide an open-source package that automates dataset loading, contains default model architectures and hyperparameters, and standardizes evaluations. Code and leaderboards are available at https://wilds.stanford.edu.

Code Repositories

skyve2012/DBA
pytorch
Mentioned in GitHub
tigrangalstyan/wilds
pytorch
Mentioned in GitHub
facebookresearch/DomainBed
pytorch
Mentioned in GitHub
p-lambda/wilds
Official
pytorch
Mentioned in GitHub
hlzhang109/ddg
pytorch
Mentioned in GitHub
qiaoruiyt/noiserobustdg
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
image-classification-on-iwildcam2020-wildsEmpirical Risk Minimization (ERM)
Accuracy (Top-1): 71.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp