HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

DivideMix: Learning with Noisy Labels as Semi-supervised Learning

Junnan Li Richard Socher Steven C.H. Hoi

DivideMix: Learning with Noisy Labels as Semi-supervised Learning

Abstract

Deep neural networks are known to be annotation-hungry. Numerous efforts have been devoted to reducing the annotation cost when learning with deep networks. Two prominent directions include learning with noisy labels and semi-supervised learning by exploiting unlabeled data. In this work, we propose DivideMix, a novel framework for learning with noisy labels by leveraging semi-supervised learning techniques. In particular, DivideMix models the per-sample loss distribution with a mixture model to dynamically divide the training data into a labeled set with clean samples and an unlabeled set with noisy samples, and trains the model on both the labeled and unlabeled data in a semi-supervised manner. To avoid confirmation bias, we simultaneously train two diverged networks where each network uses the dataset division from the other network. During the semi-supervised training phase, we improve the MixMatch strategy by performing label co-refinement and label co-guessing on labeled and unlabeled samples, respectively. Experiments on multiple benchmark datasets demonstrate substantial improvements over state-of-the-art methods. Code is available at https://github.com/LiJunnan1992/DivideMix .

Code Repositories

jyansir/text2tree
pytorch
Mentioned in GitHub
LiJunnan1992/DivideMix
Official
pytorch

Benchmarks

BenchmarkMethodologyMetrics
image-classification-on-clothing1mDivideMix
Accuracy: 74.76%
image-classification-on-mini-webvision-1-0DivideMix (ResNet-50)
ImageNet Top-1 Accuracy: 74.42 ±0.29
ImageNet Top-5 Accuracy: 91.21 ±0.12
Top-1 Accuracy: 76.32 ±0.36
Top-5 Accuracy: 90.65 ±0.16
image-classification-on-mini-webvision-1-0DivideMix (ResNet-18)
Top-1 Accuracy: 76.08
image-classification-on-mini-webvision-1-0DivideMix (Inception-ResNet-v2)
ImageNet Top-1 Accuracy: 75.20
ImageNet Top-5 Accuracy: 91.64
Top-1 Accuracy: 77.32
Top-5 Accuracy: 91.64
learning-with-noisy-labels-on-cifar-100nDivide-Mix
Accuracy (mean): 71.13
learning-with-noisy-labels-on-cifar-10nDivide-Mix
Accuracy (mean): 95.01
learning-with-noisy-labels-on-cifar-10n-1Divide-Mix
Accuracy (mean): 90.18
learning-with-noisy-labels-on-cifar-10n-2Divide-Mix
Accuracy (mean): 90.90
learning-with-noisy-labels-on-cifar-10n-3Divide-Mix
Accuracy (mean): 89.97
learning-with-noisy-labels-on-cifar-10n-worstDivide-Mix
Accuracy (mean): 92.56

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp