HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

WaveMix: A Resource-efficient Neural Network for Image Analysis

Pranav Jeevan; Kavitha Viswanathan; Anandu A S; Amit Sethi

WaveMix: A Resource-efficient Neural Network for Image Analysis

Abstract

We propose a novel neural architecture for computer vision -- WaveMix -- that is resource-efficient and yet generalizable and scalable. While using fewer trainable parameters, GPU RAM, and computations, WaveMix networks achieve comparable or better accuracy than the state-of-the-art convolutional neural networks, vision transformers, and token mixers for several tasks. This efficiency can translate to savings in time, cost, and energy. To achieve these gains we used multi-level two-dimensional discrete wavelet transform (2D-DWT) in WaveMix blocks, which has the following advantages: (1) It reorganizes spatial information based on three strong image priors -- scale-invariance, shift-invariance, and sparseness of edges -- (2) in a lossless manner without adding parameters, (3) while also reducing the spatial sizes of feature maps, which reduces the memory and time required for forward and backward passes, and (4) expanding the receptive field faster than convolutions do. The whole architecture is a stack of self-similar and resolution-preserving WaveMix blocks, which allows architectural flexibility for various tasks and levels of resource availability. WaveMix establishes new benchmarks for segmentation on Cityscapes; and for classification on Galaxy 10 DECals, Places-365, five EMNIST datasets, and iNAT-mini and performs competitively on other benchmarks. Our code and trained models are publicly available.

Code Repositories

pranavphoenix/WaveMix
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
image-classification-on-caltech-256WaveMixLite-256/7
Accuracy: 54.62
image-classification-on-cifar-10WaveMixLite-144/7
Percentage correct: 97.29
image-classification-on-cifar-100WaveMix-Lite-256/7
Percentage correct: 70.20
image-classification-on-cifar-100WaveMixLite-256/7
Percentage correct: 85.09
image-classification-on-emnist-balancedWaveMixLite-128/7
Accuracy: 91.06
image-classification-on-emnist-byclassWaveMixLite-128/7
Accuracy: 88.43
image-classification-on-emnist-bymergeWaveMixLite-128/16
Accuracy: 91.80
image-classification-on-emnist-digitsWaveMixLite-112/16
Accuracy (%): 99.82
image-classification-on-emnist-lettersWaveMixLite-112/16
Accuracy: 95.96
image-classification-on-fashion-mnistWaveMixLite
Percentage error: 5.68
image-classification-on-galaxy10-decalsWaveMix
PARAMS (M): 28
Top-1 Accuracy (%): 95.42
image-classification-on-imagenetWaveMix-192/16 (level 3)
Top 1 Accuracy: 74.93%
image-classification-on-inat2021-miniWaveMix-256/16 (level 2)
Top 1 Accuracy: 61.75
image-classification-on-mnist-1WaveMixLite
Percentage error: 0.25
image-classification-on-places365-standardWaveMix-240/12 (level 4)
Top 1 Accuracy: 56.45
image-classification-on-stl-10WaveMixLite-256/7
Percentage correct: 70.88
image-classification-on-svhnWaveMixLite-144/15
Percentage error: 1.27
image-classification-on-tiny-imagenet-1WaveMixLite-144/7
Validation Acc: 77.47%
scene-classification-on-places365-standardWaveMix
Top 1 Error: 43.55
semantic-segmentation-on-cityscapes-valWaveMix-256/16 (Level-4)
mIoU: 82.60
semantic-segmentation-on-cityscapes-valWaveMix
mIoU: 82.7

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp