HyperAI

Image Classification

Image classification is a fundamental task in computer vision, aiming to understand and categorize entire images by assigning them specific labels. This task typically targets images of single objects and achieves high-precision classification through technologies such as deep learning, with broad application value including content recognition and scene understanding. When classification reaches the instance level, it becomes associated with image retrieval, which also involves finding similar images in large databases.

AP-GeM (ResNet-101)
ResNet-50
WaveMix
AG-Net
µ2Net+ (ViT-L/16)
SimCLR
cFlow
ResMLP-24
HSANR
DINOv2 (ViT-g/14, frozen model, linear eval)
FaMUS
MentorMix
ASF-former-S
SSR
Label-Ranker
FaMUS
MentorMix
WRN-28-2 + UDA+AutoDropout
shreynet
VIT-L/16 (Spinal FC, Background)
SEER (RegNet10B)
SEER (RegNet10B)
CurriculumNet
MLP-DecAug
Entropy-based Logic Explained Network
Sparse-CBM
ViT-Large/16 (384)
ViT-Large/16 (384)
Linear FT(ViT-L/14)
SNN
WaveMixLite-128/7
µ2Net (ViT-L/16)
VGG-5(Spinal FC)
SDGM-D
µ2Net+ (ViT-L/16)
Continued fraction of straight lines
TransBoost-ResNet50
EnGraf-Net101 (G=4, H=1)
CCT-14/7x2
CNN+ Wilson-Cowan model RNN
TransBoost-ResNet50
LRA-diffusion (CLIP ViT)
Our Ensemble Learning-2
WaveMix
CoAtNet-1
Fuzzy Distance Ensemble
E2E-3M
Claude 3 Opus
GAC-SNN MS-ResNet-34
ResNet-50 + UDA+AutoDropout
SparseSwin with L2
MoCo + CaSSLe
BinaryViT
WRN (N=28, k=10)
WRN (N=36, k=5)
EfficientNet-L2-Ns
SqueezeNet + Simple Bypass
ViT-H @224 (DeiT III, 21k)
µ2Net+ (ViT-L/16)
Model soups (ViT-G/14)
WaveMix-256/16 (level 2)
AIMv2-3B (448 res)
InternImage-H
Hiera-H (448px)
ThanosNet
COSMO
V-MoE-H/14 (Every-2)
SEER (RegNet10B)
µ2Net (ViT-L/16)
RADAM (ConvNeXt-XL)
KMNIST-Tiny
HiFuse_Small
CoNAL
L3D_original_2level
Inception-v3
kEffNet-B0 V2 16ch
EfficientNet-B3
Branching/Merging CNN + Homogeneous Vector Capsules
WaveMixLite
PDO-eConv (ours)
PDO-eConv (ours)
CapsNet
STS-ResNet
CoCa
BiT-L (ResNet)
Diffusion Classifier (zero-shot)
NOAH-ViTB/16
ResNet-18 + Vision Eagle Attention
TWIST (ResNet-50)
CeiT-S (384 finetune resolution)
NNCLR
MAE (ViT-H, 448)
InternImage-H(CNN)
SWAG (ViT H/14)
kMobileNet V3 Large 16ch
SAG-ViT
ResNet-152 2x (RS training)
Deep regularization
PropMix
FaMUS
InstanceGM-SS
InstanceGM-SS
Fuzzy rank-based fusion of CNN models using Gompertz function
DL+PCA+GWO
Heinsen Routing
ResNet50
OFSCIL
Model with negotiation paradigm
Max Margin Contrastive
µ2Net+ (ViT-L/16)
TransBoost-ResNet50
E2E-3M
Wide-ResNet-28-10
EGNN+Transduction
Astroformer
UPANets
BiomedCLIP+PubmedBERT
VOLO-D5
ALIGN (50 hypers/task)
PropMix (Ours)
CurriculumNet (InceptionResNet-v2)