Image Classification On Places365
Metrics
Top 1 Accuracy
Results
Performance results of various models on this benchmark
Model Name | Top 1 Accuracy | Paper Title | Repository |
---|---|---|---|
OmniVec(ViT) | 63.5 | OmniVec: Learning robust representations with cross modal sharing | - |
ViC-MAE (ViT-L) | 59.5% | ViC-MAE: Self-Supervised Representation Learning from Images and Video with Contrastive Masked Autoencoders | |
MixMIM-L(ViT-L) | 60.3 | MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers | |
MixMIM-B (ViT) | 58.9 | MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers | |
InternImage-H(CNN) | 61.2% | InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions | |
µ2Net+ (ViT-L/16) | 59.15 | A Continual Development Methodology for Large-scale Multitask Dynamic ML Systems | |
OmniVec2 | 65.1 | OmniVec2 - A Novel Transformer based Network for Large Scale Multimodal and Multitask Learning | - |
0 of 7 row(s) selected.