Command Palette
Search for a command to run...
Picek Lukáš ; Šulc Milan ; Matas Jiří ; Heilmann-Clausen Jacob ; Jeppesen Thomas S. ; Læssøe Thomas ; Frøslev Tobias

Abstract
We introduce a novel fine-grained dataset and benchmark, the Danish Fungi2020 (DF20). The dataset, constructed from observations submitted to the Atlasof Danish Fungi, is unique in its taxonomy-accurate class labels, small numberof errors, highly unbalanced long-tailed class distribution, rich observationmetadata, and well-defined class hierarchy. DF20 has zero overlap withImageNet, allowing unbiased comparison of models fine-tuned from publiclyavailable ImageNet checkpoints. The proposed evaluation protocol enablestesting the ability to improve classification using metadata -- e.g. precisegeographic location, habitat, and substrate, facilitates classifier calibrationtesting, and finally allows to study the impact of the device settings on theclassification performance. Experiments using Convolutional Neural Networks(CNN) and the recent Vision Transformers (ViT) show that DF20 presents achallenging task. Interestingly, ViT achieves results superior to CNN baselineswith 80.45% accuracy and 0.743 macro F1 score, reducing the CNN error by 9% and12% respectively. A simple procedure for including metadata into the decisionprocess improves the classification accuracy by more than 2.95 percentagepoints, reducing the error rate by 15%. The source code for all methods andexperiments is available at https://sites.google.com/view/danish-fungi-dataset.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| image-classification-on-df20 | Inception-V3 (299) | Top-1: 72.1 Top-3: 86.58 |
| image-classification-on-df20 | SE-ResNeXt-101-32x4d (224) | F1 - macro: 0.66 Top-1: 74.26 Top-3: 87.78 |
| image-classification-on-df20 | ResNet-34 (299) | F1 - macro: 0.60 Top-3: 84.76 |
| image-classification-on-df20 | Inception-ResNet-V2 (299) | F1 - macro: 0.651 Top-1: 74.01 Top-3: 87.49 |
| image-classification-on-df20 | EfficientNet-B1 (299) | F1 - macro: 0.654 Top-1: 74.08 Top-3: 87.68 |
| image-classification-on-df20 | EfficientNet-B3 (224) | F1 - macro: 0.634 Top-1: 72.51 Top-3: 86.77 |
| image-classification-on-df20 | ResNet-50 (299) | Top-1: 73.49 Top-3: 87.13 |
| image-classification-on-df20 | ViT-Large/16 (384) | F1 - macro: 0.743 Top-1: 80.45 Top-3: 91.68 |
| image-classification-on-df20 | EfficientNet-B3 (299) | F1 - macro: 0.673 Top-1: 75.69 Top-3: 88.72 |
| image-classification-on-df20 | ViT-Base/16 (384) | F1 - macro: 0.727 Top-1: 79.48 Top-3: 90.95 |
| image-classification-on-df20 | MobileNet-V2 (299) | Top-1: 69.77 Top-3: 85.01 |
| image-classification-on-df20 | ViT-Large/16 (224) | F1 - macro: 0.675 Top-1: 75.29 Top-3: 88.34 |
| image-classification-on-df20 | SE-ResNeXt-101-32x4d (299) | F1 - macro: 0.693 Top-3: 89.48 |
| image-classification-on-df20 | ResNet-18 | F1 - macro: 0.580 Top-1: 67.13 Top-3: 82.65 |
| image-classification-on-df20 | EfficientNet-B5 (299) | F1 - macro: 0.678 Top-1: 76.1 Top-3: 88.85 |
| image-classification-on-df20 | Inception-V4 (299) | F1 - macro: 0.637 Top-1: 73 Top-3: 86.87 |
| image-classification-on-df20 | EfficientNet-B0 (224) | F1 - macro: 0.613 Top-1: 70.33 Top-3: 85.19 |
| image-classification-on-df20 | EfficientNet-B0 (299) | Top-1: 73.65 |
| image-classification-on-df20 | SE-ResNeXt-101-32x4d | Top-1: 77.13 |
| image-classification-on-df20-mini | ResNet-18 | F1 - macro: 0.514 Top-1: 62.91 Top-3: 81.65 |
| image-classification-on-df20-mini | EfficientNet-B5 (299) | Top-1: 68.76 Top-3: 85 |
| image-classification-on-df20-mini | ResNet-50 (299) | Top-1: 68.49 Top-3: 85.22 |
| image-classification-on-df20-mini | ResNet-34 (299) | F1 - macro: 0.559 Top-3: 83.52 |
| image-classification-on-df20-mini | ViT-Large/16 (384) | F1 - macro: 0.669 Top-1: 75.85 Top-3: 89.95 |
| image-classification-on-df20-mini | Inception-ResNet-V2 (299) | Top-1: 64.67 Top-3: 81.42 |
| image-classification-on-df20-mini | EfficientNet-B3 (224) | F1 - macro: 0.55 Top-1: 67.39 Top-3: 83.74 |
| image-classification-on-df20-mini | EfficientNet-B0 (224) | F1 - macro: 0.531 Top-1: 65.66 Top-3: 83.65 |
| image-classification-on-df20-mini | EfficientNet-B1 (299) | Top-1: 68.35 Top-3: 84.67 |
| image-classification-on-df20-mini | EfficientNet-B0 (299) | F1 - macro: 0.567 Top-1: 67.94 Top-3: 85.71 |
| image-classification-on-df20-mini | ViT-Base/16 (384) | F1 - macro: 0.639 Top-1: 74.23 Top-3: 89.12 |
| image-classification-on-df20-mini | EfficientNet-B3 (299) | F1 - macro: 0.59 Top-1: 69.59 Top-3: 85.55 |
| image-classification-on-df20-mini | ViT-Large/16 (224) | F1 - macro: 0.603 Top-1: 71.04 Top-3: 86.15 |
| image-classification-on-df20-mini | Inception-V3 (299) | F1 - macro: 0.535 Top-1: 65.91 Top-3: 82.97 |
| image-classification-on-df20-mini | SE-ResNeXt-101-32x4d | Top-1: 72.23 |
| image-classification-on-df20-mini | SE-ResNeXt-101-32x4d (224) | F1 - macro: 0.585 Top-1: 68.87 Top-3: 85.14 |
| image-classification-on-df20-mini | MobileNet-V2 (299) | Top-1: 65.58 |
| image-classification-on-df20-mini | Inception-V4 (299) | Top-1: 67.45 Top-3: 82.78 |
| image-classification-on-df20-mini | SE-ResNeXt-101-32x4d (299) | F1 - macro: 0.62 Top-3: 87.28 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.