Command Palette
Search for a command to run...
Counterfactual Attention Learning for Fine-Grained Visual Categorization and Re-identification
Rao Yongming ; Chen Guangyi ; Lu Jiwen ; Zhou Jie

Abstract
Attention mechanism has demonstrated great potential in fine-grained visualrecognition tasks. In this paper, we present a counterfactual attentionlearning method to learn more effective attention based on causal inference.Unlike most existing methods that learn visual attention based on conventionallikelihood, we propose to learn the attention with counterfactual causality,which provides a tool to measure the attention quality and a powerfulsupervisory signal to guide the learning process. Specifically, we analyze theeffect of the learned visual attention on network prediction throughcounterfactual intervention and maximize the effect to encourage the network tolearn more useful attention for fine-grained image recognition. Empirically, weevaluate our method on a wide range of fine-grained recognition tasks whereattention plays a crucial role, including fine-grained image categorization,person re-identification, and vehicle re-identification. The consistentimprovement on all benchmarks demonstrates the effectiveness of our method.Code is available at https://github.com/raoyongming/CAL
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| few-shot-learning-on-dtd | CAL | 12-shot Accuracy: 54.6 16-shot Accuracy: 57.4 4-shot Accuracy: 40.9 8-shot Accuracy: 50.4 |
| few-shot-learning-on-fgvc-aircraft-1 | CAL | 12-shot Accuracy: 67.6 16-shot Accuracy: 74.3 4-shot Accuracy: 35.2 8-shot Accuracy: 55.4 Harmonic mean: 35.2 |
| few-shot-learning-on-stanford-cars | CAL | 12-shot Accuracy: 82.9 16-shot Accuracy: 88.9 4-shot Accuracy: 42.2 8-shot Accuracy: 71.8 |
| fine-grained-image-classification-on-cub-200-1 | CAL | Accuracy: 90.6 |
| fine-grained-image-classification-on-fgvc | CAL | Accuracy: 94.2 |
| fine-grained-image-classification-on-stanford | CAL | Accuracy: 95.5% |
| mitigating-contextual-bias-on-fgvc-aircraft | CAL | OOD Accuracy (%): 10.2 Top-1 Accuracy (%): 71.0 |
| mitigating-contextual-bias-on-fgvc-aircraft | CAL + ALIA | OOD Accuracy (%): 25.1 Top-1 Accuracy (%): 71.8 |
| person-re-identification-on-dukemtmc-reid | CAL | Rank-1: 90 mAP: 80.5 |
| person-re-identification-on-market-1501 | CAL | Rank-1: 95.5 mAP: 89.5 |
| person-re-identification-on-msmt17 | CAL(ResNet50) | Rank-1: 84.2 mAP: 64 |
| vehicle-re-identification-on-vehicleid-large | CAL | Rank-1: 75.1 mAP: 80.9 |
| vehicle-re-identification-on-vehicleid-medium | CAL | Rank-1: 78.2 mAP: 83.8 |
| vehicle-re-identification-on-vehicleid-small | CAL | Rank-1: 82.5 mAP: 87.8 |
| vehicle-re-identification-on-veri-776 | CAL | Rank-1: 95.4 Rank5: 97.9 mAP: 74.3 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.