a month ago

Axiomatic Attribution for Deep Networks

Sundararajan Mukund Taly Ankur Yan Qiqi

Abstract

We study the problem of attributing the prediction of a deep network to itsinput features, a problem previously studied by several other works. Weidentify two fundamental axioms---Sensitivity and Implementation Invariancethat attribution methods ought to satisfy. We show that they are not satisfiedby most known attribution methods, which we consider to be a fundamentalweakness of those methods. We use the axioms to guide the design of a newattribution method called Integrated Gradients. Our method requires nomodification to the original network and is extremely simple to implement; itjust needs a few calls to the standard gradient operator. We apply this methodto a couple of image models, a couple of text models and a chemistry model,demonstrating its ability to debug networks, to extract rules from a network,and to enable users to engage with models better.

Code Repositories

shyhyawJou/Integrated-Gradient-Pytorch

pytorch

Mentioned in GitHub

sicara/tf-explain

Mentioned in GitHub

yeefan1999/Explainable-Health-Prediction-with-Transfer-Learning

Mentioned in GitHub

hannamw/eap-ig

pytorch

Mentioned in GitHub

nsaphra/acd

pytorch

Mentioned in GitHub

ascillitoe/shap

Mentioned in GitHub

jankrepl/mildlyoverfitted

jax

mindspore-ai/contrib/tree/master/intern/Integrated-Gradient

mindspore

pamflecista/Magisterka

pytorch

Mentioned in GitHub

garygsw/smooth-taylor

pytorch

Mentioned in GitHub

bips-hb/innsight

pytorch

Mentioned in GitHub

tomdyer10/fake_news

pytorch

Mentioned in GitHub

JoHof/IntegratedGradientsTutorial

Mentioned in GitHub

miaolan-xie/shap

Mentioned in GitHub

tleemann/road_evaluation

pytorch

Mentioned in GitHub

cdpierse/transformers-interpret

pytorch

Mentioned in GitHub

allenwind/text-integrated-gradients

Mentioned in GitHub

fcUalberta/UAlberta-Multimedia-Masters-Program-Interpretable-AI-Part_1_2

Mentioned in GitHub

MindSpore-scientific/code-8/tree/main/Integrated-Gradient

mindspore

TooTouch/WhiteBox-Part1

pytorch

Mentioned in GitHub

AlejandroAttento/Pytorch-Captum

pytorch

Mentioned in GitHub

marnifora/magisterka

pytorch

Mentioned in GitHub

austinbrown34/shap

Mentioned in GitHub

ankurtaly/Attributions

Official

Mentioned in GitHub

shaoshanglqy/shap-shapley

Mentioned in GitHub

TianhongDai/integrated-gradient-pytorch

pytorch

Mentioned in GitHub

shyhyawJou/Integrated-Gradient-Tensorflow

Mentioned in GitHub

saivarunr/xshap

Mentioned in GitHub

jemilc/shap

Mentioned in GitHub

pwc-1/Paper-9/tree/main/4/Integrated-Gradient

mindspore

koren-v/Interpret

pytorch

Mentioned in GitHub

shap/shap

Mentioned in GitHub

uhussai7/boldreams

pytorch

Mentioned in GitHub

xiaoyanLi629/ScRNA-seq-integration-by-Heterogeneous-Graph-transformer-neural-network

pytorch

Mentioned in GitHub

MindSpore-scientific-2/code-4/tree/main/Integrated-Gradient

mindspore

suinleelab/path_explain

Mentioned in GitHub

gablabc/shap

Mentioned in GitHub

galdeia/iirsbenchmark

Mentioned in GitHub

andresbecker/master_thesis

Mentioned in GitHub

pytorch/captum

pytorch

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
image-attribution-on-celeba	Integrated Gradients	Deletion AUC score (ArcFace ResNet-101): 0.0680 Insertion AUC score (ArcFace ResNet-101): 0.3578
image-attribution-on-cub-200-2011-1	Integrated Gradients	Deletion AUC score (ResNet-101): 0.0728 Insertion AUC score (ResNet-101): 0.0422
image-attribution-on-vggface2	Integrated Gradients	Deletion AUC score (ArcFace ResNet-101): 0.0749 Insertion AUC score (ArcFace ResNet-101): 0.5399
interpretability-techniques-for-deep-learning-1	Integrated Gradients	Insertion AUC score: 0.3578

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Axiomatic Attribution for Deep Networks

Sundararajan Mukund Taly Ankur Yan Qiqi

Abstract

Code Repositories

Benchmarks

Build AI with AI

Hyper Newsletters