Attribution Quality Metrics with Magnitude Alignment

Attribution Quality Metrics with Magnitude Alignment

Chase Walker, Dominic Simon, Kenny Chen, Rickard Ewetz

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence
Main Track. Pages 530-538. https://doi.org/10.24963/ijcai.2024/59

Attribution algorithms play an instrumental role in human interpretation of AI models. The methods measure the importance of the input features to the model output decision, which can be displayed as an attribution map for image classifiers. Perturbation tests are the state-of-the-art approach to evaluate the quality of an attribution map. Unfortunately, we observe that perturbation tests fail to consider attribution magnitude, which translates into inconsistent quality scores. In this paper, we propose Magnitude Aligned Scoring (MAS), a new attribution quality metric that measures the alignment between the magnitude of the attributions and the model response. In particular, the metric accounts for both the relative ordering and the magnitude of the pixels within an attribution. In the experimental evaluation, we compare the MAS metric with existing metrics across a wide range of models, datasets, attributions, and evaluations. The results demonstrate that the MAS metric is 4x more sensitive to attribution changes, 2x more consistent, and 1.6x more invariant to baseline modifications. Our code and the referenced appendix are publicly available via https://github.com/chasewalker26/Magnitude-Aligned-Scoring.
Keywords:
AI Ethics, Trust, Fairness: ETF: Explainability and interpretability
AI Ethics, Trust, Fairness: ETF: Trustworthy AI
Computer Vision: CV: Interpretability and transparency
Machine Learning: ML: Explainable/Interpretable machine learning