View a PDF of the paper titled Sum-of-Parts: Self-Attributing Neural Networks with End-to-End Learning of Feature Groups, by Weiqiu You and 4 other authors
View PDF
HTML (experimental)
Abstract:Self-attributing neural networks (SANNs) present a potential path towards interpretable models for high-dimensional problems, but often face significant trade-offs in performance. In this work, we formally prove a lower bound on errors of per-feature SANNs, whereas group-based SANNs can achieve zero error and thus high performance. Motivated by these insights, we propose Sum-of-Parts (SOP), a framework that transforms any differentiable model into a group-based SANN, where feature groups are learned end-to-end without group supervision. SOP achieves state-of-the-art performance for SANNs on vision and language tasks, and we validate that the groups are interpretable on a range of quantitative and semantic metrics. We further validate the utility of SOP explanations in model debugging and cosmological scientific discovery. Our code is available at this https URL
Submission history
From: Weiqiu You [view email]
[v1]
Wed, 25 Oct 2023 02:50:10 UTC (33,311 KB)
[v2]
Wed, 2 Oct 2024 23:37:28 UTC (30,418 KB)
[v3]
Fri, 14 Feb 2025 21:20:20 UTC (32,115 KB)
[v4]
Tue, 24 Jun 2025 04:57:39 UTC (12,301 KB)