Paper
Document
Download
Flag content
23

Principled feature attribution for unsupervised gene expression analysis

Save
TipTip
Document
Download
Flag content
23
TipTip
Save
Document
Download
Flag content

Abstract

Abstract As interest in unsupervised deep learning models for the analysis of gene expression data has grown, an increasing number of methods have been developed to make these deep learning models more interpretable. These methods can be separated into two groups: (1) post hoc analyses of black box models through feature attribution methods and (2) approaches to build inherently interpretable models through biologically-constrained architectures. In this work, we argue that these approaches are not mutually exclusive, but can in fact be usefully combined. We propose a novel unsupervised pathway attribution method, which better identifies major sources of transcriptomic variation than prior methods when combined with biologically-constrained neural network models. We demonstrate how principled feature attributions aid in the analysis of a variety of single cell datasets. Finally, we apply our approach to a large dataset of post-mortem brain samples from patients with Alzheimer’s disease, and show that it identifies Mitochondrial Respiratory Complex I as an important factor in this disease.

Paper PDF

This paper's license is marked as closed access or non-commercial and cannot be viewed on ResearchHub. Visit the paper's external site.