ResearchHub | Open Science Community

Generating counterfactual explanations of tumor spatial proteomes to discover effective strategies for enhancing immune infiltration

Zitong Wang et al.Jan 1, 2023

The tumor microenvironment (TME) significantly impacts cancer prognosis due to its immune composition. While therapies for altering the immune composition, including immunotherapies, have shown exciting results for treating hematological cancers, they are less effective for immunologically-cold, solid tumors. Spatial omics technologies capture the spatial organization of the TME with unprecedented molecular detail, revealing the relationship between immune cell localization and molecular signals. Here, we formulate T-cell infiltration prediction as a self-supervised machine learning problem and develop a counterfactual optimization strategy that leverages large scale spatial omics profiles of patient tumors to design tumor perturbations predicted to boost T-cell infiltration. A convolutional neural network predicts T-cell distribution based on signaling molecules in the TME provided by imaging mass cytometry. Gradient-based counterfactual generation, then, computes perturbations predicted to boost T-cell abundance. We apply our framework to melanoma, colorectal cancer (CRC) liver metastases, and breast tumor data, discovering combinatorial perturbations predicted to support T-cell infiltration across tens to hundreds of patients. This work presents a paradigm for counterfactual-based prediction and design of cancer therapeutics using spatial omics data.

Oncology

Immunology

2

Paper

Save

Multi-ContrastiveVAE disentangles perturbation effects in single cell images from optical pooled screens

Zitong Wang et al.Jan 1, 2023

Optical pooled screens (OPS) enable unbiased and cost-effective interrogation of gene function by generating images of millions of cells across thousands of perturbations. However, the analysis of OPS data remains a hurdle because it still mainly relies on hand-crafted features, which can be difficult to deploy across complex data sets. Additionally, most unsupervised feature extraction methods based on neural networks (such as auto-encoders) have difficulty isolating the effect of perturbations from the natural variations among cells. We therefore propose a contrastive analysis framework that is more effective at disentangling the phenotypes induced by perturbation from natural cell-cell heterogeneity present in an unperturbed cell population. By analyzing a significant data set of over 30 million cells across more than 5,000 genetic perturbations, we demonstrate that our method significantly outperforms traditional methods in generating biologically-informative embeddings and mitigating technical artifacts. Furthermore, the interpretable part of our model enables us to pinpoint perturbations that generate novel phenotypes from the ones that only shift the distribution of existing phenotypes. Our approach can be readily applied to other small-molecule and genetic perturbation data sets with highly multiplexed images, enhancing the efficiency and precision in identifying and interpreting perturbation-specific phenotypic patterns, paving the way for deeper insights and discoveries in OPS analysis.

Artificial Intelligence

Biophysics

0

Paper

Artificial Intelligence

Biophysics

0

Save

1

Localization of signaling receptors maximizes cellular information acquisition in spatially-structured natural environments

Zitong Wang et al.Jul 2, 2021

M

Z

Abstract Cells in natural environments like tissue or soil sense and respond to extracellular ligands with intricately structured and non-monotonic spatial distributions, sculpted by processes such as fluid flow and substrate adhesion. In this work, we show that spatial sensing and navigation can be optimized by adapting the spatial organization of signaling pathways to the spatial structure of the environment. We develop an information-theoretic framework for computing the optimal spatial organization of a sensing system for a given signaling environment. We find that receptor localization maximizes information acquisition in simulated natural contexts, including tissue and soil. Receptor localization extends naturally to produce a dynamic protocol for continuously redistributing signaling receptors, which when implemented using simple feedback, boosts cell navigation efficiency by 30-fold. Broadly, our work shows how cells can maximize the fidelity of information transfer by adapting the spatial organization of signaling molecules to the spatial structure of the environment.

Ecology

Molecular Biology

1

Paper

Save

A Supervised Contrastive Framework for Learning Disentangled Representations of Cell Perturbation Data

Xin-Ming Tu et al.Jan 8, 2024

CRISPR technology, combined with single-cell RNA-Seq, has opened the way to large scale pooled perturbation screens, allowing more systematic interrogations of gene functions in cells at scale. However, such Perturb-seq data poses many analysis challenges, due to its high-dimensionality, high level of technical noise, and variable Cas9 efficiency. The single-cell nature of the data also poses its own challenges, as we observe the heterogeneity of phenotypes in the unperturbed cells, along with the effect of the perturbations. All in all, these characteristics make it difficult to discern subtler effects. Existing tools, like mixscape and ContrastiveVI, provide partial solutions, but may oversimplify biological dynamics, or have low power to characterize perturbations with a smaller effect size. Here, we address these limitations by introducing the Supervised Contrastive Variational Autoencoder (SC- VAE). SC-VAE integrates guide RNA identity with gene expression data, ensuring a more discriminative analysis, and adopts the Hilbert-Schmidt Independence Criterion as a way to achieve disentangled representations, separating the heterogeneity in the control population from the effect of the perturbations. Evaluation on large-scale data sets highlights SC-VAE's superior sensitivity in identifying perturbation effects compared to ContrastiveVI, scVI and PCA. The perturbation embeddings better reflect known protein complexes (evaluated on CORUM), while its classifier offers promise in identifying assignment errors and cells escap- ing the perturbation phenotype. SC-VAE is readily applicable across diverse perturbation data sets.

Artificial Intelligence

Biophysics

0

Paper

Artificial Intelligence

Biophysics

0

Save