Cell phenotypes are dictated by both extra- and intra-cellular contexts, and robust identification of context-specific network features that control phenotypes remains challenging. Here, we developed a multi-omics data integration strategy called MOBILE (Multi-Omics Binary Integration via Lasso Ensembles) to nominate molecular features associated with specific cellular phenotypes. We applied this method to chromatin accessibility, mRNA, protein, and phospho-protein time course datasets and focus on two illustrative use cases after we show MOBILE could recover known biology. First, MOBILE nominated new mechanisms of interferon-{gamma} (IFN{gamma}) regulated PD-L1 expression, where analyses suggested, and literature supported that IFN{gamma}-controlled PD-L1 expression involves BST2, CLIC2, FAM83D, ACSL5, and HIST2H2AA3 genes. Second, we explored differences between the highly similar transforming growth factor-beta 1 (TGF{beta}1) and bone morphogenetic protein 2 (BMP2) and showed that differential cell size and clustering properties induced by TGF{beta}1, but not BMP2, were related to the laminin/collagen pathway activity. Given the ever-growing availability of multi-omics datasets, we envision that MOBILE will be broadly applicable to identify context-specific molecular features associated with cellular phenotypes. Graphical Summary O_FIG O_LINKSMALLFIG WIDTH=162 HEIGHT=200 SRC="FIGDIR/small/501297v1_ufig1.gif" ALT="Figure 1"> View larger version (76K): org.highwire.dtl.DTLVardef@12fa3a5org.highwire.dtl.DTLVardef@a0b6a8org.highwire.dtl.DTLVardef@137efb7org.highwire.dtl.DTLVardef@15aec85_HPS_FORMAT_FIGEXP M_FIG C_FIG Multi-Omics Binary Integration via Lasso Ensembles (MOBILE) pipeline yields statistically robust, context-specific association networksThe MOBILE pipeline integrates omics datasets in a data-driven, biologically-structured manner. The pipeline outputs are gene-level, contextspecific association networks. These association networks nominate differentially enriched pathways, subnetworks, and new connections. Broadly applicable to find condition specific networks using multi-omics datasets.
Support the authors with ResearchCoin
Support the authors with ResearchCoin