A caveat of cancer cell line models is that their molecular and functional profiling is subject to laboratory-specific experimental practices and data analysis protocols. The current challenge is how to make an integrated use of omics profiles of cancer cell lines for reliable discoveries. Here, we carried out a systematic analysis of nine types of data modalities using meta-analysis of 53 omics profiling studies across 12 research laboratories for 2018 cell lines. To account for relatively low consistency observed for certain data modalities, we developed a robust data integration approach that identifies reproducible signals shared among multiple data modalities and studies. We demonstrated the power of the integrative analyses by identifying a novel driver gene, ECHDC1, with tumor suppressive role validated both in breast cancer cells and patient tumors. Extension of the approach identified synthetic lethal partners of cancer drivers, including a co-dependency of PTEN deficient cells on RNA helicases. HighlightsO_LIA comprehensive meta-analysis of 53 multi-modal omics profiles of >2000 cancer cell lines from 12 research laboratories C_LIO_LIAn unexpected lack of consistency between TMT-labelled and non-labelled global proteomic profiles C_LIO_LIA non-parametric approach to integrate omics profiles from multiple laboratories and to identify robust molecular patterns in individual cell lines C_LIO_LIThe multi-modal data integration reveals novel drivers and potential therapeutic targets, including ECHDC1 in breast cancers and DDX27 in PTEN mutant cancers. C_LI
Support the authors with ResearchCoin
Support the authors with ResearchCoin