Abstract Single-cell RNA sequencing has revolutionized the study of immuno-oncology, cancer biology, and developmental biology by enabling the joint characterization of gene expression and cellular heterogeneity in a single platform. As of July 2023, the Gene Expression Omnibus now contains over 4000 published single-cell data sets, providing an invaluable opportunity for reanalysis to identify new cell types or cellular states as well as their defining transcriptional programs. To facilitate the reprocessing of these public datasets, we have devised a single-cell RNA sequencing analysis framework for data retrieval, quality control, expression normalization, dimension reduction, cell clustering, and data integration. Additionally, we have developed a Shiny App visualization platform that enables the exploration of gene expression, cell type annotations, and cell lineages through a user interface. We performed a re-analysis of single-cell RNAseq data generated from acute myeloid leukemia and tumor-reactive lymphocytes and found our pipeline to faithfully recapitulated the cell type assignment as well as expected lineage trajectories. Altogether, we present BERLIN, a single-cell RNAseq analysis pipeline that facilitates the integration and public dissemination of results from the reanalysis.
This paper's license is marked as closed access or non-commercial and cannot be viewed on ResearchHub. Visit the paper's external site.