ResearchHub | Open Science Community

An integrated cell atlas of the lung in health and disease

Lisa Sikkema et al.Jun 1, 2023

Abstract Single-cell technologies have transformed our understanding of human tissues. Yet, studies typically capture only a limited number of donors and disagree on cell type definitions. Integrating many single-cell datasets can address these limitations of individual studies and capture the variability present in the population. Here we present the integrated Human Lung Cell Atlas (HLCA), combining 49 datasets of the human respiratory system into a single atlas spanning over 2.4 million cells from 486 individuals. The HLCA presents a consensus cell type re-annotation with matching marker genes, including annotations of rare and previously undescribed cell types. Leveraging the number and diversity of individuals in the HLCA, we identify gene modules that are associated with demographic covariates such as age, sex and body mass index, as well as gene modules changing expression along the proximal-to-distal axis of the bronchial tree. Mapping new data to the HLCA enables rapid data annotation and interpretation. Using the HLCA as a reference for the study of disease, we identify shared cell states across multiple lung diseases, including SPP1 + profibrotic monocyte-derived macrophages in COVID-19, pulmonary fibrosis and lung carcinoma. Overall, the HLCA serves as an example for the development and use of large-scale, cross-dataset organ atlases within the Human Cell Atlas.

Genetics

Immunology

1

Paper

Save

Best practices for single-cell analysis across modalities

Lukas Heumos et al.Mar 31, 2023

Recent advances in single-cell technologies have enabled high-throughput molecular profiling of cells across modalities and locations. Single-cell transcriptomics data can now be complemented by chromatin accessibility, surface protein expression, adaptive immune receptor repertoire profiling and spatial information. The increasing availability of single-cell data across modalities has motivated the development of novel computational methods to help analysts derive biological insights. As the field grows, it becomes increasingly difficult to navigate the vast landscape of tools and analysis steps. Here, we summarize independent benchmarking studies of unimodal and multimodal single-cell analysis across modalities to suggest comprehensive best-practice workflows for the most common analysis steps. Where independent benchmarks are not available, we review and contrast popular methods. Our article serves as an entry point for novices in the field of single-cell (multi-)omic analysis and guides advanced users to the most recent best practices.

Genetics

Molecular Biology

1

Paper

Save

Probe set selection for targeted spatial transcriptomics

Louis Kuemmerle et al.Aug 17, 2022

Abstract Targeted spatial transcriptomics methods capture the topology of cell types and states in tissues at single cell- and subcellular resolution by measuring the expression of a predefined set of genes. The selection of an optimal set of probed genes is crucial for capturing and interpreting the spatial signals present in a tissue. However, current selections often rely on marker genes, precluding them from detecting continuous spatial signals or novel states. We present Spapros, an end-to-end probe set selection pipeline that optimizes both probe set specificity for cell type identification and within-cell-type expression variation to resolve spatially distinct populations while taking into account prior knowledge, as well as probe design and expression constraints. To facilitate data analysis and interpretation, Spapros also provides rules for cell type identification. We evaluated Spapros by selecting probes on 6 different data sets and built an evaluation pipeline with 12 quality metrics to find that Spapros outperforms other selection approaches in both cell type recovery and recovering expression variation beyond cell types. Furthermore, we used Spapros to design a SCRINSHOT experiment of adult lung tissue to demonstrate how probes selected with Spapros identify cell types of interest and detect spatial variation even within cell types. Spapros enables optimal probe set selection, probe set evaluation, and probe design, as a freely available Python package.

Genetics

Artificial Intelligence

1

Paper

Save

Ex vivotissue perturbations coupled to single cell RNA-seq reveal multi-lineage cell circuit dynamics in human lung fibrogenesis

Niklas Lang et al.Jan 16, 2023

ABSTRACT Pulmonary fibrosis develops as a consequence of failed regeneration after injury. Analyzing mechanisms of regeneration and fibrogenesis directly in human tissue has been hampered by the lack of organotypic models and analytical techniques. In this work, we coupled ex vivo cytokine and drug perturbations of human precision-cut lung slices (hPCLS) with scRNAseq and induced a multi-lineage circuit of fibrogenic cell states in hPCLS, which we show to be highly similar to the in vivo cell circuit in a multi-cohort lung cell atlas from pulmonary fibrosis patients. Using micro-CT staged patient tissues, we characterized the appearance and interaction of myofibroblasts, an ectopic endothelial cell state and basaloid epithelial cells in the thickened alveolar septum of early-stage lung fibrosis. Induction of these states in the ex vivo hPCLS model provides evidence that the basaloid cell state was derived from alveolar type-2 cells, whereas the ectopic endothelial cell state emerged from capillary cell plasticity. Cell-cell communication routes in patients were largely conserved in the hPCLS model and anti-fibrotic drug treatments showed highly cell type specific effects. Our work provides an experimental framework for perturbational single cell genomics directly in human lung tissue that enables analysis of tissue homeostasis, regeneration and pathology. We further demonstrate that hPCLS offers novel avenues for scalable, high-resolution drug testing to accelerate anti-fibrotic drug development and translation.

Genetics

Molecular Biology

94

Paper

Save

hadge: a comprehensive pipeline for donor deconvolution in single cell

Fabiola Curion et al.Jul 25, 2023

Single cell multiplexing techniques (cell hashing and genetic multiplexing) allow to combine multiple samples, thereby optimizing sample processing and reducing batch effects. Cell hashing conjugates antibody-tags or chemical-oligonucleotides to cell membranes, while genetic multiplexing allows to mix genetically diverse samples and relies on aggregation of RNA reads at known genomic coordinates. We developed hadge ( ha shing d econvolution combined with ge notype information), a Nextflow pipeline that combines 12 methods to perform both hashing- and genotype-based deconvolution. We propose a joint deconvolution strategy combining the best performing methods and we demonstrate how this approach leads to recovery of previously discarded cells in a nuclei hashing of fresh-frozen brain tissue.

Genetics

Molecular Biology

48

Paper

Save

Cluster efficient pangenome graph construction with nf-core/pangenome

Simon Heumos et al.May 15, 2024

Abstract Motivation Pangenome graphs offer a comprehensive way of capturing genomic variability across multiple genomes. However, current construction methods often introduce biases, excluding complex sequences or relying on references. The PanGenome Graph Builder (PGGB) addresses these issues. To date, though, there is no state-of-the-art pipeline allowing for easy deployment, efficient and dynamic use of available resources, and scalable usage at the same time. Results To overcome these limitations, we present nf-core/pangenome , a reference-unbiased approach implemented in Nextflow following nf-core’s best practices. Leveraging biocontainers ensures portability and seamless deployment in HPC environments. Unlike PGGB, nf-core/pangenome distributes alignments across cluster nodes, enabling scalability. Demonstrating its efficiency, we constructed pangenome graphs for 1000 human chromosome 19 haplotypes and 2146 E. coli sequences, achieving a two to threefold speedup compared to PGGB without increasing greenhouse gas emissions. Availability nf-core/pangenome is released under the MIT open-source license, available on GitHub and Zenodo, with documentation accessible at https://nf-co.re/pangenome/1.1.2/docs/usage . Contact simon.heumos@qbic.uni-tuebingen.de , sven.nahnsen@qbic.uni-tuebingen.de

Signal Processing

Information Systems

0

Paper

Save

An integrated cell atlas of the human lung in health and disease

Lisa Sikkema et al.Mar 11, 2022

ABSTRACT Organ- and body-scale cell atlases have the potential to transform our understanding of human biology. To capture the variability present in the population, these atlases must include diverse demographics such as age and ethnicity from both healthy and diseased individuals. The growth in both size and number of single-cell datasets, combined with recent advances in computational techniques, for the first time makes it possible to generate such comprehensive large-scale atlases through integration of multiple datasets. Here, we present the integrated Human Lung Cell Atlas (HLCA) combining 46 datasets of the human respiratory system into a single atlas spanning over 2.2 million cells from 444 individuals across health and disease. The HLCA contains a consensus re-annotation of published and newly generated datasets, resolving under- or misannotation of 59% of cells in the original datasets. The HLCA enables recovery of rare cell types, provides consensus marker genes for each cell type, and uncovers gene modules associated with demographic covariates and anatomical location within the respiratory system. To facilitate the use of the HLCA as a reference for single-cell lung research and allow rapid analysis of new data, we provide an interactive web portal to project datasets onto the HLCA. Finally, we demonstrate the value of the HLCA reference for interpreting disease-associated changes. Thus, the HLCA outlines a roadmap for the development and use of organ-scale cell atlases within the Human Cell Atlas.

Artificial Intelligence

Oncology

-1

Paper

Artificial Intelligence

Oncology

0

Save