ResearchHub | Open Science Community

Pardis Sabeti

Author with expertise in Ebola Virus Research and Outbreaks

Harvard University, Howard Hughes Medical Institute, Broad Institute

+ 7 more

Achievements

Open Access Advocate

Cited Author

Key Stats

Upvotes received:

Publications:

(53% Open Access)

Cited by:

h-index:

i10-index:

204

Reputation

Biology

< 1%

Chemistry

< 1%

Economics

< 1%

How is this calculated?

Publications

SARS-CoV-2 infected cells present HLA-I peptides from canonical and out-of-frame ORFs

Shira Weingarten-Gabbay et al.Oct 24, 2023

+18

T cell-mediated immunity may play a critical role in controlling and establishing protective immunity against SARS-CoV-2 infection; yet the repertoire of viral epitopes responsible for T cell response activation remains mostly unknown. Identification of viral peptides presented on class I human leukocyte antigen (HLA-I) can reveal epitopes for recognition by cytotoxic T cells and potential incorporation into vaccines. Here, we report the first HLA-I immunopeptidome of SARS-CoV-2 in two human cell lines at different times post-infection using mass spectrometry. We found HLA-I peptides derived not only from canonical ORFs, but also from internal out-of-frame ORFs in Spike and Nucleoprotein not captured by current vaccines. Proteomics analyses of infected cells revealed that SARS-CoV-2 may interfere with antigen processing and immune signaling pathways. Based on the endogenously processed and presented viral peptides that we identified, we estimate that a pool of 24 peptides would provide one or more peptides for presentation by at least one HLA allele in 99% of the human population. These biological insights and the list of naturally presented SARS-CoV-2 peptides will facilitate data-driven selection of peptides for immune monitoring and vaccine development.

Orfs

Biology

Human Leukocyte Antigen

159

Paper

Orfs

Save

DNA spike-ins enable confident interpretation of SARS-CoV-2 genomic data from amplicon-based sequencing

Kim Lagerborg et al.Oct 24, 2023

+16

The rapid global spread and continued evolution of SARS-CoV-2 has highlighted an unprecedented need for viral genomic surveillance and clinical viral sequencing. Amplicon-based sequencing methods provide a sensitive, low-cost and rapid approach but suffer a high potential for contamination, which can undermine lab processes and results. This challenge will only increase with expanding global production of sequences by diverse research groups for epidemiological and clinical interpretation. We present an approach which uses synthetic DNA spike-ins (SDSIs) to track samples and detect inter-sample contamination through a sequencing workflow. Applying this approach to the ARTIC Consortium's amplicon design, we define a series of best practices for Illumina-based sequencing and provide a detailed characterization of approaches to increase sensitivity for low-viral load samples incorporating the SDSIs. We demonstrate the utility and efficiency of the SDSI method amidst a real-time investigation of a suspected hospital cluster of SARS-CoV-2 cases.

Paper

Save

Co-circulating mumps lineages at multiple geographic scales

Shirlee Wohl et al.May 7, 2020

+25

Abstract Despite widespread vaccination, eleven thousand mumps cases were reported in the United States (US) in 2016–17, including hundreds in Massachusetts, primarily in college settings. We generated 203 whole genome mumps virus (MuV) sequences from Massachusetts and 15 other states to understand the dynamics of mumps spread locally and nationally, as well as to search for variants potentially related to vaccination. We observed multiple MuV lineages circulating within Massachusetts during 2016–17, evidence for multiple introductions of the virus to the state, and extensive geographic movement of MuV within the US on short time scales. We found no evidence that variants arising during this outbreak contributed to vaccine escape. Combining epidemiological and genomic data, we observed multiple co-circulating clades within individual universities as well as spillover into the local community. Detailed data from one well-sampled university allowed us to estimate an effective reproductive number within that university significantly greater than one. We also used publicly available small hydrophobic (SH) gene sequences to estimate migration between world regions and to place this outbreak in a global context, but demonstrate that these short sequences, historically used for MuV genotyping, are inadequate for tracing detailed transmission. Our findings suggest continuous, often undetected, circulation of mumps both locally and nationally, and highlight the value of combining genomic and epidemiological data to track viral disease transmission at high resolution.

Context (Archaeology)

Clade

Outbreak

Paper

Context (Archaeology)

Save

104

Single-cell profiling of Ebola virus infectionin vivoreveals viral and host transcriptional dynamics

Dylan Kotliar et al.Oct 24, 2023

+23

Summary Ebola virus (EBOV) causes epidemics with high case fatality rates, yet remains understudied due to the challenge of experimentation in high-containment and outbreak settings. To better understand EBOV infection in vivo , we used single-cell transcriptomics and CyTOF-based single-cell protein quantification to characterize peripheral immune cell activity during EBOV infection in rhesus monkeys. We obtained 100,000 transcriptomes and 15,000,000 protein profiles, providing insight into pathogenesis. We find that immature, proliferative monocyte-lineage cells with reduced antigen presentation capacity replace conventional circulating monocyte subsets within days of infection, while lymphocytes upregulate apoptosis genes and decline in abundance. By quantifying viral RNA abundance in individual cells, we identify molecular determinants of tropism and examine temporal dynamics in viral and host gene expression. Within infected cells, we observe that EBOV down-regulates STAT1 mRNA and interferon signaling, and up-regulates putative pro-viral genes (e.g., DYNLL1 and HSPA5 ), nominating cellular pathways the virus manipulates for its replication. Overall, this study sheds light on EBOV tropism, replication dynamics, and elicited immune response, and provides a framework for characterizing interactions between hosts and emerging viruses in a maximum containment setting.

Biology

Viral Replication

Ebola Virus

104

Paper

Biology

Save

Model-directed generation of CRISPR-Cas13a guide RNAs designs artificial sequences that improve nucleic acid detection

Sreekar Mantena et al.Sep 21, 2023

Generating maximally-fit biological sequences has the potential to transform CRISPR guide RNA design as it has other areas of biomedicine. Here, we introduce model-directed exploration algorithms (MEAs) for designing maximally-fit, artificial CRISPR-Cas13a guides - with multiple mismatches to any natural sequence - that are tailored for desired properties around nucleic acid diagnostics. We find that MEA-designed guides offer more sensitive detection of diverse pathogens and discrimination of pathogen variants compared to guides derived directly from natural sequences, and illuminate interpretable design principles that broaden Cas13a targeting.

Paper

Save

Designing viral diagnostics with model-based optimization

Hayden Metsky et al.Oct 24, 2023

+11

Abstract Diagnostics, particularly for rapidly evolving viruses, stand to benefit from a principled, measurement-driven design that harnesses machine learning and vast genomic data—yet the capability for such design has not been previously built. Here, we develop and extensively validate an approach to designing viral diagnostics that applies a learned model within a combinatorial optimization framework. Concentrating on CRISPR-based diagnostics, we screen a library of 19,209 diagnostic–target pairs and train a deep neural network that predicts, from RNA sequence alone, diagnostic signal better than contemporary techniques. Our model then makes it possible to design assays that are maximally sensitive over the spectrum of a virus’s genomic variation. We introduce ADAPT ( https://adapt.guide ), a system for fully-automated design, and use ADAPT to design optimal diagnostics for the 1,933 vertebrate-infecting viral species within 2 hours for most species and 24 hours for all but 3. We experimentally show ADAPT’s designs are sensitive and specific down to the lineage level, including against viruses that pose challenges involving genomic variation and specificity. ADAPT’s designs exhibit significantly higher fluorescence and permit lower limits of detection, across a virus’s entire variation, than the outputs of standard design techniques. Our model-based optimization strategy has applications broadly to viral nucleic acid diagnostics and other sequence-based technologies, and, paired with clinical validation, could enable a critically-needed, proactive resource of assays for surveilling and responding to pathogens.

Crispr

Computer Science

Computational Biology

Paper

Crispr

Computer Science

Save

123

Multi-center integrated analysis of non-coding CRISPR screens

David Yao et al.Oct 24, 2023

+24

Abstract The ENCODE Consortium’s efforts to annotate non-coding, cis -regulatory elements (CREs) have advanced our understanding of gene regulatory landscapes which play a major role in health and disease. Pooled, non-coding CRISPR screens are a promising approach for systematically investigating gene regulatory mechanisms. Here, the ENCODE Functional Characterization Centers report 109 screens comprising 346,970 individual perturbations across 13.3Mb of the genome, using a variety of methods, readouts, and statistical analyses. Across 332 functionally confirmed CRE-gene links, we identify principles for screening endogenous, non-coding elements for causal regulatory mechanisms. Nearly all CREs show strong evidence of open chromatin, and targeting accessibility peak summits is a critical component of our proposed sgRNA design rules. We provide experimental guidelines to accurately detect CREs with variable, often low, transcriptional effects. We discover a previously undescribed DNA strand-bias for CRISPRi in transcribed regions with implications for screen design and analysis. Benchmarking five screen analysis tools, we find CASA produces the most conservative CRE calls and is robust to artifacts of low-specificity sgRNAs. Together, we provide an accessible data resource, predesigned sgRNAs targeting 3,275,697 ENCODE SCREEN candidate CREs, and screening guidelines to accelerate functional characterization of the non-coding genome.

Encode

Computational Biology

Genome

123

Paper

Encode

Computational Biology

Save

Identifying Gene Expression Programs of Cell-type Identity and Cellular Activity with Single-Cell RNA-Seq

Dylan Kotliar et al.May 6, 2020

Identifying gene expression programs underlying cell-type identity and cellular processes is a crucial step toward understanding the organization of cells and tissues. Although single-cell RNA-Seq (scRNA-Seq) can quantify transcripts in individual cells, each cell's expression may derive both from programs determining cell-type and from programs facilitating dynamic cellular activities such as cell-division or apoptosis, which cannot be easily disentangled with current methods. Here, we introduce clustered non-negative matrix factorization (cNMF) as a solution to this problem. We show with simulations that it deconvolutes scRNA-Seq profiles into interpretable programs corresponding to both cell-types and cellular activities. Applied to published brain organoid and visual cortex datasets, cNMF refines the hierarchy of cell-types and identifies both expected (e.g. cell-cycle and hypoxia) and intriguing novel activity programs. In summary, we show that cNMF can increase the accuracy of cell-type identification while simultaneously inferring interpretable cellular activity programs in scRNA-Seq data, thus providing useful insight into how cells vary dynamically within cell-types.

Paper

Save

Genome-wide functional screen of 3’UTR variants uncovers causal variants for human disease and evolution

Dustin Griesemer et al.Jan 24, 2021

3’ untranslated region (3’UTR) variants are strongly associated with human traits and diseases, yet few have been causally identified. We developed the Massively Parallel Reporter Assay for 3’UTRs (MPRAu) to sensitively assay 12,173 3’UTR variants. We applied MPRAu to six human cell lines, focusing on genetic variants associated with genome-wide association studies (GWAS) and human evolutionary adaptation. MPRAu expands our understanding of 3’UTR function, suggesting that low-complexity sequences predominately explain 3’UTR regulatory activity. We adapt MPRAu to uncover diverse molecular mechanisms at base-pair resolution, including an AU-rich element of linked to potential metabolic evolutionary adaptations in East Asians. We nominate hundreds of 3’UTR causal variants with genetically fine-mapped phenotype associations. Using endogenous allelic replacements, we characterize one variant that disrupts a miRNA site regulating the viral defense gene , and one that alters abundance, nominating a causal variant underlying transcriptional changes in age-related macular degeneration.

Biology

Genetics

Three Prime Untranslated Region

Paper

Biology

Genetics

Save

Fine-mapping the Favored Mutation in a Positive Selective Sweep

Ali Akbari et al.May 7, 2020

Methods to identify signatures of selective sweeps in population genomics data have been actively developed, but mostly do not identify the specific mutation favored by the selective sweep. We present a method, iSAFE, that uses a statistic derived solely from population genetics signals to pinpoint the favored mutation even when the signature of selection extends to 5Mbp. iSAFE was tested extensively on simulated data and in human populations from the 1000 Genomes Project, at 22 loci with previously characterized selective sweeps. For 14 of the 22 loci, iSAFE ranked the previously characterized candidate mutation among the 13 highest scoring (out of ~21,000 variants). Three loci did not show a strong signal. For the 5 remaining loci, iSAFE identified previously unreported mutations as being favored. In these regions, all of which involve pigmentation related genes, iSAFE identified identical selected mutations in multiple non-African populations suggesting an out-of-Africa onset of selection. The iSAFE software can be downloaded from https://github.com/alek0991/iSAFE.

Paper

Save