ResearchHub | Open Science Community

Immunopeptidomics for Dummies: Detailed Experimental Protocols and Rapid, User-Friendly Visualization of MHC I and II Ligand Datasets with MhcVizPipe

Kevin Kovalchik et al.Nov 3, 2020

ABSTRACT Immunopeptidomics refers to the science of investigating the composition and dynamics of peptides presented by major histocompatibility complex (MHC) class I and class II molecules using mass spectrometry (MS). Here, we aim to provide a technical report to any non-expert in the field wishing to establish and/or optimize an immunopeptidomic workflow with relatively limited computational knowledge and resources. To this end, we thoroughly describe step-by-step instructions to isolate MHC class I and II-associated peptides from various biological sources, including mouse and human biospecimens. Most notably, we created MhcVizPipe (MVP) ( https://github.com/CaronLab/MhcVizPipe ), a new and easy-to-use open-source software tool to rapidly assess the quality and the specific enrichment of immunopeptidomic datasets upon the establishment of new workflows. In fact, MVP enables intuitive visualization of multiple immunopeptidomic datasets upon testing sample preparation protocols and new antibodies for the isolation of MHC class I and II peptides. In addition, MVP enables the identification of unexpected binding motifs and facilitates the analysis of non-canonical MHC peptides. We anticipate that the experimental and bioinformatic resources provided herein will represent a great starting point for any non-expert and will therefore foster the accessibility and expansion of the field to ultimately boost its maturity and impact.

Genetics

Artificial Intelligence

6

Paper

Save

Data-driven approaches for genetic characterization of SARS-CoV-2 lineages

Fatima Mostefai et al.Sep 29, 2021

Abstract The genome of the Severe Acute Respiratory Syndrome coronavirus 2 (SARS-CoV-2), the pathogen that causes coronavirus disease 2019 (COVID-19), has been sequenced at an unprecedented scale, leading to a tremendous amount of viral genome sequencing data. To understand the evolution of this virus in humans, and to assist in tracing infection pathways and designing preventive strategies, we present a set of computational tools that span phylogenomics, population genetics and machine learning approaches. To illustrate the utility of this toolbox, we detail an in depth analysis of the genetic diversity of SARS-CoV-2 in first year of the COVID-19 pandemic, using 329,854 high-quality consensus sequences published in the GISAID database during the pre-vaccination phase. We demonstrate that, compared to standard phylogenetic approaches, haplotype networks can be computed efficiently on much larger datasets, enabling real-time analyses. Furthermore, time series change of Tajima’s D provides a powerful metric of population expansion. Unsupervised learning techniques further highlight key steps in variant detection and facilitate the study of the role of this genomic variation in the context of SARS-CoV-2 infection, with Multiscale PHATE methodology identifying fine-scale structure in the SARS-CoV-2 genetic data that underlies the emergence of key lineages. The computational framework presented here is useful for real-time genomic surveillance of SARS-CoV-2 and could be applied to any pathogen that threatens the health of worldwide populations of humans and other organisms.

Genetics

Paleontology

26

Paper

Save

The mutational landscape of SARS-CoV-2 variants diversifies T cell targets in an HLA supertype-dependent manner

David Hamelin et al.Jun 3, 2021

+13

J

D

SUMMARY The rapid, global dispersion of SARS-CoV-2 since its initial identification in December 2019 has led to the emergence of a diverse range of variants. The initial concerns regarding the virus were quickly compounded with concerns relating to the impact of its mutated forms on viral infectivity, pathogenicity and immunogenicity. To address the latter, we seek to understand how the mutational landscape of SARS-CoV-2 has shaped HLA-restricted T cell immunity at the population level during the first year of the pandemic, before mass vaccination. We analyzed a total of 330,246 high quality SARS-CoV-2 genome assemblies sampled across 143 countries and all major continents. Strikingly, we found that specific mutational patterns in SARS-CoV-2 diversify T cell epitopes in an HLA supertype-dependent manner. In fact, we observed that proline residues are preferentially removed from the proteome of prevalent mutants, leading to a predicted global loss of SARS-CoV-2 T cell epitopes in individuals expressing HLA-B alleles of the B7 supertype family. In addition, we show that this predicted global loss of epitopes is largely driven by a dominant C-to-U mutation type at the RNA level. These results indicate that B7 supertype-associated epitopes, including the most immunodominant ones, were more likely to escape CD8+ T cell immunosurveillance during the first year of the pandemic. Together, our study lays the foundation to help understand how SARS-CoV-2 mutants shape the repertoire of T cell targets and T cell immunity across human populations. The proposed theoretical framework has implications in viral evolution, disease severity, vaccine resistance and herd immunity.

Genetics

Immunology

25

Paper

Genetics

2

0

Save