ResearchHub | Open Science Community

Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program

Daniel Taliun et al.Feb 10, 2021

Abstract The Trans-Omics for Precision Medicine (TOPMed) programme seeks to elucidate the genetic architecture and biology of heart, lung, blood and sleep disorders, with the ultimate goal of improving diagnosis, treatment and prevention of these diseases. The initial phases of the programme focused on whole-genome sequencing of individuals with rich phenotypic data and diverse backgrounds. Here we describe the TOPMed goals and design as well as the available resources and early insights obtained from the sequence data. The resources include a variant browser, a genotype imputation server, and genomic and phenotypic data that are available through dbGaP (Database of Genotypes and Phenotypes) 1 . In the first 53,831 TOPMed samples, we detected more than 400 million single-nucleotide and insertion or deletion variants after alignment with the reference genome. Additional previously undescribed variants were detected through assembly of unmapped reads and customized analysis in highly variable loci. Among the more than 400 million detected variants, 97% have frequencies of less than 1% and 46% are singletons that are present in only one individual (53% among unrelated individuals). These rare variants provide insights into mutational processes and recent human evolutionary history. The extensive catalogue of genetic variation in TOPMed studies provides unique opportunities for exploring the contributions of rare and noncoding sequence variants to phenotypic variation. Furthermore, combining TOPMed haplotypes with modern imputation methods improves the power and reach of genome-wide association studies to include variants down to a frequency of approximately 0.01%.

Genetics

Molecular Biology

1

Paper

Save

Common genetic variants influence human subcortical brain structures

Derrek Hibar et al.Jan 20, 2015

The highly complex structure of the human brain is strongly shaped by genetic influences. Subcortical brain regions form circuits with cortical areas to coordinate movement, learning, memory and motivation, and altered circuits can lead to abnormal behaviour and disease. To investigate how common genetic variants affect the structure of these brain regions, here we conduct genome-wide association studies of the volumes of seven subcortical regions and the intracranial volume derived from magnetic resonance images of 30,717 individuals from 50 cohorts. We identify five novel genetic variants influencing the volumes of the putamen and caudate nucleus. We also find stronger evidence for three loci with previously established influences on hippocampal volume and intracranial volume. These variants show specific volumetric effects on brain structures rather than global effects across structures. The strongest effects were found for the putamen, where a novel intergenic locus with replicable influence on volume (rs945270; P = 1.08 × 10(-33); 0.52% variance explained) showed evidence of altering the expression of the KTN1 gene in both brain and blood tissue. Variants influencing putamen volume clustered near developmental genes that regulate apoptosis, axon guidance and vesicle transport. Identification of these genetic variants provides insight into the causes of variability in human brain development, and may help to determine mechanisms of neuropsychiatric dysfunction.

Genetics

Molecular Biology

0

Paper

Save

Identification of common variants associated with human hippocampal and intracranial volumes

Jason Stein et al.Apr 15, 2012

Paul Thompson and colleagues report a genome-wide association study for hippocampal, intracranial and total brain volume. They identify a locus at 12q24 associated with hippocampal volume and a locus at 12q14 associated with intracranial volume. Identifying genetic variants influencing human brain structures may reveal new biological mechanisms underlying cognition and neuropsychiatric illness. The volume of the hippocampus is a biomarker of incipient Alzheimer's disease1,2 and is reduced in schizophrenia3, major depression4 and mesial temporal lobe epilepsy5. Whereas many brain imaging phenotypes are highly heritable6,7, identifying and replicating genetic influences has been difficult, as small effects and the high costs of magnetic resonance imaging (MRI) have led to underpowered studies. Here we report genome-wide association meta-analyses and replication for mean bilateral hippocampal, total brain and intracranial volumes from a large multinational consortium. The intergenic variant rs7294919 was associated with hippocampal volume (12q24.22; N = 21,151; P = 6.70 × 10−16) and the expression levels of the positional candidate gene TESC in brain tissue. Additionally, rs10784502, located within HMGA2, was associated with intracranial volume (12q14.3; N = 15,782; P = 1.12 × 10−12). We also identified a suggestive association with total brain volume at rs10494373 within DDR2 (1q23.3; N = 6,500; P = 5.81 × 10−7).

Genetics

Epidemiology

0

Paper

Save

Discovery of expression QTLs using large-scale transcriptional profiling in human lymphocytes

Harald Göring et al.Sep 16, 2007

Genetics

Molecular Biology

0

Paper

Save

Inherited causes of clonal haematopoiesis in 97,691 whole genomes

Alexander Bick et al.Oct 14, 2020

Age is the dominant risk factor for most chronic human diseases, but the mechanisms through which ageing confers this risk are largely unknown1. The age-related acquisition of somatic mutations that lead to clonal expansion in regenerating haematopoietic stem cell populations has recently been associated with both haematological cancer2–4 and coronary heart disease5—this phenomenon is termed clonal haematopoiesis of indeterminate potential (CHIP)6. Simultaneous analyses of germline and somatic whole-genome sequences provide the opportunity to identify root causes of CHIP. Here we analyse high-coverage whole-genome sequences from 97,691 participants of diverse ancestries in the National Heart, Lung, and Blood Institute Trans-omics for Precision Medicine (TOPMed) programme, and identify 4,229 individuals with CHIP. We identify associations with blood cell, lipid and inflammatory traits that are specific to different CHIP driver genes. Association of a genome-wide set of germline genetic variants enabled the identification of three genetic loci associated with CHIP status, including one locus at TET2 that was specific to individuals of African ancestry. In silico-informed in vitro evaluation of the TET2 germline locus enabled the identification of a causal variant that disrupts a TET2 distal enhancer, resulting in increased self-renewal of haematopoietic stem cells. Overall, we observe that germline genetic variation shapes haematopoietic stem cell function, leading to CHIP through mechanisms that are specific to clonal haematopoiesis as well as shared mechanisms that lead to somatic mutations across tissues. Analysis of 97,691 high-coverage human blood DNA-derived whole-genome sequences enabled simultaneous identification of germline and somatic mutations that predispose individuals to clonal expansion of haematopoietic stem cells, indicating that both inherited and acquired mutations are linked to age-related cancers and coronary heart disease.

Genetics

Molecular Biology

0

Paper

Save

Long-term neural and physiological phenotyping of a single human

Russell Poldrack et al.Dec 9, 2015

Abstract Psychiatric disorders are characterized by major fluctuations in psychological function over the course of weeks and months, but the dynamic characteristics of brain function over this timescale in healthy individuals are unknown. Here, as a proof of concept to address this question, we present the MyConnectome project. An intensive phenome-wide assessment of a single human was performed over a period of 18 months, including functional and structural brain connectivity using magnetic resonance imaging, psychological function and physical health, gene expression and metabolomics. A reproducible analysis workflow is provided, along with open access to the data and an online browser for results. We demonstrate dynamic changes in brain connectivity over the timescales of days to months, and relations between brain connectivity, gene expression and metabolites. This resource can serve as a testbed to study the joint dynamics of human brain and metabolic function over time, an approach that is critical for the development of precision medicine strategies for brain disorders.

Biochemistry

Molecular Biology

0

Paper

Save

Multi-site genetic analysis of diffusion images and voxelwise heritability analysis: A pilot project of the ENIGMA–DTI working group

Neda Jahanshad et al.Apr 27, 2013

The ENIGMA (Enhancing NeuroImaging Genetics through Meta-Analysis) Consortium was set up to analyze brain measures and genotypes from multiple sites across the world to improve the power to detect genetic variants that influence the brain. Diffusion tensor imaging (DTI) yields quantitative measures sensitive to brain development and degeneration, and some common genetic variants may be associated with white matter integrity or connectivity. DTI measures, such as the fractional anisotropy (FA) of water diffusion, may be useful for identifying genetic variants that influence brain microstructure. However, genome-wide association studies (GWAS) require large populations to obtain sufficient power to detect and replicate significant effects, motivating a multi-site consortium effort. As part of an ENIGMA–DTI working group, we analyzed high-resolution FA images from multiple imaging sites across North America, Australia, and Europe, to address the challenge of harmonizing imaging data collected at multiple sites. Four hundred images of healthy adults aged 18–85 from four sites were used to create a template and corresponding skeletonized FA image as a common reference space. Using twin and pedigree samples of different ethnicities, we used our common template to evaluate the heritability of tract-derived FA measures. We show that our template is reliable for integrating multiple datasets by combining results through meta-analysis and unifying the data through exploratory mega-analyses. Our results may help prioritize regions of the FA map that are consistently influenced by additive genetic factors for future genetic discovery studies. Protocols and templates are publicly available at (http://enigma.loni.ucla.edu/ongoing/dti-working-group/).

Genetics

Artificial Intelligence

0

Paper

Save

Genetic variation in selenoprotein S influences inflammatory response

Joanne Curran et al.Oct 9, 2005

Chronic inflammation has a pathological role in many common diseases and is influenced by both genetic and environmental factors. Here we assess the role of genetic variation in selenoprotein S (SEPS1, also called SELS or SELENOS), a gene involved in stress response in the endoplasmic reticulum and inflammation control. After resequencing SEPS1, we genotyped 13 SNPs in 522 individuals from 92 families. As inflammation biomarkers, we measured plasma levels of IL-6, IL-1β and TNF-α. Bayesian quantitative trait nucleotide analysis identified associations between SEPS1 polymorphisms and all three proinflammatory cytokines. One promoter variant, −105G → A, showed strong evidence for an association with each cytokine (multivariate P = 0.0000002). Functional analysis of this polymorphism showed that the A variant significantly impaired SEPS1 expression after exposure to endoplasmic reticulum stress agents (P = 0.00006). Furthermore, suppression of SEPS1 by short interfering RNA in macrophage cells increased the release of IL-6 and TNF-α. To investigate further the significance of the observed associations, we genotyped −105G → A in 419 Mexican American individuals from 23 families for replication. This analysis confirmed a significant association with both TNF-α (P = 0.0049) and IL-1β (P = 0.0101). These results provide a direct mechanistic link between SEPS1 and the production of inflammatory cytokines and suggest that SEPS1 has a role in mediating inflammation.

Genetics

Biochemistry

0

Paper

Save

Plasma lipid profiling in a large population-based cohort

Jacquelyn Weir et al.Jul 19, 2013

Biochemistry

Physiology

0

Paper

Save

Plasma Lipid Profiling Shows Similar Associations with Prediabetes and Type 2 Diabetes

Peter Meikle et al.Sep 27, 2013

The relationship between lipid metabolism with prediabetes (impaired fasting glucose and impaired glucose tolerance) and type 2 diabetes mellitus is poorly defined. We hypothesized that a lipidomic analysis of plasma lipids might improve the understanding of this relationship. We performed lipidomic analysis measuring 259 individual lipid species, including sphingolipids, phospholipids, glycerolipids and cholesterol esters, on fasting plasma from 117 type 2 diabetes, 64 prediabetes and 170 normal glucose tolerant participants in the Australian Diabetes, Obesity and Lifestyle Study (AusDiab) then validated our findings on 1076 individuals from the San Antonio Family Heart Study (SAFHS). Logistic regression analysis of identified associations with type 2 diabetes (135 lipids) and prediabetes (134 lipids), after adjusting for multiple covariates. In addition to the expected associations with diacylglycerol, triacylglycerol and cholesterol esters, type 2 diabetes and prediabetes were positively associated with ceramide, and its precursor dihydroceramide, along with phosphatidylethanolamine, phosphatidylglycerol and phosphatidylinositol. Significant negative associations were observed with the ether-linked phospholipids alkylphosphatidylcholine and alkenylphosphatidylcholine. Most of the significant associations in the AusDiab cohort (90%) were subsequently validated in the SAFHS cohort. The aberration of the plasma lipidome associated with type 2 diabetes is clearly present in prediabetes, prior to the onset of type 2 diabetes. Lipid classes and species associated with type 2 diabetes provide support for a number of existing paradigms of dyslipidemia and suggest new avenues of investigation.

Epidemiology

Internal Medicine

0

Paper

Epidemiology

273

0

Save