ResearchHub | Open Science Community

A Saturated Map of Common Genetic Variants Associated with Human Height from 5.4 Million Individuals of Diverse Ancestries

Loïc Yengo et al.Jan 10, 2022

ABSTRACT Common SNPs are predicted to collectively explain 40-50% of phenotypic variation in human height, but identifying the specific variants and associated regions requires huge sample sizes. Here we show, using GWAS data from 5.4 million individuals of diverse ancestries, that 12,111 independent SNPs that are significantly associated with height account for nearly all of the common SNP-based heritability. These SNPs are clustered within 7,209 non-overlapping genomic segments with a median size of ~90 kb, covering ~21% of the genome. The density of independent associations varies across the genome and the regions of elevated density are enriched for biologically relevant genes. In out-of-sample estimation and prediction, the 12,111 SNPs account for 40% of phenotypic variance in European ancestry populations but only ~10%-20% in other ancestries. Effect sizes, associated regions, and gene prioritization are similar across ancestries, indicating that reduced prediction accuracy is likely explained by linkage disequilibrium and allele frequency differences within associated regions. Finally, we show that the relevant biological pathways are detectable with smaller sample sizes than needed to implicate causal genes and variants. Overall, this study, the largest GWAS to date, provides an unprecedented saturated map of specific genomic regions containing the vast majority of common height-associated variants.

Genetics

Biology

3

Paper

Save

Refining The Accuracy Of Validated Target Identification Through Coding Variant Fine-Mapping In Type 2 Diabetes

Anubha Mahajan et al.May 31, 2017

Identification of coding variant associations for complex diseases offers a direct route to biological insight, but is dependent on appropriate inference concerning the causal impact of those variants on disease risk. We aggregated coding variant data for 81,412 type 2 diabetes (T2D) cases and 370,832 controls of diverse ancestry, identifying 40 distinct coding variant association signals (at 38 loci) reaching significance (p<2.2x10-7). Of these, 16 represent novel associations mapping outside known genome-wide association study (GWAS) signals. We make two important observations. First, despite a threefold increase in sample size over previous efforts, only five of the 40 signals are driven by variants with minor allele frequency <5%, and we find no evidence for low-frequency variants with allelic odds ratio >1.29. Second, we used GWAS data from 50,160 T2D cases and 465,272 controls of European ancestry to fine-map these associated coding variants in their regional context, with and without additional weighting to account for the global enrichment of complex trait association signals in coding exons. At the 37 signals for which we attempted fine-mapping, we demonstrate convincing support (posterior probability >80% under the 'annotation-weighted' model) that coding variants are causal for the association at 16 (including novel signals involving POC5 p.His36Arg, ANKH p.Arg187Gln, WSCD2 p.Thr113Ile, PLCB3 p.Ser778Leu, and PNPLA3 p.Ile148Met). However, at 13 of the 37 loci, the associated coding variants represent 'false leads' and naïve analysis could have led to an erroneous inference regarding the effector transcript mediating the signal. Accurate identification of validated targets is dependent on correct specification of the contribution of coding and non-coding mediated mechanisms at associated loci.

Genetics

Molecular Biology

0

Paper

Save

Atrial Fibrillation Genetic Risk Differentiates Cardioembolic Stroke from other Stroke Subtypes

Eric Boerwinkle et al.Dec 24, 2017

Atrial fibrillation is a prevalent arrhythmia associated with a five-fold increased risk of ischemic stroke, and specifically the cardioembolic stroke subtype. Genome-wide association studies of these traits have yielded overlapping risk loci, but genome-wide investigation of genetic susceptibility shared between stroke and atrial fibrillation is lacking. Comparing the genetic architectures of the two diseases could inform whether cardioembolic strokes are driven by inherited atrial fibrillation susceptibility, and may help elucidate ischemic stroke mechanisms. Here, we analyze genome-wide genotyping data and estimate SNP-based heritability in atrial fibrillation and cardioembolic stroke to be nearly identical (20.0% and 19.5%, respectively). Further, we find that the traits are genetically correlated (r=0.77 for SNPs with p < 4.4 x 10-4 in a previous atrial fibrillation meta-analysis). Clinical studies are warranted to assess whether genetic susceptibility to atrial fibrillation can be leveraged to improve the diagnosis and care of ischemic stroke patients.

Genetics

Internal Medicine

0

Paper

Genetics

Internal Medicine

0

Save