ResearchHub | Open Science Community

Genome-wide association meta-analysis of 78,308 individuals identifies new loci and genes influencing human intelligence

Suzanne Sniekers et al.May 22, 2017

Danielle Posthuma and colleagues perform a large meta-analysis for intelligence and determine genetic overlap with several neuropsychiatric and metabolic traits. They find 15 new significant loci and implicate 40 new genes, most of which are predominantly expressed in the brain. Intelligence is associated with important economic and health-related life outcomes1. Despite intelligence having substantial heritability2 (0.54) and a confirmed polygenic nature, initial genetic studies were mostly underpowered3,4,5. Here we report a meta-analysis for intelligence of 78,308 individuals. We identify 336 associated SNPs (METAL P < 5 × 10−8) in 18 genomic loci, of which 15 are new. Around half of the SNPs are located inside a gene, implicating 22 genes, of which 11 are new findings. Gene-based analyses identified an additional 30 genes (MAGMA P < 2.73 × 10−6), of which all but one had not been implicated previously. We show that the identified genes are predominantly expressed in brain tissue, and pathway analysis indicates the involvement of genes regulating cell development (MAGMA competitive P = 3.5 × 10−6). Despite the well-known difference in twin-based heritability2 for intelligence in childhood (0.45) and adulthood (0.80), we show substantial genetic correlation (rg = 0.89, LD score regression P = 5.4 × 10−29). These findings provide new insight into the genetic architecture of intelligence.

Genetics

Internal Medicine

3

Paper

Save

Evaluation of Polygenic Prediction Methodology within a Reference-Standardized Framework

Oliver Pain et al.Jul 29, 2020

Abstract Background The predictive utility of polygenic scores is increasing, and many polygenic scoring methods are available, but it is unclear which method performs best. This study evaluates the predictive utility of polygenic scoring methods within a reference-standardized framework, which uses a common set of variants and reference-based estimates of linkage disequilibrium and allele frequencies to construct scores. Methods Eight polygenic score methods were tested: p-value thresholding and clumping (pT+clump), SBLUP, lassosum, LDPred1, LDPred2, PRScs, DBSLMM and SBayesR, evaluating their performance to predict outcomes in UK Biobank and the Twins Early Development Study (TEDS). Strategies to identify optimal p-value threshold and shrinkage parameters were compared, including 10-fold cross validation, pseudovalidation and infinitesimal models (with no validation sample), and multi-polygenic score elastic net models. Results LDPred2, lassosum and PRScs performed strongly using 10-fold cross-validation to identify the most predictive p-value threshold or shrinkage parameter, giving a relative improvement of 16-18% over pT+clump in the correlation between observed and predicted outcome values. Using pseudovalidation, the best methods were PRScs and DBSLMM, with a relative improvement of >10% over other pseudovalidation and infinitesimal methods (lassosum, SBLUP, SBayesR, LDPred1, LDPred2). PRScs pseudovalidation was only 3% worse than the best polygenic score identified by 10-fold cross validation. Elastic net models containing polygenic scores based on a range of parameters consistently improved prediction over any single polygenic score. Conclusion Within a reference-standardized framework, the best polygenic prediction was achieved using LDPred2, lassosum and PRScs, modeling multiple polygenic scores derived using multiple parameters. This study will help researchers performing polygenic score studies to select the most powerful and predictive analysis methods.

Genetics

Artificial Intelligence

75

Paper

Save

Genetic stratification of depression in UK Biobank suggests a subgroup linked to age of natural menopause

David Howard et al.May 5, 2017

Abstract Depression is a common and clinically heterogeneous mental health disorder that is frequently comorbid with other diseases and conditions. Stratification of depression may align sub-diagnoses more closely with their underling aetiology and provide more tractable targets for research and effective treatment. In the current study, we investigated whether genetic data could be used to identify subgroups within people with depression using the UK Biobank. Examination of cross-locus correlations was used to test for evidence of subgroups by examining whether there was clustering of independent genetic variants associated with eleven other complex traits and disorders in people with depression. We found evidence of a subgroup within depression using age of natural menopause variants ( P = 1.69 × 10 −3 ) and this effect remained significant in females ( P = 1.18 × 10 −3 ), but not males ( P = 0.186). However, no evidence for this subgroup ( P > 0.05) was found in Generation Scotland, iPSYCH, a UK Biobank replication cohort or the GERA cohort. In the UK Biobank, having depression was also associated with a later age of menopause (beta = 0.34, standard error = 0.06, P = 9.92 × 10 −8 ). A potential age of natural menopause subgroup within depression and the association between depression and a later age of menopause suggests that they partially share a developmental pathway.

Genetics

Internal Medicine

0

Paper

Save

Examining sex-differentiated genetic effects across neuropsychiatric and behavioral traits

Joanna Martin et al.May 5, 2020

Abstract Background The origin of sex differences in prevalence and presentation of neuropsychiatric and behavioral traits is largely unknown. Given established genetic contributions and correlations across these traits, we tested for a sex-differentiated genetic architecture within and between traits. Methods Using genome-wide association study (GWAS) summary statistics for 20 neuropsychiatric and behavioral traits, we tested for differences in SNP-based heritability (h 2 ) and genetic correlation (r g <1) between sexes. For each trait, we computed z-scores from sex-stratified GWAS regression coefficients and identified genes with sex-differentiated effects. We calculated Pearson correlation coefficients between z-scores for each trait pair, to assess whether specific pairs share variants with sex-differentiated effects. Finally, we tested for sex differences in between-trait genetic correlations. Results With current sample sizes (and power), we found no significant, consistent sex differences in SNP-based h 2 . Between-sex, within-trait genetic correlations were consistently high, although significantly less than 1 for educational attainment and risk-taking behavior. We identified genome-wide significant genes with sex-differentiated effects for eight traits. Several trait pairs shared sex-differentiated effects. The top 0.1% of genes with sex-differentiated effects across traits overlapped with neuron- and synapse-related gene sets. Most between-trait genetic correlation estimates were similar across sex, with several exceptions (e.g. educational attainment & risk-taking behavior). Conclusions Sex differences in the common autosomal genetic architecture of neuropsychiatric and behavioral phenotypes are small and polygenic, requiring large sample sizes. Genes with sex-differentiated effects are enriched for neuron-related gene sets. This work motivates further investigation of genetic, as well as environmental, influences on sex differences.

Genetics

Clinical Psychology

67

Paper

Save

Narrow-sense heritability estimation of complex traits using identity-by-descent information.

Luke Evans et al.Jul 17, 2017

Heritability is a fundamental parameter in genetics. Traditional estimates based on family or twin studies can be biased due to shared environmental or non-additive genetic variance. Alternatively, those based on genotyped or imputed variants typically underestimate narrow-sense heritability contributed by rare or otherwise poorly-tagged causal variants. Identical-by-descent (IBD) segments of the genome share all variants between pairs of chromosomes except new mutations that have arisen since the last common ancestor. Therefore, relating phenotypic similarity to degree of IBD sharing among classically unrelated individuals is an appealing approach to estimating the near full additive genetic variance while avoiding biases that can occur when modeling close relatives. We applied an IBD-based approach (GREML-IBD) to estimate heritability in unrelated individuals using phenotypic simulation with thousands of whole genome sequences across a range of stratification, polygenicity levels, and the minor allele frequencies of causal variants (CVs). IBD-based heritability estimates were unbiased when using unrelated individuals, even for traits with extremely rare CVs, but stratification led to strong biases in IBD-based heritability estimates with poor precision. We used data on two traits in ~120,000 people from the UK Biobank to demonstrate that, depending on the trait and possible confounding environmental effects, GREML-IBD can be applied successfully to very large genetic datasets to infer the contribution of very rare variants lost using other methods. However, we observed apparent biases in this real data that were not predicted from our simulation, suggesting that more work may be required to understand factors that influence IBD-based estimates.

Genetics

Biology

0

Paper

Save

Genomic Influences on Self-Reported Childhood Maltreatment

Shareefa Dalvie et al.Jul 28, 2019

Genetics

Clinical Psychology

0

Paper

Save

Applying polygenic risk scoring for psychiatric disorders to a large family with Bipolar Disorder and Major Depressive Disorder

Simone Jong et al.Jan 27, 2017

We aim to investigate the application of polygenic risk scoring within a family context. Polygenic risk profiles could aid in unraveling the role that common variation confers on disease risk within a pedigree that would have traditionally been viewed through the prism of monogenic inheritance only. We illustrate our discussion by analyzing polygenic risk scores for schizophrenia, major depressive disorder and bipolar disorder in a large pedigree (n~260) in which 30% of family members suffer from major depressive disorder or bipolar disorder. We apply polygenic risk scores to study patterns of assortative mating and anticipation, whereby it appears increased polygenic risk for psychiatric disorders is contributed by affected individuals who married into the family, resulting in an increasing genetic risk over generations in the family. This may explain the observation of anticipation in mood disorders, whereby onset is earlier and the severity of a disease increases over the generations of a family. Joint analyses of both rare and common variation may be the most powerful way to understand the familial genetics of mood and psychiatric disorders.

Genetics

Artificial Intelligence

0

Paper

Genetics

Artificial Intelligence

0

Save

0

Genes associated with neuropsychiatric disease increase vulnerability to abnormal deep grey matter development

Harriet Cullen et al.Jun 12, 2018

Background: Neuropsychiatric disease has polygenic determinants but is often precipitated by environmental pressures, including adverse perinatal events. However, the way in which genetic vulnerability and early-life adversity interact remains obscure. Preterm birth is associated with abnormal brain development and psychiatric disease. We hypothesised that the extreme environmental stress of premature extra-uterine life could contribute to neuroanatomic abnormality in genetically vulnerable individuals. Methods: We combined Magnetic Resonance Imaging (MRI) and genome-wide single nucleotide polymorphism (SNP) data from 194 infants, born before 33 weeks of gestation, to test the prediction that: the characteristic deep grey matter abnormalities seen in preterm infants are associated with polygenic risk for psychiatric illness. Summary statistics from a meta-analysis of SNP data for five psychiatric disorders were used to compute individual polygenic risk scores (PRS). The variance explained by the PRS in the relative volumes of four deep grey matter structures (caudate nucleus, thalamus, subthalamic nucleus and lentiform nucleus) was estimated using linear regression both for the full, mixed-ancestral, cohort and a subsample of European infants. Results: The PRS was negatively associated with: lentiform volume in the full cohort (β=-0.24, p=8x10-4) and the European subsample (β=-0.24, p=8x10-3); and with subthalamic nuclear volume in the full cohort (β=-0.18, p=0.01) and the European subsample (β=-0.26, p=3x10-3). Conclusions: Genetic variants associated with neuropsychiatric disease increase vulnerability to abnormal deep grey matter development and are associated with neuroanatomic changes in the perinatal period. This suggests a mechanism by which perinatal adversity leads to later neuropsychiatric disease in genetically predisposed individuals.

Genetics

Internal Medicine

0

Paper

Save

Effect of cytochrome CYP2C19 metabolizing activity on antidepressant response and side effects: meta-analysis of data from genome-wide association studies

Chiara Fabbri et al.Feb 6, 2018

Cytochrome (CYP) P450 enzymes have a primary role in antidepressant metabolism and variants in these polymorphic genes are targets for pharmacogenetic investigation. This is the first meta-analysis to investigate how CYP2C19 polymorphisms predict citalopram/escitalopram efficacy and side effects. CYP2C19 phenotypes comprise poor metabolizers (PM), intermediate and intermediate+ metabolizers (IM; IM+), extensive and extensive+ metabolizers (EM [wild type]; EM+) and ultra-rapid metabolizers (UM) defined by the two most common CYP2C19 functional polymorphisms (rs4244285 and rs12248560) in Caucasians. These polymorphisms were genotyped or imputed from genome-wide data in four samples treated with citalopram or escitalopram (GENDEP, STAR*D, GenPod, PGRN-AMPS). Treatment efficacy was percentage symptom improvement and remission. Side effect data were available at weeks 2-4, 6 and 9 in three of the investigated samples. A fixed-effects meta-analysis was performed using EM as the reference group. Analysis of 2558 patients for efficacy and 2037 patients for side effects showed that PMs had higher symptom improvement (SMD=0.43, CI=0.19-0.66) and higher remission rates (OR=1.55, CI=1.23-1.96) compared to EMs. At weeks 2-4, PMs showed higher risk of gastro-intestinal (OR=1.26, CI=1.08-1.47), neurological (OR=1.28, CI=1.07-1.53) and sexual side effects (OR=1.52, CI=1.23-1.87; week 6 values similar). No difference was seen at week 9 or in total side effect burden. PMs did not have higher risk of dropout at week 4 compared to EMs. Antidepressant dose was not different among CYP2C19 groups. CYP2C19 polymorphisms may provide helpful information for guiding citalopram/escitalopram treatment, despite PMs are relatively rare among Caucasians (~2%).

Genetics

Pharmacology

0

Paper

Save

COPILOT: a Containerised wOrkflow for Processing ILlumina genOtyping daTa

Hamel Patel et al.Jul 26, 2021

Abstract Background The Illumina genotyping microarrays generate data in image format, which is processed by the platform-specific software GenomeStudio, followed by an array of complex bioinformatics analyses. This process can be time-consuming, lead to reproducibility errors, and be a daunting task for novice bioinformaticians. Results Here we introduce the COPILOT ( C ontainerised w O rkflow for P rocessing IL lumina gen O typing da T a) protocol, which provides an in-depth and clear guide to process raw Illumina genotype data in GenomeStudio, followed by a containerised workflow to automate an array of complex bioinformatics analyses involved in a GWAS quality control (QC). The COPILOT protocol was applied to two independent cohorts consisting of 2791 and 479 samples genotyped on the Infinium Global Screening (GSA) array with Multi-disease (MD) drop-in (~750,000 markers) and the Infinium H3Africa consortium array (~2,200,000 markers) respectively. Following the COPILOT protocol, an average sample quality improvement of 1.24% was observed across sample call rates, with notable improvement for low-quality samples. For example, from the 3270 samples processed, 141 samples had an initial sample call rate below 98%, averaging 96.6% (95% CI 95.6-97.7%), which is considered below the acceptable sample call rate threshold for a typical GWAS analysis. However, following the COPILOT protocol, all 141 samples had a call rate above 98% after QC and averaged 99.6% (95% CI 99.5-99.7%). In addition, the COPILOT pipeline automatically identified potential data issues, including gender discrepancies, heterozygosity outliers, related individuals, and population outliers through ancestry estimation. Conclusions The COPILOT protocol makes processing Illumina genotyping data transparent, effortless and reproducible. The container is deployable on multiple platforms, improves data quality, and the end product is analysis-ready PLINK formatted data, with a comprehensive and interactive summary report to guide the user for further data analyses.

Genetics

Molecular Biology

1

Paper

Genetics

Molecular Biology

0

Save