Genome-wide complex trait analysis

Genome-wide complex trait analysis (GCTA) GREML is a statistical method for variance component estimation in genetics which quantifies the total narrowsense (additive) contribution to a trait's heritability of a particular subset of genetic variants (typically limited to SNPs with MAF >1%, hence terms such as "chip heritability"/"SNP heritability"). This is done by directly quantifying the chance genetic similarity of unrelated strangers and comparing it to their measured similarity on a trait; if two strangers are relatively similar genetically and also have similar trait measurements, then this indicates that the measured genetics causally influence that trait, and how much. This can be seen as plotting prediction error against relatedness.[1] The GCTA framework extends to bivariate genetic correlations between traits;[2] it can also be done on a per-chromosome basis comparing against chromosome length; and it can also examine changes in heritability over aging and development.[3]

GCTA heritability estimates are useful because they can lower bound[4] the genetic contributions to traits such as intelligence without relying on the assumptions used in twin studies and other family studies and pedigree analyses, thereby corroborating[5][6][7] them, and enabling the design of well-powered Genome-wide association study (GWAS) designs to find the specific genetic variants. For example, a GCTA estimate of 30% SNP heritability is consistent with a larger total genetic heritability of 70%. However, if the GCTA estimate was ~0%, then that would imply one of three things: a) there is no genetic contribution, b) the genetic contribution is entirely in the form of genetic variants not included, or c) the genetic contribution is entirely in the form of non-additive effects such as epistasis/dominance. The ability to run GCTA on subsets of chromosomes and regress against chromosome length can reveal whether the responsible genetic variants cluster or are distributed evenly across the genome or are sex-linked. Examining genetic correlations can reveal to what extent observed correlations, such as between intelligence and socioeconomic status, are due to the same genetic traits, and in the case of diseases, can indicate shared causal pathways such as the overlap of schizophrenia with other mental diseases and intelligence-reducing variants.

History

Estimation in biology/animal breeding using standard ANOVA/REML methods of variance components such as heritability, shared-environment, maternal effects etc. typically requires individuals of known relatedness such as parent/child; this is often unavailable or the pedigree data unreliable, leading to inability to apply the methods or requiring strict laboratory control of all breeding (which threatens the external validity of all estimates), and several authors have noted that relatedness could be measured directly from genetic markers (and if individuals were reasonably related, economically few markers would have to be obtained for statistical power), leading Kermit Ritland to propose in 1996 that directly measured pairwise relatedness could be compared to pairwise phenotype measurements (Ritland 1996, "A Marker-based Method for Inferences About Quantitative Inheritance in Natural Populations"[8]) to combine estimated genetic relatedness with phenotypic measurements to estimate variance components such as heritability or genetic correlations.[9] and subsequently applied to plants/animals[10][11][12][13][14][15][16]

As genome sequencing costs dropped steeply over the 2000s, acquiring enough markers on enough subjects for reliable estimates using very distantly related individuals became possible. An early application of the method to humans came with Visscher et al. 2006[17]/2007,[18] which used SNP markers to estimate the actual relatedness of siblings and estimate heritability from the direct genetics. In humans, unlike the original animal/plant applications, relatedness is usually known with high confidence in the 'wild population', and the benefit of GCTA is connected more to avoiding assumptions of classic behavioral genetics designs and verifying their results, and partitioning heritability by SNP class and chromosomes. The first use of GCTA proper in humans was published in 2010, finding 45% of variance in human height can be explained by the included SNPs.[19][20] (Large GWASes on height have since confirmed the estimate.[21]) The GCTA algorithm was then described and a software implementation published in 2011.[22] It has since been used to study a wide variety of biological, medical, psychiatric, and psychological traits in humans, and inspired many variant approaches.

Benefits

Robust heritability

Twin and family studies have long been used to estimate variance explained by particular categories of genetic and environmental causes. Across a wide variety of human traits studied, there is typically minimal shared-environment influence, considerable non-shared environment influence, and a large genetic component (mostly additive), which is on average ~50% and sometimes much higher for some traits such as height or intelligence.[23] However, the twin and family studies have been criticized for their reliance on a number of assumptions that are difficult or impossible to verify, such as the equal environments assumption (that the environments of monozygotic and dizygotic twins are equally similar), that there is no misclassification of zygosity (mistaking identical for fraternal & vice versa), that twins are unrepresentative of the general population, and that there is no assortative mating. Violations of these assumptions can result in both upwards and downwards bias of the parameter estimates.[24] (This debate & criticism have particularly focused on the heritability of IQ.)

The use of SNP or whole-genome data from unrelated subject participants (with participants too related, typically >0.025 or ~fourth cousins levels of similarity, being removed, and several principal components included in the regression to avoid & control for population stratification) bypasses many heritability criticisms: twins are often entirely uninvolved, there are no questions of equal treatment, relatedness is estimated precisely, and the samples are drawn from a broad variety of subjects.

In addition to being more robust to violations of the twin study assumptions, SNP data can be easier to collect since it does not require rare twins and thus also heritability for rare traits can be estimated (with due correction for ascertainment bias).

GWAS power

GCTA estimates can be used to resolve the missing heritability problem and design GWASes which will yield genome-wide statistically-significant hits. This is done by comparing the GCTA estimate with the results of smaller GWASes. If a GWAS of n=10k using SNP data fails to turn up any hits, but the GCTA indicates a high heritability accounted for by SNPs, then that implies that there are a large number of polygenic variants and thus that much larger GWASes will be required to accurately estimate each SNP's effects and directly account for a fraction of the GCTA heritability.

Disadvantages

  1. Limited inference: GCTA estimates are inherently limited in that they cannot estimate broadsense heritability like twin/family studies. Hence, while they serve as a critical check on the unbiasedness of the twin/family studies, GCTAs cannot replace them for estimating total genetic contributions to a trait.
  2. Substantial data requirements: the number of SNPs sequenced per person should be in the thousands and ideally the hundreds of thousands for reasonable estimates of genetic similarity (although this is no longer such an issue for current commercial chips which default to hundreds of thousands or millions of markers); and the number of persons, for somewhat stable estimates of plausible SNP heritability, should be at least n>1000 and ideally n>10000.[25] In contrast, twin studies can offer precise estimates with a fraction of the sample size.
  3. Computational inefficiency: The original GCTA implementation scales poorly with increasing data size (), so even if enough data is available for precise GCTA estimates, the computational burden may be unfeasible. GCTA can be meta-analyzed as a standard precision-weighted fixed-effect meta-analysis,[26] so research groups sometimes estimate cohorts or subsets and then pool them meta-analytically (at the cost of additional complexity and some loss of precision). This has motivated the creation of faster implementations and variant algorithms which make different assumptions, such as using moment matching[27]
  4. Need for raw data: GCTA requires genetic similarity of all subjects and thus their raw genetic information; due to privacy concerns, individual patient data is rarely shared. GCTA cannot be run on the summary statistics reported publicly by many GWAS projects, and if pooling multiple GCTA estimates, meta-analysis must be done.
    In contrast, there are alternative techniques which operate on summaries reported by GWASes without requiring the raw data[28] e.g. "LD score regression"[29] contrasts linkage disequilibrium statistics (available from public datasets like 1000 Genomes) with the public summary effect-sizes to infer heritability and estimate genetic correlations/overlaps of multiple traits. The Broad Institute runs LD Hub which provides a public web interface to >=177 traits with LD score regression.[30] Another method using summary data is HESS.[31]
  5. Confidence intervals may be incorrect, or outside the 0-1 range of heritability, and highly imprecise due to asymptotics[32]

Interpretation

GCTA estimates are often misinterpreted as "the total genetic contribution", and since they are often much less than the twin study estimates, the twin studies are presumed to be biased and the genetic contribution to a particular trait is minor.[33] This is incorrect, as GCTA estimates are lower bounds.

A more correct interpretation would be that: GCTA estimates are the expected amount of variance that could be predicted by an indefinitely large GWAS using a simple additive linear model (without any interactions or higher-order effects) in a particular population at a particular time given the limited selection of SNPs and a trait measured with a particular amount of precision. Hence, there are many ways to exceed GCTA estimates:

  1. SNP genotyping data is typically limited to 200k-1m of the most common or scientifically interesting SNPs, though 150 million+ have been documented by genome sequencing;[34] as SNP prices drop and arrays become more comprehensive or whole-genome sequencing replaces SNP genotyping entirely, the expected narrowsense heritability will increase as more genetic variants are included in the analysis. The selection can also be expanded considerably using haplotypes[35] and imputation (SNPs can proxy for unobserved genetic variants which they tend to be inherited with); e.g. Yang et al. 2015[36] finds that with more aggressive use of imputation to infer unobserved variants, the height GCTA estimate expands to 56% from 45%, and Hill et al. 2017 finds that expanding GCTA to cover rarer variants raises the intelligence estimates from ~30% to ~53% and explains all the heritability in their sample;[37] for 4 traits in the UK Biobank, imputing raised the SNP heritability estimates.[38] Additional genetic variants include de novo mutations/mutation load & structural variations such as copy-number variations.
  2. narrowsense heritability estimates assume simple additivity of effects, ignoring interactions. As some trait values will be due to these more complicated effects, the total genetic effect will exceed that of the subset measured by GCTA, and as the additive SNPs are found and measured, it will become possible to find interactions as well using more sophisticated statistical models.
  3. all correlation & heritability estimates are biased downwards to zero by the presence of measurement error; the need for adjusting this leads to techniques such as Spearman's correction for measurement error, as the underestimate can be quite severe for traits where large-scale and accurate measurement is difficult and expensive,[39] such as intelligence. For example, an intelligence GCTA estimate of 0.31, based on an intelligence measurement with test-retest reliability , would after correction (), be a true estimate of ~0.48, indicating that common SNPs alone explain half of variance. Hence, a GWAS with a better measurement of intelligence can expect to find more intelligence hits than indicated by a GCTA based on a noisier measurement.

Implementations

GCTA
Original author(s) Jian Yang
Initial release 30 August 2010
Stable release
1.25.2 / 22 December 2015
Development status Maintained
Written in C++
Operating system Linux (Mac/Windows support dropped at v1.02)
Available in English
Type genetics
License GPL v3
Website cnsgenomics.com/software/gcta/; forums: gcta.freeforums.net
As of 22 May 2016

The original "GCTA" software package is the most widely used; its primary functionality covers the GREML estimation of SNP heritability, but includes other functionality:

  • Estimate the genetic relationship from genome-wide SNPs;
  • Estimate the inbreeding coefficient from genome-wide SNPs;
  • Estimate the variance explained by all the autosomal SNPs;
  • Partition the genetic variance onto individual chromosomes;
  • Estimate the genetic variance associated with the X-chromosome;
  • Test the effect of dosage compensation on genetic variance on the X-chromosome;
  • Predict the genome-wide additive genetic effects for individual subjects and for individual SNPs;
  • Estimate the LD structure encompassing a list of target SNPs;
  • Simulate GWAS data based upon the observed genotype data;
  • Convert Illumina raw genotype data into PLINK format;
  • Conditional & joint analysis of GWAS summary statistics without individual level genotype data
  • Estimating the genetic correlation between two traits (diseases) using SNP data
  • Mixed linear model association analysis

Other implementations and variant algorithms include:

Traits

GCTA estimates frequently find estimates 0.1-0.5, consistent with broadsense heritability estimates (with the exception of personality traits, for which theory & current GWAS results suggest non-additive genetics driven by frequency-dependent selection[54][55]). Traits univariate GCTA has been used on (excluding SNP heritability estimates computed using other algorithms such as LD score regression, and bivariate GCTAs which are listed in genetic correlation) include (point-estimate format: "(standard error)"):

Human

Anthropometric

Social/behavioral

Psychological

Psychiatric

Drug use

Disease

Biological

Neanderthal admixture

Neanderthal admixture as a risk factor for:[197]

Animal/plant


See also

References

  1. Figure 3 of Yang et al 2010, or Figure 3 of Ritland & Ritland 1996
  2. Lee et al 2012, "Estimation of pleiotropy between complex diseases using single-nucleotide polymorphism-derived genomic relationships and restricted maximum likelihood"
  3. 1 2 "Genetic contributions to stability and change in intelligence from childhood to old age", Deary et al 2012
  4. "A common misconception about SNP-chip heritability estimates calculated with GCTA and LDSC is that they should be similar to twin study estimates, when in reality twin studies have the advantage of capturing all genetic effects—common, rare and those not genotyped by available methods. Thus, the assumption should be that h2SNP < h2TWIN when using GCTA and LDSC, and this is what we observe for PTSD, as has been observed for many other phenotypes.54" --Duncan et al 2017
  5. Eric Turkheimer ("Still Missing", Turkheimer 2011) discusses the GCTA results in the context of the twin study debate: "Of the three reservations about quantitative genetic heritability that were outlined at the outset—the assumptions of twin and family studies, the universality of heritability, and the absence of mechanism—the new paradigm has put the first to rest, and before continuing to explain my skepticism about whether the most important problems have been solved, it is worth appreciating what a significant accomplishment this is. Thanks to the Visscher program of research, it should now be impossible to argue that the whole body of quantitative genetic research showing the universal importance of genes for human development was somehow based on a sanguine view of the equal environments assumption in twin studies, putting an end to an entire misguided school of thought among traditional opponents of classical quantitative (and by association behavioral) genetics (e.g., Joseph, 2010; Kamin & Goldberger, 2002)"; see also Turkheimer, Harden, & Nisbett: "These methods have given scientists a new way to compute heritability: Studies that measure DNA sequence variation directly have shown that pairs of people who are not relatives, but who are slightly more similar genetically, also have more similar IQs than other pairs of people who happen to be more different genetically. These “DNA-based” heritability studies don’t tell you much more than the classical twin studies did, but they put to bed many of the lingering suspicions that twin studies were fundamentally flawed in some way. Like the validity of intelligence testing, the heritability of intelligence is no longer scientifically contentious."
  6. "This finding of strong genome-wide pleiotropy across diverse cognitive and learning abilities, indexed by general intelligence, is a major finding about the origins of individual differences in intelligence. Nonetheless, this finding seems to have had little impact in related fields such as cognitive neuroscience or experimental cognitive psychology. We suggest that part of the reason for this neglect is that these fields generally ignore individual differences.65,66 Another reason might be that the evidence for this finding rested largely on the twin design, for which there have always been concerns about some of its assumptions;6 we judge that this will change now that GCTA is beginning to confirm the twin results." --"Genetics and intelligence differences: five special findings", Plomin & Deary 2015
  7. "Top 10 Replicated Findings From Behavioral Genetics", Plomin et al 2016: "This research has primarily relied on the twin design in which the resemblance of identical and fraternal twins is compared and the adoption design in which the resemblance of relatives separated by adoption is compared. Although the twin and adoption designs have been criticized separately (Plomin et al., 2013), these two designs generally converge on the same conclusion despite being based on very different assumptions, which adds strength to these conclusions...GCTA underestimates genetic influence for several reasons and requires samples of several thousand individuals to reveal the tiny signal of chance genetic similarity from the noise of DNA differences across the genome (Vinkhuyzen, Wray, Yang, Goddard, & Visscher, 2013). Nonetheless, GCTA has consistently yielded evidence for significant genetic influence for cognitive abilities (Benyamin et al., 2014; Davies et al., 2015; St. Pourcain et al., 2014), psychopathology (L. K. Davis et al., 2013; Gaugler et al., 2014; Klei et al., 2012; Lubke et al., 2012, 2014; McGue et al., 2013; Ripke et al., 2013; Wray et al., 2014), personality (C. A. Rietveld, Cesarini, et al., 2013; Verweij et al., 2012; Vinkhuyzen et al., 2012), and substance use or drug dependence (Palmer et al., 2015; Vrieze, McGue, Miller, Hicks, & Iacono, 2013), thus supporting the results of twin and adoption studies."
  8. see also Ritland 1996b, "Estimators for pairwise relatedness and individual inbreeding coefficients"; Ritland & Ritland 1996, "Inferences about quantitative inheritance based on natural population structure in the yellow monkeyflower, Mimulus guttatus"; Lynch & Ritland 1999, "Estimation of Pairwise Relatedness With Molecular Markers"; Ritland 2000, "Marker-inferred relatedness as a tool for detecting heritability in nature"; Thomas 2005, "The estimation of genetic relationships using molecular markers and their efficiency in estimating heritability in natural populations"
  9. pg800-803, ch27 "REML Estimation of Genetic Variances", Genetics and Analysis of Quantitative Traits, Lynch & Walsh 1998; ISBN 0878934812
  10. Mousseau et al 1998, "A novel method for estimating heritability using molecular markers"
  11. Thomas et al 2002, "The use of marker-based relationship information to estimate the heritability of body weight in a natural population: a cautionary tale"
  12. Wilson et al 2003, "Marker-assisted estimation of quantitative genetic parameters in rainbow trout Oncorhynchus mykiss"
  13. Klaper et al 2001, "Heritability of Phenolics in Quercus laevis Inferred Using Molecular Markers"
  14. van Kleunen & Ritland 2004, "Predicting evolution of floral traits associated with mating system in a natural plant population"
  15. van Kleunen & Ritland 2005, "Estimating Heritabilities and Genetic Correlations with Marker-Based Methods: An Experimental Test in Mimulus guttatus"
  16. Shikano 2005, "Marker-based estimation of heritability for body color variation in Japanese flounder Paralichthys olivaceus"
  17. Visscher et al 2006, "Assumption-free estimation of heritability from genome-wide identity-by-descent sharing between full siblings"
  18. Visscher et al 2007, "Genome partitioning of genetic variation for height from 11,214 sibling pairs"
  19. 1 2 "Common SNPs explain a large proportion of heritability for human height", Yang et al 2010
  20. "A Commentary on ‘Common SNPs Explain a Large Proportion of the Heritability for Human Height’ by Yang et al. (2010)", Visscher et al 2010
  21. 1 2 "Defining the role of common variation in the genomic and biological architecture of adult human height", Wood et al 2014
  22. "GCTA: A Tool for Genome-wide Complex Trait Analysis", Yang et al 2011
  23. "Meta-analysis of the heritability of human traits based on fifty years of twin studies", Polderman et al 2015
  24. Barnes, J. C.; Wright, John Paul; Boutwell, Brian B.; Schwartz, Joseph A.; Connolly, Eric J.; Nedelec, Joseph L.; Beaver, Kevin M. (2014-11-01). "Demonstrating the Validity of Twin Research in Criminology" (PDF). Criminology. 52 (4): 588–626. ISSN 1745-9125. doi:10.1111/1745-9125.12049.
  25. "GCTA will eventually provide direct DNA tests of quantitative genetic results based on twin and adoption studies. One problem is that many thousands of individuals are required to provide reliable estimates. Another problem is that more SNPs are needed than even the million SNPs genotyped on current SNP microarrays because there is much DNA variation not captured by these SNPs. As a result, GCTA cannot estimate all heritability, perhaps only about half of the heritability. Indeed, the first reports of GCTA analyses estimate heritability to be about half the heritability estimates from twin and adoption studies for height (Lee, Wray, Goddard, & Visscher, 2011; Yang et al., 2010; Yang, Manolio, et al" 2011), and intelligence (Davies et al., 2011)." pg110, Behavioral Genetics, Plomin et al 2012
  26. "Meta-analysis of GREML results from multiple cohorts", Yang 2015
  27. "Phenome-wide Heritability Analysis of the UK Biobank", Ge et al 2016
  28. Pasaniuc & Price 2016, "Dissecting the genetics of complex traits using summary association statistics"
  29. "LD Score Regression Distinguishes Confounding from Polygenicity in Genome-Wide Association Studies", Bulik-Sullivan et al 2015
  30. "LD Hub: a centralized database and web interface to LD score regression that maximizes the potential of summary level GWAS data for SNP heritability and genetic correlation analysis", Zheng et al 2016
  31. "Contrasting the genetic architecture of 30 complex traits from summary association data", Shi et al 2016
  32. "Fast and Accurate Construction of Confidence Intervals for Heritability", Schweiger et al 2016
  33. "Still Chasing Ghosts: A New Genetic Methodology Will Not Find the 'Missing Heritability'", Charney 2013
  34. "Deep Sequencing of 10,000 Human Genomes", Telenti 2015
  35. "Haplotypes of common SNPs can explain missing heritability of complex diseases", Bhatia et al 2015
  36. 1 2 3 "Genetic variance estimation with imputed variants finds negligible missing heritability for human height and body mass index", Yang et al 2015
  37. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 Hill et al 2017, "Genomic analysis of family data reveals additional genetic effects on intelligence and personality"
  38. Evans et al 2017, "Comparison of methods that use whole genome data to estimate the heritability and genetic architecture of complex traits"
  39. Methods of Meta-Analysis: Correcting Error and Bias in Research Findings, Hunter & Schmidt 2004
  40. "Fast linear mixed models for genome-wide association studies", Lippert 2011
  41. "Improved linear mixed models for genome-wide association studies", Listgarten et al 2012
  42. "Advantages and pitfalls in the application of mixed-model association methods", Yang et al 2014
  43. "A lasso multi-marker mixed model for association mapping with population structure correction", Rakitsch et al 2012
  44. "Genome-wide efficient mixed-model analysis for association studies", Zhou & Stephens 2012
  45. "Variance component model to account for sample structure in genome-wide association studies", Kang et al 2012
  46. "Advanced Complex Trait Analysis", Gray et al 2012
  47. "Regional Heritability Advanced Complex Trait Analysis for GPU and Traditional Parallel Architecture", Cebamanos et al 2012
  48. "Efficient Bayesian mixed model analysis increases association power in large cohorts", Loh et al 2012
  49. 1 2 3 4 5 6 7 8 9 10 11 "Contrasting genetic architectures of schizophrenia and other complex diseases using fast variance-components analysis", Loh et al 2015; see also "Contrasting regional architectures of schizophrenia and other complex diseases using fast variance components analysis", Loh et al 2015
  50. "Mixed Models for Meta-Analysis and Sequencing", Bulik-Sullivan 2015
  51. "Massively expedited genome-wide heritability analysis (MEGHA)", Ge et al 2015
  52. Speed et al 2016, "Re-evaluation of SNP heritability in complex human traits"
  53. Evans et al 2017, "Narrow-sense heritability estimation of complex traits using identity-by-descent information."
  54. 1 2 3 4 5 "Maintenance of genetic variation in human personality: Testing evolutionary models by estimating heritability due to common causal variants and investigating the effect of distant inbreeding", Verweij et al 2012
  55. "The Evolutionary Genetics of Personality", Penke et al 2007; "The Evolutionary Genetics of Personality Revisited", Penke & Jokela 2016
  56. 1 2 3 4 "Genome partitioning of genetic variation for complex traits using common SNPs", Yang et al 2011
  57. 1 2 3 4 5 "Estimating the genetic variance of major depressive disorder due to all single nucleotide polymorphisms", Lubke et al 2012
  58. 1 2 "Inference of the Genetic Architecture Underlying BMI and Height with the Use of 20,240 Sibling Pairs", Hemani et al 2013
  59. 1 2 "Coordinated Genetic Scaling of the Human Eye: Shared Determination of Axial Eye Length and Corneal Curvature", Guggenheim et al 2013
  60. 1 2 3 "First genome-wide association study on anxiety-related behaviours in childhood", Trzaskowski et al 2013
  61. 1 2 3 "Testing the key assumption of heritability estimates based on genome-wide genetic relatedness", Conley et al 2014
  62. 1 2 3 "Common DNA Markers Can Account for More Than Half of the Genetic Influence on Cognitive Abilities", Plomin et al 2013
  63. 1 2 3 4 5 6 7 8 "Improved heritability estimation from genome-wide SNPs", Speed et al 2012
  64. 1 2 3 "Genome-Wide Estimates of Heritability for Social Demographic Outcomes", Domingue et al 2016
  65. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 "Dominant Genetic Variation and Missing Heritability for Human Complex Traits: Insights from Twin versus Genome-wide Common SNP Models", Chen et al 2015
  66. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 "Using Extended Genealogy to Estimate Components of Heritability for 23 Quantitative and Dichotomous Traits", Zaitlen et al 2013
  67. 1 2 3 "Genomic architecture of human neuroanatomical diversity", Toro et al 2014 (supplement)
  68. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Figure 4, "Like Mother, Like Daughter: Analysis of Parent-Child Phenotypic Correlations for Hundreds of Phenotypic Traits", Pierson et al 2014
  69. 1 2 "Application of linear mixed models to study genetic stability of height and body mass index across countries and time", Trzaskowski et al 2016
  70. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 Zaidi et al 2017, "Investigating the case of human nose shape and climate adaptation"
  71. 1 2 3 4 5 6 7 "Heritability and genetic correlations explained by common SNPs for metabolic syndrome traits", Vattikuti et al 2012
  72. "Using Genome Wide Estimates of Heritability to Examine the Relevance of Gene-Environment Interplay", Domingue & Boardman 2013
  73. 1 2 3 4 "What can genes tell us about the relationship between education and health?", Boardman et al 2015
  74. "Finding the missing heritability in pediatric obesity: the contribution of genome-wide complex trait analysis", Llewellyn et al 2013
  75. Willems et al 2017, "Large-scale GWAS identifies multiple loci for hand grip strength providing biological insights into muscular fitness"
  76. 1 2 3 Warrington et al 2017, "Maternal and fetal genetic contribution to gestational weight gain"
  77. 1 2 "Large-scale genotyping identifies a new locus at 22q13.2 associated with female breast size", Li et al 2013
  78. "Molecular genetic contributions to self-rated health", Harris et al 2016
  79. 1 2 3 4 5 "Heritability and Genome-Wide Association Studies for Hair Color in a Dutch Twin Family Based Sample", Lin et al 2015
  80. "Genetic Prediction of Male Pattern Baldness", Hagenaars et al 2016
  81. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 Cole et al 2017, "Human Facial Shape and Size Heritability and Genetic Correlations"
  82. Duffy et al 2017, "Novel pleiotropic risk loci for melanoma and nevus density implicate multiple biological pathways"
  83. 1 2 "Human Fertility, Molecular Genetics, and Natural Selection in Modern Societies", Tropf et al 2015
  84. 1 2 "Mega-analysis of 31,396 individuals from 6 countries uncovers strong gene-environment interaction for human fertility", Tropf et al 2016
  85. 1 2 3 "Assortative mating and differential fertility by phenotype and genotype across the 20th century", Conley et al 2016 (supplement)
  86. "A Genetic Variant Near Olfactory Receptor Genes Associates With Cilantro Preference", Wu et al 2012
  87. "GWAS of 126,559 Individuals Identifies Genetic Variants Associated with Educational Attainment", Rietveld et al 2013
  88. 1 2 3 4 "Genome-wide association study of cognitive functions and educational attainment in UK Biobank (n=112151)", Davies et al 2016
  89. 1 2 3 4 5 6 7 8 9 10 "The genetic architecture of economic and political preferences", Benjamin 2012
  90. 1 2 3 "Molecular genetic contributions to socioeconomic status and intelligence", Marioni et al 2014
  91. 1 2 "Genetic link between family socioeconomic status and children's educational achievement estimated from genome-wide SNPs", Krapohl & Plomin 2016
  92. 1 2 Davis et al 2014, "The correlation between reading and mathematics ability at age twelve has a substantial genetic component"
  93. 1 2 "Genetic influence on family socioeconomic status and children's intelligence", Trzaskowski et al 2014b
  94. 1 2 "Molecular genetic contributions to social deprivation and household income in UK Biobank (n=112,151)", Hill et al 2016
  95. 1 2 3 "Assessing causality in the association between child adiposity and physical activity levels: A Mendelian randomization analysis", Richmond et al 2014
  96. Sanchez-Roige et al 2017, "Genetics of the Research Domain Criteria (RDoC): genome-wide association study of delay discounting"
  97. "Genetic contributions to self-reported tiredness", Deary et al 2016
  98. 1 2 "Genome-wide association analysis identifies novel loci for chronotype in 100,420 individuals from the UKBiobank", Lane et al 2016 (supplement)
  99. "Unraveling the genetic etiology of adult antisocial behavior: A genome-wide association study", Tielbeek et al 2012
  100. 1 2 Wootton et al 2016, "Exploring the Genetic Etiology of Trust in Adolescents: Combined Twin and DNA Analyses"
  101. Gao et al 2016, "Genome-Wide Association Study of Loneliness Demonstrates a Role for Common Variation"
  102. 1 2 3 4 5 "A genome-wide association study of behavioral disinhibition", McGue et al 2013
  103. 1 2 3 4 5 "Three mutually informative ways to understand the genetic relationships among behavioral disinhibition, alcohol use, drug use, nicotine use/dependence, and their co-occurrence: Twin biometry, GCTA, and genome-wide scoring", Vrieze et al 2013
  104. "Estimating the heritability of reporting stressful life events captured by common genetic variants", Power et al 2013
  105. "Mapping the Genetic Variation of Regional Brain Volumes as Explained by All Common SNPs from the ADNI Study", Bryant et al 2013
  106. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 "Heritability of Neuroanatomical Shape", Ge et al 2015
  107. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 "Partitioning heritability analysis reveals a shared genetic basis of brain anatomy and schizophrenia", Lee et al 2016
  108. "Genome-wide association studies establish that human intelligence is highly heritable and polygenic", Davies et al 2011
  109. "Most Reported Genetic Associations with General Intelligence Are Probably False Positives", Chabris et al 2012
  110. "Intelligence indexes generalist genes for cognitive abilities", Trzaskowski et al 2013
  111. "DNA Evidence for Strong Genome-Wide Pleiotropy of Cognitive and Learning Abilities", Trzaskowski et al 2013b
  112. "Results of a 'GWAS Plus': General Cognitive Ability Is Substantially Heritable and Massively Polygenic", Kirkpatrick et al 2014
  113. "DNA evidence for strong genetic stability and increasing heritability of intelligence from age 7 to 12", Trzaskowski et al 2014a
  114. "Childhood intelligence is heritable, highly polygenic and associated with _FNBP1L_", Benyamin et al 2014
  115. "Genetic contributions to variation in general cognitive function: a meta-analysis of genome-wide association studies in the CHARGE consortium (n=53949)", Davies et al 2015
  116. "A genome-wide analysis of putative functional and exonic variation associated with extremely high intelligence", Spain et al 2015
  117. 1 2 3 4 5 6 7 8 "Epigenetic age of the pre-frontal cortex is associated with neuritic plaques, amyloid load, and Alzheimer's disease related cognitive functioning ", Levine et al 2015
  118. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 "The genetic architecture of pediatric cognitive abilities in the Philadelphia Neurodevelopmental Cohort", Robinson et al 2015
  119. Zabaneh et al 2017, "A genome-wide association study for extremely high intelligence"
  120. Zabaneh et al 2017, "Fine mapping genetic associations between the HLA region and extremely high intelligence"
  121. 1 2 "Word Reading Fluency: Role of Genome-Wide Single-Nucleotide Polymorphisms in Developmental Stability and Correlations With Print Exposure", Harlaar et al 2014
  122. 1 2 Hagenaars et al 2017, "Genetic contributions to trail making test performance in UK Biobank"
  123. "Why do we differ in number sense? Evidence from a genetically sensitive investigation", Tosto et al 2013
  124. "Molecular genetics and subjective well-being", Rietveld et al 2013
  125. 1 2 3 Weiss et al 2016, "Personality Polygenes, Positive Affect, and Life Satisfaction"
  126. "Global Genetic Variations Predict Brain Response to Faces", Dickie et al 2014
  127. 1 2 "Common SNPs explain some of the variation in the personality dimensions of neuroticism and extraversion", Vinkhuyzen et al 2012
  128. "Meta-analysis of genome-wide association studies for neuroticism, and the polygenic association with major depressive disorder", De Moor et al 2015
  129. 1 2 3 4 5 "Heritability estimates of the Big Five personality traits based on common genetic variants", Power & Pluess 2015
  130. "Genome-wide analysis of over 106 000 individuals identifies 9 neuroticism-associated loci", Smith et al 2016
  131. 1 2 3 Hill et al 2017, "Genetic contribution to two factors of neuroticism is associated with affluence, better health, and longer life"
  132. "Meta-analysis of Genome-Wide Association Studies for Extraversion: Findings from the Genetics of Personality Consortium", van den Berg et al 2015
  133. 1 2 3 "Genetic risk variants for social anxiety", Stein et al 2017
  134. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 "Knowns and unknowns for psychophysiological endophenotypes: Integration and response to commentaries", Iacono et al 2014
  135. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 "No genetic influence for childhood behavior problems from DNA analysis", Trzaskowski et al 2013
  136. "Genetics of Callous-Unemotional Behavior in Children", Viding et al 2013
  137. "Single Nucleotide Polymorphism Heritability of a General Psychopathology Factor in Children", Neumann et al 2016
  138. "Describing the genetic architecture of epilepsy through heritability analysis", Speed et al 2014
  139. 1 2 3 4 5 "Genetic relationship between five psychiatric disorders estimated from genome-wide SNPs", Lee et al 2013
  140. 1 2 3 4 Hall et al 2017, "Genome-Wide Meta-Analyses Of Stratified Depression In Generation Scotland And UK Biobank"
  141. 1 2 "Familiality and SNP heritability of age at onset and episodicity in major depressive disorder", Ferentinos et al 2015
  142. Tansey et al 2013, "Contribution of Common Genetic Variants to Antidepressant Response"
  143. "Estimating the proportion of variation in susceptibility to schizophrenia captured by common SNPs", Lee et al 2012
  144. "Genome-wide association analysis identifies 13 new risk loci for schizophrenia", Ripke et al 2013
  145. "Genome-Wide Association Study of Schizophrenia in Ashkenazi Jews", Goes et al 2015
  146. "Additive Genetic Variation in Schizophrenia Risk Is Shared by Populations of African and European Descent", Candia et al 2013
  147. 1 2 3 4 5 6 7 8 9 10 11 "Partitioning heritability of regulatory and cell-type-specific variants across 11 common diseases", Gusev et al 2014a; see also "Regulatory variants explain much more heritability than coding variants across 11 common diseases", Gusev et al 2014b
  148. "Genome-wide association study of 40,000 individuals identifies two novel loci associated with bipolar disorder", Hou et al 2016
  149. 1 2 3 "Estimating missing heritability for disease from genome-wide association studies", Lee et al 2011
  150. 1 2 3 4 5 6 7 8 9 "Quantifying missing heritability at known GWAS loci", Gusev et al 2013
  151. "Genome-wide analyses of borderline personality features", Lubke et al 2014
  152. 1 2 "Partitioning the heritability of Tourette syndrome and obsessive compulsive disorder reveals differences in genetic architecture", Davis 2013
  153. 1 2 "Genome-wide analyses of empathy and systemizing: heritability and correlates with sex, education, and psychiatric risk", Warrier et al 2016
  154. "Genetic risk for autism spectrum disorders and neuropsychiatric variation in the general population", Robinson et al 2015
  155. 1 2 "Common genetic variants, acting additively, are a major source of risk for autism", Klei et al 2012
  156. "Most genetic risk for autism resides with common variation", Gaugler et al 2014
  157. "Variability in the common genetic architecture of social-communication spectrum phenotypes during childhood and adolescence", St Pourcain et al 2014
  158. Mitra et al 2016, "Pleiotropic Mechanisms Indicated for Sex Differences in Autism"
  159. 1 2 3 4 5 6 Stergiakouli et al 2017, "Shared genetic influences between dimensional ASD and ADHD symptoms during child and adolescent development"
  160. 1 2 "Single nucleotide polymorphism heritability of behavior problems in childhood: genome-wide complex trait analysis", Pappa et al 2015
  161. "Polygenic transmission and complex neuro developmental network for attention deficit hyperactivity disorder: genome-wide association study of both common and rare variants", Yang et al 2013
  162. 1 2 3 Bidwell et al 2017, "Genetic influences on ADHD symptom dimensions: Examination of a priori candidates, gene-based tests, genome-wide variation, and SNP heritability"
  163. "A genome-wide approach to children's aggressive behavior: The EAGLE consortium", Pappa et al 2015b
  164. "A genome-wide association meta-analysis of preschool internalizing problems", Benke et al 2014
  165. "Heritability and genome-wide analyses of problematic peer relationships during childhood and adolescence", St Pourcain et al 2015
  166. 1 2 3 4 5 6 "Heritability of Individual Psychotic Experiences Captured by Common Genetic Variants in a Community Sample of Adolescents", Sieradzka 2015
  167. "Web-based genome-wide association study identifies two novel loci and a substantial genetic component for Parkinson's disease", Do et al 2011
  168. 1 2 3 "Using genome-wide complex trait analysis to quantify 'missing heritability' in Parkinson's disease", Keller et al 2012
  169. 1 2 3 Guerreiro et al 2016, "Genome-wide analysis of genetic correlation in dementia with Lewy bodies, Parkinson's and Alzheimer's diseases"
  170. 1 2 3 Duncan et al 2017, "Largest GWAS of PTSD (N=20070) yields genetic overlap with schizophrenia and sex differences in heritability"
  171. "Genome-wide meta-analysis identifies six novel loci associated with habitual coffee consumption", The Coffee and Caffeine Genetics Consortium et al 2014
  172. Verweij et al 2013, "The genetic aetiology of cannabis use initiation: a meta-analysis of genome-wide association studies and a SNP-based heritability estimation"
  173. "Heritability, SNP- and Gene-Based Analyses of Cannabis Use Initiation and Age at Onset", Minca et al 2015
  174. 1 2 3 "Molecular genetic influences on normative and problematic alcohol use in a population-based sample of college students", Webb et al 2017
  175. 1 2 3 Clarke et al 2017, "Genome-wide association study of alcohol consumption and genetic overlap with other health-related traits in UK Biobank (N=112,117)"
  176. Sanchez-Roige et al 2017, "Genome-wide association study of Alcohol Use Disorder Identification Test (AUDIT) scores in 20,328 research participants of European ancestry"
  177. 1 2 3 "Examining the role of common genetic variants on alcohol, tobacco, cannabis and illicit drug dependence: Genetics of vulnerability to drug dependence", Palmer et al 2015
  178. 1 2 Zhu et al 2017, "Shared Genetic Architecture Of Asthma With Allergic Diseases: A Genome-wide Cross Trait Analysis Of 112,000 Individuals From UK Biobank"
  179. "Genome-wide association analyses identify new risk variants and the genetic architecture of amyotrophic lateral sclerosis", van Rheenen et al 2016
  180. 1 2 3 4 5 6 7 8 9 10 11 12 13 McGeachie et al 2016, "Whole genome prediction and heritability of childhood asthma phenotypes"
  181. "Estimating the proportion of variation in susceptibility to multiple sclerosis captured by common SNPs", Watson et al 2012
  182. 1 2 Chen et al 2014, "Estimation and partitioning of (co)heritability of inflammatory bowel disease from GWAS and immunochip data"
  183. Yin et al 2014, "Common variants explain a large fraction of the variability in the liability to psoriasis in a Han Chinese population"
  184. 1 2 3 4 Stahl et al 2012, "Bayesian inference analyses of the polygenic architecture of rheumatoid arthritis"
  185. "The contribution of rare variation to prostate cancer heritability", Mancuso et al 2015
  186. 1 2 3 "Heritability Estimates Identify a Substantial Genetic Contribution to Risk and Outcome of Intracerebral Hemorrhage", Devan et al 2013
  187. "Estimating the respective contributions of human and viral genetic variation to HIV control", Bartha et al 2015
  188. 1 2 3 Ek et al 2013, "Germline genetic contributions to risk for esophageal adenocarcinoma, Barretts Esophagus, and gastroesophageal reflux"
  189. Hatzikotoulas et al 2017, "National clinical audit data decodes the genetic architecture of developmental dysplasia of the hip"
  190. "Whole-genome sequence–based analysis of high-density lipoprotein cholesterol", Morrison et al 2013
  191. 1 2 3 4 Bevan et al 2012, "Genetic heritability of ischemic stroke and the contribution of previously reported candidate gene and genome-wide associations"
  192. "Type 2 Diabetes Risk Prediction Incorporating Family History Revealing a Substantial Fraction of Missing Heritability", Gim et al 2016
  193. 1 2 3 4 "Genome-Wide Contribution of Genotype by Environment Interaction to Variation of Diabetes-Related Traits", Zheng et al 2013
  194. "Genetic and Environmental Factors Are Associated with Serum 25-Hydroxyvitamin D Concentrations in Older African Americans", Hansen et al 2015
  195. 1 2 "Whole-genome sequence-based analysis of thyroid function", Taylor et al 2015
  196. "Estimating Telomere Length Heritability in an Unrelated Sample of Adults: Is Heritability of Telomere Length Modified by Life Course Socioeconomic Status?", Faul et al 2016
  197. "Neanderthals’ DNA legacy linked to modern ailments: Humans inherited variants affecting disease risk, infertility, skin and hair characteristics", Stephanie Dutchen, 2014-01-29
    "The phenotypic legacy of admixture between modern humans and Neandertals", Corinne N. Simonti et al, 2016-02-11
  198. Divergent ah receptor ligand selectivity during hominin evolution, Troy D. Hubbard et al, 2016-08-02
    Smoke signals: DNA adaptation helped early humans deal with toxic fumes, Naomi Stewart, 2016-08-02
  199. "Analysis of the genetics of boar taint reveals both single SNPs and regional effects", Rowe et al 2014
  200. "Genome-Wide Association Study on Body Weight Reveals Major Loci on OAR6 in Australian Merino Sheep", Al-Mamun et al 2014
  201. 1 2 "The genetic basis of host preference and indoor resting behavior in the major African malaria vector, Anopheles arabiensis", Main et al 2016
  202. "Genome-wide association and prediction reveals the genetic architecture of cassava mosaic disease resistance and prospects for rapid genetic improvement", Wolfe et al 2015

Further reading

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.