Selection Against Variants in the Genome Associated PNAS PLUS with Educational Attainment

Selection against variants in the genome associated PNAS PLUS with educational attainment Augustine Konga,b,1, Michael L. Friggea, Gudmar Thorleifssona, Hreinn Stefanssona, Alexander I. Youngc, Florian Zinka, Gudrun A. Jonsdottira, Aysu Okbayd,e, Patrick Sulema, Gisli Massona, Daniel F. Gudbjartssona,b, Agnar Helgasona,f, Gyda Bjornsdottira, Unnur Thorsteinsdottira,g, and Kari Stefanssona,g,1 adeCODE genetics/Amgen Inc., Reykjavik 101, Iceland; bSchool of Engineering and Natural Sciences, University of Iceland, Reykjavik 101, Iceland; cWellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, United Kingdom; dDepartment of Applied Economics, Erasmus School of Applied Economics, Erasmus University Rotterdam, 3062 PA Rotterdam, The Netherlands; eInstitute for Behavior and Biology, Erasmus University Rotterdam, 3062 PA Rotterdam, The Netherlands; fDepartment of Anthropology, University of Iceland, Reykjavik 101, Iceland; and gFaculty of Medicine, University of Iceland, Reykjavik 101, Iceland Edited by Andrew G. Clark, Cornell University, Ithaca, NY, and approved December 5, 2016 (received for review July 22, 2016) Epidemiological and genetic association studies show that genet- 62 cohorts were used to determine the weightings for POLYEDU. ics play an important role in the attainment of education. Here, we We computed POLYEDU for over 150,000 Icelanders who were investigate the effect of this genetic component on the reproduc- directly genotyped with chip arrays and imputed for additional tive history of 109,120 Icelanders and the consequent impact on sequence variants discovered through whole-genome sequencing the gene pool over time. We show that an educational attainment of 8,453 Icelanders (20) (Materials and Methods). POLYEDU was polygenic score, POLYEDU, constructed from results of a recent scaled to an SD of 1, hereafter referred to as standard units (SUs). −100 study is associated with delayed reproduction (P < 10 ) and When applied to 46,079 Icelanders with educational attainment fewer children overall. The effect is stronger for women and re- data POLYEDU was found to explain 3.74% of the trait variance − mains highly significant after adjusting for educational attain- (P < 10 300). By contrast, the strongest single variant only explains ment. Based on 129,808 Icelanders born between 1910 and 1990, 0.10% of the variance, indicating that educational attainment is a we find that the average POLYEDU has been declining at a rate of complex trait influenced by many variants in the genome and ∼ EVOLUTION 0.010 standard units per decade, which is substantial on an evo- highlighting the increased power of using the polygenic score for lutionary timescale. Most importantly, because POLY only cap- EDU our analyses. Our first analysis focused on 109,120 individuals tures a fraction of the overall underlying genetic component the (58,560 females and 50,560 males) with year of birth (yob) be- latter could be declining at a rate that is two to three times faster. tween 1910 and 1975 (Fig. S2). The genealogical database was used to obtain the number of children (NC) and, where applicable, selection | educational attainment | genes | fertility | sequence variants the age at first child (AGFC) and the average age at child birth (AACB) for this set. The estimated effects of POLYEDU on these pidemiological studies have estimated that the genetic com- reproductive traits, adjusted for yob and 20 principal components ponent of educational attainment can account for as much as SOCIAL SCIENCES E (21), are presented in Table 1 for females and males separately. 40% of the trait variance (1). Recent meta-analyses (2, 3) yielded sequence variants contributing to the underlying genetic component. A negative correlation between educational attainment Significance and number of children has been observed in many populations (4–7). A recent study of ∼20,000 genotyped Americans born be- Epidemiological studies suggest that educational attainment is af- tween 1931 and 1953 provided direct evidence that the genetic fected by genetic variants. Results from recent genetic studies allow propensity for educational attainment is associated with reduced us to construct a score from a person’s genotypes that captures a fertility (8, 9), supporting previously postulated notions (10) that portion of this genetic component. Using data from Iceland that the population average of the genetic propensity for educational include a substantial fraction of the population we show that in- attainment and related traits must be declining. Here, using a dividuals with high scores tend to have fewer children, mainly population-wide sample that is both much larger and covers a because they have children later in life. Consequently, the average substantially greater time span, and with additional auxiliary in- score has been decreasing over time in the population. The rate of formation, we aim to estimate the change of the genetic pro- decrease is small per generation but marked on an evolutionary pensity of educational attainment in the Icelandic population over timescale. Another important observation is that the association the last few decades, starting with an in-depth investigation of the between the score and fertility remains highly significant after relationship between a measurable genetic component of educa- adjusting for the educational attainment of the individuals. tional attainment and various aspects of reproduction (11–14). Author contributions: A.K. and K.S. designed research; A.K., H.S., A.I.Y., G.A.J., A.O., P.S., G.M., Results D.F.G., A.H., G.B., and U.T. performed research; A.K., M.L.F., G.T., and F.Z. analyzed data; A.K. derived the mathematical results in Materials and Methods; M.L.F. prepared the figures and The number of living Icelanders is ∼317,000 (Fig. S1). A genea- tables for publication; H.S. provided IQ data and references; G.A.J. processed the IQ data to a logical database of Icelanders (15–17) that is very close to com- form suitable for analyses; A.I.Y. assisted in deriving the mathematical results in Materials and plete for individuals born after 1910 (Materials and Methods)is Methods; A.O. provided meta-analysis results with various cohorts removed; P.S., G.M., and D.F.G. contributed to processing the Icelandic genotype data for analysis; A.H. provided key references used in this study. Probands used for the genetic analyses here are and contributed to writing the Discussion; G.B. collected and processed Icelandic education data limited to those with both parents and all four grandparents listed and provided key references; U.T. oversaw the generation of the genotype data in the labora- in the genealogy. For the fertility studies, only children who sur- tory; A.K. wrote the paper; and K.S. contributed to the writing of the final version of the paper. vived their first year are counted. The first step was to use results The authors declare no conflict of interest. from a recent genome-wide association study (GWAS) of educa- This article is a PNAS Direct Submission. tional attainment (3) to determine the per-locus allele-specific Freely available online through the PNAS open access option. weightings of 620,000 markers used to calculate a polygenic score 1To whom correspondence may be addressed. Email: [email protected] or kari.stefansson@ (18, 19), POLYEDU (see Materials and Methods for details on decode.is. polygenic score construction). After excluding the Icelandic co- This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10. horts in the GWAS to avoid confounding, 278,948 samples from 1073/pnas.1612113114/-/DCSupplemental. www.pnas.org/cgi/doi/10.1073/pnas.1612113114 PNAS Early Edition | 1of6 Downloaded by guest on September 30, 2021 Table 1. Estimated effects of POLYEDU on fertility traits Among the genotyped individuals with yob between 1910 and Female Male 1975, information about educational attainment is available for 25,794 females and 19,903 males. For these individuals, the ef- Trait n Effect PnEffect P fects of POLYEDU and educational attainment (EDU) itself on − × −43 − × −15 the reproductive traits were estimated individually, through NC 58,560 0.084 1.0 10 50,560 0.054 2.2 10 separate regressions, and jointly, through regressions including AGFC 55,208 0.59 5.3 × 10−155 45,669 0.44 6.2 × 10−57 − − both as predictors (Table 2). We coded EDU as in a recent meta- AACB 55,208 0.46 1.0 × 10 117 45,669 0.37 6.5 × 10 50 analysis (3). Individuals fall into four categories: 10, 13, 15, and = = = POLYEDU is in standard units (SU). NC denotes number of children, AGFC 20 years (mean 14.0 and SD 3.4 for males and mean 13.4 denotes age at first child, and AACB denotes average age at child birth. and SD = 3.7 for females). The first category corresponds to the mandatory minimum education in Iceland and the last corresponds to a college degree. For females, when analyzed sepa- For females, an increase of 1 SU of POLYEDU corresponds to an − rately, each SU increase of POLYEDU decreases expected NC by P = × 43 − average decrease of 0.084 children [ 1.0 10 , calculated 0.097 (P = 1.7 × 10 23), whereas each year increase in EDU with genomic control adjustment (22)], and for those with children −56 − corresponds to a reduction of 0.045 (P = 5.0 × 10 ). When AGFC and AACB increased by 0.59 years (P = 5.3 × 10 155)and P = × −117 analyzed jointly, the estimated effect of POLYEDU on NC ad- 0.46 years ( 1.0 10 ), respectively. A similar, albeit weaker, justed for EDU reduces to −0.071, a shrinkage that is meaningful − pattern of results was observed for males. The finding of a sub- but not drastic, and remains highly significant (P = 7.2 × 10 13). stantially stronger association for AGFC than NC suggests that the Similar results were observed for AGFC and AACB.

Selection Against Variants in the Genome Associated PNAS PLUS with Educational Attainment

Identifying Intergenerational Risk Factors for ADHD Symptoms Using Polygenic Scores in the Norwegian Mother, Father and Child Cohort

Common Genetic Variation and Schizophrenia Polygenic Risk

Polygenic Methods and Their Application to Psychiatric Traits

The BRCA1 Tumor Suppressor

Within-Sibship GWAS Improve Estimates of Direct Genetic Effects

LDL Polygenic Risk Scores in Practice

Clinical Use of Current Polygenic Risk Scores May Exacerbate Health Disparities

Genomic Prediction of 16 Complex Disease Risks Including Heart Attack, Diabetes, Breast and Prostate Cancer Louis Lello1, Timothy G

Genetic Risk Score Based on Statistical Learning Florian Prive

A Complex Trait Genomics Approach to Investigating Amyotrophic Lateral Sclerosis

An Epigenome-Wide Association Study Meta-Analysis of Educational Attainment

A Guide to Performing Polygenic Risk Score Analyses 2 Shing Wan Choi1, Timothy Shin Heng Mak2, Paul F