Supplementary Information
Total Page:16
File Type:pdf, Size:1020Kb
1 Supplementary Information 2 “Large-scale genomic analysis identifies 12 loci harbouring genes for human reproductive 3 behaviour and infertility” 4 Table of Contents 5 TABLE OF CONTENTS ................................................................................................................................... 1 6 1. HUMAN REPRODUCTIVE BEHAVIOUR MOTIVATION AND PHENOTYPE DEFINITION .... 3 7 1.1 PHENOTYPE MOTIVATION ........................................................................................................................... 3 8 1.2 EVOLUTIONARY CAUSES OF GENETIC VARIANCE IN FERTILITY .................................................................. 5 9 1.3 ADDITIVE GENETIC VARIATION IN FERTILITY ............................................................................................. 6 10 1.4 DOMINANT GENETIC VARIATION IN FERTILITY .......................................................................................... 7 11 1.5 ENVIRONMENTAL VARIATION IN FERTILITY ............................................................................................... 8 12 1.6 PHENOTYPE DEFINITION .............................................................................................................................. 9 13 1.7 INSTRUCTIONS FOR CONTRIBUTING COHORTS ........................................................................................ 10 14 2. PRIMARY GWAS OF HUMAN FERTILITY ......................................................................................... 11 15 2.1 OVERVIEW OF HUMAN FERTILITY ANALYSES ........................................................................................... 11 16 2.3 GENOTYPING AND IMPUTATION ............................................................................................................... 13 17 2.4 ASSOCIATION ANALYSES ........................................................................................................................... 13 18 2.5 QUALITY CONTROL ................................................................................................................................... 13 19 FILTERS .................................................................................................................................................................................. 14 20 DIAGNOSTIC CHECKS ............................................................................................................................................................ 16 21 2.6 META ANALYSES ........................................................................................................................................ 17 22 3. BIVARIATE AND CONDITIONAL ANALYSIS OF THE TWO FERTILITY-RELATED TRAITS 23 19 24 4. TESTING FOR POPULATION STRATIFICATION ......................................................................... 20 25 4.1 LD SCORE INTERCEPT TEST ..................................................................................................................... 20 26 4.2 STATISTICAL SIGNIFICANCE OF THE POLYGENIC SCORES IN A WF REGRESSION ................................... 22 27 5. SEX-SPECIFIC GENETIC EFFECTS IN FERTILITY ........................................................................ 23 28 5.1 GENETIC OVERLAP AMONG SEXES USING LD SCORE BIVARIATE REGRESSION ........................................ 24 29 5.2 GENETIC OVERLAP AMONG SEXES USING GCTA ...................................................................................... 24 30 BIVARIATE GREML ANALYSIS ........................................................................................................................................... 25 31 5.4 DISCUSSION ............................................................................................................................................... 26 32 6. POLYGENIC SCORES PREDICTION ................................................................................................. 27 33 6.1 LINEAR POLYGENIC SCORES FOR AFB AND NEB ..................................................................................... 27 34 6.2 LINEAR POLYGENIC SCORES FOR INFERTILITY ......................................................................................... 28 35 6.3 ADDITIONAL STATISTICAL MODELS FOR CENSORING AND SELECTION ................................................ 28 36 6.4 LINEAR PREDICTION OF AGE AT MENARCHE AND AGE AT MENOPAUSE USING AFB LINEAR SCORE ... 29 1 1 7 GENETIC CORRELATIONS .................................................................................................................. 30 2 7.1 ESTIMATING GENETIC OVERLAP USING LD SCORE REGRESSION ............................................................. 30 3 7.2 ESTIMATING THE GENETIC CORRELATION BETWEEN AFB AND NEB .................................................... 31 4 7.3 RESULTS: PHENOTYPIC CORRELATIONS WITH HUMAN REPRODUCTIVE BEHAVIOUR ........................ 32 5 7.4 DISCUSSION ............................................................................................................................................ 32 6 7.4.1 HUMAN DEVELOPMENT ........................................................................................................................................ 32 7 7.4.2 CARDIOMETABOLIC TRAITS ................................................................................................................................. 33 8 7.4.3 ANTHROPOMETRIC TRAITS .................................................................................................................................. 36 9 7.4.4 AFB AND EDUCATIONAL ATTAINMENT ............................................................................................................. 37 10 7.4.5 AFB, PERSONALITY AND NEUROPSYCHIATRIC DISORDERS ............................................................................ 37 11 7.4.6 SMOKING BEHAVIOUR ........................................................................................................................................... 38 12 7.4.7 LIMITATIONS OF LD SCORE REGRESSION GENETIC CORRELATIONS ............................................................. 38 13 8. LOOK-UP OF LEAD SNPS IN AFB GWAS FOR AGE AT MENOPAUSE AND AGE AT 14 MENARCHE .................................................................................................................................................... 39 15 9. BIOLOGICAL ANNOTATION ............................................................................................................. 41 16 9.1. IDENTIFYING POTENTIALLY CAUSAL VARIANTS .................................................................................. 41 17 9.2. GENE-BASED GWAS ANALYSIS ............................................................................................................ 41 18 9.3. EQTL AND MQTL ANALYSES ................................................................................................................ 42 19 I. GENOTYPE DATA ........................................................................................................................................................... 42 20 9.3.2 RNA DATA PREPARATION, SEQUENCING AND QUANTIFICATION .................................................................. 42 21 9.3.3 METHYLATION DATA GENERATION, MAPPING AND NORMALIZATION. ........................................................ 43 22 9.3.4 EQTL AND MQTL ANALYSIS ................................................................................................................................ 43 23 9.4. ENDEAVOUR, METARANKER, TOPPGENE AND DEPICT ANALYSES .................................................. 44 24 9.4.1 GENE PRIORITIZATION USING FOUR BIOINFORMATICS APPROACHES .......................................................... 44 25 9.5. FUNCTIONAL NETWORK AND ENRICHMENT ANALYSES ....................................................................... 45 26 10. ADDITIONAL AKNOWLEDGMENTS ............................................................................................. 46 27 11. LIST OF TABLES ................................................................................................................................. 68 28 12. LIST OF FIGURES ............................................................................................................................... 69 29 REFERENCES ..................................................................................................................................................... 72 30 31 2 1 1. HUMAN REPRODUCTIVE BEHAVIOUR MOTIVATION AND PHENOTYPE 2 DEFINITION 3 1.1 Phenotype Motivation 4 5 Human reproductive behaviour – measured by age at first birth (AFB) and number of 6 children ever born (NEB) – is a core topic of research across the medical, social and 7 biological sciences.1 Two central indicators are the tempo of childbearing of age at first birth 8 (AFB) and the quantum or number of children ever born (NEB). NEB is also often referred to 9 in biological research as life-time reproductive success,2 number of offspring3 or as ‘fitness’ 10 in evolutionary studies, which is the function of the number of children of a person in relation 11 to the number of children of peers of the same birth cohort.4,5 Due to improvements in 12 hygiene and the reduction in prenatal, infant and child mortality in industrialized societies, 13 NEB has emerged as the gold standard to measure lifetime reproductive success indicating 14 biological fitness.5 15 16 AFB and NEB are complex phenotypes related not