Polygenic Inheritance, GWAS, Polygenic Risk Scores, and the Search for Functional Variants PERSPECTIVE Daniel J

Total Page:16

File Type:pdf, Size:1020Kb

Polygenic Inheritance, GWAS, Polygenic Risk Scores, and the Search for Functional Variants PERSPECTIVE Daniel J PERSPECTIVE Polygenic inheritance, GWAS, polygenic risk scores, and the search for functional variants PERSPECTIVE Daniel J. M. Croucha and Walter F. Bodmerb,1 Edited by Mary-Claire King, University of Washington, Seattle, WA, and approved July 2, 2020 (received for review May 22, 2020) The reconciliation between Mendelian inheritance of discrete traits and the genetically based correlation between relatives for quantitative traits was Fisher’s infinitesimal model of a large number of genetic variants, each with very small effects, whose causal effects could not be individually identified. The de- velopment of genome-wide genetic association studies (GWAS) raised the hope that it would be possible to identify single polymorphic variants with identifiable functional effects on complex traits. It soon be- came clear that, with larger and larger GWAS on more and more complex traits, most of the significant associations had such small effects, that identifying their individual functional effects was essentially hope- less. Polygenic risk scores that provide an overall estimate of the genetic propensity to a trait at the individual level have been developed using GWAS data. These provide useful identification of groups of individuals with substantially increased risks, which can lead to recommendations of medical treatments or behavioral modifications to reduce risks. However, each such claim will require extensive investigation to justify its practical application. The challenge now is to use limited genetic association studies to find individually identifiable variants of significant functional effect that can help to understand the molecular basis of complex diseases and traits, and so lead to improved disease prevention and treatment. This can best be achieved by 1) the study of rare variants, often chosen by careful candidate assessment, and 2) the careful choice of phenotypes, often extremes of a quantitative variable, or traits with relatively high heritability. GWAS | association mapping | polygenic scores The key to Mendel’s successful demonstration of the This was entitled “The correlation between relatives discrete nature of inheritance, without which it would on the supposition of Mendelian inheritance.” He not be possible to maintain the genetic variability re- showed that the observed correlations could be quired by Darwinian evolution by natural selection, explained by a model in which a large number of dis- was his careful choice of clearly dichotomous pheno- crete inherited differences were each inherited types for study. Only in this way could he have ob- according to Mendel’s laws and where each had a served the simple ratios that defined his laws. That is small effect on the quantitative trait in question. The why we refer to clearly inherited, often extreme, dif- cumulative effect of such variants at many loci could ferences that are obviously familial, as “Mendelian.” be assumed to lead to a normal distribution, and it was The ultimate reconciliation between Mendelian in this paper that Fisher introduced the term variance discrete inheritance and the correlation between as we now know it and the concept of the analysis of relatives for continuously inherited traits, such as variance. This “infinitesimal model” is commonly con- height or weight, which had been clearly observed by sidered to be the founding principle of quantitative Francis Galton and his biometrician followers and genetics. It explains observed correlations between which implied inherited tendencies for such traits, was relatives based on Mendelian genetics but does not provided by R. A. Fisher (1) in his seminal 1918 paper. attempt to identify the causal effects of particular aJDRF/Wellcome Diabetes and Inflammation Laboratory, Wellcome Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, United Kingdom; and bCancer and Immunogenetics Laboratory, Weatherall Institute of Molecular Medicine, John Radcliffe Hospital, University of Oxford, Oxford OX3 9DS, United Kingdom Author contributions: W.F.B. designed research; D.J.M.C. and W.F.B. performed research; D.J.M.C. and W.F.B. analyzed data; and D.J.M.C. and W.F.B. wrote the paper. The authors declare no competing interest. This article is a PNAS Direct Submission. Published under the PNAS license. 1To whom correspondence may be addressed. Email: [email protected]. This article contains supporting information online at https://www.pnas.org/lookup/suppl/doi:10.1073/pnas.2005634117/-/DCSupplemental. www.pnas.org/cgi/doi/10.1073/pnas.2005634117 PNAS Latest Articles | 1of10 Downloaded by guest on October 1, 2021 genetic variants on the quantitative trait being studied. Fisher defined genes with recognizably large effects on the chance of returned only once to the analysis of quantitative inheritance us- getting diabetes. ing the infinitesimal model, and this was in the context of plant breeding (2). Here, his aim was to show how statistical approaches HLA and Disease Associations based on analyzing the quantitative distribution of a trait could be While there were claims of associations between ABO types and useful even, as he put it, “when individual factors cannot certain diseases starting in the early 1950s, it was the really striking be recognized....” associations of HLA types with autoimmune diseases, notably At that time, Fisher had become very interested in the HLA-B*27 with ankylosing spondylitis (6), that changed the whole serological techniques then being developed by Charles Todd, field of studying multifactorial inheritance by looking for associa- which were the basis for the later recognition of red cell blood tions between phenotypes and specific, possibly causal, genetic groups. Fisher suggested in a letter to Todd, with remarkable variants. The first suggestion that linkage disequilibrium (LD) with foresight, that these techniques might be able to detect “the di- an unobserved causal variant could account for associations be- rect products of individual genes rather than (those that) have tween a genetic variant and a disease was made by Bodmer (7), secondary reactions.” In this correspondence with Todd, Fisher based on early data on Hodgkin’s disease and this idea was fur- later said that he believed that such “work is going to lead to a ther developed in the context of autoimmune diseases by greater advance, both theoretical and practical, in the problems of McDevitt and Bodmer (8). The role of HLA in the control of the human genetics than can be expected from any further work on immune response provided a clear rationale for testing these as- biometrical or genealogical lines.” Here, he was foreseeing the sociations. Early examples of the use of LD with common HLA possibility of identifying biochemically the products of genes polymorphic alleles for identifying causal variants in novel closely whose variants determine a given human trait, and so identifying linked genes are the discovery of the key role of an HLA-DQ allele the effects of a given variant on the trait at the molecular level. in susceptibility to type I diabetes through the initial disease as- The transition from the “work on biometrical or genealogical sociation with HLA-B*8,15 and then HLA-DR*3,4 (9,10),and lines” to the wish to identify the direct products of genes epito- the discovery of HFE gene variants as the main underlying cause mizes the prevailing conflict in studying the genetics of complex of hereditary hemochromatosis through its association with traits. This conflict is in the contrast between analyses that do not HLA-A*3 (11). aim to identify the contributions of individual genetic variants and those that aim to understand the underlying molecular basis of the Genome-Wide Association Studies trait variation through the identification of specific variants, in It was an obvious possibility to extend the HLA and disease as- genes with defined functions, that have clearly defined effects sociation studies to looking for the association between any on the trait. The concept of a gene in the context we are discus- marker and a disease on the assumption that this could lead, by sing, is any defined mapped DNA sequence in the genome that LD, to the discovery of the functionally relevant variant in the vi- can have some functional effect and so within which a variant cinity of the gene carrying the associated variant (12). Thus, the sequence may have some detectable differential effect on a given interpretation of the reason for an observed association between phenotype. It is this dichotomy of approaches between not iden- a given identified variant and a given phenotype is that either tifying the contribution of individual variants, and understanding there is a direct effect of the variant on the phenotype or that the their molecular function, which we analyze in this paper. association is due to the effect of one or more as-yet-not- identified variants all in strong LD with observed associated vari- Polygenic Inheritance and the Location of Polygenes ant, and collectively with a larger effect on the phenotype being The first use of polygenic in the context of quantitative inheritance studied than the originally associated variant. was by Mather (3) in 1941. He talked of polygenic characters and Many initial studies looking for such disease associations in polygenic variation and postulated that this type of inheritance, genes chosen as likely functionally relevant candidates proved following Fisher’s infinitesimal model, was controlled by a differ- difficult to repeat. This
Recommended publications
  • Response to Selection in Finite Locus Models with Nonadditive Effects
    Journal of Heredity, 2017, 318–327 doi:10.1093/jhered/esw123 Original Article Advance Access publication January 12, 2017 Original Article Response to Selection in Finite Locus Models with Nonadditive Effects Hadi Esfandyari, Mark Henryon, Peer Berg, Jørn Rind Thomasen, Downloaded from https://academic.oup.com/jhered/article/108/3/318/2895120 by guest on 23 September 2021 Piter Bijma, and Anders Christian Sørensen From the Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University DK-8830 Tjele, Denmark (Esfandyari, Berg, Thomasen, and Sørensen); Seges, Danish Pig Research Centre, Copenhagen, Denmark (Henryon); School of Animal Biology, University of Western Australia, Crawley, Australia (Henryon); Nordic Genetic Resource Center, Ås, Norway (Berg); VikingGenetics, Assentoft, Denmark (Thomasen); and Animal Breeding and Genomics Centre, Wageningen University, Wageningen, The Netherlands (Bijma). Address correspondence to H. Esfandyari at the address above, or e-mail: hadi.esfandyari@ mbg.au.dk. Received May 18, 2016; First decision June 7, 2016; Accepted January 2, 2017. Corresponding Editor: C Scott Baker Abstract Under the finite-locus model in the absence of mutation, the additive genetic variation is expected to decrease when directional selection is acting on a population, according to quantitative-genetic theory. However, some theoretical studies of selection suggest that the level of additive variance can be sustained or even increased when nonadditive genetic effects are present. We tested the hypothesis that finite-locus models with both additive and nonadditive genetic effects maintain more additive genetic variance (VA) and realize larger medium- to long-term genetic gains than models with only additive effects when the trait under selection is subject to truncation selection.
    [Show full text]
  • Directional Selection and Variation in Finite Populations
    Copyright 0 1987 by the Genetics Society of America Directional Selection and Variation in Finite Populations Peter D. Keightley and ‘William G. Hill Institute of Animal Genetics, University of Edinburgh, West Mains Road, Edinburgh EH9 3JN, Scotland Manuscript received April 12, 1987 Revised copy accepted July 16, 1987 ABSTRACT Predictions are made of the equilibrium genetic variances and responses in a metric trait under the joint effects of directional selection, mutation and linkage in a finite population. The “infinitesimal model“ is analyzed as the limiting case of many mutants of very small effect, otherwise Monte Carlo simulation is used. If the effects of mutant genes on the trait are symmetrically distributed and they are unlinked, the variance of mutant effects is not an important parameter. If the distribution is skewed, unless effects or the population size is small, the proportion of mutants that have increasing effect is the critical parameter. With linkage the distribution of genotypic values in the population becomes skewed downward and the equilibrium genetic variance and response are smaller as disequilibrium becomes important. Linkage effects are greater when the mutational variance is contributed by many genes of small effect than few of large effect, and are greater when the majority of mutants increase rather than decrease the trait because genes that are of large effect or are deleterious do not segregate for long. The most likely conditions for ‘‘Muller’s ratchet” are investi- gated. N recent years there has been much interest in the is attained more quickly in the presence of selection I production and maintenance of variation in pop- than in its absence, and is highly dependent on popu- ulations by mutation, stimulated by the presence of lation size.
    [Show full text]
  • Selection Response in Finite Populations
    Copyright 0 1996 by the Genetics Society of America Selection Responsein Finite Populations Ming Wei, Armando Caballero and William G. Hill Institute of Cell, Animal and Population Biology, University of Edinburgh, King’s Buildings, Edinburgh EH9 3JT, United Kingdom Manuscript received November 28, 1995 Accepted for publication September 5, 1996 ABSTRACT Formulae were derived to predict genetic response under various selection schemes assuming an infinitesimal model. Accountwas taken of genetic drift, gametic (linkage) disequilibrium (Bulmereffect), inbreeding depression, common environmental variance, and both initial segregating variance within families (a;wo)and mutational (&) variance. The cumulative response to selection until generation t(C8) can be approximated as where N, is the effective population size, = Ne& is the genetic variancewithin families at the steady state (or one-half the genic variance, which is unaffected by selection), and D is the inbreeding depression per unit of inbreeding. & is the selection response at generation 0 assuming preselection so that the linkage disequilibrium effect has stabilized. P is the derivative of the logarithm of the asymptotic response with respect to the logarithm of the within-family genetic variance, ie., their relative rate of change. & is the major determinant of the short term selection response, but aL, Ne and p are also important for the long term. A selection method of high accuracy using family information gives a small N, and will lead to a larger- response in the short term and a smaller response in the long term, utilizing mutation less efficiently. ENETIC response to one cycle of selection for a 1975) using all information available are formally the G quantitative character is solely a function of selec- methods to achieve the maximum response for one tion accuracy (the correlation between the selection generation of selection.
    [Show full text]
  • The Qtn Program and the Alleles That Matter for Evolution: All That’S Gold Does Not Glitter
    PERSPECTIVE doi:10.1111/j.1558-5646.2011.01486.x THE QTN PROGRAM AND THE ALLELES THAT MATTER FOR EVOLUTION: ALL THAT’S GOLD DOES NOT GLITTER Matthew V. Rockman1,2 1Department of Biology and Center for Genomics and Systems Biology, New York University, 12 Waverly Place, New York, NY 10003 2E-mail: [email protected] Received April 20, 2011 Accepted September 30, 2011 The search for the alleles that matter, the quantitative trait nucleotides (QTNs) that underlie heritable variation within populations and divergence among them, is a popular pursuit. But what is the question to which QTNs are the answer? Although their pursuit is often invoked as a means of addressing the molecular basis of phenotypic evolution or of estimating the roles of evolutionary forces, the QTNs that are accessible to experimentalists, QTNs of relatively large effect, may be uninformative about these issues if large-effect variants are unrepresentative of the alleles that matter. Although 20th century evolutionary biology generally viewed large-effect variants as atypical, the field has recently undergone a quiet realignment toward a view of readily discoverable large- effect alleles as the primary molecular substrates for evolution. I argue that neither theory nor data justify this realignment. Models and experimental findings covering broad swaths of evolutionary phenomena suggest that evolution often acts via large numbers of small-effect polygenes, individually undetectable. Moreover, these small-effect variants are different in kind, at the molecular level, from the large-effect alleles accessible to experimentalists. Although discoverable QTNs address some fundamental evolutionary questions, they are essentially misleading about many others.
    [Show full text]
  • Alleles Versus Mutations: Understanding the Evolution of Genetic Architecture Requires a Molecular Perspective on Allelic Origins
    Alleles versus mutations: Understanding the evolution of genetic architecture requires a molecular perspective on allelic origins By: David L. Remington Remington, D. L. (2015). Alleles versus mutations: understanding the evolution of genetic architecture requires a molecular perspective on allelic origins. Evolution: international journal of organic evolution. doi:10.1111/evo.12775 "This is the peer reviewed version of the following article: Remington, D. L. (2015). Alleles versus mutations: understanding the evolution of genetic architecture requires a molecular perspective on allelic origins. Evolution: international journal of organic evolution. doi:10.1111/evo.12775, which has been published in final form at http://dx.doi.org/10.1111/evo.12775. This article may be used for non-commercial purposes in accordance with Wiley Terms and Conditions for Self-Archiving." Abstract: Perspectives on the role of large-effect quantitative trait loci (QTL) in the evolution of complex traits have shifted back and forth over the past few decades. Different sets of studies have produced contradictory insights on the evolution of genetic architecture. I argue that much of the confusion results from a failure to distinguish mutational and allelic effects, a limitation of using the Fisherian model of adaptive evolution as the lens through which the evolution of adaptive variation is examined. A molecular-based perspective reveals that allelic differences can involve the cumulative effects of many mutations plus intragenic recombination, a model that is supported by extensive empirical evidence. I discuss how different selection regimes could produce very different architectures of allelic effects under a molecular-based model, which may explain conflicting insights on genetic architecture from studies of variation within populations versus between divergently selected populations.
    [Show full text]
  • 100 Years of Quantitative Genetics Theory and Its Applications: Celebrating the Centenary of Fisher 1918
    100 years of quantitative genetics theory and its applications: celebrating the centenary of Fisher 1918 Nick Barton Institute for Science and Technology, Klosterneuburg, Austria The infinitesimal model In his 1918 paper, Fisher introduced the analysis of variance, and reconciled Mendelian genetics with continuous biometric variation. He also showed that when variation is due to very many additive genetic loci, selection hardly changes the genetic variance. This has come to be known as the “infinitesimal model”, and is the foundation for practical animal breeding. Yet, organisms are plainly not simply the sum of their genes, and so it is not clear when the use of the infinitesimal model can be justified. I review the history of the infinitesimal model, and show how it can be extended to include dominance, epistasis, and linkage. Sharon Browning University of Washington, Seattle, Washington, USA Identity by descent and the correlation between distant relatives Identity by descent and the correlation between distant relatives Related individuals have correlated trait values because they share parts of their genomes identical by descent. Because the genome is inherited in large chunks, punctuated by crossovers, the identity by descent (IBD) occurs in long segments. IBD segments inherited from more distant ancestors tend to be shorter due to more opportunities for crossing over. So-called “unrelated” individuals are in fact distant relatives, and share short segments of IBD. I will describe the detection of segments of IBD in population samples using genome-wide genetic data, and the application of these segments to IBD mapping and to estimating heritability. Edward Buckler USDA-ARS, Ithaca, New York, USA How to get to Plant Breeding 4.0, given that Fisher was right? As global population continues to grow and standards of living rise, the world food supply needs to almost double protein production.
    [Show full text]
  • Animal Breeding from Infinitesimal Model to Mas: the Case of a Backcross Design in Dairy Sheep (Sarda X Lacaune) and Its Possible Impact on Selection
    MARKER ASSISTED SELECTION: A FAST TRACK TO INCREASE GENETIC GAIN IN PLANT AND ANIMAL BREEDING? SESSION II: MAS IN ANIMALS ANIMAL BREEDING FROM INFINITESIMAL MODEL TO MAS: THE CASE OF A BACKCROSS DESIGN IN DAIRY SHEEP (SARDA X LACAUNE) AND ITS POSSIBLE IMPACT ON SELECTION G. Pagnacco and A. Carta [email protected] [email protected] Summary Despite the great advancements achieved by quantitative techniques, selection in some species and for some traits, as in dairy sheep, can be very expensive and requires great efforts. In this paper we review the options offered by quantitative methods and by marker-assisted selection and we discuss the possible impact of a QTL detection design based on a backcross experimental population (Sarda X Lacaune) on sheep breeding. Keywords Infinitesimal model, Quantitative Trait Loci, Marker-assisted selection, dairy sheep, Backcross Design Abstract Selection under the infinitesimal model Standard animal breeding techniques have been largely successful in improving the performance of domestic animals in the last century. The success of these techniques finds its roots in the theoretical work of Fisher and Wright who showed how selection, migration and mutation could be derived from the simple Mendelian laws. All that theory was then transferred into the operational application in animal breeding by J. L. Lush (Animal Breeding Plans, 1945) and by his many “sons” and “grandsons”. In more recent times another landmark book, Introduction to Quantitative Genetics, (Falconer, 1960) probably represents the official birth of the quantitative genetics in plant and especially animal breeding. Plant and animal breeding have based their tools of investigation on this “central paradigm for the analysis of phenotypic variation” (Lynch and Walsh, 1998) even if their methods have radically diverged probably because of the different population structure in crop and domestic animals.
    [Show full text]
  • Modeling Linkage Disequilibrium 1 Increases Accuracy of Polygenic
    1 Modeling Linkage Disequilibrium 2 Increases Accuracy of Polygenic Risk 3 Scores 4 Bjarni J. Vilhjálmsson1,2,3,4,*, Jian Yang5,6, Hilary K. Finucane1,2,3,7, 5 Alexander Gusev1,2,3, Sara Lindström1,2, Stephan Ripke8,9, Giulio 6 Genovese3,8,10, Po-Ru Loh1,2,3, Gaurav Bhatia1,2,3, Ron Do11,12, Tristan 7 Hayeck1,2,3, Hong-Hee Won3,13, Schizophrenia Working Group of the 8 Psychiatric Genomics Consortium, the Discovery, Biology, and Risk of 9 Inherited Variants in Breast Cancer (DRIVE) study, Sekar Kathiresan3,13, 10 Michele Pato14, Carlos Pato14, Rulla Tamimi1,2,15, Eli Stahl16, Noah 11 Zaitlen17, Bogdan Pasaniuc18, Gillian Belbin11,12, Eimear Kenny11,12,19,20, 12 Mikkel H. Schierup4, Philip De Jager3,21,22 , Nikolaos A. Patsopoulos3,21,22, 13 Steve McCarroll3,8,10, Mark Daly3,8, Shaun Purcell15, Daniel Chasman23, 14 Benjamin Neale3,8, Michael Goddard24,25, Peter M. Visscher5,6, Peter 15 Kraft1,2,3,26, Nick Patterson3, Alkes L. Price1,2,3,26,* 16 17 18 1. Department of Epidemiology, Harvard T.H. Chan School of Public 19 Health, Boston, 02115 MA, USA. 20 2. Program in Genetic Epidemiology and Statistical Genetics, Harvard 21 T.H. Chan School of Public Health, Boston, 02115 MA, USA. 22 3. Program in Medical and Population Genetics, Broad Institute of 23 Harvard and MIT, Cambridge, 02142 MA, USA. 24 4. Bioinformatics Research Centre, Aarhus University, 8000 Aarhus, 25 Denmark. 26 5. Queensland Brain Institute, The University of Queensland, 27 Brisbane, 4072 Queensland, Australia. 28 6. The Diamantina Institute, The Translational Research Institute, 29 University of Queensland, Brisbane, 4101 Queensland, Australia.
    [Show full text]
  • Fisher's Infinitesimal Model: a Story for the Ages
    Theoretical Population Biology 118 (2017) 46–49 Contents lists available at ScienceDirect Theoretical Population Biology journal homepage: www.elsevier.com/locate/tpb Commentary Commentary: Fisher's infinitesimal model: A story for the ages Michael Turelli Department of Evolution and Ecology, University of California, Davis, CA, USA article info a b s t r a c t Article history: Mendel (1866) suggested that if many heritable ``factors'' contribute to a trait, near-continuous variation Received 18 September 2017 could result. Fisher (1918) clarified the connection between Mendelian inheritance and continuous trait Available online 5 October 2017 variation by assuming many loci, each with small effect, and by informally invoking the central limit theorem. Barton et al. (2017) rigorously analyze the approach to a multivariate Gaussian distribution of Keywords: the genetic effects for descendants of parents who may be related. This commentary distinguishes three Polygenic traits Gaussian approximations nested approximations, referred to as ``infinitesimal genetics,'' ``Gaussian descendants'' and ``Gaussian Selection response population,'' each plausibly called ``the infinitesimal model.'' The first and most basic is Fisher's ``infinites- Additive effects imal'' approximation of the underlying genetics – namely, many loci, each making a small contribution to Epistasis the total variance. As Barton et al. (2017) show, in the limit as the number of loci increases (with enough Linkage disequilibria additivity), the distribution of genotypic values
    [Show full text]
  • The Infinitesimal Model: Definition, Derivation, and Implications Nick Barton, Alison Etheridge, Amandine Véber
    The infinitesimal model: definition, derivation, and implications Nick Barton, Alison Etheridge, Amandine Véber To cite this version: Nick Barton, Alison Etheridge, Amandine Véber. The infinitesimal model: definition, derivation, and implications. Theoretical Population Biology, Elsevier, 2017. hal-02113158 HAL Id: hal-02113158 https://hal.archives-ouvertes.fr/hal-02113158 Submitted on 27 Apr 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. The infinitesimal model: definition, derivation, and implications N H Barton∗ Institute of Science and Technology Am Campus I A-3400 Klosterneuberg Austria Email [email protected] A M Etheridgey Department of Statistics University of Oxford 24{29 St Giles Oxford OX1 3LB, UK A V´eberz Centre de Math´ematiquesAppliqu´ees Ecole´ Polytechnique Route de Saclay 91128 Palaiseau Cedex France May 16, 2017 Abstract Our focus here is on the infinitesimal model. In this model, one or several quantitative traits are described as the sum of a genetic and a non-genetic component, the first being distributed within families as a normal random variable centred at the average of the parental genetic components, and with a variance independent of the parental traits.
    [Show full text]
  • 100 Years of Quantitative Genetics Theory and Its Applications: Celebrating the Centenary of Fisher 1918”
    Report on the meeting “100 years of quantitative genetics theory and its applications: celebrating the centenary of Fisher 1918” October 9, 2018 Wolfson Hall, Royal College of Surgeons, Nicolson St., Edinburgh EH8 9DW Sponsored by The Fisher Memorial Trust, the Genetics Society, the Galton Institute, the London Mathematical Society and the Royal Statistical Society Jessica G. King Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh Ronald A. Fisher’s 1918 paper, entitled “The correlation between relatives on the supposition of Mendelian inheritance” and published in the Transactions of the Royal Society of Edinburgh, set the foundations for the study of the genetics of quantitative traits. 100 years later, we celebrate Fisher’s contribution and reflect on the advances made since this classical paper first emerged. Context Prior to R.A. Fisher’s famous contribution, the genetic basis of evolutionary change was vigorously disputed between biometricians and Mendelians. In support of Darwin’s theory of evolution by natural selection, biometricians believed evolution to be a continuous process, having developed much of modern statistical methods such as regression and correlation to describe the inheritance of biometric (continuous or quantitative) traits. After the rediscovery of Mendel’s work on inheritance, the Mendelians argued against these views by vehemently supporting discontinuous evolution via Mendelian (discontinuous) traits controlled by the segregation of major genetic factors. The first attempts to reconcile the two opposing schools of thought were made independently by George Udny Yule in 1902 and Wilhelm Weinberg in 1910, whose studies were largely overlooked by both biometricians and Mendelians, blinded by the ongoing conflict.
    [Show full text]
  • An Infinitesimal Model for Quantitative Trait Genomic Value Prediction
    An Infinitesimal Model for Quantitative Trait Genomic Value Prediction Zhiqiu Hu1, Zhiquan Wang2, Shizhong Xu1* 1 Department of Botany and Plant Sciences, University of California Reverside, Reverside, California, United States of America, 2 Department of Agricultural Food and Nutritional Science, University of Alberta, Edmonton, Alberta, Canada Abstract We developed a marker based infinitesimal model for quantitative trait analysis. In contrast to the classical infinitesimal model, we now have new information about the segregation of every individual locus of the entire genome. Under this new model, we propose that the genetic effect of an individual locus is a function of the genome location (a continuous quantity). The overall genetic value of an individual is the weighted integral of the genetic effect function along the genome. Numerical integration is performed to find the integral, which requires partitioning the entire genome into a finite number of bins. Each bin may contain many markers. The integral is approximated by the weighted sum of all the bin effects. We now turn the problem of marker analysis into bin analysis so that the model dimension has decreased from a virtual infinity to a finite number of bins. This new approach can efficiently handle virtually unlimited number of markers without marker selection. The marker based infinitesimal model requires high linkage disequilibrium of all markers within a bin. For populations with low or no linkage disequilibrium, we develop an adaptive infinitesimal model. Both the original and the adaptive models are tested using simulated data as well as beef cattle data. The simulated data analysis shows that there is always an optimal number of bins at which the predictability of the bin model is much greater than the original marker analysis.
    [Show full text]