Pseudosampling Method

Total Page:16

File Type:pdf, Size:1020Kb

Pseudosampling Method Proc. Nat. Acad. Sci. USA Vol. 80, pp. 1048-1052, February 1983 Genetics Selective constraint in protein polymorphism: Study of the effectively neutral mutation model by using an improved pseudosampling method (population genetics/neutral theory of molecular evolution/polyallelic random drift) MOTOo KIMURA AND NAOYUKI TAKAHATA National Institute of Genetics, Mishima, 411 Japan Contributed by Motoo Kimura, September 30, 1982 ABSTRACT To investigate the pattern of allelic distribution tiallelic frequencies when mutation, selection, and random drift in enzyme polymorphismwith special reference to the relation- balance each other (see ref. 14, p. 394, for a review). Later, ship between the mean (H) and the variance (VH) of heterozy- Watterson (15) and Li (16) developed a method for calculating gosity, we used the model of effectively neutral mutations involv- moments of allele frequencies from Wright's formula. Theo- ing multiple alleles in which selective disadvantage of mutant retically, the formula and the method could be applied to treat alleles follows a r distribution. A simulation method was devel- the present model. It turned out, however, that when the pos- oped that enables us to study efficiently the process of random sible number of selectively different alleles involved is large or drift in a multiallelic genetic system and that saves a great deal the product of the effective population size and selection coef- ofcomputer time. Itis an improved version ofthe pseudosampling- ficient is large, or both are large, the application ofthis method variable (PSV) method [Kimura, M. (1980) Proc. NatL Acad. Sci. is very difficult. USA 77, 522-526] previously used to simulate random drift in a Here, we intend to present a method that can simulate the diallelic system. This method will be useful for simulating many process ofpolyallelic random drift very efficiently. This method models of population genetics that involve behavior of multiple (17) and alleles in a finite population. By using this method, it was shown is an improved version ofthe pseudosampling method that, as compared with the model ofstrictly neutral mutations, the it contains a device (to be called a "telescoping" method) ofsam- present model gives the reduction ofboth H and VH and an excess pling multiple alleles. As compared with other methods (18- of rare variant alleles. The results were discussed in the light of 20), it can treat a multiallelic system much more easily. Fur- recent observations on protein polymorphism with special refer- thermore, it has the merit of enabling us to incorporate a suit- ence to the functional constraint of proteins involved. able adjustment for rare alleles easily (see also ref. 21), which makes the simulation valid over all the combination of param- From the standpoint of the neutral mutation-random drift hy- eter values. This adjustment is particularly important when we pothesis (1) (or the neutral theory, for short; see refs. 2 and 3 try to obtain a satisfactory distribution for a whole domain of for review), protein polymorphism is a transient phase of mo- allele frequencies. lecular evolution (4); therefore, it is expected to be influenced by selective constraint in a similar way as in molecular evolution Telescoping method of sampling multiple alleles (5). In other words, the stronger the functional and structural constraints to which a protein is subject, the smaller the prob- The essential idea underlying the PSV method is that we sim- ability of an amino acid change by mutation being selectively ulate the diffusion model of the process of gene frequency neutral (i.e., not harmful) and, therefore, the lower the het- change by generating a sequence ofsome simple random num- erozygosity at its locus. bers, rather than faithfully following binomial or multinomial The observation that loci for substrate-specific enzymes are process by drawing, in each generation, individual gametes on the average less heterozygous than those for substrate-non- from a finite population. For example, in the simplest case of specific enzymes (6) is consistent with this expectation. Fur- two neutral alleles in a diploid population ofsize N, we generate thermore, Yamazaki (7) and more recently Gojobori (8) showed in each generation a random number from the uniform distri- that substrate specific enzymes not only have lower mean het- bution having the mean 0 and the variance x(l - x)/(2N), where erozygosity but also have smaller variance of heterozygosity x is the frequency ofone ofthe alleles at a given moment. Then, among species than expected from the standard neutral infinite this random number is added to x to form the allelic frequency allele model (9). in the next generation. This saves computer time enormously In this paper, we intend to show that these observations can and allows us to simulate easily a process of change in a very be explained by the model ofeffectively neutral mutations (10) large population. Also, we can make many replicate trials with- in which selective constraint (negative selection) is incorpo- out prohibitive computing time. Originally, the PSV method rated. This model is based on the idea that selective neutrality was intended to simulate the diffusion process itselfrather than is the limit when the selective disadvantage becomes indefi- the discrete binomial sampling process ("Fisher-Wright model") nitely small (5) and is an extention ofOhta's model (11) in which for which the diffusion model is usually regarded as an approx- the selection coefficients against the mutants follow an expo- imation (17). nential distribution. These models are ultimately traced to In the present paper, we shall be concerned with a multial- lelic system and assume that there are K possible allelic states Ohta's hypothesis (12, 13) that very slightly deleterious muta- of ef- tions as well as the strictly neutral mutations play an important (K > 2) at a locus in a random-mating diploid population fective size Ne. Let Ak be the kth allele (k = 1, 2, .. ., K), whose role in molecular evolution and polymorphism. We first sam- One problem in our approach is that rigorous mathematical frequency in the population before sampling is Xk. study of the model is rather difficult. Previously, Wright de- rived a general formula for the equilibrium distribution ofmul- Abbreviation: PSV, pseudosampling variable. 1048 Genetics:Genetics:KimuraKimuraandandTakahataTakahata~~~~Proc.Nati. Acad. Sci. USA 80 (1983) 1049 pie allele Al so that its number follows binomial distribution we obtain Yk' by if is small, B(illn, x1 =()x~"1 -i [1] Yk' =- 7lk/nk nkYk or [6b] where n = 2Ne is the total number of gametes that contributes Yk' = 1 - ~k/nk if ~k(I - y~) is small, *to the next generation and il is the number ofgametes that con- tains allele Al after sampling. Actually, there is no need for the where ilk and 4k are Poisson random numbers with the mean distribution to follow faithfully the binomial distribution, but nk Yk and lk(1 - yk), respectively (see also ref. 21). we simply generate a uniform random number, U1 with mean The approximations given either by Eqs. 6a or 6b are satis- 4)~ and variance unity, and substitute the- frequency of Al after factory when nk is reasonably large, say larger than 20. If, on sampling (corresponding to il/n) by the other hand, nk 45 20, we must resort to a more exact pro- cedure, and in this paper, we adopt the following, method: we Xr= x1 + U1 V{[xi(l - xi)]/n}. [2] generate nk random variates that follow the uniform distribution in count number of such variates that we is is the range (0, 1), and the One caution need to take here that when Al represented is a binomial a small number of individuals so that is less than happen to fall in the range (0, yb). This number, ik, by very nx, 3, random so we let say, then a more exact sampling procedure must be used (see number, below). A similar caution is also needed when x1 is near unity, Yi' = iknk (if nk -< 20), [6c] so that n(l - x1) is small, say less than 3. Next, we sample A2 so that the number of gametes (i2) containing this allele follows from which we obtain Xk' by using Eq. 5b. the distribution B0i21n, Y2), where n2 = n - il is the remainder The advantage ofthe telescoping method in saving computer of the gametes after AI is removed, and Y2 = X2/(1 - xj). Like- time comes from the fact that we can obtain each of the Yk'S wise, A3 is sampled so that the number ofgametes (i)containing (and therefore XkI, the frequency of Ak, by using Eq. 5b) by this allele follows the distribution B(iajn3, Y3), where n3 = n generating only one random number when nk is larger than 20. - il - i2 and y3 = X3/(1 - XI- X2). A similar procedure is re- In addition, the accuracy and reliability are ensured by incor- peated until all K - 1 alleles are sampled. In general, if ik (k porating -adjustments 6b and 6c for treating rare variant alleles. - 1, 2,..,K - 1) is the number ofAk-bearing gametes in the These adjustments are particularly pertinent ifwe note that, for sample, this number follows Yk =:0 or 1, application ofEq. 6a frequently produces Yk' values either ~negative or larger than 1, thus creating a possibility of serious bias in a simulation experiment. Such adjustments were not in the treatments Kimura and Ma- where = Xl- incorporated by. (17) by nk n-il-i2...--ikl1andYk Xk/(1- X2 ruyama and Takahata (22). Later, the problem was treated in . .. = n and = for k =1). Then, it can -Xk-1). (Note: nj yi x, a heuristic manner by Maruyama and Nei (19) and Takahata (20). be shown that Their heuristic treatment is concerned with allele frequency E1ik = n~,[4] changes due to mutation; thus, in the absence ofmutation, their simulation methods ofrandom drift are not satisfactory for treat- Varlik} = flXk(l - X) ing the changes of rare alleles.
Recommended publications
  • IN EVOLUTION JACK LESTER KING UNIVERSITY of CALIFORNIA, SANTA BARBARA This Paper Is Dedicated to Retiring University of California Professors Curt Stern and Everett R
    THE ROLE OF MUTATION IN EVOLUTION JACK LESTER KING UNIVERSITY OF CALIFORNIA, SANTA BARBARA This paper is dedicated to retiring University of California Professors Curt Stern and Everett R. Dempster. 1. Introduction Eleven decades of thought and work by Darwinian and neo-Darwinian scientists have produced a sophisticated and detailed structure of evolutionary ,theory and observations. In recent years, new techniques in molecular biology have led to new observations that appear to challenge some of the basic theorems of classical evolutionary theory, precipitating the current crisis in evolutionary thought. Building on morphological and paleontological observations, genetic experimentation, logical arguments, and upon mathematical models requiring simplifying assumptions, neo-Darwinian theorists have been able to make some remarkable predictions, some of which, unfortunately, have proven to be inaccurate. Well-known examples are the prediction that most genes in natural populations must be monomorphic [34], and the calculation that a species could evolve at a maximum rate of the order of one allele substitution per 300 genera- tions [13]. It is now known that a large proportion of gene loci are polymorphic in most species [28], and that evolutionary genetic substitutions occur in the human line, for instance, at a rate of about 50 nucleotide changes per generation [20], [24], [25], [26]. The puzzling observation [21], [40], [46], that homologous proteins in different species evolve at nearly constant rates is very difficult to account for with classical evolutionary theory, and at the very least gives a solid indication that there are qualitative differences between the ways molecules evolve and the ways morphological structures evolve.
    [Show full text]
  • Experiments on the Role of Deleterious Mutations As Stepping Stones In
    Experiments on the role of deleterious mutations as PNAS PLUS stepping stones in adaptive evolution Arthur W. Covert IIIa,b,c,d, Richard E. Lenskia,c,e,1, Claus O. Wilkec,d, and Charles Ofriaa,b,c,1 aProgram in Ecology, Evolutionary Biology and Behavior, Departments of bComputer Science and Engineering and eMicrobiology and Molecular Genetics, and cBEACON Center for the Study of Evolution in Action, Michigan State University, East Lansing, MI 48824; and dCenter for Computational Biology, Institute for Cellular and Molecular Biology, and Department of Integrative Biology, University of Texas at Austin, Austin, TX 78712 Contributed by Richard E. Lenski, July 16, 2013 (sent for review April 22, 2013) Many evolutionary studies assume that deleterious mutations spread unless they are tightly linked (5, 6). In an asexual pop- necessarily impede adaptive evolution. However, a later mutation ulation, however, the fortuitous combination, once formed, will that is conditionally beneficial may interact with a deleterious be stably inherited, thereby providing a simple way to traverse a predecessor before it is eliminated, thereby providing access to fitness valley (5–10). adaptations that might otherwise be inaccessible. It is unknown Previous research on a variety of systems, both biological and whether such sign-epistatic recoveries are inconsequential events computational, has documented many examples of epistatic inter- or an important factor in evolution, owing to the difficulty of actions between mutations (3, 6, 11, 14–24), including a few cases monitoring the effects and fates of all mutations during experi- of mutational pairs that are individually deleterious but jointly ments with biological organisms.
    [Show full text]
  • The Neutral Theory of Molecular Evolution
    Copyright 2000 by the Genetics Society of America Perspectives Anecdotal, Historical and Critical Commentaries on Genetics Edited by James F. Crow and William F. Dove Thomas H. Jukes (1906±1999) James F. Crow Genetics Department, University of Wisconsin, Madison, Wisconsin 53706 OM Jukes accepted our invitation to write a Perspec- duced an immediate outcry from traditional students Ttives on the early history of molecular evolution, of evolution, undoubtedly abetted by the title. In the and in August 1999 he sent a rough beginning con- ensuing polemics, Kimura played the major role. King taining some now-forgotten early history. He planned an died prematurely in 1983 and Jukes wrote mainly about extensive revision and continuation, but on November 1 other things, although he did participate in one joint his death intervened. We have decided to publish his paper (Jukes and Kimura 1984). One of his interests early draft, realizing that it was but a start toward the was the evolution of the genetic code (Jukes 1983). I article that he had planned. particularly liked his showing how, in an orderly sequen- Tom, along with Jack L. King and Motoo Kimura, tial way, mutation pressure in the codon and anti-codon formulated the neutral theory of molecular evolution. could produce the unexpected codes in bacteria and Earlier, the idea had been foreshadowed by Sueoka mitochondria (Jukes 1985). He also developed a widely (1962) and Freese (1962). They had each suggested used correction for multiple undetected changes in evo- mutation pressure of near-neutral changes to account lutionary base substitutions (Jukes and Cantor 1969).
    [Show full text]
  • Population Genetics of Neutral Mutations in Exponentially
    The Annals of Applied Probability 2013, Vol. 23, No. 1, 230–250 DOI: 10.1214/11-AAP824 c Institute of Mathematical Statistics, 2013 POPULATION GENETICS OF NEUTRAL MUTATIONS IN EXPONENTIALLY GROWING CANCER CELL POPULATIONS By Rick Durrett1 Duke University In order to analyze data from cancer genome sequencing projects, we need to be able to distinguish causative, or “driver,” mutations from “passenger” mutations that have no selective effect. Toward this end, we prove results concerning the frequency of neutural mutations in exponentially growing multitype branching processes that have been widely used in cancer modeling. Our results yield a simple new population genetics result for the site frequency spectrum of a sample from an exponentially growing population. 1. Introduction. It is widely accepted that cancers result from an accu- mulation of mutations that increase the fitness of tumor cells compared to the cells that surround them. A number of studies [Sj¨oblom et al. (2006), Wood et al. (2007), Parsons et al. (2008), The Cancer Genome Atlas (2008) and Jones et al. (2008, 2010)] have sequenced the genomes of tumors in or- der to find the causative or “driver” mutations. However, due to the large number of genes being sequenced, one also finds a large number of “passen- ger” mutations that are genetically neutral and hence have no role in the disease. To explain the issues involved in distinguishing the two types of mutations, it is useful to take a look at a data set. Wood et al. (2007) did a “discovery” screen in which 18,191 genes were sequenced in 11 colorectal cancers, and then a “validation” screen in which the top candidates were sequenced in 96 additional tumors.
    [Show full text]
  • Effectively Neutral Mutations
    Proc. Natl. Acad. Sci. USA Vol. 76, No. 7, pp. 3440-3444, July 1979 Genetics Model of effectively neutral mutations in which selective constraint is incorporated (molecular evolution/protein polymorphism/population genetics/neutral mutation theory) MOTOO KIMURA National Institute of Genetics, Mishima 411, Japan Contributed by Motoo Kimura, April 25, 1979 ABSTRACT Based on the idea that selective neutrality is however, Ohta's model has a drawback in that it cannot ac- the limit when the selective disadvantage becomes indefinitely commodate enough mutations that behave effectively as neutral small, a model of neutral (and nearly neutral) mutations is pro- when the population size gets large. This difficulty can be posed that assumes that the selection coefficient (s') against the overcome by assuming that the selection coefficients follow a mutant at various sites within a cistron (gene) follows a r dis- r distribution. tribution; Ass') = aPe-as's.-l/P(,B), in which a = fi/i and s' is the mean selection coefficient against the mutants (?'> 0; 1 _ ft > 0). The mutation rate for alleles whose selection coef- MODEL OF EFFECTIVELY NEUTRAL ficients s' lie in the range between 0 and 1/(2Ne), in which Ne MUTATIONS is the effective population size, is termed the effectively neutral mutation rate (denoted by ye). Using the model of "infinite Let us assume that the frequency distribution of the selective sites" in population genetics, formulas are derived giving the disadvantage (denoted by s') of mutants among different sites average heterozygosity (he) and evolutionary rate per generation follows the r distribution (kg) in terms of mutant substitutions.
    [Show full text]
  • Genomic Insights Into Positive Selection
    Review TRENDS in Genetics Vol.22 No.8 August 2006 Genomic insights into positive selection Shameek Biswas and Joshua M. Akey Department of Genome Sciences, University of Washington, 1705 NE Pacific, Seattle, WA 98195, USA The traditional way of identifying targets of adaptive more utilitarian benefits, each target of positive selection evolution has been to study a few loci that one has a story to tell about the historical forces and events hypothesizes a priori to have been under selection. that have shaped the history of a population. This approach is complicated because of the confound- Several genome-wide analyses for positive selection ing effects that population demographic history and have been performed in a variety of species. In this review, selection have on patterns of DNA sequence variation. In we summarize some of the recent studies, primarily principle, multilocus analyses can facilitate robust focusing on humans, critically evaluate what genome- inferences of selection at individual loci. The deluge of wide scans for selection are and are not likely to find and large-scale catalogs of genetic variation has stimulated suggest future avenues of research. A brief overview of many genome-wide scans for positive selection in statistical methods used to detect deviations from several species. Here, we review some of the salient neutrality is summarized in Box 1. For more detailed observations of these studies, identify important chal- discussions, see Refs [6,7]. lenges ahead, consider the limitations of genome-wide scans for selection and discuss the potential significance Thinking genomically of a comprehensive understanding of genomic patterns Positive selection perturbs patterns of genetic variation of selection for disease-related research.
    [Show full text]
  • Evolution on Neutral Networks Accelerates the Ticking Rate of the Molecular Clock Arxiv:1307.0968V2 [Q-Bio.PE] 24 Sep 2014
    Evolution on neutral networks accelerates the ticking rate of the molecular clock Susanna Manrubia1 and Jos´eA. Cuesta2;3 Grupo Interdisciplinar de Sistemas Complejos (GISC), Madrid 1 Dept. de Biolog´ıade Sistemas, Centro Nacional de Biotecnolog´ıa(CSIC) c/ Darwin 3, 28045 Madrid, Spain. 2 Dept. de Matem´aticas,Universidad Carlos III de Madrid 28911 Legan´es,Madrid, Spain. 3 Instituto de Biocomputaci´ony F´ısicade Sistemas Complejos (BIFI) Universidad de Zaragoza, 50009 Zaragoza, Spain. June 17, 2021 Abstract Large sets of genotypes give rise to the same phenotype because phenotypic expression is highly redundant. Accordingly, a population can accept mutations without altering its phenotype, as long as the genotype mutates into another one on the same set. By linking every pair of genotypes that are mutually accessible through mutation, geno- types organize themselves into neutral networks (NN). These networks are known to be heterogeneous and assortative, and these properties affect the evolutionary dynamics of the population. By studying the dynamics of populations on NN with arbitrary topology we analyze the arXiv:1307.0968v2 [q-bio.PE] 24 Sep 2014 effect of assortativity, of NN (phenotype) fitness, and of network size. We find that the probability that the population leaves the network is smaller the longer the time spent on it. This progressive \phenotypic entrapment" entails a systematic increase in the overdispersion of the process with time and an acceleration in the fixation rate of neutral mutations. We also quantify the variation of these effects with the size of the phenotype and with its fitness relative to that of neighbouring alternatives.
    [Show full text]
  • A Study on a Nearly Neutral Mutation Model in Finite Populations
    Copyright 0 1991 by the Genetics Society of America A Study on a Nearly Neutral Mutation Model in Finite Populations Hidenori Tachida National Institute of Genetics, Mishima, Shizuoka-ken 41 1,Japan Manuscript received August 16, 1990 Accepted for publication January 19, 199 1 ABSTRACT As a nearly neutral mutation model, the house-of-cards model is studied in finite populations using computer simulations. The distribution of the mutant effect is assumed to be normal. The behavior is mainly determined by the product of the population size, N, and the standard deviation, u, of the distribution of the mutant effect. If 4Nu is large compared to one, a few advantageous mutants are quickly fixed in early generations. Then most mutation becomes deleterious and very slow increase of the average selection coefficient follows. It takesvery long for the population to reach the equilibrium state. Substitutions of alleles occur very infrequently in the later stage. If 4Na is the order of one or less, the behavior is qualitatively similar to that of the strict neutral case. Gradual increase of the average selection coefficient occurs and in generations of several times the inverse of the mutation rate the population almost reaches the equilibrium state. Both advantageous and neutral (including slightly deleterious) mutations are fixed. Except in the early stage, an increase of the standard deviation of the distribution of the mutant effect decreases the average heterozygosity. The substitution rate is reduced as 4Nu is increased. Three tests of neutrality, one using the relationship between the average and the variance of heterozygosity, another using the relationship between the average heterozygosity and the average number of substitutions and Watterson’s homozygosity test are applied to the consequences of the present model.
    [Show full text]
  • Neutral Evolution of Model Proteins: Diffusion in Sequence Space and Overdispersion
    Neutral evolution of model proteins: diffusion in sequence space and overdispersion. Ugo Bastolla(a)∗, H. Eduardo Roman(b) and Michele Vendruscolo(c) (a)HLRZ, Forschungszentrum J¨ulich, D-52425 J¨ulich, Germany (b)Dipartimento di Fisica and INFN, Universit`adi Milano, I-20133 Milano Italy (c)Department of Physics of Complex Systems, Weizmann Institute of Science, Rehovot 76100, Israel We simulate the evolution of model protein sequences subject to mutations. A mutation is considered neutral if it conserves 1) the structure of the ground state, 2) its thermodynamic stabil- ity and 3) its kinetic accessibility. All other mutations are considered lethal and are rejected. We adopt a lattice model, amenable to a reliable solution of the protein folding problem. We prove the existence of extended neutral networks in sequence space – sequences can evolve until their similarity with the starting point is almost the same as for random sequences. Furthermore, we find that the rate of neutral mutations has a broad distribution in sequence space. Due to this fact, the substitution process is overdispersed (the ratio between variance and mean is larger than one). This result is in contrast with the simplest model of neutral evolution, which assumes a Poisson process for substitutions, and in qualitative agreement with biological data. I. INTRODUCTION A recent study on the Protein Data Bank (PDB) showed that the distribution of pairwise sequence identity between structurally homologous proteins presents a large Gaussian peak at 8.5% sequence identity, only slightly larger than what expected in the purely random case1 (Rost, 1998). This is an interesting result which means that the structural similarity does not imply sequence similarity.
    [Show full text]
  • “Neutralism” Elsevier Handbook in Philosophy of Biology Stephens and Matthen, Editors Anya Plutynski Fall 2004 in 1968, Moto
    “Neutralism” Elsevier Handbook in Philosophy of Biology Stephens and Matthen, Editors Anya Plutynski Fall 2004 In 1968, Motoo Kimura submitted a note to Nature entitled “Evolutionary Rate at the Molecular Level,” in which he proposed what has since become known as the neutral theory of molecular evolution. This is the view that the majority of evolutionary changes at the molecular level are caused by random drift of selectively neutral or nearly neutral alleles. Kimura was not proposing that random drift explains all evolutionary change. He does not challenge the view that natural selection explains adaptive evolution, or, that the vertebrate eye or the tetrapod limb are products of natural selection. Rather, his objection is to “panselectionism’s intrusion into the realm of molecular evolutionary studies”. According to Kimura, most changes at the molecular level from one generation to the next do not affect the fitness of organisms possessing them. King and Jukes (1969) published an article defending the same view in Science, with the radical title, “Non- Darwinian Evolution,” at which point, “the fat was in the fire” (Crow, 1985b). The neutral theory was one of the most controversial theories in biology in the late twentieth century. On the one hand, the reaction of many biologists was extremely skeptical; how could evolution be “non-Darwinian”? Many biologists claimed that a “non-Darwinian” theory of evolution was simply a contradiction in terms. On the other hand, some molecular biologists accepted without question that many changes at the molecular level from one generation to the next were neutral. Indeed, when King and Jukes’ paper was first submitted, it was rejected on the grounds that one reviewer claimed it was obviously false, and the other claimed that it was obviously true (Jukes, 1991).
    [Show full text]
  • Molecular Evolution? Molecular Evolution • Evolution at the Molecular Level
    BIOL2007 What is molecular evolution? Molecular Evolution • Evolution at the molecular level Kanchon Dasmahapatra [email protected] Modes of molecular evolution Modes of molecular evolution Gene duplication INDELS: insertions and deletions Slippage in tandem repeats 1 Modes of molecular evolution Substitutions GCG ACG GGG GAG • Single base pair changes, substitutions or point mutations GCG ACA GGG GA G • Insertions or deletions, also known as indels 64 triplet codons coding for 20 amino acids • Gene duplications - formation of multigene GT T CG T TGG Tryptophan families and pseudogenes Histidine GT C CG C Proline Cysteine • Slippage – microsatellite length changes GT A TGC GT G • Chromosomal mutations Twofold degenerate Fourfold degenerate NON- SYNONYMOUS SYNONYMOUS SUBSTITUTION SUBSTITUTION (silent substitution) Classical vs. Balance schools Who is right? • Classical school • Data in the form of allozymes showed that lots of – polymorphisms are rare polymorphisms are present. – because selection gets rid of less fit alleles • But .... causes the problem of genetic load • Balance school 30,000 to 50,000 genes in humans – polymorphisms are common If only 1000 are homozygous – because of balancing selection If selective coefficient = 0.01 Fitness per locus = 0.99 Summed over 1000 loci, fitness = (0.99) 1000 = 0.00004 2 The neutral theory Neutralists vs. selectionists • Proposed by Kimura (1968) and King & Jukes Neutralists Selectionists (1969) • Majority of mutations that spread through a Deleterious population have no effect on fitness Neutral Advantageous • Therefore, genetic drift NOT natural selection drives molecular evolution • Mutations fixed by • Mutations fixed by genetic drift selection Kimura’s calculations Predictions from neutral theory µ = mutation rate per gene per generation • Molecular clock N = population size (effective) • rate of substitution ∝ 1 No.
    [Show full text]
  • Neutral Mutation and Selective Constraint Play Major Role in Dictating the Codon Bias in Breast Cancer Risk Genes
    International Journal of Computational Biology ISSN: 2229-6700 & E-ISSN:2229-6719, Volume 5, Issue 1, 2014, pp.-061-067. Available online at http://www.bioinfopublication.org/jouarchive.php?opt=&jouid=BPJ0000220 NEUTRAL MUTATION AND SELECTIVE CONSTRAINT PLAY MAJOR ROLE IN DICTATING THE CODON BIAS IN BREAST CANCER RISK GENES MAZUMDER T.H. AND CHAKRABORTY S.* Department of Biotechnology, Assam University, Silchar- 788 011, Assam, India. *Corresponding Author: Email- [email protected] Received: September 24, 2014; Accepted: October 30, 2014 Abstract- Breast cancer is ranked the first cancer in women worldwide and the second leading cause of death after cervical cancer particular- ly in developing countries. Specific genes are associated with breast cancer risk in human genome. Understanding the codon usage bias (CUB) with compositional dynamics of coding sequence is of great importance in gaining clues to predict the level of gene expression and genome characterization. In this study, we have analyzed the complete nucleotide coding sequences of fifteen breast cancer risk genes with the help of various genetic indices. Our analysis revealed that both neutral mutation and selective constraint play major roles in the codon usage patterns of breast cancer risk genes. Our results further show that gene expression level is linked with alterations in the nucleotide skewness. Breast cancer gene products showed the dominance of three amino acids namely serine, leucine and glutamate in their composi- tion but the least usage of two amino acids namely tryptophan and methionine. Moreover, the level of breast cancer gene expression meas- ured by RCBS revealed a significant negative correlation with highly used amino acids.
    [Show full text]