Infinite Allele Model with Varying Mutation Rate (Protein Polymorphism/Heterozygosity/Genetic Distance) MASATOSHI Nei, RANAJIT CHAKRABORTY, and PAUL A

Total Page:16

File Type:pdf, Size:1020Kb

Infinite Allele Model with Varying Mutation Rate (Protein Polymorphism/Heterozygosity/Genetic Distance) MASATOSHI Nei, RANAJIT CHAKRABORTY, and PAUL A Proc. Natl. Acad. Sci. USA Vol. 73, No. 11, pp.4164-4168, November 1976 Genetics Infinite allele model with varying mutation rate (protein polymorphism/heterozygosity/genetic distance) MASATOSHI NEi, RANAJIT CHAKRABORTY, AND PAUL A. FUERST Center for Demographic and Population Genetics, University of Texas at Houston, Houston, Tex. 77030 Communicated by Motoo Kimura, September 7, 1976 ABSTRACT Available data suggest that the variation in tone IV which has an unusually low rate of amino acid substi- mutation rate among protein loci follows the gamma distribu- tion. Thus, taking. into account this variation, formulae are de- tution. The mean and standard deviations of this distribution veloped for the distribution of allele frequencies, mean and are 2.47 X 10-7 and 2.51 X 10- per year, respectively. We variance of heter'ozygosity, expected number of alleles, pro- fitted the following gamma distribution to these data: portion of polymorphic loci, and genetic distance. These for- mulae should be more appropriate for the analysis of gene fre- quency data for protein loci than equivalent formulae with f(z) r(a) e Zi, [1] constant mutation rate. where a = i2/V5 and , = z/V,, in which z and V. are the In the last 10 years statistical methods based on the so-called mean and variance of the variate in question (z), respectively. infinite allele model (1, 2) have been used extensively to study The maximum likelihood estimates of a and ,3 obtained are 0.95 the mechanism of maintenance of protein polymorphism. and 3.9 X 106, respectively. It is clear that the gamma distri- Because each allele in a population behaves in a random fashion bution fits the data reasonably well (Fig. 1), though the number comparable to a molecule in a mass of gases, these methods are of polypeptides used is very small. applied to a collection of alleles from a large number of loci. Another way to get a rough idea about the distribution of They are, however,' based on two unrealistic assumptions when mutation rate is to examine the distribution of -molecular applied to the current gene frequency data, most of which have weights of protein subunits, by assuming that the mutation rate been obtained by electrophoresis. First, in this model all new at a locus is proportional to the molecular weight of the poly- mutations are assumed to be novel, whereas at the level of peptide produced. This assumption seems to be only roughly electrophoresis some backward mutations may occur. Recently, correct. Data on amino acid substitutions in polypeptides in Fig. Ohta and Kimura (3) introduced a new mutation model 1 indicate that there is a significant correlation between the (stepwise mutation model), which is presumably appropriate substitution rate per polypeptide and molecular weight but the to electrophoretic data. Second, in the application of the classical correlation coefficient is only 0.53. At any rate, we have ex- infinite allele model it is assumed that the mutation rate is the amined the distribution of molecular weights of 119 protein same for all loci. This assumption is certainly incorrect, and an subunits in mammalian species, by using the data compiled by enormous variation of the rate of amino acid substitution among Darnall and Klotz (7). The distribution obtained is given in Fig. different proteins suggests that the rate of mutations that can 2. The mean and standard deviation of this distribution are be incorporated into the population varies considerably with 45,102 and 24,531, respectively. The mean molecular weight gene loci. The purpose of this paper is to develop a theory which is much higher than that for the polypeptides in Fig. 1 (ca is applicable to a collection of alleles from different loci. 15,000) but close to that of proteins which are often used in electrophoresis (8). The shape of the distribution of molecular Distribution of mutation rate weights is somewhat different from that of Fig. 1, but the To incorporate the variation of mutation rate, we need some gamma distribution again fits the data surprisingly well. In this idea about the distribution of mutation rate among loci. In case the maximum likelihood estimates of a and 13 are 3.7 and practice, virtually nothing is known about this distribution. A 8.14 X 10-5, respectively. Clearly, the coefficient of variation priori, one might assume that it is normally distributed. In the for Fig. 2 is much smaller than that for Fig. 1. This is probably case of mutation rate, however, the normal distribution is not due to the fact that mutation rate is not strictly proportional to very suitable, because mutation rate never becomes smaller molecular weight and is affected by a number of other factors. than 0. With this restriction, the alternative candidate is the Therefore, it is likely that the a value for actual mutation rate gamma distribution. In fact, as will be discussed below, there is closer to the value for the rate of amino acid substitution is some evidence for this. rather than that for molecular weights. Although the mutation rates for most protein loci are not In practice, what we really need is not the distribution of known at present, they can be'estimated under certain as- absolute mutation rate but that of M = 4Nv, where N is the sumptions. 'First, if we assume that a majority of gene substi- effective size of a population and v is the mutation rate per tutions in evolution are due to random fixation of selectively generation. We note then that v can be obtained by multiplying neutral genes, the mutation rate to such alleles may be estimated the mutation rate per year (vp) by generation time (g), if the from the rate of amino acid substitution in proteins (4). This mutation rate per year rather than per generation is constant, assumption may be incorrect, but is sufficient for testing the as seems to be the case with protein loci (4). Clearly, the coef- neutral mutation hypothesis. Dayhoff (5) has given the rates ficient of variation of M is identical to that'of the mutation rate of amino acid substitution per residue per year for 20 different per year, and if vy follows the gamma distribution, M also fol- kinds of polypeptides. The mutation rate per polypeptide lows the gamma. In this case, the parameter a is the same for (locus) can therefore be estimated by multiplying this rate by both M and vy,, since a is the reciprocal of the squared coeffi- the number of amino acids in each polypeptide (6). Fig. 1 gives cient of variation (z2/Vj). On the other hand, ,3 is given by the distribution of mutation rate thus obtained, excluding his- i/Vz, so that this value for the distribution of M is 4Ng times 4164 Downloaded by guest on September 30, 2021 Genetics: Nei et al. Proc. Natl. Acad. Sci. USA 73 (1976) 4165 10- 15 = 5. 30 LAt 0 5- 0 5x 10-7 1-6 RATE OF AMINO ACID SUBSTITUTION FIG. 1. Distribution of the rate of amino acid substitutions per polypeptide per year. The total number of polypeptides used is 19. The gamma distribution fits the data very well (x2(1) = 0.23; P > 0.60). 01i 0 50 100 150 smaller than that for vy. Namely, unlike a, ,B depends on pop- MOLECULAR WEIGHT IN THOUSANDS ulation size and generation time. FIG. 2. Distribution of molecular weights of protein subunits in Let M and VM be the mean and variance of M among loci mammalian species. The total number of proteins used is 119. The for a particular population, which is at steady state. We assume gamma distribution fits the data very well (x2(9) = 6.43; P > 0.65). that M follows a gama distribution. Then, M may be estimated from the average heterozygosity for randomly chosen loci, as tion of constancy of mutation rate when rate will be seen later. Furthermore, if we know the value of a, the this actually varies. variance of M is given by VM = Al2/a, and ,3 = a/M. Our Thus, we first computed the average heterozygosity (H) with given values of and VM, using formula study on the distributions of the rate of amino acid substitution [4] below, and then estimated M = - and molecular weight suggests that a is about 1 to 2. the value by Mc H/(1 H). With this Mc value, we computed the distribution [2] and compared it with Distribution of allele frequencies distribution [3], in which the M value was used. In this com- putation we considered two values MA, i.e., = 0.1 Consider a randomly mating population of effective size N, and of M and M = 1.0. In both cases a 1 was average assume that mutation and random genetic drift are balanced. Am2/VM - assumed. The M = = are Let 4tM (X) be the distribution of allele for a locus heterozygosities for 0.1 and M 1.0 0.084 and 0.404, frequencies respectively (Table 1). with a particular value of M = 4Nv, such that 4M(x)dx repre- sents the expected number of alleles whose frequency is in the The results obtained are given in Fig. 3. The distributioJ3] is given by solid lines, [2] = range from x to x + dx. Wright (1) and Kimura and Crow (2) whereas by broken lines. When M have shown that 0.1, the difference between [3] and [2] is so small, that the two distributions are practically indistinguishable. When M is 1.0, however, there is considerable difference between them. In this ' -M(X)= M(1 -X)M-' X-.
Recommended publications
  • What Is a Recessive Allele?
    What Is a Recessive Allele? WernerG. Heim ONE of the commonlymisunderstood and misin- are termedthe dominant[dominirende], and those terpreted concepts in elementary genetics is that which becomelatent in the processrecessive [reces- sive]. The expression"recessive" has been chosen of dominance and recessiveness of alleles. Many becausethe charactersthereby designated withdraw students in introductory courses perceive the idea or entirelydisappear in the hybrids,but nevertheless that the dominant form of a gene is somehow stron- reappear unchanged in their progeny ... (Mendel ger than the recessive form and, when they are 1950,p. 8) together in a heterozygote, the dominant allele sup- Two important concepts are presented here: (1) presses the action of the recessive one. This belief is Statements about dominance and recessiveness are Downloaded from http://online.ucpress.edu/abt/article-pdf/53/2/94/44793/4449229.pdf by guest on 27 September 2021 not only incorrectbut it can lead to a whole series of statements about the appearanceof characters,about further errors. Students, for example, often errone- what is seen or can be detected, not about the ously conclude that because the dominant allele is the underlying genetic situation; (2) The relationship stronger, it therefore ought to become more common between two alleles of a gene falls on a continuous in the course of evolution. scale from one of complete dominance and recessive- There are other common misconceptions, among ness to a complete lack thereof. In the latter case, the them that: expression of both alleles is seen in either of two ways 1) Dominance operates at the genotypic level.
    [Show full text]
  • Basic Genetic Terms for Teachers
    Student Name: Date: Class Period: Page | 1 Basic Genetic Terms Use the available reference resources to complete the table below. After finding out the definition of each word, rewrite the definition using your own words (middle column), and provide an example of how you may use the word (right column). Genetic Terms Definition in your own words An example Allele Different forms of a gene, which produce Different alleles produce different hair colors—brown, variations in a genetically inherited trait. blond, red, black, etc. Genes Genes are parts of DNA and carry hereditary Genes contain blue‐print for each individual for her or information passed from parents to children. his specific traits. Dominant version (allele) of a gene shows its Dominant When a child inherits dominant brown‐hair gene form specific trait even if only one parent passed (allele) from dad, the child will have brown hair. the gene to the child. When a child inherits recessive blue‐eye gene form Recessive Recessive gene shows its specific trait when (allele) from both mom and dad, the child will have blue both parents pass the gene to the child. eyes. Homozygous Two of the same form of a gene—one from Inheriting the same blue eye gene form from both mom and the other from dad. parents result in a homozygous gene. Heterozygous Two different forms of a gene—one from Inheriting different eye color gene forms from mom mom and the other from dad are different. and dad result in a heterozygous gene. Genotype Internal heredity information that contain Blue eye and brown eye have different genotypes—one genetic code.
    [Show full text]
  • Basic Genetic Concepts & Terms
    Basic Genetic Concepts & Terms 1 Genetics: what is it? t• Wha is genetics? – “Genetics is the study of heredity, the process in which a parent passes certain genes onto their children.” (http://www.nlm.nih.gov/medlineplus/ency/article/002048. htm) t• Wha does that mean? – Children inherit their biological parents’ genes that express specific traits, such as some physical characteristics, natural talents, and genetic disorders. 2 Word Match Activity Match the genetic terms to their corresponding parts of the illustration. • base pair • cell • chromosome • DNA (Deoxyribonucleic Acid) • double helix* • genes • nucleus Illustration Source: Talking Glossary of Genetic Terms http://www.genome.gov/ glossary/ 3 Word Match Activity • base pair • cell • chromosome • DNA (Deoxyribonucleic Acid) • double helix* • genes • nucleus Illustration Source: Talking Glossary of Genetic Terms http://www.genome.gov/ glossary/ 4 Genetic Concepts • H describes how some traits are passed from parents to their children. • The traits are expressed by g , which are small sections of DNA that are coded for specific traits. • Genes are found on ch . • Humans have two sets of (hint: a number) chromosomes—one set from each parent. 5 Genetic Concepts • Heredity describes how some traits are passed from parents to their children. • The traits are expressed by genes, which are small sections of DNA that are coded for specific traits. • Genes are found on chromosomes. • Humans have two sets of 23 chromosomes— one set from each parent. 6 Genetic Terms Use library resources to define the following words and write their definitions using your own words. – allele: – genes: – dominant : – recessive: – homozygous: – heterozygous: – genotype: – phenotype: – Mendelian Inheritance: 7 Mendelian Inheritance • The inherited traits are determined by genes that are passed from parents to children.
    [Show full text]
  • Glossary/Index
    Glossary 03/08/2004 9:58 AM Page 119 GLOSSARY/INDEX The numbers after each term represent the chapter in which it first appears. additive 2 allele 2 When an allele’s contribution to the variation in a One of two or more alternative forms of a gene; a single phenotype is separately measurable; the independent allele for each gene is inherited separately from each effects of alleles “add up.” Antonym of nonadditive. parent. ADHD/ADD 6 Alzheimer’s disease 5 Attention Deficit Hyperactivity Disorder/Attention A medical disorder causing the loss of memory, rea- Deficit Disorder. Neurobehavioral disorders character- soning, and language abilities. Protein residues called ized by an attention span or ability to concentrate that is plaques and tangles build up and interfere with brain less than expected for a person's age. With ADHD, there function. This disorder usually first appears in persons also is age-inappropriate hyperactivity, impulsive over age sixty-five. Compare to early-onset Alzheimer’s. behavior or lack of inhibition. There are several types of ADHD: a predominantly inattentive subtype, a predomi- amino acids 2 nantly hyperactive-impulsive subtype, and a combined Molecules that are combined to form proteins. subtype. The condition can be cognitive alone or both The sequence of amino acids in a protein, and hence pro- cognitive and behavioral. tein function, is determined by the genetic code. adoption study 4 amnesia 5 A type of research focused on families that include one Loss of memory, temporary or permanent, that can result or more children raised by persons other than their from brain injury, illness, or trauma.
    [Show full text]
  • Evolution at Multiple Loci
    Evolution at multiple loci • Linkage • Sex • Quantitative genetics Linkage • Linkage can be physical or statistical, we focus on physical - easier to understand • Because of recombination, Mendel develops law of independent assortment • But loci do not always assort independently, suppose they are close together on the same chromosome Haplotype - multilocus genotype • Contraction of ‘haploid-genotype’ – The genotype of a chromosome (gamete) • E.g. with two genes A and B with alleles A and a, and B and b • Possible haplotypes – AB; Ab; aB, ab • Will selection at the A locus affect evolution of the B locus? Chromosome (haplotype) frequency v. allele frequency • Example, suppose two populations have: – A allele frequency = 0.6, a allele frequency 0.4 – B allele frequency = 0.8, b allele frequency 0.2 • Are those populations identical? • Not always! Linkage (dis)equilibrium • Loci are in equilibrium if: – Proportion of B alleles found with A alleles is the same as b alleles found with A alleles; and • Loci in linkage disequilibrium if an allele at one locus is more likely to be found with a particular allele at another locus – E.g., B alleles more likely with A alleles than b alleles are with A alleles Equilibrium - alleles A locus, A allele p = 15/25 = 0.6 a allele q = 1-p = 0.4 B locus, B allele p = 20/25 = 0.8; b allele q = 1-p = 0.2 Equilibrium - haplotypes Allele B with allele A = 12; A without B = 3 times; AB 12/15 = 0.8 Allele B with allele a = 8; a without B = 2 times; aB 8/10 = 0.8 Equilibrium graphically Disequilibrium - alleles
    [Show full text]
  • BIOL 116 General Biology II Common Course Outline
    South Central College BIOL 116 General Biology II Common Course Outline Course Information Description This course covers biology at the organismal. population and system level. It will emphasize organismal diversity, population and community ecology and ecosystems. Students will gain an understanding of how evolutionary advances have occurred among organisms within a kingdom due to natural selection. This course involves a weekly three hour lab. (prerequisites: Score of 86 or above on the Sentence Skills portion of the Accuplacer test or ENGL 0090 and score of 50 or above on College Level Math portion of the Accuplacer test or MATH 0085) MNTC area 3 Total Credits 4.00 Total Hours 80.00 Types of Instruction Instruction Type Credits Lecture Lab Pre/Corequisites Prerequisite Score of 50 or above on the College Level Math portion of the Accuplacer test or MATH 0085 Prerequisite Score of 86 or above on the Sentence Skills portion of the Accuplacer test or ENGL 0090 Course Competencies 1 Appreciate and explain the process of scientific discovery and methodology Learning Objectives List and describe the steps of the scientific method Demonstrate the process of scientific discovery in the lab 2 Develop the skills necessary to engage in the scientific method Learning Objectives Formulate a hypothesis based on observations Develop a method to test a hypothesis Collect and analyze data Interpret data and form a conclusion Communicate scientific findings Common Course Outline - Page 1 of 4 Monday, September 24, 2012 3:42 PM 3 Explain the theory of
    [Show full text]
  • Evolutionary Forces: Generation X Simulation to Launch the Genx
    Biol 303 1 Lecture Notes Evolutionary Forces: Generation X Simulation To launch the GenX software: 1. Right-click ‘My Computer’. 2. Click ‘Map Network Drive’ 3. Don’t worry about what drive letter is assigned in the upper part of the dialogue box that pops up. In the lower part, enter: \\hopper\labshare. You can probably get this without typing by clicking the little triangle on the right of the field. 4. Using Windows Explorer, Click on the Labshare folder, then on Biology 303 folder, then on the GenX icon to start. A. Recall from lecture: Evolution is a change over time in allele (gene) frequencies within a population. There are 4 main evolutionary forces: Mutation - new allele arises by physical change in structure of DNA Genetic drift - random change in allele frequency by chance, important mainly in small populations (remember that variance ↑ as sample size ↓) Isolation - two populations that exchange members (even just 1 disperser per generation) do not diverge genetically. Isolated populations can diverge due to drift or natural selection Natural selection - differential survival and reproduction of individuals with different phenotypes Also recall that a locus is fixed for a single allele if any allele reaches a frequency of one. When a population holds only one allele, evolution cannot occur (until an alternative allele arrives by mutation or immigration). B. Background for Generation X simulations: Generation X is a computer model of evolution. It allows you to alter the four evolutionary forces, either one at a time or in combination, and see the effect on genotype and allele frequencies.
    [Show full text]
  • A Glossary of Terms for Restoration Genetics
    Paul R. Salon Allele – The specific composition of DNA at each gene is known as an allele. Multiple alleles of a gene maybe A Glossary of Terms for Restoration Genetics USDA-NRCS Syracuse, NY generated by mutations which are structural or chemical changes in DNA at a specific location on a chromosome (locus), this generates genetic variation. Genetic shift – A change in the germplasm balance of a cross pollinated variety, usually caused by Biodiversity - The total variability within and among species of living organisms and the ecological complexes environmental selection pressures, or nursery practices and selection. that they inhabit. Biodiversity has three levels - ecosystem, species, and genetic diversity reflected in the Genetic vulnerability - Having a narrow range of genetic diversity and reacting uniformly to diverse external number of different species, the different combination of species, and the different combinations of genes within conditions. (Applied to breeding populations of varieties or species). each species. Genotype - The genetic constitution of an individual or group of plants. It is the set of alleles it possesses at a Biotype - A group of individuals within a population occurring in nature, all with essentially the same genetic certain locus or over particular or all loci. constitution. A species usually consists of many biotypes. See also “ecotype”. Germplasm – Genetic material that determines the morphological and physiological characteristics of a species. Chromosomes - Are thread like DNA and protein-based structures in cells whose function is the orderly duplication and distribution of genes during cell division. Heterozygote – If alleles at a locus are different. Cultivar - The international term cultivar denotes an assemblage of cultivated plants that is clearly distinguished Homozygote – If alleles at a locus are the same, the locus is homozygous and the organism is a homozygote for that by any characters (morphological, physiological, cytological, chemical, or others) and when reproduced (sexually gene or trait.
    [Show full text]
  • Module 2: Genetics
    Genetics Module B, Anchor 3 Key Concepts: - An individual’s characteristics are determines by factors that are passed from one parental generation to the next. - During gamete formation, the alleles for each gene segregate from each other so that each gamete carries only one allele for each gene. - Punnett squares use mathematical probability to help predict the genotype and phenotype combinations in genetic crosses. - The principle of independent assortment states that genes for different traits can segregate independently during the formation of gametes. - Mendel’s principles of heredity, observed through patterns of inheritance, form the basis of modern genetics. - Some alleles are neither dominant nor recessive. Many genes exist in several different forms and are therefore said to have multiple alleles. Many traits are produced by the interaction of several genes. - Environmental conditions can affect gene expression and influence genetically determined traits. - The DNA that makes up genes must be capable of storing, copying, and transmitting the genetic information in a cell. - DNA is a nucleic acid made up of nucleotides joined into long strands or chains by covalent bonds. - DNA polymerase is an enzyme that joins individual nucleotides to produce a new strand of DNA. - Replication in most prokaryotic cells starts from a single point and proceeds in both directions until the entire chromosome is copied. - In eukaryotic cells, replication may begin at dozens or even hundreds of places on the DNA molecule, proceeding in both directions until each chromosome is completely copied. - The main differences between DNA and RNA are that (1) the sugar in RNA is ribose instead of deoxyribose; (2) RNA is generally single-stranded, not double-stranded; and (3) RNA contains uracil in place of thymine.
    [Show full text]
  • Glossary in Evolutionary Biology Compiled by Prof
    Glossary evolutionary biology. Page 1 Glossary in Evolutionary Biology Compiled by Prof. Dieter Ebert This list contains terms, which a student in evolutionary biology should know. The terms denoted with an * are for an advanced level (Courses in evolutionary and quantitative genetics). This Glossary has been compiled with the help of the following books: • J.R. Krebs & N.B. Davies; An Introduction to Behavioural Ecology. 3. Ed., Blackwell UK. 1993. • S.C. Stearns & R.F. Hoeckstra; Evolution: An Introduction. Oxford University Press. 2005. • D.A. Roff; The Evolution of life histories. Chapman & Hall. 1992. ____________________________________________________________________ Adaptation: A state that evolved because it improved reproductive performance, to which survival contributes. Also the process that produces that state. Adaptive evolution: The process of change in a population driven by variation in reproductive success that is correlated with heritable variation in a trait. *Additive genetic variance: The part of total genetic variance that can be modelled by allelic effects whose influence on the phenotype in heterozygotes is additive (Additive means that the phenotype of the heterozygote is halfway between the phenotype of the two homozygotes). This part of genetic variance determines the response to selection by quantitative traits. Aging (=Ageing): (See Senescence). Allele: One of the different homologous forms of a single gene; at the molecular level, a different DNA sequence at the same place in the chromosome. Allele frequency: Proportion the copies of a given allele among all alleles at the locus of interest. Allometry: Relationship between the size of two organisms or their parts. E.g. larger organisms produce larger offspring.
    [Show full text]
  • "TOP/BOT" Strand and "A/B" Allele
    Illumina® SNP Genotyping TECHNICAL NOTE “TOP/BOT” Strand and “A/B” Allele A guide to Illumina’s method for determining Strand and Allele for the GoldenGate® and Infinium™ Assays. INTRODUCTION consistently designate the same SNP orientation and To address DNA strand designation and orientation for allele calls even if public SNP databases and genome both human and non-human species, Illumina has devel- assemblies change. This will enable researchers world- oped a consistent and simple method to ensure uniformi- wide to easily correlate the genotype calls made today to ty in the reporting of genotype calls. research that may have been completed several years ago. The method that Illumina has developed uses the top Researchers can be confident that the same genotype (TOP) and bottom (BOT) designations based on the poly- calls are being made across time. morphism itself, or the contextual surrounding Additionally, although the human genome has been sequence. This document provides a description of the annotated and extensively SNP genotyped, the study of TOP/BOT method, as well as the generalized nomencla- non-human species is growing rapidly. Much of this work ture of “Allele A” and “Allele B” within the Illumina geno- is being done on species for which SNP databases are in typing system. Beginning in mid-2005, dbSNP also their initial stages or do not yet exist. Many researchers adopted this TOP/BOT nomenclature and has included that study both humans and non-human species also rely this designation for all SNP entries. on proprietary SNP sequences that may not yet have been incorporated into public databases or for which assembly BACKGROUND and orientation have not been established.
    [Show full text]
  • Genetics, DNA, and Heredity
    Genetics, DNA, and Heredity The Basics What is DNA? It's a history book - a narrative of the journey of our species through time. It's a shop manual, with an incredibly detailed blueprint for building every human cell. And it's a transformative textbook of medicine, with insights that will give health care providers immense new powers to treat, prevent and cure disease." - Francis Collins What Does DNA Look Like? A T G C Every cell in our body has the same DNA…. Eye cell Karyotype Lung cell Toe cell How much DNA is in one cell? Genome = 46 chromosomes Genome = approx. 3 billion base pairs One base pair is 0.00000000034 meters DNA sequence in any two people is 99.9% identical – only 0.1% is unique! What makes one cell different from another? DNA = “the life instructions of the cell” Gene = segment of DNA that tells the cell how to make a certain protein. Allele = one of two or more different versions of a gene Sequence for normal adult hemoglobin: Sequence for mutant hemoglobin: Wild-type Hemoglobin Protein Mutant Protein Normal Red Blood Cell Abnormal Red Blood Cell The Human Genome Project Goals • To sequence (i.e. determine the exact order of nucleotides (A,T,G,C) for ALL of the DNA in a human cell • To determine which sections of DNA represent individual genes (protein-coding units). The HGP: International effort to decipher the blueprint of a human being. How It Was Done Samples sent to Human Genome DNA samples collected from Project centers across the world thousands of volunteers Scientists at centers perform DNA sequencing and analysis • February 2001: Draft of the sequence published in Nature (public effort )and Science (Celera – private company).
    [Show full text]