The Emergence of New DNA Repeats and the Divergence of Primates (Genetic Noveltles/Pseudogenes/Irreversible Events/Evolutionary Rates/Phylogenies) PHILLIP P
Total Page:16
File Type:pdf, Size:1020Kb
Proc. Natl. Acad. Sci. USA Vol. 90, pp. 1872-1876, March 1993 Evolution The emergence of new DNA repeats and the divergence of primates (genetic noveltles/pseudogenes/irreversible events/evolutionary rates/phylogenies) PHILLIP P. MINGHETTI*t AND ACHILLES DUGAICZYK*t *Department of Biochemistry, University of California, Riverside, CA 92521; and tDepartment of Physiology-Human Genome Mapping Center, University of California School of Medicine, San Francisco, CA 94143 Communicated by Frank W. Putnam, November 30, 1992 (receivedfor review September 29, 1992) ABSTRACT We have identified four genetic novelties that genome (5-7). They have been known for some time, but they are fixed in specific primate lineages and hence can serve as remain somewhat enigmatic, and their usefulness as evolu- phylogenetic time markers. One Alu DNA repeat is present in tionary markers has not been fully appreciated. Although the human lineage but is absent from the great apes. Another early studies on the a- and f-globin gene regions revealed no Alu DNA repeat is present in the gorilla lineage but is absent differences in the location of 14 specific Alu elements be- from the human, chimpanzee, and orangutan. A progenitor tween human, chimpanzee, and gorilla (8, 9), later studies did Xbal element is present in the human, chimpanzee, gorilla, and reveal differences in the location of specific DNA repeats orangutan, but only in the human lineage did it give rise to a between various primate lineages. For instance, in the work transposed progeny, Xba2. The saltatory appearance of Xba2 by Fitch et al. (10), involving sequence determination in the is an example of a one-time event in the evolutionary history of ,B-globin gene region, a truncated Li sequence and two Alu a species. The enolase pseudogene, known to be present as a elements were found to specifically distinguish the spider single copy in the human, was found to be present in four other monkey from the human and the apes. Dugaiczyk and primates, including the baboon, an Old World monkey. Using coworkers (11, 12) found an Xba and an Alu DNA repeat in the accepted value of 5 x 10-9 nucleotide substitutions per site the human a-fetoprotein (AFP) gene that were absent from per year as the evolutionary rate for pseudogenes, we calcu- the gorilla AFP gene. Additional human-specific Alu se- lated that the enolase pseudogene arose 14 million years ago. quences have also been found, and their species-specific The calculated age for this pseudogene and its presence in the presence or absence was demonstrated by DNA hybridiza- baboon are incongruent with each other, since Old World tion and PCR (13). In addition, a gorilla-specific Alu element monkeys are considered to have in the 3,S-globin region was absent from the human, chim- diverged from the hominid panzee, and macaque (14). Ilneage some 30 milion years ago. Thus the rate ofevolution in The process of new DNA repeats arising in the genomes the enolase pseudogene is only about 2.5 x 10-9 substitutions continues today, as has been demonstrated by a contempo- per site per year, or half the rate in other pseudogenes. It is rary de novo insertion of an Alu repeat into the neurofibro- concluded that rates of substitution vary between species, even matosis gene (15) and a de novo insertion of an Li sequence for similar DNA elements such as pseudogenes. We submit that into the factor VIII gene (16) in man. We have extended the new DNA repeats arise in the genomes ofspecies in irreversible work on the human-specific AFP Alu repeat to include and punctuated events and hence can be used as molecular time additional species-specific DNA elements and to screen small markers to decipher phylogenies. populations of higher primates for these DNA sequences. The study was undertaken in an effort to establish a corre- If it is true that evolution is punctuated with rapid changes lation between the emergence of new DNA repeats and the and new species arise in saltatory events rather than by divergence of primate species. gradual accumulation ofpoint mutations (1), then it should be possible to identify at the molecular level the genetic events MATERIALS AND METHODS that underlie these evolutionary punctuations. For example, DNA. Human DNA samples from 16 individuals were genomic rearrangements could change the timing of gene provided by Richard Gatti (Department of Pathology, Uni- expression and thus profoundly affect development of the versity of California Medical School, Los Angeles). DNA host; the new phenotype could sufficiently differ to start a samples from individual primate animals were provided by new speciation process. Similarly, DNA rearrangements Oliver Ryder (Research Department, San Diego Zoo) and by even in nonfunctional regions could disrupt meiotic chromo- Jerry Slightom (Division of Molecular Biology, Upjohn, some pairing, leading to reproductive isolation of a previ- Kalamazoo, MI). ously interbreeding population. The specific genetic changes PCR. Cycling conditions used in PCR experiments are that punctuate the evolutionary process have yet to be described in figure legends. identified, but perhaps the closest examples of such saltatory Cloning of PCR Products. DNA amplified by PCR was events in our genomes is the spreading of repetitive DNA cloned into the Hph I site of the cloning vector pCR1000 elements. It therefore seems worthwhile to analyze these (Invitrogen) according to the manufacturer's protocol. Re- events in some detail, because they can provide us with combinant clones were selected on kanamycin/5-bromo-4- genetic markers timing the evolutionary process. chloro-3-indolyl f-D-galactopyranoside plates and grown in The most prominent among the various repetitive DNA liquid culture, and plasmid DNA was isolated. elements are members ofthe Alu family. They are considered Sequencing of Recombinant Plasmid DNA. The Hph I to be pseudogenes that are ancestrally related to 7SL RNA (2, cloning site in pCR1000 is flanked by EcoRI and HindIII 3) or 4.5S RNA (4), and it appears they arose from an as yet restriction sites. These two sites were therefore used to label unidentified founder gene. They are specific to primate the DNA with 32p, and the labeled fragments were sequenced species, where they are found at an estimated 106 copies per by the chemical degradation method (17). The publication costs ofthis article were defrayed in part by page charge Abbreviations: AFP, a-fetoprotein; Myr, million years. payment. This article must therefore be hereby marked "advertisement" tPresent address: Department of Medicine, University of Alabama, in accordance with 18 U.S.C. §1734 solely to indicate this fact. Birmingham, AL 35294. 1872 Downloaded by guest on October 3, 2021 Evolution: Minghetti and Dugaiczyk Proc. Natl. Acad. Sci. USA 90 (1993) 1873 Direct Sequencing of PCR-Amplified DNA. DNA amplified A in the was labeled with 32P as op PCR concomitantly follows. 567 One of the primers was first end-labeled by using T4 poly- 540 504 nucleotide kinase and ['t-32P]ATP prior to addition to the PCR 456 mixture. The end-labeled PCR product was purified from 434 unincorporated primers by polyacrylamide gel electrophore- sis, eluted from the gel, and sequenced (17). RESULTS 267 Human-SpecificAlu Repeat in the AFP Gene. A specific Alu 234 21 3 repeat is present in the human AFP gene but absent from the 192 same (orthologous) position in the gorilla (11, 12). To extend 1684 these studies to include other primates, and to see whether the presence of this Alu repeat is fixed or polymorphic in the B - Alti Boundaries; 304 hbp-- -- human population, genomic DNA from human, chimpanzee, Term Repeat Terrim Repeat gorilla, and orangutan was amplified by PCR. The primers GGATGTTGTGGGGCCGGGCGCGGT ......lAltil..... .. AAAAAAAAAAAAAGGATGTTGTGG (D-534 and D-533; see legend to Fig. 1) for this reaction were from intron 4 of the human AFP gene sequence (11). This set of primers should amplify a 530-bp Eagment if the Alu Human: \' sequence is present, or 215 bp in the absence of this Alu AGATACACAGACATGGCAAAGGTTTACTGAACACTTA AAACATGTCTGCTTGCACAGAGGATCAGATTAACACT Human DNA an repeat. indeed yielded amplification product Chi mpanzee: Target Site of the expected size. On the other hand, amplified DNA -- GGATGTTGIGG --- products derived from genomic DNA from the chimpanzee, Goril la: gorilla, and orangutan were small (215 bp), indicative of the ------------------------------------- GGATGTTGTGG------------------------------------ absence of this Alu character in these primates (Fig. 1A). Orangutan: To verify that the amplified DNA products originateC from --------------------------- GGATGTTGTGG - orthologous sites in the four species, the DNA bands were excised from the gel and their sequences were determined. FIG. 1. PCR and sequence analysis of orthologous AFP loci in The sequences surrounding the Alu insertion site are shown primates. (A) Electrophoretic separation of PCR products. Ingredi- in Fig. 1B. Within the stretch of 85 nucleotides shown, the ents forthe PCR included (in 100 id) 20 mM Tris-HCI (pH 8.3), 50 mM are the for one mutation in the KCI, 2.5 mM MgCl2, 250 ,uM each dNTP, 0.2 ,LM each primer (D-534 sequences same, except and D-533), and 1 ug of genomic DNA. Amplification of DNA was orangutan. The 11-bp unoccupied target site is identical in performed in an Ericomp thermal cycler under the following condi- three primates, but in the human this target site was dupli- tions: 98°C for 2 min, pause at 80°C (2.5 units of Taq polymerase was cated and gave rise to the terminal repeats, conceivably added at this point), followed by 30 cycles of denaturation (96°C for during the event ofthe Alu transposition. The results indicate 30 sec), annealing (55°C for 30 sec), and elongation (62°C for 30 sec). that the primers specifically amplify orthologous sites in The last cycle was followed by afmal elongation step (65C for5 min), genomic DNA from the four primates.