Amplification of an Ancestral Mammalian LI Family of Long
Total Page:16
File Type:pdf, Size:1020Kb
Proc. Nati. A(-ad. Stci. USA Vol. 87, pp. 9481-9485, December 1990 Evolution Amplification of an ancestral mammalian LI family of long interspersed repeated DNA occurred just before the murine radiation (LINE sequence/murine evolution/transposons/taxonomy) ESTERINA PASCALE*, EULALIA VALLEt, AND ANTHONY V. FURANOt Section on Genomic Structure and Function, Laboratory of Biochemical Pharmacology, National Institute of Diabetes and Digestive and Kidney Diseases. National Institutes of Health, Building 8, Room 203, Bethesda, MD 20892 Communicated by Herbert Tabor, September 12, 1990 (re eiyedfor review July 2, 1990) ABSTRACT Each mammalian genus examined so far con- However, we report here that an ancestral rodent Li tains 50,000-100,000 members ofan Li (LINE 1) family oflong family, which we call Lx, was extensively amplified -=10 interspersed repeated DNA elements. Current knowledge on million years ago and that -60,000 copies of Lx are present the evolution of LI families presents a paradox because, in various modern murine genomes (Old World rats and although Li families have been in mammalian genomes since mice). The occurrence of this amplification identifies those before the mammalian radiation -80 million years ago, most murine genera that shared a common ancestry and, therefore, members of the Li families are only a few million years old. defines the murine node in the lineage of modern muroid Accordingly it has been suggested either that the extensive rodents. The possible mechanism and the implications of amplification that characterizes present-day Li families did not repeated amplification of Li elements are discussed. occur in the past or that old members were removed as new ones were generated. However, we show here that an ancestral MATERIALS AND METHODS rodent Li family was extensively amplified '10 million years ago and that the relics (""60,000 copies) of this amplification General. The preparation of clones, genomic DNA, hy- have persisted in modern murine genomes (Old World rats and bridization probes, agarose gels, nitrocellulose blots, and mice). This amplification occurredjust before the divergence of restriction enzyme digests was done using standard tech- modern murine genera from their common ancestor and niques (14). Hybridizations to blots were carried out at the identifies the murine node in the lineage of modern muroid temperatures indicated in the legend to Fig. 2 in a solution rodents. Our results suggest that repeated amplification of LI containing 0.5 mM Na2EDTA, 0.2 M sodium phosphate (pH elements is a feature of the evolution of mammalian genomes 6.8), 0.125% (wt/vol) SDS, denatured salmon sperm DNA at and that ancestral amplification events could provide a useful 50,4g/ml, 0.05% (wt/vol) Ficoll 400 (Pharmacia), and 0.05% tool for determining mammalian lineages. (wt/vol) polyvinylpyrrolidone. The polymerase chain reac- tion (PCR) (15) was performed by using a kit from Perkin- Elmer/Cetus and conditions suggested by the supplier. The All mammalian genomes studied to date contain 50,000- sequences of the primers used to amplify genomic Li se- 100,000 members of an Li (LINE 1) family of mobile DNA quences from the rat or mouse genomes were as follows: rat elements (1-3). Full-length elements are 6-7 kilobases (kb) LiRN C1, ATAGGATCCGCCCCACAGGTGGCCCAT, and contain two protein encoding regions [open reading and LiRN C2, ATAGGATCCAGTGGCTTAGTCCCTG- frame (ORF) I and ORF II, see Fig. 1A], which are highly GA; mouse L1MD C1, ATAGGATCCATACACTAGCAA- conserved among mammalian Li families and are presum- GATTTT, and L1MD C2, ATAGGATCCGTCAAGAGCTC- ably important for Li function (4-7). The putative protein CGGGGTA. The primers used to amplify Lx sequences from encoded by ORF II is homologous to proteins encoded by murine genomes were LX1B C1, ATAGGATCCCATCCA- transposable elements found in plants, fungi, as well as GAGACTACCTCACCT, and LX1 C2, ATAGGATCCTCT- insects and other invertebrates (8). All of them contain TCCTATGGGGTTGAAAAC. The first nine nucleotides of sequence motifs typical of DNA polymerases (8), and at least each primer are not complementary to either Li or Lx and one functions as a reverse transcriptase (9). Therefore Li contained an ATA followed by a BamHI site. This facilitated elements are apparently the mammalian counterpart of, or radiolabeling the ends of the amplified material by use of the were derived from, a genetic element that arose very early in Klenow polymerase after digestion with BamHI endonucle- evolution. ase. The present-day Li families evolved independently from DNA Melting Curves. An %=230-base-pair (bp) portion of Li an ancestral Li element that predated the mammalian radi- or Lx DNA beginning =='100 bp 3' of ORF II (see Fig. 1A) was ation 80-100 million years ago (3). However, in spite of their amplified from various genomes by PCR using oligonucleo- length oftime in the genome each present-day family is rather tides specific for each family. A- small portion of each DNA new-i.e., the DNA sequences of most of the randomly was radiolabeled at the 3' end with the Klenow polymerase. sampled members are >90% identical, indicating that these members were generated withinjust the last few million years Abbreviations: ORF, open reading frame; PCR, polymerase chain (7, 10-12). To account for this apparent paradox it has been reaction; tim melting temperature (of double-stranded nucleic acid). suggested that either extensive amplification of Li did not Li or LINE 1, family of long interspersed repeated DNA elements; occur in the past (1, 13) or that old members are removed as L1Rn, rat (Rattus norvlegic us) Li family; LlMd, mouse (Mus new ones are generated (7, 11, 12). domesticus) Li family; Lx, ancestral rodent Li family. *Present address: Cattedra di Chimica e Microscopia Clinica, Uni- versita degli Studi, L'Aquila, Italy. The publication costs of this article were defrayed in part by page charge TPresent address: Department of Functional Biology, Faculty of payment. This article must therefore be hereby marked "advertisement" Medicine, c/Julian Claveria S/N, 33071, Oviedo, Spain. in accordance with 18 U.S.C. §1734 solely to indicate this fact. TTo whom reprint requests should be addressed. 9481 Downloaded by guest on September 30, 2021 9482 Evolution: Pascale et al. Proc. Natl. Acad. Sci. USA 87 (1990) After denaturation by boiling for 10 min each DNA was Fig. 1A shows a diagram of an Lx element (Lxi) compared hybridized in 0.3 M sodium phosphate buffer, pH 6.8, at with a typical rat Li element. Lxi is 1.5 kb, and the first 770 either 550C or 650C until at least 10 times the Cot1/2 was bp of Lx is homologous to the last one-fifth of the ORF II attained. Cot is the product of the initial concentration of region of the rat or mouse (Mus domesticus) Li elements DNA (in mol of nucleotide per liter) and the time of hybrid- (LiRn and LlMd, respectively; see Fig. 1B). An ancestral ization (in sec). The Cot1/2 is that value by which 50% of the relationship between Lx and both the LiRn and LlMd starting DNA has formed duplex (16). The samples were elements is evident when the DNA sequences 3' of the diluted to 0.12 M sodium phosphate buffer and applied to a termination codon of ORF II are compared. Lxi is more jacketed hydroxyapatite column equilibrated in the same closely related to LiRn 3 or LlMd A2 than either is to each buffer at 50'C. The temperature was then raised in 3YC other (see Fig 1B 3, 4, and 1, respectively). In terms of increments, and the amount of eluted DNA was determined percent similarity, the Lx sequence 3' of ORF II is -65% at each step. At least 95% of the radioactivity that bound to similar to either LiRn or LlMd, whereas the corresponding each column was recovered. regions ofthese sequences are only 55% similar to each other (see legend to Fig. 1). Fig. 1B2 shows the relationship between two typical members of the LiRn family, and a RESULTS similar relationship is found when typical members of the Identification of an Ancestral Li Element. In studying LlMd family are compared (12). different genomic clones of the rat (Rattus norvegicus) L1 In the remainder of this paper we shall show that Lxi family (LlRn), we determined the DNA sequence of several belongs to a family of sequences amplified much earlier than DNA elements (unpublished work) that were homologous to the present-day Li families of rat or mouse. Therefore, the LiRn but were not typical members of the LiRn family. Lx family is an old or ancestral rodent Li family from which Typical LlRn members are at least 90% identical with each the present-day rat and mouse Li families descended. To other, whereas these elements, which we've called Lx ele- facilitate discussion we shall refer to this ancestral Li family ments, were only -65% or 85% identical to LiRn members, and its members as Lx and to the present-day rat or mouse depending on the region compared. Li family and its members as LiRn or LlMd, respectively. A reqg on compared aqrt RE Pr I I I1 in-LZIORF LaiZORF 1 kb I - B FIG. 1. Dot-matrix analysis of the rat Li (LlRn), mouse Li (LlMd), and Lx nucleo- ' tide sequences. (A) Schematic representa- 3 and] tion of an archetypical full-length rat Li element and of the Lxi sequence shown below. Pr, promoter; Pu:Py tract, polypu- 3'ad rine:polypyrimidine region; A-rich tail, ade- sr.i nine-rich region. Although most LiRn ele- ments are full-length as depicted, ORFs ofall ~odon rat elements sequenced so far are interrupted by one or more genetic defects.