B.J Hum Jochimsen Genet et(1998) al.: Stetteria 43:77–84 hydrogenophila © Jpn Soc Hum Genet and Springer-Verlag 199877 MINIREVIEW Yoshio Miki Retrotransposal integration of mobile genetic elements in human diseases Received: February 5, 1998 / Accepted: March 4, 1998 Abstract Approximately one-third of the mammalian ge- Introduction nome is composed of highly repeated DNA sequences, of which the two major families, the long and short inter- spersed nucleotide elements (LINEs and SINEs), are repre- The mammalian genome contains several families of sented in humans by L1 and Alu elements respectively. repeated DNA sequences, some of which appear to be Both types of element are considered to be retro- transposable elements. Retrotransposal integrations of transposable and to play significant roles in genomic func- sequences such as Alu and L-1 into biologically important tion and evolution. The majority of inserted elements are genes appear to play significant roles in some human truncated and often rearranged relative to full-length ele- genetic diseases. Retrotransposons are a specific group of ments; usually, such retrotransposed sequences are flanked movable genetic elements, structurally and functionally re- by target-site duplications of various lengths and contain 39 lated to retroviruses, that transpose by way of an RNA polyA tracts, common characteristics of retrotransposal in- intermediate; i.e., the element is transcribed to RNA, re- tegration. Retrotransposal integrations of Alu and L1 se- verse-transcribed to DNA, and reintegrated elsewhere in quences into biologically important genes appear to play the genome. significant roles in some human diseases. Most of the in- Eukaryotic retrotransposons are of two general types, serted sequences that cause human diseases seem to belong Class I and Class II (Fanning and Singer 1987). The coding to one or a few subsets of each type of retrotransposon, regions of Class I elements, such as Ty in yeast or copia in suggesting that only a few active elements can function as Drosophila, are flanked by long terminal repeats (LTRs) templates for retrotransposition. Integrations observed in and contain reverse transcriptase- and/or integrase-related oncogenes and in tumor suppressor genes may participate sequences that possibly initiate their own retro- in carcinogenesis by altering the activity of the affected transposition. There is no apparent DNA sequence specific- genes. The exact mechanism of these events is unclear; ity for host integration sites although some preference has however, retrotransposal integration may be a general been noted (Wilson and Young 1975). Class I retro- mechanism of mutation in humans. transposal elements are very similar to retroviruses, but the viruses possess envelope-protein genes to help them move from one cell to another. Reverse transcription of Key words Retrotransposon · Repeated DNA sequence · retroviruses and LTR-retrotransposons is an evolutionarily Mobile genetic element · L-1 sequence · Alu sequence · conserved process (Telesnitsky and Goff 1993). During in- Insertion mutation · Human disease tegration, characteristic sequence changes occur in both the viral DNA and the host DNA: two nucleotides are deleted at each end of the viral DNA, and four to six base pairs of host DNA are duplicated at the integration site. Class II retrotransposons do not contain LTRs, have polyadenylic acid (polyA) or A-rich sequences at the 39 ends, and are flanked by target-site duplications of various lengths. In addition, these elements have at least one open Y. Miki reading frame (ORF) that is similar to the reverse tran- Department of Molecular Diagnosis, The Cancer Institute, Japanese scriptases of retroviruses and class I retrotransposal ele- Foundation for Cancer Research, 1-37-1 Kami-ikebukuro, Toshima- ments (Xiong and Eickbush 1990; Doolittle et al. 1989). ku, Tokyo 170, Japan Tel. and Fax +81-3-5394-3926 Class II elements can be subdivided into sequence-specific e-mail: [email protected] and non-sequence-specific groups. The insertion mecha- 78 N. Matsuda et al.: EGF receptor and osteoblastic differentiation nism of the R2 from Bombyx mori and the mitochondrial when they detected de novo insertion of a truncated L1 group II introns al1 and al2 from Saccharomyces cerevisiae element into exon 14 of the Factor VIII gene on the X implicate target site-primed reverse transcription at a se- chromosome in two patients with hemophilia A. Each of quence-specific single-strand break (Luan and Eickbush these insertions contained a 39 portion of the L1 sequence; 1995; Moran et al. 1995). The most numerous transposable one was composed of a 3.8-kb truncated L1 sequence fol- genetic elements in mammals are the short and long inter- lowed by a 57-bp polyA tract, creating an A-rich target-site spersed nucleotide elements (SINEs and LINEs), repre- duplication of at least 12 base pairs of the Factor VIII sented in the human genome by Alu and L1. Class II sequence. The other insertion was composed of a 2.3-kb 39 retrotransposons, such as L1, and other elements, such as end of the L1 sequence, flanked by an A-rich target-site Alu and processed pseudogenes, are considered to move by duplication of at least 13 nucleotides. However, this L1 retrotransposition and lack LTRs and target-site duplica- sequence contained an internal rearrangement. On the ba- tions of specific lengths; however, Class II retrotransposons sis of its unique 39 trailer sequence, the inserted element may encode their own reverse transcriptases. This charac- was closely related to members of the Ta subset of the L1 teristic differentiates them from Alu sequences and pro- family (Skowronski and Singer 1986). The full-length pre- cessed pseudogenes (Temin 1985). Evidence is convincing cursor (LRE1) to one of the factor VIII insertions was that both Alu and L1 have been spread by insertion identified on chromosome 22 (Dombroski et al. 1991) and throughout the human genome over time through shown to contain the two ORFs predicted by the L1 consen- retrotransposition, wherein new copies of active elements sus sequence. ORF1 encoded a protein of unknown func- can be reinserted into the genome at new locations. tion in a transient expression assay (Holmes et al. 1992); Depending on its genomic location, the new element may ORF2 encoded a protein with reverse transcriptase activity disrupt the normal expression of a gene and result in in a yeast assay system (Mathias et al. 1991). disease. Two groups have reported insertions of L1 sequence into the dystrophin gene that resulted in Duchenne muscular dystrophy (DMD). An insertion identified in exon 48 of the dystrophin gene by Holmes et al. (1994) comprised a total Insertion mutations of L1 elements in human diseases of 1968␣ bp, including 1401␣ bp of truncated and rearranged L1 element followed by a 37-bp polyA tail; 530␣ bp of non- L1, a common family of repetitive sequences in the human L1 sequence flanked the 39 end of the element. A perfect genome, is considered to be a transposable element that can 12- to 15-bp target-site duplication of sequence from exon be transcribed into RNA, transcribed reversely into cDNA, 48 surrounded the entire insertion. Since the 39 non-L1 and then reintegrated into genomic DNA. About 105 copies sequence of this insertion showed no obvious homologies of L1 sequence are present in all mammalian genomes, with other known sequences, it was designated a “unique comprising roughly 5% of genomic DNA (Fanning and sequence component” (USC). Analysis of genomic DNA Singer 1987). They are up to 7 kilobases long, flanked by from normal individuals revealed that the USC was adja- target-site duplications that vary from 5 to 15 bases. Unlike cent to the progenitor L1 element in the genome. It appears retroviral transposons, they carry no long terminal repeats that read-through transcription resulted in an L1-USC tran- (LTRs), but A-rich regions, usually preceded by script capable of retrotransposition. The L1 sequence in- polyadenylation signals, are present at their 39 ends. A con- serted into this patient’s dystrophin gene (dystrophin L1) sensus human L1 sequence constructed from genomic L1 was shown to belong to the Ta subset of the L1 family, and sequences (Scott et al. 1987) comprises 6.0␣ kb and contains therefore was closely related to the L1 sequence inserted a 59 untranslated region (5’UTR) with an internal promoter into the Factor VIII gene mentioned before (Factor VIII (Swergold 1990), two long open reading frames (ORF-1 and L1). However, since the full-length progenitor of the Factor ORF-2), and a 3’UTR. ORF-1 encodes a 40-kDa RNA- VIII L1, LRE1, differed in eight nucleotides from the binding protein, but its precise function is unknown dystrophin L1, LRE1 was not the precursor in this case. (Holmes et al. 1992). ORF-2 contains a region which is LRE2, the actual progenitor, was identified on chromosome highly homologous to retroviral reverse transcriptase and 1q using the USC of dystrophin L1. RNase H (Hattori et al. 1989; Loeb et al. 1986; Xiong and Narita et al. (1993) reported evidence of retro- Eickbush 1990). These features indicate that L1 elements transposition of L1 into the dystrophin gene of two Japa- may be “nonviral retrotransposons” (Schwarz-Sommer et nese brothers with DMD. In these patients the dystrophin al. 1987), which transpose though an RNA intermediate gene was disrupted by insertion of a truncated L1 sequence with reverse transcriptase activity, in a mechanism similar into exon 44. The 606–608␣ bp L1 fragment was identical to to that postulated for generating pseudogenes (Sharp 1983). the inverse complement of the 39 portion of the Ta subset of However, because most L1 elements are heterogeneously L1 elements (Skowronski and Singer 1986). The insertion truncated from their 59 ends (up to 95% in mammals; lay within an A-T rich region in the dystrophin gene, and its Schwarz-Sommer et al.
Details
-
File Typepdf
-
Upload Time-
-
Content LanguagesEnglish
-
Upload UserAnonymous/Not logged-in
-
File Pages8 Page
-
File Size-