Matrix Attachment Regions and Structural Colinearity in the Genomes of Two Grass Species Zoya Avramova1,*, Alexander Tikhonov1, Mingsheng Chen2 and Jeffrey L
Total Page:16
File Type:pdf, Size:1020Kb
1998 Oxford University Press Nucleic Acids Research, 1998, Vol. 26, No. 3 761–767 Matrix attachment regions and structural colinearity in the genomes of two grass species Zoya Avramova1,*, Alexander Tikhonov1, Mingsheng Chen2 and Jeffrey L. Bennetzen1,2 1Department of Biological Sciences and 2Purdue Genetics Program, Purdue University, West Lafayette, IN 47907, USA Received October 14, 1997; Revised and Accepted December 4, 1997 ABSTRACT Besides being structural elements, therefore, MARs are believed to bear functional information as well. In order to gain insights into the relationship between For obvious reasons researchers have focused mainly on the spatial organization of the genome and genome gene-containing fraction of the genome. Hence, most of the function we have initiated studies of the co-linear existing information on the structural organization of the genome Sh2/A1-homologous regions of rice (30 kb) and describes individual genes and their immediate surroundings. sorghum (50 kb). We have identified the locations of Information on domain organization and chromosome folding at matrix attachment regions (MARs) in these homologous a supragenic level in animal systems is limited to only a few chromosome segments, which could serve as anchors studies: a 320 kb continuum of the Drosophila rosy–Ace locus for individual structural units or loops. Despite the fact (13), an 800 kb region of Drosophila chromosome 1 (14,15), the that the nucleotide sequences serving as MARs were 240 kb amplicon of the chinese hamster dihydrofolate reductase not detectably conserved, the general organizational gene (16) and 200 kb around the mouse heavy chain IgH locus patterns of MARs relative to the neighboring genes (17). The first study in plants devoted to the higher order were preserved. All identified genes were placed in structural organization of a large chromosomal continuum was individual loops that were of comparable size for focused on a 290 kb region around maize Adh1 (18). The location homologous genes. Hence, gene composition, gene of MARs along this chromosomal segment defined several loops orientation, gene order and the placement of genes of various sizes and a strong, although not perfect, correlation into structural units has been evolutionarily conserved between MAR locations and the junctions of repetitive and low in this region. Our analysis demonstrated that the copy number DNA blocks. The distribution of the different occurrence of various ‘MAR motifs’ is not indicative of classes of DNA within this continuum (19) with respect to the MAR location. However, most of the MARs discovered structural loops revealed that the long stretches of mixed classes in the two genomic regions were found to co-localize of highly repetitive DNAs are segregated into topologically with miniature inverted repeat transposable elements sequestered units (18). It was interesting, therefore, to study the (MITEs), suggesting that MITEs preferentially insert possible loop folding of grass genomic regions void of highly near MARs and/or that they can serve as MARs. repetitive DNA blocks in their intergenic space, as is the case for the Sh2/A1-homologous regions of sorghum and rice (20–22). Earlier we showed that the Adh1 gene was positioned in an INTRODUCTION individual loop (18). However, lack of information regarding the presence and location of other genes in the region did not allow The higher level structural organization of the genome is believed us at that time to pursue a possible correlation between the to be important both for compaction of chromosomes in the structural organization of genes and their function. We have nucleus and for regulating genome functions. According to the addressed this question here by examining the putative higher chromatin domain model (1–3) the genome is folded into level structural organization of two large genomic continuums of structural domains (loops), the bases of which are attached to a known gene composition. This is the first attempt to compare the proteinaceous nuclear skeleton (matrix). Such loops are believed possible spatial organization of homologous genomic regions in to provide an additional 1000-fold compaction of the genome, two different species. necessary for its accommodation into the interphase nucleus. The A characteristic feature of eukaryotic genomes is the enormous DNA sequences (matrix attachment regions, or MARs) variation in genome size, which bears little relation to differences in anchoring loops of heterogeneous size to this matrix are organism complexity or to the number of genes that code for proteins considered to be important structural elements of the genome. (23). Much of this variation is due to differences in the amount of Their ability to affect gene expression has been shown (3–6). In repetitive DNA. Recombinational mapping of different grass plants MARs have been reported to play a role in reducing both genomes has indicated extensive conservation of both gene content position effects and homology-dependent gene silencing (7–12). and gene map order (24,25), despite great variation in genome size *To whom correspondence should be addressed: Tel: +1 765 494 9837; Fax: +1 765 496 1496; Email: [email protected] 762 Nucleic Acids Research, 1998, Vol. 26, No. 3 Figure 1. Maps of the Sh2/A1-homologous regions of rice and sorghum. (A) The black horizontal arrows indicate the location of the Sh2, X and A1 homologs on the rice contig. The four overlapping clones covering 30 kb of the rice region are shown above. The sites for the restriction enzymes used in the screening are shown: X, XhoI; S, SacI; RI, EcoRI; M, MluI; Hp, HpaI; H, HindIII. The small black head-to-head triangles illustrate the MITEs identified in the region: T, tourist; St, stowaway; W, wanderer; E, explorer; Sn, snabo (20). The two arrowheads upstream of the A1 homolog show the size and location of a pair of direct repeats, while the vertical empty triangle marks the position of simple sequence repeats (20). The restriction fragments binding to the matrix are shown as open boxes with a star under the bar. The numbers flanking the boxes correspond to the map positions of the restriction cuts generating the respective fragment. The location of the 1.1 kb MAR inside the larger 4.8 kb BamHI–XhoI MAR, 3′ of gene X, is shown as a box inside a larger box. The putative loops and their lengths are shown at the lower level. The distribution patterns of A/T are shown in the boxed area on the same scale as the restriction map above. (B) Five adjacent clones covering 50 kb of the colinear Sh2/A1 region of sorghum are shown on top. Small vertical bars show the BamHI sites used for their subcloning. The restriction enzymes used in the analysis are: B, BamHI; K, KpnI; P, PacI; X, XhoI; RV, EcoRV; RI, EcoRI; H, HindIII. S-1–S-5 are putative new MITEs defined through their ability to form snap-back structures and/or because they were found around other genes in the databases (22; A.Tikhonov, unpublished results). The solo LTR between the two duplicated A1 homologs is shown as a double-headed light arrow. All other designations are as in (A). and chromosome number (26). Recent studies have shown that previously (20,21; Fig. 1). Restriction enzymes and T4 poly- interspecies gene content and order have often been preserved nucleotide kinase were from New England Biolabs. Calf intestine also at the 200–500 kb level (20–22,27–29). alkaline phosphatase (Pharmacia) was used for dephosphorylation. With this background we have asked three questions. What will be the potential of large chromosomal regions of known sequence Nuclear and matrix preparations composition to fold into individual structural units? Given the micro-colinearity of grass genomes, will the folding of colinear Leaves from 3-week-old rice (variety Teqing) and sorghum × regions into structural units follow a similar pattern? Will there be (cultivar BT 623) seedlings were used for isolation of nuclei. DNA sequence motifs that are common and/or conserved in plant Excised leaves were immediately frozen in liquid nitrogen and MARs? finely ground in a mortar. Nuclear isolation was according to the About 50 and 30 kb of sorghum and rice DNA respectively, protocol established previously (30). Aliquots containing 3–5 containing A1/Sh2-homologous regions, were screened for the A260 units of nuclei were dispersed in 50 mM Tris, 0.1 M NaCl, location and distribution of MARs as anchors for the bases of 5 mM MgCl2 buffer,_ pH 7.2, in 70% glycerol and stored for putative loops (domains). Several MAR-containing fragments (four several months at –80 C. Nuclear matrices were prepared by the in rice and seven in sorghum) were isolated and their sequences high salt extraction procedure, as described (30). compared with each other and with reported characteristics of animal MARs. The results indicated significant preservation of MAR binding experiments structural organization but no detected conservation of sequences or Each of the subclones was digested with a combination of restriction motifs responsible for folding of this region. enzymes as shown in the legend to Figure 2. The fragments resulting from restriction were dephosphorylated and end-labeled with MATERIALS AND METHODS [γ-32P]ATP (Amersham) in One-Phore All Buffer PLUS Materials (Pharmacia). The in vitro binding method was used to screen the rice and sorghum clones for the presence of MARs, essentially as Subclones of rice and sorghum bacterial artificial chromosomes described previously (30). In pilot experiments the amount of (BACs) containing Sh2 and A1 homologs have been described nuclear matrices and competitor DNA were established with the 763 Nucleic AcidsAcids Research,Research, 1994, 1998, Vol. Vol. 22, 26, No. No. 1 3 763 purpose of carrying out the binding assays under reproducible conditions and to eliminate weak and background associations.