Conserved Sequence Elements in the 5' Region of the Ultrabithorax Transcription Unit

The EMBO Journal vol.6 no.5 pp.1393-1401, 1987 Conserved sequence elements in the 5' region of the Ultrabithorax transcription unit C.Deborah Wilde and Michael Akam regulation of Ubx also requires the products of many other ident- required early in development Cambridge CB2 3EH, UK ified genes; these include functions Department of Genetics, Downing Street, to establish the initial spatial pattern of expression (Ingham et Communicated by M.Ashburner al., 1986; Ingham and Martinez-Arias, 1986; White and Leh- Clones homologous to the 5' region of the Ultrabithorax gene mann, 1986) and others whose products are required to main- of Drosophda melanogaster have been isolated from D. pseudo- tain the repression of Ubx in those segments where the gene is obscura, D. funebris and Musca domestica. Regions that en- not normally active (e.g. extra sex combs, Polycomb) (Lewis, code most of the Ubx protein have been sequenced in all 1978; Struhl, 1981; Ingham, 1985; Jiirgens, 1985; Struhl and three of these species, and the 5' upstream region has been Akam, 1985). Ubx is also regulated directly or indirectly by other sequenced in D. funebris to a point - 1000 bases upstream homeotic genes, including the abdominal-A and abdominal-B of the probable mRNA start site. Here we compare these functions of the BX-C, and itself regulates genes of the Antenna- sequences with those described elsewhere for D. melanogaster. pedia complex (Hafen et al., 1984; Harding et al., 1986; Struhl Deduced amino acid sequences of the Ubx protein show 8% and White, 1986). (D. pseudoobscura), 15% (D. funebris) and 22% (M. dom- A highly conserved 180-bp region of the Ubx 3' region is found estica) divergence from D. melanogaster. However, these fig- in many homeotic genes. This encodes the 60 amino acid homeo- ures mask very different rates of evolution in different regions box, a protein domain which probably mediates DNA binding of the protein. A glycine-rich ('hinge') region is conserved (Laughon and Scott, 1984; McGinnis et al., 1984; Scott and in each of these species, although its length is variable. Com- Weiner, 1984; Shepherd et al., 1984). parison of D. funebris and D. melanogaster sequences in the To identify other regions of both the Ubx protein and the Ubx long 5' untranslated leader region of the mRNA, and in the transcription unit that are critical for normal function, we have region immediately upstream of the start point of transcrip- isolated and sequenced clones homologous to the 5' region of tion, reveals tightly conserved elements embedded in an other- the Ubx gene from three other Dipteran species, Drosophila wise non-homologous sequence. These conserved elements include a 118-bp region that spans the mRNA start site, an Cyclorrhaphan internally repetitive (TAA), region in the untranslated leader Diptera and a short repeated motif immediately upstream of the ATG 1' codon that initiates the major open reading frame of the Ubx protein. Two other conserved elements were identified upstream of the transcription start site; both elements have structural features consistent with a role as recognition sites for regulatory proteins. Key words: bithorax complex/diptera/evolution/upstream el- M Year ements Introduction The Ultrabithorax (Ubx) gene of the Drosophila bithorax complex (BX-C) controls segment identity in parasegments 5 and 6 of the Drosophila embryo, and plays a minor role in the more posterior abdominal segments (Lewis, 1978, 1981; Casanova et al., 1985; Sanchez-Herrero et al., 1985). Within each of these regions, and within each major germ layer, Ubx gene products show different patterns of expression (Akam, 1983; Akam and Martinez-Arias, 1985; White and Wilcox, 1984, 1985a; Beachy et al., 1985). Most, and possibly all functions of the Ubx gene are mediated by homeoproteins encoded in long 5' and 3' exons, and in short Musca microexons located within the 70-kb intron of the Ubx transcrip- Drosophila Drosophila Drosophila tion unit (Bender et al., 1983; Akam et al., 1985; Beachy et al., melanogaster pseudoobscura funebris domestica 1985; Hogness et al., 1985). Genetic (Lewis, 1951, 1963, 1978), and more recently molecular evidence shows that the correct ex- Fig. 1. Relationships of the Dipteran species used for sequence comparisons the (Throckmorton, 1975; Hennig, 1981). Estimates of divergence times for the pression of these proteins requires other sequences within calyptrate and acalyptrate Cyclorrhapha, for the different subgenera of the Ubx transcription unit, and the close linkage of DNA in the bxd genus Drosophila, and for the melanogaster and obscura species groups region of the Ultrabithorax domain (Bender et al., 1983, 1985; within the subgenus Sophophora are based on immunological comparisons of Hogness et al., 1985; White and Wilcox, 1985b). The normal the haemolymph proteins (Beverley and Wilson, 1984). IRL Press Limited, Oxford, England 1393 C.D.Wilde and M.Akam ' r !''.i ! -i -r---l-- --l r- 1 ---i ,C,lr.-. .i .. .. ... .. .r-8. ., l,,,, ;' .; ..,.*s,,_...C-e'......,rwo.>s..-i;Oi.;.-E...;_...P.S,,v..-.._,-._.v/*W{;<k i _._; a::*...,.b., .4 A B C Fig. 2. Genomic Southern hybridizations to EcoRI-digested DNA from different Drosophila and other Dipteran species. (A) Hybridization of the Ubx 5' probe at moderate stringency. Specific Ubx homologues are evident in all the Drosophila species, and are just detectable in Musca and Calliphora. Arrowheads indicate the cloned fragments in D. pseudoobscura, D. funebris and Musca. (B) Hybridization of the same probe at low stringency to Musca and D. melanogaster DNA. (GGX)n sequences in the Ubx probe now reveal repeated elements present in both species. (C) Hybridization of the Musca Ubx clone to Musca and Drosophila DNA at low stringency. The Musca clone hybridizes strongly to the Ubx fragments of Drosophila sequences, and more weakly to a background of repeated sequences. Each track contains 2.5 Itg of DNA. The Musca genome is - 10-fold larger than that of Drosophila, and therefore the molar concentration of unique sequences is lower in the Musca track. The lineage tree above the species names indicates the phylogenetic grouping of the species used, with no attempt to show divergence times. The Drosophila species melanogaster, simulans, erecta and yakuba are all in the melanogaster species group. D. saltans, D. willistoni and D. pseudoobscura are all in the subgenus Sophophora. D. virilis, D. hydei and D. funebris are all in the subgenus Drosophila, and so should share a common ancestor with D. melanogaster. Calliphora, Sarcophaga and Musca are all in the same family, Calliphoridae. pseudoobscura, D. funebris and Musca domestica. D. pseudo- containing the strongly homologous fragments were isolated from obscura is a member of the same sub-genus as D. melanogaster phage libraries of D. funebris and D. pseudoobscura DNA by (the Sophophora), but these two species probably diverged 40-60 standard techniques (see Materials and methods). million years ago (Throckmorton, 1975; Beverley and Wilson, Initially we had some difficult distinguishing a unique Ubx- 1984). D. funebris is in a different sub-genus (Drosophila), and homologous fragment in Musca DNA. Under moderately strin- M. domestica is a distantly related member of the same sub-order gent hybridization conditions (Figure 2) our probe hybridized of the Diptera, the Cyclorrhapha (Figure 1). The relationship to a prominent pattern of multiple bands. However, progressively of all of these species to D. melanogaster is sufficiently distant more stringent washes of parallel filters revealed a 1.8-kb frag- that unconstrained DNA sequences have diverged extensively, ment hybridizing most stably to the Ubx probe (Figure 2). One allowing functional elements to be identified by sequence con- of four clones isolated with the same probe from a Musca gen- servation. omic library hybridized specifically to this fragment and, when Results hybridized to Drosophila DNA, uniquely to the 5' Ubx fragment originally used as probe. We were therefore confident that this Isolation and sequencing of Ubx homologous clones clone contained a specific Musca homologue of Ubx. The other Preliminary genomic Southern hybridization experiments revealed Musca clones isolated in the same screen hybridized strongly to that each of 10 tested species of the genus Drosophila contained repeated sequences in the Musca genome, and more weakly to unique restriction fragments strongly homologous to a probe span- multiple bands in the Drosophila genome. It is now clear that ning the 5' exon of the Ubx gene (Figure 2). These fragments the homology to repetitive sequences observed with these Ubx were clearly distinguishable above a background of heterogeneous probes is mediated by the stretch of GGX repeats in the Ubx fragments showing weak homology to the probe. Genomic clones 5' exon (see below, and R.Weinzierl and M.Akam, in prep- 1394 Ubx sequence conservation H R AUG H . D. mel. in l_ or m Bg B AUG _" H Bg p R D. pse. I Ll ._I_ _.. _. 1--- x AUG R PI R A A w D. fun. II.,L , 1J * - - ----, i- R Musca 1~~~~~~~ ---4 6 ---4 200 bp Fig. 3. Organization of conserved elements in the Ubx 5' region. (A) D. melanogaster, (B) D. pseudoobscura, (C) D. funebris, (D) M. domnestica. Sequenced genomic fragments from the four species are aligned by homology of the major open reading frame (cross-hatched box). The locations of other conserved elements are shown by boxes. These correspond to the regions highlighted in Figure 4. Solid boxes are used for upstream elements and conserved sequences in the Ubx intron. A stippled box shows the partially conserved TAA repeat. Splice donor sites are shown by arrowheads, and the transcription start site is shown with a bold arrow. Sequencing strategies for each fragment are also shown. fragments obtained by sonication, cloned into the SnaI site of M13mp8.

Load more