Contains Introns Shared by Src Homology 2 Domains in Many Unrelated Proteins
Total Page:16
File Type:pdf, Size:1020Kb
Copyright 2003 by the Genetics Society of America Phospholipase C-␥ Contains Introns Shared by src Homology 2 Domains in Many Unrelated Proteins Charlene M. Manning,1 Wendy R. Mathews,2 Leah P. Fico and Justin R. Thackeray3 Biology Department, Clark University, Worcester, Massachusetts 01610 Manuscript received September 10, 2002 Accepted for publication March 13, 2003 ABSTRACT Many proteins with novel functions were created by exon shuffling around the time of the metazoan radiation. Phospholipase C-␥ (PLC-␥) is typical of proteins that appeared at this time, containing several different modules that probably originated elsewhere. To gain insight into both PLC-␥ evolution and structure-function relationships within the Drosophila PLC-␥ encoded by small wing (sl), we cloned and sequenced the PLC-␥ homologs from Drosophila pseudoobscura and D. virilis and compared their gene structure and predicted amino acid sequences with PLC-␥ homologs in other animals. PLC-␥ has been well conserved throughout, although structural differences suggest that the role of tyrosine phosphorylation in enzyme activation differs between vertebrates and invertebrates. Comparison of intron positions demon- strates that extensive intron loss has occurred during invertebrate evolution and also reveals the presence of conserved introns in both the N- and C-terminal PLC-␥ SH2 domains that are present in SH2 domains in many other genes. These and other conserved SH2 introns suggest that the SH2 domains in PLC-␥ are derived from an ancestral domain that was shuffled not only into PLC-␥, but also into many other unrelated genes during animal evolution. OUR distinct types of phospholipase C (PLC) pro- X and Y catalytic domains that is unique among PLC F tein, , ␥, ␦, and ε, are produced in mammals by a subtypes. This central region contains one src homology rather diverse family of more than 10 genes. All PLCs 3 (SH3) and two SH2 domains within a split PH domain, catalyze the hydrolysis of the membrane phospholipid implying a series of shufflings and duplications to pro- phosphatidyl inositol 4,5-bisphosphate [PI(4,5)P2] into duce the modern PLC-␥ structure from an ancestral inositol 1,4,5-trisphosphate (InsP3) and diacylglycerol. PLC form. These events were apparently completed be- The former is bound by specific receptors embedded fore the parazoan-eumetazoan split, because the sponge in the endoplasmic reticular membrane, leading to a Ephydatia fluviatilis has a PLC-␥ homolog with identical transient increase in intracellular calcium, while the structure to all other animal forms (Koyanagi et al. latter is a direct activator of protein kinase C. The pat- 1998). The SH2 and SH3 domains of PLC-␥ are typical tern of expression, mechanism of activation, and cellu- examples of the many widely distributed modules that lar function vary considerably among the four types are found in many different proteins involved in signal (reviewed by Rebecchi and Pentyala 2000; Lopez et transduction; each domain is presumed to have arisen al. 2001; Song et al. 2001). All PLC proteins have three once and then been spread both by gene duplication domains in common: a C2 domain and catalytic domains and by being co-opted into existing genes by exon shuf-  ␥ ␦ X and Y; three of the PLC types, , , and , also share fling via retrotransposition, illegitimate recombination, an N-terminal pleckstrin homology (PH) domain and or long interspersed nuclear element-mediated 3Ј trans- ␦ EF hands. PLC- may be the ancestral form, because duction (Long 2001). In the case of PLC-␥ these ac-  ␥ ε , , and types are absent from plants and simple quired domains permit specific protein:protein interac- eukaryotes such as yeast. ␥ ␥ tions: the SH2 domains are thought to allow PLC- to PLC- is particularly interesting from an evolutionary bind to specific phosphorylated tyrosine residues on the standpoint, because it has a central region between the cytoplasmic face of activated receptor tyrosine kinases, whereas the SH3 domain is essential in allowing PLC-␥ to activate the phosphatidylinositol-3-OH kinase [PI(3)K] Sequence data from this article have been deposited with the enhancer (Ye et al. 2002). EMBL/GenBank Data libraries under the accession nos. AF543827 and AF543828. PLC-␥ is involved in regulating many aspects of cell 1Present address: Department of Genetics, HHMI, Harvard Medical physiology, including proliferation, differentiation, and School, 200 Longwood Ave., Boston, MA 02115. motility (reviewed by Rebecchi and Pentyala 2000). 2 Present address: Department of Biology, Johns Hopkins University, PLC-␥ activation is triggered by the binding of a wide 3400 N. Charles St., Baltimore, MD 21218. 3Corresponding author: Biology Department, Clark University, 950 variety of growth factors, cytokines, and immunoglobu- Main St., Worcester, MA 01610. E-mail: [email protected] lins to their membrane-bound receptor. Two distinct Genetics 164: 433–442 ( June 2003) 434 C. M. Manning et al. PLC-␥ homologs encoded by different genes—PLC-␥1 further rounds of plating and hybridization. Purified plaques and PLC-␥2—are present in mammals, whereas all inver- were obtained and -genomic DNA was extracted by the plate lysate method (Sambrook et al. 1989). tebrates described to date have only a single homolog. DNA sequencing: Restriction fragments containing sl exons In Drosophila, the PLC-␥ homolog is encoded by small were identified by Southern blot hybridization using a variety wing (sl; Emori et al. 1994; Thackeray et al. 1998), a of probes corresponding to the sl open reading frame (ORF). gene first identified by Bridges in 1915 (Lindsley and For the D. virilis gene, several of these fragments were sub- ϩ Zimm 1992). Flies lacking sl function are viable, with cloned into pBluescript KS (Stratagene) and sequenced on both strands by primer walking. Most of the D. pseudoobscura mild defects in eye and wing development due to over- gene was sequenced by transposon tagging of a 4.8-kb PstI/ activation of the epidermal growth factor receptor sig- HindIII subclone with the EZ::TNϽKAN-2Ͼ kit, using the naling pathway (Thackeray et al. 1998). This weak phe- conditions recommended by the manufacturer (Epicentre notype contrasts markedly with the more significant Technologies, Madison, WI). The 5Ј end of the D. pseudoob- effects resulting from loss of PLC-␥ function in verte- scura gene sequence was determined by primer walking within ␥ an overlapping 2.1-kb PstI/SacI fragment. For each species, brates: PLC- 1 knockout mice die in early embryogene- the subcloned fragments included the entire sl homolog. Se- sis (Ji et al. 1997), while mice lacking PLC-␥2 are viable, quencing reactions on purified plasmid DNA were performed but have defects in B cell development (Hashimoto et using the ABI Prism Big Dye terminator cycle sequencing kit al. 2000; Wang et al. 2000). The amino acid sequence (Applied Biosystems, Foster City, CA). Sequencing reactions of Sl is generally similar to its mammalian homologs were purified by ethanol precipitation with pellet paint (Nova- gen, Madison, WI) and separated on a Perkin-Elmer (Norwalk, (Emori et al. 1994), suggesting that its mode of activa- CT) ABI Prism 377 DNA sequencer. Sequences were edited tion and cellular function are conserved. However, sub- with Sequencher 4.1 (Gene Codes, Ann Arbor, MI) and fur- tle differences in its activation may exist; for example, ther analyzed with MacVector 6.01 (Accelrys, Madison, WI). two of the three tyrosines in mammalian PLC-␥1, which Phylogenetic analysis: Amino acid sequences were aligned become phosphorylated during activation, are missing using either ClustalW 1.4 (Thompson et al. 1994) or ClustalX 1.6 (Thompson et al. 1997) and in some cases improved by eye. in Sl. In an effort to determine the functional signifi- Unrooted trees were generated from the aligned sequences by cance of these and other differences between vertebrate PAUP (version 4.0b10; Swofford 2002) using bootstrap with and invertebrate PLC-␥, we cloned and sequenced two heuristic search and 1000 replicates; gaps were treated as miss- additional Drosophila PLC-␥ homologs. We present ing; parsimony was used as the optimality criterion; starting here a comparison of both the amino acid sequence and trees were obtained by stepwise addition; and branch swapping was by tree-bisection reconnection. the gene structure of these sl homologs in Drosophila, Tyrosine phosphorylation prediction: The probability of a as well as among a variety of other invertebrate and given tyrosine being phosphorylated was determined using vertebrate PLC-␥ homologs. Our analysis not only sheds the NetPhos algorithm (http://www.cbs.dtu.dk/services/ light on PLC-␥ function and evolution, but also provides NetPhos/), which uses a neural network method to compare tyrosines and their local context to tyrosines known to be evidence for a common origin of SH2 domains in all phosphorylated (Blom et al. 1999). A score Ͼ0.5 predicts that proteins. a tyrosine is in a context that will be phosphorylated in vivo. MATERIALS AND METHODS RESULTS -library construction and screening: -libraries of Drosophila Isolation of sl homologs from D. virilis and D. pseudoob- pseudoobcura and D. virilis genomic DNA were constructed in scura: We constructed -libraries of D. pseudoobscura and FIXII (Stratagene, La Jolla, CA) and GEM11 (Promega, D. virilis genomic DNA and screened them with various Madison, WI), respectively. Briefly, genomic DNA was pre- ␥ pared from adults by standard methods, partially digested with genomic probes from the D. melanogaster PLC- homo- DpnII, and separated on a 0.5% agarose gel. Fragments in the log encoded by sl. Several clones were isolated and 10- to 20-kb range were recovered onto dialysis membrane mapped by Southern blotting and various restriction and the ends partially filled in with dCTP and dTTP. The fragments were subcloned and sequenced from each genomic fragments were then ligated overnight at 4Њ to XhoI- digested DNA that had been partially filled in with dATP and species. We compared the two putative sl homologs with dGTP. The ligated DNA was packaged in vitro using Packagene the D.