Neurexin Ilia: Extensive Alternative Splicing Generates Membrane-Bound and Soluble Forms Yuri A
Total Page:16
File Type:pdf, Size:1020Kb
Proc. Natl. Acad. Sci. USA Vol. 90, pp. 6410-6414, July 1993 Biochemistry Neurexin IlIa: Extensive alternative splicing generates membrane-bound and soluble forms YURi A. USHKARYOV AND THOMAS C. SUDHOF* Howard Hughes Medical Institute and Department of Molecular Genetics, University of Texas Southwestern Medical School, Dallas, TX 75235 Communicated by Michael S. Brown, March 26, 1993 ABSTRACT The structure ofneurexin lIa was elucidated and 13-neurexins have identical C termini including the from overlapping cDNA clones. Neurexin lIa is highly ho- 0-linked sugar region, transmembrane region, and cytoplas- mologous to neurexins la and Ha and shares with them a mic domain (2). distinctive domain structure that resembles a cell surface Structures of the neurexins are suggestive of cell surface receptor. cDNA cloning and PCR experiments revealed alter- receptors. Indeed, the neurexins were originally found be- native splicing at four positions in the mRNA for neurexin HIa. cause of the identity of the sequence of neurexin Ia and Alternative splicing was previously observed at the same po- sequences from the high molecular weight subunit of the sitions in either neurexin Ia or neurexin Ila or both, suggesting receptor for a presynaptic neurotoxin, a-latrotoxin (7). Im- that the three neurexins are subject to extensive alternative munocytochemistry ofrat brain frozen sections revealed that splicing. This results in hundreds of different neurexins with neurexin I is highly enriched in synapses in agreement with variations in small sequences at similar positions in the pro- the localization of a-latrotoxin (2). These findings suggest teins. The most extensive alternative splicing of neurexin Ma that at least neurexins Ia and I13 are synaptic proteins that, was detected at its C-terminal site, which exhibits a minimum based on their structure and homologies, may represent a of 12 variants. Some of the alternatively spliced sequences at class of neuronal cell-interaction molecules (2). this position contain in-frame stop codons, suggesting the We now report the identification, structure, and expression synthesis of secreted proteins. None of the sequences of the of neurexin IIIa.t By extending the neurexin family of other splice sites in this or the other two neurexins include stop proteins, our results demonstrate the generality of the fea- codons. RNA blot analysis demonstrate that neurexin IlIa is tures of this protein family: their conserved alternative splic- expressed in a brain-specific pattern. Our results suggest that ing patterns, brain-specific expression, and similar domain the neurexins constitute a large family of polymorphic cell structure with distinct patterns of shared and divergent surface proteins that includes secreted variants, indicating a sequences. The expanded number of expressed neurexins possible role as signaling molecules. lends further support to a putative role of the neurexins as polymorphic identifier molecules. The surprising finding, The development and function of the nervous system are however, was that neurexin IIIa may be produced in secreted dependent on an intricate network of cell signaling and forms, suggesting a higher degree of complexity than indi- cell-cell interaction mechanisms (1). We have identified (2) a cated by the structures of neurexins I and II. family of cell surface receptors named neurexins with char- acteristics that suggest a role in such cell-cell interactions. Two genes for neurexins were identified; each gene con- EXPERIMENTAL PROCEDURES tains two independent promoters that drive transcription of cDNA Cloning and Sequencing. cDNA cloning and mapping two classes ofmRNAs. The two classes oftranscripts encode were performed as described using 32P-labeled oligonucleo- proteins with different N termini; the larger proteins are tides as probes (8). DNA sequencing was performed on referred to as a-neurexins and the shorter are l3-neurexins. single-stranded DNA from M13 subclones using the dideox- The expression of the neurexins is brain-specific and con- ynucleotide-termination method (9) with fluorescent primers centrated in neurons. Neurexins are highly polymorphic due and Taq polymerase (Promega). Reaction products were to extensive differential splicing with the potential to express analyzed on an ABI 373A automatic DNA sequencer. hundreds of different proteins. PCRs. First-strand cDNA synthesis was performed in 100 Neurexins have a distinctive domain structure. In the ,ul by standard procedures (10) using 10 ug -of poly(A)+- a-neurexins, a signal sequence is followed by three copies of enriched RNA from rat forebrain, cerebellum, or spinal cord. a long repeat that make up >75% of the total protein. Each PCR mixtures contained 2-5 ,ul of cDNA, 2.5 mM MgCl2, 10 ofthe long repeats contains a central epidermal growth factor mM Tris'HCl (pH 8.3), 50 mM KCI, 20 pmol of each primer, (EGF) domain flanked by right and left arms that are weakly all four dNTPs (each at 200 AM), and AmpliTaq DNA homologous to each other. Data bank searches (GenBank polymerase (Perkin-Elmer) at 30 units/liter. PCRs were per- Release 70 and Protein Identification Resource Release 30) formed in 32 cycles, with the first two cycles at 95°C for 5 min, demonstrated that the extracellular repeats in neurexins are 60°C for 1 min, and 72°C for 5 min, followed by 30 cycles at related to sequences found in several extracellular matrix the same temperatures but for 1 min, 1 min, and 2 min. proteins that participate in cell-cell interactions during de- Neurexin III was initially identified with two oligonucleotides velopment, including laminin A (3), agrin (4, 5), and slit (6). corresponding to sequences that are highly conserved among After the three long repeats, a-neurexins contain an 0-linked neurexins [(nucleotides in parentheses denote redundant sugar domain, a single transmembrane region, and a short positions with multiple nucleotides) GGAGATCTAGT- cytoplasmic tail. The N-terminal sequence of f-neurexins TGCGGTACTTGTACATGGC(G/A)TAGAG(G/A)AG- differs from that of the a-neurexins but is spliced into the GAT and AGCAAGCTTGGCACCGTGCCCATCGCCAT- a-neurexin sequences at the end ofthe third EGF domain. a- Abbreviation: EGF, epidermal growth factor. The publication costs of this article were defrayed in part by page charge *To whom reprint requests should be addressed. payment. This article must therefore be hereby marked "advertisement" tThe sequences reported in this paper have been submitted to in accordance with 18 U.S.C. §1734 solely to indicate this fact. GenBank (accession nos. L14851). 6410 Downloaded by guest on October 6, 2021 Biochemistry: Ushkaryov and Siidhof Proc. Natl. Acad. Sci. USA 90 (1993) 6411 CAA]. Subsequent PCRs to map the extent of alternative is followed by a long open reading frame. This identification splicing were carried out with oligonucleotides I, II, and III is supported by the fact that the putative initiator methionine in Fig. 3 (I, GCGAAGCTTGTTTCATCTGCTGAATGCT; is preceded by an in-frame stop codon (underlined in Fig. 2) II, GCGGCATGCCACATTCTTGAAAGTTGC; III, and that its sequence context conforms well to the consensus GCCTCTAGAACCCGTCTGATTCCTGGCT). Two types site of initiator methionines (12). of PCR experiments were performed: (i) PCRs were per- Six of the cDNA clones had poly(A) tails at their 3' end. formed in duplicate with one labeled primer and one unla- However, these poly(A) tails initiated at three positions in the beled primer. The products of such reactions were then nucleotide sequence (Fig. 2). None of the poly(A) tails was analyzed by denaturing PAGE and autoradiography to de- preceded by a classical polyadenylylation consensus se- termine the minimal number of fragments generated that quence. Therefore, it is doubtful that these poly(A) tracts incorporate both primers. This avoids artifacts caused by represent the poly(A) tails of the neurexin IIIa mRNA. The single-oligonucleotide priming or heteroduplex formation 3' untranslated region of the neurexin IIIa mRNA is rich in (11). (ii) PCRs were performed with unlabeled primers, adenosines, suggesting that the poly(A) tails in the cDNAs analyzed by nondenaturing PAGE, and stained by ethidium may represent oligo(dT)-priming sites that are not necessarily bromide. PCR products were then subjected to Southern blot poly(A) tracts. analysis with internal oligonucleotides or excised and cloned Alternative Splicing of Neurexin Ma. The 12 cDNA clones into M13 vectors and sequenced. All of the PCR products of that were sequenced contained four internal positions at Fig. 3 were sequenced except for products C, E, and G, which different clones exhibited sequence variations (Figs. 1 which were identified on the basis of their size in relation to and 2). These differences consisted in the presence or ab- the sequenced other products and of their hybridization with sence of a sequence stretch, suggesting that they arose by an insert-specific oligonucleotide (ACCAAGTACAGGTAG- alternative splicing. Comparison of the sequences of neurex- GTC). ins Ia, Ila, and IIIa demonstrates that, at each of these four RNA Blot Analysis. This procedure was performed (2) using positions, neurexin Ia, neurexin Ila, or both are also alter- total RNA from various rat tissues and single-stranded 32p- natively spliced (2). This result raises the possibility that the labeled DNA probes. Blots were rehybridized with a probe to three neurexins are alternatively spliced at similar positions. cyclophilin to control for RNA loading. A simple pattern of