(12) Patent Application Publication (10) Pub. No.: US 2005/0239060A1 Tang Et Al
Total Page:16
File Type:pdf, Size:1020Kb
US 2005O239060A1 (19) United States (12) Patent Application Publication (10) Pub. No.: US 2005/0239060A1 Tang et al. (43) Pub. Date: Oct. 27, 2005 (54) NOVEL NUCLEICACIDS AND (21) Appl. No.: 10/122,851 POLYPEPTIDES (22) Filed: Apr. 12, 2002 (76) Inventors: Y. Tom Tang, San Jose, CA (US); Chenghua Liu, San Jose, CA (US); Related U.S. Application Data Vinod Asundi, Foster City, CA (US); Jie Zhang, Campbell, CA (US); Feiyan (60) Division of application No. 09/620,312, filed on Jul. Ren, Cupertino, CA (US); Rui-hong 19, 2000, now Pat. No. 6,569,662, which is a con Chen, Foster City, CA (US); Qing A. tinuation-in-part of application No. 09/552,317, filed Zhao, San Jose, CA (US); Tom on Apr. 25, 2000, now abandoned, which is a con Wehrman, Stanford, CA (US); Aidong tinuation-in-part of application No. 09/488,725, filed J. Xue, Sunnyvale, CA (US); on Jan. 21, 2000, now abandoned. Yonghong Yang, San Jose, CA (US); O O Jian-Rui Wang, Cupertino, CA (US); Publication Classification Ping Zhou, San Jose, CA (US); 7 Yunqing Ma, Sunnyvale, CA (US); (51) Int. Cl. ............................ C12O 1/68; CE,...; E. WE Miles. AARs). (52) U.S. Cl. ............................. 435/6; 435/69.1; 435/183; Radoje T. Drmanac, Palo Alto, CA 435/320.1; 435/325; 536/23.2 (US) Correspondence Address: (57) ABSTRACT Leslie A. Mooi HYSEQ, INC. The present invention provides novel nucleic acids, novel 670 Almanor Avenue polypeptide Sequences encoded by these nucleic acids and Sunnyvale, CA 94085 (US) uses thereof. US 2005/0239060A1 Oct. 27, 2005 NOVEL, NUCLEC ACIDS AND POLYPEPTDES 0009. The present invention relates to a collection or library of at least one novel nucleic acid Sequence assembled 1. CROSS REFERENCE TO RELATED from expressed sequence tags (ESTs) isolated mainly by APPLICATIONS Sequencing by hybridization (SBH), and in Some cases, Sequences obtained from one or more public databases. The 0001. This application is a continuation-in-part applica invention relates also to the proteins encoded by Such tion of U.S. application Ser. No. 09/552,317, filed Apr. 25, polynucleotides, along with therapeutic, diagnostic and 2000, which in turn is a continuation-in-part application of research utilities for these polynucleotides and proteins. U.S. application Ser. No. 09/488,725, filed Jan. 21, 2000, These nucleic acid sequences are designated as SEQID NO: both of which are incorporated herein by reference in their 1-1104 and are provided in the Sequence Listing. In the entirety. nucleic acids provided in the Sequence Listing, A is adenos ine; C is cytosine; G is guanosine; T is thymine; and N is any 2. BACKGROUND OF THE INVENTION of the four bases. In the amino acids provided in the Sequence Listing, * corresponds to the Stop codon. 0002) 2.1 Technical Field 0010. The nucleic acid sequences of the present invention 0003. The present invention provides novel polynucle also include, nucleic acid Sequences that hybridize to the otides and proteins encoded by Such polynucleotides, along complement of SEQID NO: 1-1104 under stringent hybrid with uses for these polynucleotides and proteins, for ization conditions, nucleic acid Sequences which are allelic example in therapeutic, diagnostic and research methods. variants or Species homologues of any of the nucleic acid 0004 2.2 Background Sequences recited above, or nucleic acid Sequences that encode a peptide comprising a specific domain or truncation 0005 Technology aimed at the discovery of protein fac of the peptides encoded by SEQ ID NO: 1-1104. A poly tors (including e.g., cytokines, Such as lymphokines, inter nucleotide comprising a nucleotide Sequence having at least ferons, CSFS, chemokines, and interleukins) has matured 90% identity to an identifying sequence of SEQ ID NO: rapidly over the past decade. The now routine hybridization 1-1104 or a degenerate variant or fragment thereof. The cloning and expression cloning techniques clone novel poly identifying Sequence can be 100 base pairs in length. nucleotides “directly' in the sense that they rely on infor mation directly related to the discovered protein (i.e., partial 0011. The nucleic acid sequences of the present invention DNA/amino acid Sequence of the protein in the case of also include the Sequence information from the nucleic acid hybridization cloning; activity of the protein in the case of Sequences of SEQ ID NO: 1-1104. The Sequence informa expression cloning). More recent “indirect cloning tech tion can be a segment of any one of SEQID NO: 1-1104 that niques Such as Signal Sequence cloning, which isolates DNA uniquely identifies or represents the Sequence information of Sequences based on the presence of a now well-recognized SEO ID NO: 1-1104. Secretory leader Sequence motif, as well as various PCR based or low Stringency hybridization-based cloning tech 0012. A collection as used in this application can be a niques, have advanced the State of the art by making collection of only one polynucleotide. The collection of available large numbers of DNA/amino acid Sequences for Sequence information or identifying information of each proteins that are known to have biological activity, for Sequence can be provided on a nucleic acid array. In one example, by Virtue of their Secreted nature in the case of embodiment, Segments of Sequence information is provided leader Sequence cloning, by virtue of their cell or tissue on a nucleic acid array to detect the polynucleotide that Source in the case of PCR-based techniques, or by virtue of contains the Segment. The array can be designed to detect Structural Similarity to other genes of known biological full-match or mismatch to the polynucleotide that contains activity. the Segment. The collection can also be provided in a computer-readable format. 0006 Identified polynucleotide and polypeptide Sequences have numerous applications in, for example, 0013 This invention also includes the reverse or direct diagnostics, forensics, gene mapping; identification of muta complement of any of the nucleic acid Sequences recited tions responsible for genetic disorders or other traits, to above, cloning or expression vectors containing the nucleic assess biodiversity, and to produce many other types of data acid Sequences, and host cells or organisms transformed and products dependent on DNA and amino acid Sequences. with these expression vectors. Nucleic acid sequences (or their reverse or direct complements) according to the inven 3. SUMMARY OF THE INVENTION tion have numerous applications in a variety of techniques known to those skilled in the art of molecular biology, Such 0007. The compositions of the present invention include as use as hybridization probes, use as primers for PCR, use novel isolated polypeptides, novel isolated polynucleotides in an array, use in computer-readable media, use in Sequenc encoding Such polypeptides, including recombinant DNA ing full-length genes, use for chromosome and gene map molecules, cloned genes or degenerate variants thereof, ping, use in the recombinant production of protein, and use especially naturally occurring variants Such as allelic Vari in the generation of anti-sense DNA or RNA, their chemical ants, antisense polynucleotide molecules, and antibodies analogs and the like. that specifically recognize one or more epitopes present on Such polypeptides, as well as hybridomas producing Such 0014. In a preferred embodiment, the nucleic acid antibodies. sequences of SEQID NO: 1-1104 or novel segments or parts of the nucleic acids of the invention are used as primers in 0008. The compositions of the present invention addi expression assays that are well known in the art. In a tionally include vectors, including expression vectors, con particularly preferred embodiment, the nucleic acid taining the polynucleotides of the invention, cells geneti sequences of SEQID NO: 1-1104 or novel segments or parts cally engineered to contain Such polynucleotides and cells of the nucleic acids provided herein are used in diagnostics genetically engineered to express Such polynucleotides. for identifying expressed genes or, as well known in the art US 2005/0239060A1 Oct. 27, 2005 and exemplified by Vollrath et al., Science 258:52-59 generation of anti-sense DNA or RNA, their chemical ana (1992), as expressed sequence tags for physical mapping of logs and the like. For example, when the expression of an the human genome. mRNA is largely restricted to a particular cell or tissue type, polynucleotides of the invention can be used as hybridiza 0.015 The isolated polynucleotides of the invention tion probes to detect the presence of the particular cell or include, but are not limited to, a polynucleotide comprising tissue mRNA in a Sample using, e.g., in Situ hybridization. any one of the nucleotide sequences set forth in the SEQ ID NO: 1-1104; a polynucleotide comprising any of the full 0021. In other exemplary embodiments, the polynucle length protein coding sequences of the SEQ ID NO: 1-1104; otides are used in diagnostics as expressed Sequence tags for and a polynucleotide comprising any of the nucleotide identifying expressed genes or, as well known in the art and Sequences of the mature protein coding Sequences of the exemplified by Vollrath et al., Science 258:52-59 (1992), as SEQ ID NO: 1-1104. The polynucleotides of the present expressed Sequence tags for physical mapping of the human invention also include, but are not limited to, a polynucle genome. otide that hybridizes under Stringent hybridization condi tions to (a) the complement of any one of the nucleotide 0022. The polypeptides according to the invention can be sequences set forth in the SEQ ID NO: 1-1104; (b) a used in a variety of conventional procedures and methods nucleotide Sequence encoding any one of the amino acid that are currently applied to other proteins. For example, a Sequences Set forth in the Sequence Listing; (c) a polynucle polypeptide of the invention can be used to generate an otide which is an allelic variant of any polynucleotides antibody that Specifically binds the polypeptide.