wno^/ta* 3 Una
t h cDNA^y /
a e
w-m 12^3 ^
NEDOBIS T98099
NE co mm ■ mm B &
$ # im 1 (ICAWc...... l 2 ...... 5 3 jo ...... 6
1 s i. i %±&cmkmmfejkum^(nm.m(Dmm^m...... 7 i. 2 ^&cDKAmmfeRTfmfc7-(Dmyo(Dmife^m...... 8 1. 3 ^^ScDNAE^lJ^:^...... 10 12 1 . 5 #$fl§#?#ffflcDNA:r — *7 ~-'<'—7s ...... 14
mm...... 16 1. 7 AAcDNA?)#&E?lJ#:^(D#%...... 18 1. 8 Ifelftiftfc hcDNA^V J±Mtif...... 19 1. 9 A#cDNA(C#&K?iJ#:^ cDNA###:$ij(7)##, Rtf DNA-r-f ^ nTH'f^Sglff ...... 22 1 . 10 DN A-^/f ?uTU4 ...... 23 i. ii &mcrnA(Dm,mmvfcfeRtfmmmft\m-tz>%\®.&ffimm...... 25 i. 12 m#cDNA 2 m ^mcDNAmmfcRtfmfcT-nmmomm^m Its...... 62 62 64 70 ...... 70 i S4$ 55£:EcDNAgfl?lJ&£ 4. 1 *tS ...... 71 4. 2 71 4. 3 73 4. 4 151m...... 76 4. 5 78 #5# 5. 1 ItS...... 81 5. 2 tlLfcy7l>')i7, iM...... 81 5. 3 's^y- J±(DWi^...... 84 5. 4 ...... 91 5. 5 ##3C#...... 96 S6$ ####mcDNAT-f 6. 1 Its...... 97 6. 2 ###!#;...... 98 6 . 3 *5m ...... 101 6 . 4 #S...... 107 6. 5 ...... Ill e. 6 nmjcm...... 112 7 s ^ScDNAOSSE^J^^S-r/^SSE^cDNA-T — * ——X'>X t A(D# 7. 1 ItS...... ;•• 113 7. 2 £;EcDNA 7. 3 ^zEcDNA t2'—* '>Xt" AtDfM...... 122 7. 4 wa# z i44 7. 5 ...... 146 8 S A&cDNA(7)^m@a^J^:^(D#% 8 . 1 ItS...... 149 8. 2 ##### 150 8. 3 Sm!5^:fc cfclMS...... 155 8. 4 l5!^io J: 171 8. 5 ...... 172 #9# hcDNAy;j*MVf 9. 1 Its...... 173 9. 2 mmt'ntex...... 174 9. 3 177 li a cn cn cn cn cn ►U. ►fa. CO CO CO CO CO to to to to tco JC |C |C |C |C O 0 0 0 0 0 co co # # # # # # 4u CO K) CO ro 4^ CO ro I—1 4^ CO ro I—1 cn 4^ CO ro cn 4^. CO ro 1—1 CJi 4u gt # # Xm cru z5 4} 4} 4> 4i} Xffl # ■ttF # D> m # Hi uu Xm Xm 3E # OU m cjll Xm 4H S M M > m m M z3 0 4> 4> H> H> a m n 3S # Hi nn|i 4 tffl # m null s 35 » nu|i 4 ##* null > 0 a W Xm Xm Xm Xm h> 0 0 0 0 n B & # Wr rv 44 & H n M z & & 4 H j* 4 Z H 0 0 a O 3* Mn S5 # m SI W Z Z > m a v* # Q 4 > # z z z z % m M> 4§ B E > > V j> a mtti; 4 n tt ca a la ra cfl to a h’ h- FT ru ru ru ru ^ h> BK S n ^ ^ ^ K # a # Q V a #a BH h to m S ruTW ^ ^ ^ ^ * si ^4 4 # ^4 V c^rt # Ev iM 4} 4} 4} 4} a m M 4 gw 9-r- 53i % 4} & Cm 4} Q q Q » a % # # # # a a * a Hi Hi Hi # ^ Hi § 44 -H 44 m 4 S a a IS 5 5> a m a 5F m a a m a 4 a 4J $ $ Si o< & $ M 44 4 m m m % IS K' m >j w 4i y- # H # >4 n d Hi ^ is 44 4 # 4 ^ H CO to to to to to to to to to to to to to to to to to to to O O cO CO oc Oc c/i C/1 Ui ■u 4^ ^ to to 0 0 0 CO CO 00 00 00 00 O CO 00 4u to ■O to CO to o <1 o 00 •0 CZ) CO to 0 CO c/1 Oc Oc Ul Vl 15. 5 mcDNAT - f ...... 302 15. 6 ^ScDNA(7)SSE^J^:^S.r/ ±##m^cDNAT-^ * '>* 71 ...... 303 15. 7 AAcDNA(D#&Ef'J#:^(D#%...... 304 15. 8 ^ftcDNA(7)SSE^J^:^...... 305 15. 9 A#cDNA(D#&E?IJ#:J& cDNAW£ftffjlJ DNAv'i 9 306 15. 10 DNA-7-T f n T Ix^T (C Z 6#faf 307 15. ii &mcDKA(D&mmfcfeRummMVr\z.m-tz>m®.&ffifflm...... 308 15. 12 A#cDNA(D####(cMi-6K#M^...... 309 1 ] 310 [##] b&...... 323 IV ffs m m 1. # cable 2010¥(e(t25^R ^-ei^^ft-f-5 t=f <0 . ^tUe^y 5 S/f 2005^ariemmamAf5fb TV^5. mt LT*re»§^t hcDNA^v V' 9 oi^it hcDNAf n —y (7)i£&gfl?iJ£:&£U 5 ^le^(7)-^(Z)B^IWW^fTo/c0 *y p y ^ y h (DfemW-M £ Ulle^-fo j:^-!; y ^^#%^(DcDNA(D#me^(t6^-A-r(l. jEm yxmixmmtsthtit g^s cDNA7 ^ ^7 !i "l^ol't, ^SSSE^lJ^^:SL^0 d'T&DNA#%m#^-A-f#, 4- 5kbpa±<7)##^ P-y|e^i>-CME?U£$:;t U ^ <7) 41 6-gRilBU U £i£XE?iJ sosui#^emL^^y/<^!g(7)#mfm^^$re^^. mmr/y-y 3y ^^Lfc^ScDNAgfl^J^I&^L. Bf&fiW teSf-f 6 ±%L — — y '>^rA$;liLfco ATAC-PCRj^(e g-esam#Lfcy p-y£iiffl u Mtf WfiWv^y- y ^-y *m mLtz0 &ti. mmtfjte'Mfcx-mmxvx jy/^-y y bmvr pTtg^cDNA-^y * pTW (D\/E$L$Lffi*m±L\^tz.0 $ 'MfcXkl&Bb y k°yf u, ya^y- * %M$tx$5*mtL lita z bizmmMvt^n*xv b^n^mt lx. jz< fflnyy-^TAj^WL, £M£S^6#WtoSIF?)&6WM£{£iSL/::0 o y cc ^ h Tti^f**e9,071fl9(DcDNA^SE^J £ . 2,000f@(DS^SSE^J^r^:S L fc (My'-y 0o &f±(7mmEfW#f^m&m(c^iy £/e¥fi£n¥ioti ^6¥tiU2¥ 4^ it^yy-v'T -Me t N l+7,604i@(7)^SSSE^Jy — ^ L^o #^^FR&12^2 ,3 22 0 , 2,212#(7)^AAcDNA^ p - y (e o ly x S l^y — 9 y T* & 6 g #DNAT —? / tztmm&jt hcDNAyy ^hw^ibj; v ^^12^3^310 £ X) t £>tzb(DXfoZ)0 -1 - as* amsam NEDO ■ SS#cDNAro«SE?iJ8i$j:#fig te)'")v'7xm%.m (Bt)fr-fgDNAgf%gr $a5fe«#^s# H1 r^fflStl'cDNA‘yVAf£:fflft^Mf§J^Pi;i^l' 1 ##f : 9071 —> J-—A ^±ft : 8951 #E:120 E*4Sf7 —A : 3950 ^'MXWf—A : 5001 A'-TSW Bii-y-j 77v>- ±m mam#X&MM B 3l $;ia a## X>X PX^EX «7» 1928 965 1057 2501 500 1500 500 120 -2- Summary of Full-length Human cDNA Sequencing Project Current market potential of biotechnology is about 10 billion dollars, and it would be 250 billion dollars by the year of 2010, and business opportunities would expand tremendously with its growth. In order to satisfy and accelerate such expectation, so called “Millennium Project ” has been established by the Japanese government in collaboration with four ministries and an agency with a budget of 20 billion dollars in total by the year of 2005. Considering significance of genome information derived from human cDNA, a new project, supported by the supplementary budget, "Full-length human cDNA Sequencing" has started. This project aimed at not only determining about 10,000 independent full-length or long human cDNA clones isolated at Japanese research institutes, but also analyzing partially their functions. The project formation including member companies and collaborating research institutes which also supply clones to be analyzed are summarized in the figure 1. The member companies were supplied with clones from Institute of Medical Science, The University of Tokyo (IMS) and Helix Research Institute Inc. (Helix), and these teams determined entire base sequence from the libraries of full-length human cDNA prepared by oligo- capping method developed by Maruyama & Sugano. A team of Kazusa DNA Research Institute (Kazusa) determined partial sequence of mostly 4-5kbp-long cDNAs, and also determined entire sequence of some of their selected clones by using shotgun-cloning method. Sequence data obtained were analyzed by homology search, and translated into amino acid coding sequences where applicable, and then analyzed protein functions using prediction programs such as SOSui. A large comprehensive cDNA database was constructed to analyze full-length cDNA sequences for functional annotation. Concerning analysis of gene functions, expression profile was quantitatively examined by ATAC-PCR method developed by Kato, and database was constructed. In order to construct integrated gene expression profile, we also established technologies to manufacture cDNA microarray for high throughput assay. It is important to analyze relationship between disease and its relevant genes, and we tried in silico chromosonal mapping implementing genetic information database. Moreover, as a follow-up of this project, we have organized a consortium to utilize the outcome of the project, aiming at protection of intellectual property of cDNA information obtained, such as for example, through patent application with some functional analysis. In this Project, 9,123 clones of full-length human cNDA sequence were determined and 2,000 partial sequence data were also determined. The results of sequence analysis by member companies were summarized in Table 1. Total 7,604 clones of full-length sequencing data were disclosed to the consortium members during a period from October, 1999 to April, 2000. Full- length cDNA sequences of 2,212 human genes were published in the DNA Data Bank of Japan (DDBJ; an international database located in the National Institute of Genetics) on February 22nd, 2000, which may contribute to the progress of international genome research. -3- MITI / Agency of Industrial Science and Technology NEDO Technology Research Association for Biotechnology Committee Full-length cDNA Sequence and Long cDNA Sequence and Functional Analysis Functional Analysis Institute of Medical Science. University of Tokyo Helix Research Institute. Kazusa DNA Research Institute • Kyowa Hakko Kogyo Co., Ltd. • Otsuka Pharmaceutical Co., Ltd. • Hitachi, Ltd. •Aisin Cosmos R&D Co., Ltd. • Mitsui Knowledge Industry Co., Ltd. • Hitachi Science Systems, Ltd. • Nisshinbo Industries, Inc. •UNITECH. Co., Ltd. ■Daiichi Pharmaceutical Co., Ltd. I (Joint Research) •Takara Shuzo Co., Ltd. • Graduate School of biological • Hitachi Software Engineering Sciences, Nara Institute of Science and Technology Figure 1. Full-length human cDNA Sequencing Project < Project Formation) Table 1. Results of cDNA Sequencing of member companies Total : 9071 clones Long cDNA Full-length cDNA : 8951 : 1 20 Team Kazusa DNA Helix Research Inst : Inst. Medical Science : Research 5001 3950 Inst. : 1 20 Kyowa Otsuka Hitachi Member Takara Nisshinbo Hakko Pharmac Fujiya Hitachi Science Aisin Cosmos Companies Shuzo Industries Kogyo eutical Systems H7$t 1928 965 1057 2501 500 1500 500 120 -4 2. %%%# (*). xm&m (*) ZL--T*y* (*) % (*) BSSMtar fob: h cDNAY/A (#) B3[^x>XyXfAX (*). $£i£ (#) - =lX^EX9F%3r yy yy ^--7T y y x& m2 / w y y n -5- 3. flF3e»BteJ:Vfia m 9 E I* s i 5! 1 1 1 . Ttfe* c DNAE?IJS:$St>**€^-togtl(D tto#pisro 2.9 e.±M c DNAE^iJSrSSUiSe^-roS^ro 3. DNAE^IJ^S (#F-S 4. DNABaM*g&m#Gfm4W rL-—x y y# 5. cDNAT-?-^-^m% 6.4*cDNA ro$XSS?lJft$»V*«gE-6- c DNAr—9 —/<—H (# 0 VzSfEBfr 7. A# c DN 8. FcDNAy/Aj|f|f 0 9. ^cDNA©tWfe cDNA#*& SfiiSW MDNA^^nTKfl 1 0. DNA-v^DTWdttSiie^-tSilgro 0iz:yy 1 1. ## c DN A(Dmm@E9iJ%S&«:#### 1 2 . ftE C D N A l 3. ft-&pS#WF9E su / n — 6 — SB 1 * J$*roe$ 1. 1 1. fc hy/AWty^- (f 0f?e0r!S >>-f A-^WSe® ^(*#+577,677? n-yo^-y ^7“ cDNA»5'**ga?ii^fc t e, &- 3- K-f 5 "TBEttOiSiV'cDNA? D — y 5. ilft Sn/c-S(7)*ffl®exicSI LT, £ b i i t)K, Zhb 6. ^*SiM 5r^£LfccDNA?D-yJ81 L, (%# L t. Full-length cDNA sequencing and functional prediction of gene products. 1. As the candidates for analysis of the full-insertion sequences of human cDNA clones, 8,143 clones were selected by analyzing their S'-terminal sequences with bioinformatics approaches from 77,677 clones of the cDNA collections in Institute of Medical Sciences, University of Tokyo, which were collected from cDNA libraries constructed by the oligo-capping method. 2. Among the selected clones, 4,957 kinds of cDNA clones were distributed to the sequencing teams, Fujiya Bioscience Institute, Otsuka pharmaceutical, and Kyowa Hakko Kogyo. 3. After confirming the identity of the terminal sequence of each clone with that used for the clone selection, the full-insertion sequences of 1,928 clones were determined. 4. Homology analysis, prediction and evaluation of the coding sequences, evaluation of the full-rates, and prediction of signal sequences and transmembrane domains were performed in 3,950 clones. 5. Further analyses in their expression profiles and functions were performed in a part of the clones. 6. These cDNA clones was collected and stored in containers for future studies. -7- 1 . 2 mmmm, ie #¥#%#, ies7ji, j^ws, mmm&Qm&znft h h^±#cDNA7< 7 7 U — b7jM $ tlfc^^:5cDNA7 □ — > 0 £t£SE7J £ 77< 7 —7 j: D^^b/co biSlTH,f^>tl7°7X$HgiJt{ligI (Genesis Workstation) \Z £ 0 7° 7 X ^ HDNA^iHl, '> — 7 X/XKS * tA, ¥ (CPE-ABltt®! 07 ^ t: 7 U “ MDNA '>— 7X/f“ (ABI3700) CT« E$E7J£#tSbfco ftEte, EteEE26.5%, ###E f*J&3.0%, IE^/MltoEl7.8%> IES¥JI§3.2%, E^$bE¥^® 8.0%, #38.8%7^6o ¥ cDN AE 6 M 7 E b 7 2kbp <£ 0 l,926bp7& o fco 70 7 t>807 ^ / ^b(70ORF#b^l/2 t 0^1±#:068%7$) o 7. BMJT'te. ¥^fl shies/mi, s^mm¥#m, mm, m, jes/ess EE(DJE^S< , -en^'ni,327bp, l,481bp, l,560bp, l,559bp, 2,059bp, 2,187bp, 2,355bp tt£~D T^&o 5'-###R#%0#$ ^¥^349bp, 3'-###R#%0#$ ^¥^805bp7$)O fc0 ORF0¥*9E6te798bp (S>£ 7 > A° 7 RS2667 ^ / ^g)7$>6. ±###7H, GenBank, UniGene, GENSEQ& ESt7—7 A —X thy b b/clSc^^#:(7)E72%tcE ^7^0 ftTz b, b 0±BB#7j:##8 0Bfl7£7 7—7& ^#:(D^46%(Cf8^T^>o 75%tegenomicE7Jfi¥0 &0, /^TSDNAE^ l^# ##(X7 b <5 KIAA7 D — >43 fg # tE ft] 0 unknown-cDNAE 7J 7 $> <5 „ b 4l b 0 E 7J tiESTth M#(b7:g;^####M%$4l7b^b7 3->7&6. B D 025%^S^Jf7S 0#, f-7^-Xt:Si$tl/;^D“>l:i^t?)o h hESTx-7 te$tiEl687fr (1999.12.17$^) Ii$n7^7 b hSfK705^67S7E^7;5 bSt)tL7V^o 70<£7&¥7££:3iiJ*i^0ESTb t: y hb^t^0^l>§b(hit b07 -8- Full-length cDNA sequencing and functional prediction of gene products Otsuka Pharmaceuticals sequenced full-length cDNAs developed by Institute of Medical Science in The University of Tokyo. We used primer-walking method for sequencing. We extracted plasmid DNAs and purify after sequencing reactions by TECAN Genesis Workstation and sequenced them by PE-ABI 3700 Autosequencer. Clones are derived from adipose tissue (26.5%), umbilical vein (3.0%), proximal urinary tubule (2.6%), small intestine mucosa (17.8%), small intestine (3.2%), cardiac coronary artery (8.0%) and lung (38.8%). Average length of cDNA is 1,926 bp. 68 % of the clones have more than 80 amino acid. The length of cDNA of each tissues are adipose tissue (1,506 bp), umbilical vein (2,059 bp), proximal urinary tubule (1,559 bp), small intestine mucosa (2,355 bp), small intestine (1,327 bp), cardiac coronary artery (1,481 bp) and lung (2,187 bp). The length of 5 -UTR (untranslated region) is 349 bp, that of 3 -UTR is 805 bp and ORE is 798 bp long on the average. Homology searches were done against nucleotide databases, such as GenBank, UniGene, GENSEQ. 72% of clones show homology with EST, however, few of them have annotation about their function. One quarter of the clones matched registered clones after end-sequence selection. In summary, about 30% of clones have no homology with ESTs. Conversely, 19% of clones match patent database and half of them are not registered in public database. -9- 1 . 3 *1*1,057* D->^gH^lJ^^^#tb^o 7 U->(7)|*Iim> katoIIlEJI£fi*(kat)4107 □ - >, (kaia)9* □ ->, 1ESAMJ&ti* (coif) 148* U, W M±&;Mfi;fc(hrc) 253* □->> SM¥MMEti;fc(cas)827 □->, fifSSJ9&6 3fc(hsi)l55 7 D — >, In stl,057* □ —7T&-3 fc = 1:1 uit 96 a y°u- httso^-Sv-a.T;v&> oiAGENkitm^^^!t#(#Mu^m, wmatm, # %m&fe®Mn,05i\zm Lrcz-t-v bsue l,000£r* 1J y bfdo ^SE^J BB^iJx-vfe>yj < gB?iJ£j$IS£-ti"SS/^pJIE&ATGCV 7 h yxy^s/su mmit, mmt, Mmmtttm^tz. y- yfi^^L/to c(Dcan, am ® 7 n - > 0#mDNAmm, 7-7x77^^, y-^yyt>yu>y, ammm# gB^j(D##^6^^mmgB^A#mm^%(D&mm(7)mm^mfb^gi6a, 1,057® ©7 □ 6##l,798,208%&(DgByj^##^^L^o ^ nB M 1,000® tl *f L T 105.7%© ±B(D&oi-^ssn/tcDNAgBym> r#%%k ScDNAyy ^ mfumn >v-yyAmi:M^2:Ra cat: 1^3 >v-y yAmm^gByjm^, gByj^^y-A^^T 5,i53> 2,129(2000^ 2^AB^&)T$,*. i>#DNAy-^/X>71:T^MT6C(!:l:j;3T, 51^ E^lJ^^y-A^#:T2,212, ^340 (2000^3 ^ 23 B T $> Z>» -10- Full-length cDNA sequencing A total of 1057 clones newly screened by eliminating the repetition of the same clone as previously known genes were subjected to sequence analysis. The new clones contained 410 clones derived from katoIII cell (kat), 9 clones from normal small-intestinal mucosal cells (kaia), 148 from normal colorectal cells (coif), 253 from epithelial cells in the renal cortex (hrc), 82 from coronary arterial smooth muscular cells (cas) and 155 from testicular cells (hsi). To prepare the template DNA for sequencing, we compared the efficacy between Fujiya manual method using a 96-well plate and QIAGEN kit method. The latter was found more appropriate in respect of convenience although both methods were available for preparation of template DNA suitable for Sanger's reaction from quantitative and qualitative aspects and data of high quality were obtainable. The number of clones of which sequences had been determined reached 1,057, which was beyond the initial goal number of 1,000. The max and min lengths of those sequences were 4,690 bp and 346 bp, respectively and the mean was 1680. Determination of the sequences for 998 of them were made only by primer walking method, whereas that for 59 clones was done by the method in combination with random method such as shotgun or transposon insertion. Assembly of the sequence data was made using ATGC software, which allows to rapidly and accurately combine with the sequences and the sequencing became high-speed, convenient and highly accurate. By starting unidirectional walking from both ends strand-independently, it became possible to design a primer with ease and few artificial mistakes, and manage the project, indicating that the walking strategy was effective to control the status of clones and reduce the burden to construct and edit the consensus sequence assembly. Shotgun and transposon insertion techniques were introduced since those were able to read regions that were difficult to read because they contained some repeated sequences of which primer-designing was impossible or sequences which terminate the Sanger's reaction. Transposon insertion was mainly used to cover the primer walking method for 59 clones and shotgun method was further applied for 4 of them. For the unreadable region because of a long poly A stretch, five different primers, (T)19 (V/A/C/G/T) were additionally used to make complete determination of both strands with accuracy. Speed-up for the respective steps of large-scaled sequence analysis including purification of template DNA, sequencing reaction, data assembling and accurate construction of final sequences were achieved. Thus, base sequences for 1,798,208 bp in total from 1,057 clones were finally determined. This corresponds to the achievement ratio of 105.7% for 1000 as the goal of clone number to determine. The outcomes of cDNA sequences analyzed as mentioned above were disclosed to an utilization consortium, leading to an establishment of intellectual assets. The total number of sequences disclosed to the consortium was 5,153 for all sequence analysis groups and 2,129 for group IMS alone (at the end of Feb. 2000). Thereafter, an international contribution was made through opening the outcomes into a public DNA data bank. The number of data opened to DDBJ was 2,212 for all groups and 340 of them were analyzed by Fujiya ( on March 23, 2000). 11- 1 . 4 5S±ScDNAE5iJi*5g£tfae?g#)CD«»ll6^;!->7.xAroM5S sosui&i*© #rL©$@A. raSScDNACSLfclBff ->XtA4, #»©CPU &iS*LfciStt|g7 ‘j;l7-XXT--y a >£, !/--> a A AvXAAti: *95000* ©cDNAEf>J£-ffiLTl~ 2 0 J£bJI6T*-5„ *«*0X37^> LTA#©T-X©4= A^6##©&V— Tl>5. &^##©f-X^-Xlc79bT#M###&e^o)hlK*E*3 C£K j: 9 , Orthlogitfs^ ££>*■ 9 < #ST#5<, cDNAEfJ £iM IC*:* L fc © 5 . yy AEfl- ESTEfJ. 75 y SEMS^t X y?:#*bT!#9#lt. X37©S^l: -12- Development of functional analysis system of gene products for Full-length cDNA sequencing In this research, the system which automatically did a suitable analysis for FLcDNA was constructed by using suitable relational database (Sybase) for the handling of an efficient workstation equipped with two or more CPU and a large amount of data including a new technique such as SOSui in addition to a traditional technique like a homology and a motif search, etc. This system can analyze the FLcDNA sequences of about 5000 at high speed within one or two days. It also be possible to have a look at an analysis result by extracting necessary information. Moreover, not only the public data base such as GenBank, SWISS-PROT, PIR, etc. can be automatically imported, but also the newly developed analysis method can be added to the system very easy. This system can be divided roughly into two parts, i.e., core program and the interface program. The a core program where various function prediction programs are executed, a huge input FLcDNA sequences, and the results of function prediction are efficiently stored, retrieved, and displayed. (1) Core program The core program is roughly divided into "Nucleic acid sequence analysis part" and "Amino acid sequence analysis part". Each analysis program is automatically executed, and the result is stored on a relational database (Sybase). In the N.A. part, the following programs can be automatically executed, i.e, a homology search program (BLAST2), a detection tool of the repeats, the function analysis tool of protein translation area (detection of ORF, Testcode, homology search against EST data base). In the A.A. part, translation program (Translate) and other verious analytical programs (SOSui, Motifs, ProfileScan, CoilScan, HTHScan, SPScan) were implemented. (2) Interface program The interface was designed to handle a large amount data with high-speed process.The user gets on this system taking the FLcDNA sequences in the lump by specifying a storage place through the network. As for sequences, it can be handled after dividing them into two or more groups, and the addition of other sequences is also possible at any time. The user only selects sequences and the analytical programs, and all processing will be automatically done. Each parameter of an analysis program can be arbitrarily determined. The score and the character string of an analysis results become a key to add the examination from various aspects to an analytical result. The Ortholog relationships can be displayed comprehensibly by the result of homology searching against the data base of various species (human, mouse, worm, E.coli, etc.). All the results were displayed linear in order to overview the whole results together including homology search against the public data base of genomes and the EST, and other amino acid analysis programs. Moreover, the result of the amino acid analysis to the protein translation region displays the part to which the function and the structure was identified after the translated area is displayed linear to another kind. In addition, three reading frames can be translated into the amino acid, the result of the homology search executed for the known amino acid be arranged, display, and the frame shift may be detected. The progress condition of an analysis each program can be confirmed at any time as a system management tool by the job monitor. 13- 1 . 5 *###TmcDNAT-f r*l8»t hcDNATVFL7"D->';c7 8rS^S*cDNA7 o->®±#E9i|A%^e*fi6. ctuiimcE^jyt $2*1 Site ^dtSBgSfi&ff 7lc$,/tO. ■5-ffl*a^D7T'f U & 8«riHTofEaisss5*siLt, 7-+f> • 7'n-y -# WT-&-5S* #J$lcS*58#PiH, RNAS43 *1*3 x K0,'S^B*yn-7x7 h® ±3 tC#- ATAC-PCR (Adaptor Tagged Competitive PCR) B. ^'MOit > ~f f^ip S iaSST^S!# **$*fcU £6£«ft«*¥K*¥®OTStitW± (Cj;oTIS%SnS:g*tt6Cit$#?T$.%„ *E%TB, ATAC-PCR®>\< X *- 7' 7 h- ft'>XxA®«ISSrff FL7/n-7x7 h B1W91-ATE?iJ9i$2 fl7c 7 n- >® %8 fillg*iRWL, *(®#¥lfrffi©cDNAx-7*-X*#fi£-f 3 d tSSSi Lfc„ A -f X ;lr - 7r 7 h • yXfifflliCtt, 96* to^-yfcfxU-Sr&ofcBiii DNA Analyzer*#* U +F > 7P;U® M *96*e?¥fi-PfT X 5 * 7 CSC# L ft. ATAC-PCR*ffl®Sfi^#SS9*,7'f 7 —StoEffV 7 h^f-^fflSV 7 h& 7S1V7 h©E%t>ff-ofc» 2©iBS, 6B86Mlcd3B4«®e®ievt%ayn7 7'f * *ig$fc0l00»fi^lC7iVrTS**5^ 2*lB, / -If >7’n 7 h &*tt#LTK + (g®*m#e* *£, ATAC-PCRi /-+f >7'D 7 ht© tkKStBtrfrofc *2 3, ATAC-PCR® fi A MICE A 'Stlfc. %$*$»!£**$*®/ts6c atac-pcr®S«t6](Ditlsi1® 6*B<$*l*i§E#zp* L/t#IS®S6ff, (D <*»©iMR»*©«ffl, ©8J$W®1f >:/$«S®&M. ® v tv --> g > • *-7 ’*^s£L, 2 *1: *oT##%©A *7 BfJofc. B?W® 7 D-/l;7UTIi, E^H*$®ESi:* 6ICATAC-PCRC =k5SI6*ff ®- *#%1=&©TB. 1739afiT®#J6^6941*e*ICO®T, 7B«Po1®fga^n 7 7 -i *%-/0 7 7<;l/d3*aE^#*A^5®0RF*#m*®##*@*: ^5., «i6»«McDNAx-7*-X*##bfco C®###®A®T-7*-XB#a me?®####t:&/co, ii#i:*oTt)*»T#(8T*9, 4-s®8gJ!i»^n •5. -14- Development of cDNA database for functional analysis In the 'Applied technological development of strategic Human cDNA' FL Project by IMSUT(Institute of Medical Science, The University of Tokyo) Team nucleotide sequence of newly identified Full-length cDNA clones were determined. As a functional analysis of novel genes, expression profiling is extremely important. Although Northern Blotting Method is the most reliable and common method to study gene expression profiling between tissue, the method is difficult to be applied in the case of this project which studies lots of genes, because it takes a lot of time and cost. ATAC-PCR (Adaptor Tagged Competitive PCR) developed in Japan by Dr. Kikuya Kato (Nara Institute of Science and Technology) only need little sample, is highly quantitative and sensitive, and this method result in high repetition occurrences. The research constructed the ATAC-PCR High-Through Put System, and FL Project IMSUT Team collected information on the novel clones, and developed a database of functional analysis. In order to construct High-Through Put System, UNITECH introduced high-speed DNA Analyzer with 96 capillary and made it possible for each unit with 96 gene to handle all samples. Also developments of gene specific primer planning software, data processing software and data control software were made for ATAC-PCR. As a result, we made it possible to fix quantity of 100 gene per day as highly reliable expression profile between 6 tissue. This is by far, reliable processing ability as compared to Northern Blot Method. The comparison experimentation was made between ATAC-PCR and Northern Blot Method and the result showed that ATAC-PCR has greater detect capability to Northern Blot Method. Expression level of newly identified gene is generally low and in order to find fix quantity on a stable basis, ATAC-PCR was technologically developed. They are; l.to measure gene of the original tissue as an internal standard, 2.use of multiple restriction enzyme, 3.examine the concentration of each sample at the measurement, and these studies resulted to improve the frequency in acquiring samples. Also the use of three-step concentration from original gene of internal standard enabled to make calibration curve, and to get high standard data which brought the highly reliable fix quantity samples. The clones distributed from the IMSUT were studied with ATAC-PCR, and for this particular research, 941 genes out of 1739 were used between 7 tissue to get appearance information. The ORF estimation from nucleotide sequence, and ATAC-PCR analysis result were put together as cDNA database for functional analysis. This exceedingly reliable database will be useful for the development of functional analysis of new genes in the medical industry. -15- 1 - 6 ±#cDNA(DmmaE9]&2:t###ecDNAf-f-/<-XS/%:rA(D4#m *5*£ttEl2Stt3r Ltz.rn.fci-(Dmmmwit z && ;m LT. ^ y * y t h^scDNA^ 04t>fxy^^fAX, 0 t&y LT. 5,001^ n-ycDS ®EyiJ£fc£Lfc0 0 £ gift iff T? ft. r <7)\H2,50\? L/c0 $ fc, ±%#A&cDNAT —^ A#cDNA(D##^T y T —y 3 y&iftfifrSt-fT 5 ft«S:M|gi"5 r £ ^ J; >9 . Lfc£gcDNAK?iJ£te*A (1) ^ficDNA(7)BflyiJ^:S ##gi^7 y “DNA'>—yyy —<7)^At-«t o , y 7 yyy-^-Eii-^oiigo, sayij^r -Exy/HcSequencher&mV'^yy^ i& S E yiJ ^ i£ £ LTE ffit/co mm&rn. y &~y (2) ±m#^C^cDNAy—y ^^7" A(D## 1) ±m#y-y -<-;%##?)im#mife=i Hyiyits#!©)!#, RTfMVri&miz.m^'it&mmM&ft V\ r cb^^T^^y^yA^^I^L/Co ^*DB3£^<7) £«><£> g i&T y y — y 3 yy*y a £ lx n, DDBJ^o^ftcDNAy — y iftfflor-y — ## y y. y A ^ L T ^ orft yy — y 3 yyyy an y —y^< —7.y — y (-&<(<##Tyy-y3yy%yA, ^y^xyyyy-yayy^yA^M^L, y 2) it, ±m^y k°yyyy y-y a yy^yA(D^m(-, 3) 5,000 y o — yw^fcDNAC^t^t^f-^ 7, t LX. _hEStify —y £: E^#J -16- Sequence analysis of full-length cDNA clones and construction of a large systematic cDNA database The purpose of the project is to construct intellectual bases in Japan by analyzing gene functions based on the sequences of full-length cDNA clones which were isolated at Helix Research Institute (HRI). The project planned to determine the sequences of 5001 clones at Hitachi Ltd., TaKaRa Shuzo Co. Ltd., Hitachi Science systems Co. Ltd. and Nisshinbo Co. Ltd. Hitachi Ltd. analyzed 2501 clones, and also we constructed a database system to analyze full- length cDNA sequences for functional annotation. (1) Sequence analysis of full-length cDNA clones High throughput analysis system was established by introducing some instruments that is Hydra micro-dispensers, Gene Amp PCR system and multi-capillary DNA sequencer. Primer walking method was used for the analysis. Primers were synthesized based on the selected sequences from sequencing results with a software of Oligo, and data were assembled with a software of Sequencher. We reported the sequencing data of 2501 clones after solving several problems resulted from sequences and clones. (2) Construction of a large systematic cDNA database 1) We developed large-scale database techniques including a data pre-processing system, a registration system to public databases and an integrated analysis and display system. We are able to confirm sequencing accuracy, estimate a start codon and edit sequences based on the analysis data using a data pre-processing system. For the purpose of registration to public databases, we constructed a system, which helps registering full-length cDNA sequences to DDBJ as auto-annotation system. ORF annotation system, a functional annotation system based on the database search and a mapping annotation system were integrated systematically. 2) We constructed a system to identify disease-related genes and polymorphic region based on the mapping annotation system described above. This enabled us to pick up the sequences of the related genes and information. 3) An integrated analysis and display system was constructed to analyze the sequences of 5001 full-length cDNA clones. -17- 1 . 7 ±#cDNA(DS*aE9i&2(D#9E ax ^ v y ? T. ^ y (4^ n->/) ^#mi-6^^L-C^-C##Lybo a#&y7 7-7(D#@3: o tz\fi) tz Wc L o *#%e@L-C. -fy^-r X 5Ktt£0)#tt:ll£|l£®tfc-f-5 — ? ^ y \/ 's Jf&ffli & x $r)VT y y-f-S L Sequences analysis of full-length cDNA clones Our objective in this project is that both strand sequences of 500 full-length cDNAs, which were established by the Helix Research Institute, are analyzed by using the primer-walking method. These cDNA clones are originated from the libraries of the established cell lines and human organ tissues such as NT2-cell, ovary, placcenta and thyroid. The sequences of all but 36 clones have been read without troubles. The templates of the thirty six clones have obstacles such as lots of false-primming sites, secondary-structure formation, impurity and long repeat sequence. We managed to read the sequences by long time-consuming tasks in which lots of primers were designed or subclones were made by the shotgun method. The difficult sequence intends to need lots of primes. The ratio of designed primer number to the useful one for each clone let us know that NT2RM4, NT2RP3 and THTROl libraries may have difficult sequence clones. From these results we established the accurate sequences of all 500 full-length cDNAs and also improved our sequencing ability. —18 — 1 . 8 ±ficDNA®t6SE9J3fc$ *e>ti>3 3Ef5rJS#itfSDNAl^;|/T-8SilWT£-5=k5 ffiox, bh8e^c>f£$BE?iHiim, e« >X6b*®|S]±S0-5„ M #: 69 C It 10,000bp/day T'&-z> fcMtirffeJ] & 35,000bp $ T Bl £ _h If •5. ^S6cDNA©#*E?JftSiaHTro=t x icffofco 56f, -\'J y?XiMi'5>SS bfcMftJ: D ->-7->X£jS©«ffli&.|>X‘7X3 FDNA£M8bfe. 7#C, -/XX5 KdnaCX □ —- >X,snTtA57c3::6cDNA©5fcSt6SE?|J6 8t6b, # BflfcSSE fij&mcxxf • if/$bzt. cfi&miATtab*i=*;sbZ:#*Em=# v-«K4BlECbfc. «*», HPLC£ffl b/t #87)1-2-$T$> 0 , |s|#l:#.» ©XX'l'7-$:#84 5©tt8©TMT, 6'7ltffiAsW'oTb£. 4©7bA, 77-f 7-0»BAiy--)->X»fff©*acai.; tai^MT^fcOT, d; 0 ffiflE7)^ffl®Zj: X x< 7-#8%&*ab7b. ttt08S. *f7bC#Stbrcffig*-h U -y yT0*SJtt C± 0 , 20*©:/7f 7-£30»T«B-r3 st/SHiJIIC&ofc. ciiia$E*S©6f8©S $T$ 0 , #$69%%*Efil©##C*#<##b^. se-c, =ko#e7)^oamt:%*EM&*)g-#-&-x< , pcrki $, y;pE68-^'>--y > X*#CO©T©#M&lf oTh. —-St, y-'r>X'rJH:-l-*A75 h'5: #9of &CCj; 0. i«GC®*©E5iJ£J; D»EBC&Jt-f-5 S $7c, X-X-> XKJStotH X ;l/mS7&U7< V-9--x;l/tt-C©->-X>XSS$fi:'5 AtCiD. ##%### UAEm©##ca#69b 7b. SBC, #$©Au#*E9'j©*#tg Nb/t. $7c, SffcCABI 3104-7 27 D-v-dr >+)—£* A b 7b S tea D, #C/\- FffiT ©###*7)17 "j -fbtzMfS' D T& < , 4 CfflV^fcLI-CORttSas 4a-'ABia8©8-'> —X >-9--©#tt&S^bTfRE4ff 7 CtCJ; 0 , «xnc«ff^feC*i:C7aiP7-5St*i -19- otz. a wft b O^rt^tTo fcJ&H. ft) 10,000bp/day "C - >r y X il& A £ 35,000 bp/daytc $ "C 31 #-t If 5 Z i: & t±j^/i0 t LX. 0 b Sequence analysis of full-length cDNA clones The molecular biology, which has been drastically advanced in recent years, has opened the way to explain the mechanisms of many biological events at DNA level. Thus, information of human DNA sequence seems to have a potential to create novel businesses, such as developing new medication and gene therapy. Therefore, it is necessary to promote characteristic analysis related human DNA to lead the potential market. To address this request, the strategic analysis of “complete ” length cDNA, directly linked to the functional studies of proteins, has started as a national project. In this project, we Nisshinbo planed to sequence 500 complete-length cDNA clones, constructed and retained by Helix institute. In parallel with this analysis, we also had a plan to enhance overall sequence capacity up to 35,000 bp/day, 3.5-fold increase from the beginning. The complete-length cDNA sequences have been analyzed as follows. First, we prepared plasmid DNA that contains complete-length cDNA insert as a template for sequencing analysis. Next, we determined terminal sequences of the insert (first run). The sequence data from the far end of the first run could be used to prepare the primer for the second run into the region of insert. In this way, the new primers were designed and prepared around 100-bp from far end, and applied for the analysis of subsequent sequences. This process, called primer walking, can be repeated many times to sequence extensive tracts of insert cDNA. Finally, the whole sequence of complete-length cDNA was determined by assembling all fragmental sequences that were obtained by the primer walking analysis. As a result, we Nisshinbo successfully finished to sequence all 500 clones within the period of this project. We established the new simplified purification system of primers, since it requires considerable time and complicated steps using conventional HPLC method when deal with a number of primers. Thus, construction of new system was an important factor for rapid sequencing. The new system using a simple cartridge enabled the short-time purification. With this process, 20 primers were purified within 30 minutes. It allowed 6 times speed up and significantly easy treatment, and enabled effective sequencing analysis. To determine the sequences more precisely and speedy, we have been made improvements in PCR condition, gel composition, and sequencing condition. For examples, -20- formamide addition into gel has brought better results for GC-rich region, and non-cycling reaction named isothermal method was effective in sequencing long polyA region. On the other hand, introduction of new ABI 310 capillary sequencer has increased sequencing capacity, moreover, taking advantages of individual sequencers was useful in accurate determination of difficult region to sequence. As a consequence, we successfully improved the sequencing efficiency from 10,000 bp/day at initial to 35,000 bp/day after the conditions optimized. In conclusion, Nisshinbo has successfully determined all 500 complete-length cDNA clones and improved on overall sequencing ability to 3.5-fold increase from the beginning. It means that all of our purposes for this project have been accomplished, and our approach to this project can be achieved because we produced the expected results. -21- 1 . 9 £6cDNA0fflSR9J9fc$> cDNA^$g#$lO##. &tfDNA"7-f £ □ 7 flFll t$«r ¥^10¥SNEDO^3^:SlE^STfTtin/r hcDNAy/Afbm##M5Ej ^ U rf^-f v 7° & £ < — X U 7 n — > ££ STc^ScDNACDSSEyj&^/^EE&E^L IH%-t-7T$>5 = QSEja (#) A u '7 7 XE^lE^T5,000{@ 7h. ®/^100%(C#^%(AgBW7 PCR^^#(7)4%#^# L < M^h(D (±fo(D6%& T> E2l Hi % ME FbI ft ^ 5,oooSE cd cdna Sequence analysis of full-length cDNA clones , Establishment for stock and delivery system, and DNA microarray process technology The main theme of operational technology development of human cDNA genome application, which is financialy supported by NEDO grant 1998, is sequence determination of full length cDNA made by oligo cap cloning method. TAKARA Shuzo Co.,Ltd proceeded with three project, namely. ©Sequencing analysis for 1,500 full length cDNA clones from 5,000 cDNA discovered by Helix Institute. ©Establishment for stock and delivery system for 5,000 cDNA clones discovered by Helix Institute. ©Amplification and purification of PCR fragment from cDNA for making DNA array,with the corporation of Hitachi Software Co.,Ltd. The result is as follow ® & ©are completely finished and ©is finished at 94% . The reason why ©is not completely finished is that 6% of all clones has very poor yield and we could not make all the PCR product in the period of this project. -22- 1.10 cDNA77 7 □ 7 U7 Ic«t5 HSV7 h»i7i>y-7U *E5ETtt. 'x'J y 7XE%0rl:T#6n^h hcDNA^'f 75'J C'fi'?., 5S±S*s £fceStl7c5,000H©cDNA7 D — 5cSficDNA"7'f >D7 L7 (EfiK&WtoilliE i, -?7 » nr I/'HC j:%%S)Rt1r*?iro#iERy:seES BlSi: LTV>5.. CCTil. i6 s.R&DNA-r'f zui’V'isen. satt©ff«. &tf»e y^BSStirtoHiESfTofc, DNA7 7 ^ □TK ff J5$ftW©tlu ltt&Zc D5,0007.#-y H6i$S£Lfc7l/7©ff®©£ettti:, 320~3507 l/7/fl tz. $/t. f#^b^7k7©7%@M^#%#ie (ttffl-iJI67V"fZfffi£7l/'f) tt¥i$ 93% tti-Dtz (lli)©fFST48ttfF/$) . btt 6 =fc 0 , 19 —&an*^3>fi$gtt©i6HDNA 7 7 7 □ 7 1-7 DNA@$ffl3- MrilUfcXrH K# 7 X £5#E<£ffl V. LfciBS, ^ V -L- U 7>n- hySeL-TM8tt©A(xca^*UML^. #5%j; 0«i@bZ:MfW]©a%:6 77 >n7H JttBlA'f 7*V 77-fer-Zg >L7cSS %, 7 1/41 3#S©fii$©7 —y-y hDNASSSfflViT/N'f 7U 77 if— -> g > lfc£8S, iSi$© SSg&EBSSJlto LZt. 2wg, 5Kg*©mRNA^6i@SLyc7 —7 y h DNAiSESrtil ©T A7 7 U 77-b'—-> g >£fi7A 7 —y y hDNAi§f$6JIBIC#il&mRNAiitti;2«g£fi7b *5fi75:Mf£;B#M£#ISbfc„ A<7vy -23- Development of gene expression analysis technology for the use of DNA microarray This project uses fully sequenced five thousands (5,000) of full length Human cDNA clones among the cDNA libraries obtained at the Helix Research Institute and targets to establish technologies and techniques to produce DNA microarray for the use and verification for gene expression analysis of those cDNA clones. For those objectives, we have studied conditions, stability, and reproducibility to produce a high quality DNA microarray, and proved its effectiveness for the gene expression analysis. 1. Establishment of a DNA microarray production technique A quantity productivity of a DNA microarray with five thousands (5,000) DNA clones on one microarray was three hundreds twenty (320) to three hundreds fifty (350) per month. And a ratio of a usable microarray to defective one (yield rate) was average 93% (forty eight (48) microarraies can be produced by one (1) production operation). A fundamental technology for quantity production of the homogeneous and highly reproducible DNA microarray has been established. (1) The poly-L-lysine coating was found out to be the most stable and reproducible method after the comparison of five different kind of DNA stabilization coating method. (2) The DNA microarray lasts for a little less than two months after the production, according to a series of hybridization experiments at a different date(s) from the production. (3) The most suitable composition of the target DNA solution was found out by comparing the three DNA solutions at different kind of composition. (4) The most suitable amount of mRNA was confirmed to be two (2) fig, after comparing the hybridization experiments with a target DNA prepared from two (2) and five (5) fig mRNA, respectively. (5) The most suitable hybridization time was confirmed by comparing hybridization results at different time period. (6) The most suitable hybridization washing condition was confirmed by comparing different kind of washing conditions. 2. Evaluation of stability and reproducibility Randomly selected DNA microarray was applied for competitive hybridization at the same experimental condition, and dispersion of the results was evaluated. The results have confirmed that the DNA microarray production and the efficiency of the hybridization show the high level of stability and reproducibility of the DNA microarray. 3. Verification for the use of gene expression analysis Two different kind of target Human DNA (Liver and Brain) was applied for competitive hybridization against the produced DNA microarray, and the measurement of the amount of transcription of each gene was attempt. As a result, a gene(s) expressed only in the brain, in liver, or in both was detected to prove that the DNA microarray can detect the variation of the gene expression with high reproducibility. -24 1.11 6EcDNA<0i££E$im$&tX8te«*Tlc|!|-f fc5££tt7-f • 3X^EXE%9r ^ LtlI^cDNA04J1:\ t M>S(7)SmcDNAc7)j^SgdyiJ(7)^:S^ff ofCo £ /c, cDNA S L^o SEcDNAOj^SSByiJ^^^X^ft, t h t>E(7)SEcDNAy d —y (4—5kb) 2000#(DM ^(D^E^^&^SrfTofc, r (D^^SaM(Dtf^^^Jffl It, (D##y d - y 120#(DilglJ^^^ L/^o P-yl-OV'T, y a y hyfy?£lc«t *9 ^SSEyiJcD^^^rff V^x 120#(7)j^SEyiJ^:^^^T L/c0 E^T-^ld:, Long length cDNA sequencing and development of new technologies for functional analysis In the analysis for genome, it is important to analyze cDNA as information for the investigation of gene function. We have achieved the sequencing of long length cDNA of human spleen that Kazusa DNA Research Institute possesses. We have also studied the technology of targeted end-labeling by using three stranded DNA formation and detection of fluorescent signals, as the basic technology necessary for making the DNA chip of long length cDNA. We accomplished the sequencing of both 5’ and 3' end sequence of human spleen long length cDNA that has 4-5kb length. Utilizing this information of the partial sequence, we selected 120 clones of the possible new gene by comparison to several data bases. Then, we completed the whole sequence of each clone by the shotgun method. These informations of the whole sequence of long length cDNA are useful as the new data base for the analysis of gene function. Using the end labeling method for DNA with three strand DNA formation directed by recA protein, we studied the targeted end labeling technology of long length DNA and the fixation technology of the end labeled DNA on a solid surface. We could get the high efficiency of the end labeling through the optimization of the reaction to introduce efficiently an amino base into the -25- one end of a long length DNA strand. The optimization of the fixation reaction was also carried out to fix efficiently the end labeled DNA on the surface of a slide glass with a covalent bond. As a result of using new technology contained the end labeling and the fixation by the covalent bond, we could obtain the same detection capacity as that of PCR method. In the study of the detection of fluorescent signals on a solid surface, we constructed the trial detector using cooled CCD camera and macro irradiation system. This trial detector possesses the basic performance. -26- i.i2 aacDNA »efme? «*M#e>nntf. 7«&©#»d e&s tJBtJtis. b holes -f-j'^-XOMIM»^6$«lcE-fal*S$5ISm-rc«. *EflS®fi¥69#HT$.a» e#:A-> KIWS^WfflT'^a. fit, E?iHi$e*IC3 >ba-5'l: j: «$r*65ln SilicoSS&ftT -y b > 7*©#ft £&W L ft. *£#:*&+»* 1C it. MBitfi o, bioscouT-esmufc L T, UniGene £ GeneMap©* IR&K^ftLfcSa^J Mi T y y*#©T-^<-%# L^b. *iCr-?M«fil:*#it sain Silico*6#:t y fcf > d/SrHISttif S £ itfffijfefc. me? a;bAic». y y adnah ©d- >y;b e^f njxt &&. a€tt5$S© Development functional analysis of long length cDNA In case we search for relationship between gene and disease, genetic information is very important. If we can get chromosomal band location, then we can get the information about human disease information from OMIM. We build the integrated genetic information database by bioSCOUT. UniGene has chromosomal band location and sequence data. GeneMap has physical mapping information but does not have sequence data. We need the sequence data when we try In Silico Mapping. So we extracted the sequence data from the integrated genetic information database. We built two kinds of databases. One database is the sequence database including UniGene and GeneMap. Another database is the integrated map information database. We established In Silico chromosomal mapping system. When we get evidence for relationship between gene and disorder, we need genomic DNA samples. We collected the 20 samples of 5 families, which we got informed consent. We extracted genomic DNA from 20 samples. -27- i 3 Horn###.?)##. ^#$m ^ m-o fL. |gP^fb(D#^(caj^LT^igM#(/)#%# SrfT o tz-o *9 t. t tbfco j£-?u'y F L, &W^(D%}&}MiM1t&\xl 5 tz. &6(-8^ 27 0 3 v y ”— v TA^iiifco 5Fj5£l2¥2^ 22 0 , 2212i@(7)^^ScDNA^ cr - > ^ $ gd £'J t# $ £ S^X - 9 "<“ * "C^)6 0 *DNAir-f/<>f (DDBJ) (ddkMLfco To promote the project, the technology committee was established. The committee consists of seven members. Director of the committee is assistant professor Sugano, Institute of Medical Science, University of Tokyo. During the fiscal year, the technology committee meetings of all 12 companies were held twice, usual technology committee meetings were held five times, and sub-committee meetings were held twelve times to mission complete. Collaborative research agreements were made between RAB (Research Association for Biotechnology) and Institute of Medical Science, University of Tokyo, Helix Research Institute, Inc., Kazusa DNA Research Institute, and Graduate School of biological Sciences, Nara Institute of Science and Technology. R&D information was collected by attending scientific meetings and international symposiums, and technical information was exchanged. Member of the consortium was 33 at the end of March, 2000. The information collected by those attendees and the project results was summarized. A consortium was organized to utilize results of this project on 27th August, 1999, as intellectual property rights such as patents will be protected. Full-length cDNA clones of 2212 human genes were prepared and the base sequences were determined. These sequences were published in the DNA Data Bank of Japan (DDBJ; an international database located in the National Institute of Genetics) on February 22, 2000. -28- i.i4 tStolBSlStto 3CmE2l0T T194-8533 3C3fCfWEBmM0T3-6-6 T 771-0192 SSW@Srfl)Hl*|IITj)ll*aSf 463-10 W^F-E T 257-0031 #$)IIWSSr1?Eg228 siSSgR T164-8555 %RS43EKS41$f2-7-14 3-3. —y /? (#) T 277-0005 TH*#TStt^'l,fS367-2 T185-8601 SRS@^erH$^^Sl-280 n-w?i® T312-8504 ^ m % & fr fU rff T 882 WBSfO-'l'IX'XxXxAX' DNAliWS T185-8601 *S:$E»eS*S»iai-280 «S)B3t«fE@r4I*E%BrlS xXXh U T312-8504 *tti»U'7tc'fe&A>rUTffH&882 W B 0rMI2§**B&ft 0 im### T123-8585 lM$Si£EBSf#*eTl-18-l -29- T525-0055 Sgfg§##2257 T 240-0005 #^r#^±X#E#FBT 134 t: X A— X 'i — X h X 7 — 14F (#y< >> • nx T292-0812 f ##7KM#r#^#1698 + >-fe>X“ 3.4#' )(']#*% T 448-8650 #^0#)('J#T#A$fKr5-5O 101-0002 =P##^#m5-2-ll #^#2bf;i/ m-mmm mmmnm T134-8630 SSCfBEFJHE^S @1-16-13 ■Mm%m 292-0814 —30— i.i5 BFstsasKe & ^ Mum • s«« E%J*B [#*%#!##] h a tii *«E%0r ^^ScDNaE^'J^S BB g«E5ESr ittW^E* &txa€5©g#®#ig^fl'i tfjn @ ift«W?Wi E%« si* Mm SSff%9f E%M *1* IE til E%» Men M ggESSffi E3EM 45 *f ggEFESI E%» #0 % $«E?50r E?EM ;n% j*«E?E0r E3SM 1% ^ fnft g«E%m sum WEB SSESERr S|g« gSESESr SlEtt /MB gf * mW5E» SIB8$ ftMte. 6AH g«E?Sf3t Stgtt mm iff* jKmESESt #%#(#*) mm # gSESEffi E?EM(*K) *m 5 g«E5E0? E3E*(«ft) ^±ftcDNAE5i|tt56KO?a e± it ±#GEN#^Bf 0f§ Gf #| mm A ±#GEN#^Mff /j'» g# *«GenEFErH E%* CWV-S] «• wmxESEm-mft 7cA#cDNA?IJ#t% Llj#IE% A'mmxESE9r-£tt A--WmxESEffi-±tt A* IWimEFEBt 883#? A'-fMi/XE%0r-£tt. ^SScDNAE^IfteSDt # 3# V')i-y3/*®V «yXfA86ESft8 A®M% ## EJa V'Jj-y3>*S/V^yXTA^ V'Ji-yay^^AMtyXrA^ —31 — & £ am - ft— M V'Ji-y3>*gBA' ^-yXfASBS LilP 9S V'Ji-ygy^^^^V-^V'Ji-ygy^ [X——t v 7 ] Pimmi ###tff^cDNA \P E^m x — 7 ^ — 7 fF np #5%m /hS «EE^il Am a# mmmu Uj* #A mzzm^ m$cM%m #* m E^m [(WB*»ffr9?J £6cDNA ##It rJ ^^E^ AfA^AxAE^^# Sill #A ^AE^E £{fE3£M pm ^AE^E -32- R £ mm• e^b M mmE^mr AMR e# /J'l^ mmE^m mnm mm ^^mnm E^m *# s— **kw$cm AEE55R mm e^e^e AEE^R I'tm mmE^m E^R ## **E$Si?r AEE^SR DNA##' r EES® 5 BE DMA##! i AMR me DNA##1 i #a DNA##] r ^gfp ♦tin DNAM! r AEKM mn #R DNA mWi [ AMR MME DNA##1 [ AMR #BB DNA##'i AMR [iBu^X>XyXfAX] AficDNA<7)jS$gB?IJ8t£<7) S — DNA##^R E5% fX/ >Jlt-RA>^DNA#M?f^AEE^J ft AEE^R ## ®Z DNA##^ ¥M m DNA##^ E2lR *A # DNA##r^ EtfrMR ma sm DNA^UfS E#^#R lumm#] Ip* # 20lf )\-T r i\-T')-r A6cDNA<7)S$gB^J&S E^r* fi — 201? )\~T E^R Em w$zmm>t- 2QW)\-T E%R ## m E^mm>* 201^ Jh7° E%R ^jn me E^M^W- 201#* )\-T E#R EUU L(7)^ 20%y A-7° E^R mein 201^ )h 7° E^R e* mm 201^ )h7° E^R $i® Em 20%y Jk7’ E%R f%m m# E^M^W- 201* A-7° E^R E^M^W- 20ir Jh7° E^lR -33- ft « mums ft is as 201)'»-7’ »%* m;n 201)')k7" ffl%* m#jii # 203)'*-?' *%m 203)'*-7' w%* A# #? 203)' 1-7’ »5?M [SMtifc)] ^ScDNA cdSSE^J#^ - cDNA#$q#:$U(D##, »ZR it ^7°^ KStir-t >7 — -fe>7—6 &£/-DNA7-f 7 07 k ±m m / w ft«%m £&«%* mm si mm mm A'ft ft*x-f *7Hr >i'-iftWJEM ft# wz *n «* *t» *s- AJHrkS'- m# a* @e m nx OjT »5E «G;?«#r-t>5'- fil/R JnL {— AK ##ft y *?— ±bb R# Sf m# mn * &BB SEE m# ##? m Wffl M# ftf*3 %# e# *;n x« i*GT##r-b>5'- □"ill #s ^t7x-r >7— St BE 3 m# e± UU* Mf — A -34- a & SrR • Stt« e%j*b A*g fflj- /W:*E%0r M# S /"H :*E%Sr SEB IE# ^"Xx/ TjJH zXX — rWOj IE# /'W ^XxV 7j iHr >X — StK ^"Xx/ ;£UI/*fc>X — WEB % /W ^Xx^ rWS /'W ^-Xx/ *;i/-fe>^- TxHI lEfrJ A<^"Xx/ — ® A# A<^"Xx/ %)]/-£>& — m* $? /'W^Xx'f — [% /W^Xx-r £})]/'&>'?'- [BiL'jy h^x7X/y-7 U DMA 7 -fXnyi/'fe <£■£>* 7-f7lMx>XjtmS 5# m 7<7th #± * — m%±a»-x $* [WT'f->> ■ 3XfX»5Si] fiS8cDNA®SSEE9i$ UU* »5EE%BB)V « • If iMfrS? e6SE%* S# $ S| « • If A° If I • fl? • Ef « • If iMftif £ttE9SM A# E%M%S;V # • If E5EE%S;V « • If l*8$E%Sr £ffi«?cM ®E%E% 3AA E^Ammma m* -35- ft 4S mm■®#* #IXM»?Ei?T ±ttW3SM f # ## iiJ*S$»%9f gf# imxmwftm pm [emis-semi iXglJ ftS e® esse m# yy Ad¥ff«%as iwss Mm MX yy AJStfWSSSC HISS ULllW -yy AfWtfE%ee w%* -36- %2m 2. i tin #(:c Mwit, *kfitn¥ HD^y AE?iJ ^ij-e^do t h^yv«k#t>^T^6o l frb^mmmwtsMz.QimkcDwmteznip ’rfcmmmxh'Q. ^<(om^x-\mt^(Dm^ Mwtstix\,'fa\ x ?'-&mi'xftm-i-z> tcdna^-t9 , ^^-e#6. Efwmie) t h^v y y-y)f##±6^ J: ? X 6 (D 7')V-X. y ? xmizmxtt. *'J 3'^r-t 'yXfeXftmZtitiZMmocDKAyXXy V-(DMtitftte SftT&^x rn^(DcDNAy^y'y V-(c:ti#SE z «b * #a a\ ^ ^ *w?zmiiB-cfi, Ef«-enxs$ii/ccDNAy n-y^)W^fLx mm^±^xh6^mmm^^ti zmmnm&mm lt^mse^jzswommfcfeltc0 $ 6<9 mtrmef# /:o H^AAcDNA(!:^(Df##(j:^ -37 2. 2 (l) #### EfWTti, d>-Y -yycDNA7^7"7 V £n/i^ft(7)cDNA^ n - y &UM £ +1 ^ScDNA(Diisu^fTV\ e+w^-a (mwmz#*^#x *^++^% ^) LX%34,000MM<7)cDNAy xi'—y(full insertion sequences; FIS) %:;&:tL~$'6 Z + <+ L/c0 y-(t, V'-fftfct: «Jf;lit t F^y A#^ftyy —y)^##% mw^iisE, mi) o rftib<7)cDNA7^yy y —AS(7)^1+77,677y n —y <+ L fc0 il&^+lfccDNAy n-yft, E^J%^#(7)^ F^yxi^L-^yT^yL, -e ii^n^E^J^^a^dkSl-SBS-t6 ^ + £ Lfdo mi) *9F^E^^^#^L^cDNAy D-V(7)6S y n — y y^yyy-6^ #yo—y# t mg urn# ADXX 4>ftW 4,154 t FTSibEft&M CAE f ## 1,367 t CAS [+++# 9,331 t F 1E^±# COL [#+# 6,593 HepG2?M HEP 4=tt» 15,528 HRC W» 8,784 fc F/J'E HSI t+tflf 7,444 t HUV 835 KatoIII## KAT (+++# 8,839 t F# LNG 4=t-t» 7,306 t F#i&f&mm#±^ REC C+t'+W 816 t F lE^y:S COLF 3,498 t hjESlHl Siting KAIA ### 3,182 3-ft 77,677 (2) cDNA y o —^(7)5'^%E^l&mv^#ei+++# E##^6##^^(+^:cDNAy D-y(7)5'^7)^(7) l y"7-ADynaclust£rffl VT, cDNAy n —y Mt LxmiRLtz.0 mmwx\**w$zrmmmnm*\^'xb 31 ^e^Mcdna ^ d -xdkp fTbiLTWc/^*, Sf^^cDNA^ D —ycDga^lJ^^nboT^fcS-a ii:(ts %\tzt^Dt>^fcgE^lJ W "C E^IJim± 7k 6E^j^6&6 o) mmmfcx-£=i- Fi-z^mfecom' cdka? v-y (4) K-td^ttOi^V^cDNA^ n-ytDil^B ^(DATG(c#L-cmmM^ v>bL^^wm-t^xu^y ATranStart^# 6 ^H^LTVVc0 -£ r -CTranStart£ V > T#cDNAd7 a - y(D5'^%gE^lJSrWfU e3- Kf-e^rntt^m^cDNA^ n-y(7)#mi-6z i^ibx x (5) Hy y^7^/yW&CZ6y 3 y h^yy^yy U-of## GPS-1 Genome Priming System (NEBtiSD &m\ ###^TcDNA^ o —y(c h y yy^/y (J^TTn^^E) mtUX^'no/c„ Tn#-^f$^(ipGPSl.l (KanIt^-7-^7-) Tn^fi -39- #cDNA^D-y(COV^T. H16? ct-y (6) mm-?7X * KDNA(7)|SIM cDNA^ p —y^i^#-f-d^fllSSro^#:^2ml(D2xYTiStii!^rAiLfc96y\7::V—y^^/uyi/ — H^SLx —«9 x -7/kf-y ^ 9 —>96-^71/ - 7 h y — '>3 yy y — h uy^rttSi) ^fflWcyy x 5: Ktitt&^yy-^yBioMei^ooo^fflvTSiti fkL&yyyA, (Pi-50) y-yy^^iAo# KDNA(D#m^fT^yho Kd, (7) 8#y-f cDNAC05'5k^ £ 3'*#^ 6 <7)tkSE?IJ£te(C ti, y — A y * -fy y t L TME761FW (5'fJ, 5'- TGT GGG AGG TTT TTT CTC TA -3') $ A(tME1250RV (3'#L 5'- TAG GGA AGT GTT ACT TCT GC-3') &rmv^o -Ax ^WmTn^m^^-C^L/bygy h^yy^yyy-cDE^J^-C Itx Primer N (5’- ACT TTA TTG TCA TAG TTT AGA TCT ATT TTG -3') £ A ttPrimer S (5'- ATA ATC CTT AAA AAC TCC ATT TCC ACC CCT -3') Tnjf ASBffci^ Tn#Ad7 p —y(t x ii$A y ?'AIZ16? p — y &mtk Lt k° 5/ ^ T y y Lx Eyi&^l-i&Lfco i“^^^#cDNA^ P — y{^o#x ^f 5'^3'A^E^i(CTnf# A^ P — y ^ ^ # b ti 5 32 A (7) y ■- y y y £ ip x. tz & f+34 A <£> E?'J £ E?'J & £ <£> tz g> <£> T-fer y A' y y y tc f& L/b. (6) (8) «fc5J£&E?iJ (9 ) mmEfijoT'tyyy ABI377 66by —yy-y —A^SE^IJ(7)t-f (A Phred/PhrapjoZI/Sequenchert'T7 ty7'/^|fo/: 0 Phred/Phrap^fflV^/cTy yy" 9 yyy'(i x E^lJ^99.99%^± (Phrap40) && Lfz0 -Ax Sequencher Aery y A y y^fli, [^#(7)E^J^ -40- T-f&g -mSSLT#^A/ (10) In Silico-C(7)cDNAA&gaflJ(D##f ^r^^tl/ccDNA^ n —y^^gfl^Ji^^L, GenBankjo «LL^GENESEQN-r-7XfabfML /cSt^KDmRNAgE^J^T —^rBLASTN (GCG-ver.10) VT, 3=t GenPept, SwissProt, GENESEQP t - 7 fM L E 6 ffT 5: 7 Ega^JtDT 2'—7 ^<—X GENCORE FrameSearch (GENCORE Frame_N2P ver. 4.5) £ tz. ft BLAST? (GCG-ver.10) Srffll'Tff ofco SPCIAS ^DNA#) BlRp-^y 7l/-A'>7 h^ — <7)ftf#f£ffofc0 £ 6(^&££;h,/tcDNA7 P — ytD^gaW^SHBET*® yEE^iJtcmL-C, Signal? (f y^-fZ#*) SrfflV Vc'> TTh/^ga jo Z t/SOSui (^#%±) V V::EHiMW)Zi|iJ£fTo tza 2. 3 HRISmfc' £7n-y£ili£U #E^J^%^A#(cE#i-6f##. 2)^2,0007 p-y( 7)A#E^m^#, 3)^^$jT,/ccDNA(DA#E^J(D##, 4)-^^D“70iW, 5) cDNA##^#yXT A<&r%^ Lfc0 L/c9f% (i) AAga^'j 5cDNA7 d —ydoiiBU !B$i6ez-7 D-y^6«^5iifcEf4mW7)7M; V-(D5'7kmiJ(7)^S # (Eim^ Ky^-at^y n-ycDit^) tz 50~70%BS£WS£;h,Ti'5o j;9 ^^m-c^^jT^cDNAy-ryy ^ocDNA^zyy 19 t>{g;v\ b t (^^^ScDNA^W#^^^ 15ZJ (c#AA7 p-y^mjT(c< ^7 p-y^#<^$4bTV\5o $^m&(DrnRNA##im#T(j: y/ADNAie^y n—y^#^RNA(DzmA&7cA(-#(t5^ <^#LV\ 7" 7 ADNAl#^^ n —y <&r3- KLTV^^m#^i#S< x y F p y^ £;ftTiy5o Sfc7774 yyy'^rSttrw^v^^fAiRNAtiScoy d —yrity y h p y^oti^ ^;h,TV'6mRNAE?ii(DT-?-kyF -41- cDNA£il&-f5r ^ VTcDNA ^ n - y (7)5'5k^E^J £ B¥#fU r^ScDNA^L^j (DWflfiSrfroT^ n — ySriigrfSr cDNAy D-y^rilSU L/t0 iSfK-f"dWl^ti60i^V^cDNAy n —y(D##(l, L X Tran Start 5r ffl Wc 0 ##%(C677,677y n — >#29,630(7)^ 7^^ — (#) £ tl/cy :0^^l y 7^2 — htz. <9 iy D-yfo, E+8,l43SE 0f(^wy n —y^e>il&£iiTV5cDNAy p —y ^SWUESrffo/Co ##%(cW+4,95?y n — u. ^-bb^se ms£s ^60tc b/c 9 bb^l^ (m) „ sb y n-y'MfcCiffl V6 ft/c5l^6B?iJ -42- Ef-W'C&S^tl/ccDNA^ u~-'y (Ell) ^ (#2) ^ p — y ______f 7 7. f — No iHt£ 0 »#g 1 05.02 36,765 9,936 (1) 9,936 1,682 GS (a) 2 05,11 43.492 2 07.27 36.765 12.148 (2) 10.797 2.030 GS36.2. UH-cds #84 (h) 4 08.1 1 6.727 3.323 2.153 265 GS36.4. UH-cds#88 5 10.20 15.292 5.136 2.851 590 GS37.3, UH-cds#96 6 12.17a 18,940 7,427 (3) 3,893 1,216 GS-cds 38., UH-cds#102 (c) 12.17b 35,294 14,625 (4) 2,293 GS-cds 38., UH-cds#l02 (c) 12.17c 14.610 66 f5) GS-cds 38.. UH-cds#102 (c) -6-S+ 77,677 29,630 8,143 2.6. #=10.5% (I) mnmv @E*n*e^l!fc£&a;asifi £ ft fcffiJW. (2) 5/11 ?)####(DK?lJ#d^COLF,KAIA(DBa?llV'7b b -43- 49I2)^a%^,E?fi^P@%¥,S^I%A£^ ^MdM ^ "j_o#m#^¥^^G)-4/:¥##T/44 3mm v£]3-^WPl--n»4fi imm °;m3cwpg:Limw (a^^m^d^) ^ m%nm “--i^^mm^ax-nc — * 4VNCP (C) AS 6> %0I £W‘8 LL9'LL occ %Zl ZL£ Z2V£ VIVX WW«@S3M ^ ZSl %6 l££ 86fr‘£ J103 S¥^3H l ££ %8 99 918 33tl $fTamma%8 4 q AIS %6 6L9 90 CL ONI W4 l 96P %0l 168 6£8‘8 1VX ^SIIIIoibx 9£ %0I 98 S£8 AQH se#^wm#^M4 i 9PZ %t7 6l£ ISH ilhM 1 SSP %6 Z9L P2L‘2 3HH SA l‘l %ll 0AA‘l 8CS‘Sl dHH ^tffflf^OddH 698 %IC L9£‘ I £69‘9 103 f8¥$3H 3 681 %8 09L l££‘6 SV3 Bg#g^dim#m41 9 L %6 ZZl A9£‘l 3V3 W^Mlil4 i ££P %Sl 6£9 tz£l‘tz xxav #m##4i ^##P@ ### -&■§# — Cl 4; y 4z ^^0-6 44/4R"#<-a^VNa3^^$^m (t^^) AS6> eon 6Sfr P9 L'\ ISI 8frS C££ 4S-^«fttPS 6iri SP£ 681 09P cc ost OS 9S£‘ l £89 96 £L£ 8 C£l P9 *»»¥ ZS£‘Z ZL9 PL\ l £6 ICl 99C 81Z +#^ 9 9 P £ £ Z 1 °N^E/t-n ^ L\l\ p/\ i SC/8 81/A SI/A Cl/S QSPI y 09 §05 y 0t §0£ y 0c y 0i S5S t (m) Z^^K^^-a±yyy-7-^^#i-L-CE^ £r£tou ^tL^^^Dx-T^ftBE^iJ^r^tic^-tir/Cc -en^^^^tilB^PWcD^TcDNAcD^SE ^E^j&^-eM^&pwcT)^n—'s^tz.*)co^mmmt\.4mur -v-3*BS) tz'Dtz. ms) o Tnm(D^TAA@a^J^^:^-e#^:ej (g]2) ms) Tn&(CZ y g yh^yy^^y !)-f^#(7)^# ^ c —y#: ttm %m Ltz. 9 n — y<7)## 1190 100% y g y h^/yy^ yy D o-y 1143 96% y g v h^y&(7)A#E^j^^(7)0;# f a — y# itm yg^/h^yy^yy!) -^^$1%?#/::^ o-y 1143 100% yg ^ h^yy-^y^^lf-rA^E^U^^^L^^n-y 418 37% ug^y^y -7 — ^-yy-cA^E^u^^^ Lyh^ n —y 433 38% 177 15% 3[n]]^J:07P7-/f ~ ft ;j— ^y/^K'lffof:^ P-y 115 10% ^SE^^^BE^'o^cDNA^ D-y <1: L*m, 1) y-^y^EfS^ii^-Cit £oT L£ 9 b ^-t-r^te^smztitct'#mLtz0 temvMLmm^titiz>m&tt'£r7±>-7'Mm Lv\±, Tnm-eE^%^^L^#^ic (i, TnWAum'bm*)m lw£<7XMM#£*eii-cztz-tfrb, ^tz.bob&'itc (BI4) o $/cHSIl 1686 Tit, <7)EM^ n— y (7)214# B <7)£ESTA£C t°“^ ^r^LTWct)(7)^x Tnf#A^ d —ymA(7)^ n — y bC(D? p — y(;:2#ffcLfc (EI5) . Z(7)C^^^E##-cm#^^-CV^eA-C, ±##(7)^T^m^^LTV^t(7)«!:# x.bMz. 0 Z(7)#^(C(t, #^](7)tf-^(7)lt$A^6C##fL^o —45— Ho. 1 Ho. 11 Ho. 15 No. 16 Ho. 14 Ho. 4 OFF JOE a > C0L02253. 2047bp. 406aa COL01871 (3371 bp, 739aa) *• (EI3) Tn&&PW&(D#mTcDNA(0*egEM^*f TnlO TrJD9 CWH> IfB t- }?AfSSK8^' yjiic|£?i|®□. =.>y h s^S'l (279/282tg®^-0[) (B4) 47 (3) cDNAi^iESE^lJcDftW "f^b^E6@<7)S ZZT(i#^#^#^$-eKAT(D##f^#TLyc 3,754^ n — i) cDNAE^lJfttDStff 7^yy ]J — ZT (#6) y A y y V — Zf JW# L/i 7-ryy D- Efm^agdk# □ —>-©|g# (bp) t MgKiMS ADXX xmmm 253 1,476 CAE 71 2,022 CAS 151 1,512 t hiES^Ii COL BW%@ 779 2,317 HepG2>M HEP mn%& 724 2,560 t b"#^#±^#aE HRC 422 1,828 t b/H§ HSI 237 1,506 HUV *»»* 27 2,097 KatoIH#E KAT 413 1,583 t Hffi LNG 317 2,167 REC 25 1,560 t MES’^S COLE 148 1,873 t h EASE ting KAIA ^F —S 187 2,377 (bp) 3,754 2,048 D — 3'#;%(D# V Ay F V y L/to -49- ii) Z%^L/ccDNA7P —yo^Ef'J/PL, ATGT^^O^ K>-C#t»680aaJ^±(D#m# ^r(7)yo->/(D[#T#±#(Dt(D#^m^T^y^E^J^Ll:#^L^o £170/^1+3,754 7 u — V(D 9 ^2,9427 n —y (78%) T:'E (7)S*$l£#l/c-f-fMSlRT ^ / EBd^J (ADXX) T#70%£T(E]o/c (#7) o (#7) 80aaa±(7)fg)tBET ^ /^E?iJ£r^cDNAy o —y L/c >=80aa(7)#^mme^J 7^77 V y 0 — y -51- (W8) St£flmRNASfiy|j£0'7~'— 7 ^-510 7 p — y >=80aa BLASTND&nifcl y-r^y 9- mm 7 p —y#c BtiaK?iJ mmmm t b flg toSi.^ ADXX 253 150 24 (16%) 72 (48%) 54(36%) t HW fN& tM CAE 71 63 6 (10%) 38 (60%) 19(30%) t bm#m##mm^ CAS 151 122 23 (19%) 49 (40%) 50(41%) t b lESTcB COL 779 553 44 (8%) 287 (52%) 222(40%) HepG2^B/!S HEP 724 611 56 (9%) 418 (68%) 137(22%) t b HRC 422 358 72 (20%) 192 (54%) 94(26%) t b /H# HSI 237 173 26 (15%) 89 (51%) 58(34%) HUV 27 24 2 (8%) 13 (54%) 9(38%) KatoIIIiM KAT 413 357 51 (14%) 211 (59%) 95(27%) t bW LNG 317 237 6 (3%) 127 (54%) 104(44%) th#&em*#±A REC 25 21 4 (19%) 10 (48%) 7(33%) t biES^S COLE 148 130 15 (12%) 63 (48%) 52(40%) t hEffi-msttS KAIA 187 143 7 (5%) 82 (57%) 54(38%) £8+ 3,754 2,942 336 (1 1%) 1,651 (56%) 955(32%) iv) cDNAb 7)T#J fr-rZDKAWftffitmmizspciAs%m\ y^wa^yy-Ayy b^wr^tto fcQ ii)T#mL^:80aaJ^±(D#$(D#^^T ^ /^Kf'j&^acDNAf n —y(c#L, SERp~ y y yy##?#^fro^ n-y(7)it#^#;+Lyc^ C5. 200aa^ ±(D#^(DT ^ y^SE^J"efciuf8o%a±co^ p — y ifclx Hcififli (im?) . #%^^T#J$^L^V^7 cz —yoitW^ZnoTb,, L^^LSPCIAS^ffl WdWW"C(1100aaiUT7)Mv^ - K#%eiEmrT#li-5Z6^#LV^V^ - ^DNA#%^f • A'WM±frb (76%) £ ii) -e^ L^80aa^±(DSSEET ^ 7 EE7IJ ^11, V/c;*.^:: 7 p - y £ l$13:|i?l CWJ^tf L/c0 t/^u 80aai^±(7)ft^^r ^yMByiJcbSi^p-xy yy'«(7)fS]^^Mai^tL/c:y n- y(±2,617yp —y (70%) (T-ft^-f) o i6%(/)y p-y-ef±T#j^^iybm^3-Ty yy#%^ 2c>j^±(D-i:yy y £/ci4%<7)7 p —y-Cbiy y~2^yy b<7)«JHEt£^^£tifc (^9) 0 ##%Nix y y —Ayy b ^y y°y/f y y b p ycoEWEDNAE^ftTvs^SEte t)^fW£ii/c0 r. -52- s % >.. XI w X SB #5- I I ^ 9 d X X % T » X ^ M 8*- t 38 SE |$ 04 Op^* m m $ rv ^ 4i 22 X ov M: ^ 0 $ ^ rtr« # y- ft $ a 0 d 0 M rV >66 Zx , a # # 22 C0^-cu?--/'LfSOfUsW T * ^ fV ft 6 VNO^-Xi* : r z nr a £ ' 4 M" 2 '% £ i ! a >|f ^ - % n r iP [ d a V r\4 M 5 < ft -B V fi a* aS % 5 0 8 i d S3 r BB \r r< t ? r r Ky 3* 22 s? * X xl cr d n a rt X> a % ^ m V <- ^j OHD) xi ^ r It 04 V Ai d cf* z~ ( a z %' # XiflB!$EG)dOIS~DlVtoT» 'SW W W ffTtoW B^fT^^ElTTffEBlW TTTZlRZHr A 03r ^ ( S 5 Xy IM # XI m Si V H ( >4 xli --/'tfgfiBisP 2 S $ ^ H l (#9 ) 7 n — y HEffl*# *8 #?«##«y y — A y 7 b 7-ryy v- mm 7 o —ym 7 n — y ?»#] 7 o — y k2 h flU ADXX 253 147 29 (20%) 21 (14%) CAE 71 59 9 (15%) 1 (2%) b tmmm^mKmm CAS 151 128 8 (6%) 11 (9%) t ME$*» COL 779 508 72 (14%) 54 (11%) HepG2#^ HEP 724 604 88 (15%) 80 (13%) t hM#±##aa HRC 422 350 60 (17%) 44 (13%) fc b /J^# HSI 237 176 27 (15%) 25 (14%) b Hmamam&am HUV 27 26 5 (19%) 7 (27%) KatoIIlM KAT 413 374 35 (9%) 43 (11%) t h LNG 317 202 45 (22%) 40 (20%) b Ftteiafi® REC 25 21 2 (10%) 4 (19%) t HE£*B COLE 148 124 29 (23%) 23 (19%) b hjE&BUStolE KAIA 187 144 39 (27%) 38 (26%) s-n- 3,754 2,863 448 (16%) 391 (14%) SPCIAS£ffli>Ti^SJ n-yt7ix —Ay7 — > (m\0) 7l/“A'>7h^fiM§ixfc^D-y^#f 7 y —A y 7 b E£iJ^ cDNAE?lj##f L/bMm C-COLO1887 189 naii&L C-COL04151 1,227 G -> AtoSSSS C-COL05258 570 586%? 7:^ (-lbp) C-COL05623 471 430 - 435 7:7^ 473%:'7:7c (-7bp) C-COL05962 517 518%* 7:7y 450 K 3 bp#A (+2bp) C-COL06078 470 470%? 7:^ (-lbp) C-COL0739 1 1,542 Pp^M^L C-COL08 164 149 149#i&%?#y'#A C-COL08657 697 691%:' 7:£ (-lbp) C-COL09356 587 581-2%:' 7;£ (-2bp) SPClAS$rmv^-C7 lx-Ay 7 -54- BLASTN^rffl E y h L/ccDNA^ n —y(D 9 H UE^O^SW^T ^ / - ^X 1^-fdFrameSearch (GENCORE Frame_N2P ver. 4.5) WcfBiWfiW"?' H)-(^E y F /mE^J(D H **~y) ^(DiUB^cD^giEE^U r (D ^ n — y (DFrameSearch-ecD T ?Xbffi 9HH^<£oTH^3bH St)i^^^^KatoIIlM6S (KAT) (DyXXyV-XU67%, FFEMSi ^ d —y^-eco^TiMyo-efco/c (#11) 0 vO y E mfeZtitzT* ymmm^mLX. Signal? (r^-7-^II4*) £ ffl t' * ->^'>/i/E^iJ(7)I^J&J: mosiii^mv^#R###(Dfmij^e^/c (#12) 0 80aaa±(D#^mmE^j^f m'j^ n/ccDNA^ n — >±fo(D\$%X\ is £ Hfz0 tztz LSignalP-milMlES 6 m^MLxtmcx o^ k -tscdna ^p- xDmtznxd i>'pte^t> (1G M 6 6«-efc6 $ ttfco (#11) 7^yy V- E hflltoES ADXX 253 6 4 (67%) 2 (33%) t CAE 71 10 4 (40%) 6 (60%) t hmshm^-mamm CAS 151 25 9 (36%) 16 (64%) E HE#*# COL 779 97 37 (38%) 60 (62%) HepG2|fflJ|& HEP 724 55 19 (35%) 36 (65%) HRC 422 60 25 (42%) 35 (58%) E h *# HSI 237 20 7 (35%) 13 (65%) t HUV 27 4 0 (0%) 4 (100%) KatolII#^ KAT 413 72 48 (67%) 24 (33%) E Hffi LNG 317 11 4 (36%) 7 (64%) E h REC 25 0 0 (-) 0 (-) E HE#*# COLF 148 26 14 (54%) 12 (46%) E HE#@#ti)E KAIA 187 17 6 (35%) 1 1 (65%) ei+ 3,754 403 177 (44%) 226 (56%) -55- (mi2) z/t y-r^y v- >=80aa mt t hmtoE^ ADXX 253 150 27 (18%) 28 (19%) t hSbEft&EflS CAE 71 63 6 (10%) 15 (24%) t hjSBJlWrtffiifflJia CAS 151 122 9 (7%) 14 (11%) t MESAS COL 779 553 70 (13%) 128 (23%) HepG2#ga HEP 724 611 45 (7%) 111 (18%) t HRC 422 358 44 (12%) 91 (25%) t MES HSI 237 173 19 (11%) 23 (13%) t HW$iMm&iWia HUV 27 24 0 (0%) 0 (0%) KatoIIllfflJja KAT 413 357 41 (11%) 73 (20%) t hM LNG 317 237 16 (7%) 43 (18%) t EWia^mm#A^ REC 25 21 1 (5%) 0 (0%) fc MESAS COLE 148 130 8 (6%) 25 (19%) fc MES0S&S KAIA 187 143 12 (8%) 38 (27%) •s-n- 3.754 2,942 298 (10%) 589 (20%) (4) Vv otf>OcDNA* n-y^JE, Wd'^iiAo SAAffl^^ffllVccDNAy n —yii^T, COSEA&& A (#&*#!) o (5) ^ n —y^Stictl/cDNA^ u — >(D;<>?lk p-y£6SU (£WLA0 R?'l&^(D^&^i|X#L;W)NAf n —y (6) h^^T(D^%mA £9^ W,t}Ltz0 4'y^—^v b3o ckTM y hy^y bT'^Jffi-C^ 5 r t £ 0 tf)b LT, 4 y? — y a^—y ^LTWEBy^^'f-^mv^yy^Ty b/f—/i-iy^^A^L/bo ^yo^^y b-ei^mm ±mmm^9'^-^z:t\z.^ y#- b lxm%. '>7fA(t &£5;hAccDNA<7)£EyiJ£^HM L tz0 Z(D£ 5^'>^T-A^-Ctfcr ^ Jdct >9, L/hcDNAE^ij^mm^K## L, ^ 2. 4 $aa& 677,677 y n — y (DcDNA(D5'^)%E^jt» ^ (C##T K-f"d oT^tt(7)i^V^cDNAy n —y(7M£# 8,143 y n-y£ii$U 3,950## (D cDNA(DA#EyiJ (FIS) $r%6A4CLybo — gl5G9cDNAy n — A&Eyj(D##^^:Xi^ ?>, K-rsy n-y^ ^ - k l -c i ^ 5't&frti&b (Dbfo'itztiK ^2,600##(D#fm#^f 3 - K-t6 cbSbiLdcDNAy D-y(7)EyiJ^B^ fb^-?- (CjoVT, b ^^8,000###^^^ ^ Z 6 iy WZnl^fm^ 6^4ci-5Z ^#-?###*# v\ zii,^(DcDNAyo-yiiA-c#m^yy-i:m^%A$^/y:^'C/<>yfk^^i:*3i9. ^ m#c-cvx^. ^&E^m^-e(±, (Tn&) bEyij^#(iT^!ti^%7yyA^#A^^6^#t)^'C^6^\ AT^m^^V^# Wifol^fgtf) btlfzo y n —y(cZ^T(±cDNAEyiJ<7)#^/c#%^ff fr'QM'P LxffiA£fotzb(Dbfo'otz 0 t.tz, Afiff^r^^'Cffl^^tL/ccDNAy D — y<7)- -57 cDNA? ^E?iJ&3Ea — ^♦S^MSI-t'Sr lW-&C:,kdST#/bo ^ftcDNA^ IU>frcDNAy $ X'yX • • • - '-ryy^ • 3) - • • • 2) • • • • • (False 1 (FalsePositive 5'^W>tb5#@ #%(DATG(DfM:#^5'^%X)^ ATG^ 5'E?'J(D#g(Di#sv'Ems3s< 5'Eyj(D#^^m<. cDNA^(Dmm-e^#, ^P^RNAtiSy ) ^y7t-7r^'7^yiasfii$ yy 5'E^ij^^v^^seyij^s^MS<, iTOLfcy <&wm?)5wm< 'MtkLtz? y ORF^tf xn-y -CcDNA n A^f/ty — Negative n-y^£Tii&-e£TV5 y###, Ky^^^Eyii^mK) — (D P-y£V'5fc{t-eteft^ofc„ k&'VZtZo 7^yyy^(cz5i-mm# o n y (Cf6 t — — o o — LTStSX:'^ £-C7)ATG(eo^Tj$#fytfo/';:/-;:tf)S$£ilfr^o/';:y y^#p^RNAA^y y^^^oE^JcDxyy^ n ff^^cDNA^ — E^J — Tnm^#%(^fT9Z«k-e, y^ii&L/ey ^ftEyU&^l-^&ir&oT ^ y #A. KUTV^^n-y z y^7MSmRNAtc£S*tdy la 3 £ . ^ — o 5'E^IJ ^fcTn^rfflV'Sr. #A, /egeyo £ KbTV^^jife-e^&^ofcy y^^«k'^g#/b«k#t6^5y-^o LTLio/'ey £.&&&!&&:% "t 6 /£ - yh. t . 5 500i£S.iyFW&lyy m^^SiJ^-e#^75^^y--y £ 2 — ^ t /ey p~yjifet;itefr5fafi££a)Rg y> o ^ t>#;U::< 0 — tests -58- — yy^x y-^yy (T#j#g[) (?##&) y> — ^l:ioT, P-y $ 0 Ltz& ^ y £ ) 0 v\ xn n/e ) L^ofcy- V — Ty d 's^k y D ^z^ tcSr-Xo - — ^ y^ii^L/cy- h-efcofcy- * y(iS#^^ je 0 = /^SDiSLE^J^'atp^ — n ^ — ^L-CV &~f 0 0 y x Lb^X — Q — * 0 7t — L/e/cAte#^^ 0 y (D? o u-y^m 7 n — D yco - $: 5'^E^J-e#v^ga^j#^(D#^^mv^(D-e(i, ^=»-KL-cv^ct^jo^-f yo zfiw3$^ m^t^c/b E^*Jfz0 l /nx y- >y y y ti^kclyr-^ M£fT 9 & V x jlji MXlttei\ L^Lr (D l /n7y —^ry 7(:S^< n —y£ii$Lty< ±Xlt, X: y n - yil&(-jo It 5 ffiM&vmfeV. 5'^SSd^J^^fctj-e(m^E^J^(7)^-ti^^J®r-e^^ i'b(Dbh^tz .0 xtut*V ^>-r yV'fevit—ftktfitecxmhy 4Vy V —cDfES^-tt^-rs'^^ LTcDNA(D5'^^(DE^JL^^^c,yb^\ —yTticDNA(D 3'^#tC^^## # cdna^ v-ym'M^m riL(m^ite^--e(7)W®-efc <9 , iztsitz^&mnz r £-e^< ^er-e *m&m%xtttLfa\iim^mm£\z-t%%±&cYMA?v-i'(Dm&iit^tz.0 1111^\m 100%(Dm4:#-e^A&^ n-y^#^A&^o-yeeBUi-6Z ^(1 ^mXhZ)0 ®t$nmfc^ t?teio%mgxh'i£i o ^tu;i*j-u l y n - y ^ £ntz ±&mt44%fzotz 0 jotf 6 ^ n -y#%f##-C(±, K?#ms#< L^^^E^iJ-e Lfrtei'5'M$iifr b Z ^ tc j: D , Mite^-cb LXtm^titz.^ igr3 - Ki-5 (b@^6f o- y #-#-&Z ^«b v^9 gsfW'cDNAy^yy Z6 t, ^#(7)##^ L-cm^r$)5 9o yy K$tL5ite^-(DtfE(j:, t-tx> h & y^t^mm^RNAmmi^t lt^¥ £jx, &\*ll^tol'XX y Fnywli, jo j;m'^^(DCAP#i&(7){#[l^6 W-3'#^m^(7) #y AE^U JojcBmRNA tte^ fcfi t-jiBS'iliil L £ tlX V #'/—J*\Z.£ <9 ’g&ffcfrftfrtlZo L fztf'iX. tbttbti L /cRNA/^ 6 aK y AE^IJ ^Wf6RNA^##L^: fit-, tMJ J:oTCAPfSiS^W-t5mRNAil^6tl(-cDNA^^/jjci-5 cb, E6R(7)£ mE^^3-Ki-5^^cDNA^#^%t-f#^jT,6Z«b^m#^fL6o LXi^L, btelc L/ccDNAy D —ycD^t-tl, te'y AE^Jt-^^-f 6 cbSbiLdy O —y^-r y h n >^^At-I^^^TV^V^n-yj)@SA 6^^o ^(D±#^miS(D—^(bLT, mRNA(D##&(- jy(t 6Pm#^#^6ji,/:o #^(DK#-e(j:maaRRNA(7)^^#mi-5Z(b^ -59- ■et&V'o -er-e^P^lRNA'b'a^^RNA^ttmL/'cEt-^L ’ ^dT#y A £ V' TmRNA£ffiMX6 2 y/ A^-^#^RNA&^A(C%lp|^< Z «k(±#L< ^60 C(DZ 9 &RNASW£rlto t LTcDNA££*f-5 t, ^ n-y ^£ C5 :: Ift^cDNA^^cB(7)-f-- mikm^mmtc^tz^ n-y 6bb^^5n, ue &^otl t.'itzM&b— %$X%$.tbbfrifc0 ^a^K/y-et±#(DcDNA(7)A#E^J##yo^]:^ h ^S^^ilTV^o i/3L? bfrb XfrlbO nmt-wcLXL^'Dfzbobh'Dti 0 s b—Wtbtchob&'ifZo i 60 2. 5 SlfflXSt (1) 1. Maruyama & Sugano, Gene 138 171-174 1994. (2) • Dynaclust. (^'4 ~Y ^ M>) • BLAST ver.2.0 Altschul, S.F., Lipman, D.J. et.al. Nucleic Acids Res. 25:3389-3402. 1997. (http://www.ncbi.nlm.nih.gov/BLAST/ ) • NetStart A.G.Pederson & H.Nielsen. ISMB 5 226-233 1997. • TestCode J.W.Fickett Nucleic Acids Res. 10 5301-5318 1982. • MasudaScore Y.Iida & T.Masuda Nucreic Acids Res. 24 3313-3316 1996. • TranStart. E, MS • RepeatMasker. Phil Green (http://www.genome.washington.edu/UWGC/analysistools/phrap.htm ) • Cross match. Phil Green (http://www.genome.washington.edu/UWGC/analysistools/phrap.htm ) • Phred/Phrap/Consed. Phil Green (http://www.genome.washington.edu/UWGC/analysistools/phrap.htm ) • Autofinish. Phil Green (http://www.genome.washington.edu/UWGC/analysistools/phrap.htm ) • Sequencher ver.3.0. (Gene Codes Inc.) • Hybsimulator ver.3.0. (AGTC Inc.) • Amplify. Bill Engels (rM^n>-y • BLASTN. (GCG Package ver.10.) • BLASTP. (GCG Package ver.10.) • GENCORE FrameSearch (Compugen GENCORE Frame N2P ver. 4.5) • SPCIAS. /NK /StR (X,^$DNA#^Bf) • SOSui. (^#%±#) • Signal?. Nielsen et al. (t>-7“ f %#^:) (3) * • GenBank (http://www.ncbi.nlm.nih.gov/Genbank ) • GenPept (http://www.ncbi.nlm.nih.gov/ ) • UniGene Gregory D. Schuler J .Mol .Med 75(10),694-698 (http://www.ncbi.nlm.nih.gov/UniGene ) • GENESEQN (Derwent#) • GENESEQP (Derwent#) • SwissProt (http://expasy.hcuge.ch/sprot ) —61 — SE£6cDNAE9Jt*$&tfit£^ 3. 1 1&m bcDNAX'7 b (J^TFL-y n ^oc y birl&IEf 6) fi¥b£l\¥4B 1 0 «t <9 ¥$12^3^ 31 0 £ T<7)i^f^T\ 12ttJ; 9 &53y-Aj&s#n 77 y n — y (7) h: b ^i^ScDNAtDEy'Jz&Ati £f Er 9 T j;A(D## ZigltotZo AT'It4,000 y n — >(D#f#^A&cDNA(D#iB^:^ E ayfa'- X2^r n * (D^^gcDNA&^llUl.OOOy n — y COE^l $:£ £ B S Hfc„ a * te ^ }± (i,oooy n-y) ScDNA (500 y n — y ) (Dft 1,500 y o — y <7>1$bE£ 3 y fc° 3. — y — £ l' T i^S'J't' 5 r. #*$)6y&BmL/b. yd v32y bT£<^t±}£tidW*&B<7>t b^A^cDNAKiB-fa###^###^ -e^<, g#. m#, (Dkyb(c^(9#5 t)(D 6X9# LTV^6o 3. 2 k b^A#cDNA7-ry7 % - m~F, gf4f 7^7^ y -tmm-fZ>) (Y.Suzuki et.al.)^m^T#BlJ cdna y p — y (#n,oooy p-y) (D^i^SEyiJ^y 7Ty t-^y^'libi *9 & ^i-60 #6^,^Eyii7-y l±DNAy^:/^T(D^To 9 E«k4)KT^y@g Eyii(-^#Ly y/ (1) #m## mu%m^M£ 0 (adxx ) , (huv ) , & (RFC) , lES/hStoS (KAIA) , jES/hS (HIS) x (CAS) , M (LNG) b^A&cDNA7-ry7!;-zn6#$R2 (^jo, Eb^A#cDNA7-ry7V-(D##joZU(^AA yn —y(7)#Kfb#Kci^T(±^2$^#R8^ ) (2) ^•^^IcDNAy yf - bli^y y — pME18SFL3(GenBank: accession no. AB009864) (D^)V^ y u y y"yy b ftODrallHb'T b(CACNNN J, GTG)(d^A^tlT^o <9 , 7 yy — b(7)5’iiij(De?ij^£y 6s(iy yib — beyo(DttmH-ix^^-bi/cyyy — ME774FW (5 -CTTCTGCTCTAAAAGCTGCG-3'), 3 ’ M S®tK£70 pl£*Dx, 2^^%5I L, EHXffloy lx- b liy77 S KDNA#$£^ltm 6o n hMz.-? 7 7, % KDNAOl|X*(i^n.5-2 pg, #^0.02-0.03 pg / pi b & *9 , ^ 7)10 pl£^-£ b. M^OSB^l^:^ PE-ABl%#OBig Dye terminator 2.5 pi, 2.5 pM ^ £ £—~f 7 4 ~7 — 1 pi, plasmid DNA 10 pi, 10 x buffer 1.5 pK H20 5 pl£iE^ L, PCR^l£(95°C 2 min, 30 cycles of 95 °C 30 sec, 50°C 20 sec, 60°C 150 sec) £ IT 9 « E^$(-TE 15 pi, HL reg (Edge BioSystemsbiM, precipitation reagent HL) 5 pi, ^ £ / —-/L 50 pi £ X , MW4~ 4> 0 2,800 rpm, 20 minis >£ L tz \k , ±'(W£^T, L, SStK 50 pl£#xT#S t£ 95 °C, lmin y T L , PE-ABM±M —63— Sequencher y y h £ V' X BS?'J yy^-7-No.i«fNo.5^-cE &WC. y.T^y3T(±yy^-7-No.2^No.6(CTE^e^:^L. ## % (C(±(M%7^^ y If — f A#: (■Cl iS*t" 6 ^ "C# "9 iS L T VA <( 0 d. '>“^xy^7-^(Dfl BB^iJ-r — y t) co> jo ck ^y y — sbyy t y y y — bbyy cd s "t--5 y p — y ^ ii lib tf y y t y 7* L, ^LML-cy-y^y^(Dm^^fTo^. Z(o#m=^yUT-L^y-y^yy.T- y Ity D“y|i;f-^fI'>^rA|;m, SS&ftjl-fT 9o 3. 3 SSK* 6Efc (1) y p-yc£>ES ^yp-yij:i,oi7i:*ofc 0 BByy&SftJdSm^&vT^-r t°y y -i^-yiyt- A#Ti@m L/t. T¥m(c^V^T(iim^K##L965 y o-yT L. •£<79 9 f)960y p-y {Col'T^W L/d^S^Sy (#1) o m y p - y (dbbsb ^ BByim^jm 5 8 6f} 78 8fl 98 10,9 11 8 12,9 1 8 28 ¥S fMt S y n — y <79 BBSS: 64 140 373 96 683 1,356 y p — y (7) BBSS If Sc 64 64 204 577 577 577 673 673 1356 1356 1356 s — y p — y 25 20 98 32 164 339 ##f##y n — ysif 39 39 159 434 434 434 498 498 1017 1017 1017 1,017 BBym^sc 5 27 7 0 140 108 126 105 112 330 960 e?ij^6»si+ 5 32 39 39 179 287 413 498 630 960 r t 5^t7^fdABi3 7oo^r-^t p7y “-S!y-^3iy'^-(7)fikp*n«t9 y# rn/c#(d#m L/Ce (2) 3yk°^ — y-#my^TA(D## ^ttyy-lDNAy-yiyf- (AB13700, pe-abiI±§J) =k y f#6iL/cBByijT — y (3#%BT#LANA It, ¥ y (Sun Ultra60: 360MHz x 1 CPU) (C i# #^fL6. 9-yy-b(7)# M t Sequencher V y V % ^ y y cyfef&ftM bffi ft *s — yxy^W/ty^ —64 — V —SrfTVX (D FL-/n^x^ b-r —^ WS'> X 7-Me y p — y SU (C £ it It (02) o E^T — f d^#T —^ /< y y , -tteibh, GenBank/EMBL/DDBJtClEB £ it/cE^Jy — y 1*1 (C # 7 'y n — KLn r iUC*j- ItBLASTNCi dDNAl/^/V A##m (GENESEQ, fA)l:^ltt)^^ny-^i^lTofco &3o\ y=' - y /< - x (D M Sf 1-M LT(tGenBanklt 0 * , EMBLft&i®, GENESEQdl^iiT'Mff $ it, 1b-/< — c$ frl&o Wf #1 :!£ iBit /*;: t — y ^ 03-i (7)^^ g (lift 1,000 y p — y “ 7^^“^, NONST (non-est non-redundant: E S T 7b X Hi IS E 7fJ £r ^ V' id b (D) t — 7^—X , GSS x HTGS (genomic database) , 7^ # y — *7 — 7* F*J (D # ff y — 7^“ ^ > UniGene-Human> UniGene-Mouse, UniGene-Rat, io j; Cb, _hf£ T S y &E*mBLASTXi£V'ti:BLASTP(C «fc V , ±E#St'- 7 * £T&it[zffi$co&mT'-?'<-xt —65— -Oth-h SS?>J 2 kbp A 4 B A, B: — 1 ~8 : gene-specific 3?7 'f "7— HI 1 ~J~7 -f 7—^7 --- ^ ^ v7 /i GenBank/EMBLy DDBJetc. #8*% —9 /* —X 7f\t □ S/—"tt—^ Et h e r N et FL-p reject ABI3 7 00 (Sun Ultra60: 360MHz x 1 CPU) ■ n—*7 v — h (7)W@ • V->7X>X««f HI 2 3 2/1° jl—6* i/X’rJx (in-house) —66 — °9 (#^%0E) 3%0 '>#5^?7 9^a.mm%06-%0E2)^^-f)q^O)##jyO94-#^^#VNa3 ^ 64.4 ^44^^^.p 6 'f) V^W^S 'ffK'SHT °--/ "1 >^%dqss£‘z 'dq^si‘3 vdq6g0‘3 'dq^gg'l vdq90 g‘l 'dqigf/'i 'dq[[E'i^^^^ '>^2)miO)###^/^3 "# '^T a#^?}3sa '## '#N/#3[:Bg#&zk a 4ES90)%89 0$)Wf)0^ q/Tf ? ^ 3^OC0TY?T^ 4 34 08^.^0)^ °^c.($a.dq 936'I 3^1)#tk%^f)^#VNa3^±OX-a ^0964# °9vA41^^#^ /^Z^Tk ' (%OW () ) #3%0#Xg "#:#VNa3^7k '-6 4 .4^4 '9^T44:^ '%vi5f:2)3^f)Q(?^a)^^##a)4-a/;0964# °9<$a_4- d4E6E (DN1) # '4 —a<;63 (SV3) Sg#t7k##*E '4-a^0E (SIH) 3 '4-041/.1 (viv%) '4-a4^3 03%) "4 — n ^63 (AHH) ^MSiS^tff '4— a ^gS3 (flSaV 'HSQV '3SQV vV%aV) 0{CT9^2)3^^mWO)#E$a)l4^14^ 'cz)4-a ^096^3 °9149^% 4 - 4-a 4#-^k '<^9^4^ #? 9#%hP ? % 9#N[i4P@o^^^ 6 T2)^^#^04 44-^^4E^__6.4 ':)% vic^2)4-a4(Z)9V% 4f)J.%4 4 Jp—K4-^.^4/: 0)#:^/|!#WU3)U03 gg g '%i#2)#4 —a4 #### l#^^^n^P@^^^(Z)4- a 4;96^14$M# 6, T - 6 4 4 ^4VNQ3 2)81 °9$J-^?%^#a.^WG)T^T%86^gm00t7^2iT '%#OOS-OOE °9^J-^U4^?794-^^^ri^P@G)4/4/<-4^8E '4/4^4k2) 8 1 '%v1Pf2) (BB#%4:) 47}#^a^^003EI8V-4<^4-^VMam— 6 4.3^-^: W^^Sfi45@ (E) 2 y -r y y v - y p — y 3p# ORF(%) f #T ^ / m cDNA# 6! (aa) S(bp) flBto (ACXX) 255 1,506 156 (61) 230 (hu v) 29 2,059 24 (83) 342 /H#%# (KAIA) 171 2,355 130 (76) 345 25 1,559 21 (84) 276 (REC) svimm^mm (cas ) 77 1,481 65 (85) 284 lE^/hE (HIS) 30 1,327 20 (65) 207 B (LNG) 373 2,187 235 (63) 238 1#+ 960 1,926 651 (68) 267 £/c. ORFfc btltz.653 ? n — >(D^X\ 5'-#%^##(D# $ #349bp, 3’-#SE1W ^c(D# ^ (±805 bp-efco;fc 0 ORF(D5F^ESti798bp (#^^ EA 266T ^ / #) X&'itz. o ORF^ £ ftfc ^ p t. ORF/cDNA(%)^iSt:|£t^(7)'e(j:Jt^^ p — yZ,S%(D^A# He j:6mRNA(Df-xrX(D ORF^-a Ay-ev^ 6 . i&VMi, * y^° y £ fi&T L t> V' x. ft i' z £ *^ p v" —)W#f ESTt"— y /< —x o tzQ £6, NONSTx — £ t y h (D|^46%(e ti^t^>0 Z <7) 9 h , 75%(t genomicE^lJ 6 %k (D ; No-hit, 101 -800 (D b 7(Z^#:^b-Cl^V^t(D^^35%, #^P^—(D&6t(7b ^V^(2, EST#(D DNA^rb ^ — g[ L6 & (DZ,S46%. ^ ^ (21^ —B: L^: (7)(219%(7) 183 ^ P — y "C, ^(DcHeilNONSTT ——^ — as y. (7)^.7?^ < GENESEQ(D 2 9 ^ 7. ^(DiF — £> b <5 o Zb2 "Cb, O y^-(D$)6 y ^ -69 H r^r tn n '> CO beta-glucanase , CO acety lglutamate kinase, '> h 'y 3 y /< :& CO glucosyl transferase, t h CO O-linked GlcNAC trnasferase , ^ ^ CO phospholipase C#co k h 5 ^ # x. 6 fl <5 t h #f , ^7’> COphospahatidylinositol triphosphate binding protein, ## CO GTP binding protein , # # CO iron-transport protein , t h CO Ca-dependent phospholipid binding protein , y y h COurea transporter# (O h y V y zK — $ w-JHqp'S ^wmm?. mmmm, #co#m6#m* o V h^RT-PCR(CZ FISH, i&lMt, RH y t° > ? {Z. £ % %k&fc{\L #C0^^, ]) 'S \f -r > y ?'s ;< y %(Dm% t £3?5t£<0|pl£, V , KO-^yy^m k F##(7)9 fimfrb$u&t 3. 4 -^T, (ccv^T, ^ ^/ 3. 5 simxtt 1) Suzuki Y, Yoshitomo-Nakagawa K, Maruyama K, Suyama A, Sugano S;Genc 1997 Oct 24;200( 1-2): 149-56;Construction and characterization of a full length-enriched and a 5’-end-enriched cDNA library 2) Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ;J Mol Biol 1990 Oct 5;21 5(3):403-10;Basic local alignment search tool. -70 *4* 5±ficDNAEM3fc$ 4.1 MS «m Ic 9t$ L £ £ L ZtS& :»ia®«ti»y/A«%#»T»555£»cDNASFOTF%5£ 6 CffLB8i •§£®4:ftS*#ICfflSSeL =k7 6t»tt. yVA®«|gBy-®#Bg^f6fflS*M%®fci6®ttMS$6BC*y-5r t^Biet" 4. 2 tmatx*» *11 -y k>y&icj:5cDNA7'f y? u -a. 0f« y y ;ex m ^scfif e. c -71- 1057 5-f75U-SiJ ±icy^-f 7—^^% 998 ...... -...... £fr 480 kat (Katoin tans) 410 /mm 7.2 kaia 9 2.61 coif 148 ±lC7>^A;i 59 hrc (K$K±6IBga) 253 5>^A;$¥i6|E«$ 5.36 cas (a$!iE¥;ft®s$oea) 82 hsi (fit*) GC ## 50.4% 155 w-m&mm 1680 ...... —...... 1798208 a it 1057 -72- 96, 373XL, $;tHDNA Analyzer 3700H=koT##fL, 7X7X$r#/c. ^(:77< 7 —j;oT7 D-7^^ 0 1##&/:(% ATGC7-7X7X7 vt77U 7 7 h 7X7 (7 7 h7x7IM)&mWTE?iJ^-7 7 vi:777J77a37-t:77-X7-7X7X(Ai##, y7<7-^fi&l:T #Am(7,8)aw-D^7>yA&^#mL^. 4. 3 eaa##a (1) mmi£%mmMk(D%m 7°7x^ MDNAmmm^LT^zirn^-3.7;i/& (El) £ rfcE ±TM#(:f7vx lE^I^E^^n^SSISc^itgeL/t(E2) 0 E243qiagen _0^cd ^:7^ QIAGEN kit(D@# (REAL: R E A L. prep96 kit, Turbo: QIAprep96 Turbo kit, Ultra: QIAwell96 Ultra kit) £ , +G50ti96A7°l/- M±4£T'0Sephadex-G5OX t:>7 L/c C £&. +ExSAP^Exonuclease I&U^X hf 6 ^Alkaline PhosphataseMS L ft Z. £. . E£0 BJ ZfrfeJ: 7 \Z, ^72a7M, QIAGEN kitfe H & $f & 7“ 7 n°oSI: 4A& »MDNA*ai§aPlffiTf*ofc^, QIAGEN kit&7^M£#X £ tlAc. (2) E^m^mm^i^cmug ##1,000^7 u 7 b AcoZ. 7-M¥E7.27:& o Ac. -A, (D? EmmmM5.36T&o^(E3) mi). PEttDNA Sequencer 373XL, 377-963; Ac te|Rj#DNA Analyzer3700 i2 T gR # E ?'J 8t £ £ f7 7 E (D# # ^ ff W, 1A > Af— ix S 4^1 (2 UPEti: (D AmpliTaqFS-BigDye terminator &, Amershamtt(0Thermo Sequenasell-ET terminaterii $:## LA:. E^j(D###%T(Z)^A#^## A^f#^lTW6AL 7-7X77 7777 7«h(D3 7^7/ t: emmu/c. (3) E^J7-77vt77U77 E^J7-77'y-fe>7U LkmmT#^A -73- 1 yyx -r-4 — jiJl'Zf \y — h Mc&1ml0ia£#|JCA 2XLB ae @=i nx-£/nt§tmi@ 37 t, 20- 24 h%# v 1500g, 15»IWC«*afc* ±»€:ttT« StX 300ul(DPlyx''7 7 7-*i £AHXx j;<#^ 300ul(Z)R2/X^7 7-* 2 ^JfJDX. Tape pad£(iU8-10IU;S-tf3 ESTSttmttE 300ul (D P3A777 — * 3 &AQX. Tape pad£li08-1 OHJS-tf-5 4°CT10$Wcg r?v h7>7^;uy-yi/-h (77002806) /t+i-A?—/i\—vi/ k icx'f —y^ijuyu — h&t v h Ux ±(:7 V h7>7< jl/^-yi/- 7v hv>7^;i/^-yi/-Hc@#mm& ssoui 3is daic 600ui fvyo/w Tape pad £[*0* 3-50*5^ | 70%x^/ —)\z$t sooui roinx^ I 6000gx S^PbIjS/Ox ;5t» I %%&15#f8x I S-^xJUlC 75 - 10Oul DNA>S>$ V^U^X^U-y HVN45 (*) (*) )]7AP-y^ffll'TShephadex G50 4rfn#>x 330ul (7) dsHzO&AQx 3#fWm±##mx 950g 5 950gx S^iayOTdiS v (*1) PI : QAGEN # 1 9051 (*2) P2 . QAGEN #19052 y (*3) P3 : QAGEN #19053 mm DNAy>y;i/ -74- =IS8*#*«*Skelt , -f >D— h S*SA'6M«a4tC^'0#*ro|||*|n](D4.^x®y5'f T-iSffSrMter^iiltS^SfflLTc (04). c®ct«. v-Ei+*[6i$sse-sc«*., E^JttA^fcTciaswWttE^Jtd^n S-SiXi-y/cftSrSWU *#®7 o —7®A#KRS#aA5±X\ $7c7 y-b >7" V Sfl fen >-b>+fXE?iJ©*®, S*tcMt>5ft<@$6*t"-5±T*$6T$o/i. S«»gi:Sl »£7 □ — >fil:#ltyn y h bfct>©*i05T»5. frSfc D ¥i^480#*®E*® 0 TW«ife5ttf fTfcnfc (*1) . (4)#i$gt$B«A«*®to*®o * X 7 A 7 7 -T v - *5IS tt ¥ °J m ft R * E SJ * ft tt-A > il - R JS £ ff it $ A -5 E FI 9 £ M sstfcwtffefe, ->a v ham?*, A®e$, 597d-> tCftlAT, h5>X«V>SXtt$:^*CfflVX 7*7 1-X-SsfifiCTflS Lft (*l) . C® 7-647n->lcliBUTI$Se>tC7 3 y HA Uft. SSI# V AX M/y A®ft»E¥ tfeW'fttAfgi^lCltibTti, (T) 19 (V/A/C/G/T) f; H o ft 5 S W ® f# @J X 7 ¥ 7 — £ #f # S H L TWWSifctlSttSSriiJiigft-BLitift. 4?D->©*UAX hi/y 91® ¥^*11281? $>o fc 7s. AtVB#8lj7,7'f 7 —SgLft427 □ —MICKl LTt2, A©¥#S6X8, #XS**136T $> o ft. 4.4 SSS ¥#l< >D— f-:S2kb8#©cDNA7 n —MSAMtc^rSEJiiln^ttS'f 5ft*l:ii, 3AE ?iJ$ftg-r5.fcii)®->-7x>->>7*ffl«SDNAe, 6 • Sfi»f*fft5CitfegT»4. 962t¥l/- K±8ICT7)l/* U SDSft £(t7 79 X 5 b*#ES®ffi%$7tttSXCJ:0. -A-fttMS. • MS*St^of-?7'jt/9'JCI ff#&;&t£;ift©i*,S»©BBAE?iJx-7©#f#£nJ(it;:l.ft. $ft, «#bJj|6ftM-7l> '>>7*7'>>®®^W->»J:Dfl,«fe6.n-5»ffoI|g^*X«#:ictMt'T, -f >D— MW *#^6iwsi$6iLic*7#x®ie*i=]®*^roX7'f7-se$M66-r5.#iii8esffli"5.r LftU, y>7Xtt®*CT^:ftE?iJ&ft^lffi AESJ ® AS £ # 0 7 M 7'A C ¥ x -6 M a y f-AMft®*»i, #S^#@2^©X*^&Ct:< Wft6#$ljT&S. -«tLT06C. 7 0->Colfl506(^7 7-&*5018$S) SrXEtCMS T-y>7'Alctt®-®o/tE®tt¥lR 0 8866.6® ASS:¥Uc»X h 5 >X# VXSXtttttE 0 M666® ASG@@A^±C@-6^*)6. £ft7 □ — MftDNA©## X*. X77— IS fflb>ft®fPLfflttfc. May f A >&® SWEATS&. Lft Uftft'6*#—MTSSflAfto ft —76~ 7>yA&aLTiifm#% k:kL ^#^T&3/Co -MaLTimvC, ^O->hrcO14960#MS?'J(21OO#&)&^fl@#t:#L T/\—7°o^ ^M^o —>^i99#&(D@g^J^io[g#0)g^4i'5##[ #%Mmaa?'jLT&o, #$m^UA X h (T)l9(V/A/C/G/T) 6Wo/:5##(D#gijy7 HcDNA^YX x V —^ □ — >!!£: UTltsf 105?E1(D h hcDNAX n —>&± 3'%^UAXM/'yf-^#W^^^GC#m^50.4%T& o/%. ^ ^:<(ht,39@79^n —>^|%#mRNA^A^LTW6W#B%^^#^^l^:o L#7bc gg^J^^^-A^#:T5153, A#%T2,129(2000^2^ABm%)T&6. W'&m, DDBji: ^^6^^^(2000^3^23B^&) H, ag^J^^f—A#:#:T2,212(^^%^^:&^340)T$)6. ^#^1, $3* 4^ vH>^^cDNA7< 6^^ 4. 5 (1) Paul, S. and Declan, B. (2000) Celera turns to public genome data to speed up endgame. Nature 403:119 (2) Maruyama, K. and Sugano, S. (1994) Oligo-capping: A simple method to replace the cap structure of eukaryotic mRNAs with oligoribonucleotides. Gene 138 : 171-174 (3) Suzuki, Y., Yoshimoto-Nakagawa, K., Maruyama, K., Suyama, A. and Sugano, S. (1997) Construction and characterization of a full length-enriched and a 5-end-enriched cDNA library. Gene 2QQ: 149-156 (4) #:@'ilRi#u #*^^(l999)#M§%t: hcDNAfB^K#T^%7°Oi/J:X h E: Y /- V —Vol.57,No.7,pp47-48 -78- (5) Deininger, P.L. (1983) Random subcloning of sonicated DNA: application to shotgun DNA sequence analysis. Anal Biochem 129 : 216-223 (6) Yamazaki, M., Ono, A., Watanabe, K., Sasaki, K., Tashiro, H. and Nomura, T. (1995) Nucleotide sequence surrounding the locus marker D21S246 on human chromosome 21. DNA Res 2: 187-189 (?) Stellwagen, A.E. and Craig, N.L. (1997) Gain-of-function mutation in TnsC, an ATP-dependent transposition protein that activates the bacterial transposon Tn7. Genetics 145: 573-585 (8) Stellwagen, A.E. and Craig, N.L. (1997) Avoiding self: two Tn7-encoded proteins mediate target immunity in Tn7 transposition. EMBO J16: 6823-6834 -80- ms* cDNA 5.1 mm ^^□yxn K£^T-^fSlT4OOO*&±0cDNAffim*ft££n* £ 6 12^H«¥PB1 e,^lTW6. awTkL sosui ft£'(7)frL $£>, tc:£ScDNAI:: j§ L fcMfirZ g ®fr#j Hff 7 yX x A £, ##(D CPU£MLfci65ttf£&7-XXT--'> 3 > 3 tJl/r-^^-x^M^tlilfco A 7 x X AH^JSOOO*(D cDN aE^iJ£XSUTi- .kxC^oTW^,. % ##?#& t> fe0if^Tt^o fX h^lfo/co 5. 2 ttfflUitV7h15x7, 1«MI *7Xf-ATmw^V7h7x7, ##g#$rJ!XT(:^T. (i) mmmtii'jy h^xx BLAST 2.0 (NCBI) [1] National Center for Biotechnology Information (NCBl) 12 &£Q& ProfileScan(GCG) [2] Michael Gribskov^#^ L fc 7° □ 7 7 -f )l/Mtif 7 )V zf U X' A f21^ U X, PROSITEft ckZX SWISS-PROT^ 6%#^ft^7°D 7 7< ;l/X—X< —X^M^T, f filWiie • -81- SPScan (GCG) [3,4] w&a% HTHScan(GCG) [5] SSK F X ^ 7 7 7 ^ U ~ CD T — X ^ — X Pfam Release2.0(D "f DaraC^ lysR> HomeoBox (D&7 7 ^ U £f£j5K£ftfcfi^fT?y£ffltAT, C(D^^6l3^#^Ll:^\U'yXX • x^xmcv-FL, m tl?% o CoilScan (GCG) [6] gttiKDmm'Z&ztifcM&ffM&m^TZ'rju Frn^Hijg®## shirrs- c# F7<;Fi:mm^a, m-ws^MVfrtDii^x f 7 7 F £ S o n ^ ;f F 3 ^ jf& <£ tx\ 30 TestCode (GCG) [7] M6E^6S6MSRM£^ST5fc&DyDX*-7 Ao3SSSi;:tiig|-t5S$ mMmi±#(D#^6(: y xxwjxdx 9Ao%^(D#&T^*^%gam(D4:(:mmm^u 7 xx&?#LTi/^7#^^$, c, S08uiTtiSS6(DiL#: $> <5 o (2) g^J^-X^-X GenBank Rel. 116.0[9] EST_human, H sapiens, Mmusclus, Celegans ^r^fflo 3^ 20 0 (CNCBIck 0 %# SWISS-PROT Rel. 38[10] PROSITE Rel.l6[ll] REPBASE(#^ — #^^&)[12] 3 (3) ##77 F7%7 Sybase Adaptive Server Enterprise 11.9.2 Sybase Adaptive Server iXRDBMS ( U 1/ — 7 a 7" )V • 7 “ X ^— X • 7 ^ — 7X7 F * 7 X7-A) ^<^-^(D#m-^^#^(D|%^7X^A^##T6^a^T#6o ^(D^:^), IBX:# f-^(D^l@&eD*7Xf-A(D< 7^7 7 F 7%7A L T i6 jS T <5 o Apache 1.3.6 tf—/7 7 7 1lX L7>7lW >77 X — X £ jf T £ & HHTTP^rfiJffl LTWWW7* 7 7 1f&S(D7 7 m w 6>nT^&a C5T&a. C(D j:7^< > 7 7x —XeHST^^^toHTTPth-yX-^: LT^< (7^#^#3Apache^#m LTW Netscape Navigator 4.7 *>XxAll, %> >7 7x-X(h LTWWWX7 7ifL/c# ITl^. Netscape Navigator H WWW7' 7 7 if H 43 tt £ X 7 7 7 FX7 >7"— F (4) #m7"7^ F7^--A Fujitsu GP7000S model45 (UltraSPARC-II 400MHz x 4, MEMORY 2.5GB, HDD 18GB x 5) *>XxAll^r(D#%±, U £ (5) &mnm Perl 5.005 02 SUN JDK 1.1 SUN Workshop Compiler 5.0 -83- 5. 3 ->XxA 3.-V- l:yXrA —84 — (1) 37M WlgTSISrfTO ICcDNAttiST'Si|Ofc&ro"SKSa5lJ(B#f«» •t. B6EJStfr©yc#>®l8Esn7c75 y KE?'ilc#76"7 3 y M7JJ#tfTg65r'lcW B tl -5* o -en-en®i»#r7Di''7Attg*))Mce»)3ti, ismtiu 7-737;t/7-7^-x± cfe^^n-So a. a-1. S«E^Jc9#to43 H *0 ® £ f* E ?IJ £ & ffl T -5 Tc tb 1C Jurka iC SMStlTV^yy AE5SIJ- ESTE?iJ. 7 S y mEM#t:#LTf f y 7'&*mLTh BLAST&IC j=5#R###&fT7. a-3. ga*«Effi«®^ai cDNAE?iJ®*6M8Ee«i: UTSE*t£®ORF$*fitti, $ 5> lccDNAlE?iJ®569^ e, K >$T£7ft-e*tl©E7#.t£iC:i3IAT#fitiiL, £gaK*8ES«±:-f 5, ’ C T#$Stiycg6K#E®«tttt® b-!T#ESnyc7 5 y KE?iJlcoHT^-tl^!nHT©)Sff7,n7*7A6SlfT"r5 = b-2-i. misi##m©%e IS *D ® 7 3 y BSEM (CM L TBLASTIC =t 5 £fr 7 . b-2-2. a'fyAlDg'rt Motifs, ProfileScan, CoilScan, HTHScan, SPScan, SOSui-$r'7"7l7:7lif|f77 •£>» (2) y >7 — 7 I — X»# <77-7 x-xsta*e® 7-7 s-ieur. jzxicm fl-dtirtiO . h7-7&iy LTf 7C©@$±©WEB77 7-!fT)T6#$& ff tt A-5 J; 7 (CTS: o TVi-5. a. cDNAE?U©®! (I C* cDNAE?!JttSSJSeirS:fB$75 C chlcj; 0 —# L T* 7 X 7 A ± leg? 0 E ^lt$ME®7'Jb-yic5)-tyT® O&tf C j:T)SiiII6T'*0 , E^©Blnt)tone=I|gT'$,-E.. b. ##yo7'7A©m&A i-f-ttl»#f#St74ii5'iy)l'-yi)B#7py7ASI}?L, HR#7 7$7 V 77-OKE) 8BE<£> *rn E5tJ?;v-? y'JU-yroiltlR j test40 jRttESmttitDKg utf-hBe?'kD«tii x1&5-r^'5V -«tt5Ftousu(eP invertebrate <*■ AND OR mammalian _^j 7T-;^^^LSkValio Utr-h<7)V^.y T5 P" U&U »0E9j(c*f^ P oy-tr-ftomfr 8HeHK«r gyy #R$tiy=75yKE9jl: P ORF)6tiK7)*fT RI%3P>|TTG.CTG.ATG **jk:alOiTAA.TAG.TGA '> — P ri it_stop <• -* — * -* P SOSuTOXfi- P SOSui0)Xf7 |P SPScanWXM fP SPScanCDXM TT^i/a^l |P CoilScang)#m P CoilScang)X?T Motifs,SOSui,SPScan 7f3f>a>l P HTHScan<7)XfT P HTHScan(Dm% btz?i/*yl H^y-ay-I p ProfileScang)Xfr P ProfiteScanflKfT ^•^'>a>l ^z?->a>l Hfrl W7 * 7. E2:«#ryD^7A®ES ej c.r 1»< % p ...... ,3H<«...... TW____ • ueosHIH: TK1____ |(8d< q • ueoSdS; Ttt____ |)SglE#« • '"SOS 35$ IM±X • (.? U3Dc;a;nc.H Tft____ . 3‘* 35$ |lii±2£ • Ttt____ 1*34^ • 35$ N±X • ; -6- f* — TW____ |*34^3 . ue0SdS: j ; J flt'Sil 3T1____ |&S*ES8« • '"SOS j : J j\ L3A0|.M .35$ |li£±3£ • ueo S5l!^°Jdi J : J -I \ 28 A01M TW____ |834<^ • : 35$ llii±V • SfUO w; >3A0HJ TM____ |K34^3 . : 35$ |li£:feX ♦ [UIUI - |xeui 9H|PA-3 • : t - A- 6; Hit _J j j ji 3Tld9 ____ I • ew: J J J J J J fr 9 A0^ . 35$ IMiX • |uuu - |xpui an|P/\-3 . : } - 4»--4tJi'l' M2B8 SSt. d d d J d ITw d d : d J d 2* 3J J-00 d d : d J d: WIN / V d d i d J di "FTn (HJiHfV (5)»F3iunumioo <0)£^ IZHi! adeost^ - suwa( ,|nsai ias*^ d d ■ d l d Tqfw d 4 d d d d d J d 2Tw d d j d 1 d d Ww a d i d: | £1f'^ a d d d d : d J d: Ww j uaiebss j i i j j J J i J; |S»t rw 3^rr. I 13S3H | nv | S8!^--a riwriT |%* | 86* fIji?=7T d* | H-^O-hiS C"!nsos 'jnowr swtitlsjBt) ZaJZ-LiL ifv (S)jo)got^jiuioo (3)W» <3>1t,KC ?2 c-2. &cDNA«©*#T*SS©M-J l s-cDNAe© Jew @%(&) X?F RJ'i sequence Iramc2 | Jrame3 | ORFW% | TeslCode | a & ±%) human P BLAST H sapiens ;ACmii0'OT BLAST swiss pro t OC8S63IMRP3 RAT [CANAL I CtlL AH MULT 1 SPEC IFIC ORGANIC AlffoN TRANSPORTER 2 (MULTI DRUG RES 1STANCE-ASSOC 1 [Q1 5496 i MRP3 HUMAN [CAWAlTcu La R MULT ISPECI Fie" MCANIC'Tit IOMTRANSPORTER"2"^(MULTIORUCRESI STANCE- ASSOC I mtaztitc^T-y 1063120 IMRP2 RAT.... (CiNAriMAR'IurnMciF'lc'oMAMi'C ANION"TRAMSPORTER"T'(MULTL6M'WistANCE-ASSbCI 032887 IMRP2 HUNAN[CANALICULAR MULTI SPECIFIC ORGANIC ANION TRANSPORTER 1 (MULTIDRUG RESISTANCE-ASSOCf iP33527 iiiiiPT iUMAN ;MULf IDRUG RTTlSTANCE-ASSdcl ATEO PROTEIN 1...... :Stert_pos i t Ion :Stop_po*l t Ion Mot If .Sequent ATP/GTR-bindins site motif A (P-l< ATP/GTP-bindin* Start_poa i t i< [CFfOLVLSALPHALXXvLSACVL jSECONDARY j j L A L A L V A L L F A PA L V L T V L f H C Q f PRI MARY | SOSui \Z J: y ^ mztitz&nm LTiiAi/h JET. ca ^ $ b IC X ^ □ — JUT ProfileScan, HTHScan, SPScan ti&lf&zns—AOT’Sv'&R^J :& cDNA —K c-4. TestCode(:=k •So TestCodebckS^ ;] 0 7:TestCode \Z ck S H: (5 MM tfl (D iffiffi 5. 4 (1) FL3 > V — '> 7 A l*J T'm ^ £ ft 7c5151*(D cDNA@B^J (2) a. REPBASE^) ——^/a L^, — V y#^30bpm±(Dg;^H^%E^ja LT7X/7 L. # b. ##Em:#T&4M%#m(2(#fm50#) a.TV%/7 ^l(DcDNAE^J&, GenBankgenomef ^ b^a Sbh, m&(DE^J(:*LTfB[wHt#m^e3/Co M#(:GenBankCmm^^T bS b hESTE7J(:MTS##,SWISS-PROTb##$aTbS7 $ / EE 7Jb *1 b T b fg L/C. c. &^M:33WTI%#;=] ^ck^cDNAE^J(D^M^6#%](7) K Ll50bpl%±(D&(D#^##(D&(D&m8*#B:Rm^b LT#m b ?bo d. 7ay#E^(:^'fSm#g##f(#|5l%#mi5Nr^l5^, ^(D#(D#^##Tl7^^30^) #tR$;K;t#axm;RM#=ka'30(D^#(DEfij&7 a y EE^jcm^b,### #T&f7o ^:. f8M###^SWISS-PROT(:##$^TWS7 ^ / EE^'JCM bT^ff b^:. -91- (4) a. SE^'J 2000bp](: H—y 4000bpa±(D@B^'Jt)139*^< 0 , ##(D t,(D^ 6?oibpT&6. 10007^ %#6o*'a'a;fiTw&. j;e, V^o &&> lSObpRTcD^toHS^HSEE^B: LTflffl LTV^j:Vx 5 0 0 0 'a 4 5 0 0 & UK 4 0 0 0 *« * (5\ 3 5 0 0 ra 3 0 0 0 n m 2 5 0 0 Sa 2 0 0 0 e u 15 0 0 m 10 0 0 EM 5 0 0 0 0 2000 4000 6000 8000 cDN ABE m £ (bp) 9:cDNA SB^J S \Z M't Z> $ S ESmSOME b. MEyij(D#m ^m$nfcs«si^jtt^T©at)T*So ^w-ii3SSos«E^j^5ii67*»w$n, £ ft ti^TcDNABE £0 41217bp lm0il^T25S, 01O:fctfJ£ftfc££E?IJ -93- c. (phases) ^fl^fl(7)cDNAE?iJ 3500 3000 2500 2000 1500 1000 -ft tt*(%) 0 11: t hy y cDNA E^J (D^SIS^:© —IM d. Motifs,ProfileScan (7)^^: S6K(lSE$il/t5139*CDgH^JH^LTMotifs> ProfileScan^rUfi 1 L £CF Motif E?m Profile Prokar Lipoprotein 157 g beta repeats 66 Prenylation 17 rnp 1 31 Atp Gtp A 13 mitoch carrier 20 Gram Pos Anchoring 5 protein kinase tyr 16 Cytochrome C 4 cadherin 13 Tonb Dependent Ree l 4 thioredoxin 13 Ig Mhc 3 zinc finger c3hc4 13 Mitoch Carrier 3 lim domain 12 Ribosomal L35ae 3 dead atp helicase 11 Thiol Protease His 3 ubiquitin conjugal 11 Aldehyde Dehydr Cys 2 amp binding 9 Pa2 His 2 dnajnter 9 Ribosomal S14 2 c2 domain 8 Sugar Transport 1 2 fkbp ppiase 2 8 Ubiquitin 1 2 atp bind transport 7 Zinc Finger C2h2 2 bromodomain 7 34 om 492 258 1+3039* 746 l:^a -94- e. SOSuiH £ 5 MS 6 SI wBIJ fciRsnfcra i3m±ommm^u t¥$['£tlfcb < 300 o 200 4 5 6 7 8 9 10 11 12 13 14 15 ^ "j Hi2:sosuin HTHScan CoilScan SPScan cDNA * 230 1001 763 #& SPScanT^£B£ tltc i/ ^ 7 H£J^T IZtjsT 0 0 5 1015 20 25 30 35 40 45 50 55 60 65 70 -95- 5. 5 1. Altschul et al. (1997) Nucleic Acids Res. 25(17):3389-3402 2. Gribskov et al. (1988) CABIOS 4(1): 61-66 3. von Heijne, G. (1987) Sequence Analysis in Molecular Biology: Treasure Trove or Trivial Pursuit. 4. McGeoch, D. (1985) Virus Research 3:271-286 5. Claverie, J. M. and Audic S. (1996) CABIOS 12(5):431-439 6. Lupas, A. (1996) In Methods in Enzymology 226:513-525 7. Pickett, J. (1982) Nucleic Acids Res. 10(17):5303-5318 8. Mitaku, S. (1998) Bioinformatics 14(4):378-379 9. Wheeler, DL et al. (2000) Nucleic Acids Res. 28(1):15-18 10. Apweiler, R. et al. (2000) Nucleic Acids Res. 28(1):15-18 11. Bairoch, A. et al. (1999) Nucleic Acids Res. 27(1):215-219 12. Jurka, J (1998) Curr. Opin. Struct. Biol. 8:333-337 #56* cDNAf-f a—-ry 6.1 *S thy / A(C#cir£^:Fl'CIA/5#6*(l, mRNA£IET 5* >/‘(7SI£&oT':E‘®$ll6£ cDNA##:a@m#f acan. c®^###ei:mKmicm#i-*£ »SCi;5iT#5fcs6tt»T*MT*5. f»IC*(*±4l:J:5*U^v "J tf >y'ri <1)-¥)ttWA^W± 5> K±5+t'7yh7'V/1-tt %S:ftcDNA®f*#$, »''rSDNAW%Rif®/hEft,,Vj;£5SSroS«AiE%SnT*; 0, cDNA®£ftE?'J®tlf% ifc. cfi6#A,-rm6f®*myo?7<;H:MT-5f-^®e# t>egti£ieLT#fc= mmET®%###^aaLT. /-tf> • ynyxy > <-*WT*-S*s, »#®*6f l:o®T®##t:*LTti, #6 eg •*ss#ra> RNAS&itfnx h®A^6wm*im#T&a. S656SS#*¥K**®ttlS*tilW±lC 5"^- X (£-f-5 ;: £ £ Bis i L fc. A < X * - y y h • ->x X A ® WSIC te, 96*® + X tf 7 U - £ fe o fcS*DNA Analyzer £ *§ A 1/. th > y;h®®S US® £96i#edLSti:TfTA 5 <£ 7 C& £*® #8liFfSJ6®8!i*-ft£tt®75:= ifc. ATAC-PCRSffl®ite*#SW7,X'f v— 6toE WV7 M3t-X®9V7 hdoit^x-^eav 7 h®E%6m$CfTo7t„ *#%?(!. ATAC-PCRIC=t5>$S®(SStt, #$tt® —M®|S]±£Btt£ UT, ATAC-PCR®ft SB «Zp 1C, 3®@lcaS£XxZc®(s:Pfi5fcfflto£M®-5 £ £T4L-r U y 1/ —-> a > • * —7’£ -97- 6. 2 (1) ATAC-PCR ATAC-PCR (Adaptor-tagged competitive PCR) tH X [A5. 7 7 7 7 —] [B^7X7^“] ADl-F^S^P do##) adi -fN^S^F ( lfoli) AD2-B$1 ( 1##) AD2-B$3 ( 3#i1) AD3-ftgRS*P (3##) AD3-E#2 ( 3#li) AD4-E#2 (1##) AD4-F^I^## ( 3#i1) AD5-E#3 ( lfoM) AD5-B$1 ( 3#ii) AD6-P^SS^F ( i##) AD6-F^^#$ (lOfnl1) 7yT"^-(D#mE^$r^trm:^#7yy^-y7<7-6:m^f #m%y7<7-c j: OPCR^ff 7 o d. 6@E0PCR##&70th< X(: j;o T^#T-5o ABI3700& <7 U^GeneScan V 7 h (PEA' <^y%7AX)^mvx j;opcRm#(T)#w:^#j^76c 7/^67^ (2) 7—7^-7## ATAC-PCR# i:j: 9 #6a^*e70%#m^##C^A&cDNA(7)mmE^^ 63> — 7 — $:^V^T#EL/T#6ft-5i##&ftA0L^:7—7^\ — 7$r##7-&„ [IU1] ATAC-PCR(7)JIS W&1 ti^cDNA AAAA TTTT-b XA m#2A$cDNA 4 Mbo-l — GATC--- AAAA GATC - AAAA —CTAG --- TTTT-b)—0 TTTT-b CTAG 79V9-A $:#3A$cDNA AAAA TTTT-b)—0 —Zf~p y(~7— —GATC------AAAA CTAG —_ —. TTTT-b xb^hP^iP^A Aa—* GATC -AAAA CTAG -TTTT-b PCR A/w GATC AAAA CTAG <= -TTTT-b -99- m• -r 7x-7£fi#tf z>o A-7# #cmi\ z\- F ^ x 7 • 7 7 h7x7W@^6Eff'>7f-A(7)ilSl£E7T<7)ffi Off 7, a. /\-M7x7S/77A## (D 3>bf3.-^i>xyA##^j:U:'e^6(D4;v E7-7fE (2) publicDB^ 6©T“^A?, b. 7 7E7x7i/77A## (D &77bx-^^77ATOy7'y h^-A(D#% © © M^oem:B#;sb&v7h(Z)ffGE (3) © 7V7h(D#^ © x — y^< —xixst © j;V^7> - 7- < >^-7x -7## c. a- k7xT/y 7 wx.7mmzintcz>m%$kftmm(DmiL (3) mm • mm a. b bum h —^;i/RNA B (CLONTECH #64020-1,Lot #8040076), S (CLONTECH #64090-1,Lot #6120263) > /hS (CLONTECH #64039-1,Lot #7080714), 'L'E (CLONTECH #64025-1,Lot #8040067) A# (BioChain #061003,Lot #A20804l) > (BioChain #061005,Lot #A212066) WE (BioChain #061008,Lot #A20604l), EE (BioChain #061009,Lot #A205002) B A# (BioChain CDP0610125,Lot #A21102l) b. mm HepG2(A B *SIE)> Katom (A H *EB)> lESk h jfi{£6 h-^;i/RNAte, &mm • m^rco ie#mm^6(Dh-y;ERNA(D#m^^&i:j;oe (1) A<77l/-yv h • 5>7 AA## a. yXfAi^ ATAC-PCR il 43 , A )V W] X' 0 7* > 7" )!/ M #T £ A f 7 )V — 7° y MET %>tc*b\Z ABI3700£SA Lfco f£^CDABI377(PEAf t'>7f AX) Tft. f JUlfe^f; D ©^1) ^48th>y;i/T, lB$,^:0l92 th>7°;i/SS C fU: AO A T, 7°7f 7 —f:II-^PCR^F<7)17>7°;^ilSH::96A70ly — ^2AlmT£&c^:\ 7°7f 96ae^MfiTff 7> -101- )\< Xlb — ~f«j Mfc£it©Ac. ATAC-PCRX ffl © •$. FAM$t6 *5 J: W'HEXfi )l£tt©7'7'(''7 —S^tl^FtlfiSSr^xT ABI3700X#mUT#*#l%U/cb C6. Y-©»SCtt*L LAc#6U>:yt3, 0~1.5fmol/MlT$>3 Ac. ABI377Tt3, 0~8fmol/« IT* o Ac C £ *' B , ABI3700©®S* i»V^C©-y->^’JF©#R^S©^W*5£>S t7j:o /to PCRS#t3300l68 gC#R7^^#Ai&0, Ztli^-KL. *Ac. yX IzU-I/COUTtt, SXOT'y'f' 7 —£fflWtfE6£L AcPCRSteS-SiigttCBKS^X E4©j; 3 epcRBttoaatafttttf Jtw-f *>£«£ l Tf# BtlAc. ATAC-PCR C35 (7 5PCRS#it 5: ABI3700$ffl tAT #§g U/c->X 7Ax##xe&c £*mmxe/c. ;ffly7ri»*iii, ^A**i8#*$R**£© *RHJf?ECJ;OfTofc. b. UX7A©K#$# #eL/t->Xy-A©W«j3 7"7 U - £dnrH5>7)i. »e^#SW^X'r7-(3E?iJ^i:CSSI7-r%£'B^*0. HtSCVX b $■ L 7c = ® #mWXy<7-©###M£ UT(3, cDNAE5iJ©3'*>8C-#;fiV>M®gN!l ©«Wr8M4/oB3'*s6$TS:»9 tirlU, 7?X?-A73 7-tfflTml©g^!4tKrtT S>-5lf©4«a*«Ac7J;3AiX3'f 7-4fbJ*UAc. CfflB, 7 7 7-£» gf ?< 7-A<$@3m%xy< 7-^76A^#©MAmB&4*Be(7.a f*.6A'7)#g0t,©S«#(LAc. © $l©8i®B W (Mbo- I )T©XT' 7 —»i|$StT£Aj:7b>3 Acig o', Hap-II, Bfa- I ©KMC -102- [03] FAM.HEX^-fV-COiSSjr^^S s6 1 r' m 4 n ♦ FAM lM 9 ■ HEX "2 it*♦ ■ » n I V - o 1 :l 3 S$ (fmol/yl) [04] pCRj$tt»jt®jfe8 « 50 ♦ H 40 JS 30 ♦ 20 ♦ # 10 « 0 ( 5 10 15 20 pcRSitofi(ii$jt) -103- ffilft-bB-f diT, JWffT'S 6-fd 8#£tT. Mbo- I©*T t>m^S:#:©l5%l$i:S6UZc^,M®S*e30Cie^1-d«h T. 1259«E^=K 7ilEf ££(0, 4#®l«KTt lx.5dti!t#t. ® fE$SW©|6j±©fcl6, iSSc©E?iJlcoHT®-}6ffia$nIlgCL, 290, SJfgys g'jc«$stt(iy i/- h96#E) dticsatT,^7'f T-®-K^tb*r?>=t7iCLfco d. x-7M ® ATAC-PCRx— 9 COffifff V 7 h ATAC-PCR E =k 0 #f>4l-5»|gx — 7(1, - eme?©pcR##+i-ixi:*iSLtmm (##) - %# a®#) T&6. W/c, dtie.4. PCR##tM' XM. £/c##x-7 (% 1/7 hD7xD^7A) (025r#!l), d ©i»x-7 © + ;!P E> tf-7 £ *& s&*efdii;dnsiwtL, *#tt©-Kafb&. © ATAC-PCRx —7##f* a## ©TMELfcVX h$:*IC, •AS7Xft“. BS7X^f-©fflffl©mi;Ef-7IC#L/-C(DfTf&. • A/BS • a/bS5 £©ofc. #&7=-7/:#AA'6%##**%^?©-a©#f$&Web±©##?e X.5#i*$HSU£:. (2) ATAC-PCRd £ l/t5. £#?Wgefl5ilg?Cd>V»T©SI»*aT©#«St V < tSBODYMAPJ; 9%91)8$7!>W>iie®$8A,;<£<,Ef4E^ 6©%S#cDNa7 d-> ^6taoRF7li900bpK±©7 n — >&MRLfco © ATAC-PCR lC o £. 2 8E©)lfiT -104 "XT' ~f 7 tf* — z.t\ t> (d '4l\t\^ (D j — 1f > 7° □ y hcox —*? ^ 2 y 7 — ATAC-PCR(7)T-7 C j; 0 > ATAC-PCRUct 3 X% ZtlftT — ? 0 ff $114 Northern blotch 15 EIGHTS) 6ch#A6^l6. M5(:, ^ ft ATAC-PCRT' % m M & & f# ft 26SIM (D [S ## IS 7b SB PI H 'O tb T fc Northern blot £ ff 3^chC5, C41611, mm(7)mmT%#^#^C(h^T #75^3^^^), ^^bb^ATAC-PCRchb(:#(-761:M675^3^. a±$r^(h^6(h^l(D j; 3 l:7^6o ^ 6 t:RT-PCRT^3##(7)7]: y 7 9 MlIORF^^U&fsTlZ^ WTt)20 @#@^Northern blot^ff ^:< S/7'7;i/^#6^l7^^3^o (3) oe* a. m, aM&j^Katom $lj;g##^Mbo-I^c LTm W6 7 ch t L Mbo-IT iS V) fc g£ |f fii T # & tb S □* 14, Hap- II . Bfa-lSrffl lAfco (D ATAC-PCROUff ^^ii^^uziawmmchT'E,. mm-mmii, ±m, /j\m, HepG2 Katoin, 6@mo7y7°f-&mvx m2i:^$^^7yy^ © K&sm### U 7^-73 #im#mo.9a±(DKT& o ^#^lC3l^T(D^7'—7(D%#^ffVX A$ 7 77^—45 j;Z7B$ 7 77"-^— tIJ aE L ft In ^c & # # 7 — 7 8U(DmgG##(D##7k^^fT3Tm2^ai#(7)mij^ L ft o b. 7-7(7)%##:^ g##j: 0 @S^b&7 ItTh, 1259m^f COUTmij^^eiX 694*^7 c. ATAC-PCR(DT-^#m$(:3UTli. Mbo-I^m^mm^fC3WT11. 6%^ ^1:45WT1461%T&3^. Cfl2% #-*#'E##3j;^B'aA#'Katoni(7)&3m^ ^chT^6ch70%, 65%T&o/h. ATAC-PCRl4i#1#Mbo-I(7)^&mv^Tfft)4l6Cch^^ W7 *E2cT'ldb Hap-II , Bfa-I -105- **r-?ffl®MSS*3(:St. (4) ttfgfSttfflcDNAx-^-T, »i*5T-?^-XV7 hi:LToracle8£SfflU 7 'S-X 0 £©l§*. E?i)©aft76*D. E?U©V7 htoftfW ffiaotfl'3ft5*!St*SI*tl*iTS-70T-?'X-XtLTi*t5Ci# *4ic, f ©x-7-©-ae^T. a. %#mtmei:#3(7%EMx-7## it 6.4 *# a. 7XxA##^f ©#f### *7n-7x7 Hi. #»©k Fi#ef ©*##%## ^ATAC-PCRt:Z OR# L. x- C 0 . ^©SlSBt UT. ATAC-PCR©/\< 77F-7'v Hb&eo%. RyXfATIl, 10afefeD967X7'ly- MnoitT- 7;b©8c«l^oIffitC7S7 ©7-©B6-e-eExT2S-f7#6-r-5 tttlH, lB*/c t)6B«ra©f6a*S^100» sa^y'ft; y 7 lz>-70#$SB^5>, it >7p;F©**3f©*jg(i£#-5 £ trfT #. g 6t:-*©#m#m&96mefeaLT%*mic#au. surf-sr tAmm *fc. #>»®7'f75u-a>6E9iJs#fcae?-c-7©T©*»^s, ABI3700* ffl (A T w# b ft ATAC-PCR v 7, Xh, A<^)t B 0 ## L T © -5 b Wiffi7? # It. lA'L, l'<7i'©M,«MX4nt. SBltctt. #i%xm©*ef ^skso a* »5t, =P-p U 7>-->a > • *-7AH^t*T#A£©oJ|gttASt USAcfc. ##rA«T#fc: C© j:7%#eti. giJtofflttSStFStti: LAcO , StofSBiSS -107- #1 Northern blot(7)f$l|l Northern blofC ATAC-PCRi: ir— signal£1$tti 5ESti:6<-» BtoiSE^ 19/(25) 17 89.5% 6/(26) N.D. N.D. EWE^iJ 2 B • HI* • ITHiroSte B • BlfcJ* • Katom©SJ$ 6«an©is a; Bt- Tyyf —i m@##(xi)rt«5«P(xlO) m@B#m(xi) m*@#(xs) rw-2 IF (xl) ITl»(x3) «(xl) KatoDI (x3) # (xl) T5V5—3 ^E(x3) m^##(x3) 6A#(x3) HIB(xl) 4 H*(xl) ma##(x3) 6A#(xl) m*a#(x3) ifii(xi) TfVf —5 fff*(xl) W(x3) KatoDI (xl) «(x3) B(xl) 6 m@B##(xi) rtw*(xio) mem#(xi)ma#m(xio) 6 A^(xl) 7 (^m) KatoDI (xl) IBS Mbo-I Hap-II,Bfa-I m# mmmm. m# 541 473 241 253 ^j'ii 131 113 46 50 HepG2 331 306 220 224 KatoDI 379 316 228 259 site 138 108 60 63 20 18 8 8 25 23 12 12 85 85 41 41 WAS 89 89 31 31 -n It 1739 1531 887 941 -108- 1 %*: pu= dWO (W 5|:Z%#glS& U) S5KOV<0 - Oiv: WS Sti0 (C) : o) m*i^a*R(oa.$wa)pu»e\»^:duo «> (?*.? : -^H (s) u) i? 660 660 loqifl i to to 83 to to t **08 63 8t08 938000V 860 660 IldeH i to to 98 31 ** 60 9 11 8 *tl S38000V *60 *60 loqi/M i *8 0 6* 80 80 80 S68t 6* 6381 tt8000V 860 001 IoqiAl i 30 *0 13 0 61 61 389 16* 361 608000V 960 860 i°qv\i i 86 to 99 81 63 t'3 96 11 *8 8 111 608000V 660 001 i°qi/M i t *0 *8 81 St 60 t88t 8*91 681 908000V 001 860 IoqiAl i tt *0 60 61 61 *0 8*0 1 88 not 666000V 001 001 i°qi/M i t* 0 0 91 61 81 6S*t 681 838 1 866000V 001 001 Ioq^| i 30 30 88 81 *0 60 886 68 006 966000V 001 660 1° n/t. # >tti8£ffl©fcffesmE|B|-©yn h3-;i/T©#Jg-ett, *#%©*%**! 20~30%SgT$>ofc„ f:T, cDNAH >-/E©!®® £ ± If TSIST 5 & £ ©«I1 tc J; 0 60~70%a*IC$?|6]±d-ti:5C -110- ABI3700 frZO&tjT—PfrZ ATAC-PCRW #rV7 hZgWj&JlzMMZftZftZ _t|d OTntcD^&^x^Lfco MS cmtAacDNAom&m^LT, aswa, ^(7, c >>, ®E >y;i/ 6. 5 ATAC-PCR^/\ &(D]Elfef3LfeM — 111 — #W, D^titi¥Sr[B]±d1i5C -h^T'SZc, $7t, 3@I@C«S$XxZcafiTS **»©m@##&$m L7c*fuxu--y 3>-*-xxMi. f -X©»&%#f & C tic$oT, 4-0#)*lfcS8*«T-?^-Xtt*SttiiSHt) ©t&O fc„ BfWX n —> COtATIl, E9l|##©E#t 6 t ICATAC-PCRIC $ -5 #J £ $ fr tA, 1739 «fi;?ci#j$^6 94iaG^iconT, 7mmim(o%myD77i';i/c6^Te/2. *#yo? «0g*WfflcDNA % — — X#L7i. h h%8yn ~7 7 'f A-5r$l-(t;U^:±T^:MICiKSU75:'T — 5” ^-Xtl, , C(Dft*tt©i«Gx-^^-Xtt*fTSaGG(D«®*#TC$»fc 0. eUl'Mc&o T fcSidTWfflT&S 4-»©BBi:UTH, *5»*T04 tW^StlfcVX h$r t— ^91 6H-&. f**5-fe» (DATAC-PCRlC^HTffiM-r 5 ffl I® » SI i: K tt L 75: 7 7 -fx-, ©ORFtftJr, ®T5 /$E*iJ©tt6, 7<£f©#jn1f«^sa?iJ«®. BS^a 7 r -f ;i'£#tcSE£L7i:x — — X©WIST35-So ATAC-PCRtl, t#x&. $75;, RNA©S*S$8 ^^»as*©i6S6*x%^, cn^e.©#x hgv atsstt-bs# #&7h A ©##%?&%. LfrL. WSXn S n-;^©<*$|d 75:5, 2 tlT IA 5 t ftH*.fctA©tf8mT<& -5. *E5ET 11, ATAC-PCR&^Stf 6R%#7)m £« me Slit*?? i5154yXfAOftH«-C SZi^-3 4-#H, ATAC-PCR©fWMfbt#W1-&;&K#:&& 6H&. g*ag©«nZci$«T$5ATAC-PCR$:ffiUfc#6g#StfrfflcDNA©-#'<-Xi$, f © «Stt. 4i$, E ^JttSmfftiil^^gtogiSitGGCXiUTfc, ATAC-PCRlC j=5f — LT ff x d £ T, *#9n?##L75:f — —XMet. $ fcKWC 'b^EH^-HT© < £ 6. 6 SlfflSltt (1) Maruyama,K.,Sugano,S.: Gene,138,171-174 (1994) (2) Carninci,P. et al.: Genomics,37,327-336 (1996) (3) Ohara,0.,Nagase,T.,Ishikawa,K.,Nakajima,D.,M.,Seki,N.,Nomura,N.: DNA Res., 4,53-59 (1997) (4) Kato, K.: Nucleic acid Res. 25, 4694-4696 (1997) (5) Matoba, R., el al.: Gene ,24, 1125-131 (2000) (6) Kawamoto S.,Onishi,T.,Kita,H.,Chisaka,0.,Okubo,K.: Genome Res.,9,1305-1312 (1999) -112- f 7f /<-X(D4#m 7.1 ## iil-iiJgU i*41ISE?iJll!f#^*i$jxoofc5[l]o 5o ±E^ y ASSSfl^lJtf S £ am LTX ifi^cDNAI^I^II^ofc^o cDNAfi. yy Agg?'j#4:-c\ ^scdna^ &im\ mamcmm LT##B##fimc?T 9 c o, A&cDNAiimm# #ft^^T$)66#XL6ft6o AAcDNA(Dm##@#-C&9, Zft^-Cfj: cDNA(D»f^@E^j^^m#(ciR#i-62«k^fT^^iL-c#ybo n%)\07j mt mfe £ ft TV'5 aft ^*DNAE3W--y ^<-;*-efc5GenBankft-f-XM::l£e £ ft T l' £ A Fb1 (D ^^cDNA(D BS (1^] 8,000-C h 6 o -A, Se^ftM^-T'SB#^!!) LT ^6#ef(D#(±io,ooommm^^#%.6fti:v^o %^-c, mf-mmmBrrn^cDNA ^^-e#ft(f. A&cDNA^m%A^ ihMftjJDiSi-5 r do cDNAfiWft^V'Tft, ^ !i y 9 *$ DNA f-^lTit^fcDNAoitf^ii^ftoo^^o :ft^iit©Msa^j^^git^ < r £ aft 6 A $fttl'5o -ft6k hA#cDNAe##ft±M, Bit^xy^y^fAX, r, 5,001 y n — >l a 0 e r (Di*i2,5oi y n - y (DSS Am#A#cDNAT-y^-y%#^m#i-6<&:#ft, A# cDNA(D#imTyy-y 3 cDNAE?iJ&teirt — y 7st a0 &*>, y ^ y * cDNA(D##^^ 0 y y yef%^f^lTVft 0 (Dm&E^j^##ri-6o $ A, 7. 2 A#cDNA(DS&E9i&e (i) DNA##%m^wc^e • stfffflyy h y -113- a. 79X3 PI-100 E 7 7^7 (##$%# (*)) b. (Ei) 1029-40-2AW ( □ H > X) c. unmw>mw (02) GeneAmp PCR system 9700 (PE Biosystems) d. b° 7 U —DNA'>"X >th— (03) Prism 3700 (PE Biosystems /HITACHI) e. vx h7%7 Sequencher™ Ver. 3.1.1 ( 0 £ V 7 h X x 7 X > X~ 7 U > 7*$c^£ft) f. 77<7-m#fmV7h7%7 Oligo™ Primer Analysis Software Ver.6 (SSjato^^-til) (2) ## cDNA^jf Abfc77X 5. L ZcTcJiB^ (DH5 a ) 25007 □ — > (D 7* 1Hz P-/I/X h y 7£, (3) a. f###m ESE7J^S(7)ElB§^E4iI^1-o ##:%##7;i/^ U fcfc: £ 0 ifltti L fc 7°7 X a H DNAiC, &A%77< 7 —, ATPCR^ffVX 7* > 7“ & \Z £ 0 DNA£#fi T£o 7 XTV 7XG5O0XA7# xA&m ViT, j: Dm&L, M$#^7-7%77— b. yxxa MDNA#m#m (D ##:#^#$r3mL(DLB%%4:T, 37%: • 15#^ $E Utco 7^fn —X77l/S^^1c$jilti*¥7 77 U >77 7°(D$<$j1£@ (Mupid-2 : 7Xt /W^-(m) m#(D/%77-&^^/:^mm#^?^5,000m#T^6C aaL, 7^O-X7;i/#^^0.8%aL^:. 77X^MDNAa#(:DNAT- vr -114- c. dnau tu'&'tnu (D PCR^j^ 7°7 X ^ KDNAlSOng, 7°7 7 7 — 3.2pmol, 2mM^ft X X7 '> 7 A , 40mMh'J7ii pH9.0, X7^v7X2jLtL^llD^_##^10RLAL, 96"CTl^^%#, 96"C 10#, 50"C 5#, eor 4#(DMi&&25[R]moigL&m, xi/^^xx^m^Lx (X > ABI Prism BigDye™ Terminator Cycle Sequencing Ready Reaction Kit (BDT) -§: 572 Wc= —ABI Prism dGTP BigDye™ Terminator Ready Reaction Kit (dGDT), & -117- (4) a. 3 9. -m(:450^^)550m*(D##^ojmT$,6. C(D^SM> 3200#* #(D^n->(:oWTkL b. j;m@E?iJ ##(D BDTTf J t£ fc ”GT-rich” $> Z> lA te ”G-rich” (7)E^-^> b \Z < £ tAtedRTSrffl b£ b t \Z&0 . ffltfr&'M&fto C. VaM^SME^ #M(D^UA(7)]a#TH"slippage"0^:A, 120- FOOti X IF ROO y 7 — T (1 )W tif 15 H & ^ d — > (ItllO^n — y) |;o^Tlt ^ v ypy^^ F# fT(D#f (CjoV^T, Ztl 6(7)10^ P —yiCoVX5-10#(7)P Pzz —F,DNA&f5#Hii-6 C^(CZK). 2/5X)^5/5(D#g-e^%(DE^##^^^-e^6Z^^^A^/2. P — Z ^ DNA^^-T^v^ L/: t)(D ^ #y F)^,6o (5) Mmm#2:fLyc2500f P -X7)[fXl, ^ 9 y ^ & -Bit f\ (0.2%), —zz y ym^T-TX 7 T ^ F 6 t (D, ^#@d^Jlc j: 9 ihL/bt,(Dyi^23^D —> (0.88%) ^ A 5 cDN A y 4 7"7 V “©f ^ It ^ p '— V ft [Z. o V ' X (± ft # ? p “ X- ^^T##$r#y)mmm2500^ p -^(7)#^^ B m L^o HI 1^6^ (±^#(7)^#^#^ (D#AX 7^(l##f^(7)37:^±(f("B$^^#4'L, ^#^^##f(i8^y)^6^#Ly2o 9 ^ X)X(±^^^ L/c Z 9 F:^ 4^#7$^400^ p —X7)^ —2^ ^(^(192%^^ #TL;X ^#2501 ^p-X7)T-^^^9y ypy^:^ ^,^^p-y^g3z^yc^. #Uj ^ P — ># d: #^c*% #m 0 #mm M#T 3mm $#(- #RW H11.9 468 19 449 449 18% HI 1.10 300 13 287 736 29% HI 1.1 1 406 5 401 1137 45% HI 1.12 490 1 489 1626 65% H12.1 376 0 376 2002 80% H 12.2 296 0 296 2298 92% 203 0 203 2501 100% ^{d-^P —^#(:2524 (^#^ifih#23) 7. 3 ilcDNAf—M-7y7fA0^fiE (i) y^yA(Dj%^ m(zc/xyA(7)a^x,w% ^-7E^^y:Ammii^(:#Li:, ^#cDNA(±m^y(/;m^A#{:)f(cm:^W)nTy)5z^7)^m^^aTjoi9, Z(Dfr)Ufk(Z .^.y 67)'6T^6, %oT, cDNAgd^^M^^y ^ y -122 7L, L^L, ^ [al^f 0 , 7 7 &ff 7#g#C& ^>. -E-^T, cDNAE^#R%$r["]±^#^^:^^E^j(zMLT^c^-69 ^^iCckoT, bft o i>#DB (DDBJ) i>#DB##x7xA^j:3Tlf3o *7o XJ:7 j:9^:A#(D±#cDNA@a^J(Di>*DB^(7)@a^j##^, ^6T, -ecT, %\ ±#(7)^:#cDNA@2^J(7)^#DB^(7)E^m#^@^. tb t£ '> 7 X A £ $t IS b 7c o VX 3:&cDNAkL *fRf6bT(D##^^T^# LTW6(DT, ^(D###T-5C^(:j;3T. ^cr^iMTkL cDNAE^c*LTmmoj#g^#APR(D##T^e m/:, ^emfmoAmcDNACMT&^^m&^T-o- 3M,T^<(D^#$^#V^T, ^=Pf@(D^:i#cDNA(7) cDNA@B^J^±#cDNAX —7^—7 mTTn, 1) ±m#x-7<-7^#(Dm^, 2) 3) m^x-7<-7(D##. m±T^^L/:S/7xA0 (2) ±m#x-7<-7^#(DM^ a. x' “ 7 tu 50; 11 '> 7 x A CD ?M % *y%xAT^, cDNAEm#^&n%±2t!-'5/:&w:» (cLT^^o lEsi^xL/t^TCDMfiT&^Mi~ AATGprf-ct Zffltfr, 2) #^^^l/c33(DORF±(:^^6, DNASB7J -123- ^)im(Drn otlHiot, X7-(D#^€:ff 7 j:7 7^ - KXy 2 b. l^DBl#'/^fA©|^ ^^DBlIyXfAH DNA DataBank of Japan (^T%# : DDBJ) ^ h h©cDNAT m### (77 7-7a >## • &SK7J##) (DXtl • SIE • ^)o (D m^rnm ^77 7A(Dm*mf#^aTOm^T&6. 1) 777-7a777<^ (7yE^U0777h) (DO-M't-7 2) SXE7J7 T -f ;l/ (PASTA# 73 OD-H • t“7 3) 7“‘7 7 pf — 7 y h 7 x y 7 7“J1/ rParserJ CDSSS) 4) I"ParserJ \t, DDBJT'E# $ tlT lA 5 #ff7 5) &7—70HS 6) 7$/mm^## OKyoh^ -125- 0 1 y K^##0 -127- h Lt. 3) SKgKlWCx-^A-XICSfl&Bjifg&X*-Vy hTS#^?-^$^B6iiiigcLfc„ c:ne>o«S6c &MVrmmtzmi2iz*-t£v t. orfxxx -/3>yXrA, 7-j'^-X1t-fLCS73' #mf--5 7aL, -7a-x©fiiw$bbl, #©% ft T ©ft ©SB## £ X--3.7>S^T©#|gf5J:tf#IEe=J66^ Lfc. © 0RF77f— >3 >'>XtAOBS *©4@m©@g7u##&*ef 1) I»te3 H>tW1ff : Mten h* >©tig^MSrff 5. 2) t7-7#m : 7>A7K©7 5yK©E7iJ^B, ##&?#!-f-5. #1:. *©2#S ©X— 7A— X S. • Prosite • Pfam 3) 7>/w*©sftsef#i: 7>a-7»©75 y ke^b, -eft#** jaw©t*©s 4) 7>A7*©g$e«8Mi^#J : 7>A*7«©75 yg!E?iJ^6, RSI'N'JyJ^f SJ -f 5. cn6©«ff^$ttT-7^-xcsssn, c:nB©E?iJ -128- J55 cDt I A- DB 3 Search Svc-tem Menu - Netscape 3sE #* 3. Annotation Analysis Analyze Motif if-lAtl Add Prosite (PS) 08 Condition Motif }HIH mRAi F AM A H (Average hydrophobicity) Integrated Analysis P Value 1 E FO Clustering Result Quality P High (H) P Mic Very Low(VL) HTH A RAC FAMILY 1, F’ATTERN Bacterial regulRt: ry prot /yi) ZINC,FINGER C2H2; PATTERN Zinc finger, C2H2 type.doi ZINC.PROTEASE, PATTERN. Neutral zinc metallopeptidas-= 2FE2S_FERREDOXIN; PATTERN. 2Fe~2S ferredoxins, iron- Add Pfam (PF) DB Condition 4FE4S„FERREDOXIN; PATTERN. 4Fe~4S ferredoxins, iron- Domain |AhpC/TSA family AA_TRANSFER,CLASS.1; PATTERN Aminotransferases cl E Value [0 AA_TRANSFER,CLASS„2; PATTERN Aminotransferases cl _AddJ AA_TRANSFER_CLASS.5; PATTERN. Aminotransferases cl AA_TRNA_LIGASE.I, PATTERN. Aminoacyl-transfer RNA s AA_TRNA_LIG ASE.II.2, PATTERN. Ami noacyl-transfer RNA And / Or Condition No. A p No.B IF " A And B" Add | * A Or B* Add Integrated Analysis - Netscape EMEl Oz-OKF) mm Result of Integrated Analysis F DB Search & Sequence F Expression Profile r Mapping AA sequence code : a-bb / a~bbn a:0-0riginal / R-Revised(by "Check System") ,tf-7 bb:F0,F1 ,F2,R0,R1 ,R2(6 Frame by FrameMaker), FAn,RAn(ATGpr Forward/Reverse No.n),FX(Fixed) Seap^ffce RetrietW^Ad Score Sequence Motif Name ATGpr Psort Sosui ' Pros it.g Pfam 0-FA1 0-FA1I O-FO 0-F1 0-FA1 0-FA11 / 0-RA3 C-HEMBA1 000001 0-FA1 0-FX none 0-FA1 L pattern ^§2.599998%:nuclear 1 Domain > 0.810000 1 3.0RF7/f-y3>yXfA(l*lt5 130 (0i6). 1) ammaeux h 0 Swiss-prot 0 ft—V— K$?#T fit aIi L ft Swiss-prot 0 ft “ *7 — K1W ft -3 T > H ill ft -S Swiss-prot 0 X > h U — ft B# %0i#i^< >AftXft-^0AmcDNAE^J0X> h U-^-^^^ft^,o #Swss-prot0EC No.^?t/f fa^lftcEC No. ft Swiss-prot0 X > h U — ft## L, BiMO^lW >A^f- X 0A#cDNA@g7'J0X> h • Swiss-prot 0 ft S; ft ft X h ## m^L7cft#0ftftX Sft^^Swiss-prot0Xft h U-ft##L, B##0i%Wft ft A ftXft-ft 0£6cDNAE7lJ0Xft h U -ft-KSftft5o # Swiss-prot 0 ft fa^L/7cft%@ft#^0$)^) Swiss-prot 0 X ft h U — ft ## L» Bitift 0 ?6 ft ft 7N ft Xft^ft 0±#cDNA@g^J0Xft h U -ft^K^ftft6. #Owl0ftSftftX h ffltft f^^L^cft#0ftftX h ft^a tfOw!0Xft h U -ft##L, B##0i^VW ft/\ftXft “ ft 0 ft: HcDNAE7J <7)X > h #Owl0ft#@^B## j#%L/cft#@l:M#0&6Owl0Xft h U -ft##L, B##0^^ft ftAftXft- ft 0 ft 6 cDNAE 70 0 X ft h #mRNA(h(7)|p|—1±## fa£ LfcrnRHA^Xj-ft'S IrJ—ftftTX — M ft## L, # ^ T 6 ft >A^J“ X0ft6cDNAE7O 0x ft h u £o #mRNA^0^ft###T m^L^mRNA(:^ft-6^ft#0^ftft^T3- Mft##L, m^ft-^ftftAftX^- ft 0 ft 6cDNAE 70 0 X ft b U “ft-H^ftf £o ^-###^^ft### j: 0#6^6ft ft/\ftXx^ft0ft&cDNAE7iJ0Xft h U - 0-Kfti$cftU 2) M#k<;i/(7)m-# c e-mi, ^toi/-#m(7)^m#6a6miRi#(7)X3 7^e)#m$^^Mmix<;i/(7) mW1]fe\Z.-O^TmmtZ>o NCBI (National Center for Biotechnology Information) L /cblastxft fOffl ~T %> <, d ^ 0 o'^Oii'Tc ##ET0ftft ftAft%07a 7 Swiss-prot#0ft ftAftM0^-ft^- X (FASTAX;*- —VyH ^-0^^(hbT, ft^-xm0XfthU-##^. BM0fB4# 3 ft-fc >thxs> Identity 0^ft t, @ -132- >gi|l 169338 |sp|P2191 2|DHSB_HUMAN SUCCINATE DEHYDROGENASE (UBIQUINONE) IRON-SULFUR PROTQN PRECURSOR (IP) (IP SUBUNIT OF COMPLEX II) Length = 280 Score = 581 bits (1481), Expect = e-166 Identities = 278/281 (98%). Positives = 7tK)/281 (98%) Query 134 MWVALSLRf^PATTLGGACLC^SRGAQTAAATAPRIKKFAlYRWDPDKAGDKPHMQT 313 MAAWALSLRRRLPATTLGGACLQAS^GAQTAAATAPRIKKFAIYRWDPDKAGDKPHMQT Subject: 1 MAAVMALSLRRRLPAXTLGGACLQASRGAQTAAATAPRIKKFAIYRWDPDKAGDKPHMQT 60 P-V alue = logl O(Expectfi) I dentity ex.) logl0(e-166) Decision Tree: id <= 53 : | id <= 47 : | | pval <= -41.4202 : | | | pval <= -44.6383 : c (15.0) | | | pval > -44.6383 : | | | | id <= 44 : c (4.0) | | | | id > 44 : b (2.0) | | pval > -41.4202 : | | | pval <= -0.0409586 : c & 14.0/5.0) | | | pval > -0.0409586 : 0i 8. y :iu x'a -134- 3) ^ScDNAE^L RtXSwiss-protOX> h U—+ —7— — OCD^ScDNABB^HC^ B LT, t^U'J — &f&0j^t#f>nfcSwiss-prot©X > h V-, m^U'OV, &£V^£(DX.> h U-^^'ti'^-7-^-7- H era CIM % y f L, lx/<;i/^4e(D4:-7 - El9(DS^'f# 6 4lS = T&:b*> , ^rficDNABB^J HE — 7— -5 o CD 7^T'cf-5o 7c H-E-S^TEEn- HB:te, zh^E IDh, IDm, IDK NID04P ^)VX^ U T lA 5 „ # £ IDh |1| —14 iW < > NID11 IrI —14 ^ 5) ^AE## ^AE##"C11, mRNABB^iJ B:(D^^t4^^1-^E3- F cDNABB £0 -135- / \SPX> MJ 4=©*- r ^ScDNA 82?l| K-1 1 K-2\ K-3 K-4 K-n C-XXX< POO; A B C-XXX2 C : D i C D C-XXX< J(B> : D D A C-XXX< , D A C-XXX5 D C B C C-XXX6 D D C-XXX7 B C [ C \ C-XXX8 D D C C-XXX9 B \ i D C-XXX1 0 \ / B c ■' • ' ** V' •: * *• * "■ " • • 0 1 9.ific DNAlB^lJi ^r— V~ KOH# Ic / Is >= 90% Tzt^ytyV ^ >= 300bp Ic / Is < 50% Identity > 95% NIDdH-'ttfcU X Ic : zq>tZ>-yX^. Is : Query R?lJ£Subjectfi£?lj0*sU fiO> K?'JH 0 2 0. IrI—I4S*^1"5>E=-- K 3’ 5 nasq query sbject 02 1. ^gcDNASB^J (hmRNA&U^ EST(CO^T nast NF Bn . . hpH NS $ t h_2 2_G (20) PF $th_21_G (5) 4° $th 11 G $th 12 G nasq (5) (20) El 2 2 — K -137- • OwiESt-^X bmtir • Swiss-prot 41—9 — K 4^-9- M«k^#1x^71/a^a. • Swiss-prot EC No.###E EC No.tmfa U'Ol%Mtir4kfrt'?Z>o • Swiss-prot • Swiss-protES "X* ^ X h E^x^X H2:M#1x<;i/$:##^EaTa. • OwlESx^X h###E E#x4^X • f7^JH' MB! hy h L^^^cDna SH^J cdx > ^Ta. 023^^f \z, OT^a. (1) ##C43WT#mL^^E(D#^ (2) ##(D*@m#Sa&^#cDNAE9J(DX> h U (3) ^ScDNAE^J(DX> h U — M T a Swiss-prot ^ 7c ^ / ^3 j: ^ Owl (D # # Swiss-prot (C D T$> a o >X •^srapRX ^x^tc^tta^^Dv—v >x<> f/J'tDy —th—f O h77pby h (7) X > h U — ## Acc (X/h'J — ^0 'J >Xf4)> EC No. 0%EC No.(h4^—9 — M(7)M#^#^(7) U > X #) > DeffinitioiK Keyword OK 4^—9 —K ^K±b v EC No.^3i:r;+“9- H £ >bl;, M#k^7l/A, B (ab), C (ABC) 1C 43 O'a ^ Swiss-protX > h U “-ffc £:I^ thTT a o $ 6 (C , <7) S tk 7^ 6 > *€“ u >XTa. • ^EDy-X-fO h'y9°b'y h<7779-fX> H43 j;^M#lx^7l/## lx ^\)l/ (R-revel) b, 4r4l$r et#^ a^:)6(7)^-@/i9X'—Xo -138- Owlicilf 0 t D $/ — th — f 7 h © x > h u — Accx EC No.x Definition> Keyword by'fhy h©7v-f/> m24lzmmr / x-va LTx +-y-Fl:fgLMftf pJ|gT$>^o C(D=k3(:x ^yXfAlc^^m @B?m-e - E^TS^ScDNAge^lJOlWISM^^ # L Xio (3) a. < >/\9XT#m$#l^cDNA(Z)±&d3 6^6. a-c^x mA -5yu y fy -a-ij, *ist ^Sd^Jlf > F 7-^7 chWX^o #&x European Bioinformatics Institute (EBl) Ttl te/W Z? U y H 7 — t) — £rx“7 ^ — L x J T^t)^x EBI^NCBI©X-7e##76 C 61: j;oTx ##%0±©%##/\< 7*U y H v-73-i-^SUT^^dbEST-^dbSTS0gB^J1WW5:5j<6b^ ^ $ 5KX ^nt> (026)0 ±sb©xo©u (0 25x 026)x < >/\7%©@B7J##^^a##a u >7^#^ c a^oj#c^6o < >A^7XtCd3V^TE#L^cDNAOBB^JtC^ST^^fe#:±(7Mir fl1W#(DS^^x ^C1:U >/7T6^m##om^^B%aT%0 #l:x ^@##6L/Tx #m%0^ ^^x m^TZ'f >/\7x©x-7£j# ###©^±^0-5o $ bl:x LT^Xx v bf >y$a6<>AC7X^:#cDNA6:^m##^^^ormCT6o 027CT v H>^##r©^66©x-7^U >^©##^^"7o C fl 6 ©##(:: j;oTx ^:#cDNA(:^T & < o X-lf-A ^(DfmV^btHZ^LTx Smt~5^ScDNA£x Ty ¥>?<&&$& tmMffimtt&lZ 028CTvbf>y##(D##^m:A7jim@(De!ie^fo C(Dm@^6genernap -139- ~>r 15) WB §iiaWP 'if >cn»-x Integrated Annotation Analysis / Modify Query cDNA-DB -9-K A^J Search System Menu ECNoA* W Select leeclem CSP : Series-prel) SP Keyword ifetottA* SP EC No. SP Ortansm Taxocu SP Text f Or R-Le«el » A B 7'M-r A^ AND/OR Lofk«l Owety ECNo 0 2 3.EtET/T-'>3 8tX£#cDNA (P30G41 i-23460 P44411 P5697S (PI 8476 P21762 P36700 006B*r\P3S704 P35705 P48822 P52552 P80002 P91883 0 04120 0 77834 PI 8476 P21762 P26830 P30041 P34227 P48537 i 02 4.mHZ7 /-T-v3 jr-9-nc#g L/"cEEttf-7'> -140- mm a £ile Edit yiew Qo Qpnmumcatw bate esse Mapping Analysis Select Sequence Pand is RHMap SG3 (cR) r RHMap GB4 (cR) DB Sdect P dbSTS P dbEST P GenBankEST f~ GenBank mRNA P GenBank genomic f~ GenBank unfinished genomic Blast ______E Value Range fo . (To Method Sdect Sequence by Name Sequence Name jc-HEHBAiOOOOOl I* Sdect Sequence by Chromosome Number and Distance Chromosome Number 121 (SG3:0-1630cR GB4:0-266.94cR) Distance from top (cR) |l0290 o Map li$t< dbSTS,dbEST ^ST,genomic ) mm Otoe No. Sequence Name tor SG3 1 C-HEMBAl 000001 21 44.3-49.9 1308 P0.7 genomic AF064859. 1 5 polymorphism 13_ 2 C-HEMBA1000001 21 34.7-35.4 888 F genomicAP000044. 1 5 polymorphism 1 3 C-HEMBAl 000001 "21 34.7-35.4 888 F genomic JC604 59.1 5 polymorphism 1 4 C-HEMBAl 000001 21 34.7-35.4 891 PO.O genomic AP000044.1 5 polymorphism 1 5 C-HEMBA1000001 21 34.7-35.4 891 PO.O genomic JC60459.1 5 polymorphism 1 lamtHeT «hw ' 112250 Bone dysplasia with medullary fibrosarcoma 147660 Interferon, alpha, deficiency i 238300 Hyperglytanerraa, nonkitotic, type I ; 247640 Leukemia, acute lymphoblastic > K^( 02 8 -143- /itilcDNA© >J X h ^i28^ C^to &±#cDNAt:#LTkL (028). b. iieOyXfAi^^T, £#cDNAgB?iJ #, A#cDNAE^(D^m##a^^^C(b^Te6. cn&$UmLT, XvtfXX#^ T#^yc^#cDNAE^(D^m##<**^ XXXA&im^L/:. 028(:i-(D#f f. ^#cDNAE^J(:*LT^m##m&^^T6. (4) iWa-T-f <-X(Z)## ^±E%L/tStff'>XT-A^E^UTfiJfflT#^'>X7LA^MBU^ (029). 03O(: 0RF7/f-y 3 >> f'"57/<-X'tt-f l"I^'< SlbT / 7 — XgX, Tvkf>^7/T-y3>(D"e^l^^U:^LT##^m:$:j@^L"r, 7fl6^± T^fef^ficDNAga^J^U X h oJt^T'fe^o (5) f^^s/x^Ac ±#cDNA###(:MT5 7/ 7-X a X cc-rn, £t\ #6^i^±#cDNAga^j(D#m^ ##f L/c. ^6gB^J#(D^$^, A U y X XEX □- >£j 4000@B^J(:3^T^A^$§^^[g3i(:^T. gB^Jiooo#*#^ 63000#*# (c*(c^#L (FL)= it 3& (D #) H , ^#^-X<-XT&6 UniGene (7) ^ # mRNA (UniGene) > S H 2 tl tz H X V ^DNAE^u^tDfiEcDNA (KIAA) <£> # £ £ 7 L . Z.(D£o\Z, * X U X x X h TS 6 nfc±#cDNAte> £ il £ T CD 2 £2" £ & X D X x X h T# 6*l7;:£#cDNA 7. 4 emaatf^&omaRB (1) $§# M5$ -144- frfejj&tLTURLrzo ^n->©n>^^^©rajs ^$>o^^x 2^^AT2298Xn->(D^ SE^IJ^S^^TL, 3^*IttSt)201^n->jiJPU, ^H-2499^7 D-yOllr^Tl fto b. ^S^iacDNAf“^^“XyXfA(Dii ® IT, "T —^IiJ&LIIvX'tAx B :> X 7 A x ###3 j; Uf$a;#(Z)Mi A X7A^^^Lyc« f“^iy®lyXfA(5;l t^x Hz/UMTa&mmaWx evx caawm%i/x^A#BL;t. ffijT y f“'> a yyXfAiITti, DDBJ^O^HcDNAx — X SBffltiD'fr — XfEfi£5: ##7X7AALTx 0RF7/7-7g>7X7Ax 7-X<-X It — 7 (C S < SlbT yf —ya yyXtA, X y H > X 7 / 7 — v 3 yyXtA$:l^ U 7tl 6 (D^-o ###7 7X 7 A 1/yho © LTx ±iax v ^>^7/7-7a >XX7A(7)^m(:x ^m7-X<-X(D##$:m^^t)1t6 C6 la-at, ^Sx^iMyXrA^iiLfco ® 5,OOOX n-XD^r^cDNA^ttf £^£-x — x^ — X £ IT, _LEjW#r-T-X (2) ^#^lx ^^Utc^ScDNABB^iJ-tWIgAX LTx c M##x %#y/A##g;&x $6cm^Lyh7-x<-x7X7A^M%LTv^ W. ^ tl 'b o 7. 5 am## [1] Wadman, M., Human genome deadline cut by two years, Nature, 395, 207, 1998. [2] Maruyama, K., Sugano, S., Oligo-capping: a simple method to replace the cap structure of eukaryotic mRNAs with oligoribonucleotides, Gene, 138, 171, 1994. [3] Ota, T., et al., "Full-Length cDNA Project toward a High Throughput Functional Analysis", Microbial & Comparative Genomics, 2, 204, 1997. [4] Nishikawa, T., et al., Database and Analysis System of cDNA Clones obtained from Full- length enriched cDNA Library, Genome Mapping, Sequencing & Biology, 160, 1998. -146- m##^cDNA7-7<-X>X7A0##&^7L^. *7o>%7 /\7XH/-7v hE^J##>X7A#*^a#(C, cDNAO#m7#j&fT3^^)(7)& a. £ScDNA£>BE?IJ&5£ 96*7 7 &^711/7 4r t U -DNA> —7 — 01igo> IE 3\ CO 7iz 7 7 )V \Z (XSequencher^r ffi 17c 7° 7 'f ^ — 7 ^ — 7" > 7>£ ^ ^ SBE ^0 7D-XD3>7^##0^# fo'Dfctf, Lfro in 722987 n — y(Dlk sie^j^^^^7i, 3n \z\%mniou7 u-ymnh, 3-^24997 o->o##f #557 u tC o b. 7^#^C-acDNA7-7^-X>X7A(T)## (D 7%#7 — 7 -7 — L T, 7^# 7 — 7 > — 7 # #>X7A, m#43j;U:^m ffvx i>#DB##(D/c&!)(Dg #77 7->3>>X7AE:LTkL DDBJ^(D±#cDNA7-7##m (1) & L (2) 7:# ^ L (3) (4) (7>X%^#) 0 *E^|fr^2000¥2^ 19 0 m77##2200#^^M (/W777 7o>-me, *A#co#M:{Um±) 147- #58* £ficDNA®*SE9J3fc$©BFa: m a6tt 0 tL v -r x v 7 . •> t. x a x s. i *m t h^V AtoSEtt, 1!t5FMiB$T2003iF£BSibT:ie® BtlTIAS. eETttBS Sitfifl Li 0 , $i5r3:Ttt;3:tA;M$S®ST£ig!§1"5>73:.h*X fcf- H SriSATHS. C© toSli/H trJ n->*-®#Sf*Sllgt t>M«f -5*1. y / A#i@b*ef t, * iDNAga?ij©BA#tltsisntnj SNPs^flfflf^fcAICfc, % A: 73: gBt'l 7)1^51:73-5. i©is6X J A ®j*£^7)tyy Agfi?ii$ifei66fc6©5> hv-9 bUTasstirn-e.. $i. ctie. *6f©#m&B9B*4:t"%Abt:j:0, #;#©#*, B*S®M%, SBCttSrS #®im*% #*#A730#-5%©T&^. C®»,S^6/t-f ^-f-y y a U y X %#%§T*#®%6#cDNA50009 D —>®S$gE?i]$:gt$t"5>ZcS)C, (#) Hi MftBu (WBu-tMiyXy^fAX, 3iBS(#0. B ##*« (#) ®4tt^ 6/«£ 5 E%#7$J£t8$LTAU y X XE^fffSW© t b SSrftcDNAtogBtiJgWSX X - h SMi 7C o *S«ffl%Cj3tj'-$»(#) Hi-9-i x>X AXiAX® BSli, A >J y XXEtE0T7mi Li3g±#cDNA©y< X'y U - ® ft, 500X n->®ifi*gB?ij$ »S"T5 ^ b T$>5„ — #*%DNA®##r*&b LT, -> a y b tf > S A y I1 X- —i * — X > X'te**® B tl TiA-5, W#}1. ttRMfti&IM X'®DNAC, f LTftttt, gckbp® % ® KiS L T V> 5. iti'iibt'ScDNA® ¥ii31M X*tt2.5 kbpT$>5 C tfri, (Sffiffii; LTX’y'l'"?- X * — * > XSSrffl U5 C b C Li. A ©iSti, $t*cDNA©*#5008$8#©gB?iJ Sftgl, RgE?iJT*Ein,iXy1'v-T*®5Kl500tt*®E?iJ$E*- S*$69ICg*fE ®*#*f*«t5bO-i?^5. ^OffiyP-JBlCSt. y-i»I>-»y4f- x7-fe>7';v^Xy'r7-»n-^Xy'fx-^s& ®lSI5fflli, A*7kXy -149- Xff%0rT-«6i©SBSStj-fc. *1*. 5fl 4="mcSBi[Eg©X n->Ett£$l:hW#r£X 5'-h3-a-fc. aoro»D->tl8fl. lOfl ICE#dn/t„ MteMrobROffl 1$. x> f F- h®OTS^5‘Ra:3’sS«roSB^E?il (FOORtfROO) ®MS®fc**fe755&< , 8fl =fc V $□ &bh)sbfz. BllcSt" =k 5 IC3fl ST" 1:5007 n — >©($#? ISSS-B fiS Lfc. f$8e-ett:®5007 D->©SffS$Ry:y7'f 8. 2 **##%& (1) jg^ftcDNAX D — >5) 500H®cDNA©6ife*5 7 D — >|$S:glteSt"= NT-2* JET 1/ f J -i >###©# f*Tig»«j©SJP (2) T->f f- F©8@M xFfF-FicLTfJSB^rofxXS KSfflLfc. 7,7X3ftt, LLF® 4; 5 letttti LTJ 61:7# n-x*S»*jl:J: 0 7" >-9-- h®SS»eai$U7k = (a) 7/57. 3 F'ttffi f 7X5 H®#aieltT,7X5 Fttiti# (PI-1002 : 7 7f7K : SrflH* fc. iwseii, ws•ss• ‘f’fnfciimae •««• if. *JMB©B&tt»£3ml©LB«il6l:*BL, leTlSStraSj: 5#*f 7 Ff 5i, *JBBIi»i'#gl:i0liSfftT7JW U • SDSffi CiOM^ns. OUT-, 7xy-;H|©|it*Ci 0*#B®DNA^a6*75:£©7F *#7)5##LTm75^TfxX= F$7"@-U±#755#6f&. *%#, f 7X 5 F'Bf 7 f □ /v-;n: Jit) «*«•!*;* ft. i*y-jHe «fcOjfcgsns. x7y-JF£*3SliSS l fc&. TE«»«ief7X 5 F*5»«LttmxS7i5^Tf •&. : 5til6tt7 7 S ■y&roRBf x hSSl't. ft 43, fxX5 F* ©##!$, -20t:Tff o fc. (b) MSRlX-f >-9— F©i6Sfi®tl6 SBKL-fcf 7X5 HOM*Rt7'f >-9— F©g®S$r*VD"f"7U >7# f OJ§t«lK* (•Rf "felH92PlM : /Ht7 7 FttK) T7# D-xyjH»# ITM L fc. fxxs Fii^xx-&A&5k±m*A7)5?i%5,ooom*m#T*^ca7)7 6 7yfD- XfR*gS-0.8%i Lft. f 7 X 5 F* t# ICDNAtH X 7 — * — £ -150- b -X A b A A A <=^ H b -A bi □ -X 1 rx ate 4g A IK m 88 ,U ^ A 1J^<—□£ 1 5 0 0 ^D —>(7)6* MAM MAI t humb « 1 1 NT2RM2 nt -2*. y y NT2RM4 m± 3 3 NT2RP2 NT-2*, y > >### 9 2 NT2RP3 m± 8 5 NT2RP4 m± 8 9 NT2RP5 m± 9 OVARC1 t 5 0 PLACE1 t 8 5 THYROl t t'EPttBtt*® 4 8 VESEN1 t h jflvgrtt&meE® t-7 -f v v -tg*«)!s 1 Y79AA1 h h retinoblastoma InSl^tBSS Y79 6 500 TSIIT dBDT 03 -152- (3) 7—7X7777 (a) 7-7X7777 tt, Dye-terminator# £SEffl L/to ^(CPE-ABI%#®Big Dye Terminator Cycle Sequencing Kit (J7TBDT b B§ 7) L- fz0 ® #, dGTP Big Dye Terminator Cycle Sequencing Kit (J^TdBDT b fl§ 7) ^ 7 7 7 -V A GO Thermo Saquenase II Terminator Cycle Sequencing Kit (J7TTSIIT b Pl§*7) RC^DYEnamic ET terminator cycle Sequencing Kit (17TEST b "7) 5: V h |E3(C7jKt*o BDTii> Poly T/A£1£ tf BEIX7—7*7 X$E Mtftltl&Z. htffo&o CtKC^LTTSIITUx |IJE^SBOTH til Poly T/A cnmwz. LT®5mme##®#%^^B (bdt: TaqDNApolymearse, TSIIT: Sequenase) & 7h , W ^ 'y h # (CGC-rich7t#%Tr It, cm:#LT:kL dGBDT^M^bfCo ESTtt, TSIIT® S&%>Energy-Transfer 1: <7 0 A &tzb(D7\ BDTT^Stfc+|->70;i/ch|^^iinFf2iC^^^'>-XX>+h— \Z7 7° 7 7 7 WtftteZlT — MB*. &t5pmol/l b L tz. o 7> *C IrI 7P1/—b 7 )V'*)'7 7 7— (GenAmp PCR system 9600M : PE-ABltfc®0 IIt'>M/T, i^#)70n 7*7 A LfclM 7 ll/RS® £ U fc = 7i7Sephadex G-50 ( 7 7 11/ 7 y7ftS) MultiScreen-96- well-Filtration Plate ($ U #7%#) (CR22FM : 8 31%#%#) T950Xg,5^®^ff:T^6LZho 7 ® T# S nfcSjM2:^i>X/3# Iz — 7 (CE1DM : 8 37%#%#) 1:j;0 H777^7°L7^0 (b) 7 — 7 X 7+1 — 7-7 X7t)-—It, ^ ^ 7 7 7° ® 377DNA-Sequencer (PE-ABI%# : tf 7 U-77 7°®3700DNA-Sequencer (PE-ABI%# : tiCE^fSTL) 57a 'y H^7#T#e,#Zc1t77 0-7 fiJffiL7c0 377® 7-7X77^^11/It, 48cm&m®2##7X#®fWH:4%7 7 U Jl/7 ^ (PE-ABiti:® 7°n h o;H;:fi£l>IJIE) £}fALTf7S£Lfc= &&, 7 —7# It ^1:64 -153- 2 ( 1) Big Dye Terminator SJfrfflfiE KfS&ft :r > 7° 1/ — h 2.5 fxl D : 96t: 10 # 7°7<7- 0.5 nl A : 50t: 5 # BDT 1.0 Hi E : 60t: 4 ft ### 1.0 h ! ME* 5.0 Hi 2 ( 2 ) Thermo Sequnase II Terminator KfSEfiK KfS^fr 7 7 7° 1/ ” b 3.0 h ! H : 96t: 1 ft 7°7<7— 2.0 Hi D : 96°C 30 # TSIIT 1.0 Hi A : 45t: 15 # ME* 5.0 Hi E : 60°C 4 ft 3 oii-<^;i/ H : Hold D : Denature A : Annealing E : Extension #s 1) 7*77 (DSSS 19^^2 2 bp 1 3 ‘dim | 5 (kcal/mol).£lT I Aldim | 5 (kcal/mol)J7.T Hairpin 5 or^T 3)Tm{6 5 6fab 64V 4)GC^* 4 Ofab 60% K2.3,4:f&ftlffi# 04 -154- (4) 7 ^ 7 i! > 7 JHi > Sequencher (Gene Codes Corporation,Inc : B AL V 7 Mt: 16 %) V 7 h ^©^77 h^-AilT^PCyXjA (MAC-PMG3> 21-CRT,MO) 79 nix%X o^v^co^A^ x^7> aLT, fm^&awaT''^ 7-(DRmm-6: L/z. 7 7-r;i/i^^E^ESU7cSB^jx-^^7-t>y;i/r^o mmtr (5) ^7<7-mm- '>-7X>'>>^7°7-r 7“ Ote, Tmffi> 7° 7 < 7 — ^ (DT& #m&%7GC%^ ^(D#%T T"7< "7-(D2^:#m^y7< dimmer & Hairpin# igT & 0, PCR£S£EWT5o Ctl6(D# ig£Sfi£T5^S^ te> dimmer & 17hairpin (D # ig (D ^ % T # % $ tl T V ^ 6. Dimmer 3 ’ ^ $!]2$l £X iit $ $1 jg 5: $ T9 7 7 (D 1& Stti > Oligo (Natinal Bioscience,Inc. ; $ @ jg 16 jE ) V 7 h L /to ■€-CD7°7 '7 h^ — A £ l/t^PCyXrA (MAC-PMG3> 21-CRT,MO) f8 L7c. M 7 7Mt Uftmmfr6707-i’?~-&w$:mn?z>t&m&)\z±.mmm$:m%'Tz>o fit ^3(:#^"T6 t)(D$r#%L7h. (6) 7°7< 7 — 77^7-ft SSE^J^m^X-;i/HTB5:H-SJ||-th-HX (%) EL7co s. 3 (1) 7-^(D##^&(D## (a) T9 7 7— 7^* — 7 cDifl^Tf mcmtf (iXfiF{iij *W^T^>X(C7Ff j: 7 7^7°7<7 —##7 b°— H (D^(AA&2r# -155- StSCifcLfc. (b) xx77-#@6& X9 7x-©###t7MtcLT. tf'7>tS:8tx-X 2 6(:XD->#©##&lE#f *4tcS-f >n->«, ^x x-6, *sb. cDNAigs5& i; # s 2 n a ± ? c u t. (c) f-J'otasa 05lc»tfr*g*©a®«S; (ffiil'MI-y XXX *-■?••)/S) &[X###©f-X'gm* s$)K-r. x □->&, cdnaexs , »&b, xt©##A)6#a2 nto^ct, #5»'e.3 h-&a-x M,t. w t, xx7 -7-©EHRy:ss3S©T -X; (2) 500X D —>©cDNA&U=#MX97 X —©## 500X O — X©cDNA©## 6Wt, SflrlCttfflUitX^x- —©!NH4£ X57T-wa*©x-x&sic#if ut„ (a) #WSS*IC#-3*<500X □ — >©#tt cDNA©##ft tGC%lktf'( >•*)■— b ©poly A©^^ £ 0 6 CSt. £ fc, SbtC^tl f fl©T*l*:#:&TX9 7 XX U cDNA©ffi$Sli, */Mi/^638bp, # ±@#6178bp?f f%2438bpT&3 it. &jo, 7 XX — b ©f %@H2462bpT&o t. $£:, GC % It - *X30% , #*69%T¥i§46%T$>o 7t. 7 >X — b ©polyAtii, #/b8bp, # ±125bpT?#24bpT&o t. 97X9U —XD — > $ lb & WMAMA1. NT2RM2, VESEN1 & XS Y79AA1 £■ Kt < t, cDNAtottX SRtA GC*l:^('t77^7 lJ-#OWt^l5ftt. NT2, THYROl, PLACE1 f Lt OVARCl©STcDNA©¥i9S*ft7)t@ < , 2 6 ICNT2iE8ST(i 1/9^ / 7 cDNA©¥J9$*ft*tft< 7j;5>f*[p)lc$. 0 , $ Scm##©%##M^&<%6 (57 X 9 U-6(:#WTW*©##^A#Ai&) lcaoT¥i9SSft^*i8inLTV>-5. ORE Iz^y 7 >8©SSCJ:0#@MflSlc#Hb-r-5)7c6?)lcJ;0* S&X >AX»7'o*£6j$;2;hTW-5,b3|'X 6ft3„ $7c, ¥«3GC%ti, cDNA©?^#% ftt(JjSCOVARCl*25 R%AS* * < . OWT'THROl. 14X2 R tX PLACE 1 © (E C X 2 < & o T Wit. NT2MflS"Cli 1/^77 >#M##CGC%*i(@i!mN|p]IC&6. -156- 4-a, 7'f7'7'J-«©#*at*t (b) SEffly^-fv-CRtt 500 7 n->oMl;Sffl Lfc^7-f7- (BI1-L;t77 7'T-Tt><#1ff (2E1TS/3 fr-otzbc DtESU) (35773# T. 17 P — > S 7t 0 ¥0111 7 7 7 T — Sr&g £ b t, Ztl ^77^7 © 3kF (5: fiS 7 $> -6dimmer^t/hairpin^iaSfi6(7)fs^ ( I 3 ’dim I , I Aldim I % ^Hairpin). Tm«&.U:&$Bj$£ia 7 IC5S"T. B t hj E © 7 7 7 7-(3*312^7 Eg# 0ti®SSSl/TO5i!!, «)9 7 5 85 # E W (2 <£ 9 ■'?> tl £ f# 73 ’dim,AldimS !>*GC % $: E flfcE'TEH'LZj;ttnif/j;6Zj:/6'o Ttfcro^'-en-e'tlZdZ, 1969&7378#& 0 , i'fftfc^7'f'7-tLTtEBT8Tir>fccti)'6, Eg#©#(2*7#3: 9lt < iftSiiWofc, (3) BEX n —> D X h 5007 n - >£(5t A/£*7777-7;t-*>7ffiTF.S!)8&<*#rTSfc7)b SffiOB b©RifMBE7 O —7*^36#it. £©*(:: (3, 77 7 7 —7 7 — =£ ZT'ftTttStfrT Jfya*; h # >#£ tB^bti-TStff bit fc © t> $> 9 , 77 7 7-7 * -* >7S© ffi iO/li^EfFito'SS® 9 Cftofc. *St7D->©«ttt-e©SHRtXW*®*7tcSf. $fc, 77 77 9 -#©B*7 □ — >©ffiig¥TSCSt. BEXO — >©tti8S(3, ¥i§ 0.072 ( = 36/500) T£> 7 □ > g(©SV‘NTRM4. NTRP2. NT2RP3- NT2RP4, OVARC1, PLACElS. 7 THTROlTitiS^Srftlitf-5 t, NTP2©=fc 7 tC BE 7 □ - > b fifr o 7t 7 7 7 7 U — t>a6£75*. NT2RP37>0VARC1© *H©S t)VJxS © 11, $/t7 n->S©BBS$6S6<)lcSaLTUti: Wci6, 777 7 9-#T©BEX D->©#mSx.6C t^TS^oit. fct, iR7o-> ®iiSJ£i»i:85 r ££ttt3bt. (4) ##fB#K©#»3 «0iU>7D->Ttt, EM-bTt*mTS%^ot:777T-. BD56ti7j:i£S 7-76h*^ fc ©*1#< ft5#(n]C$o it. £fc. 777T-=5;t9 WETS i5ifi#S&:fcBEXP->T(3fi #f77 7T-*/#m77 7 7-») K.0577 7 V-3t 9 WETSSSSSICOIaT# 13 bit, 08K:777t-©*| s@* 9&tf777T-3;t9©WEttl£» (= (7 >7- h©E *») Z (SI7777-S-2)) ©J3%&4f7. 0^2K±. S*,, 3t5tl(il:««777 T-g:©2fg«±£Eff US(3flB» Sai'pfctifflli. 327 D ->T$>oit„ -75. 777 T-3/t 0 ©WEffi$Scli¥^366bpT, 300bPyT(:®St" & £RW# X o->(342#T&oit. fCT. -etl-e'ft©X n->£ 9 X h7y7LTS6 ICGC%^'60J7± (307 □->) © U X h ttfclSbT, S«7 -5 7 □ - > » * M ^ it. f ©i(S*S:®9lC*-r. BE 7 D->©32# (377 7 T-ijig $ 9 7>t2.b(-h© t> ©(2, 580© -157- Kit* T.lshfoashi 9d->8: NT2RP4001207 **No S0001 : PME18SFL3 xv^u-h-y-'fXiis? 1 **$ > 0 »S9f>1 4649 (Max 5.5 kb) f 9X7^7-* (TtK) 9 Input No. > 2 Contig size 4395 cDNAit4’X:*5(nt) M7X7<7-* F* 9 Insert It'fX 4375 4358 HTJy'f?-* R« 9 Kit? fia<0^7-f7 tt?E*l 7 -f 7 - 0 tttt fcMH; 990620 -t** (YES:1NO:( (X7-f7-No.) e m Size Position 3’dim< Alldim Hairf Tm (A+l (G+C) Memo S0001F00 1 S0001F01 S0001F01 CAGGGCTCATCACCAACTTC BDT 20 453 0 3.1 0 62 45 55 K7H: 990831 1 SOOOIFOIr S0001R09 ATCTGGTTTATTGGTTCATTG BDT 21 472 1.9 1.9 0 56 67 33 S0001R00 1 S0001R01 S0001R01 CACCCCT GCACT GAGGAAC BDT 19 431 1.9 6.9 32 62 37 63 HighAWi=6.9 ilKttiS 1 ioo 1 SOOOIROIr S0001F09 TTACCCTTCTCAGGTTCCTC BDT 20 443 0 4.7 31 60 50 50 htMX(kb) 4649 S0001F01 1 S0001F02 S0001F02 CTTTTG GACTTCTTG GATCAG BDT 21 519 1.6 4.7 0 60 57 43 f»K:tX7<7-KK 18 1 S0001F02r S0001R08 ATCCAAGAAGTCCAAAAGCAC BDT 21 516 1.3 3.1 0 60 57 43 K:t^7<7-R 18 S0001R01 1 S0001R02 S0001R02 CTTGGTAATGTGTATTTGTCAC BDT 22 340 3.2 34 1 60 64 36 Note:(-b7UA17^IBA) 1 S0001R02r S0001F08 GTGACAAATACACATTACCAAG BDT 22 340 0 34 1 60 64 36 S0001F02 1 S0001F03 S0001F03 GATGCTGTGTATAATGCTGTTG BDT 22 496 0 3.5 0 62 59 41 (ex. No poly A 1 S0001F03r S0001R07 CAACAGCATTATACACAGCAT C BDT 22 496 0 3.5 0 62 59 41 Irr.Sjunc.....) S0001R02 1 S0001R03 S0001R03 GGCAGGCTGGGTTACTCAG BDT 19 420 3.5 3.5 9 62 37 63 1 S0001R03r S0001F07 CTGCCGTTTGTATGTGAGAG BDT 20 405 1.6 3.6 0 60 50 50 @KA*tl S0001F03 1 S0001F04 S0001F04 AAAGAGTCAACATGCAGATAC BDT 21 374 1.3 69 0 58 62 38 HighAldi=6.9 (nawHtmicAa 1 S0001F04r S0001R06 TATGGTCGTATCTGCATGTTG BDT 21 381 1.9 6.9 1 60 57 43 HigtiALdi=6.9 (2) K7X7'f7-*ktt, S0001R03 1 S0001R04 S0001R04 TTCAGGCTGGGTAATGTTGTC BDT 21 433 0 35 0 62 52 48 %0**2:LTf#IKAt7 1 S0001R04r S0001F06 TGGGTTGATCGAATGGACAAC BDT 21 448 5.1 68 42 62 52 48 HighAldi=6 8 (3) ^6^7^7-17)** S0001F04 1 S0001F05 S0001F05 ACATACGCAGTAGGTCTATG BDT 20 379 44 4.4 24 58 55 45 ffi»(T)X7'l'7-A't,(7)-b7U 1 S0001F05r S0001R05 AGACCTACTGCGTATGTCTG BDT 20 376 0 4.5 25 60 50 50 7-5-^3-tfLT. #4 S0001R04 1 S0001R05 S0001R05 l^-Xhtt.WSLT 1 SOOOIROSr S0001F05 #rLl'i<7>£AtJ S0001F05 1 S0001F06 S0001F06 (4) M"*^y2*J6l±<0 1 S0001F06r S0001R04 X7>f7-(7)»ft. S0001R05 1 S0001R06 S0001R06 «*£mtr 1 S0001 R06r S0001F04 AtJ„**7BMH*«IE S0001F06 1 S0001F07 S0001F07 1 S0001F07r S0001R03 S0001R06 1 S0001R07 S0001R07 1 S0O0IRO7r S0001F03 S0001F07 1 S0001F08 S0001F08 1 SOOOIFOSr 0 S0001R07 1 S0001R08 S0001R08 1 SOOOIROSr 0 0 1 0 0 1 0 0 0 1 0 0 1 0 0 0 1 0 0 1 0 0 0 1 0 0 1 0 0 1 >> NG-Primer IU Date I Putative primer! Subsequence riPrimeT sequence ISCt^jsize I Position 13 dim j Alldiml HanlTm l(A+ll(G+C)lMe S0001R01 AGGAACCTGAGAAGGGTAAG 20 444 1.6 4.7 31 60 50 50 85 stteaa wmHo. &&&>% 87B #&(nt) Note Insert(nt) S0001 100 NT2RP4001207 PME18SFL3 990831 4358 4375 4649 S0002 100 NT2RP4001210 PME18SFL3 990831 2940 2958 3007 S0003 100 NT2RP4001213 PME18SFL3 000131 2732 2749 2904 S0004 100 NT2RP4001219 PME18SFL3 990831 2761 2789 2803 S0005 100 NT2RP4001228 PME18SFL3 991029 2678 2697 2904 S0006 100 NT2RP4001235 PME18SFL3 991029 2603 2625 2704 S0007 100 NT2RP4001256 PME18SFL3 991029 2617 2635 2607 S0008 100 NT2RP4001260 PME18SFL3 991029 2899 2911 3112 S0009 100 NT2RP4001274 PME18SFL3 990831 4384 4404 4782 S0010 100 NT2RP4001276 PME18SFL3 990831 2491 2518 2416 soon 100 NT2RP4001313 PME18SFL3 990930 2646 2669 2704 S0012 100 NT2RP4001315 PME18SFL3 990831 3303 3336 3549 S0013 100 NT2RP4001336 PME18SFL3 991227 2019 No-HelixROO 2088 2233 S0014 100 NT2RP4001339 PME18SFL3 990831 2715 2732 2904 S0015 100 NT2RP4001343 PME18SFL3 991029 2647 NotHelixROO 2714 2904 S0016 100 NT2RP4001345 PME18SFL3 991029 2681 2721 2607 S0017 100 NT2RP4001351 PME18SFL3 990831 3558 3575 3549 S0018 100 NT2RP4001353 PME18SFL3 991029 2922 2938 2803 S0019 100 NT2RP4001372 PME18SFL3 991029 2308 2324 2233 S0020 100 NT2RP4001373 PME18SFL3 990831 2913 2928 2904 %f5Y7- S0001 ££K?lJiMrXh 05 o ^□vx^h^wyu SQQQ1 FIN------jsnofti C-NT2RP4001207.seq Contig[0001] V J SOOOl CHR C-NT2RP4001207.fasta > t*xk 7t>ou SOOOl TXT J. E95 "T—^##7^—T'Vh 200 X 1 D 501- 1001- 1501- 2001- 2501- 3001- 3501- 4001- 4501- 5001- 5501- 6000- 1000 1500 2000 2500 3000 3500 4000 4500 5000 5500 6000 6500 SSS(bp) GC% me GC%^U^ X I D polyACOi^SS(bp) #6 &%7'(?:7,J-&(DcDNARlS'f>+t— h (bp) cDNA 0-9 —b polv A AT GC £i* 500 2438 2462 24 54 46 MAMMA1 1 1714 1746 32 51 49 NT2 309 2623 2647 24 54 46 # SR. NT2RM2 1 2956 2974 18 61 39 NT2RM4 33 2260 2282 22 56 44 NT2RP2 92 2400 2425 25 53 47 NT2RP3 85 2693 2714 21 54 46 NT2RP4 89 2846 2874 28 54 46 NT2RP5 9 3314 3339 25 52 48 OVARC1 50 1964 1993 29 51 49 PLACE1 85 2044 2066 22 55 45 THYR01 48 2433 2454 21 53 47 Y79AA1 6 2510 2527 17 49 51 VESEN1 1 3273 3290 17 45 55 -161- 3 dim | (kcal/mol) TmlB GC% | A1 dim | (kcal/mol) AT%, GC% I 12)7 1-5 6-10 11- 21- 26- 31- 36- 41- 46- Hairpin (°C) -162- g 7 1 0 V No tit 1 S0028 NT2RP4001447 F02,F04,R02,3,4')AB^A(N#L) fales-priming siteA^lx 2 S0049 NT2RP4001644 fales-priming siteA^lx ##%Primeri^RR 3 S0065 NT2RP4001849 F01,F03\R01,R03&ff£*A(N£L) fales-priming siteA*£lx #^kPrimerf$RR 4 S0084 NT2RP4002083 F06\7’,R06”&fi^A(N£L) fales-priming siteA*£Ux #^kPrimerf$RK 5 S0096 NT2RP5003512 14SI»:11 6 S0100 OVARC1000001 Fm,2,R01*B?&(N$L) fales-priming siteA*£lx #^(Primer{$RK 7 SO110 OVARC1000109 F00 >6BE£ ■r>3#’U—KEn1 8 SO112 OVARC 1000114 F04,4'.F05’;6B^6(N#L) fales-priming siteA*£lx £f&PrimerfEfi£ 9 S0128 OVARC 1000408 F00 &BE£ f^^L-hEo p—> 10 S0132 OVARC 1000440 F02,RC2'^B?A(N$L) fales-priming siteA*£lx SHtPrimer^fijt 11 S0160 PLACE1006037 F02.F03,R01,R03’;&B:FA(N£L) fales-priming siteA^Lx ##Primer{$R% 12 S0174 PLACE 1006248 R01,F02;AB^A(N$L) fales-priming siteA^Lx Primer 13 S0180 PLACE1006414 F01,1'.F03;A#^A(N#L) fales-priming siteA^Lx ##&Primer{$R% 14 S0215 PLACE1007112 F02,3,4,R03,4,5’;j6B^M(N^L) fales-priming siteA^^Lx ##%Primerf$!$ 15 S0225 NT2RP3000439 F02,3.5,R02.3.5';AB^A(N$L) fales-priming siteA^0Lx ##%Primerf$Rt 16 S0228 NT2RP3000487 F1,R05,6;AB^A(N^L) fales-priming siteA*£lx ^%Primer{$RK 17 S0229 NT2RP3000526 F03,F04;6^^a(N^U) fales-priming siteA^Lx #^%PrimeH$R)c 18 S0237 NT2RP3000603 F01,F02’;6^^6(N^L) GC-rich,fales-priming siteA^Lx £i$tPrimerf£fdt 19 S0245 NT2RP3000739 R00:Vectorok>;j£ff2Ec3 20 S0249 NT2RP3000826 R02.R03AR05';AB^A(N#L) fales-priming siteA^L' ^H%Primerf#^c 21 S0261 NT2RP3004670 F01;6BSS(N£l').®jX-ti"f GC-richT'iSf+ilig/K ^AP^L Shotogun 22 S0354 PLACE1010761 F01,R03,R04’;£ff2*A(N£L) fales-priming siteA^Lx ##%Primerf#R)c 23 S0359 PLACE1010833 F01,F03,R01’;6ff^A(N£L) fales-priming siteA^Lx £8fcPrimerfEfi£ 24 S0399 NT2RM4002504 R00:ployTflti&;&ff2E£ Singlclony$)A&L> Shotogun 25 S0408 NT2RP3000418 R03:plovTffi&M£E£ Singlclony^tiA&L,. Shotogun 26 S0412 NT2RP3000512 F04':G-richHlST;j6BSTOP BDT,TS$g&L= dBDT-Comp. dBDT-G^60°C(7)SI^^ 27 S0417 NT2RP3000628 F02:GC-rich$B#-e#BSTOP BDTTSS&A&U dBDT-Comp. dBDT-G^60oC(7)J5JS& 28 S0429 NT2RP3001383 2230-N)NU E5i]ttS? TS&l/dBDT&i:<7)j&j££ 29 S0447 THYR01000092 F03,R03.4;j$ff^A(N£L) fales-priming siteA^Ux #-#kPrimer^1%E 30 S0459 THYR01000625 R00-R0lRg##?KM fales-priming siteTSSIt^W Shotogun 31 S0479 THYR01001426 R04,;£B:FA(N£L) fales-priming siteA^Lx ##%Primeri$f%K 32 S0481 THYR01001480 R01-T-rich#%±#-C^BStop TS^VdBDT^ (5) Mm2u-XDffitirM MWt2 u->MZTIBH^-To (a) 7 □ — >£ : OVARClOOOOOl A 7 D — > 11, false-primming site0 X — 7 T$> Z> o 010-llC^f «t 7 KlSOlOORO4’0g& m*&^0K:f6j;b£>t\ UthjE^&tUyfCo C0maML^(hC6, Vector 1:# momw&zrcfttftfryfzo xzx\ ±mzmr^vectoret m l /t -m 0 & 0 (b) 2 □ — >£ : THYR01001480 *7 □ — >!!> ElO-20$S-T —7 7 IdS0481R01 0SE^fj0T-richE $S ■PWanfc^r-XTJbS. ll£#mT5 Z.b.tfX%t£frytZo XZX\ 2,yXX-&T-nch%U&xmmLXi/-2-> |gl:^l-j:7 l:m#0SL^l$r#<'#-C6:^T#^. (c) 7 □ — > £ : NT2RP3000418 *7 n->ll, ®io-3£^f < V-^EIf-T^T, '> 3 '7 h #>&(;:<£ D #e>tl/z+h7#7 □->0E^iJ^rfiJffl UZco # (d) 7 □—£ : NT2RM4002504 A 7 □ — >!!, 12110-4 7 Upoly T/A0lu^'r^S^S7l V MffiX t Ufr y tzX -164- 450 1.0-1.4 1.6-1.! 2.0-2.4 2.6-2.! 3.0-3.4 3.5-3.! 4.0-4.4 200 250 300 350 400 450 500 ms & 2>:32 300> :42 >60:30 G99 -165- 8 V p — y V 7s h A B C D E 2.0—2.4 2.5-2.9 3.0-3.4 3.5-39 4.0-4.4 ^7 a —^ NT2RM4002504 MAMMA1002094 NT2RP3000418 OVARC1000001 NT2RP3000526 NT2RP3000439 NT2RP3000487 NT2RP3004670 PLACE1006248 NT2RP3000603 NT2RP3000512 NT2RP3004670 THYRO1001480 NT2RP3000826 NT2RP4001849 OVARC1000440 NT2RP4001447 OVARC1000114 PLACE1006037 NT2RP4001644 PLACE1006414 PLACE1010833 NT2RP4002083 PLACE1010761 THYR01001426 PLACE1007112 THYR01000625 THYRO1000092 Y79AA1000346 Y79AA 1000802 : A:2, B:2.5, C:3, D:3.5, E:4 ktS k 7 'i~f=7 y — £7): (1)NT2RP3: 18 (2) PLACE 1 :17 (3) THYR01: 11.5 (4) OVARCl:9 (5) NT2RP4: 8.5 (6)Y79AA1:4.5 (7)MAMMA1: 2.5 (8)NT2RM4: 2 $9 7— >70— AT% GC% cDNA WEtiESSt MAMMA1 1 51 49 1714 2.9 249 NT2 309 54 46 2623 1.2 365 f*3SR: NT2RM2 1 61 39 2956 1.1 249 NT2RM4 33 56 44 2260 1.3 339 NT2RP2 92 53 47 2400 1.1 368 NT2RP3 85 54 46 2693 1.3 353 NT2RP4 89 54 46 2846 1.2 380 NT2RP5 9 52 48 3314 1.1 388 OVARC1 50 51 49 1964 1.2 368 PLACE1 85 55 45 2044 1.2 376 THYRO1 48 53 47 2433 1.3 362 Y79AA1 6 49 51 2510 1.6 329 VESEN1 1 45 55 3273 1.1 387 -166- T>7"lx-h0^&^#0E7iJ0## (poly-A^A-rich#^a(7)2^:#m#^) m#TkL Ittf, 0~f n% poly ^'S fco T0fco £ (Expressed Sequence Tags) ®TJt*"TT^ T “a ” T $) <5 7. b. ib £>. ACDlSM^##^ tlTho (f) T/P-y^g : NT2RM4000634 *7D->kt. ^io-6C^Td;7l:5^j;D < DmLE^J^900bpm^^#y7<7- amm-T^%^D--7T&a. 3'%j;D7'7<7-7^-7^m^. (6) mm^o-xD^sa^m aigncDj;D ##7 o->0 i! B T # ^ 0 t) 0 d. false-primming site (237 D—7). E[3 % 7° V T 7— 0 fj£ St @E ^'J 77^'f#6 tlfc 0 7* “ 7 T& £o 3 V^TgE^J## (57 □ — >) T. NT2RM40025047 □ “ >0ckD KlgE^JklJ; D 2^:#)S^#^LT#ai##^^I^^^TD--XT&6o f btr >7°l/— h0^F& (47 □ —>), GC-rich (37 □" >). RZ>'60 < 0 jELgB^J (l7 □ — >) (DMT&5. GC-richVlg&b 2frffij£I&W’C$> QMZ&]&W #0#3% DftfotfMfe-3/to GC-rich0 7* - X T ti2^^iS ^7jc #IS^T&6^^. iDDMSO^rSjja L fco TtlT t)#M^7^0#(l. dBDT^^Jffi LTto £ Dc. NT2RM4002504 0 j: 7 gE ^J ^ (D polyA^ polyT^ ^6#^0 2^#)g|#$T teDMSO 0 S M 7^' & tD -170- 7 O —7 —L/to (c) ^ □ —>(D#^Tti¥^SSS^2438bp> GC%^46%x ^ >+h~ h /to (e) 36i@£>EEft D->(Di!SS£^WU 7 C (b^##L/to NT2RM4, NT2RP3& //THYROl T, +8*%C##(DE L U— >&&< ^ tffftfr-D fro (f) E#ft D— 2/ E?iJ (2) (a) a a ^ false-primming site & # # % # # "#" -5« (w E#f 8 . 5 #3fXM 1) #j@% 7X hl/'ff-y HN o .3J (1996.6) 2) rpcR#m-*#^#^(1997 .6) 3) ”www.wsu.edu/Adnaguy/primer.desin.html ’ 4) ”genome- www.stanford.edu/ Aspellman/information.html 5) Y.Suzuki et al. ,Gene, 200,pl49 (1997) -172- * 9 * ^#cDNA©«SE$ijge$ 9.1 *3 ££*©&?> ft$a#7)lDNAb A 5t:^0 3o&&. aft. a© “^©igw-ia ” T&syyA»ey© SSK?m. fTlc*#H(l)^»#(2). 81jl(3)&£T-S:EW9;$Stt, hhy/A (4) t>2ooo¥#icii9o%© K77 s->-y>7i'*a«sn5. yy Ai*eft©#*E?iJtt ©#m##^fr?yyA##©%m«. f©##&«m ttitfojmtts.v, >/iy Miit!)T*?i). -efflfc*. L A'L & *16 , yyA*e^©95%« “5>\>V” DNAT& D . »i3, y 3/19K&3 - h* %e#±l:K*LTV^&. H#*^y y A*ef ©&*E 9ljz)<±T #RgBTt. ^©iee#lgz)i3- Hf >A9K©#E##r(cliS#SU?5;H„ S5>t. aifgoh hyyAEma&< it-fe K77 h->-y>xT$> 9 . mmo^E^T»& nJSB»t$5©T, ?>zi7*«3-Ht5fi)«ffliEllftSiE«®)l#ri!i!1l4*SSI #«ot< yyAEM©##zmafm$. ±g2©@scj: ycDNAizag^sw 93 3* 5= cDNAtift*E*gic#syi3< yKicgmsti-syy asse^ijt^-s, f AT. m^6cDNA©m*E9'l##*iet»BTU&z)1. f©$%©8B^-E^©»©# #?yf'L'T&5. ioT, ©##eeT'5 A 6^?nl#T$ 9. yVA me? ©*%E^t©Mmg4f & At & @#T&3. -#. cDNA©7t£&*#E?'J£ft;t'T-5 A £«, *ftf ##B##©#imi:»?SI^T $>-5tc4)SBt> 6"f. #l:^6A©cDNAlC3DTIl. -e©$tf#®K*69&BilS ©75:6 fc anrv>-e.©75tattT$.o/t„ t haftTisio~i477®ESftUTv>5ystinTv>-5y. %sc#ffi»i i9944u=#ea*Liizm%L/:%i-v d'ft v 'y yffi(4)n«*± < ^Scdnasmt @6^BT%< . cDNA»5f -173- © 5 %REttz)$iW© 500i' n — £75 C ££Bg£ UT{ffSe£$iS66/i„ ->-y>X|g*©[6U:£E5Ci: tilSilt. M#6®C ti 3 8110,000 bp/dayT* o felSff S6^7-S:35,000 bp/day C 31 £ :iCLfc. 9 . 2 (1) DNA+b>7'Jb©l@R a. -9->y;b©§« -trn-;i/X b -y X©ST(#)a'J y ^xe%0r^6SfiVZt. cDNA1b>X;b®Sfit|B|SfC, 6-X d->©* iSg25iJ©X □ x h 771'Jl'fefftTSai/:. b. ^7X3 b#®& . y-y>XMlSI:#fg'X'9Xa bDNA©M#*e#ML/:. ##©y XXlb SSXtBS. -9->X;H@SI;tiX fcf- btffiOEttySijiSttS. l»iL&y e, ->-y> |B|ll#lC-!t>7';b©iS«R -err, sttffiflj©Li-coRas->--y x-tb-rfl ns/t osox d - «S<8yr1"^> tU5*#ESTT*®-9->y;bH8S*?iSl±6t^WVfc„ $£, SttTti, in* T+ry >a®QIA Prep Spin ColumnTSI® L/tlb > 7,1b (TE^ft®) o, aM§gt-*©y7y>a®i8B£'y bt©it ® X7£'XaB8«!'/7X5 • 96-9->y;HiW$3® • am-fb#@@l: jc^M® © +7y>a« OIAwell 96 Turbo Miniprep Killed; -5198 • 96+b>y;bM#m® • ®5l8iMT©l@8 ® +77>ttB QIA8 Turbo Miniprep Kit 1C =fc&3@® • 81b>7';HB|S#i@8 • at' • BSItoifS SAiTSIB ® y 7 y >a® QIA Prep Spin ColumnfC 174- @ f#6nfc7°7X5 Kd NA£1 u 1ED, 1%7 th >7X1/0 &43> ##b^7°7X$ MDNA^TE/iv7 7 —-ZO^CT## L/c. (2) X"7<7 —0^^43j;U(## *yoy%/7hT^, ^±&cDNA(Dm&R^j##^f77n^^o, mm:*am0m# 6y7<7-X^-^>Xto(:T#^r^ff 7 L7h. Xx<7-x^-4:>Xto(:j; SBSgE^^t&STtex * )M£iJ£ fiWf 5 0 H >& S fc ^ X X — gfi^ij ^0 7°7X 7 — 2® H (Forward4o <=£ (^Reverse) (to) a IJ v X X#f^Br j; 0 ^#L7h^±&cDNA0(3 pME18SFL^ X X — ^ X □ —X>X£tlT^£#X C(D Communicator^) AA^CH)
l^iXABI Prism dRhodamine Terminator Cycle Sequencing Ready Reaction Kit (dRT, PE Applied Biosystems) $: A ft o (2) PCR##(D## 967\(D7;i/7X7U —> (MAHVN4550$U^7(#)) i2t7 7%XXG50 (17-0042- 042777^A77;F777(*)) —7%JI/^/c045//L^#L, ##7K300RL^AOX., l600gT5^^^f-5 2:(l:[2j:0, #mu/:DNAm d. 7—7%>X>X ±izG###(DM#eME%##7^;i/A7^ 7-X77- — ABI prism 3700£ AT E^’J %'fefeLfto '> — 7 X > +h — CO + T tf 7 U — gte50cmch L , (2 POP-6 5: IT 7. — if — 7 7 zl 7 T CD fl >T?7K Ifr ^ f#1 T 7 — 77777X&1TO mu, ^7tf7u-g;(D^^&m#^#(D5or^64or(2^ML, Isfto e. X7<7-a2# ±izGL/hV7h'7%7&^AT, 7°7<7 —^(Dt;>777 —Jl/, 7"7<7 —l^KDll/ — C