wno^/ta* 3 Una

t h cDNA^y /

a e

w-m 12^3 ^

NEDOBIS T98099

NE co mm ■ mm B &

$ # im 1 (ICAWc...... l 2 ...... 5 3 jo ...... 6

1 s i. i %±&cmkmmfejkum^(nm.m(Dmm^m...... 7 i. 2 ^&cDKAmmfeRTfmfc7-(Dmyo(Dmife^m...... 8 1. 3 ^^ScDNAE^lJ^:^...... 10 12 1 . 5 #$fl§#?#ffflcDNA:r — *7 ~-'<'—7s ...... 14

mm...... 16 1. 7 AAcDNA?)#&E?lJ#:^(D#%...... 18 1. 8 Ifelftiftfc hcDNA^V J±Mtif...... 19 1. 9 A#cDNA(C#&K?iJ#:^ cDNA###:$ij(7)##, Rtf DNA-r-f ^ nTH'f^Sglff ...... 22 1 . 10 DN A-^/f ?uTU4 ...... 23 i. ii &mcrnA(Dm,mmvfcfeRtfmmmft\m-tz>%\®.&ffimm...... 25 i. 12 m#cDNA

2 m ^mcDNAmmfcRtfmfcT-nmmomm^m

Its...... 62 62 64 70 ...... 70

i S4$ 55£:EcDNAgfl?lJ&£ 4. 1 *tS ...... 71 4. 2 71 4. 3 73 4. 4 151m...... 76 4. 5 78

#5# 5. 1 ItS...... 81 5. 2 tlLfcy7l>')i7, iM...... 81 5. 3 's^y- J±(DWi^...... 84 5. 4 ...... 91 5. 5 ##3C#...... 96

S6$ ####mcDNAT-f 6. 1 Its...... 97 6. 2 ###!#;&#...... 98 6 . 3 *5m ...... 101 6 . 4 #S...... 107 6. 5 ...... Ill e. 6 nmjcm...... 112

7 s ^ScDNAOSSE^J^^S-r/^SSE^cDNA-T — * ——X'>X t A(D# 7. 1 ItS...... ;•• 113 7. 2 £;EcDNA

7. 3 ^zEcDNA t2'—* '>Xt" AtDfM...... 122 7. 4 wa# z i44 7. 5 ...... 146

8 S A&cDNA(7)^m@a^J^:^(D#% 8 . 1 ItS...... 149 8. 2 ##### 150 8. 3 Sm!5^:fc cfclMS...... 155 8. 4 l5!^io J: 171 8. 5 ...... 172

#9# hcDNAy;j*MVf 9. 1 Its...... 173 9. 2 mmt'ntex...... 174 9. 3 177

li a

cn cn cn cn cn ►U. ►fa. CO CO CO CO CO to to to to tco JC |C |C |C |C O 0 0 0 0 0 co co # # # # # # 4u CO K) CO ro 4^ CO ro I—1 4^ CO ro I—1 cn 4^ CO ro cn 4^. CO ro 1—1 CJi 4u gt # # Xm cru z5 4} 4} 4> 4i} Xffl # ■ttF # D> m # Hi uu Xm Xm 3E # OU m cjll Xm 4H S M M > m m M z3 0 4> 4> H> H> a m n 3S # Hi nn|i 4 tffl # m null s 35 » nu|i 4 ##* null > 0 a W Xm Xm Xm Xm h> 0 0 0 0 n B & # Wr rv 44 & H n M z & & 4 H j* 4 Z H 0 0 a O 3* Mn S5 # m SI W Z Z > m a v* # Q 4 > # z z z z % m M> 4§ B E > > V j> a mtti; 4 n tt ca a la ra cfl to a h’ h- FT ru ru ru ru ^ h> BK S n ^ ^ ^ K # a # Q V a #a BH h to m S ruTW ^ ^ ^ ^ * si ^4 4 # ^4 V c^rt # Ev iM 4} 4} 4} 4} a m M 4 gw 9-r- 53i % 4} & Cm 4} Q q Q » a % # # # # a a * a Hi Hi Hi # ^ Hi § 44 -H 44 m 4 S a a IS 5 5> a m a 5F m a a m a 4 a 4J $ $ Si o< & $ M 44 4 m m m % IS K' m >j w 4i y- # H # >4 n d Hi ^ is 44 4 # 4 ^ H

CO to to to to to to to to to to to to to to to to to to to O O cO CO oc Oc c/i C/1 Ui ■u 4^ ^ to to 0 0 0 CO CO 00 00 00 00 O CO 00 4u to ■O to CO to o <1 o 00 •0 CZ) CO to 0 CO c/1 Oc Oc Ul Vl 15. 5 mcDNAT - f ...... 302 15. 6 ^ScDNA(7)SSE^J^:^S.r/ ±##m^cDNAT-^ * '>* 71 ...... 303 15. 7 AAcDNA(D#&Ef'J#:^(D#%...... 304 15. 8 ^ftcDNA(7)SSE^J^:^...... 305 15. 9 A#cDNA(D#&E?IJ#:J& cDNAW£ftffjlJ DNAv'i 9 306 15. 10 DNA-7-T f n T Ix^T (C Z 6#faf 307 15. ii &mcDKA(D&mmfcfeRummMVr\z.m-tz>m®.&ffifflm...... 308 15. 12 A#cDNA(D####(cMi-6K#M^...... 309

1 ] 310

[##] b&...... 323

IV ffs m m

1. # cable

2010¥(e(t25^R ^-ei^^ft-f-5 t=f <0 . ^tUe^y 5 S/f 2005^ariemmam&#Af5fb TV^5.

mt LT*re»§^t hcDNA^v V' 9 oi^it hcDNAf n —y

(7)i£&gfl?iJ£:&£U 5 ^le^(7)-^(Z)B^IWW^fTo/c0 *y p y ^ y h (DfemW-M £ Ulle^-fo j:^-!; y ^^#%^(DcDNA(D#me^(t6^-A-r(l. jEm yxmixmmtsthtit g^s cDNA7 ^ ^7 !i "l^ol't, ^SSSE^lJ^^:SL^0 d'T&DNA#%m#^-A-f#, 4- 5kbpa±<7)##^ P-y|e^i>-CME?U£$:;t U ^ <7) 41 6-gRilBU U £i£XE?iJ

sosui#^emL^^y/<^!g(7)#mfm^^$re^^. mmr/y-y 3y ^^Lfc^ScDNAgfl^J^I&^L. Bf&fiW teSf-f 6 ±%L — — y '>^rA$;liLfco ATAC-PCRj^(e g-esam#Lfcy p-y£iiffl u Mtf WfiWv^y- y ^-y *m mLtz0 &ti. mmtfjte'Mfcx-mmxvx jy/^-y y bmvr pTtg^cDNA-^y * pTW (D\/E$L$Lffi*m±L\^tz.0 $ 'MfcXkl&Bb y k°yf u, ya^y- * %M$tx$5*mtL lita z bizmmMvt^n*xv b^n^mt lx. jz< fflnyy-^TAj^WL, £M£S^6#WtoSIF?)&6WM£{£iSL/::0 o y cc ^ h Tti^f**e9,071fl9(DcDNA^SE^J £ . 2,000f@(DS^SSE^J^r^:S L fc (My'-y 0o &f±(7mmEfW#f^m&m(c^iy £/e¥fi£n¥ioti ^6¥tiU2¥ 4^ it^yy-v'T -Me t N l+7,604i@(7)^SSSE^Jy — ^ L^o #^^FR&12^2 ,3 22 0 , 2,212#(7)^AAcDNA^ p - y (e o ly x S l^y — 9 y T* & 6 g #DNAT —? /

tztmm&jt hcDNAyy ^hw^ibj; v ^^12^3^310 £ X) t £>tzb(DXfoZ)0

-1 - as* amsam

NEDO ■

SS#cDNAro«SE?iJ8i$j:#fig

te)'")v'7xm%.m (Bt)fr-fgDNAgf%gr

$a5fe«#^s#

H1 r^fflStl'cDNA‘yVAf£:fflft^Mf§J^Pi;i^l'

1

##f : 9071 —> J-—A ^±ft : 8951 #E:120 E*4Sf7 —A : 3950 ^'MXWf—A : 5001 A'-TSW Bii-y-j 77v>- ±m mam#X&MM B 3l $;ia a## X>X PX^EX «7» 1928 965 1057 2501 500 1500 500 120

-2- Summary of Full-length cDNA Sequencing Project

Current market potential of biotechnology is about 10 billion dollars, and it would be 250 billion dollars by the year of 2010, and business opportunities would expand tremendously with its growth. In order to satisfy and accelerate such expectation, so called “Millennium Project ” has been established by the Japanese government in collaboration with four ministries and an agency with a budget of 20 billion dollars in total by the year of 2005. Considering significance of information derived from human cDNA, a new project, supported by the supplementary budget, "Full-length human cDNA Sequencing" has started. This project aimed at not only determining about 10,000 independent full-length or long human cDNA clones isolated at Japanese research institutes, but also analyzing partially their functions. The project formation including member companies and collaborating research institutes which also supply clones to be analyzed are summarized in the figure 1. The member companies were supplied with clones from Institute of Medical Science, The University of Tokyo (IMS) and Helix Research Institute Inc. (Helix), and these teams determined entire base sequence from the libraries of full-length human cDNA prepared by oligo- capping method developed by Maruyama & Sugano. A team of Kazusa DNA Research Institute (Kazusa) determined partial sequence of mostly 4-5kbp-long cDNAs, and also determined entire sequence of some of their selected clones by using shotgun-cloning method. Sequence data obtained were analyzed by homology search, and translated into coding sequences where applicable, and then analyzed functions using prediction programs such as SOSui. A large comprehensive cDNA database was constructed to analyze full-length cDNA sequences for functional annotation. Concerning analysis of functions, expression profile was quantitatively examined by ATAC-PCR method developed by Kato, and database was constructed. In order to construct integrated gene expression profile, we also established technologies to manufacture cDNA microarray for high throughput assay. It is important to analyze relationship between disease and its relevant , and we tried in silico chromosonal mapping implementing genetic information database. Moreover, as a follow-up of this project, we have organized a consortium to utilize the outcome of the project, aiming at protection of intellectual property of cDNA information obtained, such as for example, through patent application with some functional analysis. In this Project, 9,123 clones of full-length human cNDA sequence were determined and 2,000 partial sequence data were also determined. The results of sequence analysis by member companies were summarized in Table 1. Total 7,604 clones of full-length sequencing data were disclosed to the consortium members during a period from October, 1999 to April, 2000. Full- length cDNA sequences of 2,212 human genes were published in the DNA Data Bank of Japan (DDBJ; an international database located in the National Institute of Genetics) on February 22nd, 2000, which may contribute to the progress of international genome research.

-3- MITI / Agency of Industrial Science and Technology

NEDO

Technology Research Association for Biotechnology Committee

Full-length cDNA Sequence and Long cDNA Sequence and Functional Analysis Functional Analysis

Institute of Medical Science. University of Tokyo Helix Research Institute. Kazusa DNA Research Institute • Kyowa Hakko Kogyo Co., Ltd. • Otsuka Pharmaceutical Co., Ltd. • Hitachi, Ltd. •Aisin Cosmos R&D Co., Ltd. • Mitsui Knowledge Industry Co., Ltd. • Hitachi Science Systems, Ltd. • Nisshinbo Industries, Inc. •UNITECH. Co., Ltd. ■Daiichi Pharmaceutical Co., Ltd. I (Joint Research) •Takara Shuzo Co., Ltd. • Graduate School of biological • Hitachi Software Engineering Sciences, Nara Institute of Science and Technology

Figure 1. Full-length human cDNA Sequencing Project < Project Formation)

Table 1. Results of cDNA Sequencing of member companies

Total : 9071 clones

Long cDNA Full-length cDNA : 8951 : 1 20 Team Kazusa DNA Helix Research Inst : Inst. Medical Science : Research 5001 3950 Inst. : 1 20

Kyowa Otsuka Hitachi Member Takara Nisshinbo Hakko Pharmac Fujiya Hitachi Science Aisin Cosmos Companies Shuzo Industries Kogyo eutical Systems

H7$t 1928 965 1057 2501 500 1500 500 120

-4 2.

%%%# (*). xm&m (*)

ZL--T*y* (*) % (*) BSSMtar fob: h cDNAY/A (#) B3[^x>XyXfAX (*). $£i£ (#)

- =lX^EX9F%3r

yy yy ^--7T y y x&

m2 / w y y n

-5- 3. flF3e»BteJ:Vfia

m 9 E I* s i 5! 1 1 1 . Ttfe* c DNAE?IJS:$St>**€^-togtl(D tto#pisro

2.9 e.±M c DNAE^iJSrSSUiSe^-roS^ro

3. DNAE^IJ^S (#F-S

4. DNABaM*g&m#Gfm4W

rL-—x y y# 5. cDNAT-?-^-^m%

6.4*cDNA ro$XSS?lJft$»V*«gE-6- c DNAr—9 —/<—H (# 0 VzSfEBfr

7. A# c DN

8. FcDNAy/Aj|f|f 0 9. ^cDNA©tWfe cDNA#*& SfiiSW MDNA^^nTKfl

1 0. DNA-v^DTWdttSiie^-tSilgro 0iz:yy

1 1. ## c DN A(Dmm@E9iJ%S&«:####

1 2 . ftE C D N A

l 3. ft-&pS#WF9E su / n

— 6 — SB 1 * J$*roe$

1. 1

1. fc hy/AWty^- (f 0f?e0r!S >>-f A-^WSe® ^(*#+577,677? n-yo^-y ^7“ cDNA»5'**ga?ii^fc t e, &- 3- K-f 5 "TBEttOiSiV'cDNA? D — y-R>IK*a

5. ilft Sn/c-S(7)*ffl®exicSI LT, £ b i i t)K, Zhb

6. ^*SiM 5r^£LfccDNA?D-yJ81 L, (%# L t.

Full-length cDNA sequencing and functional prediction of gene products.

1. As the candidates for analysis of the full-insertion sequences of human cDNA clones, 8,143 clones were selected by analyzing their S'-terminal sequences with bioinformatics approaches from 77,677 clones of the cDNA collections in Institute of Medical Sciences, University of Tokyo, which were collected from cDNA libraries constructed by the oligo-capping method. 2. Among the selected clones, 4,957 kinds of cDNA clones were distributed to the sequencing teams, Fujiya Bioscience Institute, Otsuka pharmaceutical, and Kyowa Hakko Kogyo. 3. After confirming the identity of the terminal sequence of each clone with that used for the clone selection, the full-insertion sequences of 1,928 clones were determined. 4. Homology analysis, prediction and evaluation of the coding sequences, evaluation of the full-rates, and prediction of signal sequences and transmembrane domains were performed in 3,950 clones. 5. Further analyses in their expression profiles and functions were performed in a part of the clones. 6. These cDNA clones was collected and stored in containers for future studies.

-7- 1 . 2

mmmm, ie #¥#%#, ies7ji, j^ws, mmm&Qm&znft h h^±#cDNA7< 7 7 U — b7jM $ tlfc^^:5cDNA7 □ — > 0 £t£SE7J £ 77< 7 —7 j: D^^b/co biSlTH,f^>tl7°7X$HgiJt{ligI (Genesis Workstation) \Z £ 0 7° 7 X ^ HDNA^iHl, '> — 7 X/XKS * tA, ¥ (CPE-ABltt®! 07 ^ t: 7 U “ MDNA '>— 7X/f“ (ABI3700) CT« E$E7J£#tSbfco ftEte, EteEE26.5%, ###E f*J&3.0%, IE^/MltoEl7.8%> IES¥JI§3.2%, E^$bE¥^® 8.0%, #38.8%7^6o ¥ cDN AE 6 M 7 E b 7 2kbp <£ 0 l,926bp7& o fco 70 7 t>807 ^ / ^b(70ORF#b^l/2 t 0^1±#:068%7$) o 7. BMJT'te. ¥^fl shies/mi, s^mm¥#m, mm, m, jes/ess EE(DJE^S< , -en^'ni,327bp, l,481bp, l,560bp, l,559bp, 2,059bp, 2,187bp, 2,355bp tt£~D T^&o 5'-###R#%0#$ ^¥^349bp, 3'-###R#%0#$ ^¥^805bp7$)O fc0 ORF0¥*9E6te798bp (S>£ 7 > A° 7 RS2667 ^ / ^g)7$>6. ±###7H, GenBank, UniGene, GENSEQ&

ESt7—7 A —X thy b b/clSc^^#:(7)E72%tcE ^7^0 ftTz b, b 0±BB#7j:##8 0Bfl7£7 7—7&

^#:(D^46%(Cf8^T^>o 75%tegenomicE7Jfi¥0 &0, /^TSDNAE^ l^#

##(X7 b <5 KIAA7 D — >43 fg # tE ft] 0 unknown-cDNAE 7J 7 $> <5 „ b 4l b 0 E 7J tiESTth M#(b7:g;^####M%$4l7b^b7 3->7&6. B D 025%^S^Jf7S 0#, f-7^-Xt:Si$tl/;^D“>l:i^t?)o h hESTx-7 te$tiEl687fr (1999.12.17$^) Ii$n7^7 b hSfK705^67S7E^7;5 bSt)tL7V^o 70<£7&¥7££:3iiJ*i^0ESTb t: y hb^t^0^l>§b(hit b07 — 0$>^> t)0, ^cV¥i, EST# 0DNA#7 b b t) 0 (X 3 7101-800) #46%, ##7 — 7 ttete — McLfthQ (X 3 7801bl±) til9%01837 3 — >7, 70 7 7 teNONSTx — 7< —X7##$77V^7##f7—7< —X##0t)0#^¥^$)3/cb b# b, &

-8- Full-length cDNA sequencing and functional prediction of gene products

Otsuka Pharmaceuticals sequenced full-length cDNAs developed by Institute of Medical Science in The University of Tokyo. We used primer-walking method for sequencing. We extracted plasmid and purify after sequencing reactions by TECAN Genesis Workstation and sequenced them by PE-ABI 3700 Autosequencer. Clones are derived from adipose tissue (26.5%), umbilical vein (3.0%), proximal urinary tubule (2.6%), small intestine mucosa (17.8%), small intestine (3.2%), cardiac coronary artery (8.0%) and lung (38.8%). Average length of cDNA is 1,926 bp. 68 % of the clones have more than 80 amino acid. The length of cDNA of each tissues are adipose tissue (1,506 bp), umbilical vein (2,059 bp), proximal urinary tubule (1,559 bp), small intestine mucosa (2,355 bp), small intestine (1,327 bp), cardiac coronary artery (1,481 bp) and lung (2,187 bp). The length of 5 -UTR (untranslated region) is 349 bp, that of 3 -UTR is 805 bp and ORE is 798 bp long on the average. Homology searches were done against databases, such as GenBank, UniGene, GENSEQ. 72% of clones show homology with EST, however, few of them have annotation about their function. One quarter of the clones matched registered clones after end-sequence selection. In summary, about 30% of clones have no homology with ESTs. Conversely, 19% of clones match patent database and half of them are not registered in public database.

-9- 1 . 3

*1*1,057* D->^gH^lJ^^^#tb^o 7 U->(7)|*Iim> katoIIlEJI£fi*(kat)4107 □ - >, (kaia)9* □ ->, 1ESAMJ&ti* (coif) 148* U, W M±&;Mfi;fc(hrc) 253* □->> SM¥MMEti;fc(cas)827 □->, fifSSJ9&6 3fc(hsi)l55 7 D — >, In stl,057* □ —7T&-3 fc =

1:1 uit 96 a y°u- httso^-Sv-a.T;v&> oiAGENkitm^^^!t#(#Mu^m, wmatm, #M^pJigT'& DQiagen kitii^'S

%m&fe®Mn,05i\zm Lrcz-t-v bsue l,000£r* 1J y bfdo ^SE^J #/J\#(l346#&, ¥ instil,680S$ T$>o7c. 3^y9<'7-^ff&(7)^T^^L^77D->A^998, tz (1 v a v h # > (D y > y A & £ #f b tz 7 D — > #59T $>^Tz0

BB^iJx-vfe>yj < gB?iJ£j$IS£-ti"SS/^pJIE&ATGCV 7 h yxy^s/su mmit, mmt, Mmmtttm^tz. y-

yfi^^L/to c(Dcan, am ® 7 n - > yu ^n7c3 a±T#m-c&o;t. yxyyy y-^m#;Mm%MmgBym;t#if >#-^&eik2#&gBfij#& hy>x^'y>#A&^mmL^o foaam, 597 D-XC^V^r > h y >y ’/y'f'?-&frfelZTffiyuLTzo C(D7^47D —>l:MLT^$61:y3 v h #>#&## L&. mm^UAA H%vA(D KlIHI ITU (T) 19 (V/A/C/G/T) (D5@#(D7°7<

0#mDNAmm, 7-7x77^^, y-^yyt>yu>y, ammm# gB^j(D##^6^^mmgB^A#mm^%(D&mm(7)mm^mfb^gi6a, 1,057® ©7 □ 6##l,798,208%&(DgByj^##^^L^o ^ nB M 1,000® tl *f L T 105.7%©

±B(D&oi-^ssn/tcDNAgBym> r#%%k ScDNAyy ^ mfumn >v-yyAmi:M^2:Ra : 1^3 >v-y yAmm^gByjm^, gByj^^y-A^^T 5,i53> 2,129(2000^ 2^AB^&)T$,*. i>#DNAy-^/X>71:T^MT6C(!:l:j;3T, 51^ E^lJ^^y-A^#:T2,212, ^340 (2000^3 ^ 23 B T $> Z>»

-10- Full-length cDNA sequencing

A total of 1057 clones newly screened by eliminating the repetition of the same clone as previously known genes were subjected to sequence analysis. The new clones contained 410 clones derived from katoIII cell (kat), 9 clones from normal small-intestinal mucosal cells (kaia), 148 from normal colorectal cells (coif), 253 from epithelial cells in the renal cortex (hrc), 82 from coronary arterial smooth muscular cells (cas) and 155 from testicular cells (hsi). To prepare the template DNA for sequencing, we compared the efficacy between Fujiya manual method using a 96-well plate and QIAGEN kit method. The latter was found more appropriate in respect of convenience although both methods were available for preparation of template DNA suitable for Sanger's reaction from quantitative and qualitative aspects and data of high quality were obtainable. The number of clones of which sequences had been determined reached 1,057, which was beyond the initial goal number of 1,000. The max and min lengths of those sequences were 4,690 bp and 346 bp, respectively and the mean was 1680. Determination of the sequences for 998 of them were made only by primer walking method, whereas that for 59 clones was done by the method in combination with random method such as shotgun or transposon insertion. Assembly of the sequence data was made using ATGC software, which allows to rapidly and accurately combine with the sequences and the sequencing became high-speed, convenient and highly accurate. By starting unidirectional walking from both ends strand-independently, it became possible to design a primer with ease and few artificial mistakes, and manage the project, indicating that the walking strategy was effective to control the status of clones and reduce the burden to construct and edit the consensus sequence assembly. Shotgun and transposon insertion techniques were introduced since those were able to read regions that were difficult to read because they contained some repeated sequences of which primer-designing was impossible or sequences which terminate the Sanger's reaction. Transposon insertion was mainly used to cover the primer walking method for 59 clones and shotgun method was further applied for 4 of them. For the unreadable region because of a long poly A stretch, five different primers, (T)19 (V/A/C/G/T) were additionally used to make complete determination of both strands with accuracy. Speed-up for the respective steps of large-scaled sequence analysis including purification of template DNA, sequencing reaction, data assembling and accurate construction of final sequences were achieved. Thus, base sequences for 1,798,208 bp in total from 1,057 clones were finally determined. This corresponds to the achievement ratio of 105.7% for 1000 as the goal of clone number to determine. The outcomes of cDNA sequences analyzed as mentioned above were disclosed to an utilization consortium, leading to an establishment of intellectual assets. The total number of sequences disclosed to the consortium was 5,153 for all sequence analysis groups and 2,129 for group IMS alone (at the end of Feb. 2000). Thereafter, an international contribution was made through opening the outcomes into a public DNA data bank. The number of data opened to DDBJ was 2,212 for all groups and 340 of them were analyzed by Fujiya ( on March 23, 2000).

11- 1 . 4 5S±ScDNAE5iJi*5g£tfae?g#)CD«»ll6^;!->7.xAroM5S

sosui&i*©

#rL©$@&#A. raSScDNACSLfclBff ->XtA4, #»©CPU &iS*LfciStt|g7 ‘j;l7-XXT--y a >£, !/--> a A AvXAAti: *95000* ©cDNAEf>J£-ffiLTl~ 2 0

-cmmzMVr-tz>z.trt»smT°$>v, lt-st*5±e>c 4^WJ. WfcSrSIc^MSftfcEfiJx-X©IR9& * # # * 2 n t> © £ & o T © -5. *yXri>ii8* ®«ig^SI^DX'5A4#fT-r?>a 7®»t, |g*/j:AA cDNAE^J < ##, tl5S. 8W5-f>5'-7x-XS»i; C*g!JT®-&. (1) 37» «E;P«l£fT&5 37l»ll**t«'):cDNA©#m?mi|©^A©"«KE9i|##@B#" t. #ig*ff©7cie)©#iRsnycT5 ymEm:**&"75 x^E?ij»ws»"c»(te> n-5o ■en^ntofSff^D^AAiiseitocfiiijsn, X±l:##$fi3. K$0©EMC* * 5 *1 loltt $ S (BLAST2), S 0 M S K @ #( © ¥ #J( ORF# tti, ESTE fJ © M 9 # If. Testcode ©*g) aio^D?7A))!*ssntn4. $7c, T5 x®E^J»#f«»t?U7 5 ;® Efj^©##R (Translate) An tf y A (SOSui, Motifs, ProfileScan, CoilScan, HTHScan, SPScan) 5")lf IS L fcQ (2) -f >^-7x-XBB» T >?-7x-X#(l*i0f-JS-fi!/T, iSjSAvD84*6XxA±C® 0a*tl5. E9HHS*©if;ir-7 pC»ltTflSt0atr C £ ao a—tf-tt)Sff*f*i;-r-5E?iJt»#f7'n 77A5I«b, >£X U -7 ?-r3fc*W-T*8T0ffl*ri*firfc:bft-5. SffT'nT’ 7A0S/177-? (fflWtt^*©x-X A-X,ttfflCPUSS-g-tf)ttBB±A' e.tt*CIS

J£bJI6T*-5„ *«*0X37^> LTA#©T-X©4= A^6##©&V— Tl>5. &^##©f-X^-Xlc79bT#M###&e^o)hlK*&#E*3 C£K j: 9 , Orthlogitfs^ ££>*■ 9 < #ST#5<, cDNAEfJ £iM IC*:* L fc © 5 . yy AEfl- ESTEfJ. 75 y SEMS^t X y?:#*bT!#9#lt. X37©S^l:

-12- Development of functional analysis system of gene products for Full-length cDNA sequencing

In this research, the system which automatically did a suitable analysis for FLcDNA was constructed by using suitable relational database (Sybase) for the handling of an efficient workstation equipped with two or more CPU and a large amount of data including a new technique such as SOSui in addition to a traditional technique like a homology and a motif search, etc. This system can analyze the FLcDNA sequences of about 5000 at high speed within one or two days. It also be possible to have a look at an analysis result by extracting necessary information. Moreover, not only the public data base such as GenBank, SWISS-PROT, PIR, etc. can be automatically imported, but also the newly developed analysis method can be added to the system very easy. This system can be divided roughly into two parts, i.e., core program and the interface program. The a core program where various function prediction programs are executed, a huge input FLcDNA sequences, and the results of function prediction are efficiently stored, retrieved, and displayed.

(1) Core program The core program is roughly divided into " sequence analysis part" and "Amino acid sequence analysis part". Each analysis program is automatically executed, and the result is stored on a relational database (Sybase). In the N.A. part, the following programs can be automatically executed, i.e, a homology search program (BLAST2), a detection tool of the repeats, the function analysis tool of protein translation area (detection of ORF, Testcode, homology search against EST data base). In the A.A. part, translation program (Translate) and other verious analytical programs (SOSui, Motifs, ProfileScan, CoilScan, HTHScan, SPScan) were implemented. (2) Interface program The interface was designed to handle a large amount data with high-speed process.The user gets on this system taking the FLcDNA sequences in the lump by specifying a storage place through the network. As for sequences, it can be handled after dividing them into two or more groups, and the addition of other sequences is also possible at any time. The user only selects sequences and the analytical programs, and all processing will be automatically done. Each parameter of an analysis program can be arbitrarily determined. The score and the character string of an analysis results become a key to add the examination from various aspects to an analytical result. The Ortholog relationships can be displayed comprehensibly by the result of homology searching against the data base of various species (human, mouse, worm, E.coli, etc.). All the results were displayed linear in order to overview the whole results together including homology search against the public data base of and the EST, and other amino acid analysis programs. Moreover, the result of the amino acid analysis to the protein translation region displays the part to which the function and the structure was identified after the translated area is displayed linear to another kind. In addition, three reading frames can be translated into the amino acid, the result of the homology search executed for the known amino acid be arranged, display, and the frame shift may be detected. The progress condition of an analysis each program can be confirmed at any time as a system management tool by the job monitor.

13- 1 . 5 *###TmcDNAT-f

r*l8»t hcDNATVFL7"D->';c7 8rS^S*cDNA7 o->®±#E9i|A%^e*fi6. ctuiimcE^jyt $2*1 Site ^dtSBgSfi&ff 7lc$,/tO. ■5-ffl*a^D7T'f U & 8«riHTofEaisss5*siLt, 7-+f> • 7'n-y -# WT-&-5S* #J$lcS*58#PiH, RNAS43 *1*3 x K0,'S^B*yn-7x7 h® ±3 tC#-

ATAC-PCR (Adaptor Tagged Competitive PCR) B. ^'MOit > ~f f^ip S iaSST^S!# **$*fcU £6£«ft«*¥K*¥®OTStitW± (Cj;oTIS%SnS:g*tt6Cit$#?T$.%„ *E%TB, ATAC-PCR®>\< X *- 7' 7 h- ft'>XxA®«ISSrff FL7/n-7x7 h B1W91-ATE?iJ9i$2 fl7c 7 n- >® %8 fillg*iRWL, *(®#¥lfrffi©cDNAx-7*-X*#fi£-f 3 d tSSSi Lfc„ A -f X ;lr - 7r 7 h • yXfifflliCtt, 96* to^-yfcfxU-Sr&ofcBiii DNA Analyzer*#* U +F > 7P;U® M *96*e?¥fi-PfT X 5 * 7 CSC# L ft. ATAC-PCR*ffl®Sfi^#SS9*,7'f 7 —StoEffV 7 h^f-^fflSV 7 h& 7S1V7 h©E%t>ff-ofc» 2©iBS, 6B86Mlcd3B4«®e®ievt%ayn7 7'f * *ig$fc0l00»fi^lC7iVrTS**5^ 2*lB, / -If >7’n 7 h &*tt#LTK + (g®*m#e* *£, ATAC-PCRi /-+f >7'D 7 ht© tkKStBtrfrofc *2 3, ATAC-PCR® fi A MICE A 'Stlfc. %$*$»!£**$*®/ts6c atac-pcr®S«t6](Ditlsi1® 6*B<$*l*i§E#zp* L/t#IS®S6ff, (D <*»©iMR»*©«ffl, ©8J$W®1f >:/$«S®&M. ® v tv --> g > • *-7 ’*^s£L, 2 *1: *oT##%©A *7 BfJofc. B?W® 7 D-/l;7UTIi, E^H*$®ESi:* 6ICATAC-PCRC =k5SI6*ff ®- *#%1=&©TB. 1739afiT®#J6^6941*e*ICO®T, 7B«Po1®fga^n 7 7 -i *%-/0 7 7<;l/d3*aE^#*A^5®0RF*#m*®##*@*: ^5., «i6»«McDNAx-7*-X*##bfco C®###®A®T-7*-XB#a me?®####t:&/co, ii#i:*oTt)*»T#(8T*9, 4-s®8gJ!i»^n •5.

-14- Development of cDNA database for functional analysis

In the 'Applied technological development of strategic Human cDNA' FL Project by IMSUT(Institute of Medical Science, The University of Tokyo) Team nucleotide sequence of newly identified Full-length cDNA clones were determined. As a functional analysis of novel genes, expression profiling is extremely important. Although Northern Blotting Method is the most reliable and common method to study gene expression profiling between tissue, the method is difficult to be applied in the case of this project which studies lots of genes, because it takes a lot of time and cost. ATAC-PCR (Adaptor Tagged Competitive PCR) developed in Japan by Dr. Kikuya Kato (Nara Institute of Science and Technology) only need little sample, is highly quantitative and sensitive, and this method result in high repetition occurrences. The research constructed the ATAC-PCR High-Through Put System, and FL Project IMSUT Team collected information on the novel clones, and developed a database of functional analysis. In order to construct High-Through Put System, UNITECH introduced high-speed DNA Analyzer with 96 capillary and made it possible for each unit with 96 gene to handle all samples. Also developments of gene specific primer planning software, data processing software and data control software were made for ATAC-PCR. As a result, we made it possible to fix quantity of 100 gene per day as highly reliable expression profile between 6 tissue. This is by far, reliable processing ability as compared to Northern Blot Method. The comparison experimentation was made between ATAC-PCR and Northern Blot Method and the result showed that ATAC-PCR has greater detect capability to Northern Blot Method. Expression level of newly identified gene is generally low and in order to find fix quantity on a stable basis, ATAC-PCR was technologically developed. They are; l.to measure gene of the original tissue as an internal standard, 2.use of multiple restriction enzyme, 3.examine the concentration of each sample at the measurement, and these studies resulted to improve the frequency in acquiring samples. Also the use of three-step concentration from original gene of internal standard enabled to make calibration curve, and to get high standard data which brought the highly reliable fix quantity samples. The clones distributed from the IMSUT were studied with ATAC-PCR, and for this particular research, 941 genes out of 1739 were used between 7 tissue to get appearance information. The ORF estimation from nucleotide sequence, and ATAC-PCR analysis result were put together as cDNA database for functional analysis. This exceedingly reliable database will be useful for the development of functional analysis of new genes in the medical industry.

-15- 1 - 6 ±#cDNA(DmmaE9]&2&#:t###ecDNAf-f-/<-XS/%:rA(D4#m

*5*£ttEl2Stt3r

Ltz.rn.fci-(Dmmmwit z && ;m LT. ^ y * y t h^scDNA^ 04t>fxy^^fAX, 0 t&y LT. 5,001^ n-ycDS ®EyiJ£fc£Lfc0 0 £ gift iff T? ft. r <7)\H2,50\? L/c0 $ fc, ±%#A&cDNAT —^ A#cDNA(D##^T y T —y 3 y&iftfifrSt-fT 5 ft«S:M|gi"5 r £ ^ J; >9 . Lfc£gcDNAK?iJ£te*A

(1) ^ficDNA(7)BflyiJ^:S ##gi^7 y “DNA'>—yyy —<7)^At-«t o , y 7 yyy-^-Eii-^oiigo, sayij^r -Exy/HcSequencher&mV'^yy^ i& S E yiJ ^ i£ £ LTE ffit/co mm&rn. y &~y? 2,501 J^JtSr^T L/c0

(2) ±m#^C^cDNAy—y ^^7" A(D## 1) ±m#y-y -<-;%##?)im#mife=i Hyiyits#!©)!#, RTfMVri&miz.m^'it&mmM&ft V\ r cb^^T^^y^yA^^I^L/Co ^*DB3£^<7) £«><£> g i&T y y — y 3 yy*y a £ lx n, DDBJ^o^ftcDNAy — y iftfflor-y — ## y y. y A ^ L T ^ orft yy — y 3 yyyy an y —y^< —7.y — y (-&<(<##Tyy-y3yy%yA, ^y^xyyyy-yayy^yA^M^L, y

2) it, ±m^y k°yyyy y-y a yy^yA(D^m(-,

3) 5,000 y o — yw^fcDNAC^t^t^f-^ 7, t LX. _hEStify —y £: E^#J

-16- Sequence analysis of full-length cDNA clones and construction of a large systematic cDNA database

The purpose of the project is to construct intellectual bases in Japan by analyzing gene functions based on the sequences of full-length cDNA clones which were isolated at Helix Research Institute (HRI). The project planned to determine the sequences of 5001 clones at Hitachi Ltd., TaKaRa Shuzo Co. Ltd., Hitachi Science systems Co. Ltd. and Nisshinbo Co. Ltd. Hitachi Ltd. analyzed 2501 clones, and also we constructed a database system to analyze full- length cDNA sequences for functional annotation.

(1) Sequence analysis of full-length cDNA clones High throughput analysis system was established by introducing some instruments that is Hydra micro-dispensers, Gene Amp PCR system and multi-capillary DNA sequencer. Primer walking method was used for the analysis. Primers were synthesized based on the selected sequences from sequencing results with a software of Oligo, and data were assembled with a software of Sequencher. We reported the sequencing data of 2501 clones after solving several problems resulted from sequences and clones.

(2) Construction of a large systematic cDNA database 1) We developed large-scale database techniques including a data pre-processing system, a registration system to public databases and an integrated analysis and display system. We are able to confirm sequencing accuracy, estimate a start codon and edit sequences based on the analysis data using a data pre-processing system. For the purpose of registration to public databases, we constructed a system, which helps registering full-length cDNA sequences to DDBJ as auto-annotation system. ORF annotation system, a functional annotation system based on the database search and a mapping annotation system were integrated systematically.

2) We constructed a system to identify disease-related genes and polymorphic region based on the mapping annotation system described above. This enabled us to pick up the sequences of the related genes and information.

3) An integrated analysis and display system was constructed to analyze the sequences of 5001 full-length cDNA clones.

-17- 1 . 7 ±#cDNA(DS*aE9i&2(D#9E

ax

^ v y ? &&cDKA? v-> f- — h (DpolyA(i24bp'C & o fc0 & t A; £'(D ^ o —waR8@% < 36^ o — n — ytfeo fc0 false-primming site#ft fc#< . o V' X*2fcmfeMl& Id «t 6 flSit P$^ T\

T. ^ y (4^ n->/) ^#mi-6^^L-C^-C##Lybo a#&y7 7-7(D#@3: o tz\fi) tz

Wc L o *#%e@L-C. -fy^-r X 5Ktt£0)#tt:ll£|l£®tfc-f-5 — ? ^ y \/ 's Jf&ffli & x $r)VT y y-f-S L

Sequences analysis of full-length cDNA clones

Our objective in this project is that both strand sequences of 500 full-length cDNAs, which were established by the Helix Research Institute, are analyzed by using the primer-walking method. These cDNA clones are originated from the libraries of the established cell lines and human organ tissues such as NT2-cell, ovary, placcenta and thyroid. The sequences of all but 36 clones have been read without troubles. The templates of the thirty six clones have obstacles such as lots of false-primming sites, secondary-structure formation, impurity and long repeat sequence. We managed to read the sequences by long time-consuming tasks in which lots of primers were designed or subclones were made by the shotgun method. The difficult sequence intends to need lots of primes. The ratio of designed primer number to the useful one for each clone let us know that NT2RM4, NT2RP3 and THTROl libraries may have difficult sequence clones. From these results we established the accurate sequences of all 500 full-length cDNAs and also improved our sequencing ability.

—18 — 1 . 8 ±ficDNA®t6SE9J3fc$

*e>ti>3 3Ef5rJS#itfSDNAl^;|/T-8SilWT£-5=k5 ffiox, bh8e^c>f£$BE?iHiim, e«b##jTS5.„ t nae^toSSE ##©*Mib&m7T##4T»bi)^$Ai a^'H»g©t h«e^©^ftE5iJS-ft£f-5. $7c, satR#C7—X

>X6b*®|S]±S0-5„ M #: 69 C It 10,000bp/day T'&-z> fcMtirffeJ] & 35,000bp $ T Bl £ _h If •5. ^S6cDNA©#*E?JftSiaHTro=t x icffofco 56f, -\'J y?XiMi'5>SS bfcMftJ: D ->-7->X£jS©«ffli&.|>X‘7X3 FDNA£M8bfe. 7#C, -/XX5

KdnaCX □ —- >X,snTtA57c3::6cDNA©5fcSt6SE?|J6 8t6b, # BflfcSSE fij&mcxxf • if/$bzt. cfi&miATtab*i=*;sbZ:#*Em=#

v-«K4BlECbfc. «*», HPLC£ffl b/t #87)1-2-$T$> 0 , |s|#l:#.» ©XX'l'7-$:#84 5©tt8©TMT, 6'7ltffiAsW'oTb£. 4©7bA, 77-f 7-0»BAiy--)->X»fff©*acai.; tai^MT^fcOT, d; 0 ffiflE7)^ffl®Zj: X x< 7-#8%&*ab7b. ttt08S. *f7bC#Stbrcffig*-h U -y yT0*SJtt C± 0 , 20*©:/7f 7-£30»T«B-r3 st/SHiJIIC&ofc. ciiia$E*S©6f8©S $T$ 0 , #$69%%*Efil©##C*#<##b^.

se-c, =ko#e7)^oamt:%*EM&*)g-#-&-x< , pcrki $, y;pE68-^'>--y > X*#CO©T©#M&lf oTh. —-St, y-'r>X'rJH:-l-*A75 h'5: #9of &CCj; 0. i«GC®*©E5iJ£J; D»EBC&Jt-f-5 S $7c, X-X-> XKJStotH X ;l/�mS7&U7< V-9--x;l/tt-C©->-X>XSS$fi:'5 AtCiD. ##%### UAEm©##ca#69&#b 7b. SBC, #$©Au#*E9'j©*#tg Nb/t. $7c, SffcCABI 3104-7 27 D-v-dr >+)—£* A b 7b S tea D, #C/\- FffiT ©###*7)17 "j -fbtzMfS' D T& < , 4 CfflV^fcLI-CORttSas 4a-'ABia8©8-'> —X >-9--©#tt&S^bTfRE4ff 7 CtCJ; 0 , «xnc«ff^feC*i:C7aiP7-5St*i

-19- otz. a wft b O^rt^tTo fcJ&H. ft) 10,000bp/day "C - >r y X il& A £ 35,000 bp/daytc $ "C 31 #-t If 5 Z i: & t±j^/i0 t LX. 0 b

Sequence analysis of full-length cDNA clones

The molecular biology, which has been drastically advanced in recent years, has opened the way to explain the mechanisms of many biological events at DNA level. Thus, information of human DNA sequence seems to have a potential to create novel businesses, such as developing new medication and gene therapy. Therefore, it is necessary to promote characteristic analysis related human DNA to lead the potential market. To address this request, the strategic analysis of “complete ” length cDNA, directly linked to the functional studies of , has started as a national project. In this project, we Nisshinbo planed to sequence 500 complete-length cDNA clones, constructed and retained by Helix institute. In parallel with this analysis, we also had a plan to enhance overall sequence capacity up to 35,000 bp/day, 3.5-fold increase from the beginning. The complete-length cDNA sequences have been analyzed as follows. First, we prepared plasmid DNA that contains complete-length cDNA insert as a template for sequencing analysis. Next, we determined terminal sequences of the insert (first run). The sequence data from the far end of the first run could be used to prepare the primer for the second run into the region of insert. In this way, the new primers were designed and prepared around 100-bp from far end, and applied for the analysis of subsequent sequences. This process, called primer walking, can be repeated many times to sequence extensive tracts of insert cDNA. Finally, the whole sequence of complete-length cDNA was determined by assembling all fragmental sequences that were obtained by the primer walking analysis. As a result, we Nisshinbo successfully finished to sequence all 500 clones within the period of this project. We established the new simplified purification system of primers, since it requires considerable time and complicated steps using conventional HPLC method when deal with a number of primers. Thus, construction of new system was an important factor for rapid sequencing. The new system using a simple cartridge enabled the short-time purification. With this process, 20 primers were purified within 30 minutes. It allowed 6 times speed up and significantly easy treatment, and enabled effective sequencing analysis. To determine the sequences more precisely and speedy, we have been made improvements in PCR condition, gel composition, and sequencing condition. For examples,

-20- formamide addition into gel has brought better results for GC-rich region, and non-cycling reaction named isothermal method was effective in sequencing long polyA region. On the other hand, introduction of new ABI 310 capillary sequencer has increased sequencing capacity, moreover, taking advantages of individual sequencers was useful in accurate determination of difficult region to sequence. As a consequence, we successfully improved the sequencing efficiency from 10,000 bp/day at initial to 35,000 bp/day after the conditions optimized. In conclusion, Nisshinbo has successfully determined all 500 complete-length cDNA clones and improved on overall sequencing ability to 3.5-fold increase from the beginning. It means that all of our purposes for this project have been accomplished, and our approach to this project can be achieved because we produced the expected results.

-21- 1 . 9 £6cDNA0fflSR9J9fc$> cDNA^$g#$lO##. &tfDNA"7-f £ □ 7 flFll t$«r

¥^10¥SNEDO^3^:SlE^STfTtin/r hcDNAy/Afbm##M5Ej ^ U rf^-f v 7° & £ < — X U 7 n — > ££ STc^ScDNACDSSEyj&^/^EE&E^L IH%-t-7T$>5 =

QSEja (#) A u '7 7 XE^lE^T5,000{@CDj!XgB?lJ£&^f £c ©5,000i@(DcDNA7 □ - >(D## |fr>t ^rPCRE^TiSifig LME1"^ =

7h. ®/^100%(C#^%(AgBW7 PCR^^#(7)4%#^# L < M^h(D (±fo(D6%& T> E2l Hi % ME FbI ft ^ 5,oooSE cd cdnat £ £ S T

Sequence analysis of full-length cDNA clones , Establishment for stock and delivery system, and DNA microarray process technology

The main theme of operational technology development of human cDNA genome application, which is financialy supported by NEDO grant 1998, is sequence determination of full length cDNA made by oligo cap cloning method. TAKARA Shuzo Co.,Ltd proceeded with three project, namely. ©Sequencing analysis for 1,500 full length cDNA clones from 5,000 cDNA discovered by Helix Institute. ©Establishment for stock and delivery system for 5,000 cDNA clones discovered by Helix Institute. ©Amplification and purification of PCR fragment from cDNA for making DNA array,with the corporation of Hitachi Software Co.,Ltd. The result is as follow ® & ©are completely finished and ©is finished at 94% . The reason why ©is not completely finished is that 6% of all clones has very poor yield and we could not make all the PCR product in the period of this project.

-22- 1.10 cDNA77 7 □ 7 U7 Ic«t5

HSV7 h»i7i>y-7U

*E5ETtt. 'x'J y 7XE%0rl:T#6n^h hcDNA^'f 75'J C'fi'?., 5S±S*s £fceStl7c5,000H©cDNA7 D — 5cSficDNA"7'f >D7 L7 (EfiK&WtoilliE i, -?7 » nr I/'HC j:%%S)Rt1r*?iro#iERy:seES BlSi: LTV>5.. CCTil. i6 s.R&DNA-r'f zui’V'isen. satt©ff«. &tf»e y^BSStirtoHiESfTofc,

DNA7 7 ^ □TK ff J5$ftW©tlu ltt&Zc D5,0007.#-y H6i$S£Lfc7l/7©ff®©£ettti:, 320~3507 l/7/fl tz. $/t. f#^b^7k7©7%@M^#%#ie (ttffl-iJI67V"fZfffi£7l/'f) tt¥i$ 93% tti-Dtz (lli)©fFST48ttfF/$) . btt 6 =fc 0 , 19 —&an*^3>fi$gtt©i6HDNA 7 7 7 □ 7 1-7 DNA@$ffl3- MrilUfcXrH K# 7 X £5#E<£ffl V. LfciBS, ^ V -L- U 7>n- hySeL-TM8tt©A(xca^*UML^. #5%j; 0«i@bZ:MfW]©a%:6 77 >n7H JttBlA'f 7*V 77-fer-Zg >L7cSS %, 7 1/41 3#S©fii$©7 —y-y hDNASSSfflViT/N'f 7U 77 if— -> g > lfc£8S, iSi$© SSg&EBSSJlto LZt. 2wg, 5Kg*©mRNA^6i@SLyc7 —7 y h DNAiSESrtil ©T A7 7 U 77-b'—-> g >£fi7A 7 —y y hDNAi§f$6JIBIC#il&mRNAiitti;2«g£fi7b *5fi75:Mf£;B#M£#ISbfc„ A<7vyMi$m©ik#ie©#M&e©. sie&issasgbfc. sett. *8tt©W« ffi;tl;:iI/bZ£7 U7 £fflbT. [b|-:#:#TM£a7 7'U 77-e-->g >b. -e©iB*© »»tt£SI*b/L. S8$J; 0 . 7 1/7 fEl$KU7\7 7 U 77 -b'- -> g >#*^AbSett. m^vtZKLtzz. tttmmiEfttz, me?%8##©*m b Zt 7 7 7 D 7 1/ 7 iC M b T > 2%M © 7 — y *y h DNA (Human Liver, Human Brain) T*s$SA7 7*V77t:-->3 >$(7©. ©K#:#©#ie&K»Zc. Human Brain 7©»*#LT©&mef. Human LiverT© »%8 b T © 5 «fiy, SffBiTJtllT %8©%fb&N#ttm<#aiemT&6 c

-23- Development of gene expression analysis technology for the use of DNA microarray

This project uses fully sequenced five thousands (5,000) of full length Human cDNA clones among the cDNA libraries obtained at the Helix Research Institute and targets to establish technologies and techniques to produce DNA microarray for the use and verification for gene expression analysis of those cDNA clones. For those objectives, we have studied conditions, stability, and reproducibility to produce a high quality DNA microarray, and proved its effectiveness for the gene expression analysis. 1. Establishment of a DNA microarray production technique A quantity productivity of a DNA microarray with five thousands (5,000) DNA clones on one microarray was three hundreds twenty (320) to three hundreds fifty (350) per month. And a ratio of a usable microarray to defective one (yield rate) was average 93% (forty eight (48) microarraies can be produced by one (1) production operation). A fundamental technology for quantity production of the homogeneous and highly reproducible DNA microarray has been established. (1) The poly-L-lysine coating was found out to be the most stable and reproducible method after the comparison of five different kind of DNA stabilization coating method. (2) The DNA microarray lasts for a little less than two months after the production, according to a series of hybridization experiments at a different date(s) from the production. (3) The most suitable composition of the target DNA solution was found out by comparing the three DNA solutions at different kind of composition. (4) The most suitable amount of mRNA was confirmed to be two (2) fig, after comparing the hybridization experiments with a target DNA prepared from two (2) and five (5) fig mRNA, respectively. (5) The most suitable hybridization time was confirmed by comparing hybridization results at different time period. (6) The most suitable hybridization washing condition was confirmed by comparing different kind of washing conditions. 2. Evaluation of stability and reproducibility Randomly selected DNA microarray was applied for competitive hybridization at the same experimental condition, and dispersion of the results was evaluated. The results have confirmed that the DNA microarray production and the efficiency of the hybridization show the high level of stability and reproducibility of the DNA microarray. 3. Verification for the use of gene expression analysis Two different kind of target Human DNA (Liver and Brain) was applied for competitive hybridization against the produced DNA microarray, and the measurement of the amount of of each gene was attempt. As a result, a gene(s) expressed only in the brain, in liver, or in both was detected to prove that the DNA microarray can detect the variation of the gene expression with high reproducibility.

-24 1.11 6EcDNA<0i££E$im$&tX8te«*Tlc|!|-f

fc5££tt7-f • 3X^EXE%9r

^ LtlI^cDNA04J1:\ t M>S(7)SmcDNAc7)j^SgdyiJ(7)^:S^ff ofCo £ /c, cDNA

S L^o SEcDNAOj^SSByiJ^^^X^ft, t h t>E(7)SEcDNAy d —y (4—5kb) 2000#(DM ^(D^E^^&^SrfTofc, r (D^^SaM(Dtf^^^Jffl It, (D##y d - y 120#(DilglJ^^^ L/^o P-yl-OV'T, y a y hyfy?£lc«t *9 ^SSEyiJcD^^^rff V^x 120#(7)j^SEyiJ^:^^^T L/c0 E^T-^ld:, U 6z&^-e^yb. #mLtyc-yDNA&;%7^ -tdac *(d x pcR&^im^^#m^#5z^as-c#^. @#±

Long length cDNA sequencing and development of new technologies for functional analysis

In the analysis for genome, it is important to analyze cDNA as information for the investigation of gene function. We have achieved the sequencing of long length cDNA of human spleen that Kazusa DNA Research Institute possesses. We have also studied the technology of targeted end-labeling by using three stranded DNA formation and detection of fluorescent signals, as the basic technology necessary for making the DNA chip of long length cDNA. We accomplished the sequencing of both 5’ and 3' end sequence of human spleen long length cDNA that has 4-5kb length. Utilizing this information of the partial sequence, we selected 120 clones of the possible new gene by comparison to several data bases. Then, we completed the whole sequence of each clone by the shotgun method. These informations of the whole sequence of long length cDNA are useful as the new data base for the analysis of gene function. Using the end labeling method for DNA with three strand DNA formation directed by recA protein, we studied the targeted end labeling technology of long length DNA and the fixation technology of the end labeled DNA on a solid surface. We could get the high efficiency of the end labeling through the optimization of the reaction to introduce efficiently an amino base into the

-25- one end of a long length DNA strand. The optimization of the fixation reaction was also carried out to fix efficiently the end labeled DNA on the surface of a slide glass with a covalent bond. As a result of using new technology contained the end labeling and the fixation by the covalent bond, we could obtain the same detection capacity as that of PCR method. In the study of the detection of fluorescent signals on a solid surface, we constructed the trial detector using cooled CCD camera and macro irradiation system. This trial detector possesses the basic performance.

-26- i.i2 aacDNA

»efme? «*M#e>nntf. 7«&©#»d e&s tJBtJtis. b holes -f-j'^-XOMIM»^6$«lcE-fal*S$5ISm-rc«. *EflS®fi¥69#HT$.a» e#:A-> KIWS^WfflT'^a. fit, E?iHi$e*IC3 >ba-5'l: j: «$r*65ln SilicoSS&ftT -y b > 7*©#ft £&W L ft. *£#:*&+»* 1C it. MBitfi o, bioscouT-esmufc L T, UniGene £ GeneMap©* IR&K^ftLfcSa^J Mi T y y*#©T-^<-%&## L^b. *iCr-?M«fil:*#it sain Silico*6#:t y fcf > d/SrHISttif S £ itfffijfefc. me? a;bAic». y y adnah ©d- >y;b e^f njxt &&. a€tt5$S©7*- A h*3>t> h©f#e.nTUa5S*20^ffli'7HTlK*$fi:^y y ADNAtottttiSff o ft. —S©-y->7';Hc#viTSNPsSSrS5liSLfc.

Development functional analysis of long length cDNA

In case we search for relationship between gene and disease, genetic information is very important. If we can get chromosomal band location, then we can get the information about human disease information from OMIM. We build the integrated genetic information database by bioSCOUT. UniGene has chromosomal band location and sequence data. GeneMap has physical mapping information but does not have sequence data. We need the sequence data when we try In Silico Mapping. So we extracted the sequence data from the integrated genetic information database. We built two kinds of databases. One database is the sequence database including UniGene and GeneMap. Another database is the integrated map information database. We established In Silico chromosomal mapping system. When we get evidence for relationship between gene and disorder, we need genomic DNA samples. We collected the 20 samples of 5 families, which we got informed consent. We extracted genomic DNA from 20 samples.

-27- i 3

Horn###.?)##. ^#$m

^ m-o fL. |gP^fb(D#^(caj^LT^igM#(/)#%# SrfT o tz-o *9 t. t tbfco j£-?u'y F L, &W^(D%}&}MiM1t&\xl 5 tz. &6(-8^ 27 0 3 v y ”— v

TA^iiifco 5Fj5£l2¥2^ 22 0 , 2212i@(7)^^ScDNA^ cr - > ^ $ gd £'J t# $ £ S^X - 9 "<“ * "C^)6 0 *DNAir-f/<>f (DDBJ) (ddkMLfco

To promote the project, the technology committee was established. The committee consists of seven members. Director of the committee is assistant professor Sugano, Institute of Medical Science, University of Tokyo. During the fiscal year, the technology committee meetings of all 12 companies were held twice, usual technology committee meetings were held five times, and sub-committee meetings were held twelve times to mission complete. Collaborative research agreements were made between RAB (Research Association for Biotechnology) and Institute of Medical Science, University of Tokyo, Helix Research Institute, Inc., Kazusa DNA Research Institute, and Graduate School of biological Sciences, Nara Institute of Science and Technology. R&D information was collected by attending scientific meetings and international symposiums, and technical information was exchanged. Member of the consortium was 33 at the end of March, 2000. The information collected by those attendees and the project results was summarized. A consortium was organized to utilize results of this project on 27th August, 1999, as intellectual property rights such as patents will be protected. Full-length cDNA clones of 2212 human genes were prepared and the base sequences were determined. These sequences were published in the DNA Data Bank of Japan (DDBJ; an international database located in the National Institute of Genetics) on February 22, 2000.

-28- i.i4

tStolBSlStto

3CmE2l0T T194-8533 3C3fCfWEBmM0T3-6-6

T 771-0192 SSW@Srfl)Hl*|IITj)ll*aSf 463-10

W^F-E

T 257-0031 #$)IIWSSr1?Eg228

siSSgR T164-8555 %RS43EKS41$f2-7-14

3-3. —y /? (#)

T 277-0005 TH*#TStt^'l,fS367-2

T185-8601 SRS@^erH$^^Sl-280

n-w?i®

T312-8504 ^ m % & fr fU rff T 882

WBSfO-'l'IX'XxXxAX' DNAliWS T185-8601 *S:$E»eS*S»iai-280 «S)B3t«fE@r4I*E%BrlS

xXXh U T312-8504 *tti»U'7tc'fe&A>rUTffH&882 W B 0rMI2§**B&ft

0 im###

T123-8585 lM$Si£EBSf#*eTl-18-l

-29- T525-0055 Sgfg§##2257

T 240-0005 #^r#^±X#E#FBT 134 t: X A— X 'i — X h X 7 — 14F

(#y< >> • nx

T292-0812 f ##7KM#r#^#1698 + >-fe>X“ 3.4#' )(']#*% T 448-8650 #^0#)('J#T#A$fKr5-5O

101-0002 =P##^#m5-2-ll #^#2bf;i/ m-mmm mmmnm T134-8630 SSCfBEFJHE^S @1-16-13

■Mm%m 292-0814

—30— i.i5 BFstsasKe

& ^ Mum • s«« E%J*B [#*%#!##] h a tii *«E%0r ^^ScDNaE^'J^S BB g«E5ESr ittW^E* &txa€5©g#®#ig^fl'i tfjn @ ift«W?Wi E%« si* Mm SSff%9f E%M *1* IE til E%» Men M ggESSffi E3EM 45 *f ggEFESI E%» #0 % $«E?50r E?EM ;n% j*«E?E0r E3SM 1% ^ fnft g«E%m sum WEB SSESERr S|g« gSESESr SlEtt

/MB gf * mW5E» SIB8$ ftMte. 6AH g«E?Sf3t Stgtt mm iff* jKmESESt #%#(#*) mm # gSESEffi E?EM(*K) *m 5 g«E5E0? E3E*(«ft)

^±ftcDNAE5i|tt56KO?a e± it ±#GEN#^Bf 0f§ Gf #| mm A ±#GEN#^Mff

/j'» g# *«GenEFErH E%* CWV-S] «• wmxESEm-mft 7cA#cDNA?IJ#t% Llj#IE% A'mmxESE9r-£tt A--WmxESEffi-±tt A* IWimEFEBt 883#? A'-fMi/XE%0r-£tt.

^SScDNAE^IfteSDt # 3# V')i-y3/*®V «yXfA86ESft8 A®M% ## EJa V'Jj-y3>*S/V^yXTA^ V'Ji-yay^^AMtyXrA^

—31 — & £ am - ft— M V'Ji-y3>*gBA' ^-yXfASBS LilP 9S V'Ji-ygy^^^^V-^V'Ji-ygy^

[X——t v 7 ]

Pimmi ###tff^cDNA \P E^m x — 7 ^ — 7 fF np #5%m /hS «EE^il Am a# mmmu

Uj* #A mzzm^ m$cM%m #* m E^m [(WB*»ffr9?J £6cDNASS6B£lJ£to #A ^9km%mx#m##g; A#A 7+ffx>xiii»Mi It IE# n-sw Amm 7f 7tK%>A#m; i$HM«j| IA ms 7f 7iff X>ASS^ if 7f 7+h%##; m b #± 7f 7 Iff %> mw A 7f 7tHx>XjtiI#IIM^*g& ## 7f 7-y-f x>A#m$#mm%*mm AJU 7f 7thf X77#m##^^%*^m S 7iEA#m##g;M%*g;m #A A 7f 77-f x>xitSSSSE^A^S

##It rJ ^^E^ AfA^AxAE^^# Sill #A ^AE^E £{fE3£M pm ^AE^E

-32- R £ mm• e^b M mmE^mr AMR e# /J'l^ mmE^m mnm mm ^^mnm E^m *# s— **kw$cm AEE55R mm e^e^e AEE^R I'tm mmE^m E^R ## **E$Si?r AEE^SR DNA##' r EES® 5 BE DMA##! i AMR me DNA##1 i #a DNA##] r ^gfp ♦tin DNAM! r AEKM mn #R DNA mWi [ AMR MME DNA##1 [ AMR #BB DNA##'i AMR [iBu^X>XyXfAX] AficDNA<7)jS$gB?IJ8t£<7) S — DNA##^R E5% fX/ >Jlt-RA>^DNA#M?f^AEE^J ft AEE^R ## ®Z DNA##^

¥M m DNA##^ E2lR *A # DNA##r^ EtfrMR ma sm DNA^UfS E#^#R lumm#] Ip* # 20lf )\-T r i\-T')-r A6cDNA<7)S$gB^J&S E^r* fi — 201? )\~T E^R Em w$zmm>t- 2QW)\-T E%R ## m E^mm>* 201^ Jh7° E%R

^jn me E^M^W- 201#* )\-T E#R EUU L(7)^ 20%y A-7° E^R mein 201^ )h 7° E^R e* mm 201^ )h7° E^R $i® Em 20%y Jk7’ E%R f%m m# E^M^W- 201* A-7° E^R

E^M^W- 20ir Jh7° E^lR

-33- ft « mums ft is as 201)'»-7’ »%*

m;n 201)')k7" ffl%*

m#jii # 203)'*-?' *%m

203)'*-7' w%*

A# #? 203)' 1-7’ »5?M

[SMtifc)] ^ScDNA cdSSE^J#^

- cDNA#$q#:$U(D##, »ZR it ^7°^ KStir-t >7 — -fe>7—6 &£/-DNA7-f 7 07 k

±m m

/ w ft«%m £&«%* mm si mm mm

A'ft ft*x-f *7Hr >i'-iftWJEM ft# wz

*n «*

*t» *s- AJHrkS'-

m#

a* @e

m nx

OjT »5E «G;?«#r-t>5'-

fil/R JnL {—

AK ##ft y *?—

±bb

R# Sf

m#

mn *

&BB SEE

m# ##?

m

Wffl M#

ftf*3 %#

e#

*;n x« i*GT##r-b>5'-

□"ill #s ^t7x-r >7—

St BE 3

m# e±

UU* Mf

— A

-34- a & SrR • Stt« e%j*b

A*g fflj- /W:*E%0r M# S /"H :*E%Sr

SEB IE# ^"Xx/ TjJH zXX — rWOj IE# /'W ^XxV 7j iHr >X —

StK ^"Xx/ ;£UI/*fc>X — WEB % /W ^Xx^ rWS /'W ^-Xx/ *;i/-fe>^- TxHI lEfrJ A<^"Xx/ —

® A# A<^"Xx/ %)]/-£>& — m* $? /'W^Xx'f —

[% /W^Xx-r £})]/'&>'?'-

[BiL'jy h^x7X/y-7 U DMA 7 -fXnyi/'fe <£■£>* 7-f7lMx>XjtmS

5# m 7<7thX#mgB *» Stii 7<7thX#mgg @BR SB *s 7-r7-y-'fx>x#BS a* lUT & 7<7thX#mgG x-f 7lMx>x}tiiS ^rn 4=@ SB 7<7thX#mgg gg# A« % 7<7iEX#mgG 8BR E5SSB ±6E%* * E%a s@

#± * — m%±a»-x $* [WT'f->> ■ 3XfX»5Si] fiS8cDNA®SSEE9i$ UU* »5EE%BB)V « • If iMfrS? e6SE%* S# $ S| « • If A° If I • fl? • Ef « • If iMftif £ttE9SM A# E%M%S;V # • If E5EE%S;V « • If S *$ W3iH^gfA' # • If Mttm mnM. # • If

l*8$E%Sr £ffi«?cM ®E%E% 3AA E^Ammma m*

-35- ft 4S mm■®#* #IXM»?Ei?T ±ttW3SM f # ## iiJ*S$»%9f gf# imxmwftm pm [emis-semi iXglJ ftS e® esse m# yy Ad¥ff«%as iwss Mm MX yy AJStfWSSSC HISS ULllW -yy AfWtfE%ee w%*

-36- %2m

2. i tin

#(:c Mwit, *kfitn¥(Dm&bLxtz:i'tx-te<' m#±-et#f \ft±.^bfrb, LTl^ 5o jit h^'/A <7)E?U&££2003¥1:X:1^7£-ti:5£^LTVdo ^StD^y^-r-^SCelera Genomics#

HD^y AE?iJ

^ij-e^do t h^yv«k#t>^T^6o l frb^mmmwtsMz.QimkcDwmteznip ’rfcmmmxh'Q. ^<(om^x-\mt^(Dm^ Mwtstix\,'fa\ xm&(D-'Hx 7z±temmf$i&zz ^Scdna^^j;< \^m:X%tt^m-f bti% 0 m4m^7 www±b #M«Ltz*V #$7 < ^AAcDNA^m#-e# 5o t-tc^&cDNAvXX? V

?'-&mi'xftm-i-z> tcdna^-t9 , ^^-e#6. Efwmie) t h^v y y-y)f##±6^ J: ? X 6 (D 7')V-X. y ? xmizmxtt. *'J 3'^r-t 'yXfeXftmZtitiZMmocDKAyXXy V-(DMtitftte SftT&^x rn^(DcDNAy^y'y V-(c:ti#SE z «b * #a a\ ^ ^

*w?zmiiB-cfi, Ef«-enxs$ii/ccDNAy n-y^)W^fLx mm^±^xh6^mmm^^ti zmmnm&mm lt^mse^jzswommfcfeltc0 $ 6<9 mtrmef#

/:o H^AAcDNA(!:^(Df##(j:^

-37 2. 2

(l) #### EfWTti, d>-Y -yycDNA7^7"7 V £n/i^ft(7)cDNA^ n - y &UM £ +1

^ScDNA(Diisu^fTV\ e+w^-a (mwmz#*^#x *^++^%

^) LX%34,000MM<7)cDNAy xi'—y(full insertion sequences; FIS) %:;&:tL~$'6 Z + <+ L/c0 y-(t, V'-fftfct: «Jf;lit t F^y A#^ftyy —y)^##% mw^iisE, mi) o rftib<7)cDNA7^yy y —AS(7)^1+77,677y n —y <+ L fc0 il&^+lfccDNAy n-yft, E^J%^#(7)^ F^yxi^L-^yT^yL, -e ii^n^E^J^^a^dkSl-SBS-t6 ^ + £ Lfdo

mi) *9F^E^^^#^L^cDNAy D-V(7)6S

y n — y y^yyy-6^ #yo—y# t mg urn# ADXX 4>ftW 4,154 t FTSibEft&M CAE f ## 1,367 t CAS [+++# 9,331 t F 1E^±# COL [#+# 6,593 HepG2?M HEP 4=tt» 15,528 HRC W» 8,784 fc F/J'E HSI t+tflf 7,444 t HUV 835 KatoIII## KAT (+++# 8,839 t F# LNG 4=t-t» 7,306 t F#i&f&mm#±^ REC C+t'+W 816 t F lE^y:S COLF 3,498 t hjESlHl Siting KAIA ### 3,182 3-ft 77,677

(2) cDNA y o —^(7)5'^%E^l&mv^#ei+++# E##^6##^^(+^:cDNAy D-y(7)5'^7)^(7) l y"7-ADynaclust£rffl VT, cDNAy n —y, S'^CgE^y — ^ 7 —coft^gE

Mt LxmiRLtz.0 mmwx\**w$zrmmmnm*\^'xb 31 ^e^Mcdna ^ d -xdkp fTbiLTWc/^*, Sf^^cDNA^ D —ycDga^lJ^^nboT^fcS-a ii:(ts %\tzt^Dt>^fcgE^lJ W "C

E^IJim±

7k 6E^j^6&6

o) mmmfcx-£=i- Fi-z^mfecom' cdka? v-y&5gE?Uc7)^&£ {±j LTy-- y£ffr&Lfc0 rixP)0T“*{3\ n — #(7)% — s<—t Ffza ^IT Dynaclust £: ffl l \ #cDNA^ n — y (D5'^%gE^j ^ ^ V-t LT, *j"f- 5 BLASTS $?!£?? I \ Effi^l x lO^TTL y h LtzM& Z

(4) K-td^ttOi^V^cDNA^ n-ytDil^B ^(DATG(c#L-cmmM^ v>bL^^wm-t^xu^y ATranStart^# 6 ^H^LTVVc0 -£ r -CTranStart£ V > T#cDNAd7 a - y(D5'^%gE^lJSrWfU e3- Kf-e^rntt^m^cDNA^ n-y(7)#mi-6z ^mt^X Z> '> 2± b L -C##TV'7tfd^^c (k £x EfW(-^V^T(tEELTSrScDNA^ D-y(7)it%S/5^TbiLTV>^r

i^ibx x

(5) Hy y^7^/yW&CZ6y 3 y h^yy^yy U-of## GPS-1 Genome Priming System (NEBtiSD &m\ ###^TcDNA^ o —y(c h y yy^/y (J^TTn^^E) mtUX^'no/c„ Tn#-^f$^(ipGPSl.l (KanIt^-7-^7-) Tn^fi

-39- #cDNA^D-y(COV^T.

H16? ct-y

(6) mm-?7X * KDNA(7)|SIM cDNA^ p —y^i^#-f-d^fllSSro^#:^2ml(D2xYTiStii!^rAiLfc96y\7::V—y^^/uyi/ — H^SLx —«9 x -7/kf-y ^ 9 —>96-^71/ - 7 h y — '>3 yy y — h uy^rttSi) ^fflWcyy x 5: Ktitt&^yy-^yBioMei^ooo^fflvTSiti fkL&yyyA, (Pi-50) y-yy^^iAo# KDNA(D#m^fT^yho Kd,

(7) 8#y-f cDNAC05'5k^ £ 3'*#^ 6 <7)tkSE?IJ£te(C ti, y — A y * -fy y t L TME761FW (5'fJ, 5'- TGT GGG AGG TTT TTT CTC TA -3') $ A(tME1250RV (3'#L 5'- TAG GGA AGT GTT ACT TCT GC-3') &rmv^o -Ax ^WmTn^m^^-C^L/bygy h^yy^yyy-cDE^J^-C Itx Primer N (5’- ACT TTA TTG TCA TAG TTT AGA TCT ATT TTG -3') £ A ttPrimer S (5'- ATA ATC CTT AAA AAC TCC ATT TCC ACC CCT -3') Tnjf ASBffci^ Tn#Ad7 p —y(t x ii$A y ?'AIZ16? p — y &mtk Lt k° 5/ ^ T y y Lx Eyi&^l-i&Lfco i“^^^#cDNA^ P — y{^o#x ^f 5'^3'A^E^i(CTnf# A^ P — y ^ ^ # b ti 5 32 A (7) y ■- y y y £ ip x. tz & f+34 A <£> E?'J £ E?'J & £ <£> tz g> <£> T-fer y A' y y y tc f& L/b. (6)&tfiMLtc0 y /i4li®*fi&tmSephadex G50 (7 7^yTtt») £ffllAc0 ZTSSSEyj^^LA 0

(8) «fc5J£&E?iJfc^ Phred/Phrap (9 yy fyA#) "CT-feyA>^fToZcS^(^(t x E^JM^of£v^^(7)^SE^i?rAutofinish (yyyfyA^) em V^THxot L/c0 —Ax Sequencher (Gene Codes) "CTir y AA'AIto fzWaoi^iix ^ yf -f 7E^J^ yy&6M±A#W'EWsm#-CV'%t'm#(ccV'-Cx (CLHybsimulator (AGCT Inc.) t Amplify (y>f 7nyyy^) V 'Xzfy 4 —£rlxf+ L tz0 y -yyyyyA-7-(±x #$^18-20m&m^x GC'al:^40~65%fMx Tm^55~60°Ci: A 6 «t 9 C^:#fLx

(9 ) mmEfijoT'tyyy

ABI377 66by —yy-y —A^SE^IJ(7)t-f (A Phred/PhrapjoZI/Sequenchert'T7 ty7'/^|fo/: 0 Phred/Phrap^fflV^/cTy yy" 9 yyy'(i x E^lJ^99.99%^± (Phrap40) && Lfz0 -Ax Sequencher Aery y A y y^fli, [^#(7)E^J^

-40- T-f&g -mSSLT#^A/

(10) In Silico-C(7)cDNAA&gaflJ(D##f ^r^^tl/ccDNA^ n —y^^gfl^Ji^^L, GenBankjo «LL^GENESEQN-r-7XfabfML /cSt^KDmRNAgE^J^T —^rBLASTN (GCG-ver.10) VT, 3=t

GenPept, SwissProt, GENESEQP t - 7 fM L E 6 ffT 5: 7 Ega^JtDT 2'—7 ^<—X GENCORE FrameSearch (GENCORE Frame_N2P ver. 4.5) £ tz. ft BLAST? (GCG-ver.10) Srffll'Tff ofco SPCIAS ^DNA#) BlRp-^y 7l/-A'>7 h^ — <7)ftf#f£ffofc0 £ 6(^&££;h,/tcDNA7 P — ytD^gaW^SHBET*® yEE^iJtcmL-C, Signal? (f y^-fZ#*) SrfflV Vc'> TTh/^ga jo Z t/SOSui (^#%±) V V::EHiMW)Zi|iJ£fTo tza

2. 3 HRISmfc'

£7n-y£ili£U #E^J^%^A#(cE#i-6f##. 2)^2,0007 p-y( 7)A#E^m^#, 3)^^$jT,/ccDNA(DA#E^J(D##, 4)-^^D“70iW, 5) cDNA##^#yXT A<&r%^ Lfc0 L/c9f%

(i) AAga^'j 5cDNA7 d —ydoiiBU !B$i6ez-7 D-y^6«^5iifcEf4mW7)7M; V-(D5'7kmiJ(7)^S # (Eim^ Ky^-at^y n-ycDit^) tz 50~70%BS£WS£;h,Ti'5o j;9 ^^m-c^^jT^cDNAy-ryy ^ocDNA^zyy 19 t>{g;v\ b t (^^^ScDNA^W#^^^

15ZJ (c#AA7 p-y^mjT(c< ^7 p-y^#<^$4bTV\5o $^m&(DrnRNA##im#T(j: y/ADNAie^y n—y^#^RNA(DzmA&7cA(-#(t5^ <^#LV\ 7" 7 ADNAl#^^ n —y <&r3- KLTV^^m#^i#S< x y F p y^

£;ftTiy5o Sfc7774 yyy'^rSttrw^v^^fAiRNAtiScoy d —yrity y h p y^oti^

^;h,TV'6mRNAE?ii(DT-?-kyF

-41- cDNA£il&-f5r ^ VTcDNA ^ n - y (7)5'5k^E^J £ B¥#fU r^ScDNA^L^j (DWflfiSrfroT^ n — ySriigrfSr cDNAy D-y^rilSU L/t0 iSfK-f"dWl^ti60i^V^cDNAy n —y(D##(l, L X Tran Start 5r ffl Wc 0 ##%(C677,677y n — >#29,630(7)^ 7^^ — (#) £ tl/cy :0^^l y 7^2 — htz. <9 iy D-yfo, E+8,l43SE*9, r-eii&^iifcy v — y

0f(^wy n —y^e>il&£iiTV5cDNAy p —y ^SWUESrffo/Co ##%(cW+4,95?y n —

u. ^-bb^se ms£s ^60tc b/c 9 bb^l^ (m) „ sb y n-y'MfcCiffl V6 ft/c5l^6B?iJ

-42- Ef-W'C&S^tl/ccDNA^ u~-'y (Dffift t~f (Dynaclust) / ft ^ gE # % (BLASTN) (TranStart7^ if) 1 - m# (E## - ##%##) i ? n — y(Dt° y ?T y ~f / V n — ygfiTn 1 I I cDNAf D —

(Ell) ^

(#2) ^ p — y ______f 7 7. f — No iHt£ 0 »#g 1 05.02 36,765 9,936 (1) 9,936 1,682 GS (a) 2 05,11 43.492 2 07.27 36.765 12.148 (2) 10.797 2.030 GS36.2. UH-cds #84 (h) 4 08.1 1 6.727 3.323 2.153 265 GS36.4. UH-cds#88 5 10.20 15.292 5.136 2.851 590 GS37.3, UH-cds#96 6 12.17a 18,940 7,427 (3) 3,893 1,216 GS-cds 38., UH-cds#102 (c) 12.17b 35,294 14,625 (4) 2,293 GS-cds 38., UH-cds#l02 (c) 12.17c 14.610 66 f5) GS-cds 38.. UH-cds#102 (c) -6-S+ 77,677 29,630 8,143 2.6. #&#=10.5%

(I) mnmv @E*n*e^l!fc£&a;asifi £ ft fcffiJW. (2) 5/11 ?)####(DK?lJ#d^COLF,KAIA(DBa?ll&#V'7b b 60o (c) GENESEQg^SE?lJ<7) 5 ^cds^^o b

-43- 49I2)^a%^,E?fi^P@%¥,S^I%A£^

^MdM ^ "j_o#m#^¥^^G)-4/:¥##T/44 3mm v£]3-^WPl--n»4fi imm °;m3cwpg:Limw (a^^m^d^) ^ m%nm “--i^^mm^ax-nc — * 4VNCP (C)

AS 6> %0I £W‘8 LL9'LL occ %Zl ZL£ Z2V£ VIVX WW«@S3M ^ ZSl %6 l££ 86fr‘£ J103 S¥^3H l ££ %8 99 918 33tl $fTamma%8 4 q AIS %6 6L9 90 CL ONI W4 l 96P %0l 168 6£8‘8 1VX ^SIIIIoibx 9£ %0I 98 S£8 AQH se#^wm#^M4 i 9PZ %t7 6l£ ISH ilhM 1 SSP %6 Z9L P2L‘2 3HH SA l‘l %ll 0AA‘l 8CS‘Sl dHH ^tffflf^OddH

698 %IC L9£‘ I £69‘9 103 f8¥$3H 3 681 %8 09L l££‘6 SV3 Bg#g^dim#m41

9 L %6 ZZl A9£‘l 3V3 W^Mlil4 i ££P %Sl 6£9 tz£l‘tz xxav #m##4i

^##P@ ### -&■§# — Cl 4; y 4z

^^0-6 44/4R"#<-a^VNa3^^$^m (t^^)

AS6> eon 6Sfr P9 L'\ ISI 8frS C££ 4S-^«fttPS 6iri SP£ 681 09P cc ost OS 9S£‘ l £89 96 £L£ 8 C£l P9 *»»¥

ZS£‘Z ZL9 PL\ l £6 ICl 99C 81Z

+#^ 9 9 P £ £ Z 1 °N^E/t-n ^ L\l\ p/\ i SC/8 81/A SI/A Cl/S QSPI y 09 §05 y 0t §0£ y 0c y 0i S5S

t (m) Z^^K^^-a±yyy-7-^^#i-L-CE^ £r£tou ^tL^^^Dx-T^ftBE^iJ^r^tic^-tir/Cc -en^^^^tilB^PWcD^TcDNAcD^SE ^E^j&^-eM^&pwcT)^n—'s^tz.*)co^mmmt\.4mur -v-3*BS) tz'Dtz. ms) o Tnm(D^TAA@a^J^^:^-e#^:ej (g]2)

ms) Tn&(CZ y g yh^yy^^y !)-f^#(7)^# ^ c —y#: ttm %m Ltz. 9 n — y<7)## 1190 100% y g y h^/yy^ yy D o-y 1143 96% y g v h^y&(7)A#E^j^^(7)0;# f a — y# itm yg^/h^yy^yy!) -^^$1%?#/::^ o-y 1143 100% yg ^ h^yy-^y^^lf-rA^E^U^^^L^^n-y 418 37% ug^y^y -7 — ^-yy-cA^E^u^^^ Lyh^ n —y 433 38% 177 15% 3[n]]^J:07P7-/f ~ ft ;j— ^y/^K'lffof:^ P-y 115 10%

^SE^^^BE^'o^cDNA^ D-y <1: L*m, 1) y-^y^EfS^ii^-Cit £oT L£ 9 b 2) ±#%#i9mL#mdS#$;h,T^5 t)(D mi%) x 3) #mcAiy#UAEW#)y n-y (^J0.5%) ft £'&%$#) btlfr0 y-^y^El^^^f-C±^6m@^ LTCL GC# l:/)W'E?ik "9 JM-l'EfL mv# «9 %g L@a^ij^#V^ T^V ^5EfU DMSO^n^^dGTPEJt>^^ LT^SSa^JtD^^^rM Atz.tK -%

^-t-r^te^smztitct'#mLtz0 temvMLmm^titiz>m&tt'£r7±>-7'Mm Lv\±, Tnm-eE^%^^L^#^ic

(i, TnWAum'bm*)m lw£<7XMM#£*eii-cztz-tfrb, ^tz.bob&'itc (BI4) o $/cHSIl 1686 Tit, <7)EM^ n— y (7)214# B <7)£ESTA£C t°“^ ^r^LTWct)(7)^x Tnf#A^ d —ymA(7)^ n — y bC(D? p — y(;:2#ffcLfc (EI5) . Z(7)C^^^E##-cm#^^-CV^eA-C, ±##(7)^T^m^^LTV^t(7)«!:# x.bMz. 0 Z(7)#^(C(t, #^](7)tf-^(7)lt$A^6C##fL^o

—45— Ho. 1 Ho. 11 Ho. 15 No. 16 Ho. 14 Ho. 4

OFF JOE a >

C0L02253. 2047bp. 406aa

COL01871 (3371 bp, 739aa) *• 7, j ------y 1------V ^ T 1 i ------52L 4------—' | h5 yxrf'Jy 4----- % —; ! y a v h)} y 1= ^ , 1 v . i y-^r y a 1 1 ,_____5Z_____, i 1 1 'f&—1 1 1 ', • ' y 1 1 1 <--- 'y-^yx 1 1 szjtm i eofl S6d I.JKd i.«d i.sii j.fi a 3, iba 3-ifd ;uo 4»a *xa S.TJH *31 d 4*43 — i i inf^ 14 * -----M-4------W—------1 t-iri ■Ht=------f 1 -f^jA—-^F m~i in il l iiii'— — 1 1 1---- 1 ttjjHH +-• r^m r r 1 -rrr n i ff in F f mTllr #1*lr ffn mi iri H 1 Ii iii i ii ll niim -rn— 11 ii ii i" Fff— n I IllS 1 Tt 1 i 1 —" 1 TWTfri'll HTIIH iFTffH—¥=i ii i H ii iiTHiri ini# f ~ rFr ~=f=F=l I All l1 “r

(EI3) Tn&&PW&(D#mTcDNA(0*egEM^*f TnlO TrJD9

CWH> IfB

t- }?AfSSK8^' yjiic|£?i|®□. =.>y h s^S'l

(279/282tg®^-0[)

(B4)

47 (3) cDNAi^iESE^lJcDftW "f^b^E6@<7)S ZZT(i#^#^#^$-eKAT(D##f^#TLyc 3,754^ n — i) cDNAE^lJfttDStff 7^yy ]J — ZT S(t 6 j: o TcDNA^ n - y o TV'fc (IH6) 0 #t^V^(D^HepG2#EA^^ o —y (HEP)

(#6) y A y y V — Zf

JW# L/i 7-ryy D- Efm^agdk# □ —>-©|g# (bp) t MgKiMS ADXX xmmm 253 1,476 CAE 71 2,022 CAS 151 1,512 t hiES^Ii COL BW%@ 779 2,317 HepG2>M HEP mn%& 724 2,560 t b"#^#±^#aE HRC 422 1,828 t b/H§ HSI 237 1,506 HUV *»»* 27 2,097 KatoIH#E KAT 413 1,583 t Hffi LNG 317 2,167 REC 25 1,560 t MES’^S COLE 148 1,873 t h EASE ting KAIA ^F —S 187 2,377 (bp) 3,754 2,048 D — 3'#;%(D# V Ay F V y L/to

-49- ii) Z%^L/ccDNA7P —yo^Ef'J/PL, ATGT^^O^ K>-C#t»680aaJ^±(D#m# ^r(7)yo->/(D[#T#±#(Dt(D#^m^T^y^E^J^Ll:#^L^o £170/^1+3,754 7 u — V(D 9 ^2,9427 n —y (78%) T:'E (7)S*$l£#l/c-f-fMSlRT ^ / EBd^J (ADXX) T#70%£T(E]o/c (#7) o

(#7) 80aaa±(7)fg)tBET ^ /^E?iJ£r^cDNAy o —y

L/c >=80aa(7)#^mme^J 7^77 V y 0 — ycDNA7 d -y-c^5E^#L/b. E#^ 1 x Kt5^D-y^lt/: 0 E7'J(a+:, %y7^yy7^#^5/ E #;t hti5 0 £ Mfi?iJ£$:)6L/ccDNA7 n —y<7) 9 t?s SOaaHiKDUvElHlRT X 7 e£E?iJ^fga6 btltz b

-51- (W8) St£flmRNASfiy|j£0'7~'— 7 ^-510

7 p — y >=80aa BLASTND&nifcl y-r^y 9- mm 7 p —y#c BtiaK?iJ mmmm t b flg toSi.^ ADXX 253 150 24 (16%) 72 (48%) 54(36%)

t HW fN& tM CAE 71 63 6 (10%) 38 (60%) 19(30%) t bm#m##mm^ CAS 151 122 23 (19%) 49 (40%) 50(41%) t b lESTcB COL 779 553 44 (8%) 287 (52%) 222(40%) HepG2^B/!S HEP 724 611 56 (9%) 418 (68%) 137(22%) t b HRC 422 358 72 (20%) 192 (54%) 94(26%) t b /H# HSI 237 173 26 (15%) 89 (51%) 58(34%) HUV 27 24 2 (8%) 13 (54%) 9(38%) KatoIIIiM KAT 413 357 51 (14%) 211 (59%) 95(27%) t bW LNG 317 237 6 (3%) 127 (54%) 104(44%) th#&em*#±A REC 25 21 4 (19%) 10 (48%) 7(33%) t biES^S COLE 148 130 15 (12%) 63 (48%) 52(40%) t hEffi-msttS KAIA 187 143 7 (5%) 82 (57%) 54(38%) £8+ 3,754 2,942 336 (1 1%) 1,651 (56%) 955(32%) iv) cDNAb 7)T#J fr-rZDKAWftffitmmizspciAs%m\ y^wa^yy-Ayy b^wr^tto fcQ ii)T#mL^:80aaJ^±(D#$(D#^^T ^ /^Kf'j&^acDNAf n —y(c#L, SERp~ y y yy##?#^fro^ n-y(7)it#^#;+Lyc^ C5. 200aa^ ±(D#^(DT ^ y^SE^J"efciuf8o%a±co^ p — y ifclx Hcififli (im?) . #%^^T#J$^L^V^7 cz —yoitW^ZnoTb,, L^^LSPCIAS^ffl WdWW"C(1100aaiUT7)Mv^ - K#%eiEmrT#li-5Z6^#LV^V^ - ^DNA#%^f • A'WM±frb

(76%) £ ii) -e^ L^80aa^±(DSSEET ^ 7 EE7IJ ^11, V/c;*.^:: 7 p - y £ l$13:|i?l CWJ^tf L/c0 t/^u 80aai^±(7)ft^^r ^yMByiJcbSi^p-xy yy'«(7)fS]^^Mai^tL/c:y n- y(±2,617yp —y (70%) (T-ft^-f) o i6%(/)y p-y-ef±T#j^^iybm^3-Ty yy#%^ 2c>j^±(D-i:yy y £/ci4%<7)7 p —y-Cbiy y~2^yy b<7)«JHEt£^^£tifc (^9) 0 ##%Nix y y —Ayy b ^y y°y/f y y b p ycoEWEDNAE^ftTvs^SEte t)^fW£ii/c0 r. ^^p:y-(±M,^yc$a^X,^^o -cmmL^Eywm^!)^io%J^±t)y-y-yyp:y-^$)6^(l#yE<

-52- s % >.. XI w X SB #5- I I ^ 9 d X X % T » X ^ M 8*- t 38 SE |$ 04 Op^* m m $ rv ^ 4i 22 X ov M: ^ 0 $ ^ rtr« # y- ft $ a 0 d 0 M rV >66 Zx , a # # 22 C0^-cu?--/'LfSOfUsW

T * ^ fV ft 6 VNO^-Xi*

: r z nr a £ ' 4

M" 2 '% £ i ! a

>|f ^

- % n r iP [ d a V r\4 M 5

< ft -B V fi a* aS % 5 0 8 i d S3 r BB \r r< t ? r r Ky 3* 22 s? * X xl cr d n a rt X> a % ^ m V <- ^j4

OHD) xi ^ r It 04 V Ai d cf* z~ ( a z %' # XiflB!$EG)dOIS~DlVtoT» 'SW W W ffTtoW B^fT^^ElTTffEBlW TTTZlRZHr

A 03r ^ ( S 5

Xy IM #

XI m Si V H ( >4 xli --/'tfgfiBisP 2 S $ ^ H l (#9 )

7 n — y HEffl*# *8 #?«##«y y — A y 7 b 7-ryy v- mm 7 o —ym 7 n — y ?»#] 7 o — y k2 h flU ADXX 253 147 29 (20%) 21 (14%) CAE 71 59 9 (15%) 1 (2%) b tmmm^mKmm CAS 151 128 8 (6%) 11 (9%) t ME$*» COL 779 508 72 (14%) 54 (11%) HepG2#^ HEP 724 604 88 (15%) 80 (13%) t hM#±##aa HRC 422 350 60 (17%) 44 (13%) fc b /J^# HSI 237 176 27 (15%) 25 (14%) b Hmamam&am HUV 27 26 5 (19%) 7 (27%) KatoIIlM KAT 413 374 35 (9%) 43 (11%) t h LNG 317 202 45 (22%) 40 (20%) b Ftteiafi® REC 25 21 2 (10%) 4 (19%) t HE£*B COLE 148 124 29 (23%) 23 (19%) b hjE&BUStolE KAIA 187 144 39 (27%) 38 (26%) s-n- 3,754 2,863 448 (16%) 391 (14%) SPCIAS£ffli>Ti^SJ n-yt7ix —Ay7 — >

(m\0) 7l/“A'>7h^fiM§ixfc^D-y^#f

7 y —A y 7 b E£iJ^ cDNAE?lj&###f L/bMm C-COLO1887 189 naii&L C-COL04151 1,227 G -> AtoSSSS C-COL05258 570 586%? 7:^ (-lbp) C-COL05623 471 430 - 435 7:7^ 473%:'7:7c (-7bp) C-COL05962 517 518%* 7:7y 450 K 3 bp#A (+2bp) C-COL06078 470 470%? 7:^ (-lbp)

C-COL0739 1 1,542 Pp^M^L C-COL08 164 149 149#i&%?#y'#A C-COL08657 697 691%:' 7:£ (-lbp) C-COL09356 587 581-2%:' 7;£ (-2bp) SPClAS$rmv^-C7 lx-Ay 7

-54- BLASTN^rffl E y h L/ccDNA^ n —y(D 9 H UE^O^SW^T ^ / - ^X 1^-fdFrameSearch (GENCORE Frame_N2P ver. 4.5) WcfBiWfiW"?' H)-(^E y F /mE^J(D H **~y) ^(DiUB^cD^giEE^U r (D ^ n — y (DFrameSearch-ecD T ?Xbffi

9HH^<£oTH^3bH St)i^^^^KatoIIlM6S (KAT) (DyXXyV-XU67%, FFEMSi ^ d —y^-eco^TiMyo-efco/c (#11) 0 vO y E mfeZtitzT* ymmm^mLX. Signal? (r^-7-^II4*) £ ffl t' * ->^'>/i/E^iJ(7)I^J&J: mosiii^mv^#R###(Dfmij^e^/c (#12) 0 80aaa±(D#^mmE^j^f m'j^ n/ccDNA^ n — >±fo(D\$%X\ is £ Hfz0 tztz LSignalP-milMlES 6 m^MLxtmcx o^ k -tscdna ^p-

xDmtznxd i>'pte^t>

(1G M 6 6«-efc6 $ ttfco

(#11)

7^yy V- E hflltoES ADXX 253 6 4 (67%) 2 (33%) t CAE 71 10 4 (40%) 6 (60%) t hmshm^-mamm CAS 151 25 9 (36%) 16 (64%) E HE#*# COL 779 97 37 (38%) 60 (62%) HepG2|fflJ|& HEP 724 55 19 (35%) 36 (65%) HRC 422 60 25 (42%) 35 (58%) E h *# HSI 237 20 7 (35%) 13 (65%) t HUV 27 4 0 (0%) 4 (100%) KatolII#^ KAT 413 72 48 (67%) 24 (33%) E Hffi LNG 317 11 4 (36%) 7 (64%) E h REC 25 0 0 (-) 0 (-) E HE#*# COLF 148 26 14 (54%) 12 (46%) E HE#@#ti)E KAIA 187 17 6 (35%) 1 1 (65%) ei+ 3,754 403 177 (44%) 226 (56%)

-55- (mi2) z/t

y-r^y v- >=80aa mt t hmtoE^ ADXX 253 150 27 (18%) 28 (19%) t hSbEft&EflS CAE 71 63 6 (10%) 15 (24%) t hjSBJlWrtffiifflJia CAS 151 122 9 (7%) 14 (11%) t MESAS COL 779 553 70 (13%) 128 (23%) HepG2#ga HEP 724 611 45 (7%) 111 (18%) t HRC 422 358 44 (12%) 91 (25%) t MES HSI 237 173 19 (11%) 23 (13%) t HW$iMm&iWia HUV 27 24 0 (0%) 0 (0%) KatoIIllfflJja KAT 413 357 41 (11%) 73 (20%) t hM LNG 317 237 16 (7%) 43 (18%) t EWia^mm#A^ REC 25 21 1 (5%) 0 (0%) fc MESAS COLE 148 130 8 (6%) 25 (19%) fc MES0S&S KAIA 187 143 12 (8%) 38 (27%) •s-n- 3.754 2,942 298 (10%) 589 (20%)

(4)

Vv otf>OcDNA* n-y^JE, Wd'^iiAo SAAffl^^ffllVccDNAy n —yii^T, COSEA&&

A (#&*#!) o

(5) ^ n —y^Stictl/cDNA^ u — >(D;<>?lk p-y£6SU (£WLA0 R?'l&^(D^&^i|X#L;W)NAf n —y

(6) h^^T(D^%mA £9^ W,t}Ltz0 4'y^—^v b3o ckTM y hy^y bT'^Jffi-C^ 5 r t £ 0 tf)b LT, 4 y? — y a^—y ^LTWEBy^^'f-^mv^yy^Ty b/f—/i-iy^^A^L/bo ^yo^^y b-ei^mm

±mmm^9'^-^z:t\z.^ y#- b lxm%.

'>7fA(t &£5;hAccDNA<7)£EyiJ£^HM L tz0 Z(D£ 5^'>^T-A^-Ctfcr ^ Jdct >9, L/hcDNAE^ij^mm^K## L, ^

2. 4 $aa&

677,677 y n — y (DcDNA(D5'^)%E^jt» ^ (C##T K-f"d oT^tt(7)i^V^cDNAy n —y(7M£# 8,143 y n-y£ii$U 3,950## (D cDNA(DA#EyiJ (FIS) $r%6A4CLybo — gl5G9cDNAy n — A&Eyj(D##^^:Xi^

?>, K-rsy n-y^ ^ - k l -c i ^ 5't&frti&b (Dbfo'itztiK ^2,600##(D#fm#^f 3 - K-t6 cbSbiLdcDNAy D-y(7)EyiJ^B^ fb^-?-

(CjoVT, b ^^8,000###^^^ ^ Z 6 iy WZnl^fm^ 6^4ci-5Z ^#-?#&###*# v\ zii,^(DcDNAyo-yiiA-c#m^yy-i:m^%A$^/y:^'C/<>yfk^^i:*3i9. ^

m#c-cvx^.

^&E^m^-e(±, (Tn&)

bEyij^#(iT^!ti^%7yyA^#A^^6^#t)^'C^6^\ AT^m^^V^# Wifol^fgtf) btlfzo y n —y(cZ^T(±cDNAEyiJ<7)#^/c#%^ff fr'QM'P LxffiA£fotzb(Dbfo'otz 0 t.tz, Afiff^r^^'Cffl^^tL/ccDNAy D — y<7)-

-57 cDNA? ^E?iJ&3E&#a — ^♦S^MSI-t'Sr lW-&C:,kdST#/bo

^ftcDNA^

IU>frcDNAy $ X'yX • • • - '-ryy^ • 3) - • • • 2) • • • • •

(False

1

(FalsePositive

5'^W>tb5#@ #%(DATG(DfM:#^5'^%X)^ ATG^ 5'E?'J(D#g(Di#sv'Ems3s< 5'Eyj(D#^^m<. cDNA^(Dmm-e^#, ^P^RNAtiSy ) ^y7t-7r^'7^yiasfii$ yy 5'E^ij^^v^^seyij^s^MS<, iTOLfcy <&wm?)5wm< 'MtkLtz? y

ORF^tf xn-yK gfc£P»fc^£r=*~-

-CcDNA n

A^f/ty — Negative

n-y^£Tii&-e£TV5

y###, Ky^^^Eyii^mK) —

(D P-y£V'5fc{t-eteft^ofc„

k&'VZtZo 7^yyy^(cz5i-mm# o n

y

(Cf6 t — — o

o

— LTStSX:'^ £-C7)ATG(eo^Tj$#fytfo/';:/-;:tf)S$£ilfr^o/';:y y^#p^RNAA^y y^^^oE^JcDxyy^ n

ff^^cDNA^ — E^J — Tnm^#%(^fT9Z«k-e, y^ii&L/ey ^ftEyU&^l-^&ir&oT

^ y

#A. KUTV^^n-y z y^7MSmRNAtc£S*tdy

la

3 £ .

^

o 5'E^IJ

^fcTn^rfflV'Sr.

#A, /egeyo £ KbTV^^jife-e^&^ofcy

y^^«k'^g#/b«k#t6^5y-^o LTLio/'ey

£.&&&!&&:%

"t 6 /£ -

yh.

t

. 5

500i£S.iyFW&lyy

m^^SiJ^-e#^75^^y--y £ 2 —

^ t /ey p~yjifet;itefr5fafi££a)Rg y> o

^ t>#;U::<

0 —

tests

-58- —

yy^x y-^yy (T#j#g[) (?##&)

y>

— ^l:ioT,

P-y $ 0 Ltz&

^ y

£ ) 0

v\

xn n/e

)

L^ofcy- V —

Ty d

's^k y

D ^z^

tcSr-Xo

- —

^ y^ii^L/cy- h-efcofcy-

* y(iS&##^^ je

0 = /^SDiSLE^J^'atp^ —

n

^

— ^L-CV &~f 0 0 y x

Lb^X — Q — *

0 7t —

L/e/cAte#^^ 0 y

(D? o

u-y^m 7

n

D yco - $:

5'^E^J-e#v^ga^j#^(D#^^mv^(D-e(i, ^=»-KL-cv^ct^jo^-f yo zfiw3$^ m^t^c/b

E^*Jfz0

l /nx y- >y y y ti^kclyr-^ M£fT 9 & V x jlji

MXlttei\ L^Lr (D l /n7y —^ry 7(:S^< n —y£ii$Lty< ±Xlt, X:

y n - yil&(-jo It 5 ffiM&vmfeV. 5'^SSd^J^^fctj-e(m^E^J^(7)^-ti^^J®r-e^^

i'b(Dbh^tz .0 xtut*V ^>-r yV'fevit—ftktfitecxmhy 4Vy V —cDfES^-tt^-rs'^^

LTcDNA(D5'^^(DE^JL^^^c,yb^\ —yTticDNA(D 3'^#tC^^##

#

cdna^ v-ym'M^m tn-k.btitz.0 *m^T%mvmi'tz.mmU

riL(m^ite^--e(7)W®-efc <9 , iztsitz^&mnz r £-e^< ^er-e *m&m%xtttLfa\iim^mm£\z-t%%±&cYMA?v-i'(Dm&iit^tz.0 1111^\m 100%(Dm4:#-e^A&^ n-y^#^A&^o-yeeBUi-6Z ^(1 ^mXhZ)0 ®t$nmfc^

t?teio%mgxh'i£i o ^tu;i*j-u l y n - y ^ £ntz ±&mt44%fzotz 0 jotf 6 ^ n -y#%f##-C(±, K?#ms#< L^^^E^iJ-e Lfrtei'5'M$iifr b

Z ^ tc j: D , Mite^-cb LXtm^titz.^ igr3 - Ki-5 (b@^6f o- y &##-#-&Z ^«b v^9

gsfW'cDNAy^yy Z6 t, ^#(7)##^ L-cm^r$)5 9o

yy K$tL5ite^-(DtfE(j:, t-tx> h & y^t^mm^RNAmmi^t lt^¥ £jx, &\*ll^tol'XX y Fnywli, jo j;m'^^(DCAP#i&(7){#[l^6 W-3'#^m^(7) #y AE^U

JojcBmRNA tte^ fcfi t-jiBS'iliil L £ tlX V #'/—J*\Z.£ <9

’g&ffcfrftfrtlZo L fztf'iX. tbttbti L /cRNA/^ 6 aK y AE^IJ ^Wf6RNA^##L^:

fit-, tMJ J:oTCAPfSiS^W-t5mRNAil^6tl(-cDNA^^/jjci-5 cb, E6R(7)£

mE^^3-Ki-5^^cDNA^#^%t-f#^jT,6Z«b^m#^fL6o LXi^L,

btelc L/ccDNAy D —ycD^t-tl, te'y AE^Jt-^^-f 6 cbSbiLdy O —y^-r y h n >^^At-I^^^TV^V^n-yj)@SA 6^^o ^(D±#^miS(D—^(bLT, mRNA(D##&(- jy(t 6Pm#^#^6ji,/:o #^(DK#-e(j:maaRRNA(7)^^#mi-5Z(b^

-59- ■et&V'o -er-e^P^lRNA'b'a^^RNA^ttmL/'cEt-^L ’ ^dT#y A £ V' TmRNA£ffiMX6 2 y/ A^-^#^RNA&^A(C%lp|^< Z «k(±#L< ^60 C(DZ 9 &RNASW£rlto t LTcDNA££*f-5 t, ^ n-y ^£ C5 ::

Ift^cDNA^^cB(7)-f--

mikm^mmtc^tz^ n-y 6bb^^5n, ue &^otl t.'itzM&b— %$X%$.tbbfrifc0 ^a^K/y-et±#(DcDNA(7)A#E^J##yo^]:^ h ^S^^ilTV^o i/3L? bfrb XfrlbO nmt-wcLXL^'Dfzbobh'Dti 0 s b—Wtbtchob&'ifZo

i

60 2. 5 SlfflXSt

(1) 1. Maruyama & Sugano, Gene 138 171-174 1994.

(2) • Dynaclust. (^'4 ~Y ^ M>) • BLAST ver.2.0 Altschul, S.F., Lipman, D.J. et.al. Nucleic Acids Res. 25:3389-3402. 1997. (http://www.ncbi.nlm.nih.gov/BLAST/ ) • NetStart A.G.Pederson & H.Nielsen. ISMB 5 226-233 1997. • TestCode J.W.Fickett Nucleic Acids Res. 10 5301-5318 1982. • MasudaScore Y.Iida & T.Masuda Nucreic Acids Res. 24 3313-3316 1996. • TranStart. E, MS • RepeatMasker. Phil Green (http://www.genome.washington.edu/UWGC/analysistools/phrap.htm ) • Cross match. Phil Green (http://www.genome.washington.edu/UWGC/analysistools/phrap.htm ) • Phred/Phrap/Consed. Phil Green (http://www.genome.washington.edu/UWGC/analysistools/phrap.htm ) • Autofinish. Phil Green (http://www.genome.washington.edu/UWGC/analysistools/phrap.htm ) • Sequencher ver.3.0. (Gene Codes Inc.) • Hybsimulator ver.3.0. (AGTC Inc.) • Amplify. Bill Engels (rM^n>-y • BLASTN. (GCG Package ver.10.) • BLASTP. (GCG Package ver.10.) • GENCORE FrameSearch (Compugen GENCORE Frame N2P ver. 4.5) • SPCIAS. /NK /StR (X,^$DNA#^Bf) • SOSui. (^#%±#)

• Signal?. Nielsen et al. (t>-7“ f %#^:)

(3) * • GenBank (http://www.ncbi.nlm.nih.gov/Genbank ) • GenPept (http://www.ncbi.nlm.nih.gov/ ) • UniGene Gregory D. Schuler J .Mol .Med 75(10),694-698 (http://www.ncbi.nlm.nih.gov/UniGene ) • GENESEQN (Derwent#) • GENESEQP (Derwent#) • SwissProt (http://expasy.hcuge.ch/sprot )

—61 — SE£6cDNAE9Jt*$&tfit£^

3. 1 1&m

bcDNAX'7 b (J^TFL-y n ^oc y birl&IEf 6) fi¥b£l\¥4B 1 0 «t <9 ¥$12^3^ 31 0 £ T<7)i^f^T\ 12ttJ; 9 &53y-Aj&s#n 77 y n — y (7) h: b ^i^ScDNAtDEy'Jz&Ati £f Er 9 T

j;A(D## ZigltotZo AT'It4,000 y n — >(D#f#^A&cDNA(D#iB^:^ E ayfa'-

X2^r n * (D^^gcDNA&^llUl.OOOy n — y COE^l $:£ £ B S Hfc„ a * te ^ }± (i,oooy n-y) ScDNA (500 y n — y ) (Dft 1,500 y o — y <7>1$bE£ 3 y fc° 3. — y — £ l' T i^S'J't' 5 r. #*$)6y&BmL/b. yd v32y bT£<^t±}£tidW*&B<7>t b^A^cDNAKiB-fa###^###^ -e^<, g#. m#, (Dkyb(c^(9#5 t)(D 6X9# LTV^6o

3. 2

k b^A#cDNA7-ry7 % - m~F, gf4f 7^7^ y -tmm-fZ>) (Y.Suzuki et.al.)^m^T#BlJ cdna y p — y (#n,oooy p-y) (D^i^SEyiJ^y 7Ty t-^y^'libi *9 & ^i-60 #6^,^Eyii7-y l±DNAy^:/^T(D^To 9 E«k4)KT^y@g Eyii(-^#Ly y/

(1) #m##

mu%m^M£ 0 (adxx ) , (huv ) , & (RFC) , lES/hStoS (KAIA) , jES/hS (HIS) x (CAS) , M (LNG) b^A&cDNA7-ry7!;-zn6#$R2 (^jo, Eb^A#cDNA7-ry7V-(D##joZU(^AA yn —y(7)#Kfb#Kci^T(±^2$^#R8^ )

(2) ^•^^IcDNAy yf - bli^y y — pME18SFL3(GenBank: accession no. AB009864) (D^)V^ y u y y"yy b ftODrallHb'T b(CACNNN J, GTG)(d^A^tlT^o <9 , 7 yy — b(7)5’iiij(De?ij^£y 6s(iy yib — beyo(DttmH-ix^^-bi/cyyy — ME774FW (5 -CTTCTGCTCTAAAAGCTGCG-3'), 3 ’ M *§&it 7 yib — —ME12I5RV (5-CGACCTGCAGCTCGAGCACA- (J^TT(TME774FW&r7ME1215RVy 7 ^ "7 —£^<£ £ — y 7 4 %) o 5’toSB^Jli — ^(D-<4 £ “SB£'J(GAATTCCTCGAGCACTGT), r yy£ “SB?iJ(tggcctactgg )(d^, 4 >-4 — y -4- b$mtW:< 0 a. y 7 ^ ^ KDNAOtotti ^ ; K £ "a tf £: # # f T 2 ml LB## (bacto-tryptone 10 g, bacto- yeast extract 5 g x NaCl 10 g / 1, pH7) (C y —#%## L, KSi (Genesis Workstation) l:t, SDS/7;^ y SlU ^7 7 ^ K DNA^#BB##i-6o HUli, #*$£3,000 rpm, lS^is-l' L, ±i^f^7T-'> 5 XiC j: 0 8#^i-6o lx ^ b £ 1 y £ ^ L^#, ^ lx y b (20 mM Tris-HCl, 10 mM EDTA, pH8)£80 plWDL, ^n>f7^£|T90 #; , S$2 (0.2 N NaOH, 1% SDS)£80 plMO L, fc0^ y X 4 4 £" £f?V\ iM2^#tt50 5 6 x #$3 (3.6 M Potassium acetate, 6 M Acetate, 60 pg/ml RNaseA)£80 pl'^stjn L, t°^ y£ 4 'y 7' £f? 0 o b“£/H80 pl£ ^ y >yr|±|!iMSNA7^;U^ —y lx— bO # £ ^ /L (C # L , 5 L, 150 plOBinding^$(8 M guanidine HCl)£^atfc: MSFBy?^^ yix — b(C^lt6o MSFBP^OS$£B^L, 1 3IT5, 80 % ^ £ y —/tz 200 pliB# L, 1.55^^31 L, f*t^, 80 % ^ £ / —/U 200 pi wo, 5e^%sii-6o y^-bom£jrA£^/^T^<. 5#mm9iL, yix- b 0*£ ^ A £ < o # # (c, io#f8K %SI L, 7/^-^£^l:iit5o

S®tK£70 pl£*Dx, 2^^%5I L, EHXffloy lx- b liy77 S KDNA#$£^ltm 6o n hMz.-? 7 7, % KDNAOl|X*(i^n.5-2 pg, #^0.02-0.03 pg / pi b & *9 , ^ 7)10 pl£^-£ b. M^OSB^l^:^ PE-ABl%#OBig Dye terminator 2.5 pi, 2.5 pM ^ £ £—~f 7 4 ~7 — 1 pi, plasmid DNA 10 pi, 10 x buffer 1.5 pK H20 5 pl£iE^ L, PCR^l£(95°C 2 min, 30 cycles of 95 °C 30 sec, 50°C 20 sec, 60°C 150 sec) £ IT 9 « E^$(-TE 15 pi, HL reg (Edge BioSystemsbiM, precipitation reagent HL) 5 pi, ^ £ / —-/L 50 pi £ X , MW4~ 4> 0

2,800 rpm, 20 minis >£ L tz \k , ±'(W£^T, L, SStK 50 pl£#xT#S t£ 95 °C, lmin y T L , PE-ABM±M- £:xy-tb- (ABI3700)y T y T-T L, SB £'J $: £ £ ?T o fzD AB13700 O $ £ {4 It & K) ®J± 5,200V, 55°C , 10,000#TfT^ T O5o C> Tl tz $ Jf2 £—£ {T Sequencher y 7 b (Sequencher 4.0; Gene Codes Corporation f±) £m^y##L, E£'jT-££FL-yDy^£ by-£#my^yAicm#i-6o c. yyT-^ —£^- — ^££-y7^-7-£mv^y##rL/: 5'to«k3'#j(7)^400-600m&(7), S bt ifi: 7) # $ SBT'J y “ £ £ T> ^ y$i)20meri7)gene-specificyy y 3 DNA £'□/$ L , a.y # #L/:yy^^ KDNA£y^yix-b(!:Ly, #2^yyy7)y-£^yyKiT;£fTV\ #m(D^&yy@B^j£^^y5o A#(7)@B^m^^^Ty6$yz(D^£^y£

—63— Sequencher y y h £ V' X BS?'J -^xy^ort of; 0 MUt^y 4 V * — y y'?££^5k¥ll-S Ltz b (DX*,

yy^-7-No.i«fNo.5^-cE &WC. y.T^y3T(±yy^-7-No.2^No.6(CTE^e^:^L. ## % (C(±(M%7^^ y If — f A#: (■Cl iS*t" 6 ^ "C# "9 iS L T VA <( 0 d. '>“^xy^7-^(Dfl

BB^iJ-r — y

t) co> jo ck ^y y — sbyy t y y y — bbyy cd s "t--5 y p — y ^ ii lib tf y y t y 7* L, ^LML-cy-y^y^(Dm^^fTo^. Z(o#m=^yUT-L^y-y^yy.T- y Ity D“y|i;f-^fI'>^rA|;m, SS&ftjl-fT 9o

3. 3 SSK* 6Efc

(1) y p-yc£>ES

^yp-yij:i,oi7i:*ofc 0 BByy&SftJdSm^&vT^-r t°y y -i^-yiyt- A#Ti@m L/t. T¥m(c^V^T(iim^K##L965 y o-y&#T L.

•£<79 9 f)960y p-y {Col'T^W L/d^S^Sy (#1) o

m y p - y (dbbsb ^ BByim^jm

5 8 6f} 78 8fl 98 10,9 11 8 12,9 1 8 28 ¥S fMt S y n — y <79 BBSS: 64 140 373 96 683 1,356 y p — y (7) BBSS If Sc 64 64 204 577 577 577 673 673 1356 1356 1356 s — y p — y 25 20 98 32 164 339 ##f##y n — ysif 39 39 159 434 434 434 498 498 1017 1017 1017 1,017 BBym^sc 5 27 7 0 140 108 126 105 112 330 960 e?ij^6»si+ 5 32 39 39 179 287 413 498 630 960 r t 5^t7^fdABi3 7oo^r-^t p7y “-S!y-^3iy'^-(7)fikp*n«t9 y# rn/c#(d#m L/Ce

(2) 3yk°^ — y-#my^TA(D##

^ttyy-lDNAy-yiyf- (AB13700, pe-abiI±§J) =k y f#6iL/cBByijT — y (3#%BT#LAN&#A It, ¥ y (Sun Ultra60: 360MHz x 1 CPU) (C i# #^fL6. 9-yy-b(7)# M t Sequencher V y V % ^ y y cyfef&ftM bffi ft *s — yxy^W/ty^

—64 — V —SrfTVX (D FL-/n^x^ b-r —^ WS'> X 7-Me y p — y SU (C £ it

It (02) o E^T — f d^#T —^ /< y y , -tteibh, GenBank/EMBL/DDBJtClEB £ it/cE^Jy — y 1*1 (C # 7 'y n — KLn r iUC*j- ItBLASTNCi dDNAl/^/V A##m

(GENESEQ, fA)l:^ltt)^^ny-^i^lTofco &3o\ y=' - y /< - x (D M Sf 1-M LT(tGenBanklt 0 * , EMBLft&i®, GENESEQdl^iiT'Mff $ it, 1b-/< —

c$ frl&o Wf #1 :!£ iBit /*;: t — y ^ ^-'7LA(7)iij@-b C#6(D^ 7 A

03-i (7)^^ g (lift 1,000 y p — y T — 7^ s<—X \~£ x itit, EST (expression seqence tag) T

“ 7^^“^, NONST (non-est non-redundant: E S T 7b X Hi IS E 7fJ £r ^ V' id b (D) t — 7^—X , GSS x HTGS (genomic database) , 7^ # y — *7 — 7* F*J (D # ff y — 7^“ ^ > UniGene-Human> UniGene-Mouse, UniGene-Rat, io j; Cb, _hf£

T S y &E*mBLASTXi£V'ti:BLASTP(C «fc V , ±E#St'- 7 * 9 6 7 1/-Af7 ^ yfifcETUlC ^#L^7^y#E7U(c#L7, ^ritfit^^Diy-##^fT9T;i%=fVXA-e$)5. 0 (D^iitD# 7AtliNow Status

£T&it[zffi$co&mT'-?'<-xtb, EybL^^-yy 03-2-mBLASTPtC Z 5NONSTt2'— ^ ^ t (D-f — X y77A^T(C^i-j:9lC, #t)^3T(Di#X)^^t(D(CcV^l:, A^l E^i (Efiquery) (C^LT, y-yyb(DT^y#E^i (T#) (DTy^y^b^^^ it-c#^ ^m^ii-mLycT^y^^mmy^ym ( + ) zcD#^i± t h a -fetoprotein enhancer binding proteinCDC^W^ 53%(D 7 d y “ ^r|'&tf)tza

—65— -Oth-h SS?>J 2 kbp

A

4 B A, B: — 1 ~8 : gene-specific 3?7 'f "7—

HI 1 ~J~7 -f 7—^7 --- ^ ^ v7 /i

GenBank/EMBLy DDBJetc.

#8*% —9 /* —X 7f\t □ S/—"tt—^

Et h e r N et FL-p reject ABI3 7 00 (Sun Ultra60: 360MHz x 1 CPU)

■ n—*7 v — h (7)W@ • V->7X>X««f

HI 2 3 2/1° jl—6* i/X’rJx (in-house)

—66 — °9 (#^%0E) 3%0 '>#5^?7 9^a.mm%06-%0E2)^^-f)q^O)##jyO94-#^^#VNa3

^ 64.4 ^44^^^.p 6 'f)

V^W^S 'ffK'SHT °--/ "1 >^%dqss£‘z 'dq^si‘3 vdq6g0‘3 'dq^gg'l vdq90 g‘l 'dqigf/'i 'dq[[E'i^^^^ '>^2)miO)###^/^3 "# '^T a#^?}3sa '## '#N/#3[:Bg#&zk a 4ES90)%89 0$)Wf)0^ q/Tf ? ^ 3^OC0TY?T^ 4 34 08^.^0)^ °^c.($a.dq 936'I 3^1)#tk%^f)^#VNa3^±OX-a ^0964# °9vA41^^#^ /^Z^Tk ' (%OW () ) #3%0#Xg "#:#VNa3^7k '-6 4 .4^4 '9^T44:^ '%vi5f:2)3^f)Q(?^a)^^##a)4-a/;0964# °9<$a_4- d4E6E (DN1) # '4 —a<;63 (SV3) Sg#t7k##*E '4-a^0E (SIH) 3 '4-041/.1 (viv%) '4-a4^3 03%) "4 — n ^63 (AHH) ^MSiS^tff '4— a ^gS3 (flSaV 'HSQV '3SQV vV%aV) 0{CT9^2)3^^mWO)#E$a)l4^14^ 'cz)4-a ^096^3 °9149^% 4 - 4-a 4#-^k '<^9^4^ #? 9#%hP ? % 9#N[i4P@o^^^ 6 T2)^^#^04 44-^^4E^__6.4 ':)% vic^2)4-a4(Z)9V% 4f)J.%4 4

Jp—K4-^.^4/: 0)#:^/|!#WU3)U03 gg g '%i#2)#4 —a4 #### l#^^^n^P@^^^(Z)4- a 4;96^14$M# 6, T - 6 4 4 ^4VNQ3

2)81 °9$J-^?%^#a.^WG)T^T%86^gm00t7^2iT '%#OOS-OOE °9^J-^U4^?794-^^^ri^P@G)4/4/<-4^8E '4/4^4k2) 8 1 '%v1Pf2) (BB#%4:) 47}#^a^^003EI8V-4<^4-^VMam— 6 4.3^-^: W^^Sfi45@ (E) 2 y -r y y v - y p — y 3p# ORF(%) f #T ^ / m cDNA# 6! (aa) S(bp) flBto (ACXX) 255 1,506 156 (61) 230 (hu v) 29 2,059 24 (83) 342 /H#%# (KAIA) 171 2,355 130 (76) 345 25 1,559 21 (84) 276 (REC) svimm^mm (cas ) 77 1,481 65 (85) 284 lE^/hE (HIS) 30 1,327 20 (65) 207 B (LNG) 373 2,187 235 (63) 238 1#+ 960 1,926 651 (68) 267

£/c. ORFfc btltz.653 ? n — >(D^X\ 5'-#%^##(D# $ #349bp, 3’-#SE1W ^c(D# ^ (±805 bp-efco;fc 0 ORF(D5F^ESti798bp (#^^ EA 266T ^ / #)

X&'itz. o ORF^ £ ftfc ^ p t. ORF/cDNA(%)^iSt:|£t^(7)'e(j:Jt^^

p — yZ,S%(D^A# He j:6mRNA(Df-xrX(D

ORF^-a Ay-ev^ 6 . i&VMi, * y^° y £ fi&T L t> V' x. ft i' z £

*^ p v" —)W#f ESTt"— y /< —x o tzQ £6, NONSTx — £ t y h (D|^46%(e ti^t^>0 Z <7) 9 h , 75%(t genomicE^lJ 6

%k Uy

(D ; No-hit, 101 -800 (D b

7(Z^#:^b-Cl^V^t(D^^35%, #^P^—(D&6t(7b ^V^(2, EST#(D DNA^rb ^ — g[ L6 & (DZ,S46%. ^ ^ (21^ —B: L^: (7)(219%(7) 183 ^ P — y "C, ^(DcHeilNONSTT ——^ — as

y. (7)^.7?^ < GENESEQ(D 2 9 ^ 7. ^(DiF — £> b <5 o Zb2 "Cb, O y^-(D$)6 y ^

-69 H r^r tn n '> CO beta-glucanase , CO acety lglutamate kinase, '> h 'y 3 y /< :& CO glucosyl transferase, t h CO O-linked GlcNAC trnasferase , ^ ^ CO phospholipase C#co k h 5 ^ # x. 6 fl <5 t h #f , ^7’> COphospahatidylinositol triphosphate binding protein, ## CO GTP binding protein , # # CO iron-transport protein , t h CO Ca-dependent phospholipid binding protein , y y h COurea transporter# (O h y V y zK — $ w-JHqp'S

^wmm?. mmmm, #co#m6#m*

o V h^RT-PCR(CZ , i&lMt, RH y t° > ? {Z. £ % %k&fc{\L #C0^^, ]) 'S \f -r > y ?'s ;< y %(Dm% t £3?5t£<0|pl£, V , KO-^yy^m k F##(7)9

fimfrb$u&t'9 £ii*u • #i# • 5 ot^< 60

3. 4

-^T, (ccv^T, ^ ^/

3. 5 simxtt

1) Suzuki Y, Yoshitomo-Nakagawa K, Maruyama K, Suyama A, Sugano S;Genc 1997 Oct 24;200( 1-2): 149-56;Construction and characterization of a full length-enriched and a 5’-end-enriched cDNA library 2) Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ;J Mol Biol 1990 Oct 5;21 5(3):403-10;Basic local alignment search tool.

-70 *4* 5±ficDNAEM3fc$

4.1 MS

«m Ic 9t$ L £ £ L ZtS& nrtfccDNAy D-yroycssaE^ifeStt. est y© J; 5 t£^&9 D->®*5cSStt®S:iC)n-xy m#9>/';%##®7kAE^m&^iE#i:R6a#%®ct. ita m&?tm&t%(tr)t>v3’ mum#w%A##fi:i%o#®ck. k&*l£.ALJUi#SftCJ;oTM5e2tt7k:*U =f=E i> y k >yffi (2) 1C <£ S^ftcDNAyd" y yu-cMLTtt. -eoa&ttt-ry@icj;^%±*cDNA7s$:ai%tifikut. wraloooa#£ @ x. scdnap o->®AameEM*%&g#t L^^AAcDNAEMm&&%mkA:.

:»ia®«ti»y/A«%#»T»555£»cDNASFOTF%5£ 6 CffLB8i •§£®4:ftS*#ICfflSSeL =k7 6t»tt. yVA®«|gBy-®#Bg^f6fflS*M%®fci6®ttMS$6BC*y-5r t^Biet"

4. 2 tmatx*»

*11 -y k>y&icj:5cDNA7'f y? u -a. 0f« y y ;ex m ^scfif e. c *«E^J&«fflUTttft%SilS*SW%0rictiV>Tg$a®fiTi:®aS®Sm*5 =tr/SSiJ*5Syfi^tl/tJ'D->#$m««»i;tfc(3K^ □->#SrbtiEtiFi:UTE?ijft S&lltoL.fcy-A$:EfWy-Ai:ll¥#Lfc. *#*-e®EM*@y-Aac®mt:^\U 7 7 yfff 1 — Aii'fJ DNASf? — A#i£>5>). 6#R@cDNA?y y? U —®6jfel#gaRti, katoIII SflS(kat). IES/J'#toMHS(kaia). iES*llliBfla(colf) , »6K±6iEe(hrc). BttM¥» ®«ge(cas), «*MS6(hsi)Ta65(#i)„ tti^WWTcfeSyyXS HSWfWIi. 2xLBlgi6$^aa4-rol8#ICT1g#®E. ®4®*ft ICT«Sf7"7X5 H (pME18S-FL3) ©MSff 5 »IC£ D , k fc. f# Gtlfey 7X5 I’DNA 5? £* M M [I. BigDye Terminator Cycle Sequencing FS Ready Reaction kit (PE Biosystems) t5 Zp lyii' 7 iZf 1% ^ ®IrI TtDNA Sequencer 377-

-71- 1057 5-f75U-SiJ

±icy^-f 7—^^% 998 ...... -......

£fr 480 kat (Katoin tans) 410

/mm 7.2 kaia 9

2.61 coif 148

±lC7>^A;i 59 hrc (K$K±6IBga) 253

5>^A;$¥i6|E«$ 5.36 cas (a$!iE¥;ft®s$oea) 82

hsi (fit*) GC ## 50.4% 155 w-m&mm 1680 ...... —......

1798208 a it 1057

-72- 96, 373XL, $;tHDNA Analyzer 3700H=koT##fL, 7X7X$r#/c. ^(:77< 7 —j;oT7 D-7^^ 0 1##&&#/:(% ATGC7-7X7X7 vt77U 7 7 h 7X7 (7 7 h7x7IM)&mWTE?iJ^-7 7 vi:777J77a37-t:77-X7-7X7X(Ai##, y7<7-^fi&l:T

#Am(7,8)aw-D^7>yA&^#mL^.

4. 3 eaa##a

(1) mmi£%mmMk(D%m 7°7x^ MDNAmmm^LT^zirn^-3.7;i/& (El) £ rfcE7— hS^PCRB^SS A&^rfinxAc E-Xn->^^S^1-^)^MDNA0lS^^^tl^n96A7V- h

±TM#(:f7vx lE^I^E^^n^SSISc^itgeL/t(E2) 0 E243qiagen _0^cd ^:7^ QIAGEN kit(D@# (REAL: R E A L. prep96 kit, Turbo: QIAprep96 Turbo kit, Ultra: QIAwell96 Ultra kit) £ , +G50ti96A7°l/- M±4£T'0Sephadex-G5OX t:>7 L/c C £&. +ExSAP^Exonuclease I&U^X hf 6 ^Alkaline PhosphataseMS L ft Z. £. . E£0 BJ ZfrfeJ: 7 \Z, ^72a7M, QIAGEN kitfe H & $f & 7“ 7 n°oSI: 4A& »MDNA*ai§aPlffiTf*ofc^, QIAGEN kit&7^M£#X £ tlAc.

(2) E^m^mm^i^cmug ##1,000^7 u 7 b AcoZ. 6 (Z) S AE&4,690 ES, #/J\#(±346E&, ¥EEtel,680E$T'$>-3 Ac= 7 % 7°7 A 7-^7t&©X^7>#A&^^^^3 v Ac 7 n->te59T£o Ac. 7°7<7—

7-M¥E7.27:& o Ac. -A, (D? EmmmM5.36T&o^(E3) mi). PEttDNA Sequencer 373XL, 377-963; Ac te|Rj#DNA Analyzer3700 i2 T gR # E ?'J 8t £ £ f7 7 E (D# # ^ ff W, 1A > Af— ix S 4^1 (2 UPEti: (D AmpliTaqFS-BigDye terminator &, Amershamtt(0Thermo Sequenasell-ET terminaterii $:## LA:. E^j(D###%T(Z)^A#^## A^f#^lTW6AL 7-7X77 7777 7«h(D3 7^7/ t: emmu/c.

(3) E^J7-77vt77U77 E^J7-77'y-fe>7U LkmmT#^A

-73- 1 yyx

-r-4 — jiJl'Zf \y — h

Mc&1ml0ia£#|JCA 2XLB ae @=i nx-£/nt§tmi@ 37 t, 20- 24 h%# v 1500g, 15»IWC«*afc* ±»€:ttT« StX 300ul(DPlyx''7 7 7-*i £AHXx j;<#^ 300ul(Z)R2/X^7 7-* 2 ^JfJDX. Tape pad£(iU8-10IU;S-tf3 ESTSttmttE 300ul (D P3A777 — * 3 &AQX. Tape pad£li08-1 OHJS-tf-5 4°CT10$Wcg

r?v h7>7^;uy-yi/-h (77002806)

/t+i-A?—/i\—vi/ k icx'f —y^ijuyu — h&t v h Ux ±(:7 V h7>7< jl/^-yi/- 7v hv>7^;i/^-yi/-Hc@#mm& ssoui 3is daic 600ui fvyo/w Tape pad £[*0* 3-50*5^

| 70%x^/ —)\z$t sooui roinx^ I 6000gx S^PbIjS/Ox

;5t» I %%&15#f8x I S-^xJUlC 75 - 10Oul

DNA>S>$

V^U^X^U-y HVN45 (*)

(*) )]7AP-y^ffll'TShephadex G50 4rfn#>x 330ul (7) dsHzO&AQx 3#fWm±##mx 950g 5

950gx S^iayOTdiS v

(*1) PI : QAGEN # 1 9051 (*2) P2 . QAGEN #19052 y (*3) P3 : QAGEN #19053 mm DNAy>y;i/ -74- =IS8*#*«*Skelt , -f >D— h S*SA'6M«a4tC^'0#*ro|||*|n](D4.^x®y5'f T-iSffSrMter^iiltS^SfflLTc (04). c®ct«. v-Ei+*[6i$sse-sc«*., E^JttA^fcTciaswWttE^Jtd^n S-SiXi-y/cftSrSWU *#®7 o —7®A#KRS#aA5±X\ $7c7 y-b >7" V Sfl fen >-b>+fXE?iJ©*®, S*tcMt>5ft<@$6*t"-5±T*$6T$o/i. S«»gi:Sl »£7 □ — >fil:#ltyn y h bfct>©*i05T»5. frSfc D ¥i^480#*®E*® 0 TW«ife5ttf fTfcnfc (*1) .

(4)#i$gt$B«A«*®to*®o * X 7 A 7 7 -T v - *5IS tt ¥ °J m ft R * E SJ * ft tt-A > il - R JS £ ff it $ A -5 E FI 9 £ M sstfcwtffefe, ->a v ham?*, A®e$, 597d-> tCftlAT, h5>X«V>SXtt$:^*CfflVX 7*7 1-X-SsfifiCTflS Lft (*l) . C® 7-647n->lcliBUTI$Se>tC7 3 y HA Uft. SSI# V AX M/y A®ft»E¥ tfeW'fttAfgi^lCltibTti, (T) 19 (V/A/C/G/T) f; H o ft 5 S W ® f# @J X 7 ¥ 7 — £ #f # S H L TWWSifctlSttSSriiJiigft-BLitift. 4?D->©*UAX hi/y 91® ¥^*11281? $>o fc 7s. AtVB#8lj7,7'f 7 —SgLft427 □ —MICKl LTt2, A©¥#S6X8, #XS**136T $> o ft.

4.4 SSS

¥#l< >D— f-:S2kb8#©cDNA7 n —MSAMtc^rSEJiiln^ttS'f 5ft*l:ii, 3AE ?iJ$ftg-r5.fcii)®->-7x>->>7*ffl«SDNAe, 6 • Sfi»f*fft5CitfegT»4. 962t¥l/- K±8ICT7)l/* U SDSft £(t7 79 X 5 b*#ES®ffi%$7tttSXCJ:0. -A-fttMS. • MS*St^of-?7'jt/9'JCI ff#&;&t£;ift©i*,S»©BBAE?iJx-7©#f#£nJ(it;:l.ft. $ft, «#bJj|6ftM-7l> '>>7*7'>>®®^W->»J:Dfl,«fe6.n-5»ffoI|g^*X«#:ictMt'T, -f >D— MW *#^6iwsi$6iLic*7#x®ie*i=]®*^roX7'f7-se$M66-r5.#iii8esffli"5.r ft®m##E&#m-f&±T. $ft7y-b>7*usnfc3>-b>tfxE?iJ ©#§Sg. SSlcMb5.AW€8«1-5.±T*8fn?6-yfc. 7>^Ai£ICj;5DNA'>-7x>->>7'ttfflJlffiibT. May f-AMfti: f7>X-f7>

LftU, y>7Xtt®*CT^:ftE?iJ&ft^lffi AESJ ® AS £ # 0 7 M 7'A C ¥ x -6 M a y f-AMft®*»i, #S^#@2^©X*^&Ct:< Wft6#$ljT&S. -«tLT06C. 7 0->Colfl506(^7 7-&*5018$S) SrXEtCMS T-y>7'Alctt®-®o/tE®tt¥lR 0 8866.6® ASS:¥Uc»X h 5 >X# VXSXtttttE 0 M666® ASG@@A^±C&#@-6^*)6. £ft7 □ — MftDNA©## X*. X77— IS fflb>ft®fPLfflttfc. May f A >&® SWEATS&. Lft Uftft'6*#—MTSSflAfto ft

—76~ 7>yA&aLTiifm#%

&#k:kL

^#^T&3/Co -MaLTimvC, ^O->hrcO14960#MS?'J(21OO#&)&^fl@#t:#L T/\—7°o^ ^M^o —>^i99#&(D@g^J^io[g#0)g^4i'5##[ #%Mmaa?'j&#LT&o, #$m^UA X h (T)l9(V/A/C/G/T) 6Wo/:5##(D#gijy7

HcDNA^YX x V —^ □ — >!!£: UTltsf 105?E1(D h hcDNAX n —>&± 3'%^UAXM/'yf-^#W^^^GC#m^50.4%T& o/%. ^ ^:<(ht,39@79^n —>^|%#mRNA^A^LTW6W#B%^^#^^l^:o

L#7bcv-7

gg^J^^^-A^#:T5153, A#%T2,129(2000^2^ABm%)T&6. W'&m, DDBji: ^^6^^^(2000^3^23B^&) H, ag^J^^f—A#:#:T2,212(^^%^^:&^340)T$)6. ^#^1, $3* 4^ vH>^^cDNA7< 6^^

4. 5

(1) Paul, S. and Declan, B. (2000) Celera turns to public genome data to speed up endgame. Nature 403:119

(2) Maruyama, K. and Sugano, S. (1994) Oligo-capping: A simple method to replace the cap structure of eukaryotic mRNAs with oligoribonucleotides. Gene 138 : 171-174

(3) Suzuki, Y., Yoshimoto-Nakagawa, K., Maruyama, K., Suyama, A. and Sugano, S. (1997) Construction and characterization of a full length-enriched and a 5-end-enriched cDNA library. Gene 2QQ: 149-156

(4) #:@'ilRi#u #*^^(l999)#M§%t: hcDNAfB^K#T^%7°Oi/J:X h E: Y /- V —Vol.57,No.7,pp47-48

-78- (5) Deininger, P.L. (1983) Random subcloning of sonicated DNA: application to shotgun DNA sequence analysis. Anal Biochem 129 : 216-223

(6) Yamazaki, M., Ono, A., Watanabe, K., Sasaki, K., Tashiro, H. and Nomura, T. (1995) Nucleotide sequence surrounding the marker D21S246 on human 21. DNA Res 2: 187-189

(?) Stellwagen, A.E. and Craig, N.L. (1997) Gain-of-function mutation in TnsC, an ATP-dependent transposition protein that activates the bacterial transposon Tn7. Genetics 145: 573-585

(8) Stellwagen, A.E. and Craig, N.L. (1997) Avoiding self: two Tn7-encoded proteins mediate target immunity in Tn7 transposition. EMBO J16: 6823-6834

-80- ms* cDNA

5.1 mm

^^□yxn K£^T-^fSlT4OOO*&±0cDNAffim*ft££n* £ 6 12^H«¥PB1

e,^lTW6. awTkL sosui ft£'(7)frL $£>, tc:£ScDNAI:: j§ L fcMfirZ g ®fr#j Hff 7 yX x A £, ##(D CPU£MLfci65ttf£&7-XXT--'> 3 >

3 tJl/r-^^-x^M^tlilfco A 7 x X AH^JSOOO*(D cDN aE^iJ£XSUTi-

.kxC^oTW^,. % ##?#& t> fe0if^Tt^o

fX h^lfo/co

5. 2 ttfflUitV7h15x7, 1«MI *7Xf-ATmw^V7h7x7, ##g#$rJ!XT(:^T.

(i) mmmtii'jy h^xx BLAST 2.0 (NCBI) [1]

National Center for Biotechnology Information (NCBl) 12

&£Q&2.oTiif t 7 x^#itlfc&m&*jm tWP»5iti6, ^00ck’5*A:*©E^J'r-^&*#f*r-5©fcabfcy x h7x7 T$»6. Motifs (GCG Wisconsin PackagedTGCG) LT,$;^4;-yA#(D Amos Bairoch^ E §<£> S SB AX£ T fX /$ L fc PROSITE£D ^tlTV^o Motifs llPROSITET SS$n^A°X-X2 0V^TS6H^MSL, geXJ^EX-X£^SX5 CttfTZZo £

ProfileScan(GCG) [2] Michael Gribskov^#^ L fc 7° □ 7 7 -f )l/Mtif 7 )V zf U X' A f21^ U X, PROSITEft ckZX SWISS-PROT^ 6%#^ft^7°D 7 7< ;l/X—X< —X^M^T, f filWiie •

-81- SPScan (GCG) [3,4]

w&a% HTHScan(GCG) [5] SSK F X ^ 7 7 7 ^ U ~ CD T — X ^ — X Pfam Release2.0(D "f DaraC^ lysR> HomeoBox (D&7 7 ^ U £f£j5K£ftfcfi^fT?y£ffltAT, C(D^^6l3^#^Ll:^\U'yXX • x^xmcv-FL, m tl?% o CoilScan (GCG) [6] gttiKDmm'Z&ztifcM&ffM&m^TZ'rju Frn^Hijg®## shirrs- c#

F7<;Fi:mm^a, m-ws^MVfrtDii^x f

7 7 F £ S o n ^ ;f F 3 ^ jf& <£ tx\ 30 F =H ;w;i WxtiT

TestCode (GCG) [7] M6E^6S6MSRM£^ST5fc&DyDX*-7 Ao3SSSi;:tiig|-t5S$

mMmi±#(D#^6(: y xxwjxdx 9Ao%^(D#&T^*^%gam(D4:(:mmm^u 7 xx&?#LTi/^7#^^$, c, S08uiTtiSS6(DiL#:

$> <5 o

(2) g^J^-X^-X GenBank Rel. 116.0[9] EST_human, H sapiens, Mmusclus, Celegans ^r^fflo 3^ 20 0 (CNCBIck 0 %# SWISS-PROT Rel. 38[10] PROSITE Rel.l6[ll] REPBASE(#^ — #^^&)[12] 3

(3) ##77 F7%7 Sybase Adaptive Server Enterprise 11.9.2 Sybase Adaptive Server iXRDBMS ( U 1/ — 7 a 7" )V • 7 “ X ^— X • 7 ^ — 7X7 F * 7 X7-A) ^<^-^(D#m-^^#^(D|%^7X^A^##T6^a^T#6o ^(D^:^), IBX:# f-^(D^l@&eD*7Xf-A(D< 7^7 7 F 7%7A L T i6 jS T <5 o Apache 1.3.6 tf—/7 7 7 1lX L7>7lW >77 X — X £ jf T £ & HHTTP^rfiJffl LTWWW7* 7 7 1f&S(D7 7 m w 6>nT^&a C5T&a. C(D j:7^< > 7 7x —XeHST^^^toHTTPth-yX-^: LT^< (7^#^#3Apache^#m LTW

Netscape Navigator 4.7 *>XxAll, %> >7 7x-X(h LTWWWX7 7ifL/c# ITl^. Netscape Navigator H WWW7' 7 7 if H 43 tt £ X 7

7 7 FX7 >7"— F

(4) #m7"7^ F7^--A Fujitsu GP7000S model45 (UltraSPARC-II 400MHz x 4, MEMORY 2.5GB, HDD 18GB x 5) *>XxAll^r(D#%±, U £r X 7X^-X£ U ^(DTc#), x^ X7^Xt U (D X7--7X;i/^%#^oT#T, mm#'b@#TWaMI/\-F'7x7&#mL;t:. fy >7>XxA^#m#^##(C#a^Solaris2.6^#mL^.

(5) &mnm Perl 5.005 02 SUN JDK 1.1 SUN Workshop Compiler 5.0

-83- 5. 3 ->XxAa^-;i/T-5,^-x±ic«ssnT^o, *io*ffe*i;sLTi@ii tcfflii, MH£fT 5 X-if-ti;*-;/ hl7-9 &66LT*->X^AS#ff f & H t)0WS6T*D . y77^W-f XX-Xx-xeiiCT&SMSSEfSAi:^ 7#6. * 77xAHiaac«ffi«W4ff7nrasa-t, emt-^osoc*^*s*®*^t lAofc-f >d7 —7x —XSA

3.-V- l:yXrA

—84 — (1) 37M WlgTSISrfTO ICcDNAttiST'Si|Ofc&ro"SKSa5lJ(B#f«» •t. B6EJStfr©yc#>®l8Esn7c75 y KE?'ilc#76"7 3 y M7JJ#tfTg65r'lcW B tl -5* o -en-en®i»#r7Di''7Attg*))Mce»)3ti, ismtiu 7-737;t/7-7^-x± cfe^^n-So a. a-1. S«E^Jc9#to43 H *0 ® £ f* E ?IJ £ & ffl T -5 Tc tb 1C Jurka iC

SMStlTV^yy AE5SIJ- ESTE?iJ. 7 S y mEM#t:#LTf f y 7'&*mLTh BLAST&IC j=5#R###&fT7. a-3. ga*«Effi«®^ai cDNAE?iJ®*6M8Ee«i: UTSE*t£®ORF$*fitti, $ 5> lccDNAlE?iJ®569^ e, K >$T£7ft-e*tl©E7#.t£iC:i3IAT#fitiiL, £gaK*8ES«±:-f 5, ’ C T#$Stiycg6K#E®«tttt® L&fcttSl Tf5. b. 73 yKE?'J*#fa» b-1. 7 3 y $E5IJ'\©»E cDNAEM©37i©m»#, *i j:ysa-3T7 ;«dtiycga*#Ee«©E?ij$;eti<:'n7 5 y #@2mcmm-7&. b-2. &###yo7'7AIC

b-!T#ESnyc7 5 y KE?iJlcoHT^-tl^!nHT©)Sff7,n7*7A6SlfT"r5 = b-2-i. misi##m©%e IS *D ® 7 3 y BSEM (CM L TBLASTIC =t 5 £fr 7 . b-2-2. a'fyAlDg'rt Motifs, ProfileScan, CoilScan, HTHScan, SPScan, SOSui-$r'7"7l7:7lif|f77 •£>»

(2) y >7 — 7 I — X»# <77-7 x-xsta*e® 7-7 s-ieur. jzxicm fl-dtirtiO . h7-7&iy LTf 7C©@$±©WEB77 7-!fT)T6#$& ff tt A-5 J; 7 (CTS: o TVi-5. a. cDNAE?U©®! (I C* cDNAE?!JttSSJSeirS:fB$75 C chlcj; 0 —# L T* 7 X 7 A ± leg? 0 E ^lt$ME®7'Jb-yic5)-tyT® O&tf C j:T)SiiII6T'*0 , E^©Blnt)tone=I|gT'$,-E.. b. ##yo7'7A©m&A i-f-ttl»#f#St74ii5'iy)l'-yi)B#7py7ASI}?L, HR#7 7$7 V 77-OKE) 8BE<£> *rn7 Communicator^) AA^CH)

E5tJ?;v-?3K

y'JU-yroiltlR j test40 jRttESmttitDKg utf-hBe?'kD«tii x1&5-r^'5V -«tt5Ftousu(eP

invertebrate <*■ AND OR mammalian _^j 7T-;^^^LSkValio Utr-h<7)V^.y T5 P" U&U »0E9j(c*f^

P oy-tr-ftomfr 8HeHK«r gyy

#R$tiy=75yKE9jl: P ORF)6tiK7)*fT RI%3P>|TTG.CTG.ATG **jk:alOiTAA.TAG.TGA '> — P ri it_stop <• -* — * -* K$ *JfiORF*fT50 (bp) ORFJKSBH P ^ny —tr-e^ien g^Koa HR “Itestp m g#j2 testp

P SOSuTOXfi- P SOSui0)Xf7 |P SPScanWXM fP SPScanCDXM TT^i/a^l |P CoilScang)#m P CoilScang)X?T Motifs,SOSui,SPScan 7f3f>a>l P HTHScan<7)XfT P HTHScan(Dm% btz?i/*yl H^y-ay-I p ProfileScang)Xfr P ProfiteScanflKfT ^•^'>a>l ^z?->a>l Hfrl W7 * 7.

E2:«#ryD^7A®ES ej c.r 1»< % p ...... ,3H<«......

TW____ • ueosHIH:

TK1____ |(8d< q • ueoSdS;

Ttt____ |)SglE#« • '"SOS

35$ IM±X • (.? U3Dc;a;nc.H Tft____ . 3‘* 35$ |lii±2£ • Ttt____ 1*34^ •

35$ N±X • ; -6- f* — £-zir//ri£2B<2’-|.*B Ttt____| $3^ ^ . ueasHIH!

TW____ |*34^3 . ue0SdS: j ; J flt'Sil 3T1____ |&S*ES8« • '"SOS j : J j\ L3A0|.M .35$ |li£±3£ • ueo S5l!^°Jdi J : J -I \ 28 A01M TW____ |834<^ •

: 35$ llii±V • SfUO w; >3A0HJ TM____ |K34^3 .

: 35$ |li£:feX ♦ [UIUI - |xeui 9H|PA-3 • : t - A- 6; Hit _J j j ji 3Tld9 ____ I • ew: J J J J J J fr 9 A0^ . 35$ IMiX • |uuu - |xpui an|P/\-3 . : } - 4»--4tJi'l' M2B8 SSt. d d d J d ITw d d : d J d 2* 3J J-00 d d : d J d: WIN / V d d i d J di "FTn (HJiHfV (5)»F3iunumioo <0)£^ IZHi! adeost^ - suwa( ,|nsai ias*^ d d ■ d l d Tqfw d 4 d d d d d J d 2Tw d d j d 1 d d Ww a d i d: | £1f'^ a d d d d : d J d: Ww j uaiebss j i i j j J J i J; |S»t rw

3^rr. I 13S3H | nv | S8!^--a riwriT |%* | 86* fIji?=7T d* | H-^O-hiS C"!nsos 'jnowr swtitlsjBt) ZaJZ-LiL ifv (S)jo)got^jiuioo (3)W» <3>1t,KC

?2 c-2. &cDNA«©*#T*SS©M-J l

s-cDNAe©

Jew @%(&) X?F Commmicatar*2) s$, nn»£» *-a m* »<(•- enei shop jjjj r „^*7>i77-9 ^ttp-//str>00hy(jra.mki,cojpfl0e5/cDNA/ndex.cdnahtm~ ~7] (fjjrBBiltHh : 5151 seq d?lU-4 : coni isa«00002.

RJ'i sequence Iramc2 | Jrame3 | ORFW% | TeslCode | a & ±%)

human

P BLAST H sapiens

;ACmii0'OT

BLAST swiss pro t

OC8S63IMRP3 RAT [CANAL I CtlL AH MULT 1 SPEC IFIC ORGANIC AlffoN TRANSPORTER 2 (MULTI DRUG RES 1STANCE-ASSOC 1 [Q1 5496 i MRP3 HUMAN [CAWAlTcu La R MULT ISPECI Fie" MCANIC'Tit IOMTRANSPORTER"2"^(MULTIORUCRESI STANCE- ASSOC I mtaztitc^T-y 1063120 IMRP2 RAT.... (CiNAriMAR'IurnMciF'lc'oMAMi'C ANION"TRAMSPORTER"T'(MULTL6M'WistANCE-ASSbCI 032887 IMRP2 HUNAN[CANALICULAR MULTI SPECIFIC ORGANIC ANION TRANSPORTER 1 (MULTIDRUG RESISTANCE-ASSOCf iP33527 iiiiiPT iUMAN ;MULf IDRUG RTTlSTANCE-ASSdcl ATEO PROTEIN 1......

:Stert_pos i t Ion :Stop_po*l t Ion Mot If .Sequent

ATP/GTR-bindins site motif A (P-l<

ATP/GTP-bindin*

Start_poa i t i< [CFfOLVLSALPHALXXvLSACVL jSECONDARY j

j L A L A L V A L L F A PA L V L T V L f H C Q f PRI MARY | SOSui \Z J: y ^

mztitz&nm LTiiAi/h JET. ca ^

$ b IC X ^ □ — JUT ProfileScan, HTHScan, SPScan ti&lf&zns—AOT’Sv'&R^J

:& cDNA —K c-4. TestCode(:=k

•So

TestCodebckS^ ;]

0 7:TestCode \Z ck S H: (5 MM tfl (D iffiffi

5. 4

(1)

FL3 > V — '> 7 A l*J T'm ^ £ ft 7c5151*(D cDNA@B^J

(2) a. REPBASE^) ——^/a L^, — V y#^30bpm±(Dg;^H^%E^ja LT7X/7 L. # b. ##Em:#T&4M%#m(2(#fm50#) a.TV%/7 ^l(DcDNAE^J&, GenBankgenomef ^ b^a Sbh, m&(DE^J(:*LTfB[wHt#m^e3/Co M#(:GenBankCmm^^T bS b hESTE7J(:MTS##,SWISS-PROTb##$aTbS7 $ / EE 7Jb *1 b T b fg L/C. c. &^&#M:33WTI%#;=] ^ck^cDNAE^J(D^M^6#%](7) K Ll50bpl%±(D&(D&##^##(D&(D&m8*#B:Rm^b LT#m b ?bo d. 7ay#E^(:^'fSm#g##f(#|5l%#mi5Nr^l5^, ^(D#(D#^##Tl7^^30^) #tR$;K;t#axm;RM#=ka'30(D&#^#(DEfij&7 a y EE^jcm^b,### #T&f7o ^:. f8M###^SWISS-PROT(:##$^TWS7 ^ / EE^'JCM bT^ff b^:.

-91- (4) a. SE^'J 2000bp]&#(: H—y 4000bpa±(D@B^'Jt)139*^< 0 , ##(D t,(D^ 6?oibpT&6. 10007^ %#6o*'a'a;fiTw&. j;e, V^o &&> lSObpRTcD^toHS^HSEE^B: LTflffl LTV^j:Vx

5 0 0 0 'a 4 5 0 0 & UK 4 0 0 0 *« * (5\ 3 5 0 0 ra 3 0 0 0 n m 2 5 0 0 Sa 2 0 0 0 e u 15 0 0 m 10 0 0 EM 5 0 0 0 0 2000 4000 6000 8000 cDN ABE m £ (bp)

9:cDNA SB^J S \Z M't Z> $ S ESmSOME b. MEyij(D#m ^m$nfcs«si^jtt^T©at)T*So ^w-ii3SSos«E^j^5ii67*»w$n, £ ft ti^TcDNABE £0 41217bp lm0il^T25S,

01O:fctfJ£ftfc££E?IJ

-93- c. (phases) ^fl^fl(7)cDNAE?iJ'fy&tfcMfffr'Ofr’DfZ b -O Tz0

3500 3000 2500 2000 1500 1000

-ft tt*(%)

0 11: t hy y cDNA E^J (D^SIS^:© —IM d. Motifs,ProfileScan (7)^^: S6K(lSE$il/t5139*CDgH^JH^LTMotifs> ProfileScan^rUfi 1 L £CF

Motif E?m Profile Prokar Lipoprotein 157 g beta repeats 66 Prenylation 17 rnp 1 31 Atp Gtp A 13 mitoch carrier 20 Gram Pos Anchoring 5 protein kinase tyr 16 Cytochrome C 4 cadherin 13 Tonb Dependent Ree l 4 thioredoxin 13 Ig Mhc 3 zinc finger c3hc4 13 Mitoch Carrier 3 lim domain 12 Ribosomal L35ae 3 dead atp helicase 11 Thiol Protease His 3 conjugal 11 Aldehyde Dehydr Cys 2 amp binding 9 Pa2 His 2 dnajnter 9 Ribosomal S14 2 c2 domain 8 Sugar Transport 1 2 fkbp ppiase 2 8 Ubiquitin 1 2 atp bind transport 7 Zinc Finger C2h2 2 bromodomain 7 34 om 492 258 1+3039* 746 l:^a

-94- e. SOSuiH £ 5 MS 6 SI wBIJ fciRsnfcra i3m±ommm^u t¥$['£tlfcb'ofco

< 300 o 200

4 5 6 7 8 9 10 11 12 13 14 15 ^ "j

Hi2:sosuin mnmfflitL^n

HTHScan CoilScan SPScan cDNA * 230 1001 763 #&

SPScanT^£B£ tltc i/ ^ 7 H£J^T IZtjsT 0

0 5 1015 20 25 30 35 40 45 50 55 60 65 70

-95- 5. 5

1. Altschul et al. (1997) Nucleic Acids Res. 25(17):3389-3402 2. Gribskov et al. (1988) CABIOS 4(1): 61-66 3. von Heijne, G. (1987) Sequence Analysis in Molecular Biology: Treasure Trove or Trivial Pursuit. 4. McGeoch, D. (1985) Virus Research 3:271-286 5. Claverie, J. M. and Audic S. (1996) CABIOS 12(5):431-439 6. Lupas, A. (1996) In Methods in Enzymology 226:513-525 7. Pickett, J. (1982) Nucleic Acids Res. 10(17):5303-5318 8. Mitaku, S. (1998) Bioinformatics 14(4):378-379 9. Wheeler, DL et al. (2000) Nucleic Acids Res. 28(1):15-18 10. Apweiler, R. et al. (2000) Nucleic Acids Res. 28(1):15-18 11. Bairoch, A. et al. (1999) Nucleic Acids Res. 27(1):215-219 12. Jurka, J (1998) Curr. Opin. Struct. Biol. 8:333-337 #56* cDNAf-f

a—-ry

6.1 *S

thy / A(C#cir£^:Fl'CIA/5#6*(l, mRNA£IET 5* >/‘(7SI£&oT':E‘®$ll6£ cDNA##:a@m&##f acan. c®^###ei:mKmicm#i-&#*£ »SCi;5iT#5fcs6tt»T*MT*5. f»IC*(*±4l:J:5*U^v "J tf >y'ri <1)-¥)ttWA^W± 5> K±5+t'7yh7'V/1-tt %S:ftcDNA®f*#$, »''rSDNAW%Rif®/hEft,,Vj;£5SSroS«AiE%SnT*; 0, cDNA®£ftE?'J®tlf% ifc. cfi6#A,-rm6f®*myo?7<;H:MT-5f-^®e# t>egti£ieLT#fc= mmET®%###^aaLT. /-tf> • ynyxy > <-*WT*-S*s, »#®*6f l:o®T®##t:*LTti, #6 eg •*ss#ra> RNAS&itfnx h®A^6wm*im#T&a. S656SS#*¥K**®ttlS*tilW±lCl±. C* It 5 »£*»?!!«*£IE*C«®ST®*f 2>ft* PCREtl®-tM ftTSD, i*fi^iSS6. riSBSWt hcDNAy/Cjsur. FLyDyx? hESSff-ATU. frSre^ficDNA^ n->®^SE5iJftS^fTtenfc. etttiSfrSCEMStStsnsiSfi 5c*fco. -e®*ayo77y«mATmm^##£#x6fi fco WSTH, ATAC-PCR® Ay 1 X;|/ — y y Mb :> X x A ® #E , FL^D y % hE»»yi-AT:E?il8t$dtifci' D->ro%gl*|g$«*L. «|g(SWffl®cDNAx-

5"^- X (£-f-5 ;: £ £ Bis i L fc. A < X * - y y h • ->x X A ® WSIC te, 96*® + X tf 7 U - £ fe o fcS*DNA Analyzer £ *§ A 1/. th > y;h®®S US® £96i#edLSti:TfTA 5 <£ 7 C& £*® #8liFfSJ6®8!i*-ft£tt®75:= ifc. ATAC-PCRSffl®ite*#SW7,X'f v— 6toE

WV7 M3t-X®9V7 hdoit^x-^eav 7 h®E%6m$CfTo7t„ *#%?(!. ATAC-PCRIC=t5>$S®(SStt, #$tt® —M®|S]±£Btt£ UT, ATAC-PCR®ft SB «Zp 1C, 3®@lcaS£XxZc®(s:Pfi5fcfflto£M®-5 £ £T4L-r U y 1/ —-> a > • * —7’£ C. ATAC-PCR ICd3(t6#K$tRl±®^:*®A&£#ML^. ESW®X n —yicoivTli. E^JIWWtoEIR £ £ % ICATAC-PCRIC#lS£fi-V>. 7 aM®%ayD7 7< 71/£#&££!: LZ=. Wtt®snsiya7 y/f ;hd3J;0:E?i| ##^6®ORFf#I6£®##K*^6. *###meDNAf-yA-X£##L7t<, £ ffl x-X^-Xii. STSi8fi*®«IE«SrC$fcO . eSSC $> o T 6 ® £ S;S:i 7)sW1#Srt-5„

-97- 6. 2

(1) ATAC-PCR ATAC-PCR (Adaptor-tagged competitive PCR) tH X 9 > a. cDNA©^^> Wifi it h “ 7 JI/RNA& 6 S# \Z <£ 9 3'^%^ 6 <£> cDNA □“ Jjfc&tf 7 <, H77>fc L fcdT7° 7< 7 b. (D9 iW'fammfmoEBuaiTeaj;5#mE^j^#tf7yX7-$: cDNA^f it ± ft \Z # fin f § o c. +h>7°;i/(7)^'S*^ cfc^PCR 7 -f X — y 3 >||©51a“7'lC7 hl/Xh7H7>#^7777^ 7 7 H — X £: tin X > ^ U A£ S 7> 7 7* 7°7 — ft jtocDNA^: SS'J *75 0 S § 7 7*7° 7 — £ n^6SE(D-tt>7°;i/^|Rl-(D7n-yi-iM^T^o Z ZH\ ##70&&&&*&&FW## mti, #irat^7^y^“Oia^t,

[A5. 7 7 7 7 —] [B^7X7^“]

ADl-F^S^P do##) adi -fN^S^F ( lfoli) AD2-B$1 ( 1##) AD2-B$3 ( 3#i1) AD3-ftgRS*P (3##) AD3-E#2 ( 3#li) AD4-E#2 (1##) AD4-F^I^## ( 3#i1) AD5-E#3 ( lfoM) AD5-B$1 ( 3#ii) AD6-P^SS^F ( i##) AD6-F^^#$ (lOfnl1)

7yT"^-(D#mE^$r^trm:^#7yy^-y7<7-6:m^f #m%y7<7-c j: OPCR^ff 7 o d. 6@E0PCR##&70th< X(: j;o T^#T-5o ABI3700& <7 U^GeneScan V 7 h (PEA' <^y%7AX)^mvx j;opcRm#(T)#w:^#j^76c 7/^67^ 3 . E2t^> A5 7X7^-H

(2) 7—7^-7## ATAC-PCR# i:j: 9 #6a^*e70%#m^##C^A&cDNA(7)mmE^^ 63> — 7 — $:^V^T#EL/T#6ft-5i##&ftA0L^:7—7^\ — 7$r##7-&„ [IU1] ATAC-PCR(7)JIS

W&1 ti^cDNA AAAA TTTT-b XA m#2A$cDNA 4 Mbo-l — GATC--- AAAA GATC - AAAA —CTAG --- TTTT-b)—0 TTTT-b CTAG 79V9-A $:#3A$cDNA AAAA TTTT-b)—0

—Zf~p y(~7— —GATC------AAAA CTAG —_ —. TTTT-b xb^hP^iP^A Aa—* GATC -AAAA CTAG -TTTT-b PCR A/w GATC AAAA CTAG <= -TTTT-b

-99- m• -r 7x-7£fi#tf z>o A-7# #cmi\ z\- F ^ x 7 • 7 7 h7x7W@^6Eff'>7f-A(7)ilSl£E7T<7)ffi Off 7, a. /\-M7x7S/77A## (D 3>bf3.-^i>xyA##^j:U:'e^6(D4;v E7-7fE (2) publicDB^ 6©T“^A?, b. 7 7E7x7i/77A## (D &77bx-^^77ATOy7'y h^-A(D#% © © M^oem:B#;sb&v7h(Z)ffGE

(3) © 7V7h(D#^ © x — y^< —xixst © j;V^7> - 7- < >^-7x -7## c. a- k7xT/y 7 wx.7mmzintcz>m%$kftmm(DmiL

(3) mm • mm a. b bum h —^;i/RNA B (CLONTECH #64020-1,Lot #8040076), S (CLONTECH #64090-1,Lot #6120263) > /hS (CLONTECH #64039-1,Lot #7080714), 'L'E (CLONTECH #64025-1,Lot #8040067) A# (BioChain #061003,Lot #A20804l) > (BioChain #061005,Lot #A212066) WE (BioChain #061008,Lot #A20604l), EE (BioChain #061009,Lot #A205002) B A# (BioChain CDP0610125,Lot #A21102l) b. mm HepG2(A B *SIE)> Katom (A H *EB)> lESk h jfi{£6

h-^;i/RNAte, &mm • m^rco ie#mm^6(Dh-y;ERNA(D#m^^&i:j;oe (1) A<77l/-yv h • 5>7 AA## a. yXfAi^ ATAC-PCR il 43 , A )V W] X' 0 7* > 7" )!/ M #T £ A f 7 )V — 7° y MET %>tc*b\Z ABI3700£SA Lfco f£^CDABI377(PEAf t'>7f AX) Tft. f JUlfe^f; D ©^1) ^48th>y;i/T, lB$,^:0l92 th>7°;i/SS 7°;i/

C fU: AO A T, 7°7f 7 —f:II-^PCR^F<7)17>7°;^ilSH::96A70ly — ^2AlmT£&c^:\ 7°7f 96ae^MfiTff 7>

-101- )\< Xlb — ~f«j Mfc£it©Ac.

ATAC-PCRX ffl © •$. FAM$t6 *5 J: W'HEXfi )l£tt©7'7'(''7 —S^tl^FtlfiSSr^xT ABI3700X#mUT#*#&#l%U/cb C6. Y-©»SCtt*L LAc#6U>:yt3, 0~1.5fmol/MlT$>3 Ac. ABI377Tt3, 0~8fmol/« IT* o Ac C £ *' B , ABI3700©®S* i»V^C©-y->^’JF©#R^S©^W*5£>S t7j:o /to PCRS#t3300l68 gC#R7^^#Ai&0, Ztli^-KL. *Ac. yX IzU-I/COUTtt, SXOT'y'f' 7 —£fflWtfE6£L AcPCRSteS-SiigttCBKS^X

E4©j; 3 epcRBttoaatafttttf Jtw-f *>£«£ l Tf# BtlAc. ATAC-PCR C35 (7 5PCRS#it 5: ABI3700$ffl tAT #§g U/c->X 7Ax##xe&c £*mmxe/c. ;ffly7ri»*iii, ^A**i8#*$R**£© *RHJf?ECJ;OfTofc. b. UX7A©K#$# #eL/t->Xy-A©W«j3Itta*©7'f77'J -^BK?lJS#fc288»e?(96iiC?x3) iT-7^-7 CSSSnTV>-5gitti*fi^l92afiT(96)te^x2)©^lf'480®ei LC7)HT#J$$ff oAc. #J;£CJS©AcRNAl3, b MFFIB, b h*Mj3 <£(/#>CM® UAcBB • *BJI§£2S

7"7 U -mmb LT28gmef SrfiXU ©^960+h>7';b C7n>T©##f £ff o Ac £ C 5. ##A(7 —XAl^m^Allbt© (387277 > ©lb (90.8%) T& o 7c. £ ©»J® ©iBi@33 i Ab'iteSAb B , $r '> X 7 A A*ATAC-PCR £/W Xlb-7y bxfr3 ±X##X&& £ £*l#m^ Al/c. WAc, R%*e?©»A^B#t:. 7# b-yX»*ief, tlb - 77T XlbMg*Gf . yfD-X7y X^-. 7M hAK >Act*©192iig :f SrS/yl?, Rl t* C%SE$ $81$ L Ac. S2p««ic7J>m«6a/uT#i$u7^$s*, *m#Kit^#^©mo*e?i:3^Tx& 3 Ac. #•< ©iieT»7 77 >fittT%$gLTUAj:©7b, JgiiSAl'>A.t ©Ac© C, 07-Ji7 AcAc©T&%. «2FiBE$iE$ff*b UTR$©8J66S$*Ac7b\ |HHSiC5%@$©%$E$tt09;£*T&o Ac. £©i|S*A'B, ATAC-PCRXIE#%^#& e 3 #&!:«. «fif©S*fi«^gfpMit -SCtCSStt^JlIgStlAc. c. mef#mw^x7)t,©i:LT@

£dnrH5>7)i. »e^#SW^X'r7-(3E?iJ^i:CSSI7-r%£'B^*0. HtSCVX b $■ L 7c = ® #mWXy<7-©###M£ UT(3, cDNAE5iJ©3'*>8C-#;fiV>M®gN!l ©«Wr8M4/oB3'*s6$TS:»9 tirlU, 7?X?-A73 7-tfflTml©g^!4tKrtT S>-5lf©4«a*«Ac7J;3AiX3'f 7-4fbJ*UAc. CfflB, 7 7 7-£» gf ?< 7-A<$@3m%xy< 7-&#^76A^#©MAmB&4*Be(7.a f*.6A'7)#g0t,©S«#(LAc. © $l©8i®B W (Mbo- I )T©XT' 7 —»i|$StT£Aj:7b>3 Acig o', Hap-II, Bfa- I ©KMC

-102- [03]

FAM.HEX^-fV-COiSSjr^^S

s6 1 r'

m 4 n ♦ FAM lM 9 ■ HEX "2 it*♦ ■ » n I V - o 1 :l 3 S$ (fmol/yl)

[04]

pCRj$tt»jt®jfe8

« 50 ♦ H 40 JS 30 ♦ 20 ♦ # 10 « 0 ( 5 10 15 20 pcRSitofi(ii$jt)

-103- ffilft-bB-f diT, JWffT'S 6-fd 8#£tT. Mbo- I©*T t>m^S:#:©l5%l$i:S6UZc^,M®S*e30Cie^1-d«h T. 1259«E^=K 7ilEf ££(0, 4#®l«KTt lx.5dti!t#t. ® fE$SW©|6j±©fcl6, iSSc©E?iJlcoHT®-}6ffia$nIlgCL, 290, SJfgys g'jc«$stt(iy i/- h96#E) dticsatT,^7'f T-®-K^tb*r?>=t7iCLfco d. x-7M ® ATAC-PCRx— 9 COffifff V 7 h ATAC-PCR E =k 0 #f>4l-5»|gx — 7(1, - eme?©pcR##+i-ixi:*iSLtmm (##) - %# a®#) T&6. W/c, dtie.4. PCR##tM' X&#M. £/c##x-7 (% 1/7 hD7xD^7A) (025r#!l), d ©i»x-7 © + ;!P E> tf-7 £ *&

s&*efdii;dnsiwtL, *#tt©-K&#afb&.

© ATAC-PCRx —7##f* a## ©TMELfcVX h$:*IC, •AS7Xft“. BS7X^f-©fflffl©mi;Ef-7IC#L/-C(D&#fTf&. • A/BS

• a/bS5

£©ofc. #&7=-7&#/:#AA'6%##**%^?©-a©#f$&Web±©##?e X.5#i*$HSU£:.

(2) ATAC-PCRd d £t; J: 9 , ATAC-PCR ©IE#tt &E"5" d £ £ B 64

£ l/t5. £#?Wgefl5ilg?Cd>V»T©SI»*aT©#«St K T$>-5 >1E® £S« L fc. t>

V < tSBODYMAPJ; 9%91)8$7!>W>iie®$8A,;<£<,Ef4E^ 6©%S#cDNa7 d-> ^6taoRF7li900bpK±©7 n — >&MRLfco © ATAC-PCR lC o £. 2 8E©)lfiT

-104 "XT' ~f 7 tf* — z.t\ t> (d '4l\t\^ (D j — 1f > 7° □ y hcox —*? ^ 2 y 7 — ATAC-PCR(7)T-7 C j; 0 > ATAC-PCRUct 3 X% ZtlftT — ? 0 ff $114 Northern blotch 15 EIGHTS) 6ch#A6^l6. M5(:, ^ ft ATAC-PCRT' % m M & & f# ft 26SIM (D [S ## IS 7b SB PI H 'O tb T fc Northern blot £ ff 3^chC5, C41611, mm(7)mmT%#^#^C(h^T #75^3^^^), ^^bb^ATAC-PCRchb(:#(-761:M675^3^. a±$r^(h^6(h^l(D j; 3 l:7^6o ^ 6 t:RT-PCRT^3##(7)7]: y 7 9 MlIORF^^U&fsTlZ^ WTt)20 @#@^Northern blot^ff ^:< S/7'7;i/^#6^l7^^3^o

(3) oe* a. m, aM&j^Katom

$lj;g##^Mbo-I&#^c LTm W6 7 ch t L Mbo-IT iS V) fc g£ |f fii T # & tb S □* 14, Hap- II . Bfa-lSrffl lAfco (D ATAC-PCROUff ^^ii^^uziawmmchT'E,. mm-mmii, ±m, /j\m, HepG2 Katoin, 6@mo7y7°f-&mvx m2i:^$^^7yy^

© K&sm### U 7^-73 #im#mo.9a±(DKT& o ^#^lC3l^T(D^7'—7(D%#^ffVX A$ 7 77^—45 j;Z7B$ 7 77"-^—

tIJ aE L ft In ^c & # # 7 — 7

8U(DmgG##(D##7k^^fT3Tm2^ai#(7)mij^ L ft o b. 7-7(7)%##:^ g##j: 0 @S^b&7 ItTh, 1259m^f COUTmij^^eiX 694*^7 c. ATAC-PCR(DT-^#m$(:3UTli. Mbo-I^m^mm^fC3WT11. 6%^ ^1:45WT1461%T&3^. Cfl2% #-*#'E##3j;^B'aA#'Katoni(7)&3m^ ^chT^6ch70%, 65%T&o/h. ATAC-PCRl4i#1#Mbo-I(7)^&mv^Tfft)4l6Cch^^ W7 *E2cT'ldb Hap-II , Bfa-I

-105- **r-?ffl®MSS*3(:St.

(4) ttfgfSttfflcDNAx-^-T, »i*5T-?^-XV7 hi:LToracle8£SfflU 7 'S-X 0 £©l§*. E?i)©aft76*D. E?U©V7 htoftfW ffiaotfl'3ft5*!St*SI*tl*iTS-70T-?'X-XtLTi*t5Ci# *4ic, f ©x-7-©-ae^T. a. %#mtmei:#3(7%EMx-7## it

6.4 *# a. 7XxA##^f ©#f### *7n-7x7 Hi. #»©k Fi#ef ©*##%## ^ATAC-PCRt:Z OR# L. x- C 0 . ^©SlSBt UT. ATAC-PCR©/\< 77F-7'v Hb&eo%. RyXfATIl, 10afefeD967X7'ly- MnoitT- 7;b©8c«l^oIffitC7S7

©7-©B6-e-eExT2S-f7#6-r-5 tttlH, lB*/c t)6B«ra©f6a*S^100»

sa^y'ft; y 7 lz>-70#$SB^5>, it >7p;F©**3f©*jg(i£#-5 £ trfT #. g 6t:-*©#m#m&96mef&#eaLT%*mic#au. surf-sr tAmm

*fc. #>»®7'f75u-a>6E9iJs#fcae?-c-7©T©*»^s, ABI3700* ffl (A T w# b ft ATAC-PCR v 7, Xh, A<^)t B 0 ## L T © -5 b Wiffi7? # It. lA'L, l'<7i'©M,«MX4nt. SBltctt.

#i%xm©*ef ^skso a* »5t, =P-p U 7>-->a > • *-7AH^t*T#A£©oJ|gttASt USAcfc. ##rA«T#fc: C© j:7%#eti. giJtofflttSStFStti: LAcO , StofSBiSS

-107- #1 Northern blot(7)f$l|l

Northern blofC ATAC-PCRi: ir— signal£1$tti 5ESti:6<-»

BtoiSE^ 19/(25) 17 89.5%

6/(26) N.D. N.D. EWE^iJ

2 B • HI* • ITHiroSte B • BlfcJ* • Katom©SJ$ 6«an©is a; Bt- Tyyf —i m@##(xi)rt«5«P(xlO) m@B#m(xi) m*@#(xs) rw-2 IF (xl) ITl»(x3) «(xl) KatoDI (x3) # (xl) T5V5—3 ^E(x3) m^##(x3) 6A#(x3) HIB(xl) 4 H*(xl) ma##(x3) 6A#(xl) m*a#(x3) ifii(xi) TfVf —5 fff*(xl) W(x3) KatoDI (xl) «(x3) B(xl) 6 m@B##(xi) rtw*(xio) mem#(xi)ma#m(xio) 6 A^(xl) 7 (^m) KatoDI (xl)

IBS Mbo-I Hap-II,Bfa-I m# mmmm. m# 541 473 241 253 ^j'ii 131 113 46 50 HepG2 331 306 220 224 KatoDI 379 316 228 259 site 138 108 60 63 20 18 8 8 25 23 12 12 85 85 41 41 WAS 89 89 31 31 -n It 1739 1531 887 941

-108- 1 %*: pu= dWO (W 5|:Z%#glS& U) S5KOV<0 - Oiv: WS Sti0 (C) : o) m*i^a*R(oa.$wa)pu»e\»^:duo «> (?*.? : -^H (s) u) i? 660 660 loqifl i to to 83 to to t **08 63 8t08 938000V 860 660 IldeH i to to 98 31 ** 60 9 11 8 *tl S38000V *60 *60 loqi/M i *8 0 6* 80 80 80 S68t 6* 6381 tt8000V 860 001 IoqiAl i 30 *0 13 0 61 61 389 16* 361 608000V 960 860 i°qv\i i 86 to 99 81 63 t'3 96 11 *8 8 111 608000V 660 001 i°qi/M i t *0 *8 81 St 60 t88t 8*91 681 908000V 001 860 IoqiAl i tt *0 60 61 61 *0 8*0 1 88 not 666000V 001 001 i°qi/M i t* 0 0 91 61 81 6S*t 681 838 1 866000V 001 001 Ioq^| i 30 30 88 81 *0 60 886 68 006 966000V 001 660 1°t m M dtiO dtiO dtiO l0ift#dHO:?8*W$rg$aF [17$] ? u^#$z)'6, ma*#E###%mr82#©aGf cud&m&cia^Tti. &me? <0 6*att$zttt-enicsnE«$FS6i5smffla^a:-r5ctct/c. Sfc. ifi«S2?'J©IEiit2 t>MH£&§. dsw^-fT-otaitt. (DaPMSif'f h^ewmr^pcREttsicsio*^u^> =j m&tfihZZ.t. /lE»ieB£&oT#6S6U£yXT-AT(ltiSittlT£Z$©;: £#i#x 611 Kmmgf ATAC-PCRT (13' * S #1 © K ?IJ £ 1'J ffl "f -5 © (211 L, C©@KA©828lj^f — ^ —^ —X±T ttiE«Ttt*:VioIEttt>S£Ti)S;V». »#. 3'5S4SIM(i#®jIR«*£ LT1SSJtill H:itJ55fcl!)Ti65. T©*, EfWEHiffliife^COlita, f ©ATAC-PCRT© 6^6, B2?Wffi*©x-5"X-X±C«l;*ll.T©;tfc© <£ o tmme*i@©TA©i)©£#x6ne. b. SSESEt1—j'fiS E#efE*©»€TA^0%##g^mwtt©fm$

n/t. # >tti8£ffl©fcffesmE|B|-©yn h3-;i/T©#Jg-ett, *#%©*%**! 20~30%SgT$>ofc„ f:T, cDNAH >-/E©!®® £ ± If TSIST 5 & £ ©«I1 tc J; 0 60~70%a*IC$?|6]±d-ti:5C X;l/«$©$ jE C11? y hfc: - X $ ft 5 ® S ^ 7 f If - y a > ® 86 ¥ & £ £ # S "t" 5 £- $;)! $> o fc. ATAC-PCR©ffSttCOUTH, R14WB21iJ»e?T, FSS1gip©aM@S^o.9K±£ ^%*#&KH/c£C5. ffgtt$ 1>X> £*IJ$rSlifeX-X (175.3%T$o ft. £11(119 -ftfr©»l5tHlfc£3l0j»D fitfc«^,3lHl£fe|B|Ca*Sf#5 ££»lTSfc|i|eT*0. »*6©*i&&£HRStt£%£&©fflCT©3„ E»wsa#«e^©5!SttaiiftTH. **ai«tioT^si«sit©^m**ifi< &^> TzbOWh-itc* /hE^*E$S»aE£ £Htl. @»ti8$ &. /M»f>*»©8SSg£ft £619668$£ LT#$ U 7h £ £ tc J: 5 6asra©%sE*©#ft&3a«-ro©2y;i/-yfcti'HTSi$Ltc a*. H-;W3amT(i%ajtAmift3!i&icW'&bs-f. gii©3*mT(iitAi*ftT#% V>£©-5-y-X»^611fc. £©7t©, 3m^R(T(l70%©mGf (2 3©T%mit*l#A6 H-5©C, 6a*S£tl-5£61%£f6V^W¥Cti'5*^^*ofc. TIUCH. «SE*©fi ©ae?##(A%:&#o@L©m%ic%M!7 6-f*mE*©*;Es&©#e#&ac£, #$©iae$©eftsic j:5E8ij©a©^6,^ffl£mtceei2a*£-x C5>?£ T/fX£LTmi)H. £ £%£*l#A611&. *Xn y x X HI, ATAC-PCR£ h N ©^ftOjtgEICjSffl LfclJgiTCX —XT&D . SS^= ? X X T © ® * © ATAC-PCR © SI ® T (1 jl ?> ll & fr o tc 9! #! 01 fi £ -5 £ £ fc * 0 # -5 it* £> ■5. ATAC-PCRTI1. #'J A#EM&&oagf T$H(1. mRNAJEW© t>©T%K86)l2te fflSH42tl:ft50-e, XX7f y > X£g(1-5HM©RNAT&7 T fc, f-©B2fiJ;X'E Un«381H:S*5Cil:a5. SEC. 82 e>^»i liSPCRjSttft £(!*# < « n^7Mx'©emcm*£-xj;7 7k*#6a*tmsn^. ^n-icourn, t h © (@ ft * (2 ■?> cDNA£ s£ 0# 12 ft U $ 82 8'J © j* © & £ # X

-110- ABI3700 frZO&tjT—PfrZ ATAC-PCRW #rV7 hZgWj&JlzMMZftZftZ _t|d

OTntcD^&^x^Lfco MS cmtAacDNAom&m^LT, aswa,

^(7, c >>&#&#, ®E

>y;i/>>mn4x$3Tv^^a#A6 nz>m&\z\tm*W)*'u~Dtio ^n^osMScio, Srfc2nfcse f #llA^m#lCkoT^lO%l:&/u/:o c. ATAC-PCR x~ 89.5 %Z'$>Z> Z hifit)ii''0 tCo C(7)M^ j: 0 , ATAC-PCR(D ff S # (1 Northern blot M # T $>%t%Z_t>nZ> o (m±B##E#E^)(:OWT^)M#^Northern blot^ff oftifi. mca f-fcfb, ATAC-PCRlCcko T!fcJgit£8U£‘f 5 Cl £/0*eJ^T'&o tz Z £ft 6, ATAC-PCR IZ^Mmom^-yy hTt)^#ittl, Off Sit ^Northern blot t 1*1 m > ^ )V z$>z>&ntzj]&z$>z>tmm'£fttc. d. X— SllEl^^3lHlgBS(Dl259^ D —XlELT, ORF# #T <7) M ^ ORFS>360bp& 0 , -£-<£> 5 *>5 ’UTRS880bp (D t)(Dlt 855^ □ — >T'&o fc„ 5’UTR ATAC-PCR T ft fz.~b (D\Z5W 7 U — y (59.8% )T&o feo —27, ORFS<240bp, ORFS<360bp0cDNAB?m ^ *1 ^ tl 131, 372 ^ D— >$> 0 , ATAC-PCRT%#S^^r#^%(D^^tl^tl42^7 D —> (32.1%), 133^7 P - >(35.8%) T'Z£>&, ATAC-PCR

6. 5

ATAC-PCR^/\%xA##i:ko, 7° □ 7 7 -f ;u £ 1 0 * fc 0 100®^^ {I O H T^it £ £ £ # pjm t fc -D fro

&(D]Elfef3LfeM

— 111 — #W, D^titi¥Sr[B]±d1i5C -h^T'SZc, $7t, 3@I@C«S$XxZcafiTS **»©m@##&$m L7c*fuxu--y 3>-*-xxMi. f -X©»&%#f & C tic$oT, 4-0#)*lfcS8*«T-?^-Xtt*SttiiSHt) ©t&O fc„ BfWX n —> COtATIl, E9l|##©E#t 6 t ICATAC-PCRIC $ -5 #J £ $ fr tA, 1739 «fi;?ci#j$^6 94iaG^iconT, 7mmim(o%myD77i';i/c6^Te/2. *#yo? «0g*WfflcDNA % — — X&##L7i. h h%8yn ~7 7 'f A-5r$l-(t;U^:±T^:MICiKSU75:'T — 5” ^-Xtl, , C(Dft*tt©i«Gx-^^-Xtt*fTSaGG(D«®*#TC$»fc 0. eUl'Mc&o T fcSidTWfflT&S 4-»©BBi:UTH, *5»*T04 tW^StlfcVX h$r t— ^91 6H-&. f**5-fe» (DATAC-PCRlC^HTffiM-r 5 ffl I® » SI i: K tt L 75: 7 7 -fx-, ©ORFtftJr, ®T5 /$E*iJ©tt6, 7<£f©#jn1f«^sa?iJ«®. BS^a 7 r -f ;i'£#tcSE£L7i:x — — X©WIST35-So ATAC-PCRtl, t#x&. $75;, RNA©S*S$8 ^^»as*©i6S6*x%^, cn^e.©#x hgv atsstt-bs# &##&#&7h A ©##%&#?&%. LfrL. WSXn S n-;^©<*$|d 75:5, 2 tlT IA 5 t ftH*.fctA©tf8mT<& -5. *E5ET 11, ATAC-PCR&^Stf 6R%#7)m £« me Slit*?? i5154yXfAOftH«-C SZi^-3 4-#H, ATAC-PCR©fWMfbt#W1-&;&K#:&& 6H&. g*ag©«nZci$«T$5ATAC-PCR$:ffiUfc#6g#StfrfflcDNA©-#'<-Xi$, f © «Stt. 4i$, E ^JttSmfftiil^^gtogiSitGGCXiUTfc, ATAC-PCRlC j=5f — LT ff x d £ T, *#9n?##L75:f — —X&#Met. $ fcKWC 'b^EH^-HT© < £

6. 6 SlfflSltt

(1) Maruyama,K.,Sugano,S.: Gene,138,171-174 (1994) (2) Carninci,P. et al.: Genomics,37,327-336 (1996) (3) Ohara,0.,Nagase,T.,Ishikawa,K.,Nakajima,D.,M.,Seki,N.,Nomura,N.: DNA Res., 4,53-59 (1997) (4) Kato, K.: Nucleic acid Res. 25, 4694-4696 (1997) (5) Matoba, R., el al.: Gene ,24, 1125-131 (2000) (6) Kawamoto S.,Onishi,T.,Kita,H.,Chisaka,0.,Okubo,K.: Genome Res.,9,1305-1312 (1999)

-112- f 7f /<-X(D4#m

7.1 ##

iil-iiJgU i*41ISE?iJll!f#^*i$jxoofc5[l]o

5o ±E^ y ASSSfl^lJtf S £ am LTX ifi^cDNAI^I^II^ofc^o cDNAfi.

yy Agg?'j#4:-c\ ^scdna^ &im\ mamcmm LT##B##f&#imc?T 9 c o, A&cDNAiimm# #ft^^T$)66#XL6ft6o AAcDNA(Dm##@#-C&9, Zft^-Cfj: cDNA(D»f^@E^j^^m#(ciR#i-62«k^fT^^iL-c#ybo n%)\07j mt mfe £ ft TV'5 aft ^*DNAE3W--y ^<-;*-efc5GenBankft-f-XM::l£e £ ft T l' £ A Fb1 (D ^^cDNA(D BS (1^] 8,000-C h 6 o -A, Se^ftM^-T'SB#^!!) LT ^6#ef(D#(±io,ooommm^^#%.6fti:v^o %^-c, mf-mmmBrrn^cDNA ^^-e#ft(f. A&cDNA^m%A^ ihMftjJDiSi-5 r do cDNAfiWft^V'Tft, ^ !i y 9 *$ DNA

f-^lTit^fcDNAoitf^ii^ftoo^^o :ft^iit©Msa^j^^git^ < r £ aft 6 A $fttl'5o

-ft6k hA#cDNAe##ft±M, Bit^xy^y^fAX,

r, 5,001 y n — >l a 0 e r (Di*i2,5oi y n - y (DSS Am#A#cDNAT-y^-y%#^m#i-6<&:#ft, A# cDNA(D#imTyy-y 3 cDNAE?iJ&teirt — y 7st a0 &*>, y ^ y * cDNA(D##^^ 0 y y yef%^f^lTVft 0 (Dm&E^j^##ri-6o $ A,

7. 2 A#cDNA(DS&E9i&e

(i) DNA##%m^wc^e • stfffflyy h y

-113- a. 79X3 PI-100 E 7 7^7 (##$%# (*)) b. (Ei) 1029-40-2AW ( □ H > X) c. unmw>mw (02) GeneAmp PCR system 9700 (PE Biosystems) d. b° 7 U —DNA'>"X >th— (03) Prism 3700 (PE Biosystems /HITACHI) e. vx h7%7 Sequencher™ Ver. 3.1.1 ( 0 £ V 7 h X x 7 X > X~ 7 U > 7*$c^£ft) f. 77<7-m#fmV7h7%7 Oligo™ Primer Analysis Software Ver.6 (SSjato^^-til)

(2) ## cDNA^jf Abfc77X 5. L ZcTcJiB^ (DH5 a ) 25007 □ — > (D 7* 1Hz P-/I/X h y 7£,

(3) a. f###m ESE7J^S(7)ElB§^E4iI^1-o ##:%##7;i/^ U fcfc: £ 0 ifltti L fc 7°7 X a H DNAiC, &A%77< 7 —, ATPCR^ffVX 7* > 7“ & \Z £ 0 DNA£#fi T£o 7 XTV 7XG5O0XA7# xA&m ViT, j: Dm&L, M$#^7-7%77— b. yxxa MDNA#m#m (D ##:#^#$r3mL(DLB%%4:T, 37%: • 15#^ $E

Utco 7^fn —X77l/S^^1c$jilti*¥7 77 U >77 7°(D$<$j1£@ (Mupid-2 : 7Xt /W^-(m) m#(D/%77-&^^/:^mm#^?^5,000m&##T^6C a&#aL, 7^O-X7;i/#^^0.8%aL^:. 77X^MDNAa#(:DNAT-

vr

-114- c. dnau tu'&'tnu (D PCR^j^ 7°7 X ^ KDNAlSOng, 7°7 7 7 — 3.2pmol, 2mM^ft X X7 '> 7 A , 40mMh'J7ii pH9.0, X7^v7X2jLtL^llD^_##^10RLAL, 96"CTl^^%#, 96"C 10#, 50"C 5#, eor 4#(DMi&&25[R]moigL&m, xi/^^xx^m^Lx (X > ABI Prism BigDye™ Terminator Cycle Sequencing Ready Reaction Kit (BDT) -§: 572 Wc= —ABI Prism dGTP BigDye™ Terminator Ready Reaction Kit (dGDT), &l^iXABI Prism dRhodamine Terminator Cycle Sequencing Ready Reaction Kit (dRT, PE Applied Biosystems) $: A ft o (2) PCR##(D## 967\(D7;i/7X7U —> (MAHVN4550$U^7(#)) i2t7 7%XXG50 (17-0042- 042777^A77;F777(*)) —7%JI/^/c045//L^#L, ##7K300RL^AOX., l600gT5^^^f-5 2:(l:[2j:0, #mu/:DNAm d. 7—7%>X>X ±izG###(DM#eME%##7^;i/A7^ 7-X77- — ABI prism 3700£ AT E^’J %'fefeLfto '> — 7 X > +h — CO + T tf 7 U — gte50cmch L , (2 POP-6 5: IT 7. — if — 7 7 zl 7 T CD fl >T?7K Ifr ^ f#1 T 7 — 77777X&1TO mu, ^7tf7u-g;(D^^&m#^#(D5or^64or(2^ML, Isfto e. X7<7-a2# ±izGL/hV7h'7%7&^AT, 7°7<7 —^(Dt;>777 —Jl/, 7"7<7 —l^KDll/ — C Jl£ pME18ST 7 &—fib ft&J8 A ^Forward (FW) 7° 7 7 7— 4oct 2/Reverse (RV) A L T teF00,R00CD (2 FX : GCCTGTACGGAAGTGTTACT, FY : GTGGTGCAAATCAAAGAACTG, FZ : AACTGCTCCTCAGTGGATGTT, RX : TTCAGGTTCAGGGGGAGG, RY : TnATGTTTCAGGnCAGGG, RZ : ATGAGTnGGACAAACCACAAC cfT :TTTTTTTTTTTTTTTTTTA, TTTTTTTTTTTTTTTTTTTG, TTTTTTTTTTTTTTTTTTTC# £ "□ L T71 A

-117- (4) a. 3 9. -m(:450^^)550m*(D##^ojmT$,6. C(D^SM> 3200#* #(D^n->(:oWTkL

b. j;&#m@E?iJ ##(D BDTTf J t£ fc ”GT-rich” $> Z> lA te ”G-rich” (7)E^-^> b \Z < £ tAtedRTSrffl b£ b t \Z&0 . ffltfr&'M&fto

C. VaM^SME^ #M(D^UA(7)]a#TH"slippage"0^:A,

120- FOOti X IF ROO y 7 — T (1 )W tif 15 H & ^ d — > (ItllO^n — y) |;o^Tlt ^ v ypy^^ F# fT(D#f (CjoV^T, Ztl 6(7)10^ P —yiCoVX5-10#(7)P Pzz —F,DNA&f5#Hii-6 C^(CZK). 2/5X)^5/5(D#g-e^%(DE^##^^^-e^6Z^^^A^/2. P — Z ^ DNA^^-T^v^ L/: t)(D ^ #y F)^,6o

(5) Mmm#2:fLyc2500f P -X7)[fXl, ^ 9 y ^ & -Bit f\ (0.2%), —zz y ym^T-TX 7 T ^ F 6 t (D, ^#@d^Jlc j: 9 ihL/bt,(Dyi^23^D —> (0.88%) ^

A 5 cDN A y 4 7"7 V “©f ^ It ^ p '— V ft [Z. o V ' X (± ft # ? p “ X- ^^T##$r#y)mmm2500^ p -^(7)#^^ B m L^o HI 1^6^ (±^#(7)^#^#^ (D#AX 7^(l##f^(7)37:^±(f("B$^^#4'L, ^#^^##f(i8^y)^6^#Ly2o 9 ^ X)X(±^^^ L/c Z 9 F:^ 4^#7$^400^ p —X7)^ —2^ ^(^(192%^^ #TL;X ^#2501 ^p-X7)T-^^^9y ypy^:^ ^,^^p-y^g3z^yc^.

#Uj ^ P — ># d: #^c*%

#m 0 #mm M#T 3mm $#(- #RW H11.9 468 19 449 449 18% HI 1.10 300 13 287 736 29% HI 1.1 1 406 5 401 1137 45% HI 1.12 490 1 489 1626 65% H12.1 376 0 376 2002 80% H 12.2 296 0 296 2298 92% 203 0 203 2501 100% ^{d-^P —^#(:2524 (^#^ifih#23)

7. 3 ilcDNAf—M-7y7fA0^fiE

(i) y^yA(Dj%^ m(zc/xyA(7)a^x,w% ^-7E^^y:Ammii^(:#Li:,

^#cDNA(±m^y(/;m^A#{:)f(cm:^W)nTy)5z^7)^m^^aTjoi9, Z(Dfr)Ufk(Z .^.y 67)'6T^6, %oT, cDNAgd^^M^^y ^ y

-122 7L, L^L, ^ [al^f 0 , 7 7 &ff 7#g#C& ^>. -E-^T, cDNAE^#R%$r["]±^#^^:^^E^j(zMLT^c^-69

^^iCckoT, bft o i>#DB (DDBJ) i>#DB##x7xA^j:3Tlf3o *7o XJ:7 j:9^:A#(D±#cDNA@a^J(Di>*DB^(7)@a^j##^, ^6T, -ecT, %\ ±#(7)^:#cDNA@2^J(7)^#DB^(7)E^m#^@^. tb t£ '> 7 X A £ $t IS b 7c o

VX 3:&cDNAkL *fRf6bT(D##^^T^# LTW6(DT, ^(D##&##T-5C^(:j;3T. ^cr^iMTkL cDNAE^c*LTmmoj#g^#APR(D##T^e m/:, ^emfmoAmcDNACMT&^^m&^T-o- 3M,T^<(D^#$^#V^T, ^=Pf@(D^:i#cDNA(7)

cDNA@B^J^±#cDNAX —7^—7 mTTn, 1) ±m#x-7<-7^#(Dm^, 2) 3) m^x-7<-7(D##. m±T^^L/:S/7xA0

(2) ±m#x-7<-7^#(DM^ a. x' “ 7 tu 50; 11 '> 7 x A CD ?M % *y%xAT^, cDNAEm#^&n%±2t!-'5/:&w:»

(cLT^^o lEsi^xL/t^TCDMfiT&^Mi~

AATGprf-ct Zffltfr, 2) #^^^l/c33(DORF±(:^^6, DNASB7J 7E^ 6L^(Dm#T&63-x/>^^x>x-v;i/##. 3) 77/^7# X-7^-7 D7-#m7ci 77 Ablastx^transqU^^T'-^^ff 7. C^l. 6(D##$gm^3 7 Ix-A(DORF±C77 7 y #;H:^L, t 6:(:gB^J%7-(D or#Me(D&6^^(7)mm^a^ojm(:T6. blastx*transq^, ^6DNA*7$ym@E^JM:^7D77A (blastx) B]yf'#TM^b^@B^J%7-(: (transq) C^UlckoT, DNAE^'J DNAEfua 7 ^ y mE7'j(D7 7 &/:, 3-xy>7^x>7^;i/#tfT(:^^T, 3-xy> ^T#x>x^;F^7$ym7^-A#^:DNA#^[:t)^3Tm^^fTA^,

-123- ^)im(Drn otlHiot, X7-(D#^€:ff 7 j:7 7^ - KXy 2 b. l^DBl#'/^fA©|^ ^^DBlIyXfAH DNA DataBank of Japan (^T%# : DDBJ) ^ h h©cDNAT m### (77 7-7a >## • &SK7J##) (DXtl • SIE •

^)o (D m^rnm ^77 7A(Dm*mf#^aTOm^T&6. 1) 777-7a777<^ (7yE^U0777h) (DO-M't-7 2) SXE7J7 T -f ;l/ (PASTA# 73 OD-H • t“7 3) 7“‘7 7 pf — 7 y h 7 x y 7 7“J1/ rParserJ CDSSS) 4) I"ParserJ \t, DDBJT'E# $ tlT lA 5 #ff7 5) &7—70HS 6) 7$/mm^## OKyoh^7 7 A^rfr'ir (DlEfflS^El 5: IHiolc:77o 1) ^^##HS77 7 K 7 #%^#^l^)7<7M7. 7:c'y^'7-;i/ (Parser) 2) jSXE7J##HS7 7 7 K7 z:z.i:\t, m*E7U7-70^K - 7. 3) x7hU-####7<7M7 X7 h U-##7-7 (x7MJ-£) • IES7f770 4) #m##W#777K7 ^ilJSS##7-7©^^ • urn^ff 7o ±X7 h U-T# I^7^-?t“ ( □ 7 — 7 g 7, ^t'J 7 7l't“R^7t’J 7 7l't“©i^t^ &) (Dca^7"7. cc(:A^L^##^ex>hv-i:gK)%H ii)jp^n^>o 5) 7/-7^-##m#5<7H7 7^-7-V-##7-7 (7xr-7t-^%U:o7-73>##) (DIM- mMzftOo 6) 7^U77<7-##mm777M7 CCTd, 7^U77<7-##7-7 (7^-U77<7-^%Z77^U77<7-(D #) vmw • ii^ifx O *777A(D#6 *777AT^J}7T(D##6^#^L^. l) U77 777##^

-125- 0 1 y K^##0

-127- h Lt. 3) SKgKlWCx-^A-XICSfl&Bjifg&X*-Vy hTS#^?-^$^B6iiiigcLfc„ c:ne>o«S6cXxA©#t$ *yXfAtt, 011l:©-fJ: 5 t:. *-7 hV-XACSS-fS+l—A*-AXxf7> h fr B#J$S fl-S. DBit — A — Til, ificDNAfcSlTSSSftyXfACiotJiJ’ ■?•©)$# IS* £ t b IC^rftcDNAx — 9 A — X ICSSS ft-5. $ tz.. wwwth —A* —6 ©Etia >1t-A*-ffli|^e,©B5) hTIJ, WWWIt —/•?—CWLT^SSrSIfT s-s, > 7.7 A 1C

&MVrmmtzmi2iz*-t£v t. orfxxx -/3>yXrA, 7-j'^-X1t-fLCS73'a >>XxA, 7-;t!> Xryf-yaA'/Xfi, *£*S»XxAfr B#J*£n-5. &##r##6l:43©T, * ©»a«igs*su7c„ if, hu-^#©#m:%y#, «®WTtn 75 yKE?ij6<@»rv, ^©e*SDBi:asti>. *fc. sssnfc#ffiB*ic*rr5

#mf--5 7aL, &#x-7a-x©fiiw$bbl, #&#©% ft T ©ft ©SB## £ X--3.7>S^T©#|gf5J:tf#IEe=J66^ Lfc. © 0RF77f— >3 >'>XtAOBS *©4@m©@g7u##&*ef 1) I»te3 H>tW1ff : Mten h* >©tig^MSrff 5. 2) t7-7#m : 7>A7K©7 5yK©E7iJ^B, ##&?#!-f-5. #1:. *©2#S ©X— 7A— X S. • Prosite • Pfam 3) 7>/w*©sftsef#i: 7>a-7»©75 y ke^b, -eft#** jaw©t*©s

4) 7>A7*©g$e«8Mi^#J : 7>A*7«©75 yg!E?iJ^6, RSI'N'JyJ^f SJ -f 5. cn6©«ff^$ttT-7^-xcsssn, c:nB©E?iJ-$>. © x — 7 A — x it—7IZjtzJ< #167 y 7— > 3 > >X x A 0i5t:S7=tx C. SScDNaE?') Affimtt©$.4SEE7Jx-7©7y x-As >1* SSrBff't&A At:=fcoT, ±ScDNA©#t67y x->3 >5:175. #167 y x-> a XRffTfl, -f >/\7Xx-7 A LT#mf&A#cDNAE7ll:*-f 6. E?m—It • % A#, E?i]Sl(Htt£«iSS*:ft A LT, Rg-f %A#cDNAEfiJ©##&m©-f6. Sfc-t

-128- J55 cDt I A- DB 3 Search Svc-tem Menu - Netscape 3sE #* ■:i hstanl Message ffig ffl| fiWd ®J y>/\ —X Integrated Sequence Analysis / Integrated Analysis cDNA-DB Select AA Sequence Hit No. Search System r Analyze Motif (Prostte / Pfam) Hit No. Menu r P re die t Protein Localisation S ttes _(Psort) Hit No. [**** Predict transmemhrane Protein (SOSIJ1? Hit No.

3. Annotation Analysis Analyze Motif if-lAtl Add Prosite (PS) 08 Condition Motif }HIH mRAi F AM A H (Average hydrophobicity) Integrated Analysis P Value 1 E FO Clustering Result Quality P High (H) P Mic Very Low(VL) HTH A RAC FAMILY 1, F’ATTERN Bacterial regulRt: ry prot /yi) ZINC,FINGER C2H2; PATTERN Zinc finger, C2H2 type.doi ZINC.PROTEASE, PATTERN. Neutral zinc metallopeptidas-= 2FE2S_FERREDOXIN; PATTERN. 2Fe~2S ferredoxins, iron- Add Pfam (PF) DB Condition 4FE4S„FERREDOXIN; PATTERN. 4Fe~4S ferredoxins, iron- Domain |AhpC/TSA family AA_TRANSFER,CLASS.1; PATTERN Aminotransferases cl E Value [0 AA_TRANSFER,CLASS„2; PATTERN Aminotransferases cl _AddJ AA_TRANSFER_CLASS.5; PATTERN. Aminotransferases cl AA_TRNA_LIGASE.I, PATTERN. Aminoacyl-transfer RNA s AA_TRNA_LIG ASE.II.2, PATTERN. Ami noacyl-transfer RNA And / Or Condition

No. A p No.B IF " A And B" Add | * A Or B* Add Integrated Analysis - Netscape EMEl Oz-OKF) mmC(G) Communicator^)

Result of Integrated Analysis

F DB Search & Sequence F Expression Profile r Mapping AA sequence code : a-bb / a~bbn a:0-0riginal / R-Revised(by "Check System") ,tf-7 bb:F0,F1 ,F2,R0,R1 ,R2(6 Frame by FrameMaker), FAn,RAn(ATGpr Forward/Reverse No.n),FX(Fixed) Seap^ffce RetrietW^Ad Score Sequence Motif Name ATGpr Psort Sosui ' Pros it.g Pfam 0-FA1 0-FA1I O-FO 0-F1 0-FA1 0-FA11 / 0-RA3 C-HEMBA1 000001 0-FA1 0-FX none 0-FA1 L pattern ^§2.599998%:nuclear 1 Domain > 0.810000

1 3.0RF7/f-y3>yXfA(l*lt5

130 (0i6). 1) ammaeux h 0 Swiss-prot 0 ft—V— K$?#T

fit aIi L ft Swiss-prot 0 ft “ *7 — K1W ft -3 T > H ill ft -S Swiss-prot 0 X > h U — ft B# %0i#i^< >AftXft-^0AmcDNAE^J0X> h U-^-^^^ft^,o #Swss-prot0EC No.^?t/f fa^lftcEC No. ft Swiss-prot0 X > h U — ft## L, BiMO^lW >A^f- X 0A#cDNA@g7'J0X> h • Swiss-prot 0 ft S; ft ft X h ## m^L7cft#0ftftX Sft^^Swiss-prot0Xft h U-ft##L, B##0i%Wft ft A ftXft-ft 0£6cDNAE7lJ0Xft h U -ft-KSftft5o # Swiss-prot 0 ft fa^L/7cft%@ft#^0$)^) Swiss-prot 0 X ft h U — ft ## L» Bitift 0 ?6 ft ft 7N ft Xft^ft 0±#cDNA@g^J0Xft h U -ft^K^ftft6. #Owl0ftSftftX h ffltft f^^L^cft#0ftftX h ft^a tfOw!0Xft h U -ft##L, B##0i^VW ft/\ftXft “ ft 0 ft: HcDNAE7J <7)X > h #Owl0ft#@^B## j#%L/cft#@l:M#0&6Owl0Xft h U -ft##L, B##0^^ft ftAftXft- ft 0 ft 6 cDNAE 70 0 X ft h #mRNA(h(7)|p|—1±##

fa£ LfcrnRHA^Xj-ft'S IrJ—ftftTX — M ft## L, # ^ T 6 ft >A^J“ X0ft6cDNAE7O 0x ft h u £o #mRNA^0^ft###T m^L^mRNA(:^ft-6^ft#0^ftft^T3- Mft##L, m^ft-^ftftAftX^- ft 0 ft 6cDNAE 70 0 X ft b U “ft-H^ftf £o

^-###^^ft### j: 0#6^6ft ft/\ftXx^ft0ft&cDNAE7iJ0Xft h U - 0-Kfti$cftU 2) M#k<;i/(7)m-# c e-mi, ^toi/-#m(7)^m#6a6miRi#(7)X3 7^e)#m$^^Mmix<;i/(7) mW1]fe\Z.-O^TmmtZ>o NCBI (National Center for Biotechnology Information) L /cblastxft fOffl ~T %> <, d ^ 0 o'^Oii'Tc ##ET0ftft ftAft%07a 7 Swiss-prot#0ft ftAftM0^-ft^- X (FASTAX;*- —VyH ^-0^^(hbT, ft^-xm0XfthU-##^. BM0fB4# 3 ft-fc >thxs> Identity 0^ft t, @

-132- >gi|l 169338 |sp|P2191 2|DHSB_HUMAN SUCCINATE DEHYDROGENASE

(UBIQUINONE) IRON-SULFUR PROTQN PRECURSOR (IP) (IP

SUBUNIT OF COMPLEX II)

Length = 280

Score = 581 bits (1481), Expect = e-166

Identities = 278/281 (98%). Positives = 7tK)/281 (98%)

Query 134 MWVALSLRf^PATTLGGACLC^SRGAQTAAATAPRIKKFAlYRWDPDKAGDKPHMQT 313

MAAWALSLRRRLPATTLGGACLQAS^GAQTAAATAPRIKKFAIYRWDPDKAGDKPHMQT

Subject: 1 MAAVMALSLRRRLPAXTLGGACLQASRGAQTAAATAPRIKKFAIYRWDPDKAGDKPHMQT 60

P-V alue = logl O(Expectfi) I dentity ex.) logl0(e-166)

Decision Tree: id <= 53 : | id <= 47 : | | pval <= -41.4202 : | | | pval <= -44.6383 : c (15.0) | | | pval > -44.6383 : | | | | id <= 44 : c (4.0) | | | | id > 44 : b (2.0) | | pval > -41.4202 : | | | pval <= -0.0409586 : c & 14.0/5.0) | | | pval > -0.0409586 :

0i 8. y :iu x'a

-134- 3) ^ScDNAE^L RtXSwiss-protOX> h U—+ —7—

— OCD^ScDNABB^HC^ B LT, t^U'J — &f&0j^t#f>nfcSwiss-prot©X > h V-, m^U'OV, &£V^£(DX.> h U-^^'ti'^-7-^-7- H era CIM % y f L, lx/<;i/^4e(D4:-7 - El9(DS^'f# 6 4lS = T&:b*> , ^rficDNABB^J HE — 7— -5 o CD f# 6 ^l/c. i^ScDNASB^J

7^T'cf-5o 7c C-XXX3, 7>'C-XXX40^:§ cDNABB^J^#^T#6. EC No.##, Swiss-prot l:ML tt), 4) IH—EStirTMt, mRNABB^J B:0|1I— 14^£^T ^^7— M ZVefe L, cdna 8b?ij h v -

H-E-S^TEEn- HB:te, zh^E

IDh, IDm, IDK NID04P ^)VX^ U T lA 5 „ # £ IDh |1| —14 iW < > NID11 IrI —14 ^

5) ^AE## ^AE##"C11, mRNABB^iJ B:(D^^t4^^1-^E3- F cDNABB £0 mRNABB^JCMLT#^ (D^scDNABB^jT^^nv-jw^^nsv, (#m zom#-#-6 7;i/37; XAE j;D^^E^^T^E3-^±E$:^T^E3- MB: lilt FL, NF, NL, HP, PF, PN(D6@ET5'#J, 3’#J^ tl^tUZ 7 lA T tu±&h'j-B®7 V ^T X y MW Hx /4> 6 full = nasq - nast£r H tB L , full## A CD X > h U — Id O VVC, 3’fill (D^±E(D^#11, by MX>MU-B:(D77 MW#^6full = naeq - naet£Sffi Lfull#^#/J\(DX> M U -l;iO^T^n^n®220M0£*K$:5£'r£o NS#Ilk y M X > MU (Dcds= (NS, NE) 1WEJ: 9 $#T5 = 6) mmT/T--'>3 El23t~ 7KlTcfc O (C, —• 7 ECU X h > 4i # # V X M, 1$ A A xEX M AA£tiMt"5Xc#) M7£S^l"5o • Swiss-protE —7 — M ## • Swiss-prot EC No.##t • Swiss-prot^ #@## • Swiss-protES A 41 X M M$X • Owl±####f

-135- / \SPX> MJ 4=©*- r ^ScDNA 82?l| K-1 1 K-2\ K-3 K-4 K-n

C-XXX< POO; A B C-XXX2 C : D i C D

C-XXX< J(B> : D D A

C-XXX< , D A

C-XXX5 D C B C

C-XXX6 D D

C-XXX7 B C [ C \ C-XXX8 D D C

C-XXX9 B \ i D C-XXX1 0 \ / B c ■' • ' ** V' •: * *• * "■ " • •

0 1 9.ific DNAlB^lJi ^r— V~ KOH#

Ic / Is >= 90%

Tzt^ytyV ^ >= 300bp Ic / Is < 50% Identity > 95%

NIDdH-'ttfcU

X Ic : zq>tZ>-yX^. Is : Query R?lJ£Subjectfi£?lj0*sU fiO> K?'JH

0 2 0. IrI—I4S*^1"5>E=-- K 3’ 5

nasq

query

sbject

02 1. ^gcDNASB^J (hmRNA&U^ EST(CO^T H##

nast

NF Bn . .

hpH

NS

$ t h_2 2_G (20)

PF $th_21_G (5)

$th 11 G $th 12 G nasq (5) (20)

El 2 2 — K

-137- • OwiESt-^X bmtir

• Swiss-prot 41—9 — K 4^-9- M«k^#1x^71/a^a. • Swiss-prot EC No.###E EC No.tmfa U'Ol%Mtir4kfrt'?Z>o • Swiss-prot

• Swiss-protES "X* ^ X h E^x^X H2:M#1x<;i/$:##^EaTa.

• OwlESx^X h###E E#x4^X • f7^JH'

MB!

hy h L^^^cDna SH^J cdx > ^Ta.

023^^f \z, OT^a. (1) ##C43WT#mL^^E(D#^ (2) ##(D*@m#Sa&^#cDNAE9J(DX> h U

(3) ^ScDNAE^J(DX> h U — M T a Swiss-prot ^ 7c ^ / ^3 j: ^ Owl (D # #

Swiss-prot (C D T$> a o >X •^srapRX ^x^tc^tta^^Dv—v >x<> f/J'tDy —th—f O h77pby h (7) X > h U — ## Acc (X/h'J — ^0 'J >Xf4)> EC No. 0%EC No.(h4^—9 — M(7)M#^#^(7) U > X #) > DeffinitioiK Keyword OK 4^—9 —K X #)

^K±b v EC No.^3i:r;+“9- H £ >bl;, M#k^7l/A, B (ab), C

(ABC) 1C 43 O'a ^ Swiss-protX > h U “-ffc £:I^ thTT a o $ 6 (C , <7) S tk 7^ 6 > *€“ u >XTa. • ^EDy-X-fO h'y9°b'y h<7779-fX> H43 j;^M#lx^7l/## lx ^\)l/ (R-revel) b, 4r4l$r et#^ a^:)6(7)^-@/i9X'—Xo

-138- Owlicilf 0

t D $/ — th — f 7 h © x > h u — Accx EC No.x Definition> Keyword by'fhy h©7v-f/>

m24lzmmr / x-va LTx +-y-Fl:fgLMftf pJ|gT$>^o C(D=k3(:x ^yXfAlc^^m @B?m-e - E^TS^ScDNAge^lJOlWISM^^ # L Xio

(3) a. < >/\9XT#m$#l^cDNA(Z)±&d3 6^6. a-c^x mA

-5yu y fy -a-ij, *ist ^Sd^Jlf > F 7-^7 chWX^o #&x European Bioinformatics Institute (EBl) Ttl te/W Z? U y H 7 — t) — £rx“7 ^ — L x JbU l/T®§o $ ft x National Center for Biotechnology Information (NCBl) Tllx CT(h©##%0^1\^ LT^o 0 x 4^#®z&i#L:)o&a. iix v M7-#-©#am&#@##

T^t)^x EBI^NCBI©X-7e##76 C 61: j;oTx ##%0±©%##/\< 7*U y H v-73-i-^SUT^^dbEST-^dbSTS0gB^J1WW5:5j<6b^ ^ $ 5KX ^nt>/\7x©@Bfm#&m6 #:©#@%0i:7 (025)0 tf:, £siw$£Miw#fymu >

(026)0 ±sb©xo©u (0 25x 026)x < >/\7%©@B7J##^^a##a u >7^#^ c a^oj#c^6o < >A^7XtCd3V^TE#L^cDNAOBB^JtC^ST^^fe#:±(7Mir fl1W#(DS^^x ^C1:U >/7T6^m##om^^B%aT%0 #l:x ^@##6L/Tx #m%0^ ^^x m^TZ'f >/\7x©x-7£j# ###©^±^0-5o $ bl:x LT^Xx v bf >y$a6<>AC7X^:#cDNA6:^m##^^^ormCT6o 027CT v H>^##r©^66©x-7^U >^©##^^"7o C fl 6 ©##(:: j;oTx

^:#cDNA(:^T & < o X-lf-A ^(DfmV^btHZ^LTx Smt~5^ScDNA£x Ty ¥>?<&&$& tmMffimtt&lZ 028CTvbf>y##(D##^m:A7jim@(De!ie^fo C(Dm@^6genernap

-139- ~>r

15) WB §iiaWP 'if >cn»-x Integrated Annotation Analysis / Modify Query cDNA-DB -9-K A^J Search System Menu ECNoA* W Select leeclem CSP : Series-prel) SP Keyword ifetottA* SP EC No. SP Ortansm Taxocu SP Text f Or R-Le«el » A B 7'M-r A^ AND/OR Lofk«l Owety

ECNo

0 2 3.EtET/T-'>3

8tX£#cDNA

(P30G41

i-23460 P44411 P5697S (PI 8476 P21762

P36700 006B*r\P3S704 P35705 P48822 P52552 P80002 P91883 0 04120 0 77834 PI 8476 P21762 P26830 P30041 P34227 P48537 i

02 4.mHZ7 /-T-v3 jr-9-nc#g L/"cEEttf-7'>

-140- mm a £ile Edit yiew Qo Qpnmumcatw bate esse Mapping Analysis

Select Sequence

Pand is RHMap SG3 (cR) r RHMap GB4 (cR) DB Sdect P dbSTS P dbEST P GenBankEST f~ GenBank mRNA P GenBank genomic f~ GenBank unfinished genomic Blast ______E Value Range fo . (To Method Sdect Sequence by Name Sequence Name jc-HEHBAiOOOOOl

I* Sdect Sequence by Chromosome Number and Distance Chromosome Number 121 (SG3:0-1630cR GB4:0-266.94cR) Distance from top (cR) |l0290 o

Map li$t< dbSTS,dbEST ^ST,genomic ) mm Otoe No. Sequence Name tor SG3

1 C-HEMBAl 000001 21 44.3-49.9 1308 P0.7 genomic AF064859. 1 5 polymorphism 13_

2 C-HEMBA1000001 21 34.7-35.4 888 F genomicAP000044. 1 5 polymorphism 1

3 C-HEMBAl 000001 "21 34.7-35.4 888 F genomic JC604 59.1 5 polymorphism 1

4 C-HEMBAl 000001 21 34.7-35.4 891 PO.O genomic AP000044.1 5 polymorphism 1

5 C-HEMBA1000001 21 34.7-35.4 891 PO.O genomic JC60459.1 5 polymorphism 1

lamtHeT «hw ' 112250 Bone dysplasia with medullary fibrosarcoma 147660 Interferon, alpha, deficiency i 238300 Hyperglytanerraa, nonkitotic, type I ; 247640 Leukemia, acute lymphoblastic

> K^(

02 8

-143- /itilcDNA© >J X h ^i28^ C^to &±#cDNAt:#LTkL (028). b. iieOyXfAi^^T, £#cDNAgB?iJ

#, A#cDNAE^(D^m##a^^^C(b^Te6. cn&$UmLT, XvtfXX#^ T#^yc^#cDNAE^(D^m##<&#**^ XXXA&im^L/:. 028(:i-(D#f f. ^#cDNAE^J(:*LT^m##m&^^T6.

(4) iWa-T-f <-X(Z)## ^±E%L/tStff'>XT-A^E^UTfiJfflT#^'>X7LA^MBU^ (029). 03O(: 0RF7/f-y 3 >> f'"57/<-X'tt-f l"I^'< SlbT / 7 — XgX, Tvkf>^7/T-y3>(D"e^l^^U:^LT##^m:$:j@^L"r, 7fl6^± T^fef^ficDNAga^J^U X h oJt^T'fe^o

(5) f^^s/x^Ac ±#cDNA###(:MT5 7/ 7-X a X&# cc-rn, £t\ #6^i^±#cDNAga^j(D#m^ ##f L/c. ^6gB^J#(D^$^, A U y X XEX □- >£j 4000@B^J(:3^T^A^$§^^[g3i(:^T. gB^Jiooo#*#^ 63000#*# (c*(c^#L (FL)= it 3& (D #) H , ^#^-X<-XT&6 UniGene (7) ^ # mRNA (UniGene) > S H 2 tl tz H X V

^DNAE^u^tDfiEcDNA (KIAA) <£> # £ £ 7 L . Z.(D£o\Z, * X U X x X h TS 6 nfc±#cDNAte> £ il £ T CD 2 £2" £ & X D X x X h T# 6*l7;:£#cDNA

7. 4 emaatf^&omaRB

(1) $§# M5$

-144- frfejj&tLTURLrzo ^n->©n>^^^©rajs ^$>o^^x 2^^AT2298Xn->(D^ SE^IJ^S^^TL, 3^*IttSt)201^n->jiJPU, ^H-2499^7 D-yOllr^Tl fto b. ^S^iacDNAf“^^“XyXfA(Dii ® IT, "T —^IiJ&LIIvX'tAx B :> X 7 A x ###3 j; Uf$a;#(Z)Mi A X7A^^^Lyc« f“^iy®lyXfA(5;l t^x Hz/UMTa&mmaWx evx caawm%i/x^A&##BL;t. ffijT y f“'> a yyXfAiITti, DDBJ^O^HcDNAx — X SBffltiD'fr — XfEfi£5: ##7X7AALTx 0RF7/7-7g>7X7Ax 7-X<-X

It — 7 (C S < SlbT yf —ya yyXtA, X y H > X 7 / 7 — v 3 yyXtA$:l^ U 7tl 6 (D^-o ###7 7X 7 A 1/yho © LTx ±iax v ^>^7/7-7a >XX7A(7)^m(:x ^m7-X<-X(D##$:m^^t)1t6 C6 la-at, ^Sx^iMyXrA^iiLfco ® 5,OOOX n-XD^r^cDNA^ttf £^£-x — x^ — X £ IT, _LEjW#r-T-X

(2) ^#^lx ^^Utc^ScDNABB^iJ-tWIgAX LTx

c

M##x %#y/A##g;&x $6cm^Lyh7-x<-x7X7A^M%LTv^

W. ^ tl 'b o

7. 5 am##

[1] Wadman, M., Human genome deadline cut by two years, Nature, 395, 207, 1998. [2] Maruyama, K., Sugano, S., Oligo-capping: a simple method to replace the cap structure of eukaryotic mRNAs with oligoribonucleotides, Gene, 138, 171, 1994. [3] Ota, T., et al., "Full-Length cDNA Project toward a High Throughput Functional Analysis", Microbial & Comparative Genomics, 2, 204, 1997. [4] Nishikawa, T., et al., Database and Analysis System of cDNA Clones obtained from Full- length enriched cDNA Library, Genome Mapping, Sequencing & Biology, 160, 1998.

-146- m##^cDNA7-7<-X>X7A0##&^7L^. *7o>%7 /\7XH/-7v hE^J##>X7A&##*^a#(C, cDNAO#m7#j&fT3^^)(7)&

a. £ScDNA£>BE?IJ&5£ 96*7 7 &^711/7 4r t U -DNA> —7 —

01igo> IE 3\ CO 7iz 7 7 )V \Z (XSequencher^r ffi 17c 7° 7 'f ^ — 7 ^ — 7" > 7>£ ^ ^ SBE ^0 7D-XD3>7^##0^# fo'Dfctf, Lfro in 722987 n — y(Dlk sie^j^^^^7i, 3n \z\%mniou7 u-ymnh, 3-^24997 o->o##f #557 u

tC o b. 7^#^C-acDNA7-7^-X>X7A(T)## (D 7%#7 — 7 -7 — L T, 7^# 7 — 7 > — 7 # #>X7A, m#43j;U:^mX7A^M%L/c. 7-7#^@>X7AE:L

ffvx i>#DB##(D/c&!)(Dg #77 7->3>>X7AE:LTkL DDBJ^(D±#cDNA7-7##mX7AE:LT, ORF777->3>>X7A, 7-7^-X 7" — 7^&z7<##7 77 —'>3 >yX7A> *7 y My 7' 7 7 7—7 3 >7X777111% L, 7^l6(Dm^##&^>X7A7M%L^. (B ^mBM##^m^?@E^ja7(D#M###7ism^:#mT5##7777->3 >>X7A(D^mC, ^m7-7^-X(D##7m^-ot,#^CE: CckoTX ^m^M##>X7A7M%L/c. (D 5,0007 n —>(D^#cDNA(:M7'5^^7—7^-X E: LT, ±SES#r7 —7 7E£- M(c###7T^)^o'7—7< —X7##L^:o

(1) & L (2) 7:# ^ L (3)

(4) (7>X%^#) 0 *E^|fr^2000¥2^ 19 0 m77##2200#^^M (/W777 7o>-me, *A#co#M:{Um±)

147- #58* £ficDNA®*SE9J3fc$©BFa:

m a6tt 0 tL v -r x v 7 . •> t. x a x

s. i *m

t h^V AtoSEtt, 1!t5FMiB$T2003iF£BSibT:ie® BtlTIAS. eETttBS Sitfifl Li 0 , $i5r3:Ttt;3:tA;M$S®ST£ig!§1"5>73:.h*X fcf- H SriSATHS. C© toSli/H trJ n->*-®#Sf*Sllgt t>M«f -5*1. y / A#i@b*ef t, * iDNAga?ij©BA#tltsisntnj SNPs^flfflf^fcAICfc, % A: 73: gBt'l 7)1^51:73-5. i©is6X J A ®j*£^7)tyy Agfi?ii$ifei66fc6©5> hv-9 bUTasstirn-e.. $i. ctie. *6f©#m&B9B*4:t"%Abt:j:0, &##;#©#*, B*S®M%, SBCttSrS #®im*%

#*#A730#-5%©T&^. C®»,S^6/t-f ^-f-y y a U y X %#%§T*#®%6#cDNA50009 D —>®S$gE?i]$:gt$t"5>ZcS)C, (#) Hi MftBu (WBu-tMiyXy^fAX, 3iBS(#0. B ##*« (#) ®4tt^ 6/«£ 5 E%#7$J£t8$LTAU y X XE^fffSW© t b SSrftcDNAtogBtiJgWSX X - h SMi

7C o *S«ffl%Cj3tj'-$»(#) Hi-9-i x>X AXiAX® BSli, A >J y XXEtE0T7mi Li3g±#cDNA©y< X'y U - ® ft, 500X n->®ifi*gB?ij$ »S"T5 ^ b T$>5„ — #*%DNA®##r*&b LT, -> a y b tf > S A y I1 X- —i * — X > X'te**® B tl TiA-5, W#}1. ttRMfti&IM X'®DNAC, f LTftttt, gckbp® % ® KiS L T V> 5. iti'iibt'ScDNA® ¥ii31M X*tt2.5 kbpT$>5 C tfri, (Sffiffii; LTX’y'l'"?- X * — * > XSSrffl U5 C b C Li. A ©iSti, $t*cDNA©*#5008$8#©gB?iJ Sftgl, RgE?iJT*Ein,iXy1'v-T*®5Kl500tt*®E?iJ$E*- S*$69ICg*fE ®*#*f*«t5bO-i?^5. ^OffiyP-JBlCSt. y-i»I>-»y4f- x7-fe>7';v^Xy'r7-»n-^Xy'fx-^s&

®lSI5fflli, A*7kXy^—iy b±TtHB£A3Ti-5. 7X> h>** 3> ^XX>7:*- b'*3 4) 73^Tttm#©#< > b £M*LTIa-5. Lt»>Lt3 ^B. f¥*ffl73I5fl:*ft^ h y X;btt*©l«7Kt;jy>;3 tA. Litly T. HS«SX*;l/7 y X"f-5ICtt. 6 WIT C n B 7- X SrSaf 5 C i: ;»$!&$ i: 73 5. *H%T*t!5000HH± © Xyl 1 * b7»if78Stl, #m©e#f-XS#&Cb^T#6. A C 6T\ AIBiMcy LT500X n — > ©jWtfrilSIlti^ICTKt' ± 7 tt^TA'J y X

-149- Xff%0rT-«6i©SBSStj-fc. *1*. 5fl 4="mcSBi[Eg©X n->Ett£$l:hW#r£X 5'-h3-a-fc. aoro»D->tl8fl. lOfl ICE#dn/t„ MteMrobROffl 1$. x> f F- h®OTS^5‘Ra:3’sS«roSB^E?il (FOORtfROO) ®MS®fc**fe755&< , 8fl =fc V $□ &bh)sbfz. BllcSt" =k 5 IC3fl ST" 1:5007 n — >©($#? ISSS-B fiS Lfc. f$8e-ett:®5007 D->©SffS$Ry:y7'f

8. 2 **##&#%&

(1) jg^ftcDNAX D — >5) 500H®cDNA©6ife*5 7 D — >|$S:glteSt"= NT-2* JET 1/ f J -i >###©# f*Tig»«j©SJPT, f bTHgffl* (MAMMA1) dllffrtS (VESEN1) 7 D — > T £ o fco C tl 6 *I®B 7 □ — >®^7 S' —ti;lA"ftt1bpME18SFL3T$>o?c„ 7 D — > It, X y ^ > f- a — fTB* Blgi LTEttSfr. -sor

(2) T->f f- F©8@M xFfF-FicLTfJSB^rofxXS KSfflLfc. 7,7X3ftt, LLF® 4; 5 letttti LTJ 61:7# n-x*S»*jl:J: 0 7" >-9-- h®SS»eai$U7k = (a) 7/57. 3 F'ttffi f 7X5 H®#aieltT,7X5 Fttiti# (PI-1002 : 7 7f7K : SrflH* fc. iwseii, ws•ss• ‘f’fnfciimae •««• if. *JMB©B&tt»£3ml©LB«il6l:*BL, leTlSStraSj: 5#*f 7 Ff 5i, *JBBIi»i'#gl:i0liSfftT7JW U • SDSffi CiOM^ns. OUT-, 7xy-;H|©|it*Ci 0*#B®DNA^a6*75:£©7F *#7)5##LTm75^TfxX= F$7"@-U±#755#6f&. *%#, f 7X 5 F'Bf 7 f

□ /v-;n: Jit) «*«•!*;* ft. i*y-jHe «fcOjfcgsns. x7y-JF£*3SliSS l fc&. TE«»«ief7X 5 F*5»«LttmxS7i5^Tf •&. : 5til6tt7 7 S ■y&roRBf x hSSl't. ft 43, fxX5 F* ©##!$, -20t:Tff o fc. (b) MSRlX-f >-9— F©i6Sfi®tl6 SBKL-fcf 7X5 HOM*Rt7'f >-9— F©g®S$r*VD"f"7U >7# f OJ§t«lK* (•Rf "felH92PlM : /Ht7 7 FttK) T7# D-xyjH»# ITM L fc. fxxs Fii^xx-&A&5k±m*A7)5?i%5,ooom*m#T*^ca7)7 6 7yfD- XfR*gS-0.8%i Lft. f 7 X 5 F* t# ICDNAtH X 7 — * — £ :ki6 L fcl£, X FyaAfn7d FT*6L-CK#m*TTe©**/5X->&#"-9"- F©SS*

-150- b -X A b A A A <=^ H b -A bi □ -X 1 rx ate 4g A IK m 88 X 8S CM ,U ^ M K ^ c t ^ X 'x SS <0 b

,U ^ A 1J^<—□£ 1 5 0 0 ^D —>(7)6*

MAM MAI t humb « 1

1 NT2RM2 nt -2*. y y NT2RM4 m± 3 3

NT2RP2 NT-2*, y > >### 9 2 NT2RP3 m± 8 5 NT2RP4 m± 8 9 NT2RP5 m± 9 OVARC1 t 5 0 PLACE1 t 8 5 THYROl t t'EPttBtt*® 4 8 VESEN1 t h jflvgrtt&meE® t-7 -f v v -tg*«)!s 1 Y79AA1 h h retinoblastoma InSl^tBSS Y79 6 500

TSIIT

dBDT

03

-152- (3) 7—7X7777 (a) 7-7X7777 tt, Dye-terminator# £SEffl L/to ^(CPE-ABI%#®Big Dye Terminator Cycle Sequencing Kit (J7TBDT b B§ 7) L- fz0 ® #, dGTP Big Dye Terminator Cycle Sequencing Kit (J^TdBDT b fl§ 7) ^ 7 7 7 -V A GO Thermo Saquenase II Terminator Cycle Sequencing Kit (J7TTSIIT b Pl§*7) RC^DYEnamic ET terminator cycle Sequencing Kit (17TEST b "7) 5: V h |E3(C7jKt*o BDTii> Poly T/A£1£ tf BEIX7—7*7 X$E Mtftltl&Z. htffo&o CtKC^LTTSIITUx |IJE^SBOTH til Poly T/A

cnmwz. LT®5mme##®#%^^B (bdt: TaqDNApolymearse, TSIIT: Sequenase) & 7h , W ^ 'y h # (CGC-rich7t#%Tr It, cm:#LT:kL dGBDT^M^bfCo ESTtt, TSIIT® S&%>Energy-Transfer 1: <7 0 A &tzb(D7\ BDTT^Stfc+|->70;i/ch|^^iinFf2iC^^^'>-XX>+h— \Z7 7° 7 7 7

WtftteZlT

— MB*. &t5pmol/l b L tz. o 7> *C IrI 7P1/—b 7 )V'*)'7 7 7— (GenAmp PCR system 9600M : PE-ABltfc®0 IIt'>M/T, i^#)70n 7*7 A LfclM 7 ll/RS® £

U fc = 7i7Sephadex G-50 ( 7 7 11/ 7 y7ftS) MultiScreen-96- well-Filtration Plate ($ U #7%#) (CR22FM : 8 31%#%#) T950Xg,5^®^ff:T^6LZho 7 ® T# S nfcSjM2:^i>X/3# Iz — 7 (CE1DM : 8 37%#%#) 1:j;0 H777^7°L7^0 (b) 7 — 7 X 7+1 — 7-7 X7t)-—It, ^ ^ 7 7 7° ® 377DNA-Sequencer (PE-ABI%# : tf 7 U-77 7°®3700DNA-Sequencer (PE-ABI%# : tiCE^fSTL) 57a 'y H^7#T#e,#Zc1t77 0-7 fiJffiL7c0 377® 7-7X77^^11/It, 48cm&m®2##7X#®fWH:4%7 7 U Jl/7 ^ (PE-ABiti:® 7°n h o;H;:fi£l>IJIE) £}fALTf7S£Lfc= &&, 7 —7# It ^1:64

-153- 2 ( 1) Big Dye Terminator

SJfrfflfiE KfS&ft :r > 7° 1/ — h 2.5 fxl D : 96t: 10 # 7°7<7- 0.5 nl A : 50t: 5 # BDT 1.0 Hi E : 60t: 4 ft ### 1.0 h ! ME* 5.0 Hi 2

( 2 ) Thermo Sequnase II Terminator

KfSEfiK KfS^fr 7 7 7° 1/ ” b 3.0 h ! H : 96t: 1 ft 7°7<7— 2.0 Hi D : 96°C 30 # TSIIT 1.0 Hi A : 45t: 15 # ME* 5.0 Hi E : 60°C 4 ft 3 oii-<^;i/ H : Hold D : Denature A : Annealing E : Extension

#s 1) 7*77 (DSSS 19^^2 2 bp

1 3 ‘dim | 5 (kcal/mol).£lT I Aldim | 5 (kcal/mol)J7.T Hairpin 5 or^T 3)Tm{6 5 6fab 64V 4)GC^* 4 Ofab 60%

K2.3,4:f&ftlffi#

04

-154- (4) 7 ^ 7 i! > 7 JHi > Sequencher (Gene Codes Corporation,Inc : B AL V 7 Mt: 16 %) V 7 h ^©^77 h^-AilT^PCyXjA (MAC-PMG3> 21-CRT,MO)

79 nix%X o^v^co^A^ x^7> aLT, fm^&awaT''^ 7-(DRmm-6: L/z. 7 7-r;i/i^^E^ESU7cSB^jx-^^7-t>y;i/r^o mmtr

(5) ^7<7-mm- '>-7X>'>>^7°7-r 7“ Ote, Tmffi> 7° 7 < 7 — ^ (DT& #m&%7GC%^ ^(D#%T T"7< "7-(D2^:#m^y7< dimmer & Hairpin# igT & 0, PCR£S£EWT5o Ctl6(D# ig£Sfi£T5^S^ te> dimmer & 17hairpin (D # ig (D ^ % T # % $ tl T V ^ 6. Dimmer

3 ’ ^ $!]2$l £X iit $ $1 jg 5: $ (DI bIH (3’dimmer2$c£/Al dimmer) @ A X4V1/f - (kcal/mol) T\ £Zt, hairpin(1, ^BS^B#nT V^c X ^>7*- H±#i>7S/> h LT, #3(:^Tj;7 \z^

T9 7 7 (D 1& Stti > Oligo (Natinal Bioscience,Inc. ; $ @ jg 16 jE ) V 7 h L /to ■€-CD7°7 '7 h^ — A £ l/t^PCyXrA (MAC-PMG3> 21-CRT,MO) &#f8 L7c. M 7 7Mt Uftmmfr6707-i’?~-&w$:mn?z>t&m&)\z±.mmm$:m%'Tz>o fit ^3(:#^"T6 t)(D$r#%L7h.

(6) 7°7< 7 — 77^7-ft SSE^J^m^X-;i/HTB5:H-SJ||-th-HX (%) EL7co

s. 3

(1) 7-^(D##^&(D## (a) T9 7 7— 7^* — 7 cDifl^Tf mcmtf (iXfiF{iijc0 7°7< 7-5:aiBe^EEt-*7^ t)0T$>^)o B&%\ F%U:R#J(D^^E^J#(:^(D^fo]i:# ^T/hA(D79 7<7-(D^^^m-T6t)(DT^6. 77<7

*W^T^>X(C7Ff j: 7 7^7°7<7 —##7 b°— H (D^(AA&2r#

-155- StSCifcLfc. (b) xx77-#@6& X9 7x-©###t7&#MtcLT. tf'7>tS:8tx-X 2 6(:XD->#©##&lE#f *4tcS-f «©X57 v-ggg^ffggLZc, f LT*5K#T±##a*t:^\ V y XXW9SSrff5£y 7-7 lb*' f>Xn->€ifl#^ (34X7 x>X®ll#^ : S0001-S0500) $3 k!-&x-x ht5;tfi'D->#<0^7'f7-SiSl:?D->«fttffa#^SAATt?l±5 CLfc. X9 7T-gTtG©##@*»t7 >-9-- HMX>6!Kgy7'f-?-»£t|-g U ^lT7’7-f7-0SiS8£t5Cit^7'f7-0#^ftlt^lfe. X977- SfcO 0«E#*8ctt$fcf-6 7c*, «fflt7BecR$#SS:RBLTSlELfc. X57 ?-©f-?iLTIt SSgB9J^SX#©ttlcX7X v-©#tt(e$8BAUfc. *it, ##K7#. cDNA©fi#ftS:tm7B (1868) SAAtilit, *5 04M1*

>n->«, ^x x-6, *sb. cDNAigs5& i; # s 2 n a ± ? c u t. (c) f-J'otasa 05lc»tfr*g*©a®«S; (ffiil'MI-y XXX *-■?••)/S) &[X###©f-X'gm* s$)K-r. x □->&, cdnaexs , »&b, xt©##A)6#a2 nto^ct, #5»'e.3 h-&a-x M,t. w t, xx7 -7-©EHRy:ss3S©T

-X;

(2) 500X D —>©cDNA&U=#MX97 X —©## 500X O — X©cDNA©## 6Wt, SflrlCttfflUitX^x- —©!NH4£ X57T-wa*©x-x&sic#if ut„ (a) #WSS*IC#-3*<500X □ — >©#tt cDNA©##ft tGC%lktf'( >•*)■— b ©poly A©^^ £ 0 6 CSt. £ fc, SbtC^tl f fl©T*l*&#:#:&TX9 7 XX U cDNA©ffi$Sli, */Mi/^638bp, # ±@#6178bp?f f%2438bpT&3 it. &jo, 7 XX — b ©f %@H2462bpT&o t. $£:, GC % It - *X30% , #*69%T¥i§46%T$>o 7t. 7 >X — b ©polyAtii, #/b8bp, # ±125bpT?#24bpT&o t. 97X9U —XD — > $ lb & WMAMA1. NT2RM2, VESEN1 & XS Y79AA1 £■ Kt < t, cDNAtottX SRtA GC*l:^('t77^7 lJ-#OWt^l5ftt. NT2, THYROl, PLACE1 f Lt OVARCl©STcDNA©¥i9S*ft7)t@ < , 2 6 ICNT2iE8ST(i 1/9^ / 7 cDNA©¥J9$*ft*tft< 7j;5>f*[p)lc$. 0 , $ Scm##©%##M^&<%6 (57 X 9 U-6(:#WTW&#*©##^A#Ai&) lcaoT¥i9SSft^*i8inLTV>-5. ORE Iz^y 7 >8©SSCJ:0#@MflSlc#Hb-r-5)7c6?)lcJ;0* S&X >AX»7'o*£6j$;2;hTW-5,b3|'X 6ft3„ $7c, ¥«3GC%ti, cDNA©?^#% ftt(JjSCOVARCl*25 R%AS* * < . OWT'THROl. 14X2 R tX PLACE 1 © (E C X 2 < & o T Wit. NT2MflS"Cli 1/^77 >#M##CGC%*i(@i!mN|p]IC&6.

-156- 4-a, 7'f7'7'J-«©#*at*tKEM©##S 6I:

(b) SEffly^-fv-CRtt 500 7 n->oMl;Sffl Lfc^7-f7- (BI1-L;t77 7'T-Tt><#1ff (2E1TS/3 fr-otzbc DtESU) (35773# T. 17 P — > S 7t 0 ¥0111 7 7 7 T — Sr&g £ b t, Ztl

^77^7 © 3kF (5: fiS 7 $> -6dimmer^t/hairpin^iaSfi6(7)fs^ ( I 3 ’dim I , I Aldim I % ^Hairpin). Tm«&.U:&$Bj$£ia 7 IC5S"T. B t hj E © 7 7 7 7-(3*312^7 Eg# 0ti®SSSl/TO5i!!, «)9 7 5 85 # E W (2 <£ 9 ■'?> tl £ f# 73 ’dim,AldimS !>*GC % $: E flfcE'TEH'LZj;ttnif/j;6Zj:/6'o Ttfcro^'-en-e'tlZdZ, 1969&7378#& 0 , i'fftfc^7'f'7-tLTtEBT8Tir>fccti)'6, Eg#©#(2*7#3: 9lt < iftSiiWofc,

(3) BEX n —> D X h 5007 n - >£(5t A/£*7777-7;t-*>7ffiTF.S!)8&<*#rTSfc7)b SffiOB b©RifMBE7 O —7*^36#it. £©*(:: (3, 77 7 7 —7 7 — =£ ZT'ftTttStfrT Jfya*; h # >#£ tB^bti-TStff bit fc © t> $> 9 , 77 7 7-7 * -* >7S© ffi iO/li^EfFito'SS® 9 Cftofc. *St7D->©«ttt-e©SHRtXW*®*7tcSf. $fc, 77 77 9 -#©B*7 □ — >©ffiig¥TSCSt. BEXO — >©tti8S(3, ¥i§ 0.072 ( = 36/500) T£> 7 □ > g(©SV‘NTRM4. NTRP2. NT2RP3- NT2RP4, OVARC1, PLACElS. 7 THTROlTitiS^Srftlitf-5 t, NTP2©=fc 7 tC BE 7 □ - > b fifr o 7t 7 7 7 7 U — t>a6£75*. NT2RP37>0VARC1© © fc £-5. 77 7U -#©S

*H©S t)VJxS © 11, $/t7 n->S©BBS$6S6<)lcSaLTUti: Wci6, 777 7 9-#T©BEX D->©#m&#Sx.6C t^TS^oit. fct, iR7o-> ®iiSJ£i»i:85 r ££ttt3bt.

(4) ##fB#K©#»3 «0iU>7D->Ttt, EM-bTt*mTS%^ot:777T-. BD56ti7j:i£S 7-76h*^ fc ©*1#< ft5#(n]C$o it. £fc. 777T-=5;t9 WETS i5ifi#S&:fcBEXP->T(3fi

#f77 7T-*/#m77 7 7-») K.0577 7 V-3t 9 WETSSSSSICOIaT# 13 bit,

08K:777t-©*| s@* 9&tf777T-3;t9©WEttl£» (= (7 >7- h©E *») Z (SI7777-S-2)) ©J3%&4f7. 0^2K±. S*,, 3t5tl(il:««777 T-g:©2fg«±£Eff US(3flB» Sai'pfctifflli. 327 D ->T$>oit„ -75. 777 T-3/t 0 ©WEffi$Scli¥^366bpT, 300bPyT(:®St" & £RW# X o->(342#T&oit. fCT. -etl-e'ft©X n->£ 9 X h7y7LTS6 ICGC%^'60J7± (307 □->) © U X h ttfclSbT, S«7 -5 7 □ - > » * M ^ it. f ©i(S*S:®9lC*-r. BE 7 D->©32# (377 7 T-ijig $ 9 7>t2.b(-h© t> ©(2, 580©

-157- Kit* T.lshfoashi 9d->8: NT2RP4001207 **No S0001 : PME18SFL3 xv^u-h-y-'fXiis? 1 **$ > 0 »S9f>1 4649 (Max 5.5 kb) f 9X7^7-* (TtK) 9 Input No. > 2 Contig size 4395 cDNAit4’X:*5(nt) M7X7<7-* F* 9 Insert It'fX 4375 4358 HTJy'f?-* R« 9 Kit? fia<0^7-f7 tt?E*l 7 -f 7 - 0 tttt fcMH; 990620 -t** (YES:1NO:( (X7-f7-No.) e m Size Position 3’dim< Alldim Hairf Tm (A+l (G+C) Memo S0001F00 1 S0001F01 S0001F01 CAGGGCTCATCACCAACTTC BDT 20 453 0 3.1 0 62 45 55 K7H: 990831 1 SOOOIFOIr S0001R09 ATCTGGTTTATTGGTTCATTG BDT 21 472 1.9 1.9 0 56 67 33 S0001R00 1 S0001R01 S0001R01 CACCCCT GCACT GAGGAAC BDT 19 431 1.9 6.9 32 62 37 63 HighAWi=6.9 ilKttiS 1 ioo 1 SOOOIROIr S0001F09 TTACCCTTCTCAGGTTCCTC BDT 20 443 0 4.7 31 60 50 50 htMX(kb) 4649 S0001F01 1 S0001F02 S0001F02 CTTTTG GACTTCTTG GATCAG BDT 21 519 1.6 4.7 0 60 57 43 f»K:tX7<7-KK 18 1 S0001F02r S0001R08 ATCCAAGAAGTCCAAAAGCAC BDT 21 516 1.3 3.1 0 60 57 43 K:t^7<7-R 18 S0001R01 1 S0001R02 S0001R02 CTTGGTAATGTGTATTTGTCAC BDT 22 340 3.2 34 1 60 64 36 Note:(-b7UA17^IBA) 1 S0001R02r S0001F08 GTGACAAATACACATTACCAAG BDT 22 340 0 34 1 60 64 36 S0001F02 1 S0001F03 S0001F03 GATGCTGTGTATAATGCTGTTG BDT 22 496 0 3.5 0 62 59 41 (ex. No poly A 1 S0001F03r S0001R07 CAACAGCATTATACACAGCAT C BDT 22 496 0 3.5 0 62 59 41 Irr.Sjunc.....) S0001R02 1 S0001R03 S0001R03 GGCAGGCTGGGTTACTCAG BDT 19 420 3.5 3.5 9 62 37 63 1 S0001R03r S0001F07 CTGCCGTTTGTATGTGAGAG BDT 20 405 1.6 3.6 0 60 50 50 @KA*tl S0001F03 1 S0001F04 S0001F04 AAAGAGTCAACATGCAGATAC BDT 21 374 1.3 69 0 58 62 38 HighAldi=6.9 (nawHtmicAa 1 S0001F04r S0001R06 TATGGTCGTATCTGCATGTTG BDT 21 381 1.9 6.9 1 60 57 43 HigtiALdi=6.9 (2) K7X7'f7-*ktt, S0001R03 1 S0001R04 S0001R04 TTCAGGCTGGGTAATGTTGTC BDT 21 433 0 35 0 62 52 48 %0**2:LTf#IKAt7 1 S0001R04r S0001F06 TGGGTTGATCGAATGGACAAC BDT 21 448 5.1 68 42 62 52 48 HighAldi=6 8 (3) ^6^7^7-17)** S0001F04 1 S0001F05 S0001F05 ACATACGCAGTAGGTCTATG BDT 20 379 44 4.4 24 58 55 45 ffi»(T)X7'l'7-A't,(7)-b7U 1 S0001F05r S0001R05 AGACCTACTGCGTATGTCTG BDT 20 376 0 4.5 25 60 50 50 7-5-^3-tfLT. #4 S0001R04 1 S0001R05 S0001R05 l^-Xhtt.WSLT 1 SOOOIROSr S0001F05 #rLl'i<7>£AtJ S0001F05 1 S0001F06 S0001F06 (4) M"*^y2*J6l±<0 1 S0001F06r S0001R04 X7>f7-(7)»ft. S0001R05 1 S0001R06 S0001R06 «*£mtr 1 S0001 R06r S0001F04 AtJ„**7BMH*«IE S0001F06 1 S0001F07 S0001F07 1 S0001F07r S0001R03 S0001R06 1 S0001R07 S0001R07 1 S0O0IRO7r S0001F03 S0001F07 1 S0001F08 S0001F08 1 SOOOIFOSr 0 S0001R07 1 S0001R08 S0001R08 1 SOOOIROSr 0 0 1 0 0 1 0 0 0 1 0 0 1 0 0 0 1 0 0 1 0 0 0 1 0 0 1 0 0

1 >> NG-Primer IU Date I Putative primer! Subsequence riPrimeT sequence ISCt^jsize I Position 13 dim j Alldiml HanlTm l(A+ll(G+C)lMe S0001R01 AGGAACCTGAGAAGGGTAAG 20 444 1.6 4.7 31 60 50 50 85 stteaa wmHo. &&&>% 87B #&(nt) Note Insert(nt) S0001 100 NT2RP4001207 PME18SFL3 990831 4358 4375 4649 S0002 100 NT2RP4001210 PME18SFL3 990831 2940 2958 3007 S0003 100 NT2RP4001213 PME18SFL3 000131 2732 2749 2904 S0004 100 NT2RP4001219 PME18SFL3 990831 2761 2789 2803 S0005 100 NT2RP4001228 PME18SFL3 991029 2678 2697 2904 S0006 100 NT2RP4001235 PME18SFL3 991029 2603 2625 2704 S0007 100 NT2RP4001256 PME18SFL3 991029 2617 2635 2607 S0008 100 NT2RP4001260 PME18SFL3 991029 2899 2911 3112 S0009 100 NT2RP4001274 PME18SFL3 990831 4384 4404 4782 S0010 100 NT2RP4001276 PME18SFL3 990831 2491 2518 2416 soon 100 NT2RP4001313 PME18SFL3 990930 2646 2669 2704 S0012 100 NT2RP4001315 PME18SFL3 990831 3303 3336 3549 S0013 100 NT2RP4001336 PME18SFL3 991227 2019 No-HelixROO 2088 2233 S0014 100 NT2RP4001339 PME18SFL3 990831 2715 2732 2904 S0015 100 NT2RP4001343 PME18SFL3 991029 2647 NotHelixROO 2714 2904 S0016 100 NT2RP4001345 PME18SFL3 991029 2681 2721 2607 S0017 100 NT2RP4001351 PME18SFL3 990831 3558 3575 3549 S0018 100 NT2RP4001353 PME18SFL3 991029 2922 2938 2803 S0019 100 NT2RP4001372 PME18SFL3 991029 2308 2324 2233 S0020 100 NT2RP4001373 PME18SFL3 990831 2913 2928 2904 %f5Y7-

S0001 ££K?lJiMrXh 05 o ^□vx^h^wyu SQQQ1 FIN------jsnofti C-NT2RP4001207.seq Contig[0001] V J

SOOOl CHR C-NT2RP4001207.fasta > t*xk 7t>ou SOOOl TXT J.

E95 "T—^##7^—T'Vh 200

X 1 D

501- 1001- 1501- 2001- 2501- 3001- 3501- 4001- 4501- 5001- 5501- 6000- 1000 1500 2000 2500 3000 3500 4000 4500 5000 5500 6000 6500

SSS(bp)

GC%

me GC%^U^ X I D

polyACOi^SS(bp)

#6 &%7'(?:7,J-&(DcDNARlS'f>+t— h

(bp) cDNA 0-9 —b polv A AT GC £i* 500 2438 2462 24 54 46

MAMMA1 1 1714 1746 32 51 49 NT2 309 2623 2647 24 54 46 # SR. NT2RM2 1 2956 2974 18 61 39 NT2RM4 33 2260 2282 22 56 44 NT2RP2 92 2400 2425 25 53 47 NT2RP3 85 2693 2714 21 54 46 NT2RP4 89 2846 2874 28 54 46 NT2RP5 9 3314 3339 25 52 48 OVARC1 50 1964 1993 29 51 49 PLACE1 85 2044 2066 22 55 45 THYR01 48 2433 2454 21 53 47 Y79AA1 6 2510 2527 17 49 51 VESEN1 1 3273 3290 17 45 55

-161- 3 dim | (kcal/mol) TmlB

GC%

| A1 dim | (kcal/mol)

AT%, GC% I

12)7

1-5 6-10 11- 21- 26- 31- 36- 41- 46-

Hairpin (°C)

-162- g 7 1 0 V No tit 1 S0028 NT2RP4001447 F02,F04,R02,3,4')AB^A(N#L) fales-priming siteA^lx 2 S0049 NT2RP4001644 fales-priming siteA^lx ##%Primeri^RR 3 S0065 NT2RP4001849 F01,F03\R01,R03&ff£*A(N£L) fales-priming siteA*£lx #^kPrimerf$RR 4 S0084 NT2RP4002083 F06\7’,R06”&fi^A(N£L) fales-priming siteA*£Ux #^kPrimerf$RK 5 S0096 NT2RP5003512 14SI»:11 6 S0100 OVARC1000001 Fm,2,R01*B?&(N$L) fales-priming siteA*£lx #^(Primer{$RK 7 SO110 OVARC1000109 F00 >6BE£ ■r>3#’U—KEn1 8 SO112 OVARC 1000114 F04,4'.F05’;6B^6(N#L) fales-priming siteA*£lx £f&PrimerfEfi£ 9 S0128 OVARC 1000408 F00 &BE£ f^^L-hEo p—> 10 S0132 OVARC 1000440 F02,RC2'^B?A(N$L) fales-priming siteA*£lx SHtPrimer^fijt 11 S0160 PLACE1006037 F02.F03,R01,R03’;&B:FA(N£L) fales-priming siteA^Lx ##Primer{$R% 12 S0174 PLACE 1006248 R01,F02;AB^A(N$L) fales-priming siteA^Lx Primer 13 S0180 PLACE1006414 F01,1'.F03;A#^A(N#L) fales-priming siteA^Lx ##&Primer{$R% 14 S0215 PLACE1007112 F02,3,4,R03,4,5’;j6B^M(N^L) fales-priming siteA^^Lx ##%Primerf$!$ 15 S0225 NT2RP3000439 F02,3.5,R02.3.5';AB^A(N$L) fales-priming siteA^0Lx ##%Primerf$Rt 16 S0228 NT2RP3000487 F1,R05,6;AB^A(N^L) fales-priming siteA*£lx ^%Primer{$RK 17 S0229 NT2RP3000526 F03,F04;6^^a(N^U) fales-priming siteA^Lx #^%PrimeH$R)c 18 S0237 NT2RP3000603 F01,F02’;6^^6(N^L) GC-rich,fales-priming siteA^Lx £i$tPrimerf£fdt 19 S0245 NT2RP3000739 R00:Vectorok>;j£ff2Ec3 20 S0249 NT2RP3000826 R02.R03AR05';AB^A(N#L) fales-priming siteA^L' ^H%Primerf#^c 21 S0261 NT2RP3004670 F01;6BSS(N£l').®jX-ti"f GC-richT'iSf+ilig/K ^AP^L Shotogun 22 S0354 PLACE1010761 F01,R03,R04’;£ff2*A(N£L) fales-priming siteA^Lx ##%Primerf#R)c 23 S0359 PLACE1010833 F01,F03,R01’;6ff^A(N£L) fales-priming siteA^Lx £8fcPrimerfEfi£ 24 S0399 NT2RM4002504 R00:ployTflti&;&ff2E£ Singlclony$)A&L> Shotogun 25 S0408 NT2RP3000418 R03:plovTffi&M£E£ Singlclony^tiA&L,. Shotogun 26 S0412 NT2RP3000512 F04':G-richHlST;j6BSTOP BDT,TS$g&L= dBDT-Comp. dBDT-G^60°C(7)SI^^ 27 S0417 NT2RP3000628 F02:GC-rich$B#-e#BSTOP BDTTSS&A&U dBDT-Comp. dBDT-G^60oC(7)J5JS& 28 S0429 NT2RP3001383 2230-N)NU E5i]ttS? TS&l/dBDT&i:<7)j&j££ 29 S0447 THYR01000092 F03,R03.4;j$ff^A(N£L) fales-priming siteA^Ux #-#kPrimer^1%E 30 S0459 THYR01000625 R00-R0lRg##?KM fales-priming siteTSSIt^W Shotogun 31 S0479 THYR01001426 R04,;£B:FA(N£L) fales-priming siteA^Lx ##%Primeri$f%K 32 S0481 THYR01001480 R01-T-rich#%±#-C^BStop TS^VdBDT^"^U—hES/Jx ^^Primer^J?)c,x>^b-hSX2 36 S0503 NT2RM4000634 5'11, 3~3(Dtl7-3 U fc0 322 a — > \ZX>^X$£L < M,Z> t, 7° yXX'-^ft 0 (D¥l]M&M$Ctf300MTTfrX>GC%60&±fiH2 D- >, 7° 7 X X — ^ ft 0 , GC%60R±^37 □ — > T\ flTiMtl c$i60^mi(3mm7D->n, ix-no ^^T&637D — >$:##, 7°7 C:3.0-3.4, D:3.5-3.9&Z7'E:4.O-4.40E^T' U X h 7 7 7° L Tc t Z. 5 ^80 <£ 7 \Z fo o fc <, #E^#1:A:2, B:2.5> C:3, D:3.5&^E:4,& PLACED THYR01, OVARC1 0 IE \Z /h £ < fc 7 <> C 0 It ^ 6 IIE 7 U — >0S#ril^$ll> # M % 1: NT2RP3 t PLACE1 7^' iW < > *7 0 X THY01&S E 0 4> & 0 MAMM1, NT2RM2 , NT2RP5, Y79AA1&IX VESENl£P£S, 7 7 7 U — £ 0 %> t, cDNA0 s$s^>Gc%ii^ii$ o tm&t£<. om^b(D\t, L & 0 t 11 pg fc 0 £ o ^ 0 0 31 (A & 0 11 NT2RM4, NT2RP3 R XS THYR0lT\ %^%I:##0BL07 D—>^^<^&^TT0^a#A6o

(5) Mm2u-XDffitirM MWt2 u->MZTIBH^-To (a) 7 □ — >£ : OVARClOOOOOl A 7 D — > 11, false-primming site0 X — 7 T$> Z> o 010-llC^f «t 7 KlSOlOORO4’0g& m*&^0K:f6j;b£>t\ UthjE^&tUyfCo C0ma&#ML^(hC6, Vector 1:# momw&zrcfttftfryfzo xzx\ ±mzmr^vectoret m l /t -m 0 & 0

(b) 2 □ — >£ : THYR01001480 *7 □ — >!!> ElO-20$S-T —7 7 IdS0481R01 0SE^fj0T-richE $S ■PWanfc^r-XTJbS. ll£#mT5 Z.b.tfX%t£frytZo XZX\ 2,yXX-&T-nch%U&xmmLXi/-2-> |gl:^l-j:7 l:m#0SL^l$r#<'#-C6:^T#^. (c) 7 □ — > £ : NT2RP3000418 *7 n->ll, ®io-3£^f

< V-^EIf-T^T, '> 3 '7 h #>&(;:<£ D #e>tl/z+h7#7 □->0E^iJ^rfiJffl UZco #

(d) 7 □—£ : NT2RM4002504 A 7 □ — >!!, 12110-4 7 Upoly T/A0lu^'r^S^S7l V MffiX t Ufr y tzX

-164- 450

1.0-1.4 1.6-1.! 2.0-2.4 2.6-2.! 3.0-3.4 3.5-3.! 4.0-4.4

200 250 300 350 400 450 500

ms &

2>:32 300> :42

>60:30

G99

-165- 8 V p — y V 7s h A B C D E 2.0—2.4 2.5-2.9 3.0-3.4 3.5-39 4.0-4.4 ^7 a —^ NT2RM4002504 MAMMA1002094 NT2RP3000418 OVARC1000001 NT2RP3000526 NT2RP3000439 NT2RP3000487 NT2RP3004670 PLACE1006248 NT2RP3000603 NT2RP3000512 NT2RP3004670 THYRO1001480 NT2RP3000826 NT2RP4001849 OVARC1000440 NT2RP4001447 OVARC1000114 PLACE1006037 NT2RP4001644 PLACE1006414 PLACE1010833 NT2RP4002083 PLACE1010761 THYR01001426 PLACE1007112 THYR01000625 THYRO1000092 Y79AA1000346 Y79AA 1000802

: A:2, B:2.5, C:3, D:3.5, E:4 ktS k 7 'i~f=7 y — £7): (1)NT2RP3: 18 (2) PLACE 1 :17 (3) THYR01: 11.5 (4) OVARCl:9 (5) NT2RP4: 8.5 (6)Y79AA1:4.5 (7)MAMMA1: 2.5 (8)NT2RM4: 2

$9

7— >70— AT% GC% cDNA WEtiESSt MAMMA1 1 51 49 1714 2.9 249 NT2 309 54 46 2623 1.2 365 f*3SR: NT2RM2 1 61 39 2956 1.1 249 NT2RM4 33 56 44 2260 1.3 339 NT2RP2 92 53 47 2400 1.1 368 NT2RP3 85 54 46 2693 1.3 353 NT2RP4 89 54 46 2846 1.2 380 NT2RP5 9 52 48 3314 1.1 388 OVARC1 50 51 49 1964 1.2 368 PLACE1 85 55 45 2044 1.2 376 THYRO1 48 53 47 2433 1.3 362 Y79AA1 6 49 51 2510 1.6 329 VESEN1 1 45 55 3273 1.1 387

-166- T>7"lx-h0^&^#0E7iJ0## (poly-A^A-rich#^a(7)2^:#m#^) m#TkL Ittf, 0~f n% poly ^'S fco T0fco £ 7° 1/ — h 0^F & ##T#S/3 v 0t)0emi/4e0E^J^7-%'7 7 L/c a 6, C poly T/A0i^7?0Si0fife^^^lfco cKDSH^JT'7-tr > 7*JI/L T£#:gB?iJ V 7^0 (e) 7 □ —>£: NT2RP5003512 if 7 □ — >!!, ® 10-5 0 ±I£ 027> 0 $ ^ X — 7 £o iDA^T^g/^o T#^T# %:^3^7-7T&6. ^0#^. (aBDT^EST^'yh/^^T^Ut: o — JV&ffilM't&'fc £) Isfztf, MfcVfeffiZ.fcfr'ofco Ctili. t->7°1/“ N^f-Act TTigmmmu^y >7;i/3 o-c-ic^ggL^^mmof- >7° u— b (5f@) TROO&$JS L tco T0ISM. ^M5SE?lJ0##TA0peak0 fe0^'27 D->. T0t0^37D->a^o/c. ^CT. T^lfaiD3V^TWm#^^ffo^. T0ISM. Tm02m^0##^-7lD^Tj;7(D@E7'J^23lD8tJ^i^. ^\Uv7xef!%mf CD ¥U $fr T . C CD # % ifi open reading frame 7^ 6 ki iT tl T 0 <5 C t. Ik. XS IS CD EST

(Expressed Sequence Tags) ®TJt*"TT^ T “a ” T $) <5 7. b. ib £>. ACDlSM^##^ tlTho (f) T/P-y^g : NT2RM4000634 *7D->kt. ^io-6C^Td;7l:5^j;D < DmLE^J^900bpm^^#y7<7- amm-T^%^D--7T&a. 3'%j;D7'7<7-7^-7^m^.

(6) mm^o-xD^sa^m aigncDj;D ##7 o->0 i! B T # ^ 0 t) 0 d. false-primming site (237 D—7). E[3 % 7° V T 7— 0 fj£ St @E ^'J

77^'f#6 tlfc 0 7* “ 7 T& £o 3 V^TgE^J## (57 □ — >) T. NT2RM40025047 □ “ >0ckD KlgE^JklJ; D 2^:#)S^#^LT#ai##^^I^^^TD--XT&6o f btr >7°l/— h0^F& (47 □ —>), GC-rich (37 □" >). RZ>'60 < 0 jELgB^J (l7 □ — >) (DMT&5. GC-richVlg&b 2frffij£I&W’C$> QMZ&]&Wa%:6o "€: tl T' tl 0IIB ID L T . ^ 0 7 ID HE L o false-primming sitekt. 7 7 T 7— |i§ ^##LTAe^^^^-7^#6^l6^Ty7<7-0m#f-^#D)gL^. C0#^.

#0#3% DftfotfMfe-3/to GC-rich0 7* - X T ti2^^iS ^7jc #IS^T&6^^. iDDMSO^rSjja L fco TtlT t)#M^7^0#(l. dBDT^^Jffi LTto £ Dc. NT2RM4002504 0 j: 7 gE ^J ^ (D polyA^ polyT^ ^6#^0 2^#)g|#$T teDMSO 0 S M 7^' & tD 2^#B^#*IS'o T#^:$ t%T0 7ta#AD:. d

-170- 7 O —7 —L/to (c) ^ □ —>(D#^Tti¥^SSS^2438bp> GC%^46%x ^ >+h~ h THR01, NT2&//PLACE 1

/to (e) 36i@£>EEft D->(Di!SS£^WU 7 C (b^##L/to NT2RM4, NT2RP3& //THYROl T, +8*%C##(DE L U— >&&< ^ tffftfr-D fro (f) E#ft D— 2/

E?iJ&, 2 6 (4ft n->) <&#mT6ft^LTAT#mLfto

(2) (a) a a ^ false-primming site & # # % # # "#" -5« (w E#f

8 . 5 #3fXM

1) #j@% 7X hl/'ff-y HN o .3J (1996.6) 2) rpcR&##m-*#^#^(1997 .6) 3) ”www.wsu.edu/Adnaguy/primer.desin.html ’ 4) ”genome- www.stanford.edu/ Aspellman/information.html 5) Y.Suzuki et al. ,Gene, 200,pl49 (1997)

-172- * 9 * ^#cDNA©«SE$ijge$

9.1 *3

££*©&?> ft$a#7)lDNAb A 5t:^0 3o&&. aft. a© “^©igw-ia ” T&syyA»ey© SSK?m. fTlc*#H(l)^»#(2). 81jl(3)&£T-S:EW9;$Stt, hhy/A (4) t>2ooo¥#icii9o%© K77 s->-y>7i'*a«sn5. yy Ai*eft©#*E?iJtt ©#m##^fr?yyA##©%m«. f©##&«m ttitfojmtts.v, >/iy Miit!)T*?i). -efflfc*.

L A'L & *16 , yyA*e^©95%« “5>\>V” DNAT& D . »i3, y 3/19K&3 - h* %e#±l:K*LTV^&. H#*^y y A*ef ©&*E 9ljz)<±T #RgBTt. ^©iee#lgz)i3- Hf >A9K©#E##r(cliS#SU?5;H„ S5>t. aifgoh hyyAEma&< it-fe K77 h->-y>xT$> 9 . mmo^E^T»& nJSB»t$5©T, ?>zi7*«3-Ht5fi)«ffliEllftSiE«®)l#ri!i!1l4*SSI #«ot< yyAEM©##zmafm$. ±g2©@scj: ycDNAizag^sw 93 3* 5= cDNAtift*E*gic#syi3< yKicgmsti-syy asse^ijt^-s, f AT. m^6cDNA©m*E9'l##*iet»BTU&z)1. f©$%©8B^-E^©»©# #?yf'L'T&5. ioT, ©##e&#eT'5 A 6^?nl#T$ 9. yVA me? ©*%E^t©Mm&#g4f & At & @#T&3. -#. cDNA©7t£&*#E?'J£ft;t'T-5 A £«, *ftf ##B##©#imi:»?SI^T $>-5tc4)SBt> 6"f. #l:^6A©cDNAlC3DTIl. -e©$tf#®K*69&BilS ©75:6 fc anrv>-e.©75tattT$.o/t„ t haftTisio~i477®ESftUTv>5ystinTv>-5y. %sc#ffi»i

i9944u=#ea*Liizm%L/:%i-v d'ft v 'y yffi(4)n«*± < ^Scdnasmt @6^BT%< . cDNA»5f ©T, *6fStt. SP^^>Af &. A©##mi:-#v-h-tsen *M*¥ES¥E5$0ij3 j:tX(tt)AU y <-xemSrTti: b h©«■»«*' e.«T3«CcDNA7'f yy u-SrffBKLru-e.. ft#, b H»@f t:M^&E%a*©*#^#*©##Mb&moT#*M%M#7©mYb yftSJdnru-a. A©5S£fi:cDNA

-173- © 5 %REttz)$iW© 500i' n — £75 C ££Bg£ UT{ffSe£$iS66/i„ ->-y>X|g*©[6U:£E5Ci: tilSilt. M#6®C ti 3 8110,000 bp/dayT* o felSff S6^7-S:35,000 bp/day C 31 £ :iCLfc.

9 . 2

(1) DNA+b>7'Jb©l@R a. -9->y;b©§«

-trn-;i/X b -y X©ST(#)a'J y

^xe%0r^6SfiVZt. cDNA1b>X;b®Sfit|B|SfC, 6-X d->©* iSg25iJ©X □ x h 771'Jl'fefftTSai/:. b. ^7X3 b#®& . y-y>XMlSI:#fg'X'9Xa bDNA©M#*e#ML/:. ##©y XXlb SSXtBS. -9->X;H@SI;tiX fcf- btffiOEttySijiSttS. l»iL&y e, ->-y> |B|ll#lC-!t>7';b©iS«R -err, sttffiflj©Li-coRas->--y x-tb-rfl ns/t osox d - «S<8yr1"^> tU5*#ESTT*®-9->y;bH8S*?iSl±6t^WVfc„ $£, SttTti, in* T+ry >a®QIA Prep Spin ColumnTSI® L/tlb > 7,1b (TE^ft®) o, aM§gt-*©y7y>a®i8B£'y bt©it

® X7£'XaB8«!'/7X5 • 96-9->y;HiW$3® • am-fb#@@l: jc^M® © +7y>a« OIAwell 96 Turbo Miniprep Killed; -5198 • 96+b>y;bM#m® • ®5l8iMT©l@8 ® +77>ttB QIA8 Turbo Miniprep Kit 1C =fc&3@® • 81b>7';HB|S#i@8 • at' • BSItoifS SAiTSIB ® y 7 y >a® QIA Prep Spin ColumnfC yy rtroias • at'-rm® c. ©7X5 b*«@B b.fflHSttWtttt^WSSJ: DSWl^7cy7y>a®QIA Prep Spin Column Srffl Vi T X" 7X3 h’WHSrffofco («We*tt9. 3. (!)&##) ® 2 ml©LBl8titiC8s$SL/cy V -bn —;bX b -y X $:6^5TAiS® 0 , #B7-5„ © 37rTlb#tm. Stt®OIA Prep Spin Column^ -y b tX5 X 3 HDNASrl?®® 5..

174- @ f#6nfc7°7X5 Kd NA£1 u 1ED, 1%7 th >7X1/0

&43> ##b^7°7X$ MDNA^TE/iv7 7 —-ZO^CT## L/c.

(2) X"7<7 —0^^43j;U(## *yoy%/7hT^, ^±&cDNA(Dm&R^j##^f77n^^o, mm:*am0m# 6y7<7-X^-^>Xto(:T#^r^ff 7 L7h. Xx<7-x^-4:>Xto(:j; SBSgE^^t&STtex * )M£iJ£ fiWf 5 0 H >& S fc ^ X X — gfi^ij ^0 7°7X 7 — 2® H (Forward4o <=£ (^Reverse) X## my7<7-. xn->c«toE^ji>#g77<7--masa%6) ^T^pME18SFL3ffi7°7-T 7—

(to) a IJ v X X#f^Br j; 0 ^#L7h^±&cDNA0(3 pME18SFL^ X X — ^ X □ —X>X£tlT^£#X C(D

£ 0 7° 7 X 7 — , tu ^ScDNA 500X Cl — >±T0^%gE^'J^##T6 0(3^ ^ 6 fiaca&s, x-xxxM^5oo[Ei^a±(D47#^^#(h7^6. N-7°7<7- ^ t>nz> „ -ecT, pME18SFL3^X X-7°7-T 7-&, < ^#^^to0^6HPLCT##L^7°7 -f7-$^lt:^^ • ##-X6C(h(:L^. J^TtC^XX —X 7^7 — 0 gE^J^r^To

Forward Primer (774FW) : 5’ CTT CTG CTC TAA AAG CTG CG 3’ 20 mer Reverse Primer (1250RV) : 5’ TGT GGG AGG TTT TTT CTC TA 3’ 20 mer b. #XXA77<7-0E# tl X X A X7< 7 —^la.0pME18SFL 7°7-r 7-0M^^l^a, 77<7-E^J03'#c%& pm%Bm0G3;/: teCtC L> GCgBX'J £ U 6 & Vi ti> 7 >+h— H6¥^7^2.5kbpx##my7<7-^E#f - &0LTm&E^&&^T5 7^-5o C0X ^-—4P >X##m7°7< 7 —(1, < xth— h#(C j; LI-CORttSX — lXo-Z/&&06-10:/7<7-di#m

-175- te7°7^f 7 — Ltztf'O T, 500^7 h — ^ ;i/T'3000~ 5000<£> 7° 7 7 — —^ >"th—iCi&t'S 7° 7< 7

HPLCffiSI&T^ffl V^> 6 tlT#/Co HPLC7##$tl/c7°7< 7 —L n^icf hTH±!g(7)j;7i:^#(7)^^-4:>^##m7'7<7-^^^(:^67)^ OHPLCC -©8f»jW!&SBi2#l£ft5£LTH< iagT?©&&£&:£ Z.ttf^m-£tltc0

£flg3xfZ>s<< 'A0W D&ftS: Lfco (D ffi%tJ?AT(Dmm&

%^(DHPLCT(D###TH, jM* 7 A£m>T^fc(7)T\ ti - h U 7 v ?

(B /^777-(T)## 7AT(D#m(:#mT6/^77 7-#(D##$r^(7) j:7 (ceoTh. 9ut, (7)^#{C'^#A'y77-$:##L, #(:i&m#i&(7)#m&ffo;t. ##(:mv^X'y7 r — te, ^ u %f7 —(7)HPLC##CA < ^ 7 >tcr> A (PH10.0) 7tf“h (TEAA, PH 7.0) (D2UM <7) £ £ lA tc0 $7c, ####(:& 1J d,7-(7)HPLC#IE^S< fflV^n^)7-fe hr: h 1J ;i/Zk*M<7)B$£lO, 12, 15, 20, 30, 40%^T^^Ty —y>X(7)7^-< y-^(:-^A6j^#(7)##$:ff3/h.

E^*7AT7l:j%5%9l#

bfco <####&(7)7°D h 7—;i/>

(D ®MT37A^7t hx: h U;i/T^fe^-r^o ® f #

(Z) 7t h~ h U;!/&##Tm%##77<7-&BmL/[34XTa (i/U )

(3) mm@E^j(7)^^

^±mcDNA(7)mmE^^-^m^T##(:^m^m#6A7mmj;o, 7°7-r7-

(D ##A& : 7°7<7 —j; 0 , ^AcDNA(7)W$M^#^C##T6o (D '>“^ >+>* —: LI-COttSi4200, PE-ABlttSi310

-176- (3) 7 $ X h 'J “ : 7 —V^/tDye Primerl£(LI-COR) t: y 7* 7W 7 “ 5. 7 “ 37 — ri (PE-ABI) ({^iffl^'y h : Amersham Pharmacia# Thermo Sequenase Kit $ /c feL Epicentre^Sequi Thermo Kit) 0 X — 7 IBS : Sequencher fcf\ MtU0 2. aTM^fcpMElSSFOffl 7° 7 7 7 — £ ^ T^^;EcDNA(D^^E^J7 7<;i/ tit&L T7 □— >

;in£|gD}SLT^£6cDNA< 0 # 6nfctESE?iJ<7)7 - 7 ^ISE^J ©7yt> 7^1/ 7 7 h 7 x 7 7 $.6 Sequencher^ T L, (#)/\ U '^7i^0T^S©7t“77 hT77i';^f7^tfCo W

caa, m&ese7j5 c

£ ^ fX <7 ITt“^©7Ip$7x "J 7 L/t0

^GCE^J^long poly aE?'J & £ Ol«E?iJ £$#77 3 «£* \Z «, 771/M £ M A 7 ^ ^^^^##7CDNA(D^^:#ae##76#^DMSO#(7)##^m!;0 L £fcSequi Thermo Kit b Tfr 7 , 7 -7">y 7;i/^#f]m^^msothermal&T<7)##T^fTVX mmE^(D$#f(C% 2W:, #^E^J"M1, 7*^XhU-(D^ (7<7"7<7-&%U: tf'y 7 7 7 7 ^ 7 7 — &)

9. 3

(1) DNAit>7°;l/0|^M a. 7"77$

> #6n7cDNA^^M^: LT'>-7->7^SU7c7->7;i/^LI-C0Rti:E'> A'y7777>M^i@K, L7cc ii, mmu/cDNA^^ 6 c##Ta#gat& o ,

7X7AiC 15^Ffp]^ ^^iJEL/to DMA#

7x7 hT$#f7^500#(7DNA^$#r76#^, ©(Djjfefz t, 507* > 7 )V / 0 TiS "7^«h, :&'7777!/&m#Ta(Dl:ioB##W'5#i-#6%0. L^L, (^%-e

-177- tizLfZo 5002 □ - >£> ■5 t>30^ n —>(coES^Sv^ SfcteBEX u — yX'h^Tz^,, +1-> 7°;i/£> E S^2-3(HlfT oTiSJbaL^o

5007 □ — >(D^^:S cDNA 5:ES Lfc^X 17 D —> (N1056: NT2RP4000787) 0^# #rLW^O->(:#L#A^ (N1501: NT2RP2006554) <,

(2) a. pME18SFL3ffi 7° 7 pME18SFL3ffl 7°7-T7 — \t\u mo\2 2 — JI/T'^J&L, HPLC^ffl ^TflS Ifeo cDNAlt>y;|/^#M(hLT, 7 —7">X##^fTVX y7<"7 —0##^, C^^±7o->(D*^gg^#^ Cffll^/co

^^rScDNA 5007 o — >&##T &#>(:, 4459*;t-—^>7*ffl 7° 7 ^f V

*7" (S2)o 17 D — ydbfc 0 ¥*^8.9*C7)<7 t'-Jr >7*ffl7°77 7“^^S^©T, ##fM 7^7< T —2*&jmA^:10.9*^l7 □ —>(DS^M'Sfc'> —7>X

EH* • £-/&Lfc7077v —(7)7-7 > > — X >XSJ£^ -5 £ < \zUi2f& L ft *#:^^*(Dl%lX^n:#^ 3 fc0T\ #$ j; <7°7<'7-(7)EH-' £•/& £ 77 X fc £ W® T#6.

fco MMte> ®tt&tfTmttfi*-7‘>XEJfi£fT^ *Slo»«ttSIBl/fc.

&> *U :*V-tt«ffli££#gBffl*a»J£#2mlB#)XX *lf §2 S <7) it i& Srfro /io Walters# E <7) % 7 - ^7c, A'y77"77>K^i#^o7c. etU:#LT% ^ HPLcmmisfc72'r?-t\5mm(D/'<> HPLCT##L^y7<-7—chit^f 6a, 7"7+h- Hg5m::te7^6<(7)T\ L#. < , 7 -X >XStfrEM^ /t(7)T\ cfcD^X h^C^6^#mimlT(7)$g

178- —6AI—

-a xoo£2iz$$W£*zi$<* 'an?*si °^*Ws«'?^ai!a-^-vjfta(a)Oi!Sit

"^ushii ? Ea$v»®iBito '<5»-v-v UKfR? ^9*H$(ztnC$a«^/:'X|H|% ’Ug<3= 0 ¥3^toSst*MSUita©<-n x •(IB '9g) 4-^J«SSSSJffl?iM3i» ,?T9S:iMS?B(*?IWS '¥$ '21 MI Ktoxz? zr- riv(#)5W9j»S®a.13s3U3nb3s21 ? 2 '«^a*#0vxao*^% °(S8) ¥^$a.%zoo -$i*YSn* °-^¥#n*a<-a xa>» '<5?^^a^^¥^J#»S<-nXOi©±y3 '^csK-nz? ®S •¥41W3##©<-a xoos <-a l2ia.$ BuyE^nas* SM^yaJMSeSf •« 3%©Ka*m (£)

°-VU9 #h3 ,n??^4-W1 A#a-SS#Z/:* a 4 £»$$«©/: * ¥XFR«»*m '.i©^arBn*sci®'$’4-#«>?¥^#BWsO!S!?@©/i'r'^xtR» ¥4##30a©2 XZ^##©mgoi ' q rv > w A

•(*•») yc£mM,xm~xti%Q)±rct 'av¥a?*s«i0TH

,a.©2vc$a-*B^ffl#®a-ai6®$ia./:':-¥xnR# '¥$ °¥4 Mitt? s-g?i(5is»ti 4 3@2l#/,2 zi'OHi3%#«2l©¥9-#3*%2/n3$ '0%?-(¥+±^mW?9%A $es '%c.# '>?'W-i£/;fii (#/ X^-aHT) ?9¥>#3E% '11MU2 °¥c ? 2 5 r/ > £¥*?¥i«'Ka£-A>^? (#//: zf-n H£<) ©irsiES '*^¥nM#3E###©/:i'¥x^* - q;#*©a.3m#ew

r, 4 -4 ¥Z '?12iSfflSa-x. >xxWiS*« '2-c»

°(£ *) 2^u9*hCi*9*y¥*t©±V3 n 4=4 ¥X2|^ '¥!** 'TttStt?‘5-t'ianif 2|#w#w^¥© (OOlHd) VO = &-^XG)Sfel2-9i9q¥?-X.>X/;^'l®W2-2!31dHya- (OOIHd) YXYKXiSM °¥4 MbW? 2 9:I¥2I#3$ 1 >Hi-.X4©X< ¥-<-Xni3?I2) (O'OlHd) YXiYKZ^tf 'f(-i (0£ Hd ‘W31) 4 -¥¥Z< & / 1f¥T f, 4 '*§*¥43*3 ”2¥c-a5M»©-x X x.\/9-©H2)#^ • 4>»7b©K»¥ M$to-x x zr.y © 1 'fyxS.YnW&omt

0 © © © ##*# 48 96 8 1 n-' y h me#m 48 */3 h 96 */ 3h 96 * / 3h 24/ 1.5 h mm © X A A 96*1518# s i*3t ffiMSKv' ffissev' 3 ^ (D oT# 3 y ^ i CD Bjffi *9 o ttfc9„ mm* X A o ©

(2) : ^rT>f y%tM QIAwell 96 Turbo Miniprep Kit (" <£ & (3) : ^TtryttM QIA8 Turbo Miniprep Kit ® : ^rT Vy^tM QIA Prep Spin Column A J; &MM

2 ffr

20 mer 22 mer 23 mer *# 4298 * 160 * 1 * % 96.39% 3.59% 0.02% 4459* 1 f n — 8.9*

^3 T±b~hV

r-t? h- h y /v(%) IR# • tr'sT.j t — 'y wm 10 i|X*^/>^< , ^fy —y^snv' A 12 O 15 A K;4S*#V' 20 #m%Tyyic-C#*6$^^6 • #* 30 U ->—^ryxESicfflV'5 r t ;4>tb5fcft X 40 vv

—180 — b. ###& -

£(Tl:5007 D — 7©SS)W#t 7°7<7 —#[ 4 7 ^ 7 7 7 s >E#cK:f£^0&ofc7 H“> : 197 D — 7 (3.8%) c. -f 77* — ##7 D — 7-ot£e NT2RP fi*(D7D — 7^264 70—7 (52.8%) £¥#:&&#)%. (#7)o e. 7 — 7* 7 X cc%o^ v^7 D-7^^U A(D#W#^7 n-7&##f6#^, 7-7*77 77'MfE;^^l:i#i^ ££>-£< DMSO£^JnLT£j&£frtA, ^61:, 5*)\/&Wi\Z *)!/ A 7 ^ H L X

false-priming I:

LI-COR&# 7 — 7 7 7* “ T ^ %. £ % Tz *t £ C 11, PE%#ABI377T t)##

w#0SWBM£S#)T£:BE?ij£&£-r5c ct, can, ##§1: S X HJ —(LI-CORTll#T^##7°7< X —ABlTllBig-Dye 7“^ 4;-7-i&l:l:6#^^m^6 c a) I:l:6t)(7)(b#x6a6. f. 7-X7X|g7j(Dfp]± ABI3107-777*-$r#AL, ±E(Dl:71:y;i/(Dm^, 7-7*777^£f&, ffl 1110,000 bp/dayT'&o fzMtft 1^7^35,000 bp/dayl: £ TfUSW 1C foj _h L fz„ 7 — X7X##(D#)gC7l3TW^:7^-4:7 7##Tm7'7<'7-(D^^^, #mxf*'7 7^1: a^TCa^T(D6fgJ^±om^T^@T#6f0###%&#i[L, mm^7-77X# #0EgEm:#'f6Mi&&#(DXU%-73 7&# 3iLT, ##(Dm^ll^(9Tn<#^^$)^m%l:m#T6$^

-181- 4

^7 — h y y ^ %!####& (t!) ## : 1ml/* — h y J/ '7 ifSryt^w (phio.o) 12%7ir h- Ml yix*## J®§ 1 Koj/7/# (®§|j£) SttiiS if LttiU&

L#x.y n—y y 7s b

^ n —y No. * n —y id S6 N1017 NT2RP4000263 •Yy-tl-- M: 523 bp b^'tzibo N1056 NT2RP4000787 N1098 NT2RP4001159 '>-*-y7^-5T 0 -<7 3 —y) Nil 04 PLACE1004103 y n —y©3

Nil 64 PLACE 1005066 SiSSB^zF-it o-y. N1196 PLACE 1005630 n-y„ N1199 PLACE 1005698 -y-^yy^ * n —ycon

N1210 PLACE 1005834 ^ 685 bp

-182- 6

18 18 pi ¥f£ ll ¥ 6^ 30 30 0 0 7^ 70 100 22 22 8^ 70 170 63 85 9 8 70 240 87 172 10 M 60 300 33 205 11 8 60 360 65 270 12 R 60 420 66 336 ¥J& 12 ¥ 1 8 20 440 69 405 2 8 60 500 71 476

3 8 — 24 500

500

450

400

350

300 A 1 n — yWc/8 250 n <\ 200* % 150

100

50

0 nq nr^ nq nq nq nq nc^ irq rrc; I'lOtDNOOOOHWHMCO

m i Mmmmfevtu

-183- 00 o

Jl S3 03 o ^ J

1 ^ 4 o —

— 1 .. ISO o

D ySc — O o 1 — ■

| CO o n

J1 S3

| 9 O) o r

1 ^ o ID

4^ 3> 3

______1 ( to o

M

']*■ - ______□ - - ______3 3° ]*■ | o

<999 >6500 1000-1499 1500-1999 3000-3499 3500-3999 2000-2499 2500-2999 4000-4499 4500-4999 5000-5499 5500-5999 6000-6499

cr {0 )# ^ ^

7 n — 9. 4

^7°n vx X hOiKTiSC^A'J'y X xeF^0rt£W 500 X

zsc, m-m^%;(D^3.5fg(D#^7vy(:3^^3^. coca £*K '>-'r>zmtl

tj&mumLWz.? u~-y'\ 0 &tz,

*yo^%/7 fr&a*, i^ttii#l:toTiJ6THWi:^D, Sfc, 1&#£$fc*ofcj£l£ iRS^a^nTv^cDT, j:^ t. fa. 'b o

9 . 5 simam

(1) Blattner F.R., et al. “The complete genome sequence of Escherichia coli K-12.”, Science

277, 1453-1474 (1997)

(2) Mewes, et al. “Overview of the yeast genome. ”, Nature 387S, 7-65 (1997) (3) The C. elegans sequencing consortium “Geneome sequence of the nematode C. elegans: A

platform for investigating biology” , Science 282, 2012-2018 (1998)

(4) Collins, et al. “New goals for the U.S. human genome project. 1998-2003 ”, Science 282, 682-689 (1998) (5) Maruyama, K. and Sugano, S. Gene 138, 171-174 (1994)

-185 *10$ £&cDNA0tt£iE$IJ2*£. cDNA#e*$j(D#ak

10. 1 *$

b h 7 y A 7° □ 7 x 7 h ^'tUl^nTyv L cDNAt:: E "7 5 to & 0 S 7° □ 7 x 7

Ml f$ 57 6B7WCDEST (Expressed Sequence Tag) ‘^P^tATiLtS-h/OM T tA 7l o bo #(Z 6cDNA7< 77 u m^^y 7/\7^^gb?i , gwcamkZiTZ:##^ h h cDNA^mo^^

0 , ®&TiS vtfii^T^±5cDNA;^7 □ — x:>7*an5@$1to7x^?£T$> 6. ¥^!0¥SNEDO^3)k1iIE^STfTt)tl/t r#%%b hcDNAy y 7 □ '7x7 Ml±|B^&(CT#6nycb h^^ScDNAOSSEyiJ^S/O^

(D a U V 7 XE^L0r^T$lf#$nyc5,OOOi@(7)cDNA7 O->0^, 1,500# (D 7 □ —Cbo (D 5,000##cDNA7 $#&&#(%## b bo (D 0iy7hyX7X>y-7iJ>y(tt)BO*^T, biEcDNA7n->£DNAxM 7 □7 7-r4)-5)>6)’7)t;:T3b&0 7 lyl'±K:*£££ft3cDNA®r>t£PCR&8)’9)'10)(:%#*§

T, t0T$,^)o -eoS*^l5V^T> 7n-7o#m#m bDNA7< 7 07 —7T&6 bV^A^o WRtpO E ^-/Dyx7h^I«6§Bbtt, B*£tt h77A7oyx7 Me* € < SiSKT'# 67-V 7X^^A6Hbbm#LTW6o b(7)7°D>?x7 ^ ocdna^ch tb&gxJn ^ n& & 0 b ft t; t ^ & 0

10. 2 ***m*3

(i) A#cDNA(Dm&Eym% mmb LT77<7-&edtLT#x h 7 > < y7<7- 7^ —^ >77£tCl;o T^ScDNA^SSgByiJ^:#^ Ifco

-186- a. cDNA7° 7X5 A'J '7 □ —>#: (^UtD-Jl/Xh';^) \t2mW 77-)!/##:%# (L-broth-77lf^U>) L, h ^ ^ 0 7 7 X 5 H ## 11 Wizard Plus SV Minipreps DNA Purification System (7 P 7 X7£t)

S£U i&SE7J&)MIMDNA

-DNA^^#:MerMade (/i-f^'^-h7-y3>) (02) (03) (04) - 96A7lx — h%## : SX^hAhV—A^7<79 >%/X^|x-7- (AU 7 ^77 7/f yf) (05) (7^7h <7^) • ^$DNAJMSJE28% 7 >^7 T/k (a±, ^^yfk#)

- fr^^m<7kM^#7p 7^m>^7 7 H>/xmA - -&i&;am<7t^io7^m>:f i- 7 if - - ^8%;amA7=:y< k -^^^mG7$y< k -^^^#<^^>C7^7< h "&^^mT7^yf-ULSS CPG500A (J£*±, CTGen) - - fr^g;^%#m>77i/p'>xf7 (^^mib) - -&i&a§K<#7tmm^m>77°;i/ffiS>2mlW967X7V —7°7° lx— h (J£l±, P—— >7) • ^^§sMPCR'> —;l/ 77<7-DNA(D^^. f-ULSS CPG 500A £&7X40nmol^B b/t967X^S7°lx — h #LX:. Je0f£'p‘$cfi7oP h P;H-^^DNA'a^c^:ffp ft0 ^^^7#

j%m##(D2ml^967t7/ -77k- P >7lx 7 7-TEiN§^^/c^ m$rixj:7 k% hU-AM7<77>%/i^lx-7-(:j;085r(:mLTmmL, ff O fto %##02ml#967tf^ — 7 7 lx — h d 200 ZU(7) E® zK £ XL 105XS ® A $ £: SJ £ L ^ $ L fc0

-187- MerMade(:ck5^1#(7)^TkWl&T#%W^l#DNA#SX^#Z@X#i/% b Ac c. 7 — ^ — h i> — 7 x > +P — lc j; -5 ^ # BE (7)^ 0 a.T#6flTccDNA^'at;7°7 7a Ho.5% g 7x>7 7°7<7 — 2.5pmolS:^BigDye7 — 3. ^“7“FS^ "j h (PE/H^yXfAXft) AT, +7 F (C#&'flTW'5 7°[] b o;i/[C%Vx +h77;i/y —7X77^fb^ffo^:o ^^#(7)th>X;b^t7bUi:'y77H7^7A (7u>7b>-t:/^y-y3>%) 1: ckoTy;i/6]§, 7;i/xb T(7)7^,(7)m^m^X-b 7-7X7th-l:T##r Lfto X — b 7X71b —^7 7 77)1/7 7 X(7)ABI377M, 77 X(7)ABI3700 m (f^M^^PE7y7

7" — h 7"7x>it — TToTf#biiAtll$BB7J(7)^:r—^tilEbbiTKirAII/rSiST-';/

b7 — 7b:ToTl§'£\ H S A tl 7c o 7(7)AA(7)S#tV7 h 17 SEQUENCHER (GeneCode %) (gl7) (EI8) ^^^^i^cDNA(Dm*@g^^Moxxx7(:E#$^, ^\Uv77#^mri:mo, X — 7 (7) ppS^iHf-x y 7 £: S (7 A <,

(2) (/\Uv7 7#f) ^^:&cDNAx (7) {& IS ^#cDNA(7)m*BB^(:^xt,, ^\Uv77#S%Brj:0 7o-> (±j#m) 195,700## -80"CT##L/c. CH^/\Uv77#^m^6(7)m^l:%bTmmBB^^ 7 &%^BB^L^:. BBiWl b 77 77 7#^T0^##l-TfT^^:. b . # # - # IS 7° 7 7 $ b (7) iW # ^\ U v 7 7^^pfr^ 6#-^^l%^:7 O — >(D 7 t,, 5,000@#(7)cDNAX7 7 $ b Wc±##7UtO —)b7bv7$:12ml(7)^#:igm (L-broth-77kf7V7) lOOmlHA 7 77 7 7" (2 T 37°C T19 200# f^l ±g * L A „ T (7)#, JS X 5b $§ll£ (2 T M If \%l > Wizard Plus Midipreps DNA Purification System (70 7^%) ^mWTX77aKDNA^##L^.

tt^tlTcDNA^lOMl-f 74^(7)77 7 Ufa“ 7" (WSl.5ml) T 5b £ L , cDNA 7 D- 7(7)^#(7)A-3^7^)1/&M0, -80T(:Tl^#L/c.

(3) (^\Uv77#) ^&cDNA7<7*7U-(7)m^f5m##f -DNA7< 7 0 7 1/7

M'-k, SSBB^J A7£ ^ A tl 7c ^ U b 7 7 W^lFTt (7)cDNA7 d — 7&DNAT7 7 0 7 1/7 fb75^)b. BB^l##^b7°77T —^r^gtL, PCR%l:TcDNA(7) —gg5b^#4#L^. U ABB 7J£'a ^fcUS’non coding region b # m (7 7 1: M T 19 500 - 800^

-192- Contig Sequencher™ “T1441_FIN”

T1441 R06 <------< T1441 F C-MAMMA1003026.seq F-MAMMA1 003026 >------> T1441 R05 T1441 F01 T1441 R04-A T1441 R04 T1441 R03-A T1441 F02 T1441 R03 T1441 R02 T1441 F03 T1441 F04 T1441 R01 T1441 F05 >------> T1441 F05-A T1441 R T1441 F06 R-MAMMA1003026 < ------< T1441 F07 > >

1 312 456 628 990 1,178 1,530 1,784 2,064 2,276 2,608 4-4H III II llllll mil...... Til I III —1+1 I I III r—Ilf 11T I I II liT II ir ■-...... ' FFe-.... "• ? inT f ii iirr-^-i-if# ■f "Tin r Ff~nrri * Fi —f - n r n n n fiiru 1n~ i8 nun i p i if-ap

□ Hole in contig Diagram Keu ESI Single fragment Bumps on HH Multiple fragments same direction fragments Both strands show motifs, Both strands plus hollow r r- j .y ‘ | - Start codon frame 1 rectangles =T4^Stop codon frame 2 show features

h SEQUENCHER led:-5 — >T 1441

-194 ScDE^Sril/u/fo (7777-£jGfcl:MLTlilO. 2 (l)b.M) PCR^Ste, 17 □ —>C*f lth„8ml0;KS*WE

^^©^i^y^n-XSMitlTfx'^^n, 09!:** J: 7!:, QIAquick96> QIAVac96 (*77>#) ^m^T#i(i@DNA^##L^.

*SE 2 ft fcPCRitifi$11296:7 71/ — h #^t£S#SPECTRAFLUORPLUS (77 >#) 1: <2 D L, 0.5mg/mlJ^±(D#^(:^^ck 3 !:#$0 L, 967^7 7 7 □ 77 7 —7°

Iz- h±l:^B2ft, DNA7 7 7 □ 7 1/7 0*W £ LT 0 il V 7 h X > 7 x 7 U > 7

io. 3

(1) £5cDNACDS$ge?lJ&S A U y 7 XE2u0r^l&f# lfc5,OOO@E0cDNA7 □ — >0 5 ft 1,5007 D— >0^$gB ^#^^$ft6gB^m^^M^#i!:**o >7-1:7##$ ft *xv7$ft^#0#m^c,

Sf££7^oTVVc5T>0##M7 □->!;* 7O->05'*%a3'*%0W^l: U kf- U 77 7 7 —<7*-*>77£712gB3W#t:67$fcV}ft07&-3/:o (010) AUv7X#^mf6:0%m07X, 8U07n->!:mL#A^#0^L@$:ffVX #1,5007 D — >(7)##$r#7 L/h. A7□ 7x7 h0*3 !:7777-7^-*>7mi: j;6mmgB7J0##(: 12 7

7 7 7 —1 0 1:192*0* U 7DNA£ o' $c 7 cf &DNA d $g tH

“Mer Made” ©f ffiCi 0 t U 7dNA©4I©3X h ^Tlf6 C ^$$ft ^DNAl2E%L^ < 7 ft70 ££ >—7 x > > > 7# 0 7 7 7 7 — #Mi*6PCRMlb!:ft^7ft±< ^:o Mer MadeDNA-q^#!:*^-^^ @#0HPLd: * 6 7 D7 h/x°7 —>5:Eiil:^7o 7 7 A liDEAE-2SW£ffl tAfco 28.83# 0 t d 5 C ^ lA tf — 7 ifi ft ft ft ft, Cftftl*# (ACa&^LTWa.

(2) (AU V 7X#f%Br) ^^:ScDNA 77 77 U-0ite*%§!##

-cDNA7 7 77

a u y 7x#^pfr j; o#m*:a;tm,7oo@m0cDNA7 D-7#&mm@B^j##f^-A l:iiS6BS^fToZcBe(7)7 □ ->gB7fNm£S2l:**o #4,9777 n- >0gB7U£fTo 5,0007 n->!:#^^w012A U y 7 XSf^pfr£ D it $ gB Tfi $ ft 7c 7 □ - > tffo Z> ^ ft 7 ft £ o ffil:fcjSl:7 n->0^L#x> gB^#^e^gB7'J^^^M#l:mf77^ j;7 C^Tl L fco 7 o->gB%6MNr!:cDNA0l%#, M£0fc& !:gB7J## £fr 7 5,0007 n->0 7 7X^ 12ml0i§#X7-;i/7&5#7 20 U g~ 100 U g0 7 7 X ^ H DNA^t# ft ft » 4*07

-195- flMShfcPCRStt

VtVISi'Ar 25»

09. QIAquick96, QIAVac96

El ±ScDNA 1500^0-

Stitia WWf east Mt (%> 99.8.27 128 57 71 71 4.7 99.9.29 56 0 56 127 8.5 99.10.14 68 0 68 195 13 99.10.28 144 15 129 324 21.6 99.11.29 278 4 273 597 39.8 99.12.29 201 0 201 798 53.2 00.1.28 308 8 300 1098 73.2 00.2.28 240 4 235 1333 88.9 00.3.30 141 7 134 1467 97.8 00.4.30 33 - 0 33 1500 100

-196- GGCCTACTGG AGCTCTACAC CCCACCCTCC GATGTCCTGG TGGGCACAGT GC:CCCYGCG TCTGCTGAGC ACAGGTGCTC^ GCCATTTGCG CATCTGGGGA ATCCATCTGT GAGATTTCTC TGGGCACAGT GCTCCCTGAG TCTGCTGGAC ACAGGTGCTC CACTGTCTTC CCCAGTCTAA CTCCTTCTGA TATTCTCTTT TGGGCACGGT GCTCTCTGCG TCTGCTGAGC ACAGGTGCTC TGTCCAGCTG GGGTGATGAT GGTCCCTCAG AGGAAAACCA TGGGCACAGT GCTCCCTGAG TCTGCTGGAC ACAGGTGCTC GAGATGGATT CGAAGAACAT TTCGGCCTGA ACTACCTAGG TGGGCAYGGT GCTCCCTGCR TCTGCTGGGY ASAGGTGCTC GCACTTCCTG CTGACCAACC TTCTCTTGGA TACGCTGAAA TGGGCACAGT GCTCCCTGAG TCTGCTGGAC ACAGGTGCTC GAGTCTGGGT CCCCTGGCCA CAGTGCGAGG GTGGTCACCG TGGGCACGGT GCTCCCTGAG TCTGCTGGGT ACAGGTGCTC TCTCCTCTGC CACCCATTAC GTCGCTGAGC TGAACATGGA TGGGCACGGT GCTCCCTGCT TCTGCTGGGT AGCGGTGCTC TGACCTTCAG AGCAGGTAGG TGCACCCTGT GAATAATCAT TGGGCACAGT GCTACCTGTA TCTGCTGGGC ACATGTGCTC AACAGCATCT CAGGTGGGTT AAAGGTTATT CATCTCCCTC JGGAAATG6T GCTGCCTGCA TCTGCCGGGC ACAGGTGCTC^ TGTCTGTGCG GTGTGGCGCT CCCTGCATCT GCTGGACGCT CTGGCGCTGT GTTGTGGGAT ATGTACATCC CCAGCCATTT ' GGTGCTCTGG GAACAGTGCT CCCTGCGTCT GCCGAGCACT" GCTGTCCTTG GGCAACTTAG CAATGTGCGA TGGTATTGTT GGTGCTTTGG GGACAGTGCT CCCTGCATCT GCCGGACACT ACTTTATTTT TTAGAAGGAA CAAGAAACAC AGAGGTGAGA GGTGCTCTGG GGACAGTGCT CCCTGCATCT GCCGAGCACT CATAGAAAAG GAAAACGGAT CTATTTAAAC CTTCTCTTGT GGTGCTCTGG GGACAGTGCT CACTGCGTCT GCCGAGCACT GCCCCCATGG GAATGTCAGG CACTGCATCT GCAGGTGGCA GGTGCTCTGG GAACAGTGCT TCCTGCGTCT GCCGAGCACT TTGGTGTGGA CTCCTGTCGA CACGCAATGT ACTGTCTGTC GGTGCTCTGG GAACAGTGCT CCCTGCATCT GCTGTACACT CATCAGCAGG TCTCTCTTGT GAATTCCTCC TAACCCCGCC GGTGCTCTGG GGACAGTGCT CCCTGCATCT GCCGAGCTCT TTAAGGATAA AAACGATTCC TCTGTGGGAA TAATCATCTT GGTGCTCTGG GGACAGTGCT CCCTGTGTCT GCCGAGCATT AGTTTGTTTA AGCTGATATA AGAAAAAATC CAGCAGTCCG GGTGCTCTGG GGACGGTGCT CCCTGTGTCT GCTGTACACC GGTGTGGTGG CTCACGCCTG TCATCCCAGC ACTTTGGGAG GGTGCTCTGA GGACAGTGCT CCCTGCGTCT GCTGGACACC GCTGAGGCAG GTA:GATCAC GAGGTCAGGA GATCGAGACC GGTGCTCTGG GGACAGTGCT CCCTGCGTCT GCCAAGCACT ATCCTGACCA ACATGGTGAA ACCCCATCTC TATTAAAAAT GGTGCTCTGG GGACAGTGCT CCCTGCATCT GCTGTACACT ACAAAATTAG CCAGGCGTGG TGGTGGACGC CTGTAATCCC GGTGCTCTGG GTACAGTGCT CCCTGTGTCT GCCGTACACT AGCTACTCGG GAGGCTGAGG CAGGAGAATC GCTTGAACCC GGTGCTCTGG GTACAGTGCT CCCTGCGTCT GCTGAGCACT GGGAGGCGGA GGTTGTGGTG AGCCAAGATC ACGCCACTGC GGTGCTCTGG GGACAGTGCT CCCTGCGTCT GCTGGACACT ACTCCAGCCG GAGTAACAAC AGCACAATTC CGTCTCAAAA GGTGCTCTGG GGACAGTGCT CCCTGCGTCT GCTGAGCACT AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AGGCCACATG GGTGCTCTGG GGACAGTGCT CCCTGCGTCT GCTGGATACT T GGTGCTCTGG GGACAGTGCT CCCTGTGTCT GCTGGACACT GGTGCTCTGG GGACAGTGCT CCCTGCGTCT GCTGAGCACT GGTGCTTCCT GCATCTGCTG GGCACAGGTG CTCTGGCTAC N GGTGCTCCCT GGAGTCTGCT GGGCATGGGT GCTCTGGGCA, CAGTGCCCCC GCATTTGCTG CACATTGGTG CCCGGCATAG TGCTCCCTGC ATCTGCTGGG CATGGGTGCT CTGAGAACGG TGCTCCTTGC ATCTCCTGAG CATTGTGCTC TGGGCAC:GG TGCTCCCTGT GTCTGCTGAG CACAG:GTGC TCTGGGCAC: GGTACTCCCT GTGTCTGCTG GGCACAGGTG TTCTGzGGCA CACTGSTGCT CTGK6KMYAS TGSKCSCTRY GTGYCCTGGA CAMTGGTGCT CYSKGGATCW GCTGSKCMCW GCRTSTKCTG RGCACTGRTG CTYCCTGCAT CTGCTGGGCA CWGGTSCTCT GGSWACRGTG CTCCCTGGAG TCTGCTGGGC ATGGGTGCTC

@10. UbT-KE^J ( [ ] (DlttoMtimWiZU-xom

8

0 ll.MerMade DV

-197- ■i 7 ei^zl —7*1;:#*15 6 ft, fom\zfoz>o 1999^11^ j; <9 t%ifeffltfr(D7t£>\Zi&%££ tlfz? D->t?:i3t:^to

-dnav-t ^ny i/^r fFE^#j DNA7< ^7 D 7 l/< SS±tIDNA^|g'B‘$ tLX, ify^v

(C ck^)DNAT< 7 0 7 lx< ^(Dtc&\zs\ U y 7 X #*® % L 7z5,OOOft E:ftfccDNA7 7 7*7 > h^flEL/to DNA7< 7 UT lx< l,000#:^^##76(7)(C^g^DNA##*$930/Zg6f /:o HE(C7°rr v-^Ert-L> PCRfcfcZmifLfct d6i.8ml0^jSM^^DNA0^^ iSS5:ET^30// g^±(DcDNA7 7 7*7 > h#*#6^16(D(1A#:(D77%T^'D 71 0 (D23%^30^ g(:#/z^V^###T$,o 7^#*, £ n £ (D 7 U - > £ mZ$YCR%. £ #10 @ CO P C C5 A#:(D94%^B#(D30A6g^#X.^DNAT$,o /Zo U, 71 0 (D6%#NPCRKf&&fTo'"C 7 □ — >-?& 0 , :9Vgf:7n- ^3^6 <^%a^6DNA(D2^ mmznz.

E2

1999$ 6fl 68 78 88 9fl 10fl ns 12fi Baawm 0 889 911 0 0 638 33 2471 B3fctM'x>X 250 0 0 119 0 128 4 501 B»«6 250 0 119 0 0 131 3 503 saa 0 750 356 0 0 388 8 1502 £ft 500 1639 1386 119 0 1285 48 4977

E3 ##E##(0y=A(DcDNA/7O->#i|l&% # 1999$ CM o'" gnc 11fi 12fl 28 38 a# 8 29 42 22 17 118

-198- io. 4

(1) ±ScDNAC7)SSSB^IJ^S /\u 7Xx#s%mA^(D^±mcDNAmmi,5ooxn->(Dmm c a^Te/c. v-kmi7,ooo*T& o

& "tf7£ 6 > W^cDNA(DSNP (Sinngle Nucleotide Polmorphism) IT<5 iFf til A 7° D i?

6o DNA^^t:^ ^Tfe^M&6T96A7 lx- F ±T?0DNA^^ £ UfrfzW. &G^, 3XFA7^—7>X, V^Ta^)#&T#^)IxA;F^]#LTV^.

MadeDNA&G%#(Dck 5 \ZO. X A'-+f-;F7j 7 A VA 967tX< X-7°lx- F±T<£>£-/&

oai Ps&px fy'yyizfc'Dfco &GSXX--;i/&j;0/J\3

(2) (AU 7XX#f) ^±^cDNA7< 77 U ^###T -cDNA#^A#(D## F L^Dyx^ F (D@g^J^^^%A(DcDNA#m^, 7U -bn-;FX FvXCD^^mO,

^^(:cDNA##^fxo/25,000X n —6##L^y7X $ FteX X 1x7 —if 7 V — O^HT-SOr^T^^SnTV^o mticDNAX D-XDgBTiJIWffi^H >X-7y F± TJE:A^iE£toTVio£:l#'S\ B*fc7tT'& A^cDSA/^ttiT < &

Jf:, A#EAFqTll77X a t & 0 . Z.tl% #m^cDNA^#AA% (c##A 5 fc #> T% $> 5 „ FL^Dyx^h^tl^Sl, b F^^:SST^cDNA»t|f-3^ES#f %T$)6o ^n6(DXD->^ B A/W 7 T U 5 fztfT: b b FXVA7pDvxX FC^W

A^Dyi7 F #c, AJUB^mi^xx ^ f t LT&ft-£nrzcDNA\z^f&fefcb(Db$>y), mtz ^l^XX l/7-tf(D#(A*j##y7X^ Fb LTt)^#TA#T&6. DNA#^t)Te-5^:^l#;U^mT-80r L ^myoi/xX Fe##(:cDNA##, #@L IM0J©li7°nyi^ F lx A jFTOjf iI£:Mfro

-199- (3) -DNAT-r 7 O 7 V'iftW&M 7 0 7 1/7 C^^$#^cDNA7 7 7* X > h $;PCR&C T(DATX V v h#&a.

• pcr7°7 7x—£g6cxif7 >T^>cacj; ^, ^ioTmt^-sc&&cfc 3cm

• iStelE^/^^U A £: |^ V^ /t 3’non coding region^ 61#jiilillfa7CS‘iE> 500~800^$T' $> 0 , CmiDNAT< 7 07 1/7 #SCW*iJfc^$T$>o fco T^if ^6 , )^^^##Cj:0mRNA^6-'*mcDNA&^^'#-'5l^, #^^#T7A;|/-76X)C C 0

L^L, C$nfco O >Ca.-7 V7 H: j;CcDNA(DE^J##^6 (DPCRm 7°77'7 —±#:(D6%@(D7 P->fiiHll^il< 'PUfry fc» iiDNA©2^iife^#LTt^^fein^:I/^^ =b 3-3(D*S^EB

3j?UAjfi<©3,fliomsBa^i«A, tu y7&ge?ij^< , 7°7 7 fc» PCRj^SBecDTr:- U > 7*<7){S~F^PCRjS fc H ©i 3 3’fi!l©cDNAWf>t©mtiC Zfzt) ^'f, & 3 ^ c Gfia. ^my°u vx7 hc^^^DNAv^ 7 0 7 c<{$mcmLT#&±^%fmmA#e(7) f##X 7" 7a. — j|/ C & o Xio ®S> 777071/7 $rf##"76#^(17 l/7fbT6TAi:(DcDNA##^#6^lT^ £#J47 £l/^ L> cDNAQgg7U##^#6tl^: t (Dll, 2000¥2£ <£>5fcT A 1J '7 7 XE^0r^W(D5,OOO7 4,6007 D — >ST$)0 /bo

^o Tto DNA7 C7 (D##(D^#^ 6, ^fl&X^y x-r >71, 1,000%(DDNA7 7 7 07 1/7 $r2000#3^ ^TC^776^:&7C11, 1999^ ll£ (D^PgTf AT(DcDNA7 D-XDSXgB^iJStfr^^7LTV^n^aS^)T$)o/z = 7 D71/7^B

10. 5 gima#

1) Adams, M.D., Kelley, J.M., Gocayne, J.D., Dubnick, M., Polymeropoulos, H., Xiao, H., Merril, C.R., Wu, A., Olde, B., Moreno, R.F., et al. : Science, 252, 1651-1656 (1991). 2) Tllii^O^, ###^ : 11 : 2491, 1993 3) ^Ulj^a* : #r#^%#^#7°0 h 3-;l/, pp 179, 1993 4) Lipshutz,R.J., Morris,D., Chee,M., Hubblell,E., Kozal,M.J., Shan,N., Shen,N., Yang,R. and

-200- Fodor,S.P.A. : Bio Techniques, 19, 3, 442-447 (1995) 5) numlz, Vol.43 No.13 2004-2011 (1998) 6) —Vol.48 8^ (1999) 582-585 7) Vol.4 No.l (2000) 106-112 8) Saiki, R., Scharf, S., Fallona, F., Mullis, K., Horn, G., Erlich, H., Arnheim, N. : Am. J. Hum. Genet., 37, 172 (1985) 9) Saiki, R., Scharf, S., Fallona, F., Mullis, K., Horn, G., Erlich, H., Arnheim, N. : Science, 230, 1350-1354 (1985) 10) Mullis, K., Fallona, F. : in Methods in Enzymology (ed. Wu, R.), p.155, Academic Press, San Diego (1987) sms DNA7-C?o 7

02V7 h»X7X>-Vx7 U

11. 1 «»

h HcDNAtoWFEtt, -MiSISWTSflST./t «$i'0j;5 ^7)i#@6AoTj3 0 , *»©««£- «x7771/ rfe (DDft) " < >X7 XX@ (MlS) 21 *5)£7>gc»lS.< 75: 0 . mtORfcmftlzmffltffrfr&fz® ±®.mts.%.m^mmtitt)mMT»-otc» cmV^®@^nJig75:?SSae?*ffffi©Mfg*i*5T

<5 o DNA7 -f J n 7 H’SSIJ, X x< H # 7 X "?> y V 3 >*ffi±C*M©DNA:»-T£i® $-$icsm. 0${fcs-it. &RNA^6m#L^7-y v hDNA&/\< vu -->3 >5*. /W VJ 7 h'»6$©36$£$St;:UT. S-*C?(D5SS1f «£tltUT51 W7*axx V-x>X'$&T&a" 86. ©ISM#!** e.ttT£ D, ^f$©B6B®r3S'x©J$ffl»tXimSftTV>5. $fc. cDNAH6#:#igc^y:o

^#K6©^f 5C ^-tilcHi-fn -X7 (^^ftcDNA) ^e.ytA,«. fCTWJJISCiHT, /\U 7 XXE%9rCT#6tl/cb hcDNATH Vx V ©T ^e>. 5c4:ft^ttSS tlZt5,000fl©cDNAX □ —ySISWC. S4ftcDNA?'f7n71/ 'f66£j£«©ffll ii. 7-f707H lcJ:5%81W#f6#£®$S5Ry:S6B£ i«a >KJS*fr£ * bl, *ef«a*#fssiit5±t, s-asaeicML. tie©j:57^6* 1% 5:$3B¥ffi$-e©raCff V). 6b8L7c7 ?5:iSS*£ffi L 15 £ t £i*|gL7c». *S©7T X nxi/'f 6fi86$$5li6t'5>. SfrW7j:$!0i@*I:LT. (l)DNAX'f X □ 7 VT (2) AT 7'J^-ff- ->a>Rs*fr«w(3)$stt. #8#©w*(4)meT*8##©$E&ex.

-202- SSII Super Script II Reverse Transcriptase (18064-014: Gibco BRL) DTT Dithiothreitol PLL Poly-L-Lysine dATP 7^4^77f 7 7>hU7^X7x-h dCTP y>h V7X 7x — h dGTP 7>hU7^X7x-h dTTP y>h U7 7 X 7 x —h dUTP 7y ^ U y > h 'J 7 t X 7 x “ h Cy3 Cyanine-3-Tyramide Cy5 Cyanine-5-Tyramide EDTA Ethylenediamine tetra acetic acid Tris-HCl 2-Amino-2-hy doxy methyl- 1,3-propanediol + hydrochloric acid TE buffer lOmM 2-Amino-2-hydoxymethyl-1,3-propanediol + ImM EDTA DW sterilized distilled water depc ^Mdw Diethyl Pyrocarbonate^SDW SDS sodium dodecyl sulfate SSC Trisodium citrate dihydrate + sodium chloride

11. 2

11. 2. 1 DNA7 7 X □ 7 1/7

(1) DNAm&fb&m:## X77 X77 M7f7X0^M.

^±#<^#'7^). ^CTX77 M^7X%i;DNA@^m7-x^ o/b. ±#(D'77Xn7k7&fM'76/:afK ^#^3^ M^7X^A^W#T&6 7. (D iM(DX7 7 M7f7X(:PLL7- h (2) ?^)PLL7—h$7lTr#^^^lT^/5X77M^7X (2@E) d) f^)77>7-h$^iT$^^^TW^X77 M^f7X @ ?0POLYSINE:]—M^7X

& ;7?£ £ TI2 H tkTo (a) PLL 7— h Elicit" cL 7 lz, X 77 K#7X CPLL7— h^lto (b) 7 7 7 fflDNA : 4$ E O DNA ^ PCR7: ±i ifig fi, 7J1/7— TEH '1&M L 7 7 7 J&T £ » ##^H^T0.5// g/X/iaT'-S).

-203- (c) X “ tf "j h DNAPI $=£> A X* U y "f *il — V 3 > : 4@#(D 3 ^2##(DDNA^, PCRJSJST'if (fit* § 1$£, Cy5-dUTP (PA53022 : 77yt AXy^XvX) (1^2). &Cy5x^;i/DNA&A?SE43^2jUgTo A< XU y<&ff X. X~ X> hDNA*Mtel5/zl 4X HSS4XSSC> 0.5%SDST\ A< X >J XW if—'> 3 >^overnight (l4hr)> 65*C T'ff X o t) A— X X X fct24mmX 24mm(D t CD £: •?> o (d) : 2XSSC, 0.1 % SDS SB 20^ 2® 0.2XSSC, 0.1%SDS SB 20ft 2® 0.2XSSC, 0.1 % SDS 60'C 20tt 2® 0.2XSSC, 0.1 % SDS SB 20# 2® 0.2XSSC> 0.05 X SSCT U > X

(2) 7l/<(D^m#^m^(D## xxx H^7x±t-@s$nDcDNAH> x

<£> <5 o

::Ttt fMtx^62in> 2*eisk 5y^raiisjaufcrwsr, xuxx-tf-xax^^^ev^ (^3), 7i/<(D^#m^^##^ §0

11. 2. 2 A^XU^tf-va >5J6*#M*W

(1) X—y V hDNAi&#(D%G%(D#M A< xu x*x y-xa x^^cm^ax —yx i-dna^ecde^^ov^t,

#*7 a. ^TC^t3li0a^tA^^uyt-y3 >£fiAx

Condition I Condition II Condition III 2 X Denhardt’s solution 2 X Denhardt’s solution 4XSSC 4XSSC 4XSSC 0.3% SDS 0.2% SDS 0.2% SDS Prehybridization:65 ‘C lhr Prehybridization:65 ‘C lhr

(2) X —y v hDNA##(D#^(D## y-yvhDNA^^(D#^(:3WT, x-y^hDNAa (D/\

205- cam> SSII (18064-014 : GibcoBRLh SS Buffer (SSlI##) DTT (SSII##) dTTP, dATP, dGTP, dCTP (27-2035-01 : 77ytA77)Ky7) RNasin (TOYOBO) FluoroLink Cy3-dUTP (PA53022 : 7 V 7A 7 7 ll/V'> 7) FluoroLink Cy5-dUTP (PA55022 : 77y t A77^7y7) Oligo dT(12~18)primer (18064-014 : GibcoBRL) 0.5M EDTA (pH8.0), IN NaOH> 1M Tris-HCl (pH7.5) TE buffer. DW (OTH^zKA DEPC DW 10% SDS» 20x SSC, Microcon-30 (42410 : Amicon)

E2. RNA^#^6(D#^7^U (a) tA <5

-206- (D (SSH) ^m^TmRNA ^6##cDNA i. (i##7<7n7i/

mRNA 2—5Mg 0.5 M g/M 1 oligo dT primer 1 M 1 Solution A (7 >yAY7< "7 —(D# lMg/M 1 & 1m 1 ) DEPC ^IDW II

(o'at 7 m l ) 2. 70°CT' 5min 'i > + 3.^ — h 7 <5 o 3. Solution A 5 M 1 SXfcfo Buffer 4 M 1 dNTPmixture (2mM dTTP > 5mM dATP, dGTP, dCTP) 2 M 1 lOOmM DTT 2 M 1 40U RNasin 2.5 Ml ImM FluoriLink dUTP 2 M 1 ('□If 19.5 [i l) 4. ssn 1 Ml (200U) 42*C T 30—40min + 5. WS SSII 1 Ml (200U) 42 TE £SDX> 10-20M1 (:##76 (8000rpmX $

2-3 BUD jSLT7S1$<7) dNTP

^ SI £: fiB SSlT <5 <>) $ < (AoTWfUi, Micerocon TEl0#f: 7/ ;i/^ —1: Cy3 t) L < 11 Cy5

T7 ^ 11/^-

El 2. RNA^#^6(D#7^^^U>y^^ (b)7"nt:7

-207- 20 X Denhardt’s solution (50 X Denhardt’s solution ! D-2532 ! SIGMA 4XSSC, 0.2%SDS, 10%SDS, 2XSSC-0.1%SDSY§^ 0.2XSSC-0.1%SDS##L 0.2XSSC, 0.05XSSC

9 — ¥ "J b DNA (2 20 X Denhardt’s solution 2.5 /j. k 20 X SSC 5^13: jbnx_> 2 X Denhardt’s solution, 4 XSSC, 0.2%SDS, b - 9 ;k 24.5

10%SDS 0.5 ju 1 %1]Px. 0.2%SDS Kf 6

7%

60~65‘C“C& 14

'i y j. ^ — b

o .i%sds 'MMizm 6

2 X SSC-0.1%SDS 20 5^1, 0.2X SSC-0.1 %SDS rSETSim 20

0.2XSSC, 0.05XSSC >X L, 600rpmX20sec

Bl3.

-208- XCX#* ©#%#&#&/: A. -E-n-e'tlZUg. 5tig*(DmRNA^6s9B Ufc^-y-y h DNAiM^HSU 5>„

(3) A-f7'U^-f->3 /KtBBlrelro^W HM&/W 7'U 7'7-f-7a >RtBmffl£'iwt>fzisb< fii-ffti4S#r«flRtf4omPBl|Bl- AftT'^JSt-tiZiSiRS, JtiRtSm-fSo

(4) /W 7"'J 7*-f +- -> a >£)£)*, + KJS©7- + 'y hDNA**^*W&f fi 7 »s. 7 X 5 7 > K y l'X»t,«< &ri, ->XXX h'W«gij75iBlgC*5. $fc, a, 7- + 7 h DNAiS#

Condition I Condition II Condition III 2 X SSC, 0.1% SDS 20min RT +0.2 X SSC, 0.1%SDS lOmin +0.2 X SSC, 0.1%SDS lOmin 0.2 X SSC, 0.1%SDS 20min RT 55T: 55°C twice ii. 2. 3 sse.

DNA7-f ?D7HO*ffllt0taltLT, — effofcBic. ei-7iemAt!#6nsxt ($$tt, Hint) 7bKJ*£mx fg*® AIM £ WAX-5. ffffittt. AVEDEVfi (T-7±#ro¥i9flfcMX5^n^-'tl©7- 7©*w@m©T%3©#) spiuifiTt-fsan. *®ssT«sn5.

IzX - X

11. 2. 4 iS^^^iEMt/fOUE

hDNA&/\

Human PolyA RNA (Human Brain whole) (6516-1 : t7 □ > 7 "J 9 ) Human PolyA RNA (Human Liver) (6510-1 : 9 U y zr y ?7 )

-209- 11. 3

11. 3. 1 DNAV'f X P7 \s4fti$L$kQr1&t*

(1)

&X 7< H ft 7XC7 Iz-f ffiDNA5r@S{(: L, Cy5T*HI$ L fz X — 7 y hDNA£:/W X 7

V 7 h XxXtCTfiWlXzSS^ PLLH— M&7/PLL3 — hlllOX XX M X 7 XT#

#^xxx;i/(D#m^mm$ti^. ^/z, ^ti^tim^^T^m^ffo/z^, PLLo-s tick 0 , PLL3 — Z £ t Lfto ck ^) (b , ^S@B7JS(DS V^cDNA;Sr7 XX X S S nXd:PLL3 — h , %g X U XE^'J $r 7 XX X ^ ^ 7 X X 3 - h L T W ^

lfccDNA©¥£Jj^8E7J6ti800T$> 0 , XX£ftb: 43 (aT t> [rJII£>jg^ t fc 3 fco

(2) f^^#, ##L ft (E6)0 Signal/Noisetb(D^^HT&(Dck X 0, 7 XX CDEffl ME^^27 £ liXT'$> S ca^mm^ti^.

2week 2month 5month

S/N 1.00 0.78 0.35

* 2weekT(DS/N(D#^lB:L^:

11. 3. 2 /\XXUXXt!-Xg

(1) X-yyhDNA%M(Z)ma%(Z)## 3@#(Dm^(DX —7\y hDNA#%^mWTAXXU XXtf—Xa >#, 4E-tlf HE£ltgeU/z (07). Condition iX^-fl/NX XU y F®M$/Tlfco

(2) X-7ykDNA##(D#^(D#2t ^tl^niflg, 5/XgMilbfc7-y>> hDNA/SE£:ffl WCAX XU XX t: —X3X$rffo^: (08). AXXUXXt: —X3XH03(D^^(:Tffo^z. XXXJk ^ SR 77 Signal/Noise kliTi ©

-211- 2Ug 5Ug Signal^* (?5) 1 1.18

S/N (f 1 0.18

* 2ttgT®Vi9ttS:li: Ufc

(3) /W 7*U y^ff-'/a >£iSHfP»l©tlW

^n^enumm, 40mm a y^n-otc (09 ). t*© i$5M=t: 0 , l4£?ffi®£jS#r=W#8T'*5 £ttB£tl.fc.

14HR 40HR Signal^S C5?#?) 1 1.13

S/N (¥i^) 1 0.08

* 14HRT®¥^ttS:li: Lfc

(4) *#*#©#* W|-*ftTVW 7U >1*. 3#m®**im^Ro^: (010). T*®#8$ j: 0 . Condition IC»Xgl)iiat*l.tIi3nfe.

Condition I Condition II Condition III

S/N (W) 1 0.36 0.19

* Condition IT® Wtt£l

11. 3. 3 s$tt. nattrosfli

ffiSt X'f-E-i'a >L, f®@*® 4>^6. 40 ® X tit y h (XD->) Sttlllia, #fSctt£ffiBLfc (Ell). Cy3t» f ^Cy5®±$®«6oe%#M T#® X CT. W«ICti£- j'my-Stesaef clone crta, #©»> o xniajt-yy hDNA®a*#m# ^©StCiSfe© 40©XiJty h?)i, ¥i9*6*i6S3.4©iBjRT$)eLTIo 5XtiO. 7 l/'ffMRUi/W XV Xl-"!?--> 3 RattSSLit x

CLONE A CLONEB CLONEC CLONED

AVEDEV 2.76 1.74 7.06 2.15

-213- li. 3. 4 iS€^%$llW«rroStE

^ L^n/k^C^LT, 2@ 0 7 — 7 '.y h DNA (Human Liver, Human Brain) T^/\77U7"7t:-7a>:H7W (Bll2), CCT#, e^0#^#0H:^#777 (^1207 p(5) 'Tf'7K$tlTV^-^)0 Human BrainT'0-T^IE L T iUfsT^ Human LiverTI'0 "7%$E l> T (A 5 8697 ^%7W^T^mLT0 5 86f ii. 4 n. 4. i $am

(1) DNA77 7 □ 7 1x7 f7^^#0#]L l%$>^(]5,OOO7^vh^##L^:71x70f#^ (%@:77 7M^770O-7^>7\ DNA^&, 7lx7f^^, 0^###, 320-350 7l/^moim^#J^ (#mom7k7/ff ^7 lx< ) \Z¥i%93% t?£otz (l[Hl0{7ST48%{7fi£)o Cft £ <£ 0 , #%0#0DNA7 7 7 0 7 77 &3:Mlz{^j$T;5&'i$TM$gffffl£AL£tifc0

(2) DNA77 7 n 7 lx7 &1R 7^867%###7&0#mE *#^M^(:T^^L^777o777T,

(Rm%:ioo%). &/cd)c^^T, 5,0000m 6f & 7 77

II. 4. 2

(1) DNA0^#fb *#5%M%0DNA7 7 7 n 7 1x7 £ l,000%{7fi£T 5 f 0DNAC3 7T ^30^gT3#m L ;M# SV)If >:/;!/&^#W$M0m(Atf-> 7 7lx7f#^1:^g^:DNA^^a#0DNAT%

(2) #7%##

215- (3) v 7 s o % 7

11. 5 SlfflKitt

1) P.Liang & A.B.Pardee (1992) Science, 257:967-971. 2) Kato,K. RNA fingerprinting by molecular indexing. Nucleic Acids Res. 24(1996) 394 395 3) DeRiesil JL, et al: Science (1997) 278: 680-686 4) Lashkari DA, et al: Proc Natl Acad Sci USA (1997) 94: 13057-13062

5) xmw , roNA7< m a&n:#, Vol.18. No7(1999) 1050-1056

-217- *12* ft*cDNA©**E5lja*Stf«lllg*#flCWr SttagffiRg

ttssttT'f->y • aatamam

12. i tes

t h£tKL''x9 h»s, g6S%C#i§sn Tjso, is©#*, StiT©5„ 4-#«yy *e^©«*«¥MC ©#«ib*sic^oT< 5. r®±5ft. LT©t#xm##*#©#ai:i«] It. *E%Ttt. DNAx-5'^-X©{t3ilC?ey;#< h ScDNA©^ftS#E?lJ©ft£t cDNA©«EStf©tc6©SSS«©W%l*%e*S6Ltc. **69CH. ^■fdDNAE^Sf =t OftiSnStltch HfrSftEcDNA^ D —>618©, f ns©**E9ij©i*)£s>t>

9E5fc ©RS Srft V>, ## KCKl4000#S©ESE©ft$* (5kbH±) 9D —-etl6f©SB»M #SE?'JS:$6U. aft^©[*J*9l200y D —> <1998*81211 40 $61172:? n —>. -6-H-# 6Mb«*») ©^seE^ij^ftsu. ^n?.©th^M±®ftB, b»SLTHji. Ctilt. SSt6WT*±SSE?iJA^St6dtitc b h S^ScDNA© l$E l3%&a#T©a. WC, E*a°pfflM^IC075:*S5Bj#6tt*tK©tg)bn55kbK±©ft EcDNAf □ — > t®tl(i^-©it^(i50%ie< Ctl B © :? D — >© * 1C It S S. «*#j*t<£t'ig*±ffifeTSifflM«tt*5iS^tst3n?>ife©A5^inT^5sns. *E%Ttt. d©J: A IC*EcDNAT16#©5fe#$ff < A>rSDNAE%9f©b bSrSft EcDNA;? D->S:1SU. f ©m%E9i|&*%L. #|g)@1ff©tc«e)©®*fi*t LTftE cDNAto^-y y-fbKMt"%SrSSE$E%L, t hcDNA©eSmWx4flJffl, SE*SW*^®iISl;tit5bOt«5.

*ef%cTmeLtcflf%i*isi$*©aoT$% ©SJSttE^J^t^ 7»'"f SDNA6f%0T j: 0 #fc2000H©SEcDNAt7 D — > (?^5kb) ©SB^tE^JSftS U £: = b. SSSE^JttSrotcebto-tty^ d —- a.©®#E5Ui@*?!bB. #Mi^«$nsftEcDNA^ n->i20«£jSSftU 6#*E LT. DNA*$-fi<-r^-tty^ □-->y$ffotc. c. SEcDNA^ n —>©SS*E?J©#t$ n->©S$E5i|SSSrSI® L, ^©<6$^ BTttofiEcDNA:? n->©£$$ Vtc.

B. SEcDNA©6 v yyt©fc©©&riSES8*©R% fSBtltccDNASffl©. ^1®. DNA^-y T'l; LT*aA'©*»(C*W

-218- *'f 2 D N Ai7<^7 V - ##(7) Hy 7° £ It < t KSIIcDNA'7-1' ^‘"7 V —

r;SIIcDAM ££-\

M0CDNA (D^y 7°1t(7) (2000? a-» tc sb

l •§ "j yit t Hfr^iSIScDNA 7p — > d207n-» 1 y i mmm^ L. t

m -1 t himssicDNA(7)ME5ij(D)*^:^

-219- 3. bri'b&ai£, SeSSffiW'f yj LTUVfcDNAJBgt^SSH. @£

@ffi±C®£oTtt#bTi5D,

HCi^n-f x*U AA-feT-va >/^biE^&£©F=mrt5*.-5„ PCRAg<7-®1t fliJIC*®S6F*$:SAt"Sfrflll5kSSSii t), AMcDNA (5kb) CM bT liif ♦§5(C>41

CSX7y7®*%&%0&tl ’t® 5IEmmc®-t&#mtl:t))^bT*6. M5ttTIH§% b/eH*m#eg#®7A;i/®mt3&»gi$S®m b/e#^$-$®®##R#rii, t® =k

3 7k#^®DNACMbT&$®##A^fi#E?&6. #C. It (MASS® Tj.® gig?#®

*MigT$.o, atidi'io&taffjscti's, *®%*@®mms#A-c#Am

®DNA®*®#toC58bT® 5 t^gSMS (02##). f CT, C®igr)g®t#$S-

*Stt6fPS#i®W?£E%C, 4-0toSEcDNA®^-y Xfi®$S&« t bT$ 0 »/etc.

£ 6C, *®#6ffSb^#@cDNA$0fB*HC0$fcb, C ® @ $(t b 7t 6 EcDNAC

?&m#b^yo-ADNA&/\<^uy-fxst, aix<

«C®GEAAc (03##). a. #EcDNA®#&# —^®##K#®#%

£ a * E ?l) £ ft )£ b te *) -f £ DNA®f 9S m ® ft E cDN A *' «8 m * « £ S 5 tc tb ® 7 ■;> X ftC£.B&B*a*ibT, ftWcDNACSbfc*®Sto®SrSSE$E%M%bte. b. *S#fil|rcDNA®0tl'X®@$ftft«

#&#-**£## b^AEDNAS. Xf 7X&t'0fi±C3S0C0$1-5&#S£@e

'fbb^m#DNA^6®**^y±;b&#aif-5&#&M*b^.

12. 2 fi*cDNA©£ttSE$U®3t£

12. 2. 1 AEcDNAA □ - >®ffl«E?IJro95$:

(1) *g**m*5J ;«:*»

SBy-E^lftSffltoftEcDNAi? d — > i bt, h h ®Z>R®4~5kb® AE:ftiB©Br1t£

'<.i'i' — CSS If £9 p —>£»> A £ DNASES J: 0 2000® A # b £:. £©2000®® X □

— >®z:aEDNAS96X'l’ — XXlJPXb — b£ffl®T, 7)l'*U8CJ;9#titibA:.

SBilfcDNAttXAXXTt/t —X> — b (5 U # 7 ttMAFBNOB) £ffl®T*##b£.

CBEDNAS ffl ® tc > — A > > > X te. Xp —At bT5’®l(iM13P7FS3’#HiM13P8R £ffl ©. BigDye Terminator^ — S>X)b7— A 7 /'I A) C =fc £ X X A IP * > — X>>>X£R®, > — tX>t-ABI PRIZM 977-96Sffl ® T#$E?IJ ® 0 £

Rote. (04#.#)

(2) j=#*#

2000®® A n —>ro5’*Sfii|t3 ’*SflJT¥i9E*$ D ftd500bProsy-#SEti]SE

*8t5 £ t^TS/t. fib, 34X p —>12 2)® T tilt*®*® ay-Et'lb to Eire ttb'T

£&»'o fc. #C3’*Sfli|HpolyA-tail7)t#6LTi5 0 . £ ft*)* > — A > X R ft £ ItS

Bctco. E^mnti®fc®7)saiT^tc. hs&gcvv?-m&ts.ewir&t&t.

-220- 5’ 3* 5------3' HHcD N A (-SEDN A)

Klenow fragment##^ J: 3 VdATP<7)m&

1 ) M6T'S 5 D N A, l^l U%$T##A^W#T&^. 2) imnmmwiktimbT. 3 ) #69ttz —*ED N A, (DN Af 7/) 1CSUTV'50 4) P C R fctffi3ld$M6T'jl 6 tl 5 5 X v -y ^(TDSStD® U

m c D N A

m-2 Hdcm&m'tDNA#

-221- 7 AMU

0

Tj--7^ if—

:HcDNA DNA77 U —

#cDNA(Df v -fimwm&m - 3

-222- IrJti \z 7 — ¥ > X fc £ £LfJK 0

(3) vx y- 7^-7 ^$jm LT#m##7 o-7i2o®^m&¥^7c

12. 2. 2 SEcDNA7 □ - >

(1) ^^E^J'f-7^6#gUL^:120®(7)h: h^im&mcDNA^D->$:mVX 7 a '7 7&1:j;0AmmE^(7)^^^ffo7c. m%]^mEcDNA7o-7(7) <>th-m##TDNA^M-^mM-{bL^. Bftit bfcDNA$?#-£:, — ¥EDNA£ ?# & 7c $ ^ 7 7 — <7)M13mpl8RF<7)SmaIlM MC+h 7'7n“7b/c0 fe t % — ¥E DNASrtitti U> 7 — 7 7 7 7 7* >K S (C ¥V Vc o 7°D —y a LTM13P7F^¥ VX BigDye Terminator^ (|£ j£^fr/\° — tr >XJl/7-7 if

/\°7) tC £ & 1M 7 ;l/ • 7 — 7* > 7 7 7*£It ^ 7—7"X 7th—ABI PRIZM 977-96£rffl 0 /c. # 5a^96#(7)mmK^f-^ V 7 E 0 Sequencher(B3fV7hX77x7U7y) &mWT7t777l/L, x v 7 ^ffV^S 15^0(7)1%■7-® 0 5 7 <7) #E JE £: fr o 7t o

(2) 120® (7)70 — 7^3# 96® (7) th 7" 7 7 — 7* > 7 > 7* £ ff 1A 7 ir 7 77E L7c^^, 06K:96®(7)th77O —7(7)7t77;i/$&#(7) — M£r¥T = 7-fc 77*;i/£:htcE£|Jtit 07Efx —7 —7 7 E &&0, ^(7)j:7^#^, m*E^JO##x-7^B#1:=k(97x'y7L, m&E^J(D# Gcuv7m#^^mm@g^j(7)@mi:j;0'U-y7[]-7 m*E^j(7)m^^mL^7--7t&o^. ##%&# 7-7-777y&#^e3T^^L7h.

(3) ^m^j;^#(7)##m# #^i#fz;f(7)##7 O — 7 T <$> 120® <7)§i cDNA 7 O — 7(7)^:^%E7J(7)^:%&^ 73vh^7&^j;(9^7$#6C«E^T#^:o 7-7-7777"(:^^T. TC^Tf- f&sm&mfxDmmmus.i (##) thy 7 0-7^^ 0 (Df ¥^500bp 70 — 7^tch)(7)7iz7 77ELtc1t7"7n — 7E ¥^91.8® 7 O — 7^/c 0 (7)±%&E7J(7)#$ ¥#)4509bp S1ES 500bpx91.8/(4509bpx2)=5.1

-224- t

t

216 364 512 662 826 976 1,152 1,366 1,592 1,802 2,006 2,214 2,424 2,656 2,904 3,106 3,338 3,580 3,790 4,010 4,319 i» gf, r K =F=F I ■ I "I- I I I - I II 11' ’ H-l- -I—-I- F==RR -»------1------1- =F=$ 1 ■ I -I------h-

! □ Hole in con tig Piaqraiw Keu | EZ3 Single fragment Bumps on ! I fill Multiple fragments same direction fragments {^3 Both strands show motifs, | | ■ Both strands plus hollow j ■ * | iii Start codon frame 1 rectangles | I =5=F*^Stop codon frame 2 show features;

W 6 V3 7

-226- i2. 3 recdna ^v

12. 3. 1

(i) h*Edna S/& M13mpl8 RF DNA5:MESISSnaB ITili^ \Z L fcDNA$r tf* t, T(D7°DNA%frM'(DM'#j * ^60SS 0 E ?IJ £ 760-mer<7) * U 3*7 7 l/:t7 F £ 0.48mM ATP- 7 S (Roche) > 30mM BE F *J X (pH7.2) , 10/2g©DNA, 21.5mM BET T"^77A<5^tr^^#5:37'CT30^^^mT'5C:«>Teo^. Mfb#200/2 lC, 0.5% (W/Vol) SDS, 0.7mg/ml7° □ 7 ^ 7 “ if K (Roche) 5rtin7, 37"C T30^^^mT6C

)]/ • 7 D □ 7 :* ;i/Afii £ 1 [Hh 7 □ o 7 :* )F 5: 1[eH t o DNAOX7 / —)!/

RI##77 F77 F WcDNAOSS

M13 mpl8 RF DNA^r E ®B SSnaB ITSE$ C: L fcDNAEJt DNA®T> t* 37TT30^^^M"76 C 6(: j;0, T"(D#, 7%7-;F'7337^;FA#m 5:1®, 7 O D 7 * ;i/A$tii£llElfTo tc'lk, DNA0X7 7 — f;« DNA^tit 5:10.5/2 KD#^zK(C#^ L/:#, lOmM Tris-HCl pH7.5, 5mM MgC12, 7.5mM dithiothreitol, 4unit Klenow fragment, 0.02mM a -32P dCTP, 0.02mM dGTP, 0.02mM dATP 0.02mM dTTP7 T, 37

ImM EDTA) 5:10/2 ljtox, G25X H >7 7 ATiMJfc[ a-32P]dCTP5:P££ L 7

»6&m>y;KD^j*£f2®Lfc. To#, y;i/5:6m(D±(:m#Ty;i/%:##§T%m$ '>7#7;i/

To-TDNA(7)#ma, X77 F^^X'XOSM H*#DNA#^5:#7T#6^l/:DNA^:#5:100/2l(D#^zK(:#^ L/:#, lOmM Tris- HCl pH7.5, 5mM MgC12, 7.5mM dithiothreitol, 4unit Klenow fragment, 0.04mM dCTP, 0.04mM dGTP, 0.04mM [6-7 ^ J ^7 '>iI/]dATP (Gibco BRL) 0.04mM dTTP^T, 37*CT30# £ t dck 0 , 7° □ — 7*DNA<7) 7 ^ /to X7 / '-)l?fc$£\Z j; 0 , DNASS$£SE(2l5 jMfodNTPs^BS^fTV} , DNA5:10/2 KDTEEEE (2

-227- ## (DDNAX V 7" 'J a“'>3 > (Array IT mivro-spotting Solution) y'J LX h X *7 X M/c0

PCR(:j:^X-X'7 hDNA(D#ma, X7X MX^X/\(D@#fb MFSSISSnaB ITBE^^ LfcM13 mpl8 RF DNA?:iiiLT, X-CDiB*5ffi f©7°D h H )]/ lZ$£ ■o X'if o tz» PCR

^ £ ljUg/zzKDSS^Zj:^) j; -7 (CTEEWE 7§ A V Tz o © DNA X y T -f > 7* 7 U a — V 3 > (ArraylT mivro-spotting Solution) y'J l/^f FX 7^( FX'yX^X^y hl/c0

X —X V MDNA(D#7i6## X — >f v MDNA(DCy3#7^##^, 5 U g3>MfE; XD-XDNA^X^ v ML/hXxX KXxX(D##m&mx 2##DNACDXXX MX XX±TCD-^Efb^fTofc 0 X xX MX7 X(D#m#^, 0.2% SDSTr^T^f^® ioot'T2#fwmm@L *sib %MX#X M V X A## (18.75ml phosphate buffer saline, 0.065g Na2BH4, 6.25ml Et-OH) T ^^T5^r01l®fgM, 0.2% SDST^fMTl^^3[Bl##^#, ^ UQzKTg#Tl#fW2lBli#

AX XU XX if-X 3 XMfWl. Cy3 mmLfr?-¥y MDNA6AX XU XX if-'>3 X## (Unihyb hybridization Solution : BM##§) ^lilCDlFJ^T'llcif'nfoiiA X" (DlO U l£: XP“XDNA^X#y F^tm^I^diTL, X*0± Xj A* — XX X TSo Tc0 A X 7 U iJ "fc X h ( Hybridization Cassette : BM # §§ ) 1C A tl % /\ X U f t X /( — (Hybridization Chamber : BM#§§) £ffllvC> 6513 Tl5P#fl9 AX XU XXif—X 3 X£S £: fr x tc o AXXUXXif-Xs X^fS#(DXxX MXXX(D^#lj:, iiA-XxX$:##^&^ (DXXX H XxX£2X SSC + 0.2% SDS^T##^## A—XxX&#^>A C#iJAL/^:o 2 6 KX 2X SSC + 0.2% SDS#Tl5#fW!l@###c#K 2X SSC^T5^^1[Bl##^#^fT VA XXXMXXX&a%$#Zh. s/X±;i/(D#m Axxuxxif-xs >#(DX-Xv MDNA^6

-228- (2) DNA(Djf 07 (B) 11, RISB7 7 1st?- HtrffiV^cDNACDSli^fTO lx—>lll,

07 (A) Id tk l~oligo A (7 U 7 7 7 lx 7 7 F @2 £*i # ^ 1) £ l^TEZ^SMSfi£2ll, lx->3ll, 07 (A) IC^foligoB (^U7'77lx^7FE^J#^2)

777^-(D^m$r^'l. k->4ll, mm@##EcoRlTM lx—7M11, DNA1KX7 —1; —$r^To (c) 11 (A) ^|p|C7^c]'-7y;i/(D^A#%&^T. Ix->22:lx->4l:^Tj;7l:, fH^H#mDNA(DH-#*aM#(D @E^J&^32@(D^-U 7'7 7 1x^7- MA(kB$rm 3/cC: (h6, f flSOzf u3-77 ix^-^DNA#M-^#m$aTv^c 'b o m&mm 5'-agaggctttg aggactaaag actttttcat gaggaagttt ccattaaacg ggtaaaatac-3' nmm^2 5'-tgttttagtg tattctttcg cctctttcgt tttaggttgg tgccttcgta gtggcattac-3'

SliStl^DNAEcDlRlS M13mpl8RF DNA£ E P6SF SHinc IITSS1#M~ l fcDNAllr jf £ Vi T > 08 (A) <£7&oligo c (t U 77 7 1/^7 HE^iJS^-3) l(:^to lx —>2 11, 7 U 77 7 VirJ- F £ £P 7. & !a T'IrI o |g JH £ tk T o lx —>m!1, DNAlt < X7 —^7 —&^To k->il:^Tl:7(:, #m^a^#ll, 2*#DNA(DM-mm(D^T& -5. 3;%:, #^$H^,$M11, 08 (A) l:^fj;7(:, 2#:#DNA(D3'^%#J#T&6 C

E^J#^3 5'-ggaaacagct atgaccatga ttacgaattc gagctcggta cccggggatc ctctagagtc-3'

#^L^:DNA&70 — 7 2: U^DNA^y 7(DfF^6, /\7 7U y<-tz —ixs X: ck-5# /E 09 (A) 11, s E 7° n - 7DNAC7) Cy3SISti7 — 7 7 F DNATA77U 777-73 >L, 09 (B) 11, PCR&CZ c T##S L/:7n —7DNA(DIMM^(7)|g^fb&ff 7, Cy3##7—7 y F

-229- DNAT/WytJ XHt:-'>E >U = SE7°n-7*DNA$rH- 0$7.2kbp) £> X7^ H ^'7Xg® !:@ffif^ #, ^-(DX9 H*E^Tf#^L7cDNAA7 X&, (PCR

##&) \z z.

(3) y Z J L^SE7°D-7,DNASfrM'^, 7J1/Tk M##AX 7X K X 7 X 0 A = X 7< M X* 7 X ± <7) 7° n — 7*DNAte:> 7 hdna t.}\y ~y u x#(e. /\3 > ^fr7 6:, >/XA;i/^^^-^B(D/\3 XDch# A (7$940%#TLT^ rzo zn\t, /w xu yy XLrcDNA^m^mTm^tmiz, x^7 k,7cdna^x7 ^ FX7X^b^nfcfc&T&5

12. 3. 2 ^%##cDNA(D@#/\(D@m

§ScDNA

(i) @ ^ f b A & (^ ^ 0 M)

DNA#^0%#^^<*/c66, ^jbns^7xxcD^tcxn^o ^(Dtzvmmmvmfem&soro&tiz'Tzt, x#7 X-(Dkf>^lE^lC±TL^<^0, #%T msxizmfeLTytfvyj >x^eo^. m^x#7 h# ^20x^7 hs a sox #7 h BTH7)Vj: X^7 hS6 7>S&b(D ti20~40X # 7 h S ii. 7? & & o

(2) A7X&M7- He A 6 X ^ 7 hS^cDSV^ M2flTW&&%(DDNAA7XmX7< M^7XieDNA#^eX^7 hb, ##$: ^#TaDNA(D7%#em#SL;t. M#T^-^6(eX^7hXXX^#oT C

-233- 0-13 Gtt Silylateda —tj's? (*/R v hffJtt. t&liboO

235- 0-14 Cti;

I

0-15 G% *huyh7tfv>=l —7>f (H-f vVB#)

-236- 6 : ^IhI

Ell Stt 3D-LINK ( F — 7 y 7^&ck£/g$M) El2 G% Silanated3 — y-4 7 7 (X^y HE, E13 G% Silylated3-7^77 (X#yh##b #^^63 <) E14 C% 7^77773-7^77 E15 G% Xb77b7b773-7^77(M-77'7B^)

(3) X73#X)m3i3 (^%(DmmmaT^TiRic#^c%oT8% LW)„ X^7hb7€»mVA:j;6X^7b#(D^63#, X^vh(DM#Cj:6^63^, X^7h'7^DNA(D#^(:j;6^63#^&6. DNA 7 v 7l: j;6#ai, ##7°o h 3—j; 0 X#v7^ 77#^(Dfo]±^^W^T&6(b#^'5. & : 7E(D##^lDNA$:7 U:i —7g7Ay7T —

m767^7;i/(D#m (1) ^titiSS(7)S*^^^:El6> SE0E^^S^El7^TKt'o (2) mm&sm&ckim# #8#L Cy3, Cy5(D^-^l^^K7)A#(:i3^T H7< L, #mm^^Av^(D$r^Lyc. A'ck Ao /rE^n*tD-ti-tiCy3Tti JtU : BMNSPG500S B)]&y 4 )V& — : 7 7 b b'— JVyt^ SPCF640 77 b b' — )VJt¥ SPCF550 5% #ai7^;i/7— : 77bb —JI/76# SPCF630 % Mt 7 M589/^7 MAX Cy5Tte ytM : TSRH180P lg7A^“ : 7*7 klf-J^ HM 77b H —;i/%# SPCF650 3% ##7/ Jl/f- : 77b ti — )Vyt¥ HM 77b H —;>76# SPCF750 Kenko R68 2% T&o^bo Cy3, Cy5H#^^# 7 X 77 "7 7 MX)^±# <

-237 T, #i:4'BI0 j; -eCT, ISy7h b° □—;i/MfeST'Ex=533.5nm, Em=6O6.Onm0 WtiG l -3 fc„

(3) Xl7h LfcDNA^EBS : 200ng/X-f X u U y h )]/ X^y hDNA^El : X#y HE : 0.28mm (Xtfy ^114; DMJ) X#7Mi: 0.282/4 X3.14=0.06mm2 X#y : 0.014 (/WX t°- F0X XB&S4: OSS) X# V h#:# : 8.5X10^< X D U y h 91/ (## : X#yX — 00XD X16tilXiO"3X -f 7 □ 'J 7 Hi/) X^7 h U/cDNA* : 8.5 X10 < X 200ng=170pg^# o T W &

6S &aw,fl low High @19

Cy3 & »9 2# 24,642 42,496 17,854 26,781 Cy5 & *9 5# 23,065 32,972 9,907 5,944

g#^#m^^3#Tio,ooo^mm±T&6. ^m#t@L^6$Cy3(C^^TH, BH£_L[HlSM£MT£fco U h , #0A$ Cy5£43^Tte, BS043<£JE-6O% X U 7 — ^ & o T Vi 5 „ 7l£M0S0tliJn^7,£MM0 4: 0, $ 6 ^4{e^<^0SSio]±^B)A^^0T\ B #^X U TT^Bm^&oTV^. 3##m0#m#^mmxr#^c6T, /\

$ 6 (^41^14^/4 ^#0 6# 6^ ? ) 0[n]±0/cJ5t:tt, A'y XX"7> HX

i2. 4 gimam

1) Lindsley, J. E. & Cox, M. M. Dissociation pathway for recA nucleoprotein filaments formed on linear duplex DNA. J. Mol. Biol. 205, 695-711 (1989). 2) Umlauf, S. W., Cox, M. M. & Inman R. B. Triple-helical DNA pairing intermediates formed by recA protein. J. Biol. Chem. 265, 16898-16912 (1990). 3) Rao, B. J., Dutreix, M. & Radding, C. M. Stable three-stranded DNA made by Rec A protein. Proc. Natl Acad. Sci. USA 88, 2984-8 (1991). 4) Muller, B., Burdett, I. & West, S. C. Unusual stability of recombination intermediates made by

-240- Escherichia coli RecA protein. EMBO J. 7, 2685-2693 (1992). 5) Bianco, P. R. & Weinstock, G. M. Interaction of the RecA protein of Escherichia coli with single-stranded oligodeoxyribonucleotides. Nucleic Acids Res. 24, 4933-4939 (1996). 6) Golub, E. I., Glazer, P. M., Ward, D. C. & Radding, C. M. RecA protein-mediated irreversible fixation of an oligodeoxyribonucleotide to specific site in DMA. Mutat. Res. 351, 117-124 (1996).

-241- $13* aacDNAo#%##cMf

13. 1 #8*

a. k NcDNA©**WWS&l»JfflCM'r«W^IH58T*S. IftWCttA' r$DNAE^0i(D^@^^Sy/^f#T, y§W3ftjft©S*cDNA©*$WW$#Jffl®ffl^W

^r^DNAE^mti. 1996¥^£ DitbTk MKcDNA^ ^7'J - j; ^mcDNA^n->o^m$:ev^ #em:#Hooo##(D8;%L ao#m (5kbJ£A_h) yn — -en (7)^^1000 (1998^12^40^^1172^ □“>, ^ |f-^J6MbSS^) hhmM^lL :tmiat^TM*SE^J ^&/££nfck h5t£;gcDNA£Bi ft&^ C7) it * te50% ifi < Ulta.

(*6f) 6(7)/? o-xD^^6#$j:M^Tac a^m%o^mmmT&a. *yo^%^hT^##cDNA(D*faf##, ^(C#6a^0^A##Om^f $DNAE^E#mcDNA7<7^ U “£

U r^[:^itTi^ta0

EST : Expressed Sequence Tag ftp : file transfer protocol GDB : Genome DataBank NCBI : National Center for Biotechnology Information MIT : Massachusetts Institute of Technology OMIM : Online Mendelian Inheritance in Man PDB : Protein Data Bank PIR : Protein Information Resource RH : Radiation Hybrid RHDB : Radiation Hybrid DataBase STS : Sequence Tagged Site SNPs : Single Nucleocid Polimolphism

-242- 13. 2 aGfmaoaaff'fz

13. 2. 1 d) m^Mm^x-x^-x### 0MIM(1), GenBank(2)> Swiss-PROT, PDB> RHDBft E CD X > X — X "j N ±T'^^ $ ft jtfc'lf $g£ MX XLION##(DX\X XX X X X X x A T $> 5 bioSCOUT ^0yXfA!t SGI## 07-7Xt“'>3 X 3 X H x - X Origin200± T M £ tiXc o

(2) NCBI> XX#—ftXX—> MIT&^T&M^ftTlDSfc hXV A6fi?iJ1t$8£LinuxXX xA±(DX 7X;i/^m7XU X-Xa >ncftp$;m^T@mm#L^. fftSOXV AK D #3 £ J; 3 KX bioSCOUT(C##iA/)Oft6#^(D# XX 7X ff VX bioSCOUTCDx — X ^ — X yXf AC|pftbf;o

(3) v UniGene& tXGeneMapCD 1W16 <§: SNCBKDftpft— /X (ftp://ncbi.nlm.nih.gov ) ^ 6 7 7 "f /Mfcit77°U X-'>3 >ncftp£mvTML#0 7 vX###mmPerlXX U X TUniGene(D##^6 7'y @E^J##m#PerlXX U X GeneMap (D'X — #—XO#^)(DBLAST#^MEMX—X^\ —X $:## L#o UniGene tGeneMap(D X'yX##^#a'{bL#o'Xv7°##X—X^\ —X^##L/Co

(4) In Silico^6#:X y t°>^yXrA

wx, (3)T##L#BLAST#mmE^Jx-x<-x\zml

U mmm(Dm&fo’T'y tfXX'MXDPerlXX U X M^fX^L/c.

13. 2. 2 i) m^##m^x-x<-xcD## bioSCOUftft X XX —$ V h±(DftpX —2 flTW &##&%# L, ^(D^#

(:^6#Tm^(:X-X^-X#X6#m&#XT^6o GenBank^ (D # * E f'J X - X tix SBr-f U-7'yXr - M E: bTx-X£ ftTft D , Swiss-PROT& £ blltb^o bioSCOUT^, ftpft-A'StC j:3 ft-A#(Dx-X(DM#mmc cn ftftMl~5@

-243- — # T (D Human Chromosome , k / A BB -£lJ 1W $z > ^’y'/ISISf HbioSCOUT(DS*1itk^iStD L/t & (7)T&50 bioSCOUTT %# b ^ , ^SBB^J (D## A L T HGenBank, EMBL#^$> 0 , X ^ U — 7 '7 7° x — h tLX EMBLNEW > GenBankNEW# ^ 3 7b. 4; 3 CUT, SBS nr Wc=

TREMBL, GENPEP#0x — X 0 ft 2 llT $g £ $ H 3 > h° a. — X Mffi £ 3# PROSITE J^> BLOCKS# EST, STSCD^^g &3X V £?t, UniGene& (Hoeiem#6ck9(:^oTv^, b OMIMb&B£tlT^fZo £fz, ®SSiL^^iix — ^^“XPDB#t)X — XA — X Cft b h 7 / ABB7J##, v V 7°'M# TbioSCOUT0 7’—X ^< —X t:-7c#@Ta c: j:3

2) bioSCOUTlC 4; & BB7J## bioSCOUTteX:# < ^Pj"CA30##g^#L, — oteBByjy —X £St;:y-y ^-X (:

=fo^^o^T-^^-xic^ur^-v- Ffemuziz'ftommr&Zo u

#m§^cDNA3 > V — A 7 A PN T E ^ £ ft /b BB7J 7 — X 43f@ 5: WCbioSCOUTTfil#T %'ft-itzo bioSCOUTOBB^iJStfrtc hTx-r v X t — F ^-77f^ Xt — ^--hyy^vXt-MTffo/c. bioSCOUTOXV U — £01& 4;r/02lC^f o ^ y — X ^ — X <7) Human Chromosome ICSS^cftlTV^V'y 7° 1W $8 0 7 — X Kl b 'y h ^07'yX##A^$a, In Silico^A#:7'yb>X^fT^6CA^^^3 ft o fcfcb =G(DT$)o ^Co feSH©-7> bioSCOUTO^fg^In Silico^fe^X y fcf >7 0fctf)

In Silico^fi#:y y b° > X 3 tztf)\Z, In Silico^fe#: 7 y If > 7 X X 7 A ^04:37^## 7^6 0##^^ 04; 3 (:7-X<-X{b7^)0^&W^0##^, bioSCOUT

&6#7ytf b H(Dm^#^my-y<-xT&^oMiM(D^m(i^^#^v^ ^0y-x^-x^e> 0##0 9iemw:km&#x> K###{e#jT&ac c0##^#3/:7-x<-xa LTHUniGene^$>6. ibibb UniGeneC## $ tlT 0 ^ 7- X 0 P*q , M## o GeneMapte. 7y 7°##b LT07-X<-XT$) 0 . ±T07-x^7y x##^#-x6 7)b pyr, 8U07-x<-x^6m#7^^^^& 3 tz. o UniGene

-244- *1 aerngmaf-f

Indexing Data Bank EntryOfi# Group Date

EMBL 3952878 16-Sep- 1999 Sequence EMBLNEW 486931 16-Sep-1999 Sequence GENBANK 4028171 16-Sep-1999 Sequence GENBANKNEW 348411 16-Sep-1999 Sequence SWISSPROT 80000 29-Jul-1999 Sequence SWISSNEW 2012 28-Jul-1999 Sequence PIR 122810 16-Sep-1 999 Sequence SPTREMBL 201082 29-Jul-1999 Sequence REMTREMBL 46785 29-Jul-1999 Sequence TREMBLNEW 49087 28-Jul-1999 Sequence GENPEPT 412246 16-Sep-1999 Sequence GENPEPTNEW 12768 16-Sep- 1999 Sequence TREMBL 411045 28-Jul-1999 Sequence SPTREMBLNEW 57302 29-Jul-1999 Sequence PROSITE 1374 18-Sep-1999 SeqRelated PROSITEDOC 1035 18-Sep-1999 SeqRelated BLOCKS 4034 16-Sep-1999 SeqRelated DOMO 8877 16-Sep-1999 SeqRelated PFAM 41681 18-Sep-1999 SeqRelated PFAMA 1664 18-Sep-1999 SeqRelated PFAMB 40017 18-Sep-1999 SeqRelated SWISSPFAM 206718 18-Sep-1999 SeqRelated PFAMHMM 1664 18-Sep-1999 SeqRelated PFAMSEED 1664 18-Sep-1999 SeqRelated ENZYME 3704 18-Sep-1999 SeqRelated DBEST 2523230 30-Jul-1999 SeqRelated DBESTNEW 244296 30-Jul-1999 SeqRelated UNIGENE 83148 18-Sep-1999 SeqRelated RNUNIGENE 27138 18-Sep-1999 SeqRelated MMUNIGENE 25423 18-Sep-1999 SeqRelated UNISEQ 76468 16-Sep-1999 SeqRelated RNUNISEQ 26494 16-Sep-1999 SeqRelated

-245 #1 aemamaT-f (#)

Indexing Data Bank Entry#)## Group Date

MMUNISEQ 23765 16-Sep-1999 SeqRelated UNIEST 83148 18-Sep-1999 SeqRelated RNUNIEST 27138 18-Sep-1999 SeqRelated MMUNIEST 25423 18-Sep-1999 SeqRelated UNILIB 948 18-Sep-1999 SeqRelated MMUNILIB 263 18-Sep-1999 SeqRelated RNUNILIB 108 18-Sep-1999 SeqRelated NRDB 390775 15-Sep-1999 SeqRelated NREST 2739207 15-Sep-1999 SeqRelated NRNEE 1465467 15-Sep-1999 SeqRelated NRDBUP 67 29-Jul-1999 SeqRelated NRESTUP 194 29-JUI-1999 SeqRelated NRNEEUP 23 29-JUI-1999 SeqRelated FASTAGEN 104137 18-Sep-1999 SeqRelated PDB 10253 06-Jul-1999 ProteinSDStruct PDBFINDER 10688 14-Sep-1999 ProteinSDStruct RHDB 105173 28-Jul-1999 Mapping RHEXP 158 28-Jul-1999 Mapping RHMAP 23 18-Sep-1999 Mapping RHPANEL 13 28-Jul-1999 Mapping OMIM 11155 18-Sep-1999 Mutations PATHWAY 3057 18-Sep-1999 Metabolic Pathways LENZYME 3705 18-Sep-1999 Metabolic Pathways LCOMPOUND 5635 18-Sep-1999 Metabolic Pathways REBASE 3662 18-Sep-1999 Others SRSFAQ 10 16-Sep-1999 Others Human_Chromosome 12327 16-Sep-1999 Sequence

-246- BE 2 UniGene

UniGene GeneMap

V "j -j\fiOSI — nfl (0 y (#HSM*|j!0) E^nia & u % L ■etoteoma smatena /<£ U >77?

kw«t omim #>&£*■ ait® #»&**.

UniGene, GeneMap&

• V“7 tf>^ blast 16v ^bvyyiwa^tiiii. ea

@3 In Silico&6#7ytf>f(D#B ;$1) GeneMap(DV'yyiWWlCli, ■*wrv5/^1«gO^|E*-&nTl'«. (K9J1W ;±2) ttins^@iwwttomxnkmmmz££#>£*& mmn

-248- 3) In Silicony tf > ^ bioSCOUTT£>&f*£B£x, 03(:^T#j@{:%VX $£B§ WcDNAH > V - '> 7 A T §1 fg-0, WT. In Silico^feftTy tf>^£fr ofc.

^•0^^?:^3H7KTo &£u ^-0(7)^T'$DNAW^0r0X-^«ESTtf 0 , & ^7 □->(DW^cDEfrE^J'IWW(DET^^40005:E LTV^^\ ti2000T^^) = UniGene^{t(7)i##{Cck 6ln Silicony if > lfRlii 0 ^7 D — > {C ”2 0 T 0 y K0ft@£#£T5 Z. {colttint's :: {COltT{1^6#:##-{!##(:C

^ GeneMap(D1W# (REV y 7°) 5 <> ^n^»-ooiw#5:E'S-{bLztE^vvy##y-y^-xemi^6 ctiao, R±0^a —>{COWr> #@%{&# (RHV y •/) (D#^aa F0f6B Mt${£B^#ST£fct>0 0^< \t, #O4o^z:O0#6#:/i

^Vy j; D77 tf UniGene^m

;C0#Ftm, Ay^^l@^m^6 j;4 {c^oT43 C, t;1^fasta^0x-9 £ffi Vt TS&Or-^^-SUjUlTtSo #13 0^^0^-^ ^40{C^{tT^tl4ftl##& 'n-Dfztz-z, i2i$raefl^£T0fh#tfite7Lfc.

13. 2. 4 ^^#3 ckZ/4%0M%m@

E^J##&m^T3 j; (DmA#:±(D^m^#%T6ln Silico m&fo'Ty H>y0S*$&¥&MiLTe/fo 4*00^ y tf > yfiOMIM^JB 0 TSIitOHUtSi^SOt^iCiM/O K1*«£*§£fc«>0*7y My>?~Tzh 0 , C07y hf>^«RHV'y tf>^ {lit ^m^t)OTa5^>o £0 £ 0E^f£ £&gE*fZ> & #> \Z \t£ D*B*EtiVj:V y tf y^m=f x£RHTy tf > 7 &ft o Z> E 4"? y {1, GeneMap0RHVy yitlg S 4 £ ttT& 0 > 4{:^L^a^0RHTy z&MmzMizmmtvmmmf&ifofc® izit, xffiffiwz^zmmizM^z&grf&Zo m#^^#^0^(c%i5o0ym##^ emu, vy y’mm&tkgtti'tftMzif'Dfco rh-?? hfy'fztirzmfc^tomm&z IHitfc^©g iMfcPerlX^ )J yh^^Pyi^ h M£Uu {Cf£$ b fc fe 0£*£> £ 7^\ o^in smcornA^Ty if ^ 'J-/h$:ffl^T®itt^iT'f:{j:5oT{^ci:^o 4#, d 0^®j£03 £ t b \Z, V y y##^ j; 0 ^#^i±6ca{:ck0, Ty t° >y £tt5SJ4£M5S£ B

&actabtia, Kittit, GDB^0y-y £jrsl, #&, 375 ^Ex^-r-y ^#Zto ^nb®f-^(D^^e>l0[|^f“^ (-O0%& #=/^> ^0) fc{t£&4iti U UE t:1^7 y 7°tf £Sx-y ^-x izmm £

-249- 83 In Si I ico$fefe<*7y £>if

Heil ix9f mm DNAffi

1st 2nd 4th 1st 2nd 4th 1st 2nd 4th 895 538 1523 296 590 870 4000 7 56

UniGenelCckSHtfr /t> KttEOKS 55 25 87 24 22 76 843 0 1 396 217 518 118 234 346 745 4 16 GeneNaplC 513 276 793 171 323 582 2105 4 4

A> Kt6H<0#£ 323 151 422 87 183 314 1507 2 3 315 138 491 93 162 291 677 2 1 72 46 101 27 48 63 151 1 1

»6ff/<>Kro

as) fr1*$DNAE<01 stl*ESTlt#T?tey, — >»

4 asvysuico$&#?y e>yrotss

-250 ;hT^5 £ft> GDBi: Gene Card ft £\ fife 0 ^lllf $8 tS X-ft ^ - X t> & & 0 T, t) tfcftKUptTS tlfctis. t (D^l^< #&L. Whitehead© 1W$ ft UniGene©x-ftkLTkL ft7Xft##^&(9, ^ — A'—7 v y-f BA©® ©ESTE^T'I^ 7 v 7°##x -ft^-xtcmm-ftaxot, ft7xft&3:&X“ ft it$J34MB©X — ft ^ -X CMf ^BLAST##©1000feiftX©m##^^#Lft. X 0 A # ft ^ © x~ft kfta k#Xft###fm&gT5 cDNAE7'J©7 kftft^ffft ft&ft(:HcDNA(:m^%6^l6T$,5 ft 6 C5©##ft^^^#m L^© ft ABB^ij##^ #%^?#jT^#m^bioscouT(:#mL, cDNAcm&msfi

In Silico^6#:X'y kftft^X 0#$A<, £ D *1 S & < It ft ft & © XAk bTfTo TlA&ftft bioSCOUTft #7C^ < ©Etg££6#)> 2 6 (:#% Lft# SgS^&T&*ftSfflft ##X6*15, In Silico#6#:Xv kftft^ff VA OMIM^ft:## #ftxcxo, ^mk(D#mft^^$aftm6xc3^Tca6(D#m^m^T##-ft6 kk^ic^(Dm6xoMm(:#%T6#Tftftm{axt)^^i:###WT6ck(cxo, x DSH^sssffiBjt^ftaiKXosm^fTx.^xft tcft^>^(7)kSbn^)o

4"#^, In Silico^fi&X y H ft ft*©*SS|n]± £ E 5 k k & Cft t^T£DNAE2 l0T ft*X — 7°X##^Xl6 ft O —ft ^P#X4C#l%%cDNA7 ftV — ftTftpNOft D — ftft^ 68SS

13. 3

13. 3. 1

(1) mmtm -f ft 7t-AH7ftXft h^#6^ixv^ x#^$#6 5^^20thft ft ADNA(D#ai(C^ Wfto

(2) Ar£ AEMX DPhenol&ClX ft XV ADNA^ltiffiU ®ftS£SJ/£l£> 0.7%T #U-X ftJK'l!M5:iIlfto Xft, — gK©DNA+h ft 7ftl/ £ AffymetrixtizSlGeneChip HuSNP Mapping Assay yXr^5:ffl^T|] 1500@#©SNPs## ^ L ft o

-251- is. 3. 2 xnesi5=ttxw#

1) DNAt)->y;V©g@E srttBL. y-yeydSifessL/t (*s). *c, -f ^T©SM-y->7';Mao.7%©7y □ ■oit. -e©$s$, ttmuerox x i/7--t'co:?.y-#^#ia WfcJSKrt C JiSWBr&i'y / Adna^.®SL © 7'X --7 tB £ tift*' o 7c (04). $fc. tttti L/t-y y A DNAiS *«©**©»*>)' 6 *>*>/■(£»& £*©iBA(i Ht Art' B©e.nZj:^o/t (05). m±©ctj:0. tt#ti-S)*©aSIB|EA^©W«®iliE*E i'6lUilfcy y ADNAli, 4-S©SKT(5Effl75Ca6fcoT#cr=1@tt/j;V> AWHrU fc. L *■> L 7j:*1 6 , DNA-y-AT")!/®* • C ttfiHM11 &•&©#*&BSI1 £SS L T * 0. 4-«. +«-assatoAttnti»e>)5:n.

2) SNPsSff AffymetrixttGeneChip HuSNP Mapping Assaytt06C5tf A"t < 105ng® V J ADNA7)1& a«mMMTmi500#M©SNPs*1##nZm-e& 0 , 4-0. aE5S®KE (Al) ©y y A DNA&##f LZc. -e©^*. 85.9%©SNPs»1StfrPJ6ET6oyE;. ^0M Vi fcHuSNP array T22*6T©%%6* AX%6*±©SNPs (#6) it'WtrT#, 13##6* ©**© — * &*7C*-r. -e-(6. ftifiBictMiSft)', 7^r x a-y-xx'f

13. 3. 3 S8i6;l5=ktf-%«©BtS£8B

#e#^.#©###&©-o-e&&mmisiE###&e? a >x*-a h 3 At A h©#BtlTV^5)5S*20^#:IC3Viryy ADNASrittttiL, — SEtotP > 7,)HC ft iATSNPs(@#r£**LAc. 4-m. fta«0)aic + ^ass&n. 7< X ayyx-f s

13. 4 SlffliK

(1) Online Mendelian Inheritance in Man, OMIM(TH). (1997) Center for Medical Genetics, Johns Hopkins University (Baltimore, MD) and National Center for Biotechnology Information, National Library of Medicine (Bethesda, MD). (2) Benson, D. A., el. al. (1998) GenBank. Nucleic Acids Res. 26, 1-7. (3) Gamier, J., et. al. (1978) Analysys of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins. J. Mol. Bio. 120, 97-120

-252- 3E5 2 0*9->y;i/(DDNA*S

No. 14SIJ $M

A2 F 0.448

A3 F 0. 358 A4 M 0. 356

J1 M 0. 329 J 2 F 0. 308

J 3 F 0. 332 J4 U 0. 341

LI M 0. 295

L2 F 0.406

L3 M 0. 323 L4 F 0.412

PI M 0. 380

P2 F 0. 309

P3 F 0. 334 P4 F 0. 321

*1 M 0. 257

12 F 0. 308

13 M 0. 327 14 M 0.438 i =y: 2=# 3, 4

B H3

A1 A2 A3 A4

04 yv AD NACOSY® iflLifWtll^j; OPhenol^^lljUT#,T,L/:y V ADNA(200ng/lanef) 7iJv~Z \ *.-- > ('4c^ A: A1 = 3£, A2 = f#, A3 • A4 = WM11 iJlltixf) —253 A1 A3

Print a ♦ * * Exit Z*en ZooeOut Tract Function Auteacala Annotate print Print at** Exit Zoo* ZooaOut Tract Function flutoscale annotate Print Functions: Scan Function*: Scan

260.0 0.1358 pt 260.0 0.1160 pt 280.0 0.0798 pt 280.0 0.0713 pt 320.0 -0.0027 pt 320.0 -0.0032 pt

260/280=1.68 260/280=1.60

Havelwith mi.e vavetanoth

260.0 0.1458 Pt 260.0 0.1159 pt 280.0 0.0894 pt 280.0 0.0659 pt 320.0 - 0.0036 pt 320.0 -0.0027 pt

IMS) 260/280=1.61 260/280=1.73

0.448 lag/At I 0.356 n g//al

238.1 vavoimeti

05 m#tA\Z'D\/'>T

genomic DNA 105ng

4 Multiplex PCR

primer : 5 ’ ICT3. nWMifittmtStlfc. 6- SNP %Qt&tZ>tz£>

1000 (SNP-speci f ic primer (D&^2T(fa) 4 Labeling PCR

primer : 5’ Id biotin T3. T7 primer template : Mutiplex PCR Product 1000 ‘fSSIRU/tt)

Microcon-10 (Mi 11 ipore). Igenome 25pi I x 21 Poo I = 525^1 £ 60^,1^ 4 hybridization for Array

hybridization Idli 30^1 0 concentrated labeled DNA 4 44*C. 40-50 rpitu ~16hr Wash. Stain Fluidics station 4 washing 30m in xbuyh7b #v>-7>fnxuxutz*?-xbfitxhuyh 71:v> jnft 4 Scanning

4 lArray ~8min W

•^primer Pool Z£SNP & LTjg#rT£Zz SNP ®L¥*] 5£

136 Gene Chip SNPs W5\7 - h HuSNP Array : Array ±ld 1494 SNP(22 -&±T0%%&.fr£ tztz{s$m\ZMtt'QZ&0\t:t:0olE>0 80-90% (1200~1344SNP)

-255- 6 HuSNPs Array ±(7) SNPs 0#$ 7 13 A1 (%g) (D SNPs

Chromosome HuSNP Array Label Call Chromosome EstGenDist 1 110 WIAF-1057 AB 13 2 85 WIAF-1318 A 13 72.16 WIAF-1353 AB 13 48.29 3 87 WIAF-1544 AB 13 48.29 4 61 WIAF-1858 A 13 5 40 WIAF-2062 A 13 88.26 6 74 WIAF-2138 A 13 42.37 7 64 WIAF-2207 AB 13 8 89 WIAF-2385 A 13 9 60 WIAF-2555 B 13 10 53 WIAF-2706 B 13 11 71 WIAF-2713 AB 13 12 46 WIAF-2754 A 13 13 30 WIAF-2781 AB 13 37 14 43 WIAF-2866 A 13 11.88 WIAF-3070 B 13 50.33 15 45 WIAF-3084 No Signal 13 71.43 16 40 WIAF-3268 A 13 17 53 WIAF-3340 A 13 18 24 WIAF-3367 B 13 19 37 WIAF-3384 B 13 69.68 20 22 WIAF-3386 No Signal 13 21 21 WIAF-3543 No Signal 13 22 23 WIAF-3594 A 13 11 X 13 WIAF-3756 A 13 not-mapped 303 WIAF-4575 A 13 total 1494 WIAF-505 B 13 WIAF-648 AB 13 WIAF-911 B 13 WIAF-973 No Signal 13 13.38

-256- *14* ******

14. 1 ***** [ft********]

SSlBftiG* ****** (*•><'^-7) ¥1*11^5^180 (ft^/FI/IX h5JK, #in#60«

/H XHaiCiig7!>1M5-LX$¥T*S»b ^@3*#IEro*7"D 7x 7 hX»:25t6H tv>7B«^*gojsnfe. %m®##i:#Rx#& a wc, »w SMSEtcfcW-S-L, **W®t;*rt'LX]StfcL. 2. *ft*¥E#¥E5E0f 30 X ffi IS X *> 5. -*. $a*iceEsti-E.toio7i7D->@t hcdna , rtti^thoiefii, a SKs.OOOri^ESttXLSCy^ft®, xn$SEU7bHb®S*li@:3KXt)#SC36 ioW54t, 7,n7x7 hi: LX® 9iEA/TU< ®ta-e©->-X7»t&ofcB*75:t:t'T$. So *FL7d 7x7 XlttSBXtfttSnfccDNAWx 7 &—B* 4*W5'J-KLXH 7 77A#gl:7UX®*m KPfli 3>V—>Ti <7; ®7@Wf

7X>h (ifSffi) : »Bltn>7-77AICJ:SM^. -#&E1iaW:#&b*x£. Kf93 : SEeiu(2*fmLX WccDNA7)lfl.7»^Zx#*, i‘©J;5CS)fctli®4'. @S3 : toSWSSdcXSntf, #B1##&*A. #*AfSoim*®#S7 o->®# iSML«UXttttS5. 37 > h (IWBK) : #$ICot<#x.6nXLSo aetoC#$LXie/t®„ 3>V-

->y att@a®£»x-&<, se®fc*xs>s. Meta-enicia AXSi£#X6S. 5. *aS$Sti-H®«tB [BSUSS] E#tffy-A4,oooe®tt*SB?iJft5E. ####®70-&EM. 2kb®±®7c*#cDNAt$ h 7 >X#'/>6X, ■7n«Tti77777y-¥>7ttT E?iJ£St6X6.

-257- [**«*] 7" 7 ^ "7 ft * — * > if&hzi 0 200ft n — >/fl ©A—XTSSBB^JSri&S L , 8 WjS ffV7 hCitJSSJflDMr. E#fN&Rft. [?=%] 3kbKT©aSX D — ftSrft'x'I'Aft* —*>/?£, ■^-tlHAOftJIft □ — > €• 7 > ft' A@TBB^*%-f&. 1000K-t£BtBf. [3#1*«]

MT-^x-x^si'tse*. #a - E#y-ftA-x&##L, e^e^-a ® ###*, Kffll ($W5te£) : Genbankt$ffi©IC< ©7/ Tttfflf -5SB-5 fr. ®«1 : yy AE^^ScDNAEM&#mf [3--T7 9 ] ATAC (Adaptor TAgged Competitive) -PCRi£ © 1% tifl Q S656S*a:©#|B|E%T%a*#SJtK->xyA#® L«$E66. io8H»FLyn-xxx hE##y;i/-7'4,oooxo->©*a##&%mf ny> h : (®SiZ*BK) £;:©K$»a Ea&h h©^K. X D->©atf#T*iSgS7^fc>7T< 3©T", [0*] 7^xwy-A^5,ooo4>2,5oo©BB?0ftstif««asaa. [HS+Mx/x] + 0 60/8 ¥i9T500X O —>##?%. [Hi* IS] 8aTSfc&8*&SALJ$ffiB#rifl£@$gU, 60/8 Wesooft D->*lrfS. [SBffi] l,500©S$SB5llft$©ti!, ^ U 7 X 2Sff-A©cDNAftSa#©JiDNAV't' 7P71/-f MRS:®®"#" 5. Kttlgft'x'f x-6-jftE (l96*/40*]-) T107b*c:&t"PCR#R »l^-M8t5yXfA, XD->gB*-tliaxt.5TfflS©6toSSeSAS. »© SfttC&o tc7 u — -iftsat?.. ny > h : (A.1J xftXE/EMK) fS#rttai:©A7>XAis>ffi-H^H> ■ 3 XtX] 75'fSE© t hSEcDNA2,000^ S 120£S9J L, E*BB^&*%f 6. $7c, mmm DNA'X'f 7D7K®8IS#tlT, -5.

3X> h : 5kbH±©*H$g8iJUT@:L©„

[#-#*] ga#cDNA7<**57 7icftw-f 370*20

-258- [a>J y?xm] U 7' + -V V LtiST] X □ — >*' tL, T#V7 h$*iJfflUS 6 IC^ 0 &»6,040&mR*.

USSS*#e) ¥fiEi2^3fl23B %-;!/), SimSiSP^

MH91 : E?iJi*S £> 0 *M2 : grSE^JdS^Sft^lAdtt&ifAA 0§2 : SftRNA^yy AdfiA^&D, ktt#igT$E?iJ k L TiSiRSttT k $ 7 7kid 1C,

s^RNA^yyAdSAdSiis-Titfeok *r=l3 : E?iJffi#£±(f-5 7kiddlA«:kT^-5;(p„ B§3 : 7I/-A-/7 HX7-7)SA-SC x 6ff y

Rffll : r%^gcDNAE^*%&kK#ef ##d#m?gi|^xyAj ttd7ttffli>Jig;K BSl : 3fl*TfeSJffl1iJ„ *192 : C®yXfAtta«t?.i'. E§2 : knicMkT*f£*&kfc-&»TA'-f «&i#KLkiT'St!i5. [A#K%] *r-1l : U (E5flftSS#dM1S®) ttk'd@*7K ESI : S^totcliA^'f •7-7*-*>yzS;dT26eS7k*7iik k "^gene-specific^ 7 74 7 — {itRd k 0 tCtJ3'^'4fp 7?$> -5 7 o l^=.m *»ic$"dSEI9. [1C-T7?] Mffll : (A 1J -7 7 XE/88HK) RNA-^cell line&##tC#fR kTd&*fd y h NO^ #$ Mft-Stl^dlSgSkd£3 IctogkTdSdriA @$1 : *&J* 6 D y kN07)fg%&@@iildRNA TZ/k d7?|B| k P y hNOd&d£k:< d A/8F#LT-ktl$ti'EVTS$6#U7k. *W2 : (»vf £DNAR/'MW5te£) 7 > h □ -;f-dlo : 3 : 1+ -v 'J Ak — 7 a >* — 7 e> ti- n tc * tt e 7 * m k r n 3 0 E@2 : ft ddTti-T LT [0ut'fX>XyXfAX] *Ml : (iiSW/ftEK) 377

-259- 1Ate. ^LT«M&x-XM:377T® DitlX/*#K) 3700tt^¥m9ilceAL. SKrafift — X ^ # S # g4 L X © S. Kffl2 : (/W 4-E^Ztta'JK) MUX 500Xn->X36 a©»J9T-MKXn->»:aiT< 5tLt, X tr- K£#x/i£ e©*fJS££© ct 5 tc-rs©*'. @S2•• E#t57D->tsagaj;o&s-< se, |B|B#tC«SEX □->©*#? fci*©£tAi#XS. 3X> h : (WMS) MBEX St©X. **&*>©%&3 tt-f.

[BMtt] XRtcjcOmm. [Bit] Kffli: (KSF5fc£) *t0;t7f*afeTt5J:5i:l/TUUi. @Sl : $ C©ST©S:ftE?illcy LTS:T©x?xff xYiftffH, s#f^$*e,a»®E?iJs^sf SEoT, 6T©AAEm:#f a5 C t KW2 : -Xy tr>XCti, £ ©if Jf 1f « !C*t t T £ © -> X toTI'SK 0^2 : Sftttff 7TI>41>. XVAf-?5©J;t)BaaT-X5®?)Aft, rt'Ox- XMf #&##L^*l6vX7A&##LTH<^#Xil$ 0 , 4"#©##. Rffl3 : GenBank©* Ct>x —XUS^Jf *1#H©T. GenBankSrttffl L T V> < ^Cti, T — X©X U —=>X'##AiW'. 0#3 : ^©ej30fc*e#x5„ *ra4 : (/W ThES/ilSJK) *f-X^-X©4-#©Bm&a&m«Hx*x&A^

—*, a©*AX##L/ht©X&S^G, &#©#j#©^:©l:&ra s. EZ.X, t'©J:xt»IWtT^<*'tt, A.IJ y xxw, *jf*Mft, NEDOjStffi SSI'S. 3x> h (NEDO//WHS): a©K^i?SB6LZc=b©7S»6, ara^Bi'j. 4>•)* cra LTtiSffiSS. [ss®] [0i£V7hXi7x>-:;7iJ>X] Krai : *:/n 5/x X M?tt. 6S £j#x£DNASffioTtAS;5X ft £ $#x fc*0r tt£ O fcA'o 0Sl : A4 X'USS^#^K*mr, S£$#lilxTiA/j:iA7 W J: 0 t>, i4-*ft~FT 6V>ie*7)5aiTHS. *ra2 : 3« g ©DNATfltt©7 M- #i J$ =J IE/»'. 0#2 : 3w gTl,OOOX#-y hftCltffS. Kras : GITESE ITU SB iftt©* It'd!->*«*]#;£ L/j:^o£©;»'. 0§3 : l,000*©*mi:#xSS7y 7&MTSTT. */SE%l $> S A > 41 v ©

-260- [Tl'y/OXtXff*S] RMl : (##%&) !:#«##% tc#-5DMA? t). = *SS b£ fc J: Hr fc * S iti * © m A tt ol I6 o ®gl : tffWS^M^fcDNACX'tf [$-#*] *01 : F«#C ±J7)i tf 7F3e#Tf2*tA7)'. RH7 y 7#* s:to ic*x.rn-5*x 0^1 : S$6#A"> h*ro S-Xf ttXfxftLT RHTy tf >^*£®!|Sia»Vy tf>^^*^ST*a6 0 . 3C#ttif $ fclSt * 5 t U IL.Tl'5. ##7 y 7f1t#x-5'^-XC(jRH'7y 7© tf # 4 £ ttT V' 5 „ EC, -@TiJ*5iiiW^OX7 yfflfao&itfib'bff 9TH5.

[8958*#]

$i®895sa# Vfi8n^4fl6B n-ftfx y n^-M*a«®f?S«^©^C, fB**li*gSSIfttt5lf7«4' 6*5 rtSiSI^J 6S*U, *yni?x^7 S®84¥6<)#Se05, ©$-0 *7'nvx^ hro$*|Ha©BE.&y:#®»S j: UTttWMeftSrBW £ b It 3 > V - -> 7 A IS it $ it* 5 £ £ l: 31>T#* L &„

WSiMA (*tt*¥Ef4¥E%9r WtSg) ««$$ (ttSSttAU y ^XE%9r BS)

/J'S HSc SdnaE%0t SB) a ati (BftsssistiMisa Ee#%*) (#S:=aBitSff0f -fe>^-fi) HW#BJ «S^tt7yy>-3XtXS(?ES

asm* (A'f tr? s bb) m2@i$958il# f All#6H30B (KAW *«#). ^ o->^E$^5%m*E^*s^M#tiefim#KK(D#e6^#(D?sc3UTmL

*C"DV>TI**lLfc.

$3089588# ¥)$11^9fll6B (R/Hta&), #SD#17« &7-A©iS SMASHES, 3>y-y7A©S@J!:3i>Tffl*«Ii2tBSOi!iSBi,

-261- L - * & M £ M Mi f -5 8# HI ^ -DViTSISUfc

84@t$$B$ad ¥/*n¥nfl25B #JD*i9« ^?-A©a#tt««eA''S2000S®x.-5^

$50&«j$JlS ¥fi£l2¥lfl250 (S/W^ES), *)ira#20«

t&mmu. 0* dnat-?a'>j (ddbj)

14. 2

¥J*ll¥4fl2lB SDNABf%Rlf) #%P#7$ gtCA'fSEKTSeMUTV^ ragg mmmftmJBHSj ©!?□->©#«, 4#Bfm BtottiS, M^SStottSSSiMAfc/Se, #Ktob hcDNA-y'/ASffli$«E%©3 >V-'>7 A&mftZtztb ©##t

¥J*ll¥6Sl40 (R*#S*GEN»%Sf. «ft) #J)D#4« A-^r>itABi3700©ESttH. ? n->ES, £%##©. # Wffls©y;«£»sufc.

¥(*llf7^140 (R$*Ef4E) #jJP=#10« WSfSMftttCAEfl-E^-A©* >A'-*i*$ 9 , ^ n->E*#H. ##S$. x

-5'©<'*Ut-^ CBUtaaifc.

¥fiEll¥7fl210 (RSfiifi. «*SS») #M7=6 ss®©^D->®atts. ifi*E5im«tt«£smu sscv-fimyngcB ITlSbfc,

¥6Kll¥7fl29E (»*««*, e,S) *ip#5« WSSMftJ; 0 Eftiff-AO-li L T*t8®*©fiS £«iS l<. OSS E?iJSftSL/i9 D->l:oHTMP v-»W£MfeU W b L @©m#KR#eAl&o^.

-262- -898 -

^

'$w®#m###®?#xx x

’$±'$■4-^285 .M Xa X >x 1 ti$Ws

°-Vc--@’lSI3Mi'-",rA:--XX ®»*g ‘:WS®.MXciX>x nv, 1 4 A A 2? g 'Eg$ »Z#0(# (^W4: >,V«) B0iyz*zi^7b #fct«S4HslOlig

'@9 4-:!Gf-@*a.qM MS*y x i 5:SH4IEI6 c a q5|2.tic:(S¥®2S#r/69#tf to-3- ' 0 %-XgWiV

'3W2l*$:«il9.£)QTcf:,£ ' fi XA>A '(ilis?tM0§B4H/ 'X

at>*7tb -SB -EB$ 'mxx ar,xz ~sse '##a »£!#!%# B£^Z*ZI39* ¥««*4Hsl6B

:it£gxij##(##M2i®o3ii!s in 'itia^5B®f0^<^-Ai:j/4Zi0l¥i*®*

»OI#OW (W9IS«) Biyzi^ny*

°^n#mxnw 9SH ATA-aAW»id '-a ■SlSi'5-4-M:i«6t*6*t'7XA--A< c 3£i#D(# Bizyoi^xi®* 5Fti#3!l4HsU5e

°7vn wWXnMZMf-TA-X-xrzailaS^ 4 DA® .M X n X >XVNQ 1 Q $W.^##8 ll®iftX X 4 (W ' 4 X A 9 g 'fij g?ox#D?# M/-m B0£yz*n$i* M. 3 mwaamis**

i4. 3. i i

m) B VlSS0t(DBJIIW*^> ¥l&ll¥9;ii8B-24B7y U^THSLTto

98 18-21 B TIGR^-7y>^ (7<7^) 9^220 7# CeleraM.# (7'>> h >DC) 98 22 0E# TIGRH¥ (7 > > h >DC) 9^23 0 NCBliCT#^## (7 >> h >DC) 98 240 E# Lexicon## (ka — 7 h>) 9^ 24 0 4P# Baylor E#A#7V A>-7>>>^t>f-## (bo.-X

(#An#) TIGR5 — 7y s? : 172000^ Cetera, TIGR, NCBI, Lexicon, Baylor U Pc A# W T$\t, DNA#f > 70c|5lfT$ti:T Wc^c Wco /H NEDO (/J7HK, S#E), M, 3 >

(#g) M*iIE##7PU >x> h r#^%k hcDNA^y CD —'81 7 f nth INTERNATIONAL GENOME SEQUENCING AND ANALYSIS CONFERENCE J 1:#A0 L , 7yA#^(D##tR#&f7o^:. Ifc, DNA#^^##^#(D#j##lCck 0 , ^ >7#(CCelera, TIGR, NCBL Lexicon, Baylor B#A#Y ? A > - 7 > > > ?±. > 7 - 0 £ [W0f $ it T tA fc ft < Cl £ #

(1) TIGR^ — 4 s?

/w ?? Mi, mmnm\zrnf^%^ntzsB?fsb?n^.(D±?j

UlOOOASSo Primary Session &3H81Z. ft \~f X ffl M° Concurrent Sessional) Bioinformatics,

2) Technology, 3) Genomics \Z jrf 7 T 2 IB 7~D (PHIL # A 9 “ \Z 18 A/fi X 7—4z v > 3 >#&%, 95 tf) vendor/^ 117j\ L T 43 Q , Parracel, Incyte, PangeaCD *tz ^ A — Ifco -en^E'n, EST0 7 A > 7>l/ (D > X 7 A {fc £ f7 o T 43 0 , 71l > 7>l/> X 7 A £>S

'>a7 7 a 7 A > 3 y h7>>“7X>>>7 (iz > 7) 1407##&##>3 7>3 7/^x7/A^e>3oo^#(D>3 v ht;>>-7x>x&e

- 264— Ia, »g|*yn?7i ("tl/7 • 7-t'y^y-) *IW%5fc© * IJ 7 4-A-X7 A¥>~ b' >tS±©7-Ai!>lff I$U/t6?ijBAC7 n-> i: -> 3 y h # > 7-t>7’;l©*£HT£)b:3 >xx 7*® CiSfflStlfco t !/7t0i7/Ay 3 y l-^Oy-XXXyAJ'MIJ, 2kb ©M*aE9i|&*%LTue. 7-tr>7ft*fT-5. XOISftS©}# 7fcifn->Sftlf5;tM-f>hT (S68J±io%), X©U b- 6 6. -fc b 7 • 7-t >7y- «. SIC* T ±5 tc, 1) §t»u b-b©7X7, 2) ^--/t-y y7©Mtti. 3) +)77t>7U- T& -5 unitig ©# HI. 4) unitig £ 7 Cl — T##L XiScaffold©#^. 5) 6) *e.B, n ffS© 15 fx#2tx SkLJxyiSfiSitoXIcPSB^cizSyLl, — -> a XSfrotl'S. H C < #2 IcHStoESrTFf^. ESbS&Rl C <1*1# 5, tlT 5. Scaffold ALT, lMbpKUy© Scaffold ^21 flU (total=104Mbp), lOOKbpW. ± IMbpK T © Scaffold* 5 2613 - S X Scaffold*! 491 H 5# ?> tl 7b „ BDGP © BACs ( 95BACs (22Mbp)) , 99.97 % T — & U T IA Ac . b h © ilfi #t it 6 T 110# © * ^ 0 T- (E-20 cutoff) ©KSit^AciiS;*, ##. 0 ifflS25. 52lC)C*LT73{B©lh :En 7* *!M.7*pyAc. Atlll, •>3'77a7rtxy/A)!i!kl7yAffl ;ETjl*t 4#17^@%TT-7-&M©?&. ±gite4©4f . -> —7x>7tx(3:, 47)5. 40 A (7-fc >7'J — 15A) &BL. -> a 4 7 a 7 A' xy;AB1.3%fflD4X0At7-f- *7 4 >7ftX>-!fB©JiBo AliBf k < . K# A©#m©m$Aift*. 4#. b b-y */ a sadists (+)->7>»64oo75+i-> 74)

1) *%#7o 7 y< 4*1: j:&$%##itef ©###&. Rtto a-try 77, h->* tC J; ^)f@5f#MitlSf©ln]^ (D.Eisenberg,UCLA) 1) 6-5#5E*toite^V Ac*ito#7'n7 7 -rA-©E®tttc±ot, meii»±#y/A*m+#EE ?m;tshT£9, *ft©«S6tt^tt@TS5ei®„ cti»'e.s4,i:y/AE?iJ©ft$» *!i@5n46t:7fllT. HECStXiT7SrTi:UTl#=rEi:JBt>n5.„ 2) *«y/#©yyA©%g,#e?f-rM7 4 >A F*4 5>^n-e*tl©i*fiTltEE^ffl-T5

*SfiXPft-A*« (x*7C««.*D)AK*4#r©eiHb) b bDNAf y 71:4:6^#©## (T1GR) 19,200©cDNA7 4 7 o 7 b -1: J: o T, SSJBESe# l: *5 ti 5 84%g ESJWtfrSAx -3 Ac. @K##7>b7 4Aag#Bt7bbf 4©t4y4 bXcfe*. ©*m*%i#Gf»*£fT. sse

-265- b h ESTCD 7 "t X X V “ 1C cb 6 cDNA t — XA— X (incyte) 986 (763 + ^^223) 47075 (36075 + &&11075) ©EST^E^J ^^Lx B^cD^Bb^XtXXV-E^^#^ (^1#^).

(2) +a7'/i> (CuraGen) X X i9B^f^^au y xx -Earns x gem, DNA#e$f&a#c^cL9^%>%(DBj% S775/t“S (Michael McKenna) ^<'76ca j; Vx ffofCo C(DA^T250)7ESTs^EM^^ Lx X 'K-X A L

XV -&%#. 120,000SNP£Ir1So C CD 7 "L 7 5 / Mibft# 7 =L CD 1112% T & V x *£ ©^T'lC^ ft (^1#M). TIGRS-X + XX

^ss2-ax xv v MmT»mm6,ooo*^f 7 (m< i^ibT^gft), #l&%i:l#X;i/-X'7 h^#^T6##e##LTV^6o CCD A T X]#X%ftMillmiumtl#T lift W

(3) ill/7 ft (Celera) X — X X X — S x MI, X" X X ft—11 ABB 700 ft 300 V x 20ATSE= #KU:##LTW6##Tx 140,000,000# &/B(D^m#Aft& 6o tfj#Q:@llPCRft200-n\ Genetixft (D X □ — X t° y ft X X □ X y h (96^X24) ft3o" & 7##LTWft. ^7bmcPUlll200X W 7 C CX'ftficDXV AX XX - £J±$J L TID 6 0 Aeft@m^ft^7x ##Nrl:ll%##ft##Lx f##ftf##Lft^l:7^m$llT^6o CUft 6x khy-X>'»^l^t§o ftMcDVenterftft X ^DtlTffiftt 6

(4) XXX— (TIGR) ft#^ xxx-ft#*m##et>x-(DW:#?& om&#x/ 6.0 X —X XX —llx ABB77^38^x 3700^10"^##. ffltft t£ tl t L X 11X 1/ 7 ft \Z A ^$>11 6ftft#ft$>6. # ft CD AM 11220 A (l*!lRPh.D.30Ax X > 7 + 7 X X 7 X X 60 A), #WXXXAftLoft7LT^7x 3A(D#W#ft#Af^#M^27^^#WLft#x #<7SS/\Ejfi1"6= 9E ft TIGR(D ft g X X A - T $> 6 J.Quachenbushft tfi 3g ft CD ft #) x # WS^CDftft [7cA^#gm] ft*fj£0 TIGRllX-XXX-(Dm#(Dftft 5 ftx #ft#X

XAl:#ftLT^< C#A61l6o Lft Lx TIGR

(5) S i /( X 1 f 7 7 y — 1# # X X X — ( NCBI ; Natinal Center of Biotechnology

-266- Information) ESfo! Greg Shuler^3 Lipmann^0^fco >A' — ft 3g >7 T & o fctzlsb, C t, 6^## £ ivf 7> 7 ~~ 7 ^— A # (PROW> LocusLinkx RefSeq* Blasts GenBank^ UniGene) CD S ^

60^:0 NCBlTit, Blast^(D#%//;i/-yilit^%^^ (±#0f

^m1210A#$T\ (180A) (teLI/^;na#fc«>TiftVO T\ ^JA«\ RefSeqii. GenBankCD

^{t^T^DNA@a^&i5)^L^(D7y t--'>3 6^-7^-AT&^AA A#:(D##l:A<^<>7^-77y 77A#^# - ^##^7-

8AAT/t--'>3 >£{7 D ±if ^ A^(DgMb bT^#!#(Dt#±^^#3 7 7 7< 7- ^7-8^a^m-15^T%Dm/uTW6a(DC(hT&6. NCBKD^<(D#^#il, #A#A (D lz;l/T”Bio” 7* 1C 11 MegaBlast m LT#3 0 , 1CPUT'4BT'^Tt"-So

(6) l/^y3 > • yx^f-f 7 ^ Att (Lexcicon Genetics) tftfnl i995¥^E5:$n/t^>7 7-jhE= AM(if«byciteAMT<7AEM£> itfc f (D####V-;i/'r&^OmniBank6 LT, «Afr£7“/y h 17155^ L T ^ 5 „ f T 1:65,000# (SS^rl^V^cESS^AEch LTH20,000#SS)

V^o C0 7 ^40% il%^0ESTb7 y 7^ LTV^W LTV^6 ({& b^r^Tilfa1a)0 %#Li:< (D@B^J^Amc#3f-7<-Aa 7 yAEyfj^msDAto

1/7 7 — <5b#X.£>ll<5o Lexicon, Curagen^ 7L 7.— 7 & At£ -SriK^ L T, SS CO-^3 o^m^%i:m#LT7-7<-A{bL, #T(D5 (^HC3tt0lt^^m^^-r)o b't>bk, b hy y A@B^J^m%T-5 c C2-3 ^^C0#0^-7^-A0##bb6T$>^ b

(7) 7 - #£ 150 A £ Richard Gibs^$WT, b h TV A SBEJ^^CD 10% 1:^4T^ < ffll 7 7 Aii^^J 15 A T, 8Ail7°D /7 7 —2AllUNIX^##T&^)o 27(D7P7-l:iilS, '> — 7 >7"-^, /^<^<>7^-77-y7A^, ###^, A7v7^^&^)o $5 1117 n 0 , A'-f/l' >7y — 77y 7 A illoAff

-267- ft >ft-ABB700^23^, MD^3#, ABB77^10#C^, ^ - X 3 — )V \Z tephred, 71I> 7 7-(C tephrap, HS(C texgap£Effl LT V^Q rf-SSti, h - 7 ;H4CPUo rfS/W

^5o 96Xg(Z)m'5#, h(D$^DNA^^l?:giiiti^lTi/^o t, j; 3

btlfZa

7<7a(Da—fty>7^6il, ##(DftyAftDy;x7h(Dft^$r6^LT#^:, i" LTCft^ 6 =b&^ft-5T$>6 3 ft C ^ftyAya'yhftftC^^^^l^^ScDy/XftAM, C^l^T(7)±m#ft>7-a ft/X(D#7#, ####ftn7x7hdmeLTW < (D *e fr t ^3 7 Ji & & o ft (#&#y y AtcN l/T#TTk:%# Loo& *)o $6(c#c^cad, '75C

£$>tfTtA5^ttft)fr’Dtc0 ft@, MfiUE^ftnvxy h r#p§%L hcDNA^y

Afoft&wmmj \z&ttz>m^m(D-mtlt, ^-%-y /ft\ hft< h (incyte) ft, ^ra7V x> (CuraGen) ft, &£/lx7'>3 > • yi^T^'y^X (Lexcicon Genetics) ft COcDNA i:7-ft'y t^oynyx^htiilTUAfl

(D±#@a^j^fg-^(c#^6c 6^m^c^3T< 5 t & y> 3 = y-jut LTEST7ft > ft)!/ '/X 7 A ©ft a^i&c&sc6:^Te, ca^ft^xy#;!/ 7-y ^(7)^-7 7-7^-X(Drft^6(7)^E

-268- 1 cDNAx — 9 -X— X

Normalization Library EST %. 7-fe c SNP # m8 j, DB >7 7t urn >7 U&

Incyte Normalized Library 986 470 75" 13 y 99 5t DB ^ 20 tt LifeSeq-Gold cDNA Library (763 + (360 75" %% LifeSeq-PD fa zt Public + ^A 1.7 7? LifSeq-Zoo c=t o . 223) 110 75) LifeSeq-MB £< 0it

% Curagen mm#### 350 75 28 75 7 ¥ 12 75 COR, SeqCalling Rfftf-057B7 □ — >£ (250 75 40% Roche 7t>7'J □ — “ > if C ^75: < y + &A coding Glaxo DB, SNPDB =ta — y > y 100 12%7S Wellcome GeneCalling Normalization 75) DuPont, a C0O+, Pioneer geneDB 1/3 ArQule, PathCalling Genentech 5236 drug Biogen DB (Yeast target (Lhy/ two-hybrid) 1306 A 7 HitCalling secreted+ 4,M/u"CVi CureTools plasma a) membrane Lexicon y / A DNA Kl«i; 40 75 7 75" 4 75" ? ¥¥. 0* OMNI BANK 'XCDjf Aif (99 C 3 (v7Xiie 07>^Ai Vi 5# DB f %##) AC,t aG# 01?. A %%) HTST C ft HTST 7i(t by (Human

m

2

V >500bp / read at 98% quality trimming. V >550bp / read with some reads having tails at < 98% (<1%) V >70% of reads in pairs. V >67% of reads in pairs. V Insert length normally distributed with ±3% var. V Normal to 1st order with± 10% var. V >1% false positive rate on pairs. V Preliminary indication is that rate is < 0.16% V >15X map-coverage in BAG pairs V Estimate 12.5X coverage given 130Mbp genome size V >l-to-4 long-tcrshort ratio V Her 1.86 ratio

-269- mi tl/77t>^7-07D-

Mask repeats Sc reener

Find all overlaps <^40bp allowing 6% mismatch (1000X Blast)

ASSEMBLER CORE: • Compute all consistent sub-assemblies= unitigs • identify those that cover unique DNA=U-unitigs • Scaafford U-unitigs with confinned shorts & longs • Then with BAG end • Fill repeat gaps with: I .Doubly anchored mates. II.O-path confinned singly-anchored mates m.Greedy path completion usingQVs

Bayesian “ SNR ” consensus using quality

-270- 14. 3. 2 II

*anacd(«o¥j&n¥9m70-2607xvathbl Ao

mm^c) llth International Genome Sequencing and Analysis Conference HfHKift : Fontainebleau Hilton, Miami Beach, Florida The Institute for Genomic Research (TIGR) ; Rockville, MD Celera Genomics Corporation ; Rockville, MD National Center for Biotechnology Information (NCBl) ; Rockville Pike, Bethesda, MD Baylor College of Medicine ; Houston, TX LexiconGenetics, Inc. ; The Woodlands, TX

(«g)

A#, 7n7x7K 3>7-y7A^0S FL7°□ vx 7 h 69 dS# t" 5£HS6 U

llth International Genome Sequencing and Analysis Conferenced#^ $92000 A ^#90^7 IMO)tW&5EIE£:IiT\ A69dil#>T Vt5Celera> Sanger Centre > Washington A# fill# < Ofg^£> o fz, 7 7' #$L#0 yyAyo7x7 h, The Institute for Genomic Research (TIGR) d> ^[H]07^7$T'©3>7xl/>X$:i ti LfcBH9T\ 7UNIH©MTIGRC7)CEOCraig VenterE^92¥ElL= ## 1: h h%HiSfc70^DNASSE*iJ (EST) O7-7 7-X>fHSE L T O 5 Q Celera Genomics CorporationdPerkin-Elmer#

LexiconGenetics, Inc.d)KEBaylor College of Medicine^ C^T6x-77-XO

National Center for Biotechnology Information (NCBl) dBioinformatics^^##OZ^ 7 7

[Celera#07 7 7y krd EE^BiTOfM, Celera#^#0777^ H7X ^^Td|Bf = ^-[HlOCelera#O^ST h k '7 7 X d 7 a 77 3 7 7X0^7 X 5 •? „ $ A C#d7 7#77Ad^d^#^A#^:hkv7XTt)&%. Celera# d 4^#, k h OSSEXO^^ d A # < 7 7 h t" 5 = (Hflu) 73 7737AX (Drosophila) khO±7XAk^O##P$^,

-271- Hflu Drosophila Human

Genome Size (Mbp) 1.8M MOM 3500M Sequences 26K 3M 64M Months in sequencing 4 4 12-18 Sequencing Staff 24 40 50 Assenbly group staff 1 10 15

& : t h J A3500MbpX lOcoverage/ 550 bp/read = 64M reactions

BDGP (Berkeley Drosophila Genome Project) Or — 4 t Celerafch(D t — £r ch "tir n CelerafiKTT-k^y^ y ^ 9 T y - (D#^y—y y^^-y 9 Tt&K

T< 6o CelerattcDj&jH^oV'Tfi, f T j t m 0 (Baylor^^Tmm fcV'3„ CelerattiSftffl U ^ W LTV>5 /' — K/v(trEv^0 y 3 ? y 3 r>/

Venters 4" : #%fi#^(Dmaa-em9o v^

-272- Celerati:

ABI 3700 300 o' ^ U jr. — fc° y y 5-10A#/ 0 y —y- y y 10-2077read/ 0 5OOOA-l#4OOOAbp/0

Prodoctral Staff 50A Support Staff 50A

yy/ —Aft 16B#f?H %2'y y h

384X 168 (#) =64,512 Ztlft, Ay — Ayy£$C (300AA y7 A A IB 10-20 75 read A 0

Design Constraints 7 / o y 7 ^ —) (Gene Myers) -KISS keep it simple stupid (##6"CAA"Ct»"C# 5 C

- Quality and throughput have equal voice (aa H A t A t) - Write and Follow SOPs (###%## ) I t) SOP (Standard Operating Procedure; # # # #I#)

Genetixli (D y n — y t° y A — {t„ #%3 o' ((^2"o ##. 1 o' iH ^JP L fHA t & 5 A > 15^ 2 AT; Ay o-yy-y-y y (D Ay y T y y "1. ry^U- ffiv^aT/K K ^ y v a. —y g y, ^ 3ry yy#E (%>:[## n # y b

3 y t°oL-y s^cmT

- 2500il (D y — y- y y it $ £ l A l b# fi9 "C fr 9 0 - y ^ u mid:4oy y/<-r b IBM, Sun^#WLA^A AConpaqA^ffl LAo A y y T y y eSM£ixB = t“hs y ^e —y A LTV^6o

[Baylor A<7)T y A T t'r^l Sequencing Mammalian DNA At The Baylor College Of Medichine Human Genome Sequencing Center (R.A. Gibbs) BCM-HGSC (Baylor College of Medicine Human Genome Sequencing Center) T'li t h A

-273- 7 A £9 {& N "7 (7 y ^ '>a ^ v 3 ^ cDNA^ # #f # i& (i y 3 y b A y A^C, BAC£E 9 o 99^68 10!J7y/3> (l/^y-^ry^Cffi^) 2000^4/1 {d

k b (D SS ^IJ ^ /£ (d (i „ (Multi-laboratory international consortium)

Baylor* bl 15 fL1! LAI^K 17 n T£ri|fix(£>I;$:&' Itl'fco ft? #f $C 7) 1# 5(1 (D ^ £ (2 , M

VrifetJ

Zfrjjk&mi-Z) ^ k tiVZfz o

Baylor AO^@, AMtiM

ABI 377 16* ABI 3700 17* 7 — y 1* 150 A F^phD 7 A

homw, Aii^

ABI 377 250* ABI 3700 230* 7 #/<—y 110* Licor 70* People 1100A Read 35MJ^_h

500^ — y be #) -5 L 7) A lread

-274- [Lexicon GeneticsCO T y ~r -i t"t -i ]

Lexicon Geneticsl3:. — o — o (D jfHH-p H & j]\l z_ fz. X It ¥ tf} L . (7) |p ^ t — ?'<'-7s\k'tZ> r OmniBankJ

¥¥2£ frb ft #>&&$. T 65,000 it fcTA ¥¥1073 3: t 75,000ilfc¥. 45Ay-¥y LTVd0 y - A “ ft it (- ^ilbt 5 r A %U

¥zcAT#&a#%mc#;t6cDNAdsy

#t5o il$\ ^ftT^T^-O'd

¥Ai£TUy A. £ £>;£$£## fljT fc d 0 S* O^^ftcDNAyn y ;r y h A tttfc L^ftSrES-td A > ^ T <7) i§ 9 T £> 6o

y y Apj ^^ScDNAPJ Lexicon Human Gene Trap

ft&. y y A A#: iHIST1 (mRNA) iHiiS-p (mRNA) 7XL^- X?E£ y n — y (79$tf# ^ pj ■sr S>iiM — m *

¥^AT(3$)d^. ESTj:i9t)(j:6^4c#^@a^j(Dm#^T#, ^m¥dZA(D#^# ^ficoV'T t A ft#lc##T# dAdS#$iJT&do t L° - K^#^Z AT. A'(D:My !) v L^$)d^4c¥¥&<7)##^^^^TV^d A#t do mm, A##^

ABI 3700 2i3 ABI 377 6 p SUN Enterprize Ultratec %## 115A. phD 22A

-275- T7X, EX>7©#i% : T7X(:3WTill8^, 8000#, 90^E$i^t/tvf>f^5. K/O^CO^T ft 6 b 3 M£&©A£h /^ — A-t^nU-r^ (#©TK 6) (TTSS L Sf*I H A S '> X^A#i##LTW5.

M0^Ic DNA3>V-y7A] Sequencing and Analysis of Full Length cDNAs in the Course of the German Genome Project (Stefan Wiemann) teifiOn a : 6^:63 >y —3000& 6 4000©#T#cDNA^ @ W t f £ = F*38megabase FI — 7" 7 X $: # T o 7“ 7“ > 7 7 7 F^1997^9D b §fl£q U T F7 <5 0 x —

^gm^#©tExA£FL7°n7:r7 h(h©it# :

(2000#5^ #&TrFl http://mips.biochem.mpg.de/ FT j## ci ^IT -&) ©FL7°n 7x 7 h

ts ca&#mc#smLTw & ca&##RLTw;t. — $!&•£: IT, V^lNIHCD^TS tlTV^

Wik\zmfa&}'V fo Z> (DT:\ g*©FL7 0D'7x7 h©EM£ffm LT3 7V-7TA£B LT&c#6

[Genome Sequencing and Analysis Conference ©_bEB.l7^© h H %y 7 X ] O ;ET^4ll-yD y f 7tXt (Arabidopsis thaliana) Xiaoying Lin, TIGR Michael W. Sevan, John Innes Centre, UK T: :r LTfilltlTT'fclT'* / 3tXt (Arabidopsis thaliana) ffi $£&&%£< ED ±Ff £>nT©£o TIGR§#©£ti Lfc^-7 FT&T57 7 A-'> 3 7/f FtT'& < GeneBank^ 6 ©7" 7 £ 7 7 T"- 7 3 >t>£(T©I0iJSlT^5.

Lab Seq TIGR Annotation (k)

TIGR 22.3 21.4 Others 57.7 4.7 Total 80.1 26.1

-276- O ;£T^4ll->/3,)y3 ,)AiI (Drosophila melanogaster) Michael Ashburner, Cambridge^, Berkeley Drosophhila Genome Project

(Drosophila) <5 o 233# (11 (7) tRNAch 222 protein coding) (7)3 — M LXbo S fiR(7)3— H til3.1 kb (Cl#(7)Sll □"T'$> 0 , $5#Ef (transposable elements) \Z~D ^-nil71kbl:l#(7)#j^T&6.

O Phylogenomics : ilHb (7) fE6 *f 7 A## Jonathan Eisen, TIGR David Eisenberg, UCLA Evolution (7)7*6 ^ T0R7X-f ^*D v — ^@(7)g Protein functions t. „ D h — & 0 , 6000

(W 0 3L#ffMW#±###k ¥J&12¥l^50~¥/&12^1fll507X 'J*T ^56 L Tzo

H$r^5fe : Pacific Symposium on Biocomputing (PSB2000) > MYRIAD GENETICSo

: Pacific Symposium on Biocomputing (PSB2000)

(#mr) rx u zh/;i/;i/

(A#(7)##(L#^B%)

psb2ooo «, ftfc>tiZ>0 £S8tetA < 7riu7)-fe y '> 3 XzftfrfiT# 0 , ^InHi EsNPOt v ^3 >#8rL< ##(7)#^%mi:3^T(7)mg^ff

(#%#) USA, 3 — Dy/W 0 *, '> —;i/^^ 6 ^200A =

& t Aj o fz0

(ir v is 3 > (7)EW) ^#(7)PSBlllXT(7)10#(7)tv73>#^T&o/z. ^-7##) tE#^Mfv7XT$,o/:^tv73>aLTll^<^CI,

-277 SESSION: PROTEIN EVOLUTION AND STRUCTURAL GENOMICS

High-throughput functional annotation of novel gene products using document clustering (A. Renner et. al.)

cDNA6B?'J£#F#rf 5 cDNAE ?iJ £ M ^ E ?iJ E L T E M SB m $1 V — )V T' <&> 'b BLAST £ V ^ T E £0 7 — 7 ^ — X H — 7 fr 3 o

6(177-7-7-7^^0^0, ####, # EOj;7^^^Tr, 7^7L/^E^J^$,o^7#(:##7X7-7g> $> 6^ C#)7—X7 —XO H^a^>h$^7X^U >^IT^ *#^0^61^$)^. T-f< — X(DK=t:iX>h&X9Xf U>;X LTimn, M»fc7£77n7£^ttfc*g£;^6ummtmmryf-ya M4:^X>h&X7X^U>y76^^C, M7^ X > h^(Dgg#&^#L^o ?%:£>%. X-ATx^7-AX 7XXk(:%0^6#$$r

N(fx,Ty) TyWCk Ty*Tx PCk(Tx) = NTot(Tx)

7LT, 7-AX7X^kO M73.X>hdd3lf^7y7X37 max PC* (7^) z/ 7 ED Match(k,D) = 0 z/ 7^D :ft/^ 6 20 0 K7a.X > hDiEDjtelOl 11,

Dist(Di,Dj) = Nmatch 2 (Match(k,Di) - Match(k,Dj)) k — 1 E^#T6. j; o , 7A, M7o. X > h

SESSION: MOLECULAR NETWORK MODELING AND DATA ANALYSIS

Algorithms for Inferring Qualitative Models of Biological Networks (T. Akutsu, S. Miyano, and S.Kuhara) mx:h hXV 052#. ^&#OM^###^(lEOt'y7g MLT& 0 , E0 5>S?TX1RES^#^VE 7 7 X 07 l/7 7-7^70^##0^^^J 7-X^6^#%m&77'y ^)o X-U7>^vh9-XC^#%(LX

-279- c0#^0Begn, 5/^a.i%-yg> /x(D%#6g^4;^ h7-/ C C (DT, L&#o%\ ccD^^^yyo-^^nTmc^ ^). 1/-M^0A#^K (indegree) &#mTaC

Hybrid Petri Net Representation of Gene Regulatory Network (H. Matsuno, A. Doi, M. Nagasaki, and S. Miyano) (dm/#, 0%^. @m[nI%07^^1/-y3>Tii^# h V 4; v h a tf 7 A^7 y-^0##^5>$ a. I/->/g >T

0

#7

SESSION: DATA MINING AND DISCOVERY IN MOLECULAR DATABASES

Mining the Quantitative Trait Loci Associated with Oral Glucose Tolerance in the OLETF Rat (A. Nakaya, H. Hishigaki, and S. Morishita) ^/;B##a±wmmGEN#(7)^^. &< 2 A,0E#7y

L&y-y&mcLT, cmw £&&% (%&{£±(di±ls. S)

trait: genotype at j-th marker locus in the i-th individual Individuals k 1 i=l,... N *2 j=l,... M .. N_ maximize inter-class variance (ICV) = maximize LOD score

mi : m#07cc%o/:f-y#m

280- Stochastic Segmrnt Models (SSM) of Eukaroyotic Promoter Regions (U. Ohler, G. Stommer, and S. Harbeck)

^o&afx AE^TkL (ssM) #<&e>%i6:a

6) a^GGZfiTW&aA (ssM) 2fia. ssM^y>^;i/^j (##) Team tJ&b'DTl'Z

MigUlXf bT0jMte, *r

h Lfzt £ a, fco D. melanogaster t fc h >^;V(Dm*s^^^E^si:T^xa{±E^5:AnTv^a 0

SESSON: IDENTIFICATION OF COORDIANTED GENE EXPRESSION AND REGULATORY SEQUENCES

The Use of EST Databases as Multi-Conditional Gene Expression Datasets (R.M. Ewing and J-M. Claverie )

ty btcx, EST5=-f m&f(7)^7%^um§§(7)/7^x^u a casern, x-f ^urvxxtntz.^m(Dm\:^:n^m(D9-^ (#% ^x-7) titnii, 077x7 u >^mf:bfiT^a#A >yA(: a 6^lTT-^^-X(:#m$^l/hEST^-^<-X & ffl^T^afcfti:> :H6£&ag^;£JMLT

T^a&^7a4i:b&s%v). v^

281- 1 2 ► clustering, gene dendrogram N

f U >^$:m^?(:3WTeVX ^MUX9X^i:AoTW6C (h&m#8LT^a. LA^u,

SESSION: NATURAL LANGUAGE PROCESSING FOR BIOLOGY

Knowledge Representation and Indexing Using the Unified Medical Language System (K. Baclawski, J. Cigna, M.M. Kokar, P. Mager, and B. Indurkhya)

(umls ) £mvc*-*y

hzw >-rx %Htl

cumls^v^o ^, W£&'>x^aslv^„ 50#(Dm#(DM# (IS-AM#, 7000#(D* T-rf U Tel(D U > ^7 > 475000# 100A#(D#^T '7 7% 607}# (Z) * 7^ 3* U > 37772f#(D#^fm(DV A is a B, B disrupts C. C is connected to D. D. is a part of E. & £:

-282- information source

Domain-Specific a set of discrete Scanning , Notation elementary units (lexemes) I Tokenization Vocabulary, converts lexemes Conceptualization, into conceptual units, determine the part of speech resolution of v Categorization y references Relationships, produce a parse tree , Semantic Links , Representing r \ Knowledge Knowledge convert to a knowledge Representation RulesJ representation

Knowlege Representation < Knowledge Model patented indexing technology X______/

m:

disrupts Immunologic Factor Immunologic Factor antibodies

connected to

Fc-receptors NK cells

-283- : MYRIAD GENETICS

B U : 2000^1^ 13B

: A. Oliphant (##7/ ^ X X|S^gij%#), ffflProNet^ffi ^ ^JIO^S.

: MYRIAD GENETICStttiX (D JM 0 tl b Ttz'tf J CNS#?%\ ^3

fi6*6?7X &/c, MB i3j:U:^#3:##%6(DX h^x^'yX - 7 7<7>X, 43j:U:^#^®LT,

3 7)I/X p — “ y h 7 — Pffitft ProNet) ^ MTS3) £%H> TU^:gB7J#t^ b T B6 „ $ 6 ±TffiSf^ffl*rS*f L ViM+feOatfe'f £, ProNet^:® ltMoitt^-5. ProNet ^6/u^AT(Db: h&/Wf< # (80,000) (Of TTwo-Hybrid^(C 4; 0##% txE^fcMK-SdMa^ffl^x-XB-XtbLfc'BXxAT&So x'-X £ 3 > tf

60

Iflnpif-HX : itfc^xX h BRACAnalysis, CardiaRisk, ProNet (TM) EfcBSi 6 = BRACAnalysisteX 20©lfef (BRCA1& & I/BRCA2) (D &&&] &DNA'> “ 7 > X ##r j:U:#m%(Dm^%M^(DxX ht^50 CardiaRisk^ > Z> fc $> (D HkW P>

BS±fimxre6*7 -hxx :&6 = ProNet^1, ±fE

C^Bi, $6MIE/^X h&#oT& 63. ProNetBXxA^#JMT6 7, t® 0 O x~7 <-x&##T#, %!BW&(:^u:3

-284-

14. 3. 4 &*M93IV

¥^12^2^50-¥^12^2^ 1307 / USSLfco

G2K Back to Science : Advances in Genome Biology and Technology I Feb. 5-8, 2000 at Marco Island, Florida *B 7 a V y MARCO IslandT*58#$ tlfcgSB^Sfc:*#] Ufc, £CELERA£@@ £. L ta.f-f^>?tPJHTl>tOfc,)>f, f ®J=3%*KW:lmb/u,K:%< , e#4»'D®#A

%*

3 >, /W X)l/ —7> b&jStif (EI6flW. SNPs, ARRAY) ** + — 7— K. HTG2KT ®S«rtSi:ontlSt5.

y;»H-T=-s)®K@*i»ofc. t hyyA^Dyx? KDiifTtts ?S e>tX(CY^xy y A®ttS45 j:Oi, t hi7,)Xt®'rVAy'!*T®*;EDy-|tS ®ISMo High throughput® 6 SMb U fc SB /£ ® IS ill» Prokaryote T®^ J h,~? Dyi^t h®ttEICOHT®B@TaB5.

• b by/Aj3j:tK'7r>xyy A®E5i]ftS t h®-yv AE?iJttS®ieff^'Y'yx®ttssiinttLTv>5S6- b K T®##&7c C vy X ® -y* y A E M 8t $ C tin @ $ »t»z)> 9 2003 ¥ C ti: K?x b ©^BSifflttft! Tfc-5. -T-® b h iTbrX t®ratr®3flV^^®7bgB&T*®Jtfel2

5.» a w n s n -a t # a . s n y y a tt e a * m t jb t> n s.

Mouse Physical Mapping and Sequencing at the Washington University Genome Sequencing Center (John D.McPherson,The Genome Sequencing Center,Washington Univ. School of Medicine) f STHlO0t >^-t7^X^y A7”nyx^ Genome Sequensing Center*? 11 h h (? 3 V^*? 11283287 BAC^7 □ — >/^ 677003 >7~ J ^/^"?#*? 43 (3, 50613>f-///^/7DtV-Al:7tK>$HTW'5, 3>^/^$rT%7.7;E"?^ IE (edit) LT^SiOiltT, 2000^2^ 03® Ulledit^T

285- X C? X I: OW T IlRPCl-23 C57BL/6JS6 30000SS ®BACX □ — > £ fE 9 > physical mapping^ A IS3 XfSB S it A o T tA <5 <>

Human-Mouse Comparative Mapping and Sequencing of Human Chromosome? (Eric D.Green :National Human Genome Research Institute,NIH) NIHT'Shuman chromosome? (75%draft) X X 7 AO human chromosome? tS IrI Hi ££ 13 7> lA T SB M $£ S £ fr o T 1A -5 0 BAC CD physical map £: human-mouse CD orthologlW IB £: 7H ^ fF L/z0 $8 6 ortholog CD "7° D — X (>940) ~3800BAC&##, 112BAC3 >T-^r X'(AT-fe>X;i/0 C ® 3 ^,36% (61MB) Shuman chromosome? (h # MI % ^ o f:»

Human and Mouse Genome Sequencing at the GTC Sequencing Center (Lynn Doucette-Stamm: Genome Therapeutics Corporation (GTC)) GTC sequencing centerT I1ABI377T 6Mb/day, 3700T 4Mb/day® # ® @B f'! S f# # & ff o T tA <5 o (MegaBase-10001) o ) shotgun □ *b X 3> \A T13. 31 D X—tf — U — Ax V X> X°7^X-Er|-$T^:TS$J

• Automated sequencing### British Columbia A % Genome sequencing Center (Washington Univ. School of MadicineK Center for Genome Research (Whitehead Inst.MIH) > Stanford A CD 4# ^ 6%#^$)O^Co (Af fit) 3 ox — (^7-X) MvVfrh's — X >X AiiiOAXfA© #So £fl^fflTSJjlATlA&o y>7;l/%S/XAAT#&AA:/Xdi0#4E-3T&O, (HCCTt)AX

Technology Development at University of British Columbia (Andre Marziali) The Completion of the Stanford Automated DNA Sequencing System (Thomas D.Willis) 96 A small volume thermal cycler ® #1 H ,^U 96Aplate6$C — $ tCPCRJASA <5 o (300nl vol ® S ® M I X $klt)j t. ® Pel IS § Sfj~C life 1A t. ,Sf)fl -S o Stanford A T' (314000 samples/day^X )V'— ~f v h U 5. y ho ABI3700 x 8,MegaBase x 3^ U ~600 lanes/dayo Reagent I $1.13 (M13) $0.46 /sample

+ | ®3X h disposables I $1.48 (pUC) $0.43 xl/2 dilution xl/4 dilution (enzyme)

-286- POP57^ —Snx b\ZjtM&$>%> t\Z\tX9 y 78A£'go

Componens of a Successful High-Throughput Genome Sequencing Center (E.R.Mardis Washington Univ. School of Medicine, Genome Seqencing Center) C C0XXXA#3 37007T8 cycle/day (2 X 384plate BigDye terminator jRJ^)

3700 7X384 /day X ? ###. MegaBase * * tf 7 U “ 13###„ 37000 h X Zf)V '> 3. “ x 4 >X&WebTi>M LT W5 t Gigabase Plasmid Pipeline (Kevin J.McKernan:Center for Genome Research,Whitehead Institute,MIT) -I(DIg?:7 0 V— h n

• prokaryote §t[)!l 7?&Tripanosoma,Salmonella^ o Jto

SNPs##, micro arrayl: j; SNPslCM

High Throughput Identification,cloning and Charecterization of Novel Genes (Audrey D,Goddard:Departmant of Molecular Biology,Genentech,Inc) Genentechti: 0 % o $: /i[ $ tl X J A SH^'J ^ h xfr U v — j: L^Wet## 1C j; 0 ^

^5# 6 , ^ IhJ MU i® 1? 1=1 £: Mffc A &<> 0#### 1C llS.cerevisiaeT0^^^ signal sequence trap (SST) ##, T V y Md 7> T 0 2fc ^ & < g m 0 perl script £&oT^5pJtEtt&9), Wet## (microarray (C RT-PCR, CHO£ tevaculo virusT0^#) U E&'U^^r — j? x; — X OX T D/B b. IT) £:fF o Till tfc L T <5 „ (SPIDI:Signal Peptide Discovery — Initiative)

Xh^ □ 'X —#S^SE^'J^dbEST> INCYTE Lifeseq D/B, &£EXh TG (^0SS&^ ^J D titi L) T$>5 = lTtt^00^re«f 7 7 ^ >J -4l, fr£ TNF family 17?, fr^TNFR family 4i0^W^i§ $ ftfco ^;EDy “0^^Soit

The Human Open Reading Frame Project (David L. Baker) The Design of Genome Derived Microarrays of Human Open Reading Frames (Dvid K.HanzekMolecuIar Dynamics) Molecular Dynamics# 0 fg ESTX' — X ^ 6 fz it T & < , XV A T — X 6 ORF^

-287- #£jf£bTdttlC|llbTarray£{F/&U %###, SNPs&S £# o T b $ & 5 5 cDNA(D^o- - S^fcS. EXON^tidlC #GraiL Genefinder> DicTion#)33#)##V7 h&#oT(b&. ±20765exon (lgenetCO V^Tlexon : —#±^(7)^0 ? £jl#bTV^5) $:4arraylC(7)#T ##o 10l©Ii, cell type(COV)T/\< 7U . 7284^gE^l6^&, 57447^K£0d 0 , 7728#S#f#T&o &«, Array## lCj:#l5#T#0RF#) 3 ^50% 15 Z < b

Haploinsufficiency PfofilingrA Genomic Approach To Drug Target Identification (G GiaeverrDep of Biochemistry,Stanford Univ. School of Medicine) Yeast#) dffr#: !C &tbTH'A#)jfi,E;T?L:£::AAb tc heterozygote £: V^T#^!j/\#)#^##) #AD(C5: 'MteT-t LfrbOo Gene dosagedtft* Cl b. & 5ll b T lb •?> o S.cerevisiae deletion project (C 5; -o T fF b tl 7b > A A oR 1C Tag sequence^]# A ^ 5l/b# o (competitive cluture) „ SH6*J 1C Tag BE A" ^ array IT # tb o TunicamycineW 14 lCOV^T2^3#)#iRf^:#Hj b T lb Tbo fair z>0 ESTT 15±EST©30%^^#. Array^^d$,t)#2^l##)dioxin^#)fE;#^#m bT #@ M 1C 'f IS 1C D b T toxity related gene expression information D/B 5: web (DEG:databade of gene expression) T'^i bTb§0 M$Jtarget^

A High-Throughput Yeast Two-Hybrid System for Rapid Dissection of Protein Interaction Networks (Robert Hollingsworth: Glaxo Wellcome) RbboticslC 0 yeast two hybrid system & Fl b, He 0 R t§ 5 fF #] D/B £: fF -5 3 t. b T V^o ia^/u&K A^A(7)matinglcj;6%fg#:#^&^Jm b^:# li; d 15 iS o T 96 well plate(C [target protein+yeast DNA binding domain] fusionifiis"? & A bfcyeast^rA* yJ-'Vftfe b ^ d 5l 1C [Cdna + yeast transcription activation domain] (Dy ^7'J$lAt^o 300-500 cDNA clone/plate X 1200plate 1/9 V $rfFl^o ^tBb zb positive cDNA#)## & 6 SMfclo Unigene, IncyteD/B, Genbank, Genpept, NR (peptide) 1C ft bTBLASTb77f-'>3 >#15 o

• SNPsH@ SNPs^ tB @ Wiit fb L fcSNPs&ttio NTP£l@B T AOX, DNA03’^^NTP^IS'S-U/r U U — X $ tl ppi ^ sulfurylase V^TATPtC! U th "f X )V 0 ruciferase^T^ L tct)^ -X>XT#^, 2^#iB(0^#

384well^ 'i y y jl y h 7 U y 9 caldiovascular screrosis^: x JliZo 115A(Dm#(D^#TSNPs#m.

Method correct ambiguity incorrect SNPS scorering RFLP 113 1 1 manual Cycle sequencing 88 16 11 manual Pyrosequencing 114 1 0 automatic

>99% correct SNP detection and scoring:$0.6-1.0 384well plate x4?£: 1C##0 5000sample/8hrs Pyrosequencing:RecentDeveropments and Future Prospects (Mostafa Ronaghi:KTH) Nonsynchronyzed extention:NTP£:?is hU'T <5 [el Z2 b. CD l^S# tlTS Ai^S^lOObase^S ^ "C L 7^ V"> 0 #^89 1C ^piezo electric sprayer (printer^#) & ® o T array ± 1: DNA^ T &. StoOOM mflSRlTX # V hTf^o

Single nucleotide Polymorphism Analysis By Multiplexed Flow Cytometry (Michael P.WeinerrGlxo Wellcome) 5’yki^tIX.- — XE^lJ (ZIP-code) — (capture probe) TPCR 1# te Is fc genomic DNA \Z A X U 0 Z. tl \Z SNPs & 0$ O dye-labeled oligo nucleotide £: A "f X U $ iir > ligation, ^ fz fctdye-labeld-ddTTP,ddCTP \Z <£ <5 dye termination Mo ZIP-code#FACS##. SNPs&mT'^S. -mi:100#(D ol#o 96 or 384 well plate# J?J o

• technology Hi! Lysis matrix:A New Approach To Plasmid Isolation (Life Technologies,Inc)

M ^poster Room Temperature Archiving of Plasmid Clones in an Automatable 96-well Format (Life Technologies Inc.) FTA h U V X X

-289- &2>. (96 or 384well format^ 0 * A0 7^7 7 h L Ttube© & ^ T# o T =£> /f Utn-;i/X T(D#^#,L (d j; -& Micro plateT 300 colony COTE## (DNAM^F^) (f> 2mm<7)punchT61500 colony DNA$:EilX^'y h pjfgo DNA0mm±#^#&a LT#*$)6o

< Computational Biology > b h^VAO H77h^l$|l(|jS(:b, 7/A##(D^m^Web^-7T7.-if-^ J;7 (dT6 77xA(DM@^^^o^*

Visualisation and Publication of DNA Microarray Data on the World Wide Web (Christian A. ReesrDept. of Genetics Stanfod Univ. School of Medicine) Botstein )V~~ Zf * IEi^^S.^ A 7 7i$l|i|T CO DNA 3 b— fcfc(D jS lA 5rcDNA microarray o /i Iff T L ( array based comparative genome hybridization:aCGH 7 DtV — A A* 7 □ — 7*) > Stanford# Microarray D/B (SMD) £fFfi£* ~1800#Gf tdO(AT84(D##(Dx—7 A' 6 ^5. Array± (090007 P — >

Expression Profile Viewer (EXPROVIEW) A Universal Tool for Transcriptome Analysis (Mgnus LarssonzDept of Biotechnology, KTH) microarray Id HI L TN JA < 7$:^^69(d#7T# %> U 7 7 * A) cf cf Virtual chip ALT" array spot Id x — 7

Unigene,Refseq,Locuslink:Expanded NCBI Resources for Comparative Genomics (Lynn SchrimlzNCBl) comparetive genomics (d 100base(DiES£T CD b y Md O tA T 7/fn| T y 7 (querry A £: Aft# *T*)b 7 NTS) A-7io]7y 7 (An#x^)Ab'7 NL&17 Id THt T ortholog £: IB #o MteNCBICDhome page LocusLink,RefseqM *

ENSEMBLE:A Database of Reference Genome Sequences with Consistent,Automatic Annotation (Tim HubbardzSanger Centre,Wellcome Trust Genome Campus) Sanger Centre AEBKD^ R VH* Sanger Centre, EBI, EMBLCDhuman genome drafts 6real

-290- time T exon 7: # ^ L transcripts,genes IT ^ U autoanotation £ O (X /i: X — X ^ — X Humembl'SrfF -5) '/XfAo 24hrsTnew rough draftE^J £: #T T' cf <5 „ Exon^M tT GeneScan£ EL> f# 6 tl Dc predicted peptide(T 3 T# 6 D/B^BLAST# i l/T -€■ CD transcript £: IS 8§ *7 -So Pfam x — X ^ — X 7 IT automatic functional annotation 5: It D 0 Transcript assemble, Pfam modeling CD X X "J "7° tT Wise2 package ^X o

Interpreting Working Draft Human Genome sequences Using DOUBLETWIST (Pouliot,Y: DoubleTwist) http://www.doubletwist.com Online genetic research portal 5rHl^7-V —» Hfj $ 7l T CD •£> human draft sequenceIT 3 CD T h b J^XL cDE^J^SIIE^J^: 7X X £ b ftD/B£ j# 770

u xt -zue ^jx -x ^ g t> 7 CD BAG clone L /cx~* X £ & o TlD WTWeb<-XT^t D WTcomputational^####^##'#-^ C 6: t "1#. SrlA> bxX b^m^/hl7(DeME^(Dt%X-^Ax> b(D7 7-

Biological Applications of Quantum Dot Nanocrystals (A.Watson Quantum Dot Corp.) Quantum Dot Nanocrystals 13 #T L Molecular scale LED (###) 5nmd>SS

Non photo-bleaching:^^ *7 0 120secl7,-h

JA tDbB X G In situ hybri,array,reporter,sequencing^ Bf IT f lj T cf -i5 <, 7-A-3- b t ltf7 C ch (X)

-291- T X U tJ % S ^ % Washington D.C / AISliiSifeS^tfrlnlLtCo Bioinformatics <0 m*i;(0Ae$^##$tt6ii^. bF^y/AtoF^x F^ffidfMT'&D. VtfXfc-enUMI&fiU Z'MMciLT 5 o T#T H 6. w^A^^

<#Ao

CELERA Gemomicsti: 3l##T (CELERA1) b4pgS T (CELERA2) 02OC|ft^^5, %##450^ CLERAl^v — X y'yy^m. CELERA2^##f, (H —A —JR|fi)„ bFtt>X7FbLTd^%5A(0th>y;F (A#) ^6yP->m#LTV^^m&d jA^A@T9A^^6##. 3 >t >tfXK^J(0#^!5!;lXSNPs#m&ff oTW5. E.coliSRES&dSO cm x 30cmtf)S Aplate^ 6GENETIXti: Q-bot (384wellX24) X4i? T' b 7 7 T 'y 7 o Deep well plateTT%# (bar code##) o v — 7" > It— (1 ib T ABI3700 (384plate) T8run/day/machine<0 X ;F — 7” 7 Fo 3700(120 <0 7 U 7 b ^rgRTSOOit $> 5 »

1 % of human genome /day <0 6E 7J tifc /b X b — F <> c:ne>y-t>7ig0#< 1 y;f-'>3>®ffiai^T^3

> F □ —;i/S^ 6$iJli$iiTV^0 n > F n—;i/S dec« tt— A C0400M Hz^Pir yitx5<&M^J*?'>>#*$:-h$L ^^-T — y (196TB b FXV AE^IJ'>-y >X(l2000¥4^ (Cd^T l/c= 6 7 7X (2000^2 £ OB#^T(lhuman genome 090% £SE70&^ b , human genes (097% /^CELERA D/B(C#0 £tlT Wc = ¥Ab (iy y ABEyj^^^T^o NCBI(: t,reposit!" 6 6 L W. y (COWTdSwissprot&Ef/xOiFtPi/-##, 7 7 X b COit^, &!:££$ § 3 >0(lTlo3o C (O 0^ ^ 2001/March^ 6 Web (virtual network 7# #^) T Publish!" <5 „ (BE7J (13 >*b>ttXSE^J) fctcU ##o M40 0MS£Sb3.-tf-$miTV>50 7 y (11^4003.-tf-b^ ? (M.#6%(0^m3 6L^?) Double Twist% b M # (0 b ^ ^ X^SilTF^. X# (lproteosome,gene expression,gene finding,SNPs (I & '>7 F L T $ < <> 6 million human SNPs^rt&titi LTI/^0

Informax^t (World fusion#:#* 0 NCBIX F-7F, GeneMaxdSE^TGeneMax plus&7 7F< >X F

— )Vo (-?ril ^bllenterprizelE b LTx 7 7 F 7 — 7 £: 7F L T tt IrIfFH £:ft A 3 client/server y < 7(D &(0 &&&„) GeneMax (1CELERA7T TV^ b <0 b b„

-292- VTNIllESK^J, 7 5 7 ^ E (CrustalW) X > Mfe , ^ 7 ^ — EH*(D simulation h. abr>-y>it-o^x- X /(P bdrag &drop7T^ — X 3 — Jl/, contigT 'yfe/7*'J GeneMaxllEy'J7' “X77 ~>X7 7 ho x —X##, ORF##, □ 7 —OBLAST ##, 7yx-5>a>3ii, hxo##ii^x>/^X(7)#m##T, ^mam^mox- X##7l2:(7)##b$:^7)o ISIS ffi3# ;B O SJ & fr X = Array data mining## 7 a. — jh iim%a&o all-in-one(7)V7 hX J:7T$,^(7)^^jAT&67)A h$l:ll#l:t b b j;W#m 77h^&K), #(:B#Tl/^ll#C61l7l^o

NCBI (NIH)

NCBlll4A/^ b TS6250A= Library center <7)8_10l$g = 4 A (7) 0 7 % 0 — A Dr.Dvid 1C. Overview^ |& B# <£: 11 » Human genome drafts Hi 11^7 0 t ZL -5 2000^31 ^ LV^A 7“^®^ 7^Ut-^—3>h □-JK:O^To NCBI £ A Til registration £ tile X“ 7 > h n-;i/^EL v^o SNPsli: t <7) ff y AlW#A#*tA<5 t(DZL to Promoter#^ 0SNPsll

TIGER institute ±< 0##tlj@#:T^ OCELERAb0###lllC^o TIGR1 bTIGR2<7)7 7> 02lM

T(D##^ CELERAlIit^ tfrfc 0 11/L£ (a#A h h ICEG 1A y(#m^#i:3^Tii TIGR homepage# M#) shotgun clone tf) 7 D d—if y T > X*llGenetix0Q-BOT- J#o Template^ M 11 fc £ 0 £< ll£Tb&7-A Utfy h-«T¥S*frWK:fTbTlAS. 377x44#,3700x20#, & :> —7 >11 —G3-MacTTtlT'tllMo 3run /day? X ;h— 7" y h 11 0 (I £ b T % ifi b 0 TU hj t b (A X & (A t 0 t t „ CELERAll t \Z 11 > XxTxy 7 XT11%:VA

GeneLogicti: # 1=3 (7) tissue source 1Z J; -§> 3W t3, commertial 1C 3# # o $ 7c, C0D/B$:#bT drug discovery and development (seed find;! T) ^ o# 11 #. b o X— X A — X GeneExpressllBioExpress, ToxExpress, PharmExpress03/^— h A f£ 0 ,

zL-*nz±x%z> o tvmz%#&<£>&&%&ta *x X AD/BfM<# 0 , cail##7.-lf 0^1: A^^#KT#7l Vb BioExpress : human,rat,mouse0IE^‘/2W#:&B.$$ 1 ft llcell lineT0 X > 0 is 00x —X ^ —X.

-293- ToxExpress : RatEIS^ rat, human primary cel Is ^ fy)W

PharmExpress : therapeutic compounds GO human,mouse,rat E^-^cell line^-(7)^ # ^ 5 0 ?h

Expression analysis data%#(7)&^:## : 1. El&x ^fflASrcDtranscripted mRNA£ ZfrlZ * y A' LX t (7)iiV*£1t

titix = ”READ”>y x m, 0 (&tfj li S t-atv ~ ^ 7 ?) 2 . GeneChip (Affymetrix:human,mouse,rat — jo Z (7# iJ <7)chip:3D flow through chip

r iL^-e(D2¥^-eil ( t 9 C #22) (7) drug target x 2502

7 general accrual sites,8 academic/disease-specific collaborations.#* £>4000^7._h

$fc#- L T H1S;4[eI] "C # 6 o High throughput,low cost G

i4. 3. 5 snuasv

^-7"^ y D ¥/&12^2^5 0 ~¥h£l2 ¥2fll30Ty U#T'ilLf:0

# #y"c Washington D C. & Marco Island, Florida

it)^ H12.02.05~H12.02.13 Washington D C. & Marco Island, Florida

^®x ;r>a>77- (JTBifix (7)2 K y/-b-efTj7jl,#:yyA^##^ (G2K; Back to Science) — #A0i-5$#*T#/b(7)-ex 6 o !# fol j£:H (7) Tf; — -A ■/\ — v • T K 1/ y -Sr gd L/ccO'CPI^cIb&lR]# (3: x ^ ;h, 5r # BS £il /c V * 0

1 . Washington D.C.i&#(7)/

Washington D.C&^(l#&x ^'iz-y-^vy h L T/*y T y dkS<7)Sli#*S #iCi@AyT6%%-C^6o (Washington D.Cx > y-7> K'lH (MDh /<-v-T m (VA)o /Wd-emH-RBiUfx yy-7yK'J1M$Xf&#A

-294- ■CV'5)0 tT'II Washington D.C. ji iZ2 (d tt 12,183 (O/n % % 7 , gtlZis }) =1 > s< 1/- (11,930%) £±06##£ ftoTV'So U y^/<-7-) (d^x (bnri'So 80%{tJ%l#m^ LT#7:%7 #£, ^^TlOOSrEX, 7 V — 7 y KM tz. It 7;'6o &5 b (D1 b (##$ "C(d, ::ts9Wet-Lab^t)Of^;p^- tttt, y=^/<-7-^ ES'j-ra#^-cw*7, • % y^^- 7-£fl*tf, ^(7)SC(i^ 6 (CjtXT^^-e500^EXd)o ^tEJim L^^(DE^jkS5% t, r 9 Ltz.±m(D^\z.h 9 SjS^WoT^^dkS (tfcfiW^oofcSdfeli) -efcSo < 6t>(tic(ie^^v^%(c$)5@9o b * -e t>, it (Information Technology) 2 ^ % 7"-^ y % f ^#X6o z 9 L^A - A - b £ V ^ y b V ^ ('T 'y ~%: a-s< — 7 —) 0^^® (--e# 6$^m#L^V\

(1) CELERA Genomics% (0f%ftti : Rockville,MD. WEB : www.celera.com )

CELERA1 (3|^ST) CELERA 1^ y —%y yy XSfflfcfU CELERA2^#^f 3 ^oTV^6o %##(^450^@ So DR.H.O.Smith (1978^7-^/1/#^:##) ^ TIGR £ *9 31 t tk 7^%#% b M# "C &> -5 HI PE Biosystemsti; b HI Compaq Computer%/(p b ^tt^Cift/DNAy — 7 y %

*9 . ^ (O #[ , B IS- 300 t? £ <9 ! 7 “ 7 7 t “ 11 ^ t t lr t W ABI3700 (384plate) *C\ 8run/day/o ‘

< E x. 6 ) o S^^TI^t-7 96TB (77/<% h = lOOOX'X'/M h) td &^o ^%I^B^(dfc/do7:'^S#^^S(-j: 5 £ Human genomeSB^J v — % y 7 (t2000¥6^ (d ^m(d(iT/%-y3y^#^^i:#Ti-5T^o Z(O^mij:2001^3^^^, Web^^-CXy/^- (%-7(Oy^TA - 9 h&fc----) b(D~ bo CELERA Genomics%tt#%, 4^(DMM±M b U % —7 ^ ##L-c^5^, ^(di&^igidmf 6 c ^ ;Sri66tlTV>l-6<}:Sx:5/d7)9 0 human genes (d £> It 'b 07% (ONIH b (O 1^% Hl]{£ (d o V' T (OH^id^t LT rcELERAfiJ7^NlH^^% /d r b 5 • ■ • • J b

-295- (2) Aptagentt (0TStiti Reston,Virginia. WEB : www.aptagen.com )

digits *#tl##S28A(D Colin Higbie&. DNAE^J##^7cC, f u wo /ca^ 60 ^tt CD %j ffj & , Apt ( = jj§ *2) ) -gene$: D o L-fc toSyeast (the Pichia fafforw) ^ (D If < >%: (codon frequency(D7vy) LTCfia. T

NT# LB&6 A 3 < 6/uTW6. £j&bA7>Ay &FDA(Ds8aE&9X#T# 6 & @# L, ^43^'T^>^)^[1I®'77“7 >A' — 4^13:, AWlcIflT'^/Kf fa cf £> h o tz.o

(3) NCBI (NIH) (^rffctit! : Rockville, MD . WEB : www.ncbi.nlm.nih.gov )

JA A&NIH CD S[ till — :& > Library center(D8-10$glZ & £<, NCBltil988¥4A^ 6#! £oT^6250A@S = 4A(D#J%#(7)3 ^0^ADr.Dvidl:#g^#%a(DTopics(Dm%^^: ttfco #&M^^(D##fWEBy A7A(7)Tt$8^^^^/Zo DNA*

— // %>%##, Gene Map$^te& t A 0 , (D3D$j££T&%

LTL^^@T&-5. C3 L&Of-*

(4) TIGR ; The Institute for Genomic Research ( 0T fE i& : Rockville, MD . WEB : www.tigr.org )

mm£TM.¥%tfAn&te/vr'gfcfcMr\ mmTo^u#L^$ijwT, fe, %#(C# AT W^:. TIGER1 6 TIGER2 CD T T (D 2 Pg # T(Dm#^6^D, A^WL, yXfAW^t-hZ-yaXb^T tlSB/uT& W(DT\ Craig Venterffi^^fr £ $AL L fcCELERA t \tit £ T b & WG

*(D^^CD#T7'yE^-A^m#T&o^:. &X^'y7(D#^7<^7$r%DA T>A9-T#$7'y7&#oTWam&&aJ;5T&5. human(:^6f, #m# yy A(Diy-T>5y >y^#A^fTV^o

-296- (5) Gene Logictt (dliM : Gaithersburg, MD . WEB : www.genelogic.com )

S'J1:V7 jAm@(Dm*a(D7yy7k7&#S, ##J3- -If—(D^- —y —±^<33(D^-yvh (C ^ tlT T > The BioExpress database > The ToxExpress database > The PharmExpress database & 6 % 5 . — X (human,rat,mouse • • * •) £: jvf 7 V 7° IT (7) ^ —t) — & £ \Z fH 7 «. m^i:^^o^##m^%E#Zareh&^B oT WT, &£ acDc a. 6 7iy7^\(D cDNAiH $g ^ U £ \t H it T < ti & ^ o ft • • • • \t V Z. © £> fc 0 ifi A ^ <£> / 7 A 7 ^ • • ••?)„ 7 v 7ll ^ T Affimetorix* ^ ^ ^ ^ , #g'J#$9##TA#. ?£ESb te7 k'y - ##$iTT(Aa. x^'y7/ T^3 0 , BS(DS#:@^{bAii6 l(A0 Wj:t)©m 3D flow through chip£ 3&7cMMT2a?<7t)&a77/: (@/j\77H7U-^*KC#/uT-%(D7'y 7°£^J5£bfc b

G2K Back to Science : Advances in Genome Biology and Technology I Feb. 5-8, 2000 at Marco Island, Florida

Washington D.C.&##7/ M7 □ U ^T^37< 7 > KTl*#$fl/:G2K (Y2K^Etl5l o /(MtT, G2K^Si b Pf/u T V> fc) IZb^MLTto #

I*b h • 7/AO H77 hSE^iJ (90%) #5Ji 5 £ T £«&It # 5 «> £#* o fc J b T?eiS®ilSM%5§£:fifr-3> /tyjtB National Human Genome Research Instituted M &> Francis CollinsE= CELERA Genomicsti:(D$£}B 0 & 5 ^ < <7) % S T'$> 5 #7 ^ 6 (C M & tt«tB*#rffl3ft©mgft£3ft#IU b h^^ScDNAOif^^ff LS^d ItB (2000^7 1:275000#&7 7

£(DZ£ o HK30^T\ 59i£S©fS^£6O0)3ffX^ — n^V^TfiJEL^V^o ^im^m-imB(DM#T&6^^#t)Marco IslandT E(tTff 77Sb(D^bo Marco Island^ 7 D U T'O® m&lZ & 0 , V V- H6b LT\ 6 LW.

-297- *15* J$*©*lStt3i is. 1

77,677^7 a —>©h MMB& S T/IcBSie*® V a'+t 7^7 'f7,7lJ -zi'BiKSSnyccDNA^ n-> j: 0 , £ $E® l TgrST^oS: ftT* -E>nJ|gtt©fi ©8,143:7 n —>S:*tkU, A U y 9 7,flfji&:7 D — >i©SSS:iS 6$, 4,957:7 a —>S:S:ftE?i]ft$1C#Lfc, —A±#TB@©A 4,000:7 n->©^:ftE5iJ&s^ e.i'i:t5 c ±ftE^j©##r^#^6. » 3,0008# ©$r Sit GTS 3 — Kf 5 tStitl-ScDNA^ p — >T& <0 , *EfS3- Kf %t)®t#SS4ifc„ «5?**l8ji6l|fiT ,A^f-5'<-7l;S8S flT©fc±6#ifi»iBfl e> A'75: b h cDNA^Kl 10,0008# 1C b iltfc/io ft C ©ii#£ u y-x©isg*o*«ofcc ttcj; o, yyAy-7r>x^6M6T)4:$4i&iig*%m©*ef&msuL, -etiB ©#6g&«ff vfcoestitcsffl lti-k zt6©M%w%©x b- ic Joanns feffltlftTSS.

1) #¥F ©#M ttiEB : ¥fiKl2¥3fl7B, HIES# : #62000-061464 [#8fmsn ttiEB : ¥f6l2¥3fl7B- KE## : #62000-60548 • mmmmf (2#. 6«e?> • MBSItt (4*fif)

-298- 15. 2

xmwmmastt

1,000* SStWlt, ±¥MEtc*5V^Tti, D> S XS§§PE-ABI3700^HH¥5> L^EfflTS*f> ft#BPE-ABI377£ffl C

* U — > (D@k1fc(DT%m&. One-pass '> —* X>X £ CD ^ ^ H X> tD T UMlM £# i$ $ ft tc. W o

I^SOig^-i^jioTt^yy X • 'MfcTZWZfrizTZ z. .

h h cd ^cd BE • fit, ##%(:

w$tmm • its, *«t, L

-299- is. 3 a±acDNABM*m

**3H®CMESnTH5*W?SM^©169IJ. U -y £ > ?'j£ (2 =fc 3 t h cDNA 7 < y7U-7)>6a#S:#BSh/t«m9n->#j;0 1^M I: 1000#?*tit ± © cDNA^ D —CtlSr&o TtrV A(B#rOT35ft-ST©fc&©»W liSfctCXf 5 CtT*$5. W#lcW LTtti05.7%©a(S;^)*#lclS LTIi. %t r#mM£ hcDNAtr / AtSffljMMSj E^I*&^-AA#:T5,153. 3 tE# Sf-g^ —A V-8T2.129 (2000¥2fl * 0 86) ©E^JSrESLZc, L #. DDBJ12 ft L T. B5^JS$gL-AS#T2,212. 5%^rS9i/t:9-340©BEW-)iS£iE2?n;t (2000¥ 3fl23aatt). a±c±o. »w»*{bc«'t‘6ctfc+»a«a^;^nfct#ji.6n5.

[*%** - mm. £»;. #*^©#R] #12% L

-300 is. 4

= #««M96#jt61±

*E%E%tC*3VTt$, # 17-X## 6 Wo/:%*©#& 15 AHA, SOSui %^©#L©#@S^A. 5ci:ftcDNAlcSU^SI!)«tlr->XxAm#SSrffo7h„ *-> XxAtttt5OOO*©cDNAE5lJS-jgUTl~2 0?SaTAaC)BDT-r5 X ttfnUfeT~$> 0 , CftoTtAS.

(i) «*©#@ic• saHromeg • #m?#i cDNA6E7JS BLAST, Repeat^ (iifc if. © IS BE BE ?'J )S #T7 n X y A T fftfr L ORF© tttti. testcodelC J:5 0RFW«S«TT 5 / BiE?lllC#EU/t„ 7 5 /$ BE ?'J ©»#"?« Motifs, ProfileScan, HTHScan, SPScan, CoilScanSUK Lfc, 6 7‘D 7 7 A ©Sfi1#

/I 7 X — 7 • StfT$SXjtS©'l'>r5'7i — XS#b£L, XSSALT ##Sff o A „

(2) »rL,©¥ffiS®0AtlA->XxA8?« Jg[*a®«^ill7Dy5AT$5SOSuiS5lSV, -&©«¥#?#* A ±?2fiE*i£©*Wt5 J; -5 IS * t #f IB--f 5 5 A15 A o T * #f £* S »*' 0 ■*> t < * * T # A„

(3) 4000K±©7cS:fl:cDNASig@$T®#r *->X7ATtt4CPU©7 —XX7—> (Sun Enterprise) SffiUT, 515lft®cDNA SS#rT5>®IC40B#E28i)-^^ofc,

[»%%s • sm. 3twt. #m=#©«K] #I5&L

-301- 15. 5 -K-Xtfrj*

i —— T "J C7

#%(D B# I : ATAC-PCR(D/N>r y h'>^T^(D«S^BSTfcdo ^ikt> yy 6. m^$k (Dm#, Pharmacogenomics^ cosmeticgenomics^i? (Cl jo V' T CL (D#^Hj:#j#'t~ 6 Z ^ ?"#J "C # 6 o

■7^^dT 1%/^DNAf- 5/ y#(D##(D#%m&mTt) :w:i 6. m#^-C^yeast^ yX^X'iT^-0'5i\ y yf v yy-tf-ya

Tag-man&te^Scitfc^fiWB iftt- (til£ &l' = r B#-C^6^ATAC-PCR&(D##(j:W:##*-e$)6(D-e4"#]E^#^#M$Am(DW:#, ^

ESDI : ATAC-PCR

^fifp]± ltv^5 ir WBxz 6o yy;y(D#j^^#(±[o]± L^o ^-c^6. >r y-<-y a xt^fi

• mm, xm, ftfFm

(2) #rFttiE^£ - RNA#(Dit##m^&, m*ge ■. ¥^12^4^260, mm#^-: ##2000-125309

• rmtextmmi

-302- 15. 6

B#m2,5oo(D^#cDNAmm@a^m^^^Tu A(D##e^TL^o ^yoy^y hoiftoiaar% /N-r y bSByj##yy y cDNAcDB^i^SJ^tf 9 tz.ti)(D& Mm?mtit&ffi£mmLtz0 a^n%n^KBL/-;:0 6iL/Co a. ^^cDNAcDgd^J^:^ 96* y-f -f(D 6 Wift'&fflt, t°y V “DNA'>“-^yt- 0iA(:j: D .

^^7.;v--fy bM$t& yyy ltz0 yyy -r-ei+t- Oligo N SB ^J (D T i? y y zl/ (C (3: Sequencher^ \'tz.~f y X v’ — y d— ^r y y ^ i^SSB^J &/£yfi£

(D yc%#y — y ^—y ££ Ilf y> fin i: lt\ y — y Su^syy y a x y^#y — y ^—y # #yyyA, ##joZU(^m(Z)%^^^yyyAeM^L^o y-y#^myyyA^L xn, Kyy^* 6#@(Z)#^, my BBym^^^±^-i±6 ^

®jt / y — y a yyyyA ^ ttii, DDBj^cD^ScDNAy' — y

#y —L^:. ^ffyyy^i Lt, orft/ y-y a yyyy a, f-y^y y-y(c&^<##T/y-y3 yyyyA, ^^c'yyyyy-yH yyyy^^M# L, ^^6(Z)m^##^^yyyA^^#Lybo

(D t x:^>tkmt lt, ±ib^ ^^yyyyy-yg y yy y A(Z)i^^K, ##y-y/<-y (DfWe&a^^bita z ^ d-o-c, ^m#^#^fyyyA^M%L^. © 5,000y n — y(Z)^^cDNA^^fysipf^y^-y dr it, ±tB)Wffy'-y

3CRL

(1)

(2) ^(Z)#(Dy>^ (yyy##^) 0*#^#^2000* 2^ 190, iteyttW22ooi4 :yy>^ (/

-303- U) r- SScDNAOttSEMatSroBfSE X n V A A A IP P A K G p u u p 1 1 1 1 # It AJ AJ A) E p 4! P 0 IP X & tSE m IK £ E W 4 A AJ mi AJ iK A IP G A H i IK $ Y\ 'A V 'A # m # iife t G U P P Q z < o # IK -*K g # $ -H feC % <4 R 51 ■Hi BK A m AJ R P N 4 G A A A u Q Z < n o A # E AJ « ft U # £ % « IK UK -ftu A 4 P K» G G P G 4 n o IK m % # 3< $ X S fen # JS G • S: 4^ A) y 1 15. 8 £5cDNA(9ifiSiE9]fc$

&U^3Ul0,000 bp/dayT$»o^i/-7->X###^^35,000 bp/day£T'3l €±tf £ £ £T'£>o fto ^%^l^(:^^#cDNA5007 O —

##^evx &BM%35,000bp/day(:&T[%±2#a T, ^£1:0 A7°0 7x 7 ^7°Dyx^ h^^Ii)l/;gS[iRTOil:#^btl^.o £t\ ^^#cDNA0^#BB

#ggT^I#0AW#^f^oTmT$)O fco $ 6 T“0fgfMM&£*t3i:Lfc0T\ 7°7^ zttf-cgfzo y,7't-7-ffimtfMtfr$:m®z>mmr'$)'z>rc cDT\ -AT\ Sr/t tl 7-7 > th- £zgAl/c0T\ A - FBT0#£AtW±U &'>-7* L fc$F e&MCAT, Lrc.zttfBMmi%(DgmT$>z>t%7Lt>tiz>o

A7uvx7 h©hfftcj:db^e>$nZccdnae$en&, c0##^^mu^g

• m\LLfcMmmikfe

& \Z f£ L

305- 15. 9 ±ficDNA

V^lO^SNEDOSa^MlEi^STffbn^ r#0§%t: hcDNAy/A^m^ Wlffl

® A U y ^XEFL0iiCT^#$tl^:5,OOOfi(7)cDNA(D-5 *> > l,500f@(7)^S E?iJ£&£T50

(D 5,000m#>t^PCR&^l:#i|#L##T 6.

^94%(D#^#T$>o^. ®^ioo%pcR^s^0E*^eu ^(Dm^’Dtz.Z.t (±fo

ftlZfcL

—306 — 15. 10 DNAV^f » □ 7 H- 1C

aav7 h»xziyy-7 u

w^cTMitv-f jD7i/'f». Ki-^aKTisnsatts^vfctott. S3B atmoasg&tfKisiewmsiftciD. ae^sssw^ft^wattn

$sr t^sisns.

ms (%)

5,OOOX # V b/7 lz-f X 1,000% 100

fm* wma#7 k

*att 7 M1 rt©*S*ltK 100

[#%** • »#. Xtt. #*#©KR] 4# i:a u

-307- is. ii a*cDNA

McDNAtoSSE^JmStST'tt. t H>R(7)2000®©fiiflcDNA©SBASd?l|S9fl $T ICSSf^gglC^L. 97? C© SS?iJx-^ia^:©®fffi«jS^ D->OMffl?;6(D, Wffl&x-? tat 0 l20H©SrSi8 U-'s&MRT-gtz* $7c, 120a©#%##)7 0->©A#E9il©;*^Tia. 3 7? $

TC5c7Sti-5B8K*tL. -> 3 -y h # >i£ £7S V> «*37? 5f7T5S 7 T * tz, r ©E^'Jt1 -^ian>y->7A/iiT, *Tm*e7©##e##f©7hA©#m;a##t:7a&ame?

ft$cDNA©^'.y y?t©ac46©trag*a«T(a. #*$#©#&©# 8 IC#L. ##a#60-80%©$ —-5 C

7c. *fc, 0ffl±©s*->i7a-;i/ftma«©i*it©B8t»L. ^9 ccd*2(7i?? d M » £ ffl © fc S * ft ffi H w- m £ 9S sS s -ta 5 C

*•«», xto. ###©*«] #K% L

—308 — 15. 12 &*cDNA

afs-stamaatt

1. bioscouT&mwT, ot, yy aiwe , flTt§ ck 3 fcfc'ofz o bioSCOUT(D#*##T^MfBT#% —±y — yv A BE #1 It $8117) bioSCOUTO ^X^T X £ ff 7 g>

E^##f-^^-X1:*LTBLAST##$:eVx V X##T 7°##&&#mT;t#)(Z)Periy y U X h £©'> !9¥Ecoy n-XC#V^T, In Silico^feffc V "j ¥ > £f IZ £ 0 tfe&fcA > F&m&ttfc-r&ct&tB&fco 2. ^1, ¥/&DNA^W>7°Jl'tf&g^pJ y* — & Hn >*fe> h##e>flTW55Wc#20±h>7;H:OV)T4%#&ff(Ay y ADNA# — S(Dth >"/;!/(C#V^TSNPs^tff 5:^156 L/to a±omo,

[wm£% • m, xe,

-309- i ]

S c DNAy/

¥/*12¥3J!

*r#«Stoh hcDNAy / AjSffl&iffM^jSStcTE^jgtSLfc? □ ->112212(0 (?R% 1 1 ^%$##E) "Cfc 6. f ## (MO) <7>B 1 »lcEEV/c„ MOIcEELfc -aew i ic/pf. DDBJKSE Vfcy n —ycotf Sl±, DDBJ, GenBank, EMBL^toBISto^y- 5'^- StLTV'do DDBJSftB#l;ii#7"D y h"e)iWcy a — totoicSrfcic accession fetLT$SI5tv5„ ##: (MO) B 2 Hicgstt U M0lcEiBVfc--6liSr«2 lC77t-j-„DDBJ*»6.fl.fcii5y:Mffl#roMS:S# j: LTM 3frb «5 r-js^-rs.

## (MO) KEiiLfcT-* b *

l* hcDNAy / SSlcrtiM L/ttfiSEW-y : 2212EM (<2>ME?IJ 2212.doc) 2012^-^

2IE (i)i#^_^D — >£#)£*.xls) 95^ —

B Zfc

«*J1 hcDNAy/SSKTiKM L/ciSSE^T-? H2 ^ y a — y^#fC* M3 <&ME?iJ»M «4 &MEM(oM iftif-A M 5 £ME#IJm --!/ .y

-310- m i w-z >AK000001 gtgacttcagtttttcgtctttcaacttcagttgctcatctgcagtcaccagctgggatt tgaacccagccaatctgactcctgagctcctctttcctcccacttagtcttttcccctga gtccaacagcaccttagagcatgtctaatgcacgtgcttactgccttcatctcattgcag cctgtccctgccagctgtgtcaccttcatcctgctgccagcccctggtgactctcccgtt acagatcagagaccagactccaggggtggagagtaggggttctggaccagccgctctctg ctcctagagctcttgtctacacaactgcctggtgctcaggggatgacgaggcgcctgcct ctctcagtaggttgtgtgtcctggcagcaggggtgggacctgtcctctttcatagctcca gcacatctgtgctggatgtcacctggctccatccagtgacggattcatattctctgtttc gacatcttcagttgctggacaactaaatgccatcttcaatacctttaatgccatcttttt ttttcttcttttgcagcaatggatggtgtgccttttatgatttcagagaagttttcttgt gttccagaaagtatgcagccctttgatctcctgggaatcaccatcaaatctctagcagaa atcgaaaaagagtacgagtacagcttccgcacagagcagagcgcagccgccaggctcccg cccagccccaccaggtgtcagcagatcccgcagtcctgaggagccagcggccacctgcgg ggagaccaccgccgcccagactactgacggcaggggctgctgcccccgcctcctcctgcc gcctccgccagccctccctcccacactgccccagcagggctggcccggagactgggcagc taagtgggcgcatcctgtcttcagctggcctgactgacaccccggcctccctggggacat tgttcataaccatgactaatctgtgtgtgctgtagtgaccagcggctcctaacgtgtctg tgtgatgaccagttgtcctccaaaaacctcactcaccacggagtcaccctgagggccccg ggcagttgccctggccgcccagtgacgcccaccggctccctccgctccctcctgtcacct accagggcagaccccactaagccgttgagtctcttcctggaagaccctgagggccggcag ctttgcctaggactctccagggtcgctgatcctccccggcccagggctgccaggaacaag ttcctctctgcaggatctcacagcagccctgagaagacatgggaggaagaagccagtggc cctggggtggaccctgccctttggacatcccatggacttgtcacccagtcgcctggaagt cggggagcagcacccctgcccccggtggaggggttgttgctttctggggaggaggacggt tgacggccctgcttggatccaccaccaggacgctgtgctcggccatggcgcgcctccccc aagggctcatctgtgcaacaaccacatgccaacaatagctccaggtgcctcagccccgtg ccctgacccggggaccacagggacaccctctgagacgtgctggccccaggaggatacagg cacacgggcacccagcccctctcagagccaggcgtgtcttgacggtatctgctgacggga acaagtcattctgaaaggtccctctgacccaggtcaccaaatttgggccagaactggcac ttccgagtgcccactggtgctggcgtcctcaaagtgccgggcgccaagttcagccccagg tgcgtttgcctaggcagtggtggctgccatccctcggggcccaggccctcacccgaaggg ctgcctcactctgtggtgtcaggcactgggctgtgtcctgattcctcagagtatggggac ctgacatctttgggtccccaagagttggctcttggatgtggaggcttgatgctgcggttg ggggctgcactacctgcgtccgtggaagcctctgcctcagcggggacggcctcctgctgc aggtcctgggcagcccacagtagtgctagctgaggcccagaggaacgggaggaaagggag gcagcgccggacggcgcagctggccaagaggcgggcggggccagaaatacctcatgcccc tgcaccagcccctccgtctccaggccccaggcctctttgtggacccatcttcctcctctg ccaccccgatcttcactcggatacctcttgatttctgtggataagggaagcggccacttc

-311- tccctccctgagacccgaggatgcccctggtgggggtggccttctcaggactcctgggtg gcacagcagcaggaccttgtcatgggactgatgctgagcagggagggggctgccaggctg gctgccgatgctccgttcacataagccagtgtggttctggggacctgaggagccctgtgg cacccacagggggcacctatgtctgccgtggctcctcgggtggtgccctgtgtgacaaag cccaacagcaagggtgactcctggcaggtgggggcagcaggagggcagagggcagagctc tggccacttctgcccacttcattagggtttgtgaactttgtccttcacctcttttcgtgc cctggttgtgagattgcctctaacaggtaatgccaggggcccttcactccgcccccatga ctgggaagaggcctgtggcagcgccgctggaccctaggaggctcagaggcagtggtgtgg gagccctgtctgcaaggacgcagaataagcagtgagggcggctgcaggagaggaaggggc tcccacagcccccactgatgccgctgcaggcccctgtcgagctggggtcccagccaggtg cccccgcatgccctcctgcagttgctggatggatagggacaccaggaagaggacaaactg catggactcaagcgagctggagccatcttctccatagcattacggacttagcataagagt aaatgactgtgaacgttgtagtaaacggcagcttaagataagtaagcagagacagtgtta aggacgagttggtgtctgtggtagcttttaggctgctcctaaccccccatttattgcctt ctgagaggtgggtgaggacaagcatgtgcctgtgtgtgtgtgtacgcacatgcgtgtgta taagcccacctgagtggggctcgtgcaggagaactgaggcatgaaactctggctcaaacc taggaattgagagcgtttctgtcttttgggagagtacttttctccacgagccctctggcc actttgggagggaaggacaagggttcccttggaaatgtgaagggtcttggcctcatccct caggtccccccacagcacttcccactactgcttctgtccctgctggcagcctctgtccct ccagaacggctaaccagagcacactgtccccaccgcctcccctttctctctgaaagttga agtatctccaaaggccttggaaatggcacaaaggtgataaggagcaggtgctttgctgca gtctcccttgcaaatgtataattaaggcctttcttcccaccccaagtccaagaacaaatg ccagccacgtcctccgccacttggagagatgagaacccagtggggtcacgtaaaggaatt gcaggtcggtgagaggacaagagggactcccatgttctaagcacctgttcctggccaggc tctaggccaggctctctaagcacatttctcctttcattccccctaaaaacagagtgacct ggaagtagatgttctttgctccttgtcagagttgaagaggctgacttggcccactgctaa gcggcagaggcagggccagccatcctgtcgcaagcccgtgctggggctgccctttctgtt tccagtccagttacggacttcccggccgccactgggccctgccggtcaccaggccactgt gcagtgggcgcagagcatggtcaggagtggcctgcccgtactcctccacccagatgaggg ccctccagagcctgcaggcatctgtggggaatcccagcctgcaggttcttggagaagcag gtgaacctaaggatgaaagcaaaggagggccttgaggaagcagcccccaggcctggcagc cacgcagcggctgagctcatgaacttggttcgcagcctgccttgcccctggaggccacgc caggcgctcacccctgagcccacagcccctgcttgggctgcctggcaccctcagggtggc ccggcctcctcctgccactctgagcacatgtccgggggttgccaccagagacggctttgt tctcccagctaaggccgtggagctgctgtgtgactgtgtcaggcctggacaaggaagacc cttagggatgacgtccccgctgcatatttattcaaggtgactcttgtacttggcaaggga agtccactgtgtgattgtctgtattcttaatataatttgttaaataaacgtttgttttaa cctc

-312- m2 □->£©**£$

FLJ##, 9 □ — >£> T/ 7—’>3> E^JS AK000001 Homo sapiens mRNA for FLJOOOOl protein, clone asOOOOl, 4504

AK000002 Homo sapiens mRNA for FLJ00002 protein, clone as00002, 5023

AK000003 Homo sapiens mRNA for FLJ00003 protein, clone as00003, 4701

AK000004 Homo sapiens mRNA for FLJ00004 protein, clone as00004, 4670

AK000005 Homo sapiens mRNA for FLJ00005 protein, clone asOOOOS, 4706

AK000006 Homo sapiens mRNA for FLJ00006 protein, clone asOOOOB, 4219

AK000007 Homo sapiens mRNA for FLJ00007 protein, clone as00007, 4303

AK000008 Homo sapiens cDNA FLJ20001 fis, clone ADKAO1273. 1962

AK000009 Homo sapiens cDNA FLJ20002 fis, clone ADKA01577. 462

AKOOOOIO Homo sapiens cDNA FLJ20003 fis, clone ADKA01794. 1387

AKOOOOl1 Homo sapiens cDNA FLJ20004 fis, clone ADKA02391. 1826

AK000012 Homo sapiens cDNA FLJ20005 fis, clone ADKAO2526. 1487

AKOOOOl3 Homo sapiens cDNA FLJ20006 fis, clone ADKA02694. 1940

AKOOOOl4 Homo sapiens cDNA FLJ20007 fis, clone ADKA02871, highly 1547 similar to AF049611 Homo sapiens huntingtin interacting protein HYPE mRNA.

AKOOOOl5 Homo sapiens cDNA FLJ20008 fis, clone ADKA03052. 478

AKOOOOl6 Homo sapiens cDNA FLJ20009 fis, clone ADKA03183. 850

AKOOOOl7 Homo sapiens cDNA FLJ20010 fis, clone ADKA03268. 1466

AKOOOOl8 Homo sapiens cDNA FLJ20011 fis, clone ADKA03432. 1759

AKOOOOl9 Homo sapiens cDNA FLJ20012 fis, clone ADKA03438. 1344

AK000020 Homo sapiens cDNA FLJ20013 fis, clone ADKA03455. 1523

AK000021 Homo sapiens cDNA FLJ20014 fis, clone ADSE00113. 1758

AK000022 Homo sapiens cDNA FLJ20015 fis, clone ADSE00237. 1457

AK000023 Homo sapiens cDNA FLJ20016 fis, clone ADSE00529. 1294

AK000024 Homo sapiens cDNA FLJ20017 fis, clone ADSE00552. 547

AK000025 Homo sapiens cDNA FLJ20018 fis, clone ADSE00909. 2011

AK000026 Homo sapiens cDNA FLJ20019 fis, clone ADSE00947. 1295

-313- A3 'jkmmm

LOCUS AK000001 4504 bp mRNA HUM 29-FEB-2000

DEFINITION Homo sapiens mRNA for FLJ00001 protein, partial cds.

ACCESSION AK000001

VERSION AK000001.1

KEYWORDS FLJ00001 protein; oligo capping; fis (full insert sequence).

SOURCE Homo sapiens adult spleen cDNA to mRNA, clone:asOOOOl.

ORGANISM Homo sapiens

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia;

Eutheria; ; Catarrhini; Hominidae; Homo.

REFERENCE 1 (bases 1 to 4504)

AUTHORS Ohara,0., Kikuno,R., Nagase,T. and Okumura,K.

TITLE Direct Submission

JOURNAL Submitted (14-FEB-2000) to the DDBJ/EMBL/GenBank databases. Osamu

Ohara, Kazusa DNA Research Institute, Department of Human Gene

Research; 1532-3 Yana, Kisarazu, Chiba 292-0812, Japan

(E-mail:[email protected], URL:http://www.kazusa.or.jp,

Tel: 81-438-52-3913, Fax:81-438-52-3914)

REFERENCE 2 (bases 1 to 4504)

AUTHORS Ohara,0., Kikuno,R., Nagase,T. and Okumura,K.

TITLE The nucleotide sequence of a long cDNA clone isolated from human

spleen

JOURNAL Published Only in DataBase(2000)

STANDARD full staff_review

COMMENT NEDO human cDNA sequencing project supported by Ministry of

International Trade and Industry of Japan; cDNA full insert

sequencing: Research Association for Biotechnology; cDNA library

construction, 51 - & 31-end one pass sequencing and clone

selection: Kazusa DNA Research Institute.

FEATURES Location/Qualifiers

source 1..4504

/clone="asOOOOl"

/dev_s tage="adult"

/note="vector: pBluescriptll SK plus"

/organism="Homo sapiens"

/sequenced_mol="cDNA to mRNA"

/tissue_type="spleen"

CDS <976..1443

/codon_start=l

/gene="FLJ00001" -314- /note="CDS was assigned by GeneMark analysis."

/note="Start codon is not identified."

/product="FLJ00001 protein"

/protein_id="BAA92226.1"

/transl_table=l

/translation="LICVCCSDQRLLTCLCDDQLSSKNLTHHGVTLRAPGSCPGRPVT

PTGSLRSLLSPTRADPTKPLSLFLEDPEGRQLCLGLSRVADPPRPRAARNKFLSAGSH

SSPEKTWEEEASGPGVDPALWTSHGLVTQSPGSRGAAPLPPVEGLLLSGEEDG"

polyA_signal 4483..4488

BASE COUNT 865 < a 1379 c 1277 g 983 t 0 others

ORIGIN

1 gtgacttcag tttttcgtct ttcaacttca gttgctcatc tgcagtcacc agctgggatt

61 tgaacccagc caatctgact cctgagctcc tctttcctcc cacttagtct tttcccctga

121 gtccaacagc accttagagc atgtctaatg cacgtgctta ctgccttcat ctcattgcag

181 cctgtccctg ccagctgtgt caccttcatc ctgctgccag cccctggtga ctctcccgtt

241 acagatcaga gaccagactc caggggtgga gagtaggggt tctggaccag ccgctctctg

301 ctcctagagc tcttgtctac acaactgcct ggtgctcagg ggatgacgag gcgcctgcct

361 ctctcagtag gttgtgtgtc ctggcagcag gggtgggacc tgtcctcttt catagctcca

421 gcacatctgt gctggatgtc acctggctcc atccagtgac ggattcatat tctctgtttc

481 gacatcttca gttgctggac aactaaatgc catcttcaat acctttaatg ccatcttttt

541 ttttcttctt ttgcagcaat ggatggtgtg ccttttatga tttcagagaa gttttcttgt

601 gttccagaaa gtatgcagcc ctttgatctc ctgggaatca ccatcaaatc tctagcagaa

661 atcgaaaaag agtacgagta cagcttccgc acagagcaga gcgcagccgc caggctcccg

721 cccagcccca ccaggtgtca gcagatcccg cagtcctgag gagccagcgg ccacctgcgg

781 ggagaccacc gccgcccaga ctactgacgg caggggctgc tgcccccgcc tcctcctgcc

841 gcctccgcca gccctccctc ccacactgcc ccagcagggc tggcccggag actgggcagc

901 taagtgggcg catcctgtct tcagctggcc tgactgacac cccggcctcc ctggggacat

961 tgttcataac catgactaat ctgtgtgtgc tgtagtgacc agcggctcct aacgtgtctg

1021 tgtgatgacc agttgtcctc caaaaacctc actcaccacg gagtcaccct gagggccccg

1081 ggcagttgcc ctggccgccc agtgacgccc accggctccc tccgctccct cctgtcacct

1141 accagggcag accccactaa gccgttgagt ctcttcctgg aagaccctga gggccggcag

1201 ctttgcctag gactctccag ggtcgctgat cctccccggc ccagggctgc caggaacaag

1261 ttcctctctg caggatctca cagcagccct gagaagacat gggaggaaga agccagtggc

1321 cctggggtgg accctgccct ttggacatcc catggacttg tcacccagtc gcctggaagt

1381 cggggagcag cacccctgcc cccggtggag gggttgttgc tttctgggga ggaggacggt

1441 tgacggccct gcttggatcc accaccagga cgctgtgctc ggccatggcg cgcctccccc

1501 aagggctcat ctgtgcaaca accacatgcc aacaatagct ccaggtgcct cagccccgtg

1561 ccctgacccg gggaccacag ggacaccctc tgagacgtgc tggccccagg aggatacagg

1621 cacacgggca cccagcccct ctcagagcca ggcgtgtctt gacggtatct gctgacggga

1681 acaagtcatt ctgaaaggtc cctctgaccc aggtcaccaa atttgggcca gaactggcac -315- 1741 ttccgagtgc ccactggtgc tggcgtcctc aaagtgccgg gcgccaagtt cagccccagg

1801 tgcgtttgcc taggcagtgg tggctgccat ccctcggggc ccaggccctc acccgaaggg

1861 ctgcctcact ctgtggtgtc aggcactggg ctgtgtcctg attcctcaga gtatggggac

1921 ctgacatctt tgggtcccca agagttggct cttggatgtg gaggcttgat gctgcggttg

1981 ggggctgcac tacctgcgtc cgtggaagcc tctgcctcag cggggacggc ctcctgctgc

2041 aggtcctggg cagcccacag tagtgctagc tgaggcccag aggaacggga ggaaagggag

2101 gcagcgccgg acggcgcagc tggccaagag gcgggcgggg ccagaaatac ctcatgcccc

2161 tgcaccagcc cctccgtctc caggccccag gcctctttgt ggacccatct tcctcctctg

2221 ccaccccgat cttcactcgg atacctcttg atttctgtgg ataagggaag cggccacttc

2281 tccctccctg agacccgagg atgcccctgg tgggggtggc cttctcagga ctcctgggtg

2341 gcacagcagc aggaccttgt catgggactg atgctgagca gggagggggc tgccaggctg

2401 gctgccgatg ctccgttcac ataagccagt gtggttctgg ggacctgagg agccctgtgg

2461 cacccacagg gggcacctat gtctgccgtg gctcctcggg tggtgccctg tgtgacaaag

2521 cccaacagca agggtgactc ctggcaggtg ggggcagcag gagggcagag ggcagagctc

2581 tggccacttc tgcccacttc attagggttt gtgaactttg tccttcacct cttttcgtgc

2641 cctggttgtg agattgcctc taacaggtaa tgccaggggc ccttcactcc gcccccatga

2701 ctgggaagag gcctgtggca gcgccgctgg accctaggag gctcagaggc agtggtgtgg

2761 gagccctgtc tgcaaggacg cagaataagc agtgagggcg gctgcaggag aggaaggggc

2821 tcccacagcc cccactgatg ccgctgcagg cccctgtcga gctggggtcc cagccaggtg

2881 cccccgcatg ccctcctgca gttgctggat ggatagggac accaggaaga ggacaaactg

2941 catggactca agcgagctgg agccatcttc tccatagcat tacggactta gcataagagt

3001 aaatgactgt gaacgttgta gtaaacggca gcttaagata agtaagcaga gacagtgtta

3061 aggacgagtt ggtgtctgtg gtagctttta ggctgctcct aaccccccat ttattgcctt

3121 ctgagaggtg ggtgaggaca agcatgtgcc tgtgtgtgtg tgtacgcaca tgcgtgtgta

3181 taagcccacc tgagtggggc tcgtgcagga gaactgaggc atgaaactct ggctcaaacc

3241 taggaattga gagcgtttct gtcttttggg agagtacttt tctccacgag ccctctggcc

3301 actttgggag ggaaggacaa gggttccctt ggaaatgtga agggtcttgg cctcatccct

3361 caggtccccc cacagcactt cccactactg cttctgtccc tgctggcagc ctctgtccct

3421 ccagaacggc taaccagagc acactgtccc caccgcctcc cctttctctc tgaaagttga

3481 agtatctcca aaggccttgg aaatggcaca aaggtgataa ggagcaggtg ctttgctgca

3541 gtctcccttg caaatgtata attaaggcct ttcttcccac cccaagtcca agaacaaatg

3601 ccagccacgt cctccgccac ttggagagat gagaacccag tggggtcacg taaaggaatt

3661 gcaggtcggt gagaggacaa gagggactcc catgttctaa gcacctgttc ctggccaggc

3721 tctaggccag gctctctaag cacatttctc ctttcattcc ccctaaaaac agagtgacct

3781 ggaagtagat gttctttgct ccttgtcaga gttgaagagg ctgacttggc ccactgctaa

3841 gcggcagagg cagggccagc catcctgtcg caagcccgtg ctggggctgc cctttctgtt

3901 tccagtccag ttacggactt cccggccgcc actgggccct gccggtcacc aggccactgt

3961 gcagtgggcg cagagcatgg tcaggagtgg cctgcccgta ctcctccacc cagatgaggg

4021 ccctccagag cctgcaggca tctgtgggga atcccagcct gcaggttctt ggagaagcag

4081 gtgaacctaa ggatgaaagc aaaggagggc cttgaggaag cagcccccag gcctggcagc -316- 4141 cacgcagcgg ctgagctcat gaacttggtt cgcagcctgc cttgcccctg gaggccacgc

4201 caggcgctca cccctgagcc cacagcccct gcttgggctg cctggcaccc tcagggtggc

4261 ccggcctcct cctgccactc tgagcacatg tccgggggtt gccaccagag acggctttgt

4321 tctcccagct aaggccgtgg agctgctgtg tgactgtgtc aggcctggac aaggaagacc

4381 cttagggatg acgtccccgc tgcatattta ttcaaggtga ctcttgtact tggcaaggga

4441 agtccactgt gtgattgtct gtattcttaa tataatttgt taaataaacg tttgttttaa

4501 cctc

//

-317- ffl4 &

LOCUS AK000008 1962 bp mRNA HUM 22-FEB-2000 DEFINITION Homo sapiens cDNA FLJ20001 fis, clone ADKA01273.

ACCESSION AK000008 VERSION AK000008.1 KEYWORDS oligo capping; fis (full insert sequence). SOURCE Homo sapiens adipose tissue cDNA to mRNA, clone_lib:ad clone:ADKAO1273. ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1962) AUTHORS Sugano,S., Suzuki,Y., Ota,T., Obayashi,M., Nishi,T., Isogai,T., Shibahara,T., Tanaka,T. and Nakamura,Y. TITLE Direct Submission JOURNAL Submitted (15-FEB-2000) to the DDBJ/EMBL/GenBank databases. Sumio Sugano, Institute of Medical Science, University of Tokyo, Deptment of Virology; Shirokane-dai, 4-6-1, Minato-ku, Tokyo 108-8639, Japan (E-mail:[email protected], Tel:81-3-5449-5286, Fax:81-3-5449-5416) REFERENCE 2 (sites) AUTHORS Tanigami,A. , Fujiwara,T., Ono,T., Yamada,K., Fujii,Y., Ozaki,K., Hirao,M., Ohmori,Y., Ota,T., Suzuki,Y., Obayashi,M., Nishi,T., Shibahara,T., Tanaka,T., Nakamura,Y., Isogai,T. and Sugano,S. TITLE NEDO human cDNA sequencing project JOURNAL Unpublished (2000) STANDARD full staff_review

COMMENT NEDO human cDNA sequencing project supported by Ministry of International Trade and Industry of Japan; cDNA full insert sequencing: Research Association for Biotechnology; cDNA library construction, 5'- & 3'-end one pass sequencing: Departent of Virology and Human Genome Center, Institute of Medical Science, University of Tokyo (partly supported by Science and Technology Agency).

FEATURES Location/Qualifiers source 1..1962

/clone="ADKA01273" /clone_lib="ad"

/note="cloning vector pME18SFL3" /organism="Homo sapiens" -318- /sequenced_mol="cDNA to mRNA"

/tissue_type="adipose tissue"

CDS 152..1063 /codon_start=l

/protein_id="BAA90880.1"

/translation="ME FLRAGSNVMQTFT FSAS EDNMES KWEDVNAAACDLAREVAGK GDALVAGGICQTSIYKYQKDEARIKKLFRQQLEVFAWKNVDFLIAEYFEHVEEAVWAV

EVLKESDRPVAVTMCIGPEGDMHDITPGECAVRLVKAGASIVGVNCRFGPDTSLKTME LMKEGLEWAGLKAHLMVQPLGFHAPDCGKEGFVDLPEYPFGLESRVATRWDIQKYARE AYNLGVRYIGGCCGFEPYHIRAIAEELAPERGFLPPASEKHGSWGSGLDMHTKPWIRA RARREYWENLLPASGRPFCPSLSKPDF"

BASE COUNT 562 a 421 c 533 g 446 t 0 others ORIGIN 1 gaaggggatt ttggagcgcc tggagagtgg ggaggttgtg attggagatg gcagctttct 61 cattactctg gagaagagag gctatgtgaa ggctgggctc tggactccag aggcagtgat 121 agaacaccca gatgcagttc gtcaacttca catggaattc ttgagagcag gatcaaatgt 181 catgcagact tttacctttt ctgccagtga ggacaatatg gaaagcaagt gggaagatgt 241 aaatgctgct gcctgtgacc tcgccaggga agtggctggc aaaggtgatg ctttggtagc 301 aggggggatc tgccagacat caatatacaa ataccagaag gatgaagcta gaattaaaaa 361 actttttcga caacagctag aagtttttgc ctggaaaaat gtggacttct tgattgcaga 421 gtattttgag cacgttgaag aagctgtgtg ggctgtggaa gtcttaaaag aatcagatag 481 acccgtggca gttaccatgt gcataggccc agagggagac atgcatgata taacccccgg 541 agaatgtgct gtgaggctgg tgaaggcagg ggcttccatc gttggcgtga actgccgctt 601 tgggcccgac accagcttga agacgatgga gctcatgaag gagggtcttg agtgggcagg 661 gctgaaagcg cacctcatgg tgcagcctct ggggttccac gcgcctgact gtggcaaaga 721 ggggtttgtg gatctcccag aatatccctt tggactggag tccagagttg ccaccagatg 781 ggatattcaa aaatacgcca gagaggccta caacctgggg gtcaggtaca ttggcgggtg 841 ctgtggattt gagccctacc acatcagggc aattgcagag gagctggccc cagaaagggg 901 ctttttgcca ccagcttcag aaaaacacgg cagctgggga agtggtttgg acatgcacac 961 caaaccctgg attagagcaa gggctcgaag ggagtattgg gagaatctgc tgccagcttc 1021 aggcagacct ttctgtcctt cgctgtcaaa gccagacttc taaggagtag tgaaagaaaa 1081 ccctgaaata atcgaacagg aaaaagttgc cctcaagcct gacctggaac cgttcctcac 1141 cttcatcctc accatgccct gctatctcca gctgctgagc agctgaggtg ctgcagcccc 1201 ttcccttcca gcccacaagt gtgtgcatat tgagctcctg ctgtggttaa gcactgcaac 1261 agactctacc agagatgcaa agagaagcga gagaggcacc ttgttctcca agaacttact 1321 gtccaatcat gctgtggtgc attcctttga agatcatgaa gaatagccaa acttgtcttt

1381 gagggtgaat ttgacacttt aaaataatca gaagtcactg agacccaaag ctagtgaata 1441 attcaagtga ctaagctaga tacttgagga tgatagagca acactaaact aaagcaatga

1501 gtctgctttt tttaatgatt tatccaatga ctccaaaggc aatcccaaag aaaagatgta

1561 aagaatgttt tgagcaatgg tagcattatt gaaataaatg tttaactatc caaagtgatt 1621 actttcaaag atgattttta cttaaagata ttaggtctgg ccaggcgaag tgactcacac

1681 ctgcaatccc agcactttgg gaggccgagg cgggcggatc acctgaggtc agcagttcga 1741 gaccagcctg accaacatgg tgaaaccctg tctctaccaa aatacaaaaa ttagccgggc 1801 atgatggcgg gtgcctgtaa tcccagctac tcgggaggct gaggcaggag aatcgcttga 1861 acccaggagg cggagcttgc agtgagccga gatcaggcca ttgcacttca gcctgggtga 1921 cagagtgaga ctccatctca aaaaaaaaaa aaaaaaaaaa aa

-320- ms '&mmm au ^x ^-a

LOCUS AK000863 1753 bp mRNA HUM 22-FEB-2000

DEFINITION Homo sapiens cDNA FLJ10001 fis, clone HEMBA1000030.

ACCESSION AK000863

VERSION AK000863.1

KEYWORDS oligo capping; fis (full insert sequence).

SOURCE Homo sapiens embryo, 10 weeks whole embryo, mainly head cDNA to

mRNA, clone_lib:HEMBA1 clone:HEMBA1000030.

ORGANISM Homo sapiens

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia;

Eutheria; Primates; Catarrhini; Hominidae; Homo.

REFERENCE 1 (bases 1 to 1753)

AUTHORS Isogai.T. and Otsuki.T.

TITLE Direct Submission

JOURNAL Submitted (16-FEB-2000) to the DDBJ/EMBL/GenBank databases. Takao

Isogai, Helix Research Institute, Genomics Laboratory; 1532-3

Yana, Kisarazu, Chiba 292-0812, Japan (E-mail:[email protected],

Tel:81-438-52-3951, Fax:81-438-52-3952)

REFERENCE 2 (sites)

AUTHORS Isogai.T., Ota,T., Hayashi,K., Sugiyama,T., Otsuki.T., Suzuki,Y.,

Nlshikawa.T., Nagai.K., Sugano,S., Takahashi-Fujll.A., Hara.H.,

Tanase,T., Nomura,Y., Togiya,S., Komai,F., Kara,R., Takeuchi,K.,

Arita.M., Nabekura,T., Ishii.S., Kawal.Y., Salto,K., Yamamoto,J.,

Wakamatsu.A., Nakamura,Y., Nagahari,K., Masuho,Y. and Oshima.A.

TITLE NEDO human cDNA sequencing project

JOURNAL Unpublished (2000)

STANDARD full staff_review

COMMENT NEDO human cDNA sequencing project supported by Ministry of

International Trade and Industry of Japan; cDNA full insert

sequencing: Research Association for Biotechnology; cDNA library

construction, 51 - & 31-end one pass sequencing and clone

selection: Helix Research Institute (supported by Japan Key

Technology Center etc.) and Department of Virology, Institute of

Medical Science, University of Tokyo.

FEATURES Location/Qualifiers

source 1..1753

/clone="HEMBA1000030"

/clone_lib="HEMBA1"

/dev_stage="embryo, 10 weeks"

/note="cloning vector: pME18SFL3" -321- /organ!sm="Homo sapiens"

/sequenced_mol="cDNA to mRNA"

/tissue_type="whole embryo, mainly head"

BASE COUNT 337 a 575 c 490 g 351 t 0 others

ORIGIN

1 tttcccgcca cggcagtgga gccgacagtg actatgagaa cacgcaaagt ggggacccac

61 tgctggggct ggaagggaag aggtttctag agctgggcaa agaggaagac ttccacccag

121 agctggaaag cctggatgga gacctagatc ctgggcttcc cagcacagag gatgtcatct

181 tgaagacaga gcaggtcacc aagaacattc aggaactgtt gcgggcagcc caggagttca

241 agcatgacag cttcgtgccc tgctcagaga agatccattt ggctgtgacc gagatggcct

301 ccctcttccc aaagaggcca gccctggagc cagtgcggag ctcactgcgg ctgctcaacg

361 ccagcgccta ccggctgcag agtgagtgcc ggaagacagt gcccccagag cccggcgccc

421 cagtggactt ccagctgctg actcagcagg tgatccagtg cgcctatgac atcgccaagg

481 ctgccaagca gctggtcacc atcaccaccc gagagaagaa gcagtgacct ctctccccac

541 accctcacct gcaccctagg acctcactgg ccataggagc tgggccactc cagacattaa

601 tccccacccc aacagagcca ctggcacaag tgcccttagt gctgccacac tccctggcag

661 ccaggtgccc tggtgcccac ccctgtcgag cccctaagga tggggaggtg ggggggcagg

721 agcttctgtc ccccacattc catgcacctc ccctctgtat atagcatctc ccccctccta

781 gtgagcaggg gcctgcaagg catcactccc agcccctcgc cttctagggc accctcagca

841 aaggggcagg tggggacact ccaagtgggg cagctctccg tacatgcgcc ccacccccat

901 gagccagttc agccctactg ggggctgagc gggggcatcc cctcctttgt acatagtctc

961 catggatgtc cctgccctgt agccaccagc cccttgctgc tctcccttta atgccatatg

1021 gcccctgcct agggcacagg ccccaacctg tgtgctgggg tccccagcag caaacactgg

1081 aaagtctgtt tttttttttt ctttcttctt ccccacccct taattttaac tttgtggtaa

1141 ctgagtgccc ccgcgtgcct gcgtgttgag tgtgtgggcg gcagtgccgt tccggaggcc

1201 tggtccatct ggagttttga ggggtgaggg gaccagagca gtgggaccag catggggatc

1261 agcttccctt ccccacctgg agccagggac tgtccgggta gccagttttg gtcctgccag

1321 ctgcctccct gatccctccc cactctcgcc ccttctctat gaacttaaat caaaaaccac

1381 ttccctccat ctcctcctgc tcctgcgtgg agggggaatg tgtgctggct agggtggagg

1441 actgagcacc tgagcctggg gctggctccc cggggtcccc gactcagctg gtggctgtgg

1501 agctgagtcc cctccccgta acctctgcaa ggccagcacc caccatcact acctgcacct

1561 gctgtggtcc caccctctgg aggcctggga acctggctgc agcctgggaa ggctggagag

1621 gcagacggtg ggacccacca gctctctccc catcccgctt cttccctggg ggccaggccc

1681 tacctgtgtg gtggtgggtg ggctgtcaag acgtgtcatg tacatttgta tcaaaaataa

1741 agaagtgacc atg

-322- [##] ¥ J&l 0^ a IB 3 ft Ji JE T »

em&Kjti h c DNAy/Aft)iia«M

¥fi8l2¥3^l

*r#|g%k hcDNAfty (MO) lc£ifcfc:0 HT^to g ftS:## i: LT^i".

g ft

e. iORB4

B l m r«»§wt hcDNAfty -k.iSfflft«M%j*SlcT«#fL7tffi$ga?iJT:"-^ (AB29U.doc) 10852^-t? 2* 5S^fl: c DN AE?IJft/ERO!ite^toe#lto«|g^a'J

B i« t^ofccDNAy D-^»5'*asa?ij»«tfffe*-^j (fitoSS 1 .xls) 774^-iy B2IE r#y P-^rosaSip-tJ (MMiSW2.xls) 79^-y

S31* rf||W].|4

B47E v->?ji =2 - K-raae^wte^sferoRT-ocRisic xz,&:. (#%K# 4.xls) 4'<-S?

-323-