Enzymatic Encoding Methods for Efficient Synthesis of Large Libraries
Total Page:16
File Type:pdf, Size:1020Kb
(19) TZZ ¥__Z_T (11) EP 2 341 140 A1 (12) EUROPEAN PATENT APPLICATION (43) Date of publication: (51) Int Cl.: 06.07.2011 Bulletin 2011/27 C12N 15/10 (2006.01) C12Q 1/68 (2006.01) C40B 40/06 (2006.01) C40B 50/06 (2006.01) (21) Application number: 10192716.8 (22) Date of filing: 01.12.2006 (84) Designated Contracting States: • Husemoen, Birgitte Nystrup AT BE BG CH CY CZ DE DK EE ES FI FR GB GR 2500 Valby (DK) HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI • Dolberg, Johannes SK TR 1674 Copenhagen V (DK) Designated Extension States: • Jensen, Kim Birkebæk AL BA HR MK RS 2610 Rødovre (DK) • Petersen, Lene (30) Priority: 01.12.2005 DK 200501704 2100 Copenhagen Ø (DK) 02.12.2005 US 741490 P • Nørregaard-Madsen, Mads 3460 Birkerød (DK) (62) Document number(s) of the earlier application(s) in • Godskesen, Michael Anders accordance with Art. 76 EPC: 2950 Vedbæk (DK) 06818144.5 / 1 957 644 • Glad, Sanne Schrøder 2750 Ballerup (DK) (71) Applicant: Nuevolution A/S • Neve, Søren 2100 Copenhagen 0 (DK) 2800 Lyngby (DK) • Thisted, Thomas (72) Inventors: 3600 Frederikssund (DK) • Franch, Thomas • Kronberg, Tine Titilola Akinleminu 3070 Snekkersten (DK) 3500 Værløse (DK) • Lundorf, Mikkel Dybro • Sams, Christian 4000 Roskilde (DK) 3500 Værløse (DK) • Jakobsen, Søren • Felding, Jakob 2000 Frederiksberg (DK) 2920 Charlottenlund (DK) • Olsen, Eva Kampmann • Freskgård, Per-Ola 2730 Herlev (DK) 603 79 Norrkörping (SE) • Andersen, Anne Lee • Gouliaev, Alex Haahr 4100 Ringsted (DK) 3670 Veksø Sjælland (DK) • Holtmann, Anette • Pedersen, Henrik 2750 Ballerup (DK) 2880 Bagsværd (DK) • Hansen, Anders Holm 2400 Copenhagen NV (DK) (74) Representative: Aamand, Jesper L. et al • Sørensen, Anders Malling Hjerrild & Levin ApS 2860 Søborg (DK) Tuborg Boulevard 12 • Goldbech, Anne 2900 Hellerup (DK) 2200 Copenhagen N (DK) • De Leon, Daen Remarks: 2300 Copenhagen S (DK) This application was filed on 26-11-2010 as a • Kaldor, Ditte Kivsmose divisional application to the application mentioned 2880 Bagsvaerd (DK) under INID code 62. • Sløk, Frank Abildgaard 3450 Allerød (DK) (54) Enzymatic encoding methods for efficient synthesis of large libraries (57) Disclosed is a method for obtaining a bifunction- stranded identifier oligonucleotide, wherein a nascent bi- al complex comprising a molecule linked to a single functional complex comprising a chemical reaction site EP 2 341 140 A1 Printed by Jouve, 75001 PARIS (FR) (Cont. next page) EP 2 341 140 A1 and a priming site for enzymatic addition of a tag is a) reactants, and b) reacted enzymatically at the priming reacted at the chemical reaction site with one or more site with one or more tag(s) identifying the reactant(s). 2 EP 2 341 140 A1 Description [0001] This application is a non-provisional of U.S. provisional application Serial No. 60/741,490 filed on December 2, 2005, which is hereby incorporated by reference in its entirety. All patent and non-patent references cited in U.S. 5 provisional application Serial No. 60/741,490, or in the present application, are also hereby incorporated by reference in their entirety. Technical Field of the Invention 10 [0002] The present invention relates to bifunctional complexes and methods for synthesising such complexes, as well as to methods for split-and-mix synthesis of different molecules each linked to a single stranded identifier oligonucleotide comprising a plurality of tags identifying the molecule and/or the chemical entities having participated in the synthesis of the molecule. The invention also relates to a method for generating a library of different bifunctional complexes and methods for selecting molecules and/or identifying molecules having a desirable property, such as affinity for a target 15 compound. Background of the Invention [0003] Synthesis methods known as split-and-mix, or split-and-recombine, are known and have been used for the 20 synthesis of different molecules. Split-and-mix methods for synthesising polypeptides and other biochemical polymers have been disclosed e.g. in US 5,723,598 directed to the generation of a library of bifunctional complexes comprising a polypeptide and an identifier oligonucleotide comprising tags in the form of a sequence of nucleotides identifying the amino acids which participated in the formation of the polypeptide. The methods are directed to chemical linkage of tags and do not disclose enzymatic linkage, such as ligation, of the nucleotide tags making up the identifier oligonucleotide. 25 [0004] WO 00/23458 discloses a split-and-mix method in which nucleic acid tags are involved in both molecule syn- thesis and molecule identification. [0005] WO 2004/039825 and WO 2005/058479 disclose split-and-mix methods wherein tags in the form of identifier oligonucleotides are linked enzymatically. The prior art methods do not disclose ligation of a double-stranded oligonu- cleotide substrate comprising a plurality of tags and complementary anti-tags at least partly hybridised to each other, 30 wherein said ligation results in the formation of an identifier oligonucleotide comprising a plurality of consecutive nucle- otides in the form of covalently linked tags, whereas the anti-tags of the double-stranded oligonucleotide substrate are not affected by the action of the ligase, i.e. no anti-tags become covalently linked as a result of the enzymatic ligation of the tag part of the double-stranded oligonucleotide substrate. [0006] Reference is also made to WO2006/053571 disclosing methods for molecule synthesis. 35 Summary of the Invention [0007] There is a need for improved methods for split-and-mix synthesis of libraries of small molecules for e.g. phar- maceutical and other purposes. The small molecules can initially be synthesised as part of a bifunctional complex further 40 comprising an identifier oligonucleotide identifying the reactants which have participated in the synthesis of the small molecule. [0008] The methods of the present invention employs a ligation step wherein the substrate for the ligase is in a double stranded form and wherein the substrate comprises a plurality of tags and at least one or more anti-tags wherein tags and anti-tag(s) is/are at least partly hybridised to each other. The tags are covalently linked as a result of the action of 45 an enzyme comprising a ligase activity on the double stranded substrate, but no anti-tags are covalently linked as a result of said ligase action. [0009] The method facilitates separation of ligated tags and discrete, non-ligated anti-tags because of the size and molecular weight difference between (i) the afore-mentioned single stranded identifier oligonucleotide comprising a plurality of covalently linked tags and (ii) discrete, non-ligated anti-tags. The identifier oligonucleotide comprising the 50 ligated tags will typically have a length at least about 3 times the length of the individual anti-tags. [0010] In one embodiment, the tags comprise a 5’ phosphate, or a variant ligatable reactive group, whereas anti-tags do not. Therefore, the growing bifunctional complex will comprise a covalently linked "top"- strand to which a number, such as one or more, of shorter and non-ligated anti-tag(s) is/are hybridised/annealed. This enables removal of anti-tag(s), e.g. after all tag additions have been performed (Fig. 1) or after each tag addition has been performed (Fig. 6). 55 [0011] Removal of anti-tags generally increases fidelity and allows extension after purification of the single-stranded oligonucleotide identifier comprising a plurality of ligated tags. Said extension makes possible the use of selection-specific sequences, which improves robustness towards contamination, which is a well-known phenomena when e.g. PCR amplifications are performed. Also, diversification-sequences can be used which diversifies otherwise un-distinguishable 3 EP 2 341 140 A1 tag combinations. This makes it possible to identify tag combination sequences which may arise during PCR from a single tag combination. It is advantageous to identify such sequences as they may otherwise be interpreted as arising from several molecular entities containing the same tag combination, thus indicating that the specific tag combination corresponds to a small molecule with relatively high ability to be retained during subsequent selection procedures. 5 [0012] Also, because of the design of the tags, cross-hybridization between single stranded tags can be reduced or essentially eliminated. This greatly improves the purification process of the bifunctional complexes comprising single stranded identifier oligonucleotides following the synthesis reactions. One problem associated with purification of bi- functional complexes comprising double stranded identifier oligonucleotides is that such identifiers are prone to illegiti- mate hybridization when renaturing conditions are resumed following a purification process under denaturing conditions. 10 [0013] In order to achieve a minimum degree of cross-hybridization between tags in an identifier oligonucleotide, the tags can be designed using a computer algorithm to maximize or optimize the number of mismatches between any oligo pair, e.g. resulting in a minimum of seven mismatches between any two tag tags. This maximizes or optimize fidelity of hybridization and enables use of sorting methods, such as the ones disclosed e.g. in WO 00/23458 (Harbury). Moreover, it increases robustness of decoding by sequencing, e.g. the tag information can be decoded even if sequencing errors 15 occur. [0014] It is also possible to perform a quenching reaction with the reactants and tags. After reactant reactions resulting in the synthesis of the molecule, excess reactants are rendered unreactive, e.g., by adding a quencher in the form e.g. of a reactant which reacts with excess reactants, e.g. by raising pH, etc, or by removal of a reactant which is critical for the reaction, etc. This can be done e.g. prior to, after or in parallel with deprotection of reacted reactants.