Aspects of Oligonucleotide Synthesis, Enzymatic Assembly, Sequence Verification and Error Correction

Total Page:16

File Type:pdf, Size:1020Kb

Aspects of Oligonucleotide Synthesis, Enzymatic Assembly, Sequence Verification and Error Correction Молекулярная и клеточная биология Вавиловский журнал генетики и селекции. 2018;22(5):498­506 ОБЗОР / REVIEW DOI 10.18699/VJ18.387 Modern approaches to artificial gene synthesis: aspects of oligonucleotide synthesis, enzymatic assembly, sequence verification and error correction G.Y. Shevelev1, 2 , D.V. Pyshnyi1, 2 1 Institute of Chemical Biology and Fundamental Medicine, SB RAS, Novosibirsk, Russia 2 Novosibirsk State University, Novosibirsk, Russia Synthetic biology is a rapidly developing field aimed at Современные подходы engineering of biological systems with predictable properties. к синтезу генов: аспекты Synthetic biology accumulates the achievements of modern biological sciences, programming and computational model­ синтеза олигонуклеотидов, ing as well as engineering technologies for creation of biologi­ ферментативной сборки, cal objects with user­defined properties. Evolution of synthetic проверки последовательностей biology has been marked by a number of technological de­ velopments in each of the mentioned fields. Thus, significant и коррекции ошибок reduction in cost of DNA sequencing has provided an easy access to large amounts of data on the genetic sequences Г.Ю. Шевелев1, 2 , Д.В. Пышный1, 2 of various organisms, and decreased the price of the DNA sequence synthesis, which, analogous to Moore’s law, resulted 1 Институт химической биологии и фундаментальной медицины in an opportunity to create a lot of potential genes without the Сибирского отделения Российской академии наук, Новосибирск, time – consuming and labor – intensive traditional methods Россия 2 Новосибирский государственный университет, Новосибирск, Россия of molecular biology. Development of system biology has allowed forming a deeper understanding of the functions and relationship of natural biological models, as well as of the Синтетическая биология – быстро развивающаяся отрасль computational models describing processes at the cell and науки, нацеленная на создание биологических систем с system levels. Combination of these factors has created an op­ предсказанными свойствами. При этом она использует portunity for conscious changes of natural biological systems. достижения современной биологии, программирования In this review the modern approaches to oligonucleotide gene и компьютерного моделирования, а также инженерной assembly synthesis are discussed, including such aspects as отрасли для создания биологических объектов, обладаю­ protocols for gene assembly, sequence verification, error cor­ щих набором заранее заданных пользовательских свойств. rection and further applications of synthesized genes. Развитие синтетической биологии было обусловлено мно­ Key words: oligonucleotide synthesis; artificial gene synthesis; жеством технологических разработок в каждой из упомя­ enzymatic gene assembly; polymerase cycling assembly (PCA); нутых отраслей. Так, значительное снижение стоимости ligase cycling assembly (LCA); error correction; sequence технологии секвенирования ДНК привело к наработке verification. больших объемов данных о генетических последователь­ ностях различных организмов. Снижение стоимости син­ теза последовательностей ДНК в соответствии с законом Мура позволило создавать библиотеки синтетических генов, представляющие потенциальный интерес в работе генных инженеров без необходимости использования традиционных и трудоемких методов молекулярной био­ логии. Благодаря развитию системной биологии сфор­ мировано глубокое понимание взаимосвязей и функций природных биологических моделей, а также построены прогностические модели, описывающие молекулярные процессы на клеточном и системном уровнях. Комбина­ ция вышеперечисленных факторов создала возможность осознанного изменения природных биологических систем. В данном обзоре обсуждается современное состояние подходов к синтезу олигонуклеотидов для последующей HOW TO CITE THIS ARTICLE: сборки генных конструкций и к ферментативной сборке Shevelev G.Y., Pyshnyi D.V. Modern approaches to artificial gene synthesis: aspects of oligonucleotide synthesis, enzymatic assembly, генов. Освещены аспекты использования различного sequence verification and error correction. Vavilovskii Zhurnal Genetiki программного обеспечения для подбора олигонуклеоти­ i Selektsii = Vavilov Journal of Genetics and Breeding. 2018;22(5):498­ дов для последующей сборки генов, проверки точности 506. DOI 10.18699/VJ18.387 синтезированной последовательности генов, а также ис­ правления ошибок. Ключевые слова: синтез олигонуклеотидов; синтез искус­ УДК 577.2.08 ственных генов; ферментативная сборка генов; полиме­ Поступила в редакцию 02.05.2018 Принята к публикации 09.06.2018 разная циклическая сборка; лигазная циклическая сборка; © АВТОРЫ, 2018 коррекция ошибок; проверка последовательности. e­mail: [email protected] ince a first gene was synthesized from scratch in the factor for overcoming barrier over 100 nt during oligonucle- 70’s (Agarwal et al., 1970) and it’s in vivo activity was otide synthesis is stochastic depurination side reactions which Stested (Ryan et al., 1979) around 50 years has passed, accompany acid deprotection step (Hall et al., 2009). Vendors and the technology of artificial gene synthesis has made a such as Integrated DNA Technologies (IDT), however, declare significant leap beyond the approaches implemented in the possibility to synthesize high quality oligonucleotides up to classical works. Nowdays, it relies on the modern methods of 200 nt using a special support and synthetic protocol for low- oligonucleotide synthesis, the variants of Polymerase Chain yield and long-oligonucleotide synthesis. Reaction (PCR) (Saiki et al., 1985; Mullis et al., 1986) with As an alternative to chemical oligonucleotide synthesis, high fidelity DNA polymerases (Böhlke et al., 2000), and T4 RNA ligase-mediated solid-phase enzymatic oligonucle- the sequence verification step using the Sanger sequencing otide synthesis approach was proposed (Schmitz, Reetz, (Sanger, Coulson, 1975; Smith et al., 1986) or high-throughput 1999). The approach is based on coupling reaction between DNA sequencing (Church, 2006; Hall, 2007; Bosch, Grody, a solid-phase attached oligonucleotide primer and the mono- 2008; Schuster, 2008; Shendure, Ji, 2008; Tucker et al., 2009). nucleoside 3′,5′-biphosphate mediated by T4 RNA ligase in During the last decade modern approaches to assemble large a water solution. The terminal phosphate blocking group is (up to dozens and hundreds kb) DNA molecules (Gibson et al., removed enzymatically by alkaline phosphatase, and all ex- 2009; Gibson, 2011) and error correction within synthesized cess reagents are simply washed off from the resin. The next gene have been introduced (Ma et al., 2012b). nucleoside 3′,5′-biphosphate can then bind to the end of the chain. Following NH2OH-induced cleavage from the resin Oligonucleotide synthesis gives the desired elongation of the product. The drawback of All the synthesis approaches rely on usage of chemically syn- this approach is relatively low rate of elongation (48 h) due thesized oligonucleotides as building blocks for an enzyme- to the kinetic properties of the enzyme. However, possible mediated assembly (Ma et al., 2012b; Kosuri, Church, 2014). automation and molecular biological optimization of enzyme using directed evolution can make the enzymatic approach Solid-phase oligonucleotide synthesis a viable alternative to today’s chemical synthesis approach. The most widely used approach to oligonucleotide synthesis has been the solid-phase phosphoramidite method developed Parallel oligonucleotide synthesis using microchips in the 80’s (Beaucage, Caruthers, 1981; Matteucci, Caruthers, The oligonucleotide amounts necessary for gene synthesis ap- 1981). The DNA oligonucleotides are synthesized from the plication are relatively small (picomols) while modern DNA 3′ to 5′ end by consecutive coupling of activated building de- synthesizers can produce oligonucleotides in a higher excess oxynucleoside phosphoramidites to an initial deoxynucleoside (nanomolar and micromolar) making this approach redundant. attached to a solid support (usually the support is a controlled Significant improvements in performance and cost-reduction pore glass (CPG) or highly cross-linked polystyrene beads) may be achieved by parallelization and miniaturization of by its 3′-OH group (Ellington, Pollard, 2000). A solid support oli gonucleotide synthesis platforms. Thus, microchip-based matrix is placed into several or several hundred parallel flow- approaches previously used for DNA diagnostics and sequenc- through individual reaction columns, and while synthesis all ing have been adopted for multi-parallel synthesis of oligo- the reagents necessary for the synthesis cycle flow through the nucleotide arrays. Several constructive implementations such solid support column. A full cycle to add a single nucleotide as ink-jet DNA printing (Lausted et al., 2004), microfluidic consists of several stages: (1) deprotection: acid is used to devices (Zhou et al., 2004; Huang et al., 2009; Lee et al., 2010), remove a Dimetoxytrityl (DMT) group from the 5′-end of a light-directed (Richmond et al., 2004) and electrochemical growing oligonucleotide chain; (2) coupling: the solid-phase (Egeland, Southern, 2005; Chow et al., 2009) microarray syn - 5′-OH group of an attached oligonucleotide reacts with tet- thesis and LED-controlled capillaries (Blair et al., 2006) razole – activated nucleoside phosphoramidite producing the synthesis have been independently developed. A conditions elongated product of reaction;
Recommended publications
  • Gene Composer: Database Software for Protein Construct Design, Codon
    BMC Biotechnology BioMed Central Software Open Access Gene Composer: database software for protein construct design, codon engineering, and gene synthesis Don Lorimer1,2,3, Amy Raymond1,2,JohnWalchli3,4,MarkMixon3,4, Adrienne Barrow1,3, Ellen Wallace1,3,RenaGrice1,3,AlexBurgin1 and Lance Stewart*1,2,3 Address: 1deCODE biostructures, Inc 7869 NE Day Road West, Bainbridge Island, WA, 98110, USA, 2Seattle Structural Genomics Center for Infectious Disease, Bainbridge Island, WA, 98110, USA, 3Accelerated Technologies Center for Gene to 3D Structure, Bainbridge Island, WA, 98110, USA and 4Emerald BioSystems, Inc 7869 NE Day Road West, Bainbridge Island, WA, 98110, USA E-mail: Don Lorimer - [email protected]; Amy Raymond - [email protected]; John Walchli - [email protected]; Mark Mixon - [email protected]; Adrienne Barrow - [email protected]; Ellen Wallace - [email protected]; Rena Grice - [email protected]; Alex Burgin - [email protected]; Lance Stewart* - [email protected] *Corresponding author Published: 21 April 2009 Received: 16 October 2008 BMC Biotechnology 2009, 9:36 doi: 10.1186/1472-6750-9-36 Accepted: 21 April 2009 This article is available from: http://www.biomedcentral.com/1472-6750/9/36 © 2009 Lorimer et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Abstract Background: To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences.
    [Show full text]
  • Paulo Miguel Da Silva Gaspar Optimizaç˜Ao De Genes Para
    Departamento de Universidade de Aveiro Electr´onica,Telecomunica¸c~oese Inform´atica, 2010 Paulo Miguel Optimiza¸c~aode genes para express~aoheter´ologa da Silva Gaspar Gene optimization for heterologous expression Departamento de Universidade de Aveiro Electr´onica,Telecomunica¸c~oese Inform´atica, 2010 Paulo Miguel Optimiza¸c~aode genes para express~aoheter´ologa da Silva Gaspar Gene optimization for heterologous expression Tese apresentada `aUniversidade de Aveiro para cumprimento dos requisitos necess´ariospara a obten¸c~aodo grau de Mestre em Engenharia de Computa- dores e Telem´atica,realizada sobre a orienta¸c~aoci^entificado Dr. Jos´eLu´ıs Oliveira (Professor Associado da Universidade de Aveiro e investigador no IEETA) e da Dr.a Gabriela Moura (Professora Auxiliar na Universidade de Aveiro e investigadora no CESAM). o j´uri/ the jury presidente / president Joaquim Arnaldo Carvalho Martins Professor Catedr´aticoda Universidade de Aveiro vogais / examiners committee Jos´eLu´ısOliveira Professor Associado da Universidade de Aveiro (orientador) Gabriela Moura Professora Auxiliar na Universidade de Aveiro (co-orientadora) Rui Pedro Lopes Professor Coordenador do departamento de Inform´aticae Comunica¸c~oes do Instituto Polit´ecnicode Bragan¸ca agradecimentos / Elaborar uma tese sobre um assunto algo complexo em cerca de cem p´aginas acknowledgements ´ede facto uma tarefa dif´ıcil. Compreender em alguns meros par´agrafoso reconhecimento e a gratifica¸c~aopor quem me apoiou, ´eirrealiz´avel. Agrade¸co em primeiro lugar ao meu professor e orientador Jos´eLu´ısOliveira, que me acompanhou, guiou e ajudou durante todo o progresso desta inves- tiga¸c~ao,mostrando-se sempre presente e dispon´ıvel, e cujos conselhos e apoio foram vitais.
    [Show full text]
  • Attenuated Viruses Useful for Vaccines Für Impfstoffe Geeignete Abgeschwächte Viren Virus Attenues Utiles Pour Des Vaccins
    (19) TZZ _¥__T (11) EP 2 139 515 B1 (12) EUROPEAN PATENT SPECIFICATION (45) Date of publication and mention (51) Int Cl.: of the grant of the patent: A61K 39/13 (2006.01) 13.06.2018 Bulletin 2018/24 (86) International application number: (21) Application number: 08799746.6 PCT/US2008/058952 (22) Date of filing: 31.03.2008 (87) International publication number: WO 2008/121992 (09.10.2008 Gazette 2008/41) (54) ATTENUATED VIRUSES USEFUL FOR VACCINES FÜR IMPFSTOFFE GEEIGNETE ABGESCHWÄCHTE VIREN VIRUS ATTENUES UTILES POUR DES VACCINS (84) Designated Contracting States: (74) Representative: Pohlman, Sandra M. et al AT BE BG CH CY CZ DE DK EE ES FI FR GB GR df-mp Dörries Frank-Molnia & Pohlman HR HU IE IS IT LI LT LU LV MC MT NL NO PL PT Patentanwälte Rechtsanwälte PartG mbB RO SE SI SK TR Theatinerstrasse 16 Designated Extension States: 80333 München (DE) AL BA MK RS (56) References cited: (30) Priority: 30.03.2007 US 909389 P WO-A2-02/095363 US-A1- 2004 209 241 07.03.2008 US 68666 US-A9- 2004 097 439 US-B1- 6 696 289 (43) Date of publication of application: • BURNS CARA CARTHEL ET AL: "Modulation of 06.01.2010 Bulletin 2010/01 poliovirus replicative fitness in HeLa cells by deoptimization of synonymous codon usage in (73) Proprietor: The Research Foundation of the State thecapsid region", JOURNAL OF VIROLOGY, vol. University of 80, no. 7, April 2006 (2006-04), pages 3259-3272, New York XP002633525, -& WO 2006/042156 A2 (US GOV Albany, NY 12207 (US) HEALTH & HUMAN SERV [US]; KEW OLEN M [US]; BURNS CARA C [US]; S) 20 April 2006 (72) Inventors: (2006-04-20) • WIMMER, Eckard • MUELLER STEFFEN ET AL: "Reduction of the East Setauket, NY 11733 (US) rate of poliovirus protein synthesis through • SKIENA, Steve large-scale codon deoptimization causes Setauket, NY 11733 (US) attenuation of viral virulence by lowering specific • MUELLER, Steffen infectivity.", JOURNAL OF VIROLOGY, vol.
    [Show full text]
  • Computational Tools for Synthetic Gene Optimization
    Computational Tools for Synthetic Gene Optimization Numaan Cheema - Computer Science, 21 Mentor: Dr. Dimitris Papamichail Introduction ● Last two decades have seen a rise of computational tools developed by software developers and biologists for synthetic biology ● Many types: ○ Optimizing constructs for manufacturability ○ Enabling design of genetic circuits ○ Redesign of protein-coding genes for expression ● My focus: To examine gene optimization computation tools and to analyze their efficacy in performing synthetic gene optimization Gene Editing Genome editing (also called gene editing) is a group of technologies that give scientists the ability to change an organism's DNA. These technologies allow genetic material to be added, removed, or altered at particular locations in the genome Figure 2. Top down view of why gene optimization (part of synthetic biology) is important. (Microbial Electrochemical and Fuel Cells, 2016) How does redesign work with tools? 1. The first step is to retrieve the natural DNA sequence of letters across a chromosome 2. Choose an appropriate tool that performs the wanted optimization features Examples of these include: a. Everytime it sees the letter series TAG change it to TAA b. Delete a class of genes referred to as tRNA genes from their positions c. Shuffle the DNA sequence according the researchers desired order d. Insert or Delete portions of DNA to enhance or minimize particular genes 3. Once the alterations and optimization is completed, then researchers can purchase DNA chunks from companies that can be easily manipulated in a lab Common Optimization Objectives 1. Codon Usage Bias optimization Altering rare codons in the target gene so they more closely reflect the codon usage of host without modifying the amino acid sequence 2.
    [Show full text]
  • Computational Tools and Algorithms for Designing Customized
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE REVIEW ARTICLE published: 06provided October by 2014 Frontiers - Publisher Connector BIOENGINEERING AND BIOTECHNOLOGY doi: 10.3389/fbioe.2014.00041 Computational tools and algorithms for designing customized synthetic genes Nathan Gould 1, Oliver Hendy 2 and Dimitris Papamichail 1* 1 Department of Computer Science, The College of New Jersey, Ewing, NJ, USA 2 Department of Biology, The College of New Jersey, Ewing, NJ, USA Edited by: Advances in DNA synthesis have enabled the construction of artificial genes, gene cir- Ilias Tagkopoulos, University of cuits, and genomes of bacterial scale. Freedom in de novo design of synthetic constructs California, Davis, USA provides significant power in studying the impact of mutations in sequence features, and Reviewed by: verifying hypotheses on the functional information that is encoded in nucleic and amino Yinjie Tang, Washington University, USA acids. To aid this goal, a large number of software tools of variable sophistication have M. Kalim Akhtar, University College been implemented, enabling the design of synthetic genes for sequence optimization London, UK based on rationally defined properties. The first generation of tools dealt predominantly *Correspondence: with singular objectives such as codon usage optimization and unique restriction site incor- Dimitris Papamichail, Department of Computer Science, The College of poration. Recent years have seen the emergence of sequence design tools that aim to New Jersey, 2000 Pennington Road, evolve sequences toward combinations of objectives.The design of optimal protein-coding Ewing, NJ 08628, USA sequences adhering to multiple objectives is computationally hard, and most tools rely on e-mail: [email protected] heuristics to sample the vast sequence design space.
    [Show full text]
  • (12) United States Patent (10) Patent No.: US 8,697.417 B2 Bakker Et Al
    USOO8697417B2 (12) United States Patent (10) Patent No.: US 8,697.417 B2 Bakker et al. (45) Date of Patent: Apr. 15, 2014 (54) BACULOVIRAL VECTORSCOMPRISING FOREIGN PATENT DOCUMENTS REPEATED CODING SEQUENCES WITH DIFFERENTIAL CODON BASES WO 9636712 11, 1996 WO WOO114539 A2 * 3, 2001 ............. C12N 15.00 WO WOO168888 A2 * 9, 2001 ........... C12N 15/864 (75) Inventors: Andrew Christian Bakker, Almere WO WO O3018820 A2 * 3, 2003 ........... C12N 15/864 (NL); Wilhelmus Theodorus Johannes WO WO O3061582 A2 * 7, 2003 Maria Christiaan Hermens, Almere WO WO 2005035,743 A2 * 4, 2005 ............... C12N 7/O1 WO 2007046703 A2 4/2007 (NL) WO 2007084773 A2 7/2007 (73) Assignee: Uniqure IP B.V., Amsterdam (NL) OTHER PUBLICATIONS (*) Notice: Subject to any disclaimer, the term of this Urabe et al. Insect Cells as a Factory to Produce Adeno-Associated patent is extended or adjusted under 35 Virus Type 2 Vectors. Human Gene Therapy 2002, vol. 13, pp. 1935 U.S.C. 154(b) by 725 days. 1943. Jayaraj et al. GeMS: an advanced software package for designing (21) Appl. No.: 12/670,780 synthetic genes. Nucleic Acids Research 2005, vol. 33, No. 9, pp. 3O11-3016. (22) PCT Filed: Jul. 25, 2008 Hunter et al. Colocalization of adeno-associated virus Rep and capsid proteins in the nuclei of infected cells. Journal of Virology 1992, vol. (86). PCT No.: PCT/NL2008/050512 66, No. 1, pp. 317-324.* Urabe et al., “Insect cells as a factory to produce adeno-associated S371 (c)(1), virus type 2 vectors.” Human Gene Therapy, 2002, 13:1935-1943.
    [Show full text]
  • All-In-One Molecular Cloning and Genetic Engineering Design, Simulation & Management Software for Complex Synthetic Biology and Metabolic Engineering Projects
    Author’s Accepted Manuscript Molecular Cloning Designer Simulator (MCDS): All-in-one Molecular Cloning and Genetic Engineering Design, Simulation & Management Software for Complex Synthetic Biology and Metabolic Engineering Projects Zhenyu Shi, Claudia E. Vickers www.elsevier.com/locate/mec PII: S2214-0301(16)30015-3 DOI: http://dx.doi.org/10.1016/j.meteno.2016.05.003 Reference: MEC39 To appear in: Metabolic Engineering Communications Received date: 26 December 2015 Revised date: 30 March 2016 Accepted date: 10 May 2016 Cite this article as: Zhenyu Shi and Claudia E. Vickers, Molecular Cloning Designer Simulator (MCDS): All-in-one Molecular Cloning and Genetic Engineering Design, Simulation & Management Software for Complex Synthetic Biology and Metabolic Engineering Projects, Metabolic Engineering Communications, http://dx.doi.org/10.1016/j.meteno.2016.05.003 This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting galley proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain. Molecular Cloning Designer Simulator (MCDS): All-in-one Molecular Cloning and Genetic Engineering Design, Simulation & Management Software for Complex Synthetic Biology and Metabolic Engineering Projects Zhenyu Shi1* & Claudia E. Vickers1 1Australian Institute for Bioengineering & Nanotechnology, The University of Queensland, St Lucia, QLD 4072, Australia. *Corresponding Author. Telephone: +61 7 3346-3958; Fax: +61 7 334 63973; Email: [email protected] Abstract Molecular Cloning Designer Simulator (MCDS) is a powerful new all-in-one cloning and genetic engineering design, simulation and management software platform developed for complex synthetic biology and metabolic engineering projects.
    [Show full text]
  • Computational Codon Optimization of Synthetic Gene for Protein Expression Bevan Kai-Sheng Chung1,2,3 and Dong-Yup Lee1,2,3*
    Chung and Lee BMC Systems Biology 2012, 6:134 http://www.biomedcentral.com/1752-0509/6/134 METHODOLOGY ARTICLE Open Access Computational codon optimization of synthetic gene for protein expression Bevan Kai-Sheng Chung1,2,3 and Dong-Yup Lee1,2,3* Abstract Background: The construction of customized nucleic acid sequences allows us to have greater flexibility in gene design for recombinant protein expression. Among the various parameters considered for such DNA sequence design, individual codon usage (ICU) has been implicated as one of the most crucial factors affecting mRNA translational efficiency. However, previous works have also reported the significant influence of codon pair usage, also known as codon context (CC), on the level of protein expression. Results: In this study, we have developed novel computational procedures for evaluating the relative importance of optimizing ICU and CC for enhancing protein expression. By formulating appropriate mathematical expressions to quantify the ICU and CC fitness of a coding sequence, optimization procedures based on genetic algorithm were employed to maximize its ICU and/or CC fitness. Surprisingly, the in silico validation of the resultant optimized DNA sequences for Escherichia coli, Lactococcus lactis, Pichia pastoris and Saccharomyces cerevisiae suggests that CC is a more relevant design criterion than the commonly considered ICU. Conclusions: The proposed CC optimization framework can complement and enhance the capabilities of current gene design tools, with potential applications to heterologous protein production and even vaccine development in synthetic biotechnology. Background expressed recombinant protein since the wild-type for- Recent developments in artificial gene synthesis have eign genes have not been evolved for optimum expres- enabled the construction of synthetic gene circuits [1] sion in the host.
    [Show full text]
  • Open-Source Web Application for Rare Codon Identification and Custom DNA Sequence Optimization Edward Daniel1, Goodluck U
    Daniel et al. BMC Bioinformatics (2015) 16:303 DOI 10.1186/s12859-015-0743-5 SOFTWARE Open Access ATGme: Open-source web application for rare codon identification and custom DNA sequence optimization Edward Daniel1, Goodluck U. Onwukwe1, Rik K. Wierenga1, Susan E. Quaggin2, Seppo J. Vainio3 and Mirja Krause3* Abstract Background: Codon usage plays a crucial role when recombinant proteins are expressed in different organisms. This is especially the case if the codon usage frequency of the organism of origin and the target host organism differ significantly, for example when a human gene is expressed in E. coli. Therefore, to enable or enhance efficient gene expression it is of great importance to identify rare codons in any given DNA sequence and subsequently mutate these to codons which are more frequently used in the expression host. Results: We describe an open-source web-based application, ATGme, which can in a first step identify rare and highly rare codons from most organisms, and secondly gives the user the possibility to optimize the sequence. Conclusions: This application provides a simple user-friendly interface utilizing three optimization strategies: 1. one-click optimization, 2. bulk optimization (by codon-type), 3. individualized custom (codon-by-codon) optimization. ATGme is an open-source application which is freely available at: http://atgme.org Keywords: Codon usage, Sequence optimization, Protein, Translation, DNA Background [4]. Studies have shown that the presence of rare codons Alternative synonymous codons are not used with equal influences gene expression levels [4, 5] and the solubility frequencies in one organism or also when different organ- and amount of the expressed protein [2].
    [Show full text]
  • 2008 Holiday Publication
    Proc. Natl. Acad. Sci. NorthPole (2008) 12:25-31 NorthPoleInc. Metabolic engineering of Picea abies for receptor mediated induction of fluorescence and olfactory signaling Santa Claes1*, Feliz Navidad1, Tomte Ness E.2, Ten N. Baum2, Hanukkah Harry1,2, Bada Din Sridhar2, and Jeremy Elf1 1University of NorthPole, Dept Bioengineering, Arctic Lane 1, Santa’s Secret Village, NorthPole 2University of NorthPole, Department of Cool, Cold Tundra Avenue 365, Santa’s Secret Village, NorthPole *Corresponding author: [email protected] Received 28 June 2008, received in revised form 25 October 2008, accepted 24 December 2008 Abstract Reoccurring seasonal harvesting of Picea abies (Norwegian spruce) and the subsequent attachment of light fixtures to the tree followed by composting after only 3 weeks of minimal use is not only a waste but also leads to diminishing spruce forests, release of carbon dioxide, global warming, increased energy consumption and a political burden to the ongoing diplomatic efforts of Santa Claus. We here present a synthetic biology solution to address this key concerns for winter celebrations – A song induced Christmas tree with endogenous light and odor emission. Advances in synthetic biology, tissue engineering and metabolic engineering have now provided new insights and techniques for controlling whole organism signaling and bio-compound production in situ, and recent efforts in synthetic biology have demonstrated that complex regulatory and metabolic networks can be designed and engineered in microorganisms and simple eukaryotes. Leading researchers at the NorthPole Bioscience Engineering Council here disclose how sonic-induced signals received through tissue engineered tympanal organs are incorporated into regulating the metabolic distribution of fluorescent and olfactory signals in Picea abies.
    [Show full text]
  • Views and Protocols
    Tuan-Anh et al. BMC Bioinformatics (2017) 18:100 DOI 10.1186/s12859-017-1517-z METHODOLOGY ARTICLE Open Access Novel methods to optimize gene and statistic test for evaluation – an application for Escherichia coli Tran Tuan-Anh1, Le Thi Ly2, Ngo Quoc Viet3 and Pham The Bao1* Abstract Background: Since the recombinant protein was discovered, it has become more popular in many aspects of life science. The value of global pharmaceutical market was $87 billion in 2008 and the sales for industrial enzyme exceeded $4 billion in 2012. This is strong evidence showing the great potential of recombinant protein. However, native genes introduced into a host can cause incompatibility of codon usage bias, GC content, repeat region, Shine-Dalgarno sequence with host’s expression system, so the yields can fall down significantly. Hence, we propose novel methods for gene optimization based on neural network, Bayesian theory, and Euclidian distance. Result: The correlation coefficients of our neural network are 0.86, 0.73, and 0.90 in training, validation, and testing process. In addition, genes optimized by our methods seem to associate with highly expressed genes and give reasonable codon adaptation index values. Furthermore, genes optimized by the proposed methods are highly matched with the previous experimental data. Conclusion: The proposed methods have high potential for gene optimization and further researches in gene expression. We built a demonstrative program using Matlab R2014a under Mac OS X. The program was published in both standalone executable program and Matlab function files. The developed program can be accessed from http://www.math.hcmus.edu.vn/~ptbao/paper_soft/GeneOptProg/.
    [Show full text]
  • ( 12 ) United States Patent
    US010023845B2 (12 ) United States Patent ( 10 ) Patent No. : US 10 , 023 ,845 B2 Wimmer et al . ( 45 ) Date of Patent: Jul. 17, 2018 ( 54 ) METHODS OF MAKING MODIFIED VIRAL ( 58 ) Field of Classification Search GENOMES None See application file for complete search history . (71 ) Applicant: The Research Foundation for The State University of New York , Albany , (56 ) References Cited NY (US ) U . S . PATENT DOCUMENTS ( 72 ) Inventors: Eckard Wimmer , East Setauket , NY 6 ,696 ,289 B1 2 /2004 Bae et al. (US ) ; Steve Skiena , Setauket , NY (US ) ; 2004 / 0097439 A95 /2004 Nicolas et al . Steffen Mueller , Kings Point, NY (US ) ; 2004 /0209241 A1 10 / 2004 Hermanson et al . Bruce Futcher , Stony Brook , NY (US ) ; 2008 /0118530 A1 5 / 2008 Kew et al . Dimitris Papamichail , Pennington , NJ (US ) ; John Robert Coleman , Blauvelt , FOREIGN PATENT DOCUMENTS NY (US ) ; Jeronimo Cello , Port WO 2002 /095363 A2 11 /2002 Jefferson , NY (US ) WO 2006 /042156 A2 4 / 2006 ( 73 ) Assignee : The Research Foundation for The OTHER PUBLICATIONS State University of New York , Albany , NY (US ) Cheng , L . et al. , “ Absence of Effect of Varying Thr- Leu Codon Pairs on Protein Synthesis in a T7 System ” , Biochem (2001 ) , vol . 40 , pp . 6102- 6106 ( * ) Notice : Subject to any disclaimer , the term of this Cohen , B . et al. , “ Natural Selection and Algorithmic Design of patent is extended or adjusted under 35 mRNA ” , B JCB (2003 ) , vol. 10 , pp . 3 - 4 . U . S . C . 154 ( b ) by 0 days . Doma, M . et al. , “ Endonucleolytic Cleave of Euraryotic mRNAS with Stalls and Translation Elongation ” , Nature ( 2006 ) , vol. 440 , pp .
    [Show full text]