Computational Approaches for Improving Treatment and Prevention of Viral Infections

Total Page:16

File Type:pdf, Size:1020Kb

Computational Approaches for Improving Treatment and Prevention of Viral Infections Computational Approaches for Improving Treatment and Prevention of Viral Infections Dissertation zur Erlangung des Grades des Doktors der Naturwissenschaften an der Fakultät für Mathematik und Informatik der Universität des Saarlandes von Matthias Döring Saarbrücken 2019 Tag des Kolloqiums 30.04.2019 Dekan Prof. Dr. Sebastian Hack Vorsitzender der Prüfungskommission Prof. Dr. Gerhard Weikum Erster Berichterstatter Prof. Dr. Nico Pfeifer Zweiter Berichterstatter Prof. Dr. Dr. Thomas Lengauer Dritter Berichterstatter Prof. Dr. Olga Kalinina Akademischer Mitarbeiter Dr. Peter Ebert Abstract The treatment of infections with HIV or HCV is challenging. Thus, novel drugs and new computational approaches that support the selection of therapies are required. This work presents methods that support therapy selection as well as methods that advance novel antiviral treatments. geno2pheno[ngs-freq] identifies drug resistance from HIV-1 or HCV samples that were subjected to next-generation sequencing by interpreting their sequences either via support vector machines or a rules-based approach. geno2pheno[coreceptor-hiv2] deter- mines the coreceptor that is used for viral cell entry by analyzing a segment of the HIV-2 surface protein with a support vector machine. openPrimeR is capable of finding optimal combinations of primers for multiplex polymerase chain reaction by solving a set cover prob- lem and accessing a new logistic regression model for determining amplification events arising from polymerase chain reaction. geno2pheno[ngs-freq] and geno2pheno[coreceptor- hiv2] enable the personalization of antiviral treatments and support clinical decision making. The application of openPrimeR on human immunoglobulin sequences has resulted in novel primer sets that improve the isolation of broadly neutralizing antibodies against HIV-1. The methods that were developed in this work thus constitute important contributions towards improving the prevention and treatment of viral infectious diseases. 4 Zusammenfassung Die Behandlung von HIV- oder HCV-Infektionen ist herausfordernd. Daher werden neue Wirkstoffe, sowie neue computerbasierte Ver- fahren benötigt, welche die Therapie verbessern. In dieser Arbeit wurden Methoden zur Unterstützung der Therapieauswahl entwick- elt, aber auch solche, welche neuartige Therapien vorantreiben. geno2pheno[ngs-freq] bestimmt, ob Resistenzen gegen Medika- mente vorliegen, indem es Hochdurchsatzsequenzierungsdaten von HIV-1 oder HCV Proben mittels Support Vector Machines oder einem regelbasierten Ansatz interpretiert. geno2pheno[coreceptor- hiv2] bestimmt den HIV-2 Korezeptorgebrauch dadurch, dass es einen Abschnitt des viralen Oberflächenproteins mit einer Support Vector Machine analysiert. openPrimeR kann optimale Kombi- nationen von Primern für die Multiplex-Polymerasekettenreaktion finden, indem es ein Mengenüberdeckungsproblem löst und auf ein neues logistisches Regressionsmodell für die Vorhersage von Amplifizierungsereignissen zurückgreift. geno2pheno[ngs-freq] und geno2pheno[coreceptor- hiv2] ermöglichen die Personalisierung antiviraler Therapien und unterstützen die klinische Entscheidungsfindung. Durch den Ein- satz von openPrimeR auf humanen Immunoglobulinsequenzen konnten Primersätze generiert werden, welche die Isolierung von breit neutralisierenden Antikörpern gegen HIV-1 verbessern. Die in dieser Arbeit entwickelten Methoden leisten somit einen wichti- gen Beitrag zur Verbesserung der Prävention und Therapie viraler Infektionskrankheiten. Acknowledgements This dissertation would not have been possible without the guidance and support from several people that I would like to acknowledge in the following paragraphs. I am grateful to Nico Pfeifer and Thomas Lengauer for the opportunity of pursuing a PhD at the Max Planck Institute for Informatics. It was a privilege to work in an environ- ment that allowed me to fully immerse myself in research, free from other concerns. The expert guidance and support provided by Nico and Thomas were integral to shaping my doctoral research. I am par- ticularly thankful for Nico’s kind mentorship and continual support. I would like to express my appreciation for many people at the Max Planck Institute for Informatics. For providing helpful advice before and during my time as a PhD student, I would like to thank Bastian Beggel, Glenn Lawyer, and Markus List. For the fruitful and stimulating collaboration on the geno2pheno web service, I would like to express my gratitude to Joachim Büch and Georg Friedrich. For their great technical support, I am indebted to Achim, Georg, and everyone working in the Information Services and Technol- ogy group. I would like to thank Alejandro Pironti and Prabhav Kalaghatgi for their contributions to my work; geno2pheno[ngs-freq] would not have been possible without the groundwork that was laid by Alejandro. For her administrative work, I am thankful to Ruth Schneppen-Christmann who always helped me with organizing my conference trips. I am grateful to a multitude of people outside Max Planck Society. I am indebted to Rolf Kaiser for his openness, inventiveness, and the organization of platforms such as the AREVIR meeting and the Rettenstein symposium. For their input to my scientific work, I would like to highlight Eva Heger, Elena Knops, Florian Klein, and Christoph Kreer. Additionally, I would like to acknowledge the contributions from Pedro Borrego, Ricardo Camacho, Martin Däumer, Josef Eberle, Meryem Seda Ercanoglu, Nathalie Lehnen, Andreia Martins, Martin Obermeier, Philipp Schommers, Saleta Sierra-Aragon, Simone Susser, Alexander Thielen, and Nuno Taveira. I would also like to thank the following organizations that funded my doctoral research: Max Planck Society, the German Ministry of Health (MASTER-HIV/HEP), the European HIV Coreceptor Study Panel (EucoHIV), and the German Center for Infection Research (HCV Treatment optimization). Without governmental funding, this work would not have been possible. Heartful thanks go to all my peers at the Center for Bioinformatics in Saarbrücken who always were a source of inspiration, motivation and joyfulness; most notably Peter Ebert, Anna Hake, Lisa Handl, 6 Sivarajan Karunanithi, Tim Kehl, Fabian Müller, Sarvesh Nikumbh, Alejandro Pironti, Florian Schmidt, Lara Schneider, Nora Katharina Speicher, and Thorsten Will. I would like to highlight Peter Ebert, Anna Hake, Eva Heger, Edith Heiter, Elena Knops, and Nora Katha- rina Speicher for providing feedback on this dissertation. For their companionship during my studies I am deeply thankful to Miriam Bah, Max Fischer, Alexander Junge, Sebastian Keller, and Andreas Mohr. Above all, I would like to thank my family. I am indebted to my parents for their continual support and to my brother for spark- ing my interest in computers. Finally, I would like to thank Edith Heiter for always being there for me. I cannot imagine anything better than coming home and seeing your smiling face. Contents List of Figures 13 List of Tables 17 List of Algorithms 19 1 Introduction 21 IBackground 27 2 Virological and Immunological Foundations 29 2.1 Viruses and Viral Pathogenesis 30 2.2 Defense Mechanisms against Viruses 32 2.2.1 Components of the Immune System...... 32 2.2.2 Overview of the Adaptive Immune System.... 33 2.2.3 Cells of the Adaptive Immune System...... 33 2.2.4 Antibody Structure and Function....... 35 2.2.5 Neutralizing Antibodies........... 36 2.3 Human Immunodeficiency Virus 37 2.3.1 Introduction to HIV............ 38 2.3.2 Structure and Genome Organization...... 38 2.3.3 Life Cycle................ 40 2.3.4 Coreceptor Usage............. 42 8 2.3.5 Transmission and Course of Infection...... 45 2.3.6 Treatment of HIV Infection.......... 46 2.3.7 Neutralizing Antibodies and Treatment..... 51 2.4 Hepatitis C Virus 54 2.4.1 Introduction to HCV............ 54 2.4.2 Structure and Genome Organization...... 55 2.4.3 Life Cycle................ 55 2.4.4 Transmission and Course of Infection...... 57 2.4.5 Treatment of HCV Infection......... 58 2.5 Molecular Techniques 60 2.5.1 Phenotypic HIV Coreceptor Testing...... 60 2.5.2 Phenotypic Resistance Testing......... 62 2.5.3 Sequencing of Viral Genomes......... 66 2.5.4 Polymerase Chain Reaction.......... 69 3 Methodological Foundations 73 3.1 Overview of Machine Learning 73 3.2 Supervised Learning 74 3.2.1 Preliminaries............... 74 3.2.2 Supervised Learning as Function Estimation... 75 3.2.3 The Bias-Variance Decomposition....... 76 3.2.4 Training and Test Errors........... 77 3.2.5 Estimating Model Errors........... 78 3.2.6 Limiting Model Complexity via Regularization.. 81 3.2.7 Feature Selection............. 81 3.3 Measures of Predictive Performance 83 3.3.1 The Confusion Matrix............ 83 3.3.2 Performance Measures for Non-Scoring Classifiers. 84 3.3.3 Performance Measures for Scoring Classifiers... 85 3.4 Models for Supervised Learning 86 3.4.1 Logistic Regression............. 86 3.4.2 Support Vector Machines.......... 87 3.5 Clustering 93 3.5.1 K-Means Clustering............ 93 3.5.2 Hierarchical Clustering........... 95 3.6 Statistical Significance Tests 96 3.6.1 Introduction to Hypothesis Testing....... 97 3.6.2 McNemar’s Test.............. 98 3.6.3 Fisher’s Exact Test............. 98 9 3.6.4 Wilcoxon Rank-Sum Test.......... 99 3.6.5 Multiple Hypothesis Testing......... 99 3.7 Optimization with Linear Programs 101 3.7.1 Linear and Integer Linear Programming..... 101 3.7.2 Branch and Bound............. 102 3.7.3 The Set Cover Problem........... 102 II Contributions 105 4 Interpreting Drug Resistance
Recommended publications
  • Department of Energy Office of Health and Environmental Research SEQUENCING the HUMAN GENOME Summary Report of the Santa Fe Workshop March 3-4, 1986
    Department of Energy Office of Health and Environmental Research SEQUENCING THE HUMAN GENOME Summary Report of the Santa Fe Workshop March 3-4, 1986 Los Alamos National Laboratory Los Alamos Los Alamos, New Mexico 87545 Los Alamos National Laboratory is operated by the University of California for the United States Department of Energy under contract W-7405-ENG-36. DEPARTMENT OF ENERGY OFFICE OF HEALTH AND ENVIRONMENTAL RESEARCH SEQUENCING THE HUMAN GENOME SUMMARY REPORT ON THE SANTA FE WORKSHOP (MARCH 3-4, 1986) Executive Summary. The following is a summary of the Santa Fe Workshop held on March 3 and 4, 1986. The workshop was sponsored by the Office of Health and Environmental Research (OHER) and Los Alamos National Laboratory (LANL) and dedicated to examining the feasibility, advisability, and approaches to sequencing the human genome. The workshop considered four principal topics: I. Technologies to be employed. II. Expected benefits. III. Architecture of the enterprise. IV. Participants and funding. I . Technology The participants of the workshop foresaw extraordinary and continuing progress in the efficiency and accuracy of mapping, ordering , and sequencing technologies. They suggested that a coordinated analysis of the human genome begin with the task of ordering overlapping recombinant DNA fragments obtained from purified human chromosomes that would provide an infrastructure for sequencing activity. At the same time, they support in-depth evaluation of current and developing strategies for sequencing including possible applications of automation and robotics that would minimize the time and cost of sequencing. II. Benefits The socio-political and health benefits, and the benefit:cost ratio were seen as highly favorable not only for human health, but in addition for the development of new diagnostic, preventative and therapeutic tools, jobs, and industries.
    [Show full text]
  • Mapping Our Genes—Genome Projects: How Big? How Fast?
    Mapping Our Genes—Genome Projects: How Big? How Fast? April 1988 NTIS order #PB88-212402 Recommended Citation: U.S. Congress, Office of Technology Assessment, Mapping Our Genes-The Genmne Projects.’ How Big, How Fast? OTA-BA-373 (Washington, DC: U.S. Government Printing Office, April 1988). Library of Congress Catalog Card Number 87-619898 For sale by the Superintendent of Documents U.S. Government Printing Office, Washington, DC 20402-9325 (order form can be found in the back of this report) Foreword For the past 2 years, scientific and technical journals in biology and medicine have extensively covered a debate about whether and how to determine the function and order of human genes on human chromosomes and when to determine the sequence of molecular building blocks that comprise DNA in those chromosomes. In 1987, these issues rose to become part of the public agenda. The debate involves science, technol- ogy, and politics. Congress is responsible for ‘(writing the rules” of what various Federal agencies do and for funding their work. This report surveys the points made so far in the debate, focusing on those that most directly influence the policy options facing the U.S. Congress, The House Committee on Energy and Commerce requested that OTA undertake the project. The House Committee on Science, Space, and Technology, the Senate Com- mittee on Labor and Human Resources, and the Senate Committee on Energy and Natu- ral Resources also asked OTA to address specific points of concern to them. Congres- sional interest focused on several issues: ● how to assess the rationales for conducting human genome projects, ● how to fund human genome projects (at what level and through which mech- anisms), ● how to coordinate the scientific and technical programs of the several Federal agencies and private interests already supporting various genome projects, and ● how to strike a balance regarding the impact of genome projects on international scientific cooperation and international economic competition in biotechnology.
    [Show full text]
  • Characterizing the Dna-Binding Site Specificities of Cis2his2 Zinc Fingers
    MQP-ID-DH-UM1 C H A R A C T E RI Z IN G T H E DN A-BINDIN G SI T E SPE C I F I C I T I ES O F C IS2H IS2 Z IN C F IN G E RS A Major Qualifying Project Report Submitted to the Faculty of the WORCESTER POLYTECHNIC INSTITUTE in partial fulfillment of the requirements for the Degrees of Bachelor of Science in Biochemistry and Biology and Biotechnology by _________________________ Heather Bell April 26, 2012 APPROVED: ____________________ ____________________ ____________________ Scot Wolfe, PhD Destin Heilman, PhD David Adams, PhD Gene Function and Exp. Biochemistry Biology and Biotech UMass Medical School WPI Project Advisor WPI Project Advisor MAJOR ADVISOR A BST R A C T The ability to modularly assemble Zinc Finger Proteins (ZFPs) as well as the wide variety of DNA sequences they can recognize, make ZFPs an ideal framework to design novel DNA-binding proteins. However, due to the complexity of the interactions between residues in the ZF recognition helix and the DNA-binding site there is currently no comprehensive recognition code that would allow for the accurate prediction of the DNA ZFP binding motifs or the design of novel ZFPs for a desired target site. Through the analysis of the DNA-binding site specificities of 98 ZFP clones, determined through a bacterial one-hybrid selection system, a predictive model was created that can accurately predict the binding site motifs of novel ZFPs. 2 T A B L E O F C O N T E N TS Signature Page ««««««««««««««««««««««««««« $EVWUDFW«««««««««««««««««««««««««««««« 7DEOHRI&RQWHQWV«««««««««««««««««««««««««« $FNQRZOHGJHPHQWV««««««««««««««««««««««««« %DFNJURXQG«««««««««««««««««««««««««««« Project Purpose «««««««««««««««««««««««««««15 0HWKRGV««««««««««««««««««««««««««««««16 5HVXOWV««««««««««««««««««««««««««««««21 'LVFXVVLRQ«««««««««««««««««««««««««««««28 Bibliograph\«««««««««««««««««««««««««««« 6XSSOHPHQWDO««««««««««««««««««««««««««« 3 A C K N O W L E D G E M E N TS I would like to thank Dr.
    [Show full text]
  • 8Th IAS Conference on HIV Pathogenesis, Treatment &
    ONLINE ONLINE OPEN PEER-REVIEWD PEER-REVIEWD ONLINE A PEER-REVIEWD OPEN ACCESS OPEN HIV/AIDS JOURNAL HIV/AIDS JOURNAL CCESS ONLINEONLINE CCESS A OPEN OPEN PEER-REVIEWD ONLINE HIV/AIDS JOURNAL HIV/AIDS HIV/AIDS JOURNAL PEER-REVIEWD PEER-REVIEWD OPEN ACCESSONLINEPEER-REVIEWD A ONLINEPEER-REVIEWD CCESS HIV/AIDS JOURNALPEER-REVIEWD OPEN ACCESS CCESS A OPEN ONLINE ONLINE HIV/AIDS JOURNAL ONLINE PEER-REVIEWDHIV/AIDS JOURNALONLINEONLINE PEER-REVIEWDOPEN ACCESS PEER-REVIEWD ONLINE ONLINE OPEN ACCESSONLINE OPEN HIV/AIDS JOURNAL CCESS OPEN ACCESS HIV/AIDS JOURNAL PEER-REVIEWD PEER-REVIEWD PEER-REVIEWD HIV/AIDS JOURNAL PEER-REVIEWD A HIV/AIDS JOURNAL OPEN ACCESS PEER-REVIEWD PEER-REVIEWD PEER-REVIEWDONLINE HIV/AIDS JOURNAL HIV/AIDS JOURNAL OPEN ACCESS ONLINE ONLINEONLINE HIV/AIDS JOURNAL HIV/AIDS PEER-REVIEWD A PEER-REVIEWD ONLINE ONLINE HIV/AIDS JOURNAL CCESS HIV/AIDS JOURNALOPEN ONLINE PEER-REVIEWDONLINE PEER-REVIEWD HIV/AIDS JOURNAL OPEN PEER-REVIEWDJOURNAL HIV/AIDS HIV/AIDS JOURNAL HIV/AIDS JOURNAL CCESS ONLINE A A ONLINE OPEN ACCESS HIV/AIDS JOURNAL ONLINE HIV/AIDS JOURNAL CCESS PEER-REVIEWD OPEN ACCESS HIV/AIDS JOURNAL OPEN ACCESS ONLINE HIV/AIDS JOURNAL HIV/AIDS ONLINE OPEN OPEN ACCESS HIV/AIDS JOURNAL PEER-REVIEWD OPEN HIV/AIDS JOURNALPEER-REVIEWD PEER-REVIEWD OPEN ACCESS PEER-REVIEWD CCESS A OPEN PEER-REVIEWDONLINE PEER-REVIEWD PEER-REVIEWD HIV/AIDS JOURNALAbstractHIV/AIDS JOURNAL SupplementHIV/AIDS JOURNAL PEER-REVIEWD OPEN ACCESS CCESS PEER-REVIEWD A PEER-REVIEWD A 8th IAS Conference on HIV Pathogenesis, Treatment & Prevention CCESS OPEN ACCESS 19–22 July 2015, Vancouver, Canada OPEN HIV/AIDS JOURNAL Volume 18, Supplement 4 Scan this QR code with your mobile device to view July 2015 the special issue online 8th IAS Conference on HIV Pathogenesis, Treatment and Prevention (IAS 2015) Journal of the International AIDS Society 2015, 18 (Suppl 4) http://www.jiasociety.org/index.php/jias/article/view/20479 ORAL ABSTRACTS Cg-PrkdcscidIl2rgtm1Wjl/SzJ (NSG) mice resulted in viral amplification MO Á MONDAY in the mouse.
    [Show full text]
  • ISMB 99 August 6 – 10, 1999 Heidelberg, Germany the Seventh
    ______________________________________ Welcome to ISMB 99 August 6 – 10, 1999 Heidelberg, Germany The Seventh International Conference on Intelligent Systems for Molecular Biology ______________________________________ Final Program and Detailed Schedule Friday, August 6, 1999 Tutorial Day The tutorials will take place in the following rooms: 8:30 – 12:30 (Coffee break around 10:30) Tutorial #1 Trübnersaal Piere Baldi Probabilistic graphical models Tutorial #2 Robert-Schumann-Zimmer Douglas L. Brutlag Bioinformatics and Molecular Biology Tutorial #3 Ballsaal Martin Reese The challenge of annotating a complete eukaryotic genome: A case study in Drosophila melanogaster Tutorial #4 Gustav-Mahler-Zimmer Tandy Warnow Computational and statistical Junhyong Kim challenges involved in reconstructing evolutionary trees Tutorial #5 Sebastian-Münster-Saal Thomas Werner The biology and bioinformatics of regulatory regions in genomes Lunch (on this day served in "Grosser Saal" on the ground floor) 13:30 – 17:30 (Coffee break around 15:30) Tutorial #6 Sebastian-Münster-Saal Rob Miller EST Clustering Alan Christoffels Winston Hide Tutorial #7 Trübnersaal Kevin Karplus Getting the most out of hidden Markov Melissa Cline models Christian Barrett Tutorial #8 Robert-Schumann-Zimmer Arthur Lesk Sequence-structure relationships and evolutionary structure changes in proteins Tutorial #9 Gustav-Mahler-Zimmer David States PERL abstractions for databases and Brian Dunford distributed computing Shore Tutorial # 10 Ballsaal Zoltan Szallasi Genetic network analysis
    [Show full text]
  • STOP HIV/AIDS Backgrounder
    TREATMENT AS PREVENTION STOP HIV/AIDS STOP HIV/AIDS In 2008, the BC-CfE developed a mathematical On February 4, 2010, then-Minister of Health “TREATMENT AS model and a cost-effectiveness evaluation Kevin Falcon announced a four-year $48 million that suggested that the expansion of HAART pilot aimed to evaluate the impact of the HIV PREVENTION” IN coverage would be highly cost-effective, as it Treatment as Prevention strategy in BC. The would prevent AIDS morbidity and mortality. strategy is a pilot project called STOP HIV/AIDS ACTION However, when the impact of the expansion of (Seek and Treat for Optimal Prevention of HIV HAART coverage on HIV transmission was taken and AIDS). Extending HAART via Treatment as into account, the strategy was found to have the The STOP HIV/AIDS pilot aims to expand access Prevention saves lives, prevents potential of becoming cost-averting, as it would to HIV/AIDS treatment and medication among HIV transmission and reduces health costs virtually eliminate vertical transmission of HIV, hard-to-reach and vulnerable populations in and dramatically reduce HIV transmission by all hile an outright cure or a preventive Vancouver’s Downtown Eastside and Prince other routes. Wvaccine for HIV/AIDS remains elusive, George. These regions had been identified as remarkable advances in HIV treatment have Most recently, HPTN 052—a randomized trial of priority sites for the pilot project because they been achieved over the past two decades. HIV sero-discordant (primarily heterosexual) were disproportionately affected by HIV/AIDS. Most significant among these advances is the couples—provided definitive proof of the The pilot project specifically serves individuals development of highly active antiretroviral efficacy of HIV Treatment as Prevention.
    [Show full text]
  • Killam Prizes | Prix Killam
    Killam Prizes | Prix Killam Year | Winners | University | Discipline Année Gagnants Université 2021 Michel Bouvier Université de Montréal Health Sciences | sciences de la santé Stephen R. Gill York University Social Sciences | sciences sociales Gilbert Laporte HEC Montréal Engineering | génie Arthur Ripstein University of Toronto Humanities | sciences humaines Douglas Stephan University of Toronto Natural Sciences | sciences de la nature 2020 Cecilia Benoit University of Victoria Social Sciences | sciences sociales Sarah Carter University of Alberta Humanities | sciences humaines Alan Evans Montreal Neurological Institute Health Sciences | sciences de la santé Ted Sargent University of Toronto Engineering | génie Barbara Sherwood Lollar University of Toronto Natural Sciences | sciences de la nature 2019 Yoshua Bengio Université de Montréal Natural Sciences | sciences de la nature André Blais Université de Montréal Social Sciences | sciences sociales Keith W. Hipel University of Waterloo Engineering | génie Stephen W. Scherer University of Toronto Health Sciences | sciences de la santé Lynne Viola University of Toronto Humanities | sciences humaines 2018 André Gaudreault Université de Montréal Humanities | sciences humaines Vladimir Hachinski Western University Health Sciences | sciences de la santé Walter Herzog University of Calgary Engineering | génie James Pinfold University of Alberta Natural Sciences | sciences de la nature Janet Werker University of British Columbia Social Sciences | sciences sociales Canada Council for the Arts
    [Show full text]
  • Download File
    Topics in Signal Processing: applications in genomics and genetics Abdulkadir Elmas Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in the Graduate School of Arts and Sciences COLUMBIA UNIVERSITY 2016 c 2016 Abdulkadir Elmas All Rights Reserved ABSTRACT Topics in Signal Processing: applications in genomics and genetics Abdulkadir Elmas The information in genomic or genetic data is influenced by various complex processes and appropriate mathematical modeling is required for studying the underlying processes and the data. This dissertation focuses on the formulation of mathematical models for certain problems in genomics and genetics studies and the development of algorithms for proposing efficient solutions. A Bayesian approach for the transcription factor (TF) motif discovery is examined and the extensions are proposed to deal with many interdependent parameters of the TF-DNA binding. The problem is described by statistical terms and a sequential Monte Carlo sampling method is employed for the estimation of unknown param- eters. In particular, a class-based resampling approach is applied for the accurate estimation of a set of intrinsic properties of the DNA binding sites. Through statistical analysis of the gene expressions, a motif-based computational approach is developed for the inference of novel regulatory networks in a given bacterial genome. To deal with high false-discovery rates in the genome-wide TF binding predictions, the discriminative learning approaches are examined in the context of sequence classification, and a novel mathematical model is introduced to the family of kernel-based Support Vector Machines classifiers. Furthermore, the problem of haplotype phasing is examined based on the genetic data obtained from cost-effective genotyping technologies.
    [Show full text]
  • Baldi Bioinformatics
    Vol. 15 no. 11 1999 BIOINFORMATICS Pages 865–866 A.M. Shmatkov, A.A. Melikyan, F.L. Chernousko and Editorial M. Borodovsky’s paper, ‘Finding prokaryotic genes by the “frame-by-frame” algorithm: targeting gene starts THE SECOND GEORGIA TECH INTERNA- and overlapping genes’, addresses the important practical TIONAL CONFERENCE ON BIOINFORMATICS: problem of gene recognition in prokaryotic genomes, SEQUENCE, STRUCTURE AND FUNCTION of which over a dozen have already been completely (NOVEMBER 11–14, 1999, ATLANTA, GEORGIA, sequenced. The authors suggest an approach to one of USA) the few remaining open problems in prokaryotic gene finding: accurate prediction of gene starts allowing for Steering & Program Committee: Pierre Baldi, Mark the possibility of overlapping protein-coding regions, a Borodovsky, Soren Brunak, Chris Burge, Jim Fickett, relatively common occurrence in prokaryotic genomes Steven Henikoff, Eugene Koonin, Andrej Sali, Chris which seems to be rare in eukaryotes. Their algorithm Sander, Gary Stormo involves application of a hidden Markov model of gene structure to each of the six global reading frames (three on This issue of Bioinformatics contains reports on selected each strand) of a genome separately, followed by a simple papers presented at the international conference in At- post-processing step to remove completely overlapping lanta. The conference was held at one of the midtown genes which rarely occur in nature. Promising results are hotels offering a magnificent bird’s-eye view to the obtained in identifying gene starts, and the possibility of cosmopolitan capital of the Southeast of the USA. The systematic biases in the annotation of several bacterial conference agenda included keynote lectures by Russell genomes is raised.
    [Show full text]
  • Gene Regulatory Network Inference Using Machine Learning Techniques
    GENE REGULATORY NETWORK INFERENCE USING MACHINE LEARNING TECHNIQUES Stephanie Kamgnia Wonkap A Thesis in the department of Computer Science and Software Engineering Presented in Partial Fulfillment of the Requirements For the Degree of Doctor of Philosophy(Computer Science) Concordia University Montreal,´ Quebec,´ Canada August 26 2020 c Stephanie Kamgnia Wonkap, 2020 Concordia University School of Graduate Studies This is to certify that the thesis prepared By: Miss. Stephanie Kamgnia Wonkap Entitled: Gene Regulatory Network Inference using Machine Learning Techniques and submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy (Computer Science) complies with the regulations of this University and meets the accepted standards with respect to originality and quality. Signed by the final examining committee: Chair Dr. Liangzhu Wang External Examiner Dr. Mathieu Blanchette Examiner Dr. Leila Kosseim Examiner Dr. Malcolm Whiteway Examiner Dr. Volker Haarslev Supervisor Dr. Gregory Butler Approved Dr. Leila Kosseim, Graduate Program Director August 26th, 2020 Dr. Amir Asif, Dean Date of Defence Faculty of Engineering and Computer Science Abstract Gene Regulatory Network Inference using Machine Learning Techniques Stephanie Kamgnia Wonkap, Ph.D. Concordia University, 2020 Systems Biology is a field that models complex biological systems in order to better understand the working of cells and organisms. One of the systems modeled is the gene regulatory network that plays the critical role of controlling an organism's response to changes in its environment. Ideally, we would like a model of the complete gene regulatory network. In recent years, several advances in technology have permitted the collection of an unprecedented amount and variety of data such as genomes, gene expression data, time-series data, and perturbation data.
    [Show full text]
  • Highlights (PDF)
    GENETICS A PERIODICAL RECORD OF INVESTIGATIONS BEARING ON HEREDITY AND VARIATION Founded in 1916 and published by The Genetics Society of America VOLUME 179, MAY–AUGUST 2008 GENETICS VOLUME 179, MAY–AUGUST 2008 EDITORIAL BOARD Elizabeth W. Jones, Editor-in-Chief Carnegie Mellon University Mark Johnston, Acting Editor-in-Chief Washington University School of Medicine Montserrat Aguade´ Kent Golic Rasmus Nielsen Universitat de Barcelona University of Utah University of Copenhagen, Centre for Bioinformatics Eric E. Alani Susan Gottesman Cornell University National Institutes of Health-NCI Michael Nonet Washington University School of Medicine Kathryn V. Anderson David I. Greenstein Sloan-Kettering Institute University of Minnesota Magnus Nordborg University of Southern California Brenda J. Andrews David Jonah Grunwald University of Utah University of Toronto Peter J. Oefner hris aley Robert R. H. Anholt C H Stanford University Roslin Institute (Edinburgh) North Carolina State University Andrew Paterson ichael ampsey Elja Arjas M H University of Georgia University of Helsinki Robert Wood Johnson Medical School-UMDNJ David Rand orman rnheim Brown University N A awrence arshman University of Southern California L G. H University of Nebraska, Lincoln Eric J. Richards onnie artel Washington University B B ancy ollingsworth Rice University N H Stony Brook University Mark D. Rose David Begun afri umayun Princeton University University of California, Davis M. Z H UMDNJ-New Jersey Medical School Paul Russell ames irchler J A. B ancy enkins The Scripps Research Institute University of Missouri N A. J National Cancer Institute-FCRDC Matthew S. Sachs arl roman K W. B homas aufman Texas A&M University University of Wisconsin, Madison T C.
    [Show full text]
  • Dr. Julio Montaner Honoured As Groundbreaking Physician by Canada Post
    Journal of the BC Centre for Excellence in HIV/AIDS St. Paul’s Hospital, forecast SEPT 2020 Vancouver, B.C. Dr. Julio Montaner Honoured as Groundbreaking Physician by Canada Post his September, the BC-CfE’s Executive Director & Eric Harris, the board chair at Providence Health Care, TasP ® then became the cornerstone of UNAIDS’ 90-90- Physician-in-Chief Dr. Julio Montaner was praised Dr. Montaner, and said, “He is brilliantly fearless, 90 strategy in 2014, and recently helped inspire another honoured by Canada Post as part of their Medical to the point of being breathtaking on occasion, but it’s game-changing initiative, U=U. The U=U campaign Groundbreakers stamp series. always backed up by organization, discipline, and informs the public, healthcare professionals and policy- T ® clinical work.” makers about the tenets of TasP , that those living with The new stamps honour groundbreaking contributions HIV who are successfully on HAART treatment and have to medicine and include six Canadian physicians and Those organizational skills were on full display as an undetectable viral load are untransmittable. researchers who revolutionized their fields of health care. Dr. Montaner worked as a guiding force behind the The continued success of TasP ® in reducing new 1996 International AIDS Society Conference, where he Alongside Dr. Montaner, those chosen by Canada Post diagnoses led St. Paul’s Hospital to repurpose their introduced HAART as a new global standard of care for include Drs. Bruce Chown, Balfour Mount, M. Vera AIDS Ward in 2014 and Dr. Montaner to declare the end HIV. The 1996 conference is widely seen as a turning Peters, James Till and Ernest McCulloch.
    [Show full text]