Concepts, Historical Milestones and the Central Place of Bioinformatics in Modern Biology: a European Perspective
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Applied Category Theory for Genomics – an Initiative
Applied Category Theory for Genomics { An Initiative Yanying Wu1,2 1Centre for Neural Circuits and Behaviour, University of Oxford, UK 2Department of Physiology, Anatomy and Genetics, University of Oxford, UK 06 Sept, 2020 Abstract The ultimate secret of all lives on earth is hidden in their genomes { a totality of DNA sequences. We currently know the whole genome sequence of many organisms, while our understanding of the genome architecture on a systematic level remains rudimentary. Applied category theory opens a promising way to integrate the humongous amount of heterogeneous informations in genomics, to advance our knowledge regarding genome organization, and to provide us with a deep and holistic view of our own genomes. In this work we explain why applied category theory carries such a hope, and we move on to show how it could actually do so, albeit in baby steps. The manuscript intends to be readable to both mathematicians and biologists, therefore no prior knowledge is required from either side. arXiv:2009.02822v1 [q-bio.GN] 6 Sep 2020 1 Introduction DNA, the genetic material of all living beings on this planet, holds the secret of life. The complete set of DNA sequences in an organism constitutes its genome { the blueprint and instruction manual of that organism, be it a human or fly [1]. Therefore, genomics, which studies the contents and meaning of genomes, has been standing in the central stage of scientific research since its birth. The twentieth century witnessed three milestones of genomics research [1]. It began with the discovery of Mendel's laws of inheritance [2], sparked a climax in the middle with the reveal of DNA double helix structure [3], and ended with the accomplishment of a first draft of complete human genome sequences [4]. -
Gene Prediction: the End of the Beginning Comment Colin Semple
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by PubMed Central http://genomebiology.com/2000/1/2/reports/4012.1 Meeting report Gene prediction: the end of the beginning comment Colin Semple Address: Department of Medical Sciences, Molecular Medicine Centre, Western General Hospital, Crewe Road, Edinburgh EH4 2XU, UK. E-mail: [email protected] Published: 28 July 2000 reviews Genome Biology 2000, 1(2):reports4012.1–4012.3 The electronic version of this article is the complete one and can be found online at http://genomebiology.com/2000/1/2/reports/4012 © GenomeBiology.com (Print ISSN 1465-6906; Online ISSN 1465-6914) Reducing genomes to genes reports A report from the conference entitled Genome Based Gene All ab initio gene prediction programs have to balance sensi- Structure Determination, Hinxton, UK, 1-2 June, 2000, tivity against accuracy. It is often only possible to detect all organised by the European Bioinformatics Institute (EBI). the real exons present in a sequence at the expense of detect- ing many false ones. Alternatively, one may accept only pre- dictions scoring above a more stringent threshold but lose The draft sequence of the human genome will become avail- those real exons that have lower scores. The trick is to try and able later this year. For some time now it has been accepted increase accuracy without any large loss of sensitivity; this deposited research that this will mark a beginning rather than an end. A vast can be done by comparing the prediction with additional, amount of work will remain to be done, from detailing independent evidence. -
Meeting Review: Bioinformatics and Medicine – from Molecules To
Comparative and Functional Genomics Comp Funct Genom 2002; 3: 270–276. Published online 9 May 2002 in Wiley InterScience (www.interscience.wiley.com). DOI: 10.1002/cfg.178 Feature Meeting Review: Bioinformatics And Medicine – From molecules to humans, virtual and real Hinxton Hall Conference Centre, Genome Campus, Hinxton, Cambridge, UK – April 5th–7th Roslin Russell* MRC UK HGMP Resource Centre, Genome Campus, Hinxton, Cambridge CB10 1SB, UK *Correspondence to: Abstract MRC UK HGMP Resource Centre, Genome Campus, The Industrialization Workshop Series aims to promote and discuss integration, automa- Hinxton, Cambridge CB10 1SB, tion, simulation, quality, availability and standards in the high-throughput life sciences. UK. The main issues addressed being the transformation of bioinformatics and bioinformatics- based drug design into a robust discipline in industry, the government, research institutes and academia. The latest workshop emphasized the influence of the post-genomic era on medicine and healthcare with reference to advanced biological systems modeling and simulation, protein structure research, protein-protein interactions, metabolism and physiology. Speakers included Michael Ashburner, Kenneth Buetow, Francois Cambien, Cyrus Chothia, Jean Garnier, Francois Iris, Matthias Mann, Maya Natarajan, Peter Murray-Rust, Richard Mushlin, Barry Robson, David Rubin, Kosta Steliou, John Todd, Janet Thornton, Pim van der Eijk, Michael Vieth and Richard Ward. Copyright # 2002 John Wiley & Sons, Ltd. Received: 22 April 2002 Keywords: bioinformatics; -
Toolboxes for a Standardised and Systematic Study of Glycans
Campbell et al. BMC Bioinformatics 2014, 15(Suppl 1):S9 http://www.biomedcentral.com/1471-2105/15/S1/S9 RESEARCH Open Access Toolboxes for a standardised and systematic study of glycans Matthew P Campbell1, René Ranzinger2, Thomas Lütteke3, Julien Mariethoz4, Catherine A Hayes5, Jingyu Zhang1, Yukie Akune6, Kiyoko F Aoki-Kinoshita6, David Damerell7,11, Giorgio Carta8, Will S York2, Stuart M Haslam7, Hisashi Narimatsu9, Pauline M Rudd8, Niclas G Karlsson4, Nicolle H Packer1, Frédérique Lisacek4,10* From Integrated Bio-Search: 12th International Workshop on Network Tools and Applications in Biology (NETTAB 2012) Como, Italy. 14-16 November 2012 Abstract Background: Recent progress in method development for characterising the branched structures of complex carbohydrates has now enabled higher throughput technology. Automation of structure analysis then calls for software development since adding meaning to large data collections in reasonable time requires corresponding bioinformatics methods and tools. Current glycobioinformatics resources do cover information on the structure and function of glycans, their interaction with proteins or their enzymatic synthesis. However, this information is partial, scattered and often difficult to find to for non-glycobiologists. Methods: Following our diagnosis of the causes of the slow development of glycobioinformatics, we review the “objective” difficulties encountered in defining adequate formats for representing complex entities and developing efficient analysis software. Results: Various solutions already implemented and strategies defined to bridge glycobiology with different fields and integrate the heterogeneous glyco-related information are presented. Conclusions: Despite the initial stage of our integrative efforts, this paper highlights the rapid expansion of glycomics, the validity of existing resources and the bright future of glycobioinformatics. -
Life with 6000 Genes Author(S): A. Goffeau, B. G. Barrell, H. Bussey, R
Life with 6000 Genes Author(s): A. Goffeau, B. G. Barrell, H. Bussey, R. W. Davis, B. Dujon, H. Feldmann, F. Galibert, J. D. Hoheisel, C. Jacq, M. Johnston, E. J. Louis, H. W. Mewes, Y. Murakami, P. Philippsen, H. Tettelin and S. G. Oliver Source: Science, Vol. 274, No. 5287, Genome Issue (Oct. 25, 1996), pp. 546+563-567 Published by: American Association for the Advancement of Science Stable URL: https://www.jstor.org/stable/2899628 Accessed: 28-08-2019 13:31 UTC REFERENCES Linked references are available on JSTOR for this article: https://www.jstor.org/stable/2899628?seq=1&cid=pdf-reference#references_tab_contents You may need to log in to JSTOR to access the linked references. JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact [email protected]. Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at https://about.jstor.org/terms American Association for the Advancement of Science is collaborating with JSTOR to digitize, preserve and extend access to Science This content downloaded from 152.3.43.41 on Wed, 28 Aug 2019 13:31:29 UTC All use subject to https://about.jstor.org/terms by FISH across the whole genome. A subset of the 37. G. D. Billingsleyet al., Am. J. Hum. Genet. 52, 343 (1994); L. -
ELIXIR Poster Numbers: P El001 - 037 Application Posters: P El034 - 037
POSTER LIST ORDERED ALPHABETICALLY BY POSTER TITLE GROUPED BY THEME/TRACK THEME/TRACK: ELIXIR Poster numbers: P_El001 - 037 Application posters: P_El034 - 037 Poster EasyChair Presenting Author list Title Abstract Theme/track Topics number number author P_El001 714 Joan Segura, Daniel Joan Segura 3DBIONOTES: Unifying molecular biology With the advent of next generation sequencing methods, the amount of proteomic and genomic information is growing faster than ever. Several projects have been undertaken to annotate the ELIXIR poster ELIXIR Tabas Madrid, Ruben information genomes of most important organisms, including human. For example, the GENECODE project seeks to enhance all human genes including protein-coding loci with alternatively splices Sanchez-Garcia, Jesús variants, non-coding loci and pseudogenes. Another example is the 1000 genomes, a repository of human genetic variations, including SNPs and structural variants, and their haplotype Cuenca, Carlos Oscar contexts. These projects feed most relevant biological databases as UNIPROT and ENSEMBL, extending the amount of available annotation for genes and proteins.Genomic and proteomic Sánchez Sorzano, Ardan annotations are a valuable contribution in the study of protein and gene functions. However, structural information is an essential key for a deeper understanding of the molecular properties Patwardhan and Jose that allow proteins and genes to perform specific tasks. Therefore, depicting genomic and proteomic information over structural data would offer a very complete picture in order to understand Maria Carazo how proteins and genes behave in the different cellular processes.In this work we present the second version of a web platform -3DBIONOTES- that aims to merge the different levels of molecular biology information, including genomics, proteomics and interactomics data into a unique graphical environment. -
Comprehensive Analysis of CRISPR/Cas9-Mediated Mutagenesis in Arabidopsis Thaliana by Genome-Wide Sequencing
International Journal of Molecular Sciences Article Comprehensive Analysis of CRISPR/Cas9-Mediated Mutagenesis in Arabidopsis thaliana by Genome-Wide Sequencing Wenjie Xu 1,2 , Wei Fu 2, Pengyu Zhu 2, Zhihong Li 1, Chenguang Wang 2, Chaonan Wang 1,2, Yongjiang Zhang 2 and Shuifang Zhu 1,2,* 1 College of Plant Protection, China Agricultural University, Beijing 100193 China 2 Institute of Plant Quarantine, Chinese Academy of Inspection and Quarantine, Beijing 100176, China * Correspondence: [email protected] Received: 9 July 2019; Accepted: 21 August 2019; Published: 23 August 2019 Abstract: The clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated protein (Cas) system has been widely applied in functional genomics research and plant breeding. In contrast to the off-target studies of mammalian cells, there is little evidence for the common occurrence of off-target sites in plants and a great need exists for accurate detection of editing sites. Here, we summarized the precision of CRISPR/Cas9-mediated mutations for 281 targets and found that there is a preference for single nucleotide deletions/insertions and longer deletions starting from 40 nt upstream or ending at 30 nt downstream of the cleavage site, which suggested the candidate sequences for editing sites detection by whole-genome sequencing (WGS). We analyzed the on-/off-target sites of 6 CRISPR/Cas9-mediated Arabidopsis plants by the optimized method. The results showed that the on-target editing frequency ranged from 38.1% to 100%, and one off target at a frequency of 9.8%–97.3% cannot be prevented by increasing the specificity or reducing the expression level of the Cas9 enzyme. -
The Ethos and Effects of Data-Sharing Rules: Examining The
Informed consent for: "The ethos and effects of data-sharing rules: Examining the history of the 'Bermuda principles' and their effects on 21 st century science" University of Adelaide Duke University Researchers at the University of Adelaide, Australia, and the IGSP Center for Genome Ethics, Law & Policy, Duke University, are engaged in research on the Bermuda Principles for sharing DNA sequence data from high-volume sequencing centers. You have been selected for an interview because we believe that the recollections you may have of your experiences with the International Strategy Meetings for Human Genome Sequencing (1996-1998) will be interesting and helpful for our project. We expect that interviews will last from 30 minutes to much longer, but you may stop your interview at any time. Your participation is strictly voluntary, and you do not have to answer every question asked. Your interview is being recorded and we may take written notes during the interview. After your interview, we may prepare a typed transcript of the interview. If we prepare a transcript, you will have an opportunity to review it and to make deletions and corrections. Unless you indicate otherwise, the information that you provide in this interview will be "on the record"-that is, it can be attributed to you in the various articles and chapters that we plan to write, and thus could become public through these channels. Jf, however, at some point in the interview you want to provide us with information that might be useful for us to know, but which you do not want to have attributed to you, you should tell us that you wish to go "off the record" and we will stop the recording. -
Introduction to Bioinformatics (Elective) – SBB1609
SCHOOL OF BIO AND CHEMICAL ENGINEERING DEPARTMENT OF BIOTECHNOLOGY Unit 1 – Introduction to Bioinformatics (Elective) – SBB1609 1 I HISTORY OF BIOINFORMATICS Bioinformatics is an interdisciplinary field that develops methods and software tools for understanding biologicaldata. As an interdisciplinary field of science, bioinformatics combines computer science, statistics, mathematics, and engineering to analyze and interpret biological data. Bioinformatics has been used for in silico analyses of biological queries using mathematical and statistical techniques. Bioinformatics derives knowledge from computer analysis of biological data. These can consist of the information stored in the genetic code, but also experimental results from various sources, patient statistics, and scientific literature. Research in bioinformatics includes method development for storage, retrieval, and analysis of the data. Bioinformatics is a rapidly developing branch of biology and is highly interdisciplinary, using techniques and concepts from informatics, statistics, mathematics, chemistry, biochemistry, physics, and linguistics. It has many practical applications in different areas of biology and medicine. Bioinformatics: Research, development, or application of computational tools and approaches for expanding the use of biological, medical, behavioral or health data, including those to acquire, store, organize, archive, analyze, or visualize such data. Computational Biology: The development and application of data-analytical and theoretical methods, mathematical modeling and computational simulation techniques to the study of biological, behavioral, and social systems. "Classical" bioinformatics: "The mathematical, statistical and computing methods that aim to solve biological problems using DNA and amino acid sequences and related information.” The National Center for Biotechnology Information (NCBI 2001) defines bioinformatics as: "Bioinformatics is the field of science in which biology, computer science, and information technology merge into a single discipline. -
Embnet Conference 2020 Programme
EMBnet Conference 2020 Bioinformatics Approaches to Precision Research 23-24 September 2020 Zoom Platform - Time 3 pm to 8 pm (CEST) Registration: https://us02web.zoom.us/meeting/register/tZEvd-mupjwiE911qtAnYln4TrNwQlxDMYm4 Programme 1st Day: 23 September 9, 2020 15:00-15:10 Welcome & Programme Presentation Domenica D’Elia & Erik Bongcam-Rudloff 15:10-15:30 Exosomes in breast milk; a beneficial genetic trojan horse from mother to child Dimitrios Vlachakis, Aspasia Efthimiadou, Flora Bacopoulou, Elias Eliopoulos, George P. Chrousos 15:30-15:50 Precision dairy farming – A Phenomenal opportunity Tomas Klingström 15:50-16:10 Genome regulation by long non-coding RNAs Katerina Pierouli, George N. Goulielmos, Elias Eliopoulos, Dimitrios Vlachakis European Molecular Biology Network (EMBnet) and Global Bioinformatics Network since 1988 16:10-16:30 Precision Epidemiology of Multi-drug resistance bacteria: bioinformatics tools J.Donato, L. Lugo, H. Perez, H. Ballen, D. Talero, S. Prada, F.Brion, V. Rincon, L. Falquet, M.T. Reguero, E. Barreto-Hernandez 16:30-16:50 Mechanisms of epigenetic inheritance in children following exposure to abuse E. Damaskopoulou, G.P. Chrousos, E. Eliopoulos, D. Vlachakis 16:50-17:00 Break 17:00-17:20 Genome-wide association studies (GWAS) in an effort to provide insights into the complex interplay of nuclear receptor transcriptional networks and their contribution to the maintenance of homeostasis Thanassis Mitsis, G.P. Chrousos, E. Eliopoulos, D.Vlachakis 17:20-17:40 Computer aided drug design and pharmacophore modelling towards the discovery of novel anti-ebola agents Kalliopi Io Diakou, G.P. Chrousos, E. Eliopoulos, D. Vlachakis 17:40-18:00 3′-Tag RNA-sequencing Andreas Gisel 18:00-18:20 Expression profiling of non-coding RNA in coronaviruses provides clues for virus RNA interference with immune system response Arianna Consiglio, V.F. -
The Uniprot Knowledgebase
Bringing bioinformatics into the classroom The UniProtIntroducing Knowledgebase UniProtKB Computer-Aided Drug Design A PRACTICAL GUIDE 1 Version: 30 November 2020 A Practical Guide to Computer-Aided Drug Design Designing tomorrow’s drugs Overview This Practical Guide outlines basic computational approaches used in drug discovery. It highlights how bioinformatics can be harnessed to design drug candidates, to predict their affinity for their targets, their fate inside the body, their toxicity and possible side-effects. Teaching Goals & Learning Outcomes This Guide introduces bioinformatics tools for designing candidate drug molecules, and for predicting their likely target protein(s) and their drug-like properties. On reading the Guide and completing the exercises, you will be able to: • design drug-candidate molecules using the structures of known drugs as templates, and dock them to known protein targets; • compare the protein target-binding strengths of drug candidates with those of known drugs; • calculate properties of drug candidates and infer whether they need chemical modification to make them more drug-like; • predict the protein(s) that a drug candidate is likely to target; • create molecular fingerprints for known drugs, and use these to quantify their similarity. simple computational methodologies to conceive and evaluate mol- 13 1 Introduction ecules for their potential to become drugs . Although macromolec- ular entities (e.g., like antibodies) can act as therapeutic agents, here we consider drugs as small organic molecules (less than ~100 Over the past century, the design and production of drugs has had atoms) that activate or inhibit the functions of proteins, ultimately a beneficial impact on life expectancy and quality1,2. -
Contcenter for Genomic Regul
CONTCENTER FOR GENOMIC REGUL CRG SCIENTIFIC STRUCTURE . 4 CRG MANAGEMENT STRUCTURE . 6 CRG SCIENTIFIC ADVISORY BOARD (SAB) . 8 CRG BUSINESS BOARD . 9 YEAR RETROSPECT BY THE DIRECTOR OF THE CRG: MIGUEL BEATO . 10 GENE REGULATION. 14 p Chromatin and gene expression .....................16 p Transcriptional regulation and chromatin remodelling .....19 p Regulation of alternative pre-mRNA splicing during cell . 22 differentiation, development and disease p RNA interference and chromatin regulation . 26 p RNA-protein interactions and regulation . 30 p Regulation of protein synthesis in eukaryotes . 33 p Translational control of gene expression . 36 DIFFERENTIATION AND CANCER ...........................40 p Hematopoietic differentiation and stem cell biology..........42 p Myogenesis.....................................46 p Epigenetics events in cancer.......................49 p Epithelial homeostasis and cancer ...................52 ENTSATION ANNUAL REPORT 2006 GENES AND DISEASE .................................56 p Genetic causes of disease .............................58 p Gene therapy ......................................63 p Murine models of disease .............................66 p Neurobehavioral phenotyping of mouse models of disease .....68 p Gene function ......................................73 p Associated Core Facility: Genotyping Unit..................76 BIOINFORMATICS AND GENOMICS ..........................80 p Bioinformatics and genomics ...........................82 p Genomic analysis of development and disease ..............86