Bibliography

Total Page:16

File Type:pdf, Size:1020Kb

Bibliography Cover Page The handle http://hdl.handle.net/1887/87513 holds various files of this Leiden University dissertation. Author: Khachatryan, L. Title: Metagenomics : beyond the horizon of current implementations and methods Issue Date: 2020-04-28 Bibliography [1] WB Whitman, DC Coleman, and ica et Biophysica Acta (BBA)-Bioenergetics, WJ Wiebe. Prokaryotes: the unseen 1707(2-3):231–253, 2005. majority. Proceedings of the National [8] PG Falkowski and LV Godfrey. Elec- Academy of Sciences, 95(12):6578–6583, trons, life and the evolution of earth’s 1998. oxygen cycle. Philosophical Transactions [2] AA Juwarkar, SK Yadav, PR Thawale, of the Royal Society of London B: Biological P Kumar, SK Singh, and T Chakrabarti. Sciences, 363(1504):2705–2716, 2008. Developmental strategies for sustain- [9] C Gougoulias, JM Clark, and LJ Shaw. able ecosystem on mine spoil dumps: a The role of soil microbes in the case of study. Environmental monitoring global carbon cycle: tracking the below- and assessment, 157(1-4):471–481, 2009. ground microbial processing of plant- [3] C Pedros-Alio. Dipping into the rare derived carbon for manipulating car- biosphere. Sciense, 5809:192, 2007. bon dynamics in agricultural systems. Journal of the Science of Food and Agricul- [4] C Pedros-Alio. The rare bacterial bio- ture, 94(12):2362–2371, 2014. sphere. Annual review of marine science, [10] MG Klotz, DA Bryant, and TE Hanson. 4:449–466, 2012. The microbial sulfur cycle. Frontiers in [5] GT Pecl, MB Araujo, JD Bell, J Blan- microbiology, 2:241, 2011. chard, TC Bonebrake, I-C Chen, [11] J Chen, Y Li, Y Tian, C Huang, D Li, TD Clark, RK Colwell, F Danielsen, Q Zhong, and X Ma. Interaction be- B Evengaard, et al. Biodiversity tween microbes and host intestinal redistribution under climate change: health: modulation by dietary nutri- Impacts on ecosystems and human ents and gut-brain-endocrine-immune well-being. Science, 355(6332):eaai9214, axis. Current Protein and Peptide Science, 2017. 16(7):592–603, 2015. [6] K Todar. Bacteria and archaea and the [12] SE Erdman and T Poutahidis. Microbes cycles of elements in the environment. and oxytocin: benefits for host physi- Retrieved June, 7:2014, 2012. ology and behavior. In International re- [7] M Paumann, G Regelsberger, C Ob- view of neurobiology, volume 131, pages inger, and GA Peschek. The bioener- 91–126. Elsevier, 2016. getic role of dioxygen and the terminal [13] E Patterson, JF Cryan, GF Fitzgerald, oxidase(s) in cyanobacteria. Biochim- RP Ross, TG Dinan, and C Stanton. Gut 121 122 Bibliography microbiota, the pharmabiotics they pro- [23] J Rifkin. The biotech century. Sonoma duce and host health. Proceedings of the County Earth First/Biotech Last, 1998. Nutrition Society, 73(4):477–489, 2014. [24] S Farmer. Probiotic, lactic acid- [14] LK Ursell, W van Treuren, JL Metcalf, producing bacteria and uses thereof, M Pirrung, A Gewirtz, and R Knight. October 8 2002. US Patent 6,461,607. Replenishing our defensive microbes. [25] JR Postgate. Economic importance of Bioessays, 35(9):810–817, 2013. sulphur bacteria. Phil. Trans. R. Soc. [15] TA Van der Meulen, HJM Harm- Lond. B, 298(1093):583–600, 1982. sen, H Bootsma, FKL Spijkervet, [26] M Fernandez, B Del Rio, DM Linares, FGM Kroese, and A Vissink. The MC Martin, and MA Alvarez. Real- microbiome–systemic diseases connec- time polymerase chain reaction for tion. Oral diseases, 22(8):719–734, 2016. quantitative detection of histamine- [16] GJM Christensen and H Bruggemann. producing bacteria: use in cheese pro- Bacterial skin commensals and their duction. Journal of dairy science, role as host guardians. Beneficial mi- 89(10):3763–3769, 2006. crobes, 5(2):201–215, 2013. [27] A Mayra-Makinen and M Bigret. Indus- trial use and production of lactic acid [17] Y Belkaid and S Tamoutounour. The bacteria. Food sciense and Technology, influence of skin microorganisms on 139:175–198, 2004. cutaneous immunity. Nature Reviews Immunology, 16(6):353, 2016. [28] BS Dien, MA Cotta, and TW Jeffries. Bacteria engineered for fuel ethanol [18] PC Calder. Feeding the immune sys- production: current status. Applied mi- tem. Proceedings of the Nutrition Society, crobiology and biotechnology, 63(3):258– 72(3):299–309, 2013. 266, 2003. [19] DA Cowan, J-B Ramond, TP Makha- [29] SF Bender, C Wagg, and MGA van der lanyane, and P de Maayer. Metageno- Heijden. An underground revolution: mics of extreme environments. Current biodiversity and soil ecological engi- opinion in microbiology, 25:97–102, 2015. neering for agricultural sustainability. [20] P Blum. Archaea: Ancient Microbes, Ex- Trends in Ecology & Evolution, 31(6):440– treme Environments, and the Origin of Life, 452, 2016. volume 50. Gulf Professional Publish- [30] M-N Xing, X-Z Zhang, and H Huang. ing, 2001. Application of metagenomic tech- [21] L Tazi, DP Breakwell, AR Harker, and niques in mining enzymes from micro- KA Crandall. Life in extreme environ- bial communities for biofuel synthe- ments: microbial diversity in great salt sis. Biotechnology advances, 30(4):920– lake, utah. Extremophiles, 18(3):525–535, 929, 2012. 2014. [31] M Wainwright, J Lederberg, and [22] C Bang, T Dagan, P Deines, N Dubilier, J Lederberg. History of microbiology. WJ Duschl, S Fraune, U Hentschel, Encyclopedia of microbiology, 2:419–437, H Hirt, N Hulter, T Lachnit, et al. 1992. Metaorganisms in extreme environ- [32] Y Stanier, M Doudoroff, EA Adelberg, ments: do microbes play a role in or- et al. General microbiology. General ganismal adaptation? Zoology, 2018. microbiology., 1958. BIBLIOGRAPHY 123 [33] N Hall. Advanced sequencing tech- mination of 16s ribosomal rna sequen- nologies and their wider impact in mi- ces for phylogenetic analyses. Proceed- crobiology. Journal of Experimental Biol- ings of the National Academy of Sciences, ogy, 210(9):1518–1525, 2007. 82(20):6955–6959, 1985. [34] SE Hasnain. Impact of human genome [42] TM Schmidt, EF DeLong, and NR Pace. sequencing on microbiology. Indian Analysis of a marine picoplankton com- journal of medical microbiology, 19(3):114, munity by 16s rrna gene cloning and 2001. sequencing. Journal of bacteriology, [35] JT Staley and A Konopka. Measure- 173(14):4371–4378, 1991. ment of in situ activities of nonpho- [43] PJ Turnbaugh, RE Ley, M Hamady, tosynthetic microorganisms in aquatic CM Fraser-Liggett, R Knight, and and terrestrial habitats. Annual Reviews JI Gordon. The human microbiome in Microbiology, 39(1):321–346, 1985. project. Nature, 449(7164):804, 2007. [36] A Escobar-Zepeda, A Vera-Ponce de [44] HMP Integrative. The integrative hu- Leon, and A Sanchez-Flores. The road man microbiome project: dynamic anal- to metagenomics: from microbiology to ysis of microbiome-host omics profiles dna sequencing technologies and bioin- during periods of human health and formatics. Frontiers in genetics, 6:348, disease. Cell host & microbe, 16(3):276, 2015. 2014. [37] National Research Council et al. The [45] V Robles-Alonso and F Guarner. From new science of metagenomics: revealing the basic to applied research: lessons from secrets of our microbial planet. National the human microbiome projects. Jour- Academies Press, 2007. nal of clinical gastroenterology, 48:S3–S4, [38] C Simon and R Daniel. Achieve- 2014. ments and new knowledge unraveled by metagenomic approaches. Applied [46] SM Bakhtiar, JG LeBlanc, E Salvucci, microbiology and biotechnology, 85(2):265– A Ali, R Martin, P Langella, J-M Chatel, 276, 2009. A Miyoshi, LG Bermudez-Humaran, and V Azevedo. Implications of the [39] R Knight, A Vrbanac, B C Taylor, human microbiome in inflammatory A Aksenov, C Callewaert, J Debelius, bowel diseases. FEMS microbiology let- A Gonzalez, T Kosciolek, L-I McCall, ters, 342(1):10–17, 2013. D McDonald, et al. Best practices for analysing microbiomes. Nature Reviews [47] J Lloyd-Price, G Abu-Ali, and C Hut- Microbiology, page 1, 2018. tenhower. The healthy human micro- biome. Genome medicine, 8(1):51, 2016. [40] S Junemann, N Kleinbolting, S Jaenicke, C Henke, J Hassa, J Nelkner, Y Stolze, [48] XC Morgan, N Segata, and C Hutten- S P Albaum, A Schluter, A Goesmann, hower. Biodiversity and functional et al. Bioinformatics for ngs-based genomics in the human microbiome. metagenomics and the application to Trends in genetics, 29(1):51–58, 2013. biogas research. Journal of biotechnology, [49] X C Morgan, T L Tickle, H Sokol, D Gev- 261:10–23, 2017. ers, K L Devaney, D V Ward, J A Reyes, [41] DJ Lane, B Pace, GJ Olsen, DA Stahl, S A Shah, N LeLeiko, S B Snapper, ML Sogin, and NR Pace. Rapid deter- et al. Dysfunction of the intestinal 124 Bibliography microbiome in inflammatory bowel dis- [58] JL Metcalf, ZZ Xu, A Bouslimani, ease and treatment. Genome biology, P Dorrestein, DO Carter, and R Knight. 13(9):R79, 2012. Microbiome tools for forensic science. [50] D Gevers, S Kugathasan, L A Den- Trends in biotechnology, 35(9):814–823, son, Y Vazquez-Baeza, W Van Treuren, 2017. B Ren, E Schwager, D Knights, S J Song, [59] SE Schmedes, AE Woerner, NMM M Yassour, et al. The treatment-naive Novroski, FR Wendt, JL King, microbiome in new-onset crohn’s dis- KM Stephens, and B Budowle. ease. Cell host & microbe, 15(3):382–392, Targeted sequencing of clade-specific 2014. markers from skin microbiomes for [51] C Pedros-Alio. Genomics and marine forensic human identification. Forensic microbial ecology. International Microbi- Science International: Genetics, 32:50–61, ology, 9(3):191–197, 2006. 2018. [52] PG Falkowski, RT Barber, and [60] EN Hanssen, E Avershina, K Rudi, V Smetacek. Biogeochemical controls P Gill, and L Snipen. Body fluid predic- and feedbacks on ocean primary tion from microbial patterns for foren- production. Science, 281(5374):200–206, sic application.
Recommended publications
  • Absent from DNA and Protein: Genomic Characterization Of
    Georgakopoulos-Soares et al. Genome Biology (2021) 22:245 https://doi.org/10.1186/s13059-021-02459-z RESEARCH Open Access Absent from DNA and protein: genomic characterization of nullomers and nullpeptides across functional categories and evolution Ilias Georgakopoulos-Soares1,2†, Ofer Yizhar-Barnea1,2†, Ioannis Mouratidis3, Martin Hemberg4,5* and Nadav Ahituv1,2* * Correspondence: mhemberg@ bwh.harvard.edu; nadav.ahituv@ Abstract: Nullomers and nullpeptides are short DNA or amino acid sequences that ucsf.edu are absent from a genome or proteome, respectively. One potential cause for their † Ilias Georgakopoulos-Soares and absence could be their having a detrimental impact on an organism. Ofer Yizhar Barnea contributed equally to this work. Results: Here, we identify all possible nullomers and nullpeptides in the genomes 4Evergrande Center for and proteomes of thirty eukaryotes and demonstrate that a significant proportion of Immunologic Diseases, Harvard Medical School and Brigham and these sequences are under negative selection. We also identify nullomers that are Women’s Hospital, Boston, MA, USA unique to specific functional categories: coding sequences, exons, introns, 5′UTR, 3′ 1Department of Bioengineering and UTR, promoters, and show that coding sequence and promoter nullomers are most Therapeutic Sciences, University of California San Francisco, San likely to be selected against. By analyzing all protein sequences across the tree of life, Francisco, CA, USA we further identify 36,081 peptides up to six amino acids in length that do not exist Full list of author information is in any known organism, termed primes. We next characterize all possible single base available at the end of the article pair mutations that can lead to the appearance of a nullomer in the human genome, observing a significantly higher number of mutations than expected by chance for specific nullomer sequences in transposable elements, likely due to their suppression.
    [Show full text]
  • COVIPENDIUM May12.Pdf
    COVIPENDIUM Information available to support the development of medical countermeasures and interventions against COVID-19 Cite as: Martine DENIS, Valerie VANDEWEERD, Rein VERBEEKE, Anne LAUDISOIT, & Diane VAN DER VLIET. (2020). COVIPENDIUM: information available to support the development of medical countermeasures and interventions against COVID-19 (Version 2020-12-05). Transdisciplinary Insights. This document is conceived as a living document, updated on a weekly basis. You can find its latest version at: https://rega.kuleuven.be/if/corona_covid-19. The COVIPENDIUM is based on open-access publications (scientific journals and preprint databases, communications by WHO and OIE, health authorities and companies) in English language. Please note that the present version has not been submitted to any peer-review process. Any comment / addition that can help improve the contents of this review will be most welcome. For navigation through the various sections, please click on the headings of the table of contents and follow the links marked in blue in the document. Authors: Martine Denis, Valerie Vandeweerd, Rein Verbeke, Anne Laudisoit, Laure Wynants, Diane Van der Vliet COVIPENDIUM version: 12 MAY 2020 Transdisciplinary Insights - Living Paper | 1 Contents List of abbreviations .......................................................................................................................................................... 8 Introduction .....................................................................................................................................................................
    [Show full text]
  • Absent Sequences: Nullomers and Primes
    Pacific Symposium on Biocomputing 12:355-366(2007) ABSENT SEQUENCES: NULLOMERS AND PRIMES GREG HAMPIKIAN Biology, Boise State University, 1910 N University Drive Boise, Idaho 83725, USA TIM ANDERSEN Computer Science, Boise State University, 1910 N University Drive Boise, Idaho 83725, USA We describe a new publicly available algorithm for identifying absent sequences, and demonstrate its use by listing the smallest oligomers not found in the human genome (human “nullomers”), and those not found in any reported genome or GenBank sequence (“primes”). These absent sequences define the maximum set of potentially lethal oligomers. They also provide a rational basis for choosing artificial DNA sequences for molecular barcodes, show promise for species identification and environmental characterization based on absence, and identify potential targets for therapeutic intervention and suicide markers. 1. Introduction As large scale DNA sequencing becomes routine, the universal questions that can be addressed become more interesting. Our work focuses on identifying and characterizing absent sequences in publicly available databases. Through this we are attempting to discover the constraints on natural DNA and protein sequences, and to develop new tools for identification and analysis of populations. We term the short sequences that do not occur in a particular species “nullomers,” and those that have not been found in nature at all “primes.” The primes are the smallest members of the potential artificial DNA lexicon. This paper reports the results of our initial efforts to determine and map sets of nullomer and prime sequences in order to demonstrate the algorithm, and explore the utility of absent sequence analysis. It is well known that the number of possible DNA sequences is an exponentially increasing function of sequence length, and is equal to 4n, where n is the sequence length.
    [Show full text]
  • Efficient Computation of Absent Words in Genomic Sequences
    BMC Bioinformatics This Provisional PDF corresponds to the article as it appeared upon acceptance. Fully formatted PDF and full text (HTML) versions will be made available soon. Efficient computation of absent words in genomic sequences BMC Bioinformatics 2008, 9:167 doi:10.1186/1471-2105-9-167 Julia Herold ([email protected]) Stefan Kurtz ([email protected]) Robert Giegerich ([email protected]) ISSN 1471-2105 Article type Methodology article Submission date 8 November 2007 Acceptance date 26 March 2008 Publication date 26 March 2008 Article URL http://www.biomedcentral.com/1471-2105/9/167 Like all articles in BMC journals, this peer-reviewed article was published immediately upon acceptance. It can be downloaded, printed and distributed freely for any purposes (see copyright notice below). Articles in BMC journals are listed in PubMed and archived at PubMed Central. For information about publishing your research in BMC journals or any BioMed Central journal, go to http://www.biomedcentral.com/info/authors/ © 2008 Herold et al., licensee BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Efficient computation of absent words in genomic sequences Julia Herold1, Stefan Kurtz2 and Robert Giegerich∗1 1Center of Biotechnology, Bielefeld University, Postfach 10 01 31, 33501 Bielefeld, Germany 2Center for Bioinformatics, University of Hamburg, Bundesstra¨se 43, 20146 Hamburg, Germany Email: Julia Herold – [email protected]; Stefan Kurtz – [email protected]; Robert Giegerich∗– [email protected]; ∗Corresponding author Abstract Background: Analysis of sequence composition is a routine task in genome research.
    [Show full text]
  • Hierarchical Clustering of DNA K-Mer Counts in Rnaseq Fastq Files Identifies Sample Heterogeneities
    International Journal of Molecular Sciences Article Hierarchical Clustering of DNA k-mer Counts in RNAseq Fastq Files Identifies Sample Heterogeneities Wolfgang Kaisers 1,2,* , Holger Schwender 3 and Heiner Schaal 2 1 Department of Anaesthesiology, HELIOS University Hospital Wuppertal, University of Witten/Herdecke, Heusnerstr. 40, 42283 Wuppertal, Germany 2 Institut fur Virologie, University Hospital Düsseldorf, Heinrich Heine University Düsseldorf, 40225 Düsseldorf, Germany; [email protected] 3 Mathematisches Institut, Heinrich-Heine-Universität Düsseldorf, 40225 Düsseldorf, Germany; [email protected] * Correspondence: [email protected]; Tel.: +49-202-896-1641 Received: 5 November 2018; Accepted: 15 November 2018; Published: 21 November 2018 Abstract: We apply hierarchical clustering (HC) of DNA k-mer counts on multiple Fastq files. The tree structures produced by HC may reflect experimental groups and thereby indicate experimental effects, but clustering of preparation groups indicates the presence of batch effects. Hence, HC of DNA k-mer counts may serve as a diagnostic device. In order to provide a simple applicable tool we implemented sequential analysis of Fastq reads with low memory usage in an R package (seqTools) available on Bioconductor. The approach is validated by analysis of Fastq file batches containing RNAseq data. Analysis of three Fastq batches downloaded from ArrayExpress indicated experimental effects. Analysis of RNAseq data from two cell types (dermal fibroblasts and Jurkat cells) sequenced in our facility indicate presence of batch effects. The observed batch effects were also present in reads mapped to the human genome and also in reads filtered for high quality (Phred > 30). We propose, that hierarchical clustering of DNA k-mer counts provides an unspecific diagnostic tool for RNAseq experiments.
    [Show full text]
  • Nullomers and High Order Nullomers in Genomic Sequences
    RESEARCH ARTICLE Nullomers and High Order Nullomers in Genomic Sequences Davide Vergni1, Daniele Santoni2* 1 Istituto per le Applicazioni del Calcolo ªMauro Piconeº - CNR, Via dei Taurini 19, 00185, Rome, Italy, 2 Istituto di Analisi dei Sistemi ed Informatica ªAntonio Rubertiº - CNR, Via dei Taurini 19, 00185, Rome, Italy * [email protected] a11111 Abstract A nullomer is an oligomer that does not occur as a subsequence in a given DNA sequence, i.e. it is an absent word of that sequence. The importance of nullomers in several applica- tions, from drug discovery to forensic practice, is now debated in the literature. Here, we investigated the nature of nullomers, whether their absence in genomes has just a statistical explanation or it is a peculiar feature of genomic sequences. We introduced an extension of OPEN ACCESS the notion of nullomer, namely high order nullomers, which are nullomers whose mutated Citation: Vergni D, Santoni D (2016) Nullomers sequences are still nullomers. We studied different aspects of them: comparison with nullo- and High Order Nullomers in Genomic Sequences. mers of random sequences, CpG distribution and mean helical rise. In agreement with previ- PLoS ONE 11(12): e0164540. doi:10.1371/journal. ous results we found that the number of nullomers in the human genome is much larger than pone.0164540 expected by chance. Nevertheless antithetical results were found when considering a ran- Editor: JuÈrgen Brosius, University of MuÈnster, dom DNA sequence preserving dinucleotide frequencies. The analysis of CpG frequencies GERMANY in nullomers and high order nullomers revealed, as expected, a high CpG content but it also Received: January 25, 2016 highlighted a strong dependence of CpG frequencies on the dinucleotide position, suggest- Accepted: September 27, 2016 ing that nullomers have their own peculiar structure and are not simply sequences whose Published: December 1, 2016 CpG frequency is biased.
    [Show full text]
  • Mutation Rate Variations in the Human Genome Are Encoded in DNA Shape
    bioRxiv preprint doi: https://doi.org/10.1101/2021.01.15.426837; this version posted June 14, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Mutation Rate Variations in the Human Genome are Encoded in DNA Shape Authors List Zian Liu1. [email protected] Md. Abul Hassan Samee1. [email protected]* 1: Department of Molecular Physiology and Biophysics, Baylor College of Medicine, Houston, TX, 77030, USA bioRxiv preprint doi: https://doi.org/10.1101/2021.01.15.426837; this version posted June 14, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Abstract Single nucleotide mutation rates have critical implications for human evolution and genetic diseases. Accurate modeling of these mutation rates has long remained an open problem since the rates vary substantially across the human genome. A recent model, however, explained much of the variation by considering higher order nucleotide interactions in the local (7-mer) sequence context around mutated nucleotides. Despite this model’s predictive value, we still lack a biophysically-grounded understanding of genome-wide mutation rate variations. DNA shape features are geometric measurements of DNA structural properties, such as helical twist and tilt, and are known to capture information on interactions between neighboring nucleotides within a local context.
    [Show full text]
  • Amino Acid - Wikipedia, the Free Encyclopedia
    1/29/2016 Amino acid - Wikipedia, the free encyclopedia Amino acid From Wikipedia, the free encyclopedia This article is about the class of chemicals. For the structures and properties of the standard proteinogenic amino acids, see Proteinogenic amino acid. Amino acidsarebiologically important organic compoundscontaining amine (-NH2) and carboxylic acid(- COOH) functional groups, usually along with a side-chain specific to each aminoacid.[1][2][3] The key elements of an amino acid are carbon, hydrogen, oxygen, andnitrogen, though other elements are found in the side-chains of certain amino acids. About 500 amino acids are known and can be classified in many ways.[4] They can be classified according to the core structural functional groups' locations as alpha- (α-), beta- (β-), gamma- (γ-) or delta- (δ-) amino acids; other categories relate to polarity, pHlevel, and side-chain group type (aliphatic,acyclic, aromatic, containing hydroxyl orsulfur, etc.). In the form of proteins, amino acids comprise the second-largest component (water is the largest) of humanmuscles, cells and other tissues.[5] Outside proteins, The structure of an alpha amino amino acids perform critical roles in processes such as neurotransmittertransport and biosynthesis. acid in its un-ionized form In biochemistry, amino acids having both the amine and the carboxylic acid groups attached to the first (alpha-) carbon atom have particular importance. They are known as 2-, alpha-, or α-amino [6] acids (genericformula H2NCHRCOOH in most cases, where R is an organic substituent
    [Show full text]
  • COVID Review May5.Pdf
    COVIPENDIUM Information available to support the development of medical countermeasures and interventions against COVID-19 Cite as: Martine DENIS, Valerie VANDEWEERD, Rein VERBEEKE, Anne LAUDISOIT, & Diane Van der Vliet. (2020). COVIPENDIUM: information available to support the development of medical countermeasures and interventions against COVID-19 (Version 2020-05-05). Transdisciplinary Insights. This document is conceived as a living document, updated on a weekly basis. You can find its latest version at: https://rega.kuleuven.be/if/corona_covid-19. The COVIPENDIUM is based on open-access publications (scientific journals and preprint databases, communications by WHO and OIE, health authorities and companies) in English language. Please note that the present version has not been submitted to any peer-review process. Any comment / addition that can help improve the contents of this review will be most welcome. For navigation through the various sections, please click on the headings of the table of contents and follow the links marked in blue in the document. Authors: Martine Denis, Valerie Vandeweerd, Rein Verbeke, Anne Laudisoit, Laure Wynants, Diane Van der Vliet COVIPENDIUM version: 05 MAY 2020 Transdisciplinary Insights - Living Paper | 1 Contents List of abbreviations .......................................................................................................................................................... 8 Introduction .....................................................................................................................................................................
    [Show full text]
  • SBIA1201 – Sequence Analysis
    SCHOOL OF BIO AND CHEMICAL ENGINEERING DEPARTMENT OF BIOINFORMATICS UNIT – 1- SBIA1201 – Sequence Analysis Sequence analysis In bioinformatics, sequence analysis is the process of subjecting a DNA, RNA or peptide sequence to any of a wide range of analytical methods to understand its features, function, structure, or evolution. Methodologies used include sequence alignment, searches against biological databases, and others. Since the development of methods of high-throughput production of gene and protein sequences, the rate of addition of new sequences to the databases increased exponentially. Such a collection of sequences does not, by itself, increase the scientist's understanding of the biology of organisms. However, comparing these new sequences to those with known functions is a key way of understanding the biology of an organism from which the new sequence comes. Thus, sequence analysis can be used to assign function to genes and proteins by the study of the similarities between the compared sequences. Nowadays, there are many tools and techniques that provide the sequence comparisons (sequence alignment) and analyze the alignment product to understand its biology. Sequence analysis in molecular biology includes a very wide range of relevant topics: The comparison of sequences in order to find similarity, often to infer if they are related (homologous) Identification of intrinsic features of the sequence such as active sites, post translational modification sites, gene-structures, reading frames, distributions of introns and exons and regulatory elements Identification of sequence differences and variations such as point mutations and single nucleotide polymorphism (SNP) in order to get the genetic marker. Revealing the evolution and genetic diversity of sequences and organisms Identification of molecular structure from sequence alone In chemistry, sequence analysis comprises techniques used to determine the sequence of a polymer formed of several monomers (see Sequence analysis of synthetic polymers).
    [Show full text]
  • Some Unexpected Results on the Distribution of Genomic DNA K-Mers
    TEL-AVIV UNIVERSITY RAYMOND AND BEVERLY SACKLER FACULTY OF EXACT SCIENCES Common, Rare, and Surprising Words: Some Unexpected Results on the Distribution of Genomic DNA k-mers Thesis submitted in partial fulfillment of the requirements for the degree of M.Sc. in Tel -Aviv University The Department of Computer Science by Yaron Levy The research work for this thesis has been carried out at Tel-Aviv University under the supervision of Prof. Benny Chor and Prof. David Horn January 2008 i Abstract The probability distribution of DNA k-mers in whole genome sequences provides an inter- esting perspective of the complexity of these systems. Whereas previous research concen- trated on missing k-mers, we study the overall k-mer distribution of more than 100 species from archaea, bacteria, and eukarya (including mammals). We focus on low order Markov models, which capture short range correlations between nucleotides in the DNA sequence. In particular, they enable us to decide if rare, missing, and common k-mers are surprising or not. We show that various local and global properties of DNA k-mers can be modeled fairly well by low order chains. While exploring these empirical k-mer distributions, we discovered that a few species, in- cluding all mammals, have multi-modal histograms, while most species exhibit unimodal distributions. From an evolutionary perspective, these multi-modal distributions are exactly the tetrapods. These distributions are characterized by specific values of C+G contents and CpG dinucleotide suppression, but not by any one of these factors alone. Again, we provide an explanation for this phenomenon, using low order Markov models.
    [Show full text]
  • Significant Non-Existence of Sequences in Genomes and Proteomes
    bioRxiv preprint doi: https://doi.org/10.1101/2020.06.25.170431; this version posted June 26, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Significant non-existence of sequences in genomes and proteomes Grigorios Koulouras1 # and Martin C. Frith1, 2, 3 * 1Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), 2-3-26 Aomi, Koto-ku, Tokyo 135-0064, Japan. 2Graduate School of Frontier Sciences, University of Tokyo, Kashiwa, Chiba, Japan. 3Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), AIST, Shinjuku-ku, Tokyo, Japan. # Present Address: Cancer Research UK Beatson Institute, Glasgow G61 1BD, UK * Correspondence: [email protected] bioRxiv preprint doi: https://doi.org/10.1101/2020.06.25.170431; this version posted June 26, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Abstract Nullomers are minimal-length oligomers absent from a genome or proteome. Although research has shown that artificially synthesized nullomers have deleterious effects, there is still a lack of a strategy for the prioritisation and classification of non-occurring sequences as potentially malicious or benign. In this work, by using Markovian models with multiple-testing correction, we reveal significant absent oligomers which are statistically expected to exist.
    [Show full text]