Chapter One: General Introduction 1
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Long Overlapping Genes in Pseudomonas Aeruginosa
bioRxiv preprint doi: https://doi.org/10.1101/2021.02.09.430400; this version posted February 10, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. 1 Shadow ORFs illuminated: long overlapping genes in Pseudomonas 2 aeruginosa are translated and under purifying selection 3 4 Michaela Kreitmeier1, Zachary Ardern1*, Miriam Abele2, Christina Ludwig2, Siegfried Scherer1, 5 Klaus Neuhaus3* 6 7 1Chair for Microbial Ecology, TUM School of Life Sciences, Technische Universität München 8 Weihenstephaner Berg 3, 85354 Freising, Germany. 9 2Bavarian Center for Biomolecular Mass Spectrometry (BayBioMS), TUM School of Life 10 Sciences, Technische Universität München, Gregor-Mendel-Strasse 4, 85354 Freising, 11 Germany. 12 3Core Facility Microbiome, ZIEL – Institute for Food & Health, Technische Universität München 13 Weihenstephaner Berg 3, 85354 Freising, Germany. 14 15 *email: [email protected]; [email protected] 16 17 Abstract 18 The existence of overlapping genes (OLGs) with significant coding overlaps revolutionises our 19 understanding of genomic complexity. We report two exceptionally long (957 nt and 1536 nt), 20 evolutionarily novel, translated antisense open reading frames (ORFs) embedded within 21 annotated genes in the medically important Gram-negative bacterium Pseudomonas 22 aeruginosa. Both OLG pairs show sequence features consistent with being genes and 23 transcriptional signals in RNA sequencing data. Translation of both OLGs was confirmed by 24 ribosome profiling and mass spectrometry. Quantitative proteomics of samples taken during 25 different phases of growth revealed regulation of protein abundances, implying biological 26 functionality. -
A Novel Short L-Arginine Responsive Protein-Coding Gene (Laob)
Erschienen in: BMC evolutionary biology ; 18 (2018), 1. - 21 https://dx.doi.org/10.1186/s12862-018-1134-0 Hücker et al. BMC Evolutionary Biology (2018) 18:21 https://doi.org/10.1186/s12862-018-1134-0 RESEARCH ARTICLE Open Access A novel short L-arginine responsive protein-coding gene (laoB) antiparallel overlapping to a CadC-like transcriptional regulator in Escherichia coli O157:H7 Sakai originated by overprinting Sarah M. Hücker1,5, Sonja Vanderhaeghen1, Isabel Abellan-Schneyder1,4, Romy Wecko1, Svenja Simon2, Siegfried Scherer1,3 and Klaus Neuhaus1,4* Abstract Background: Due to the DNA triplet code, it is possible that the sequences of two or more protein-coding genes overlap to a large degree. However, such non-trivial overlaps are usually excluded by genome annotation pipelines and, thus, only a few overlapping gene pairs have been described in bacteria. In contrast, transcriptome and translatome sequencing reveals many signals originated from the antisense strand of annotated genes, of which we analyzed an example gene pair in more detail. Results: A small open reading frame of Escherichia coli O157:H7 strain Sakai (EHEC), designated laoB (L-arginine responsive overlapping gene), is embedded in reading frame −2 in the antisense strand of ECs5115, encoding a CadC-like transcriptional regulator. This overlapping gene shows evidence of transcription and translation in Luria-Bertani (LB) and brain-heart infusion (BHI) medium based on RNA sequencing (RNAseq) and ribosomal- footprint sequencing (RIBOseq). The transcriptional start site is 289 base pairs (bp) upstream of the start codon and transcription termination is 155 bp downstream of the stop codon. -
1. a 6-Frame Translation Map of a Segment of DNA Is Shown, with Three Open Reading Frames (A, B, and C). Orfs a and B Are Known to Be in Separate Genes
1. A 6-frame translation map of a segment of DNA is shown, with three open reading frames (A, B, and C). ORFs A and B are known to be in separate genes. 1a. Two transcription bubbles are shown, one in ORF A and one in ORF B. In the transcription bubble diagram, mark the following: • the location of RNA polymerase on the appropriate strand in each bubble • the RNA transcripts to show the relative lengths of RNA made by those two polymerases 1b. Are the promoters for ORFs A and/or B present in the DNA region shown in this diagram? Promoter for A: Present? Yes No (circle one) If present, mark its location and label it. Promoter for B: Present? Yes No (circle one) If present, mark its location and label it. 1c. Electron microscopy experiments failed to show RNA polymerases over the ORF "C" region of DNA. State whether each of the three explanations listed below is valid or not, explaining as necessary: Explanation If valid, just write “Valid.” If invalid, BRIEFLY explain why. _______________________________________________________________________ ORFs B and C are exons of the same gene and a splicing error causes ORF C to be left out. Splicing occurs after transcription. Incorrect splicing doesn't explain why transcription didn't happen. ORF C has a mutation in its start (ATG) codon, preventing transcription. The start codon is used for translation, not transcription... whether or not the start codon is intact, transcription could still happen. ORF C has a promoter mutation preventing transcription. VALID. (A promoter mutation is consistent with failure to transcribe the gene.) 2. -
JEWISH REVIEW of BOOKS Volume 5, Number 1 Spring 2014 $7.95
The New Balaboosta, Khazar DNA & Agnon’s Lost Satire JEWISH REVIEW OF BOOKS Volume 5, Number 1 Spring 2014 $7.95 The Screenwriter & the Hoodlums Ben Hecht with Stuart Schoffman Ivan Marcus Rashi with Chumash Elliott Abrams Israel’s Journalist-Prophet Steven Aschheim The Memory Man Amy Newman Smith Expulsion Chick-Lit Gavriel D. Rosenfeld George Clooney, Historian NEW AT THE Editor CENTER FOR JEWISH HISTORY Abraham Socher Senior Contributing Editor Allan Arkush Art Director Betsy Klarfeld Associate Editor Amy Newman Smith Administrative Assistant Rebecca Weiss Editorial Board Robert Alter Shlomo Avineri Leora Batnitzky Ruth Gavison Moshe Halbertal Hillel Halkin Jon D. Levenson Anita Shapira Michael Walzer J. H.H. Weiler Leon Wieseltier Ruth R. Wisse Steven J. Zipperstein Publisher Eric Cohen Associate Publisher & Director of Marketing Lori Dorr NEW SPACE The Jewish Review of Books (Print ISSN 2153-1978, The David Berg Rare Book Room is a state-of- Online ISSN 2153-1994) is a quarterly publication the-art exhibition space preserving and dis- of ideas and criticism published in Spring, Summer, playing the written word, illuminating Jewish Fall, and Winter, by Bee.Ideas, LLC., 165 East 56th Street, 4th Floor, New York, NY 10022. history over time and place. For all subscriptions, please visit www.jewishreviewofbooks.com or send $29.95 UPCOMING EXHIBITION ($39.95 outside of the U.S.) to Jewish Review of Books, Opening Sunday, March 16: By Dawn’s Early PO Box 3000, Denville, NJ 07834. Please send notifi- cations of address changes to the same address or to Light: From Subjects to Citizens (presented by the [email protected]. -
1 Research Article the Missing Link of Jewish European Ancestry: Contrasting the Rhineland and the Khazarian Hypotheses Eran
Research Article The Missing Link of Jewish European Ancestry: Contrasting the Rhineland and the Khazarian Hypotheses Eran Israeli-Elhaik1,2 1 Department of Mental Health, Johns Hopkins University Bloomberg School of Public Health, Baltimore, MD, USA, 21208. 2 McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA, 21208. Running head: The Missing Link of Jewish European Ancestry Keywords: Jewish genome, Khazars, Rhineland, Ashkenazi Jews, population isolate, Population structure Please address all correspondence to Eran Elhaik at [email protected] Phone: 410-502-5740. Fax: 410-502-7544. 1 Abstract The question of Jewish ancestry has been the subject of controversy for over two centuries and has yet to be resolved. The “Rhineland Hypothesis” proposes that Eastern European Jews emerged from a small group of German Jews who migrated eastward and expanded rapidly. Alternatively, the “Khazarian Hypothesis” suggests that Eastern European descended from Judean tribes who joined the Khazars, an amalgam of Turkic clans that settled the Caucasus in the early centuries CE and converted to Judaism in the 8th century. The Judaized Empire was continuously reinforced with Mesopotamian and Greco-Roman Jews until the 13th century. Following the collapse of their empire, the Judeo-Khazars fled to Eastern Europe. The rise of European Jewry is therefore explained by the contribution of the Judeo-Khazars. Thus far, however, their contribution has been estimated only empirically; the absence of genome-wide data from Caucasus populations precluded testing the Khazarian Hypothesis. Recent sequencing of modern Caucasus populations prompted us to revisit the Khazarian Hypothesis and compare it with the Rhineland Hypothesis. -
The Design of Maximally Compressed Coding Sequences
Two Proteins for the Price of One: The Design of Maximally Compressed Coding Sequences Bei Wang1, Dimitris Papamichail2, Steffen Mueller3, and Steven Skiena2 1 Dept. of Computer Science, Duke University, Durham, NC 27708 [email protected] 2 Dept. of Computer Science, State University of New York, Stony Brook, NY 11794 {dimitris, skiena}@cs.sunysb.edu 3 Dept. of Microbiology, State University of New York, Stony Brook, NY 11794 [email protected] Abstract. The emerging field of synthetic biology moves beyond con- ventional genetic manipulation to construct novel life forms which do not originate in nature. We explore the problem of designing the provably shortest genomic sequence to encode a given set of genes by exploit- ing alternate reading frames. We present an algorithm for designing the shortest DNA sequence simultaneously encoding two given amino acid sequences. We show that the coding sequence of naturally occurring pairs of overlapping genes approach maximum compression. We also investi- gate the impact of alternate coding matrices on overlapping sequence de- sign. Finally, we discuss an interesting application for overlapping gene design, namely the interleaving of an antibiotic resistance gene into a target gene inserted into a virus or plasmid for amplification. 1 Introduction The emerging field of synthetic biology moves beyond conventional genetic ma- nipulation to construct novel life forms which do not originate in nature. The synthesis of poliovirus from off-to-shelf components [1] attracted worldwide at- tention when announced in July 2002. Subsequently, the bacteriophage PhiX174 was synthesized using different techniques in only three weeks [2], and Kodu- mal, et al. -
1 the Genochip: a New Tool for Genetic Anthropology Eran Elhaik
The GenoChip: A New Tool for Genetic Anthropology Eran Elhaik1, Elliott Greenspan2, Sean Staats2, Thomas Krahn2, Chris Tyler-Smith3, Yali Xue3, Sergio Tofanelli4, Paolo Francalacci5, Francesco Cucca6, Luca Pagani3,7, Li Jin8, Hui Li8, Theodore G. Schurr9, Bennett Greenspan2, R. Spencer Wells10,* and the Genographic Consortium 1 Department of Mental Health, Johns Hopkins University Bloomberg School of Public Health, 615 N. Wolfe Street, Baltimore, MD 21205, USA 2 Family Tree DNA, Houston, TX 77008, USA 3 The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, UK 4 Department of Biology, University of Pisa, Italy 5 Department of Natural and Environmental Science, Evolutionary Genetics Lab, University of Sassari, Italy 6 National Research Council, Monserrato, Italy 7 Division of Biological Anthropology, University of Cambridge, UK 8 Fudan University, Shanghai, China 9 University of Pennsylvania, Philadelphia, PA 10 National Geographic Society, Washington DC, USA *Please address all correspondence to Spencer Wells at [email protected] 1 Abstract The Genographic Project is an international effort using genetic data to chart human migratory history. The project is non-profit and non-medical, and through its Legacy Fund supports locally led efforts to preserve indigenous and traditional cultures. While the first phase of the project was focused primarily on uniparentally-inherited markers on the Y-chromosome and mitochondrial DNA, the next is focusing on markers from across the entire genome to obtain a more complete understanding of human genetic variation. In this regard, genomic admixture is one of the most crucial tools that will help us to analyze the genetic makeup and shared history of human populations. -
Structural Modeling and Biochemical Characterization of Recombinant KPN 02809, a Zinc-Dependent Metalloprotease from Klebsiella Pneumoniae MGH 78578
Int. J. Mol. Sci. 2012 , 13 , 901-917; doi:10.3390/ijms13010901 OPEN ACCESS International Journal of Molecular Sciences ISSN 1422-0067 www.mdpi.com/journal/ijms Article Structural Modeling and Biochemical Characterization of Recombinant KPN_02809, a Zinc-Dependent Metalloprotease from Klebsiella pneumoniae MGH 78578 Mun Teng Wong 1, Sy Bing Choi 2, Chee Sian Kuan 1, Siang Ling Chua 1, Chiat Han Chang 1, Yahaya Mohd Normi 3, Wei Cun See Too 1, Habibah A. Wahab 2,* and Ling Ling Few 1,* 1 School of Health Sciences, Health Campus, Universiti Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia; E-Mails: [email protected] (M.T.W.); [email protected] (C.S.K.); [email protected] (S.L.C.); [email protected] (C.H.C.); [email protected] (W.C.S.T.) 2 Pharmaceutical Design and Simulation (PhDS) Laboratory, School of Pharmaceutical Sciences, Universiti Sains Malaysia, Minden 11800, Pulau Pinang, Malaysia; E-Mail: [email protected] 3 Department of Cell and Molecular Biology, Faculty of Biotechnology and Biomolecular Sciences, Universiti Putra Malaysia, Serdang 43400, Selangor, Malaysia; E-Mail: [email protected] * Authors to whom correspondence should be addressed; E-Mails: [email protected] (H.A.W.); [email protected] (L.L.F.); Tel.: +604-6532238 (H.A.W.); +609-7677536 (L.L.F.); Fax: +604-6570017 (H.A.W.); +609-7677515 (L.L.F.). Received: 27 October 2011; in revised form: 29 December 2011 / Accepted: 9 January 2012 / Published: 16 January 2012 Abstract: Klebsiella pneumoniae is a Gram-negative, cylindrical rod shaped opportunistic pathogen that is found in the environment as well as existing as a normal flora in mammalian mucosal surfaces such as the mouth, skin, and intestines. -
Classification and Function of Small Open-Reading Frames Abstract
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Sussex Research Online 1 Classification and function of small open-reading frames Juan-Pablo Couso1,2* and Pedro Patraquim2 1Centro Andaluz de Biologia del Desarrollo, CSIC-UPO, Sevilla, Spain and 2Brighton and Sussex Medical School, University of Sussex, Brighton, United Kingdom. *Author for correspondence: [email protected] Abstract Small open-reading frames (smORFs or sORFs) of 100 codons or less are usually - if arbitrarily - excluded from canonical proteome annotations. Despite this, the genomes of a wide range of metazoans, including humans, contain hundreds of smORFs, some of which fulfil key physiological functions. Recently, ribosomal profiling has been employed to show that the transcriptome of the model organism Drosophila melanogaster contains thousands of smORFs of different classes actively undergoing translation which produces peptides of mostly unknown function. Here we present a comprehensive analysis of the smORF repertoire in flies, mice and humans. We propose the existence of several classes of smORFs with different functions, from inert DNA sequences to transcribed and translated cis- regulators of translation, and finally to expression of functional peptides with a propensity to act as regulators of canonical membrane-associated proteins, or as components of ancestral protein complexes in the cytoplasm. We suggest that the different smORF classes could represent steps during the evolution of novel peptide and protein sequences. Our analysis introduces a distinction between different peptide-coding classes in animal genomes, and highlights the role of Drosophila melanogaster as a model organism for the study of small peptide biology in the context of development, physiology and human disease. -
Comparative Testing of DNA Segmentation Algorithms Using
Comparative Testing of DNA Segmentation Algorithms Using Benchmark Simulations Eran Elhaik,*,1 Dan Graur,1 and Kresˇimir Josic´2 1Department of Biology & Biochemistry, University of Houston 2Department of Mathematics, University of Houston *Corresponding author: E-mail: [email protected]. Associate editor: Hideki Innan Abstract Numerous segmentation methods for the detection of compositionally homogeneous domains within genomic sequences have been proposed. Unfortunately, these methods yield inconsistent results. Here, we present a benchmark consisting of Research article two sets of simulated genomic sequences for testing the performances of segmentation algorithms. Sequences in the first set are composed of fixed-sized homogeneous domains, distinct in their between-domain guanine and cytosine (GC) content variability. The sequences in the second set are composed of a mosaic of many short domains and a few long ones, distinguished by sharp GC content boundaries between neighboring domains. We use these sets to test the performance of seven segmentation algorithms in the literature. Our results show that recursive segmentation algorithms based on the Jensen–Shannon divergence outperform all other algorithms. However, even these algorithms perform poorly in certain instances because of the arbitrary choice of a segmentation-stopping criterion. Key words: isochores, GC content, segmentation algorithms, Jensen–Shannon divergence statistic, entropy, genome composition, benchmark simulations. Introduction into compositionally homogeneous domains according to predefined criteria. Segmentation methods include non- In 1976, Bernardi and colleagues (Macaya et al. 1976) put overlapping, sliding window methods (Bernardi 2001; Clay forward a theory concerning the compositional makeup of et al. 2001; International Human Genome Sequencing Con- vertebrate genomes. This description, later christened ‘‘the ´ isochore theory’’ (Cuny et al. -
On the Unfounded Enthusiasm for Soft Selective Sweeps III: the Supervised Machine Learning Algorithm That Isn’T
G C A T T A C G G C A T genes Article On the Unfounded Enthusiasm for Soft Selective Sweeps III: The Supervised Machine Learning Algorithm That Isn’t Eran Elhaik 1,* and Dan Graur 2 1 Department of Biology, Lund University, Sölvegatan 35, 22362 Lund, Sweden 2 Department of Biology & Biochemistry, University of Houston, Science & Research Building 2, Suite #342, 3455 Cullen Bldv., Houston, TX 77204-5001, USA; [email protected] * Correspondence: [email protected] Abstract: In the last 15 years or so, soft selective sweep mechanisms have been catapulted from a curiosity of little evolutionary importance to a ubiquitous mechanism claimed to explain most adaptive evolution and, in some cases, most evolution. This transformation was aided by a series of articles by Daniel Schrider and Andrew Kern. Within this series, a paper entitled “Soft sweeps are the dominant mode of adaptation in the human genome” (Schrider and Kern, Mol. Biol. Evolut. 2017, 34(8), 1863–1877) attracted a great deal of attention, in particular in conjunction with another paper (Kern and Hahn, Mol. Biol. Evolut. 2018, 35(6), 1366–1371), for purporting to discredit the Neutral Theory of Molecular Evolution (Kimura 1968). Here, we address an alleged novelty in Schrider and Kern’s paper, i.e., the claim that their study involved an artificial intelligence technique called supervised machine learning (SML). SML is predicated upon the existence of a training dataset in which the correspondence between the input and output is known empirically to be true. Curiously, Schrider and Kern did not possess a training dataset of genomic segments known a priori to have evolved either neutrally or through soft or hard selective sweeps. -
A Comprehensive Two-Hybrid Analysis to Explore the Yeast Protein Interactome
A comprehensive two-hybrid analysis to explore the yeast protein interactome Takashi Ito*†, Tomoko Chiba*, Ritsuko Ozawa‡, Mikio Yoshida§, Masahira Hattori‡, and Yoshiyuki Sakaki‡¶ *Division of Genome Biology, Cancer Research Institute, Kanazawa University, Kanazawa 920-0934, Japan; ‡Human Genome Research Group, RIKEN Genomic Sciences Center, Yokohama 230-0045, Japan; §INTEC Web and Genome Informatics Corporation, Tokyo 136-0075, Japan; and ¶Human Genome Center, Institute of Medical Science, University of Tokyo, Tokyo 108-8639, Japan Communicated by Satoshi Omura, The Kitasato Institute, Tokyo, Japan, January 22, 2001 (received for review December 4, 2000) Protein–protein interactions play crucial roles in the execution of DNA chip technologies (7). Accumulation of gene expression various biological functions. Accordingly, their comprehensive de- data under various conditions has allowed one to classify genes scription would contribute considerably to the functional interpre- into distinct classes, each of which shares a unique expression tation of fully sequenced genomes, which are flooded with novel profile and is presumably under the same regulatory mechanism. genes of unpredictable functions. We previously developed a The functions of well-characterized genes would give insight into system to examine two-hybrid interactions in all possible combi- those of novel ones in the same cluster. nations between the Ϸ6,000 proteins of the budding yeast Sac- Note that, albeit its power, expression profiling is essentially an charomyces cerevisiae. Here we have completed the comprehen- indirect measure for biological process. Much finer information sive analysis using this system to identify 4,549 two-hybrid would be obtained by the analysis of proteins per se, which interactions among 3,278 proteins.