JVI Accepts, published online ahead of print on 29 July 2009 J. Virol. doi:10.1128/JVI.00595-09 Copyright © 2009, American Society for Microbiology and/or the Listed Authors/Institutions. All Rights Reserved. 1 Overlapping genes produce proteins with unusual sequence properties and 2 offer insight into de novo protein creation 3 4 Corinne Rancurel1, Mahvash Khosravi2, Keith A. Dunker2, Pedro R. Romero2*, and David Karlin3* 5 6 1 Architecture et Fonction des Macromolécules Biologiques, Case 932, Campus de Luminy, 13288 Marseille 7 Cedex 9, France 8 2 Center for Computational Biology and Bioinformatics, 410 West 10th Street, Suite 5000, Indiana University 9 - Purdue University, Indianapolis, IN 46202-5122, USA, Downloaded from 10 3 25, rue de Casssis, 13008 Marseille, FRANCE 11 12 * Corresponding authors:
[email protected],
[email protected] 13 14 Running Title : Overlapping proteins have unusual sequence properties http://jvi.asm.org/ 15 Keywords 16 de novo gene creation; de novo protein creation; novel proteins; new proteins; orphan proteins; orphan 17 genes; ORFans; overlapping genes; overlapping reading frames; overprinting; unstructured proteins; 18 disordered proteins; intrinsic disorder; structural disorder; disorder prediction; profile-profile comparison; on April 22, 2020 by guest 19 PFAM; viral genomics; viral bioinformatics; viral structural genomics. 20 21 Abbreviations 22 Only abbreviations used in the main text are listed here; others are given in the figure captions. 23 30K, conserved domain of the 30K family of movement proteins; aa, amino acid; dsRNA, double-stranded 24 RNA; GT, guanylyltransferase; MP, movement protein; MT, methyltransferase; N, nucleoprotein; NSs, non 25 structural protein of orthobunyaviruses; ORF, open reading frame; PDB, protein databank (database of 26 protein structures); PFAM, Protein families (database of families of protein sequences); ssRNA, single- 27 stranded RNA; TGB, triple gene block; RE, relative (compositional) entropy; tm, transmembrane segment.