Self-Analysis of Repeat Proteins Reveals Evolutionarily Conserved
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Genomic Correlates of Relationship QTL Involved in Fore- Versus Hind Limb Divergence in Mice
Loyola University Chicago Loyola eCommons Biology: Faculty Publications and Other Works Faculty Publications 2013 Genomic Correlates of Relationship QTL Involved in Fore- Versus Hind Limb Divergence in Mice Mihaela Palicev Gunter P. Wagner James P. Noonan Benedikt Hallgrimsson James M. Cheverud Loyola University Chicago, [email protected] Follow this and additional works at: https://ecommons.luc.edu/biology_facpubs Part of the Biology Commons Recommended Citation Palicev, M, GP Wagner, JP Noonan, B Hallgrimsson, and JM Cheverud. "Genomic Correlates of Relationship QTL Involved in Fore- Versus Hind Limb Divergence in Mice." Genome Biology and Evolution 5(10), 2013. This Article is brought to you for free and open access by the Faculty Publications at Loyola eCommons. It has been accepted for inclusion in Biology: Faculty Publications and Other Works by an authorized administrator of Loyola eCommons. For more information, please contact [email protected]. This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 License. © Palicev et al., 2013. GBE Genomic Correlates of Relationship QTL Involved in Fore- versus Hind Limb Divergence in Mice Mihaela Pavlicev1,2,*, Gu¨ nter P. Wagner3, James P. Noonan4, Benedikt Hallgrı´msson5,and James M. Cheverud6 1Konrad Lorenz Institute for Evolution and Cognition Research, Altenberg, Austria 2Department of Pediatrics, Cincinnati Children‘s Hospital Medical Center, Cincinnati, Ohio 3Yale Systems Biology Institute and Department of Ecology and Evolutionary Biology, Yale University 4Department of Genetics, Yale University School of Medicine 5Department of Cell Biology and Anatomy, The McCaig Institute for Bone and Joint Health and the Alberta Children’s Hospital Research Institute for Child and Maternal Health, University of Calgary, Calgary, Canada 6Department of Anatomy and Neurobiology, Washington University *Corresponding author: E-mail: [email protected]. -
Protein Interaction Network of Alternatively Spliced Isoforms from Brain Links Genetic Risk Factors for Autism
ARTICLE Received 24 Aug 2013 | Accepted 14 Mar 2014 | Published 11 Apr 2014 DOI: 10.1038/ncomms4650 OPEN Protein interaction network of alternatively spliced isoforms from brain links genetic risk factors for autism Roser Corominas1,*, Xinping Yang2,3,*, Guan Ning Lin1,*, Shuli Kang1,*, Yun Shen2,3, Lila Ghamsari2,3,w, Martin Broly2,3, Maria Rodriguez2,3, Stanley Tam2,3, Shelly A. Trigg2,3,w, Changyu Fan2,3, Song Yi2,3, Murat Tasan4, Irma Lemmens5, Xingyan Kuang6, Nan Zhao6, Dheeraj Malhotra7, Jacob J. Michaelson7,w, Vladimir Vacic8, Michael A. Calderwood2,3, Frederick P. Roth2,3,4, Jan Tavernier5, Steve Horvath9, Kourosh Salehi-Ashtiani2,3,w, Dmitry Korkin6, Jonathan Sebat7, David E. Hill2,3, Tong Hao2,3, Marc Vidal2,3 & Lilia M. Iakoucheva1 Increased risk for autism spectrum disorders (ASD) is attributed to hundreds of genetic loci. The convergence of ASD variants have been investigated using various approaches, including protein interactions extracted from the published literature. However, these datasets are frequently incomplete, carry biases and are limited to interactions of a single splicing isoform, which may not be expressed in the disease-relevant tissue. Here we introduce a new interactome mapping approach by experimentally identifying interactions between brain-expressed alternatively spliced variants of ASD risk factors. The Autism Spliceform Interaction Network reveals that almost half of the detected interactions and about 30% of the newly identified interacting partners represent contribution from splicing variants, emphasizing the importance of isoform networks. Isoform interactions greatly contribute to establishing direct physical connections between proteins from the de novo autism CNVs. Our findings demonstrate the critical role of spliceform networks for translating genetic knowledge into a better understanding of human diseases. -
Supplementary Information Integrative Analyses of Splicing in the Aging Brain: Role in Susceptibility to Alzheimer’S Disease
Supplementary Information Integrative analyses of splicing in the aging brain: role in susceptibility to Alzheimer’s Disease Contents 1. Supplementary Notes 1.1. Religious Orders Study and Memory and Aging Project 1.2. Mount Sinai Brain Bank Alzheimer’s Disease 1.3. CommonMind Consortium 1.4. Data Availability 2. Supplementary Tables 3. Supplementary Figures Note: Supplementary Tables are provided as separate Excel files. 1. Supplementary Notes 1.1. Religious Orders Study and Memory and Aging Project Gene expression data1. Gene expression data were generated using RNA- sequencing from Dorsolateral Prefrontal Cortex (DLPFC) of 540 individuals, at an average sequence depth of 90M reads. Detailed description of data generation and processing was previously described2 (Mostafavi, Gaiteri et al., under review). Samples were submitted to the Broad Institute’s Genomics Platform for transcriptome analysis following the dUTP protocol with Poly(A) selection developed by Levin and colleagues3. All samples were chosen to pass two initial quality filters: RNA integrity (RIN) score >5 and quantity threshold of 5 ug (and were selected from a larger set of 724 samples). Sequencing was performed on the Illumina HiSeq with 101bp paired-end reads and achieved coverage of 150M reads of the first 12 samples. These 12 samples will serve as a deep coverage reference and included 2 males and 2 females of nonimpaired, mild cognitive impaired, and Alzheimer's cases. The remaining samples were sequenced with target coverage of 50M reads; the mean coverage for the samples passing QC is 95 million reads (median 90 million reads). The libraries were constructed and pooled according to the RIN scores such that similar RIN scores would be pooled together. -
PPR Proteins – Orchestrators of Organelle RNA Metabolism Aleix
1 PPR proteins – orchestrators of organelle RNA metabolism 2 Aleix Gorchs Rovira and Alison G Smith* 3 Department of Plant Sciences, University of Cambridge, Downing Street, Cambridge CB2 3EA UK 4 Correspondence 5 *Corresponding author, 6 e-mail: [email protected] 7 8 Pentatricopeptide repeat (PPR) proteins are important RNA regulators in chloroplasts and mitochondria, 9 aiding in RNA editing, maturation, stabilisation or intron splicing, and in transcription and translation 10 of organellar genes. In this review, we summarise all PPR proteins documented so far in plants and the 11 green alga Chlamydomonas. By further analysis of the known target RNAs from Arabidopsis thaliana 12 PPR proteins, we find that all organellar-encoded complexes are regulated by these proteins, although 13 to differing extents. In particular, the orthologous complexes of NADH dehydrogenase (Complex I) in 14 the mitochondria and NADH dehydrogenase-like (NDH) complex in the chloroplast were the most 15 regulated, with respectively 60 and 28% of all characterised A. thaliana PPR proteins targeting their 16 genes. 17 Abbreviations – CMS, Cytoplasm Male Sterility; PMT, Post-transcriptional modification; T/P/OPR, 18 tetra-, penta-, octo-tricopeptide repeat. 19 20 The need to regulate organelle genetic expression 21 Photosynthesis and respiration, which arose initially in prokaryotic organisms, requires the assembly of 22 many different components to form functional protein complexes (Vermaas 2001) and networks for 23 effective electron transfer (Anraku 1988). After the endosymbiotic events in which a free-living 24 cyanobacterium and an alpha-proteobacterium gave rise respectively to the chloroplasts and 25 mitochondria found in eukaryotic cells, there was migration of essential genes from the “new organelle” 26 to the nucleus. -
Title: Therapeutic Potential of HSP90 Inhibition for Neurofibromatosis Type 2
Author Manuscript Published OnlineFirst on May 28, 2013; DOI: 10.1158/1078-0432.CCR-12-3167 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited. Title: Therapeutic Potential of HSP90 Inhibition for Neurofibromatosis type 2 Karo Tanaka1, Ascia Eskin3, Fabrice Chareyre1, Walter J. Jessen4, Jan Manent5, Michiko Niwa-Kawakita6, Ruihong Chen7, Cory H. White2, Jeremie Vitte1, Zahara M. Jaffer1, Stanley F. Nelson3, Allan E. Rubenstein8, Marco Giovannini1,9§. Authors’ affiliations: House Research Institute, 1Center for Neural Tumor Research and 2Section on Genetics of Hereditary Ear Disorders, Los Angeles, CA; 3Department of Human Genetics, University of California, Los Angeles, CA; 4Informatics, Covance Inc., Princeton, NJ; 5Peter MacCallum Cancer Institute, Melbourne, Australia; 6Inserm U944, CNRS U7212, Université Paris, Institut Universitaire d'Hématologie, Paris, France; 7NexGenix Pharmaceuticals, Burlingame, CA; and 8New York University Langone Medical Center, New York, NY; and Department of Cell and Neurobiology, University of Southern California, Keck School of Medicine, Los Angeles, CA Running title: HSP90 Inhibition for NF2 Keywords: NF2, HSP90 inhibitors, Transcriptome Financial support: This work was supported by a Drug Discovery Initiative Award, Children’s Tumor Foundation, to M.G., and by the House Research Institute. Corresponding author: Marco Giovannini, House Research Institute, Center for Neural Tumor Research, 2100 West 3rd street, Los Angeles, CA90057. Phone: +1-213-989-6708; Fax: +1-213-989-6778; E-mail: [email protected] 1 Downloaded from clincancerres.aacrjournals.org on September 30, 2021. © 2013 American Association for Cancer Research. Author Manuscript Published OnlineFirst on May 28, 2013; DOI: 10.1158/1078-0432.CCR-12-3167 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited. -
TAL Effectors Are Remote Controls for Gene Activation Heidi Scholze and Jens Boch
Available online at www.sciencedirect.com TAL effectors are remote controls for gene activation Heidi Scholze and Jens Boch TAL (transcription activator-like) effectors constitute a novel typically 34 amino acids (aa) long, but also repeat types of class of DNA-binding proteins with predictable specificity. They 30–42 aa can be found ([2], Figure 2). The last repeat is are employed by Gram-negative plant-pathogenic bacteria of only a half repeat. Repeat-to-repeat variations are limited the genus Xanthomonas which translocate a cocktail of to a few aa positions including two hypervariable residues different effector proteins via a type III secretion system (T3SS) at positions 12 and 13 per repeat. Typically, TALs differ into plant cells where they serve as virulence determinants. in their number of repeats while most contain 15.5–19.5 Inside the plant cell, TALs localize to the nucleus, bind to target repeats [2]. The repeat domain determines the specificity promoters, and induce expression of plant genes. DNA-binding of TALs which is mediated by selective DNA binding specificity of TALs is determined by a central domain of tandem [7,8]. TAL repeats constitute a novel type of DNA- repeats. Each repeat confers recognition of one base pair (bp) binding domain [7] distinct from classical zinc finger, in the DNA. Rearrangement of repeat modules allows design of helix–turn–helix, and leucine zipper motifs. proteins with desired DNA-binding specificities. Here, we summarize how TAL specificity is encoded, first structural data The recent understanding of the DNA-recognition speci- and first data on site-specific TAL nucleases. -
CDC23 (NM 004661) Human Tagged ORF Clone Product Data
OriGene Technologies, Inc. 9620 Medical Center Drive, Ste 200 Rockville, MD 20850, US Phone: +1-888-267-4436 [email protected] EU: [email protected] CN: [email protected] Product datasheet for RC224973L3 CDC23 (NM_004661) Human Tagged ORF Clone Product data: Product Type: Expression Plasmids Product Name: CDC23 (NM_004661) Human Tagged ORF Clone Tag: Myc-DDK Symbol: CDC23 Synonyms: ANAPC8; APC8; CUT23 Vector: pLenti-C-Myc-DDK-P2A-Puro (PS100092) E. coli Selection: Chloramphenicol (34 ug/mL) Cell Selection: Puromycin ORF Nucleotide The ORF insert of this clone is exactly the same as(RC224973). Sequence: Restriction Sites: SgfI-MluI Cloning Scheme: ACCN: NM_004661 ORF Size: 1791 bp This product is to be used for laboratory only. Not for diagnostic or therapeutic use. View online » ©2021 OriGene Technologies, Inc., 9620 Medical Center Drive, Ste 200, Rockville, MD 20850, US 1 / 2 CDC23 (NM_004661) Human Tagged ORF Clone – RC224973L3 OTI Disclaimer: The molecular sequence of this clone aligns with the gene accession number as a point of reference only. However, individual transcript sequences of the same gene can differ through naturally occurring variations (e.g. polymorphisms), each with its own valid existence. This clone is substantially in agreement with the reference, but a complete review of all prevailing variants is recommended prior to use. More info OTI Annotation: This clone was engineered to express the complete ORF with an expression tag. Expression varies depending on the nature of the gene. RefSeq: NM_004661.3 RefSeq Size: 3169 bp RefSeq ORF: 1794 bp Locus ID: 8697 UniProt ID: Q9UJX2 Domains: TPR, APC8 Protein Families: Druggable Genome Protein Pathways: Cell cycle, Oocyte meiosis, Progesterone-mediated oocyte maturation, Ubiquitin mediated proteolysis MW: 68.7 kDa Gene Summary: The protein encoded by this gene shares strong similarity with Saccharomyces cerevisiae Cdc23, a protein essential for cell cycle progression through the G2/M transition. -
Eukaryotic Genome Annotation
Comparative Features of Multicellular Eukaryotic Genomes (2017) (First three statistics from www.ensembl.org; other from original papers) C. elegans A. thaliana D. melanogaster M. musculus H. sapiens Species name Nematode Thale Cress Fruit Fly Mouse Human Size (Mb) 103 136 143 3,482 3,555 # Protein-coding genes 20,362 27,655 13,918 22,598 20,338 (25,498 (13,601 original (30,000 (30,000 original est.) original est.) original est.) est.) Transcripts 58,941 55,157 34,749 131,195 200,310 Gene density (#/kb) 1/5 1/4.5 1/8.8 1/83 1/97 LINE/SINE (%) 0.4 0.5 0.7 27.4 33.6 LTR (%) 0.0 4.8 1.5 9.9 8.6 DNA Elements 5.3 5.1 0.7 0.9 3.1 Total repeats 6.5 10.5 3.1 38.6 46.4 Exons % genome size 27 28.8 24.0 per gene 4.0 5.4 4.1 8.4 8.7 average size (bp) 250 506 Introns % genome size 15.6 average size (bp) 168 Arabidopsis Chromosome Structures Sorghum Whole Genome Details Characterizing the Proteome The Protein World • Sequencing has defined o Many, many proteins • How can we use this data to: o Define genes in new genomes o Look for evolutionarily related genes o Follow evolution of genes ▪ Mixing of domains to create new proteins o Uncover important subsets of genes that ▪ That deep phylogenies • Plants vs. animals • Placental vs. non-placental animals • Monocots vs. dicots plants • Common nomenclature needed o Ensure consistency of interpretations InterPro (http://www.ebi.ac.uk/interpro/) Classification of Protein Families • Intergrated documentation resource for protein super families, families, domains and functional sites o Mitchell AL, Attwood TK, Babbitt PC, et al. -
Modern Trends in Plant Genome Editing: an Inclusive Review of the CRISPR/Cas9 Toolbox
International Journal of Molecular Sciences Review Modern Trends in Plant Genome Editing: An Inclusive Review of the CRISPR/Cas9 Toolbox Ali Razzaq 1 , Fozia Saleem 1, Mehak Kanwal 2, Ghulam Mustafa 1, Sumaira Yousaf 2, Hafiz Muhammad Imran Arshad 2, Muhammad Khalid Hameed 3, Muhammad Sarwar Khan 1 and Faiz Ahmad Joyia 1,* 1 Centre of Agricultural Biochemistry and Biotechnology (CABB), University of Agriculture, Faisalabad 38040, Pakistan 2 Nuclear Institute for Agriculture and Biology (NIAB), P.O. Box 128, Faisalabad 38000, Pakistan 3 School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China * Correspondence: [email protected] Received: 14 June 2019; Accepted: 15 August 2019; Published: 19 August 2019 Abstract: Increasing agricultural productivity via modern breeding strategies is of prime interest to attain global food security. An array of biotic and abiotic stressors affect productivity as well as the quality of crop plants, and it is a primary need to develop crops with improved adaptability, high productivity, and resilience against these biotic/abiotic stressors. Conventional approaches to genetic engineering involve tedious procedures. State-of-the-art OMICS approaches reinforced with next-generation sequencing and the latest developments in genome editing tools have paved the way for targeted mutagenesis, opening new horizons for precise genome engineering. Various genome editing tools such as transcription activator-like effector nucleases (TALENs), zinc-finger nucleases (ZFNs), and meganucleases (MNs) have enabled plant scientists to manipulate desired genes in crop plants. However, these approaches are expensive and laborious involving complex procedures for successful editing. Conversely, CRISPR/Cas9 is an entrancing, easy-to-design, cost-effective, and versatile tool for precise and efficient plant genome editing. -
Identification and Characterisation of Human Cytomegalovirus-Mediated Degradation of Helicase-Like Transcription Factor
Identification and Characterisation of Human Cytomegalovirus-Mediated Degradation of Helicase-Like Transcription Factor Kai-Min Lin Department of Medicine Cambridge Institute for Medical Research University of Cambridge This dissertation is submitted for the degree of Doctor of Philosophy Fitzwilliam College September 2020 Declaration I hereby declare, that except where specific reference is made to the work of others, the contents of this dissertation are original and have not been submitted in whole or in part for consideration for any other degree of qualification in this, or any other university. This dissertation is the result of my own work and includes nothing which is the outcome of work done in collaboration except as where specified in the text and acknowledgments. This dissertation does not exceed the specified word limit of 60,000 words as defined by the Degree Committee, excluding figures, photographs, tables, appendices and bibliography. Kai-Min Lin September, 2020 I Summary Identification and characterisation of human cytomegalovirus-mediated degradation of helicase-like transcription factor Kai-Min Lin Viruses are known to degrade host factors that are important in innate antiviral immunity in order to infect successfully. To systematically identify host proteins targeted for early degradation by human cytomegalovirus (HCMV), the lab developed orthogonal screens using high resolution multiplexed mass spectrometry. Taking advantage of broad and selective proteasome and lysosome inhibitors, proteasomal degradation was found to be heavily exploited by HCMV. Several known antiviral restriction factors, including components of cellular promyelocytic leukemia (PML) were enriched in a shortlist of proteasomally degraded proteins during infection. A particularly robust novel ‘hit’ was helicase-like transcription factor (HLTF), a DNA repair protein that participates in error-free repair of stalled replication forks. -
The Genome of Schmidtea Mediterranea and the Evolution Of
OPEN ArtICLE doi:10.1038/nature25473 The genome of Schmidtea mediterranea and the evolution of core cellular mechanisms Markus Alexander Grohme1*, Siegfried Schloissnig2*, Andrei Rozanski1, Martin Pippel2, George Robert Young3, Sylke Winkler1, Holger Brandl1, Ian Henry1, Andreas Dahl4, Sean Powell2, Michael Hiller1,5, Eugene Myers1 & Jochen Christian Rink1 The planarian Schmidtea mediterranea is an important model for stem cell research and regeneration, but adequate genome resources for this species have been lacking. Here we report a highly contiguous genome assembly of S. mediterranea, using long-read sequencing and a de novo assembler (MARVEL) enhanced for low-complexity reads. The S. mediterranea genome is highly polymorphic and repetitive, and harbours a novel class of giant retroelements. Furthermore, the genome assembly lacks a number of highly conserved genes, including critical components of the mitotic spindle assembly checkpoint, but planarians maintain checkpoint function. Our genome assembly provides a key model system resource that will be useful for studying regeneration and the evolutionary plasticity of core cell biological mechanisms. Rapid regeneration from tiny pieces of tissue makes planarians a prime De novo long read assembly of the planarian genome model system for regeneration. Abundant adult pluripotent stem cells, In preparation for genome sequencing, we inbred the sexual strain termed neoblasts, power regeneration and the continuous turnover of S. mediterranea (Fig. 1a) for more than 17 successive sib- mating of all cell types1–3, and transplantation of a single neoblast can rescue generations in the hope of decreasing heterozygosity. We also developed a lethally irradiated animal4. Planarians therefore also constitute a a new DNA isolation protocol that meets the purity and high molecular prime model system for stem cell pluripotency and its evolutionary weight requirements of PacBio long-read sequencing12 (Extended Data underpinnings5. -
A New Census of Protein Tandem Repeats and Their Relationship with Intrinsic Disorder
G C A T T A C G G C A T genes Article A New Census of Protein Tandem Repeats and Their Relationship with Intrinsic Disorder Matteo Delucchi 1,2 , Elke Schaper 1,2,† , Oxana Sachenkova 3,‡, Arne Elofsson 3 and Maria Anisimova 1,2,* 1 ZHAW Life Sciences und Facility Management, Applied Computational Genomics, 8820 Wädenswil, Switzerland; [email protected] 2 Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland 3 Science of Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, 106 91 Stockholm, Sweden * Correspondence: [email protected]; Tel.: +41-(0)58-934-5882 † Present address: Carbon Delta AG, 8002 Zürich, Switzerland. ‡ Present address: Vildly AB, 385 31 Kalmar, Sweden. Received: 9 March 2020; Accepted: 1 April 2020; Published: 9 April 2020 Abstract: Protein tandem repeats (TRs) are often associated with immunity-related functions and diseases. Since that last census of protein TRs in 1999, the number of curated proteins increased more than seven-fold and new TR prediction methods were published. TRs appear to be enriched with intrinsic disorder and vice versa. The significance and the biological reasons for this association are unknown. Here, we characterize protein TRs across all kingdoms of life and their overlap with intrinsic disorder in unprecedented detail. Using state-of-the-art prediction methods, we estimate that 50.9% of proteins contain at least one TR, often located at the sequence flanks. Positive linear correlation between the proportion of TRs and the protein length was observed universally, with Eukaryotes in general having more TRs, but when the difference in length is taken into account the difference is quite small.