Plant Systems Biology Comes of Age
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Arabidopsis Thaliana
Downloaded from genome.cshlp.org on September 28, 2021 - Published by Cold Spring Harbor Laboratory Press RESEARCH A Physical Map of Chromosome 2 of Arabidopsis thaliana Eve Ann Zachgo, 2,4 Ming Li Wang, 1'2'4 Julia Dewdney, 1'2 David Bouchez, 3 Christine Carnilleri, 3 Stephen Belmonte, 2 Lu Huang, 2 Maureen Dolan, 2 and Howard M. Goodman 1'2'5 1Department of Genetics, Harvard Medical School and 2Department of Molecular Biology, Massachusetts General Hospital, Boston, Massachusetts 02114; 3Laboratoire de Biologie Cellulaire, Institut National de la Recherche Agronomique (INRA), 78026 Versailles CEDEX, France A yeast artificial chromosome (YAC] physical map of chromosome 2 of Arabidopsis thaliana has been constructed by hybridization of 69 DNA markers and 61 YAC end probes to gridded arrays of YAC clones. Thirty-four YACs in four contigs define the chromosome. Complete closure of the map was not attained because some regions of the chromosome were repetitive or were not represented in the YAC library. Based on the sizes of the YACs and their coverage of the chromosome, the length of chromosome 2 is estimated to be at least 18 Mb. These data provide the means for immediately identifying the YACs containing a genetic locus mapped on Arabidopsis chromosome 2. The small flowering plant Arabidopsis thaliana is ters (Maluszynska and Heslop-Harrison 1991; A1- an excellent model system for metabolic, genetic, bini 1994; Copenhaver et al. 1995). We present and developmental studies in plants. Its haploid here a YAC contig physical map of chromosome nuclear genome is small (-100 Mb), consisting of 2 of A. -
To Find Information About Arabidopsis Genes Leonore Reiser1, Shabari
UNIT 1.11 Using The Arabidopsis Information Resource (TAIR) to Find Information About Arabidopsis Genes Leonore Reiser1, Shabari Subramaniam1, Donghui Li1, and Eva Huala1 1Phoenix Bioinformatics, Redwood City, CA USA ABSTRACT The Arabidopsis Information Resource (TAIR; http://arabidopsis.org) is a comprehensive Web resource of Arabidopsis biology for plant scientists. TAIR curates and integrates information about genes, proteins, gene function, orthologs gene expression, mutant phenotypes, biological materials such as clones and seed stocks, genetic markers, genetic and physical maps, genome organization, images of mutant plants, protein sub-cellular localizations, publications, and the research community. The various data types are extensively interconnected and can be accessed through a variety of Web-based search and display tools. This unit primarily focuses on some basic methods for searching, browsing, visualizing, and analyzing information about Arabidopsis genes and genome, Additionally we describe how members of the community can share data using TAIR’s Online Annotation Submission Tool (TOAST), in order to make their published research more accessible and visible. Keywords: Arabidopsis ● databases ● bioinformatics ● data mining ● genomics INTRODUCTION The Arabidopsis Information Resource (TAIR; http://arabidopsis.org) is a comprehensive Web resource for the biology of Arabidopsis thaliana (Huala et al., 2001; Garcia-Hernandez et al., 2002; Rhee et al., 2003; Weems et al., 2004; Swarbreck et al., 2008, Lamesch, et al., 2010, Berardini et al., 2016). The TAIR database contains information about genes, proteins, gene expression, mutant phenotypes, germplasms, clones, genetic markers, genetic and physical maps, genome organization, publications, and the research community. In addition, seed and DNA stocks from the Arabidopsis Biological Resource Center (ABRC; Scholl et al., 2003) are integrated with genomic data, and can be ordered through TAIR. -
In Silico Tools for Splicing Defect Prediction: a Survey from the Viewpoint of End Users
© American College of Medical Genetics and Genomics REVIEW In silico tools for splicing defect prediction: a survey from the viewpoint of end users Xueqiu Jian, MPH1, Eric Boerwinkle, PhD1,2 and Xiaoming Liu, PhD1 RNA splicing is the process during which introns are excised and informaticians in relevant areas who are working on huge data sets exons are spliced. The precise recognition of splicing signals is critical may also benefit from this review. Specifically, we focus on those tools to this process, and mutations affecting splicing comprise a consider- whose primary goal is to predict the impact of mutations within the able proportion of genetic disease etiology. Analysis of RNA samples 5′ and 3′ splicing consensus regions: the algorithms used by different from the patient is the most straightforward and reliable method to tools as well as their major advantages and disadvantages are briefly detect splicing defects. However, currently, the technical limitation introduced; the formats of their input and output are summarized; prohibits its use in routine clinical practice. In silico tools that predict and the interpretation, evaluation, and prospection are also discussed. potential consequences of splicing mutations may be useful in daily Genet Med advance online publication 21 November 2013 diagnostic activities. In this review, we provide medical geneticists with some basic insights into some of the most popular in silico tools Key Words: bioinformatics; end user; in silico prediction tool; for splicing defect prediction, from the viewpoint of end users. Bio- medical genetics; splicing consensus region; splicing mutation INTRODUCTION TO PRE-mRNA SPLICING AND small nuclear ribonucleoproteins and more than 150 proteins, MUTATIONS AFFECTING SPLICING serine/arginine-rich (SR) proteins, heterogeneous nuclear ribo- Sixty years ago, the milestone discovery of the double-helix nucleoproteins, and the regulatory complex (Figure 1). -
Comprehensive Analysis of CRISPR/Cas9-Mediated Mutagenesis in Arabidopsis Thaliana by Genome-Wide Sequencing
International Journal of Molecular Sciences Article Comprehensive Analysis of CRISPR/Cas9-Mediated Mutagenesis in Arabidopsis thaliana by Genome-Wide Sequencing Wenjie Xu 1,2 , Wei Fu 2, Pengyu Zhu 2, Zhihong Li 1, Chenguang Wang 2, Chaonan Wang 1,2, Yongjiang Zhang 2 and Shuifang Zhu 1,2,* 1 College of Plant Protection, China Agricultural University, Beijing 100193 China 2 Institute of Plant Quarantine, Chinese Academy of Inspection and Quarantine, Beijing 100176, China * Correspondence: [email protected] Received: 9 July 2019; Accepted: 21 August 2019; Published: 23 August 2019 Abstract: The clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated protein (Cas) system has been widely applied in functional genomics research and plant breeding. In contrast to the off-target studies of mammalian cells, there is little evidence for the common occurrence of off-target sites in plants and a great need exists for accurate detection of editing sites. Here, we summarized the precision of CRISPR/Cas9-mediated mutations for 281 targets and found that there is a preference for single nucleotide deletions/insertions and longer deletions starting from 40 nt upstream or ending at 30 nt downstream of the cleavage site, which suggested the candidate sequences for editing sites detection by whole-genome sequencing (WGS). We analyzed the on-/off-target sites of 6 CRISPR/Cas9-mediated Arabidopsis plants by the optimized method. The results showed that the on-target editing frequency ranged from 38.1% to 100%, and one off target at a frequency of 9.8%–97.3% cannot be prevented by increasing the specificity or reducing the expression level of the Cas9 enzyme. -
Locdb: Experimental Annotations of Localization for Homo Sapiens and Arabidopsis Thaliana Shruti Rastogi1,* and Burkhard Rost1,2,3
D230–D234 Nucleic Acids Research, 2011, Vol. 39, Database issue Published online 11 November 2010 doi:10.1093/nar/gkq927 LocDB: experimental annotations of localization for Homo sapiens and Arabidopsis thaliana Shruti Rastogi1,* and Burkhard Rost1,2,3 1Department of Biochemistry and Molecular Biophysics, Columbia University, 701 West, 168th Street, New York, NY 10032, USA, 2Technical University Munich, Bioinformatics, Department of Computer Science and Institute of Advanced Studies (IAS) and 3New York Consortium on Membrane Protein Structure (NYCOMPS), TUM Bioinformatics, Boltzmannstr. 3, 85748 Garching, Germany Downloaded from Received August 15, 2010; Revised September 24, 2010; Accepted September 27, 2010 ABSTRACT stepped up to the challenge to increase the amount of annotations (6–15). These data sets capture aspects of LocDB is a manually curated database with protein function and, more generally, of global cellular nar.oxfordjournals.org experimental annotations for the subcellular processes. localizations of proteins in Homo sapiens (HS, UniProt (release 2010_07) (16) constitutes the most human) and Arabidopsis thaliana (AT, thale cress). comprehensive and, arguably, the most accurate Currently, it contains entries for 19 604 UniProt resource with experimental annotations of subcellular proteins (HS: 13 342; AT: 6262). Each database localization. However, even this excellent resource entry contains the experimentally derived locali- remains incomplete for the proteomes from Homo sapiens (HS) and Arabidopsis thaliana (AT): of the zation in Gene Ontology (GO) terminology, the at Medical Center Library, Duke University on January 13, 2011 experimental annotation of localization, localization 20 282 human proteins in Swiss-Prot (17), 14 502 have predictions by state-of-the-art methods and, where annotations of localization (72%), but for only 3720 (18%) these annotations are experimental. -
A Bioinformatics Analysis of the Arabidopsis Thaliana Epigenome Ikhlak Ahmed
A bioinformatics analysis of the arabidopsis thaliana epigenome Ikhlak Ahmed To cite this version: Ikhlak Ahmed. A bioinformatics analysis of the arabidopsis thaliana epigenome. Agricultural sciences. Université Paris Sud - Paris XI, 2011. English. NNT : 2011PA112230. tel-00684391 HAL Id: tel-00684391 https://tel.archives-ouvertes.fr/tel-00684391 Submitted on 2 Apr 2012 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. 2011 PhD Thesis Ikhlak Ahmed Laboratoire: CNRS UMR8197 - INSERM U1024 Institut de Biologie de l'ENS(IBENS) Ecole Doctorale : Sciences Du Végétal A Bioinformatics analysis of the Arabidopsis thaliana Epigenome Supervisors: Dr. Chris Bowler and Dr. Vincent Colot Jury Prof. Dao-Xiu Zhou President Prof. Christian Fankhauser Rapporteur Dr. Raphaël Margueron Rapporteur Dr. Chris Bowler Examiner Dr. Vincent Colot Examiner Dr. Allison Mallory Examiner Dr. Michaël Weber Examiner Acknowledgement I express my deepest gratitude to my supervisors Dr. Chris Bowler and Dr. Vincent Colot for the support, advice and freedom that I enjoyed working with them. I am very thankful to Chris Bowler who trusted in me and gave me the opportunity to come here and work in the excellent possible conditions. I have no words to express my thankfulness to Vincent Colot for his enthusiastic supervision and the incredibly valuable sessions we spent together that shaped my understanding of DNA methylation. -
Mutator-Like Elements in Arabidopsis Thaliana: Structure, Diversity and Evolution
Copyright 2000 by the Genetics Society of America Mutator-like Elements in Arabidopsis thaliana: Structure, Diversity and Evolution Zhihui Yu, Stephen I. Wright and Thomas E. Bureau Department of Biology, McGill University, Montreal, Quebec, H3A 1B1 Canada Manuscript received May 18, 2000 Accepted for publication September 11, 2000 ABSTRACT While genome-wide surveys of abundance and diversity of mobile elements have been conducted for some class I transposable element families, little is known about the nature of class II transposable elements on this scale. In this report, we present the results from analysis of the sequence and structural diversity of Mutator-like elements (MULEs) in the genome of Arabidopsis thaliana (Columbia). Sequence similarity searches and subsequent characterization suggest that MULEs exhibit extreme structure, sequence, and size heterogeneity. Multiple alignments at the nucleotide and amino acid levels reveal conserved, potentially transposition-related sequence motifs. While many MULEs share common structural features to Mu ele- ments in maize, some groups lack characteristic long terminal inverted repeats. High sequence similarity and phylogenetic analyses based on nucleotide sequence alignments indicate that many of these elements with diverse structural features may remain transpositionally competent and that multiple MULE lineages may have been evolving independently over long time scales. Finally, there is evidence that MULEs are capable of the acquisition of host DNA segments, which may have implications for adaptive evolution, both at the element and host levels. HE Mutator (Mu) system is a diverse family of class ef®cient transposon-tagging tool for maize gene isola- TII transposable elements (TEs) found in maize. tion (Walbot 1992). -
The Embryonic Transcriptome of Arabidopsis Thaliana
bioRxiv preprint doi: https://doi.org/10.1101/479584; this version posted November 27, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Title: The embryonic transcriptome of Arabidopsis thaliana 1,2 1,2 1 Authors: Falko Hofmann , Michael A. Schon and Michael D. Nodine 1 Affiliations: Gregor Mendel Institute (GMI), Austrian Academy of Sciences, Vienna Biocenter 2 (VBC), Dr. Bohr-Gasse 3, 1030 Vienna, Austria; These authors contributed equally to this work Corresponding author: Michael D. Nodine (email: [email protected]; phone: +43 1 79044-9821) Author ORCID IDs: Falko Hofmann (#0000-0002-5032-0411), Michael A. Schon (#0000-0002-4756-3906) and Michael D. Nodine (#0000-0002-6204-8857) 1 bioRxiv preprint doi: https://doi.org/10.1101/479584; this version posted November 27, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Key Message Arabidopsis embryos possess unique transcriptomes relative to other plant tissues including somatic embryos, and can be partitioned into four transcriptional phases with characteristic biological processes. 2 bioRxiv preprint doi: https://doi.org/10.1101/479584; this version posted November 27, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Abstract Cellular differentiation is associated with changes in transcript populations. Accurate quantification of transcriptomes during development can thus provide global insights into differentiation processes including the fundamental specification and differentiation events operating during plant embryogenesis. -
Evolutionary Systems Biology 12 – 14 February 2020 Wellcome
Evolutionary Systems Biology 12 – 14 February 2020 Wellcome Genome Campus, Hinxton, Cambridge, UK Draft Conference Programme Wednesday, 12 February 12:00-13:20 Registration 13:20-13:30 Welcome and introductions Mark Siegal, New York University, USA 13:30-14:30 Keynote lecture Chair: Trisha Wittkopp, University of Michigan, USA Robustness and precision within developmental networks Eileen Furlong EMBL, Germany 14:30-15:30 Session 1: Evolution of Fitness Landscapes Chair: Olivier Tenaillon, INSERM, France 14:30 Understanding genetic interactions and solving protein structures using deep mutagenesis Ben Lehner Centre for Genomic Regulation, Spain 15:00 Latent phenotypic complexity of adaptation in a single condition Dmitri Petrov Stanford University, USA 15:30 Pervasive modulation of a natural genotype-phenotype map by global epistatic modifiers José Aguilar-Rodríguez Stanford University, USA 15:45-16:15 Afternoon tea 16:15-17:15 Session 1b: The diversity of evolutionary systems biology 16:15 The evolution of fitness effects in a long-term experiment with bacteria Alejandro Couce Technical University of Madrid, Spain 16:30 Instraspecific evolution of QR.pax final position in Caenorhabditis elegans Clement Dubois IBENS, France 16:45 Functional optimization governs the rate of protein evolution in multicellular species Dinara Usmanova Columbia University, USA 17:00 Toward an ecology of cancer: Intra-tumor spatial heterogeneity based on multi-region mass spectrometry, single cell copy number sequencing and cyclic immunofluorescence in lung -
In Silico Protein Design: a Combinatorial and Global Optimization Approach by John L
From SIAM News, Volume 37, Number 1, January/February 2004 In Silico Protein Design: A Combinatorial and Global Optimization Approach By John L. Klepeis and Christodoulos A. Floudas The use of computational techniques to create peptide- and protein-based therapeutics is an important challenge in medicine. The ultimate goal, defined about two decades ago, is to use computer algorithms to identify amino acid sequences that not only adopt particular three-dimensional structures but also perform specific functions. To those familiar with the field of structural biology, it is certainly not surprising that this problem has been described as “inverse protein folding” [16]. That is, while the grand challenge of protein folding is to understand how a particular protein, defined by its amino acid sequence, finds its unique three-dimensional structure, protein design involves the discovery of sets of amino acid sequences that form functional proteins and fold into specific target structures. Experimental, computational, and hybrid approaches have all contributed to advances in protein design. Applying mutagenesis and rational design techniques, for example, experimentalists have created enzymes with altered functionalities and increased stability. The coverage of sequence space is highly restricted for these techniques, however [4]. An approach that samples more diverse sequences, called directed protein evolution, iteratively applies the techniques of genetic recombination and in vitro functional assays [1]. These methods, although they do a better job of sampling sequence space and generating functionally diverse proteins, are still restricted to the screening of 103 – 106 sequences [22]. Challenges of Generic Computational Protein Design The limitations of experimental techniques serve to highlight the importance of computational protein design. -
7 Systems Biotechnology: Combined in Silico and Omics Analyses for The
0195300815_0193-0231_ Ch-07.qxd 23/6/06 4:56 PM Page 193 7 Systems Biotechnology: Combined in Silico and Omics Analyses for the Improvement of Microorganisms for Industrial Applications Sang Yup Lee, Dong-Yup Lee, Tae Yong Kim, Byung Hun Kim, & Sang Jun Lee Biotechnology plays an increasingly important role in the healthcare, pharmaceutical, chemical, food, and agricultural industries. Microorganisms have been successfully employed for the production of recombinant proteins [1–4] and various primary and secondary metabolites [5–8]. As in other engineering disciplines, one of the ulti- mate goals of industrial biotechnology is to develop lower-cost and higher-yield processes. Toward this goal, fermentation and down- stream processes have been significantly improved thanks to the effort of biochemical engineers [9]. In addition to the effort of making these mid- to downstream processes more efficient, microbial strains have been improved by recombinant and other molecular biological methods, leading to the increase in microbial metabolic activities toward desired goals [10]. However, these conventional attempts have not always been successful owing to unexpected changes in the physiology and metabolism of host cells. Alternatively, rational meta- bolic and cellular engineering approaches have been tried to solve such problems in a number of cases, but they were also still limited to the manipulation of only a handful of genes encoding enzymes and regulatory proteins. In this regard, systematic approaches are indeed required not only for understanding the global context of the meta- bolic system but also for designing better metabolic engineering strategies. Recent advances in high-throughput experimental techniques have resulted in rapid accumulation of a wide range of biological data and information at different levels: genome, transcriptome, pro- teome, metabolome, and fluxome [11–17]. -
In Silico Prediction of Protein Flexibility with Local Structure Approach
Bioinformatics protein flexibility prediction Preprint – accepted for publication in Biochimie in silico prediction of protein flexibility with local structure approach. Tarun J. Narwani1,2,3,+, Catherine Etchebest1,2,3,+, Pierrick Craveur1,2,3,4, Sylvain Léonard1,2,3, Joseph Rebehmed1,2,3,5, Narayanaswamy Srinivasan6, Aurélie Bornot1,2,3, #, Jean-Christophe Gelly1,2,3 & Alexandre G. de Brevern1,2,3,4,* 1 INSERM, U 1134, DSIMB, Univ Paris, Univ de la Réunion, Univ des Antilles, F-75739 Paris, France. 2 Institut National de la Transfusion Sanguine (INTS), F-75739 Paris, France. 3 Laboratoire d'Excellence GR-Ex, F-75739 Paris, France. 4 Molecular Graphics Laboratory, Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA. 5 Department of Computer Science and Mathematics, Lebanese American University, Byblos 1h401 2010, Lebanon. 9 MBU, IISc, Bangalore, India # Present address: AstraZeneca, Discovery Sciences, Computational Biology, Alderley Park UK. Short title: bioinformatics protein flexibility prediction * Corresponding author: Mailing address: Dr. de Alexandre G. de Brevern, INSERM UMR_S 1134, DSIMB, Université Paris, Institut National de Transfusion Sanguine (INTS), 6, rue Alexandre Cabanel, 75739 Paris cedex 15, France e-mail : [email protected] 1 Bioinformatics protein flexibility prediction Abstract Flexibility is an intrinsic essential feature of protein structures, directly linked to their functions. To this day, most of the prediction methods use the crystallographic data (namely B-factors) as the only indicator of protein’s inner flexibility and predicts them as rigid or flexible. PredyFlexy stands differently from other approaches as it relies on the definition of protein flexibility (i) not only taken from crystallographic data, but also (ii) from Root Mean Square Fluctuation (RMSFs) observed in Molecular Dynamics simulations.