Digital Sequence Information on Genetic Resources: Concept, Scope and 2 Current Use
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Whole Genome Sequencing and Comparative Genomic Analysis Of
Li et al. BMC Genomics (2020) 21:181 https://doi.org/10.1186/s12864-020-6593-1 RESEARCH ARTICLE Open Access Whole genome sequencing and comparative genomic analysis of oleaginous red yeast Sporobolomyces pararoseus NGR identifies candidate genes for biotechnological potential and ballistospores-shooting Chun-Ji Li1,2, Die Zhao3, Bing-Xue Li1* , Ning Zhang4, Jian-Yu Yan1 and Hong-Tao Zou1 Abstract Background: Sporobolomyces pararoseus is regarded as an oleaginous red yeast, which synthesizes numerous valuable compounds with wide industrial usages. This species hold biotechnological interests in biodiesel, food and cosmetics industries. Moreover, the ballistospores-shooting promotes the colonizing of S. pararoseus in most terrestrial and marine ecosystems. However, very little is known about the basic genomic features of S. pararoseus. To assess the biotechnological potential and ballistospores-shooting mechanism of S. pararoseus on genome-scale, the whole genome sequencing was performed by next-generation sequencing technology. Results: Here, we used Illumina Hiseq platform to firstly assemble S. pararoseus genome into 20.9 Mb containing 54 scaffolds and 5963 predicted genes with a N50 length of 2,038,020 bp and GC content of 47.59%. Genome completeness (BUSCO alignment: 95.4%) and RNA-seq analysis (expressed genes: 98.68%) indicated the high-quality features of the current genome. Through the annotation information of the genome, we screened many key genes involved in carotenoids, lipids, carbohydrate metabolism and signal transduction pathways. A phylogenetic assessment suggested that the evolutionary trajectory of the order Sporidiobolales species was evolved from genus Sporobolomyces to Rhodotorula through the mediator Rhodosporidiobolus. Compared to the lacking ballistospores Rhodotorula toruloides and Saccharomyces cerevisiae, we found genes enriched for spore germination and sugar metabolism. -
Sequence-Selective Recognition of Double-Stranded RNA And
Downloaded from rnajournal.cshlp.org on October 8, 2021 - Published by Cold Spring Harbor Laboratory Press Hnedzko et al. Sequence-Selective Recognition of Double-Stranded RNA and Enhanced Cellular Uptake of Cationic Nucleobase and Backbone- Modified Peptide Nucleic Acids Dziyana Hnedzko1,*, Dennis W. McGee,2 Yannis A. Karamitas1 and Eriks Rozners1,* Departments of 1 Chemistry and 2 Biological Sciences, Binghamton University, The State University of New York, Binghamton, New York 13902, United States. * To whom correspondence should be addressed. Tel: +1-607-777-2441; Fax: +1-607-777- 4478; Email: [email protected] and [email protected] Running Tittle: RNA recognition and cellular uptake of PNA 1 Downloaded from rnajournal.cshlp.org on October 8, 2021 - Published by Cold Spring Harbor Laboratory Press Hnedzko et al. ABSTRACT Sequence-selective recognition of complex RNAs in live cells could find broad applications in biology, biomedical research and biotechnology. However, specific recognition of structured RNA is challenging and generally applicable and effective methods are lacking. Recently, we found that peptide nucleic acids (PNAs) were unusually well suited ligands for recognition of double-stranded RNAs. Herein, we report that 2-aminopyridine (M) modified PNAs and their conjugates with lysine and arginine tripeptides form strong (Ka = 9.4 to 17 × 107 M-1) and sequence-selective triple helices with RNA hairpins at physiological pH and salt concentration. The affinity of PNA-peptide conjugates for the matched RNA hairpins was unusually high compared to the much lower affinity for DNA hairpins of the same 7 -1 sequence (Ka = 0.05 to 0.11 × 10 M ). The binding of double-stranded RNA by M-modified 4 -1 -1 PNA-peptide conjugates was a relatively fast process (kon = 2.9 × 10 M s ) compared to 3 -1 -1 the notoriously slow triple helix formation by oligodeoxynucleotides (kon ~ 10 M s ). -
Sequence Assembly Demystified
REVIEWS COMPUTATIONAL TOOLS Sequence assembly demystified Niranjan Nagarajan1 and Mihai Pop2 Abstract | Advances in sequencing technologies and increased access to sequencing services have led to renewed interest in sequence and genome assembly. Concurrently, new applications for sequencing have emerged, including gene expression analysis, discovery of genomic variants and metagenomics, and each of these has different needs and challenges in terms of assembly. We survey the theoretical foundations that underlie modern assembly and highlight the options and practical trade-offs that need to be considered, focusing on how individual features address the needs of specific applications. We also review key software and the interplay between experimental design and efficacy of assembly. Paired-end data Previously the province of large genome centres, DNA between DNA sequences is then used to ‘stitch’ together Data from a pair of reads sequencing is now a key component of research carried the individual fragments into larger contiguous sequenced from ends of out in many laboratories. A number of new applica- sequences (contigs), thereby recovering the informa- the same DNA fragment. The tions of sequencing technologies have been spurred tion lost during the sequencing process. The assembly genomic distance between the reads is approximately by rapidly decreasing costs, including the study of process is complicated by the fact that, in many cases, known and is used to constrain microbial communities (metagenomics), the discovery this underlying assumption is incorrect. For example, assembly solutions. See also of structural variants in genomes and the analysis of genomic repeats — segments of DNA repeated in an ‘mate-pair read’. gene structure and expression. -
DNA Sequencing and Sorting: Identifying Genetic Variations
BioMath DNA Sequencing and Sorting: Identifying Genetic Variations Student Edition Funded by the National Science Foundation, Proposal No. ESI-06-28091 This material was prepared with the support of the National Science Foundation. However, any opinions, findings, conclusions, and/or recommendations herein are those of the authors and do not necessarily reflect the views of the NSF. At the time of publishing, all included URLs were checked and active. We make every effort to make sure all links stay active, but we cannot make any guaranties that they will remain so. If you find a URL that is inactive, please inform us at [email protected]. DIMACS Published by COMAP, Inc. in conjunction with DIMACS, Rutgers University. ©2015 COMAP, Inc. Printed in the U.S.A. COMAP, Inc. 175 Middlesex Turnpike, Suite 3B Bedford, MA 01730 www.comap.com ISBN: 1 933223 71 5 Front Cover Photograph: EPA GULF BREEZE LABORATORY, PATHO-BIOLOGY LAB. LINDA SHARP ASSISTANT This work is in the public domain in the United States because it is a work prepared by an officer or employee of the United States Government as part of that person’s official duties. DNA Sequencing and Sorting: Identifying Genetic Variations Overview Each of the cells in your body contains a copy of your genetic inheritance, your DNA which has been passed down to you, one half from your biological mother and one half from your biological father. This DNA determines physical features, like eye color and hair color, and can determine susceptibility to medical conditions like hypertension, heart disease, diabetes, and cancer. -
GALA, a Database for Genomic Sequence Alignments and Annotations
Resources GALA, a Database for Genomic Sequence Alignments and Annotations Belinda Giardine,1 Laura Elnitski,1,2 Cathy Riemer,1 Izabela Makalowska,4 Scott Schwartz,1 Webb Miller,1,3,4 and Ross C. Hardison2,4,5 Departments of 1Computer Science and Engineering, 2Biochemistry and Molecular Biology, 3Biology, and 4Huck Institute for Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA We have developed a relational database to contain whole genome sequence alignments between human and mouse with extensive annotations of the human sequence. Complex queries are supported on recorded features, both directlyand on proximityamong them. Searches can reveal a wide varietyof relationships, such as finding all genes expressed in a designated tissue that have a highlyconserved non coding sequence !Ј to the start site. Other examples are finding single nucleotide polymorphisms that occur in conserved noncoding regions upstream of genes and identifying CpG islands that overlap the !Ј ends of divergentlytranscribed genes. The database is available online at http://globin.cse.psu.edu/ and http://bio.cse.psu.edu/. The determination and annotation of complete genomic combined with sequence conservation can refine predictions DNA sequences provide the opportunity for unprecedented of functional sequences (Levy et al. 2001). One way to do this advances in our understanding of evolution, genetics, and is to record both extensive annotations and sequence align- physiology, but the amount and diversity of data pose daunt- ments in a database. ing challenges as well. Three excellent browsers provide access We have developed a database of genomic DNA se- to the sequence and annotations of the human genome, viz., quence alignments and annotations, called GALA, to search the human genome browser !"#$ at %&'& (Kent et al. -
ANSWER KEY Sybsc. Life Sciences- SEM
ANSWER KEY S.Y.B.Sc. Life Sciences- SEM III - Paper III Q.P.Code: 79543 Exam Date: 2nd November 2018 Marks : 100 Q. 1 Do as Directed: (20mks) .Q. 1. A) Define / Explain the following terms: (07) 1. TCP/IP- It is commonly known as TCP/IP because the foundational protocols in the suite are the Transmission Control Protocol (TCP) and the Internet Protocol (IP). It is a set of networking protocols that allows two or more computers to communicate. 2. WWW- The World Wide Web (WWW), also called the Web, is an information space where documents and other web resourcesare identified by Uniform Resource Locators (URLs), interlinked by hypertext links, and accessible via the Internet. Web pages are primarily text documents formatted and annotated with Hypertext Markup Language (HTML). In addition to formatted text, web pages may contain images, video, audio, and software components that are rendered in the user's web browser as coherent pages of multimedia content. 3. Proteomics- Proteomics is the large-scale study of proteomes. A proteome is a set of proteins produced in an organism, system, or biological context. 4. Human Genome Project-The Genome Project (HGP) was an international scientific research project with the goal of determining the sequence of nucleotide base pairs that make up human DNA, and of identifying and mapping all of the genes of the human genome from both a physical and a functional standpoint. 5. Forward reading frame-An open reading frame starts with an atg (Met) in most species and ends with a stop codon (taa, tag or tga). -
Nuclear and Mitochondrial DNA Sequences from Two Denisovan Individuals
Nuclear and mitochondrial DNA sequences from two Denisovan individuals Susanna Sawyera,1, Gabriel Renauda,1, Bence Violab,c,d, Jean-Jacques Hublinc, Marie-Theres Gansaugea, Michael V. Shunkovd,e, Anatoly P. Dereviankod,f, Kay Prüfera, Janet Kelsoa, and Svante Pääboa,2 aDepartment of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, D-04103 Leipzig, Germany; bDepartment of Anthropology, University of Toronto, Toronto, ON M5S 2S2, Canada; cDepartment of Human Evolution, Max Planck Institute for Evolutionary Anthropology, D-04103 Leipzig, Germany; dInstitute of Archaeology and Ethnography, Russian Academy of Sciences, Novosibirsk, RU-630090, Russia; eNovosibirsk National Research State University, Novosibirsk, RU-630090, Russia; and fAltai State University, Barnaul, RU-656049, Russia Contributed by Svante Pääbo, October 13, 2015 (sent for review April 16, 2015; reviewed by Hendrik N. Poinar, Fred H. Smith, and Chris B. Stringer) Denisovans, a sister group of Neandertals, have been described on DNA to the ancestors of present-day populations across Asia the basis of a nuclear genome sequence from a finger phalanx and Oceania suggests that in addition to the Altai Mountains, (Denisova 3) found in Denisova Cave in the Altai Mountains. The they may have lived in other parts of Asia. In addition to the only other Denisovan specimen described to date is a molar (Deni- finger phalanx, a molar (Denisova 4) was found in the cave in sova 4) found at the same site. This tooth carries a mtDNA se- 2000. Although less than 0.2% of the DNA in the tooth derives quence similar to that of Denisova 3. Here we present nuclear from a hominin source, the mtDNA was sequenced and differed DNA sequences from Denisova 4 and a morphological description, from the finger phalanx mtDNA at only two positions, suggesting as well as mitochondrial and nuclear DNA sequence data, from it too may be from a Denisovan (2, 3). -
The Bio Revolution: Innovations Transforming and Our Societies, Economies, Lives
The Bio Revolution: Innovations transforming economies, societies, and our lives economies, societies, our and transforming Innovations Revolution: Bio The Executive summary The Bio Revolution Innovations transforming economies, societies, and our lives May 2020 McKinsey Global Institute Since its founding in 1990, the McKinsey Global Institute (MGI) has sought to develop a deeper understanding of the evolving global economy. As the business and economics research arm of McKinsey & Company, MGI aims to help leaders in the commercial, public, and social sectors understand trends and forces shaping the global economy. MGI research combines the disciplines of economics and management, employing the analytical tools of economics with the insights of business leaders. Our “micro-to-macro” methodology examines microeconomic industry trends to better understand the broad macroeconomic forces affecting business strategy and public policy. MGI’s in-depth reports have covered more than 20 countries and 30 industries. Current research focuses on six themes: productivity and growth, natural resources, labor markets, the evolution of global financial markets, the economic impact of technology and innovation, and urbanization. Recent reports have assessed the digital economy, the impact of AI and automation on employment, physical climate risk, income inequal ity, the productivity puzzle, the economic benefits of tackling gender inequality, a new era of global competition, Chinese innovation, and digital and financial globalization. MGI is led by three McKinsey & Company senior partners: co-chairs James Manyika and Sven Smit, and director Jonathan Woetzel. Michael Chui, Susan Lund, Anu Madgavkar, Jan Mischke, Sree Ramaswamy, Jaana Remes, Jeongmin Seong, and Tilman Tacke are MGI partners, and Mekala Krishnan is an MGI senior fellow. -
Genomic Sequencing of Infectious Pathogens
Science, Technology Assessment, MARCH 2021 and Analytics WHY THIS MATTERS Genomic sequencing reveals the genetic code of an infectious disease pathogen, such as SARS-CoV-2, the SCIENCE & TECH SPOTLIGHT: virus that causes COVID-19. Newer, faster, and less costly sequencing can now be used to more quickly track GENOMIC SEQUENCING OF transmission, detect new variants, and develop vaccines and other countermeasures. However, challenges such INFECTIOUS PATHOGENS as high startup costs and privacy concerns remain. /// THE TECHNOLOGY How mature is it? First-generation sequencing is often used to confirm results of NGS. NGS is relatively new to the public health field, but is used What is it? Genomic sequencing technologies decode a pathogen’s to augment surveillance (i.e., data collection and analysis) for SARS- genetic material by identifying the order of chemical “letters” of its DNA CoV-2 in the U.S. and overseas. New technologies are allowing greater (or RNA, its chemical equivalent in some viruses). Each of four letters access to sequencing capabilities by making NGS portable, faster, and represents a chemical unit called a base. The sequence of the bases can more affordable (the cost of one sequence run is now one-millionth of reveal useful information for combatting disease. For example, sequencing what it was two decades ago). Genomic sequencing technologies enable of SARS-CoV-2 is used to track the spread of various strains, known as many different areas of infectious disease study. For example, they enable variants (see fig. 1). genomic epidemiology—the science of using pathogen genomic data to determine the distribution and spread of an infectious disease in a group How does it work? Technologies for sequencing genomes of pathogens of people or animals, and the application of this information to respond to use chemicals to break up the DNA or RNA into small fragments. -
The Economic Impact and Functional Applications of Human Genetics and Genomics
The Economic Impact and Functional Applications of Human Genetics and Genomics Commissioned by the American Society of Human Genetics Produced by TEConomy Partners, LLC. Report Authors: Simon Tripp and Martin Grueber May 2021 TEConomy Partners, LLC (TEConomy) endeavors at all times to produce work of the highest quality, consistent with our contract commitments. However, because of the research and/or experimental nature of this work, the client undertakes the sole responsibility for the consequence of any use or misuse of, or inability to use, any information or result obtained from TEConomy, and TEConomy, its partners, or employees have no legal liability for the accuracy, adequacy, or efficacy thereof. Acknowledgements ASHG and the project authors wish to thank the following organizations for their generous support of this study. Invitae Corporation, San Francisco, CA Regeneron Pharmaceuticals, Inc., Tarrytown, NY The project authors express their sincere appreciation to the following indi- viduals who provided their advice and input to this project. ASHG Government and Public Advocacy Committee Lynn B. Jorde, PhD ASHG Government and Public Advocacy Committee (GPAC) Chair, President (2011) Professor and Chair of Human Genetics George and Dolores Eccles Institute of Human Genetics University of Utah School of Medicine Katrina Goddard, PhD ASHG GPAC Incoming Chair, Board of Directors (2018-2020) Distinguished Investigator, Associate Director, Science Programs Kaiser Permanente Northwest Melinda Aldrich, PhD, MPH Associate Professor, Department of Medicine, Division of Genetic Medicine Vanderbilt University Medical Center Wendy Chung, MD, PhD Professor of Pediatrics in Medicine and Director, Clinical Cancer Genetics Columbia University Mira Irons, MD Chief Health and Science Officer American Medical Association Peng Jin, PhD Professor and Chair, Department of Human Genetics Emory University Allison McCague, PhD Science Policy Analyst, Policy and Program Analysis Branch National Human Genome Research Institute Rebecca Meyer-Schuman, MS Human Genetics Ph.D. -
DNA Sequencing
Contig Assembly ATCGATGCGTAGCAGACTACCGTTACGATGCCTT… TAGCTACGCATCGTCTGATGGCAATGCTACGGAA.. C T AG AGCAGA TAGCTACGCATCGT GT CTACCG GC TT AT CG GTTACGATGCCTT AT David Wishart, Ath 3-41 [email protected] DNA Sequencing 1 Principles of DNA Sequencing Primer DNA fragment Amp PBR322 Tet Ori Denature with Klenow + ddNTP heat to produce + dNTP + primers ssDNA The Secret to Sanger Sequencing 2 Principles of DNA Sequencing 5’ G C A T G C 3’ Template 5’ Primer dATP dATP dATP dATP dCTP dCTP dCTP dCTP dGTP dGTP dGTP dGTP dTTP dTTP dTTP dTTP ddCTP ddATP ddTTP ddGTP GddC GCddA GCAddT ddG GCATGddC GCATddG Principles of DNA Sequencing G T _ _ short C A G C A T G C + + long 3 Capillary Electrophoresis Separation by Electro-osmotic Flow Multiplexed CE with Fluorescent detection ABI 3700 96x700 bases 4 High Throughput DNA Sequencing Large Scale Sequencing • Goal is to determine the nucleic acid sequence of molecules ranging in size from a few hundred bp to >109 bp • The methodology requires an extensive computational analysis of raw data to yield the final sequence result 5 Shotgun Sequencing • High throughput sequencing method that employs automated sequencing of random DNA fragments • Automated DNA sequencing yields sequences of 500 to 1000 bp in length • To determine longer sequences you obtain fragmentary sequences and then join them together by overlapping • Overlapping is an alignment problem, but different from those we have discussed up to now Shotgun Sequencing Isolate ShearDNA Clone into Chromosome into Fragments Seq. Vectors Sequence 6 Shotgun Sequencing Sequence Send to Computer Assembled Chromatogram Sequence Analogy • You have 10 copies of a movie • The film has been cut into short pieces with about 240 frames per piece (10 seconds of film), at random • Reconstruct the film 7 Multi-alignment & Contig Assembly ATCGATGCGTAGCAGACTACCGTTACGATGCCTT… TAGCTACGCATCGTCTGATGGCAATGCTACGGAA. -
Hammerhead Ribozymes Against Virus and Viroid Rnas
Hammerhead Ribozymes Against Virus and Viroid RNAs Alberto Carbonell, Ricardo Flores, and Selma Gago Contents 1 A Historical Overview: Hammerhead Ribozymes in Their Natural Context ................................................................... 412 2 Manipulating Cis-Acting Hammerheads to Act in Trans ................................. 414 3 A Critical Issue: Colocalization of Ribozyme and Substrate . .. .. ... .. .. .. .. .. ... .. .. .. .. 416 4 An Unanticipated Participant: Interactions Between Peripheral Loops of Natural Hammerheads Greatly Increase Their Self-Cleavage Activity ........................... 417 5 A New Generation of Trans-Acting Hammerheads Operating In Vitro and In Vivo at Physiological Concentrations of Magnesium . ...... 419 6 Trans-Cleavage In Vitro of Short RNA Substrates by Discontinuous and Extended Hammerheads ........................................... 420 7 Trans-Cleavage In Vitro of a Highly Structured RNA by Discontinuous and Extended Hammerheads ........................................... 421 8 Trans-Cleavage In Vivo of a Viroid RNA by an Extended PLMVd-Derived Hammerhead ........................................... 422 9 Concluding Remarks and Outlooks ........................................................ 424 References ....................................................................................... 425 Abstract The hammerhead ribozyme, a small catalytic motif that promotes self- cleavage of the RNAs in which it is found naturally embedded, can be manipulated to recognize and cleave specifically