Genome Viewing

Total Page:16

File Type:pdf, Size:1020Kb

Load more

Exploring Genomes with Ensembl Prague June, 2014 Dr. Giulietta M. Spudich Ensembl EBI is an Outstation of the European Molecular Biology Laboratory. EMBL-EBI This Course Monday EBI Introduction (Giulietta) Ensembl (Giulietta) ArrayExpress (Karyn) Tuesday Phylogenies (Laura) Wednesday Proteomics (Sandra) www.ebi.ac.uk/training/Prague2014 EBI is an Outstation of the European Molecular Biology Laboratory. This Morning Ensembl Introduction Ensembl Browser Walkthrough Hands-on Exercises 11 AM Coffee BioMart 1 PM Lunch EBI is an Outstation of the European Molecular Biology Laboratory. Beginnings … 1995: 1st free-living organism: bacterium Haemophilus influenzae (1.8 million bp) 2001: First draft of the human sequence (3 gb) 2004: ‘Finished’ human sequence 2014: Polished human sequence with haplotypes (GRCh38) Today’s genomics - human 1000 Genomes Project ENCODE COURTESY OF NIH THOMAS POROSTOCKY; SOURCE: MEETINGZONE Today’s genomics – other species 6 of 24 Ensembl – Access to … 7 of 24 Ensembl Genomes - Expanding Species Bacteria, Protists, Plants, Fungi, (non-vertebrate) Metazoa 8 of 24 Ensembl Genomes – Examples http://ensemblgenomes.org Metazoa Protists Ixodes scapularis Leishmania major Schistosoma mansoni Plasmodium falciparum Anophelese gambiae Plants Arabidopsis thaliana Triticum aestivum Bacteria (10,000+) Fungi Acetobacter tropicalis Saccharomyces cerevisiae Escherichia coli Schizosaccharomyces_pombe 9 of 24 Raw sequence CGCCGGGAGAAGCGTGAGGGGACAGATTTGTGACCGGCGCGGTTTTTGTCAGCTTACTC CGGCCAAAAAAGAACTGCACCTCTGGAGCGGACTTATTTACCAAGCATTGGAGGAATATCG TAGGTAAAAATGCCTATTGGATCCAAAGAGAGGCCAACATTGCACTGCTGCGCCTCTGCTG CGCCTCGGGTGTCTTTTGCGGCGGTGGGTCGCCGCCGGGAGAAGCGTGAGGGGACAGA TTTGTGACCGGCGCGGTTTTTGTCAGCTTACTCCGGCCAAAAAAGAACTGCACCTCTGGA GCGGACTTATTTACCAAGCATTGGAGGAATATCGTAGGTAAAAATGCCTATTGGATCCAAAG AGAGGCCAACATTTTTTGAAATTTTTAAGACACGCTGCAACAAAGCAGATTTAGGACCAATA AGTCTTAATTGGTTTGAAGAACTTTCTTCAGAAGCTCCACCCTATAATTCTGAACCTGCAG ACTAAAATGGATCAAGCAGATGATGTTTCCTGTCCACTTCTAAATTCTTGTCTTAGAAGAATC TGAACATAAAAACAACAATTACGAACCAAACCTATTTAAAACTCCACAAAGGAAACCATCTTA TAATCAGCTGGCTTCAACTCCAATAATATTCAAAGAGCAAGGGCTGACTCTGCCGCTGTAC CAATCTCCTGTAAAAGAATTAGATAAATTCAAATTAGACTTAGGAAGGAATGTTCCCAATAGT AGACTAAAAGTCTTCGCACAGTGAAAT CGCCGGGAGAAGCGTGAGGGGACAGATTTGTGACCGGCGCGGTTTTTGTCAGCTTACTC CGGCCAAAAAAGAACTGCACCTCTGGAGCGGACTTATTTACCAAGCATTGGAGGAATATCG TAGGTAAAAATGCCTATTGGATCCAAAGAGAGGCCAACATTACTAAAATGGATCAAGCAGAT GATGTTTCCTGTCCACTTCTAAATTCTTGTCTTAG AATTGGTTTGAAGAACTTTCTTCAGAAGCTCCACCCTATAATTCTGAACCTGCAGTGAAAGT CCTGTTGTTCTACAATGTACACATGTAACACCACAAAGAGATAAGTCA Ensembl – unlocking the code Regulation Conserved sequence Gene Allele • Splice variants, proteins, non-coding RNA • Small and large scale sequence variation, phenotype associations • Whole genome alignments, protein trees • Potential promoters and enhancers, DNA methylation • User upload, custom data Figure adapted from the ENCODE project www.nature.com/nature/focus/encode/ 29 May 2014 11 This talk … Genome Sequencing and Browsers Ensembl Data Genes Variation Comparative Genomics Regulation Access Challenge: number of gene/protein sequences increases • UniProtKB/Swiss-Prot (e.g.Q8IU82) 542,258 • UniProtKB/TrEMBL 51,616,950 • NCBI RefSeq (e.g. NP_006570) 37,371,278 13 of 24 Is there a consensus? • Reaching a consensus coding sequence set for human and mouse. • Human 29,045 CCDS IDs -18,683 Ensembl Gene IDs (e74) • Mouse 23,093 CCDS IDs- 19,988 Ensembl GeneIDs (e72) The GENCODE set www.gencodegenes.org • Human and mouse • Not just the cds (coding sequence) • Ensembl contributes genes to GENCODE • Havana contributes genes to GENCODE • GENCODE is used by ENCODE, 1000 Genomes, and other projects. 15 of 24 This talk … Genome Sequencing and Browsers Ensembl Data Genes Variation Comparative Genomics Regulation Access Ensembl Variation Aims: • Collect, integrate and annotate all known variants • Provide tools for comparison to other genomic data • Provide a framework for access and to improve understanding Practical applications of variation Molecular and clinical medicine • Diagnosis, detection and treatment: – e.g. myotonic dystrophy, fragile X syndrome, inherited colon cancer, familial breast cancer • Pharmacogenomics "custom drugs" DNA forensics • Identification of suspects • catastrophe victims • endangered species Agriculture, livestock breeding • Disease-, insect-, and drought-resistant crops • Healthier, disease-resistant animals • Marker-assisted breeding • More nutritious produce • Reducing the costs of agriculture Anthropology, evolution, and human migration Variation Sources . dbSNP (1000 Genomes, ClinVar, etc) . ESP (Exome Sequencing Project) . UniProt . COSMIC . HGMD_Public . NHGRI-GWAS . & more … www.ensembl.org/info/genome/variation/sources_documentation Variation in the Browser Ensembl Variant Effect Predictor . Uses an Ensembl gene set to annotate: . SNPs . Indels New Interface! . Variants in regulatory regions . Structural variants REST API Web interface Perl script XM L . Publication: McLaren et al. 2010 (Bioinformatics) o v a p s o l o l s h a u c l g l n a _ t a s y i a g h t r r _ t g y s u s a t M g u s u a l e i _ S l l l o n a s M p i a a e i n n t _ r g a e o G M s a c y n d c a i o D n p l s r e n i C p o a o o a i s l r _ A h h s p p n a n o s i e y u h e c l u n p l u _ o i s a i s s P u h T s s e _ s i i r _ l _ l s e c o s _ u L p d o s a E _ h n c o c u c i s u n u i i x a c n o t d a y c i g A p lo r o v h o s m s i i r e h i o v d i i v _ e r n o r a o a n r d t _ r h e n s e i n i ig _ _ o o n o s l n s o s s t c m i m m _ e p i i p a a f t h e u u o n i s i l r u t r o t s c f i P p b a _ p i h d u l _ m c a r u a e i o h c o _ ac f t n a n n c o ra u m g _ r n e a r e _ e t u s E ic s l c r Te if ru f t n X ia O k o r a i O a h s in s a u r T op tipe S n n e h _la a o i s ip as c a r i X yzi e re i im Or P u x t uleatus te s _ a steus_ac ro _e a L Gastero p u ra us ro n M _ p e yo va a u Gadus_m orhua C tis m eu s Must ani _lu py s ela_p s_fa cif ru Da utor m i ug s nio_rerio Ailur ius_fu liar us opoda_m e ro is lanoleuca Felis_catus Petro caballus m yzo Equus_ n_m a rinus crofa Sus_s s uru us os Cio _ta cat c C na_ Bos un pa io sav _tr a_ na ign ps gn _in yi sio cu tes Tur Vi tin s s al ep lu s is c u u i n t r ri ic a Ca D p n e e e ro _ u g n s Ensembla c in s o o l n _ n p o s lu rh h t m l a T il u l i a a o a e e i b h g i s _ c c e t d m c a i r l e r t u S i e C s t O o d o d b a is la e t r n i _ i n a c _ i c r p o u e o a n c y t _ r g l l _ i r r _ s h e a l a s i s u i a g s O a _ a y v y s t r a te s p s h N o n r u g a m M m s m u y m c u _ r o o C i l _ i y r o T r a P d u s x m c i g i d u o t c c e _ c u • o e a G s c n h a I s j v b _ p m a s H c g o c i r u t e P o e e c c o a a o r D l a m r c i c u e n m _ _ l o l n o v _ h s a a o _ t m r _ i s _ b _ _ s s u t s i O c l u u g r a u i e a s e o t o p e l u l g t M a i M i r i a c i l e Whole Genome Alignmentst l o • n l R t o a d s a g y t e e n s y s Image obtained using Dendroscope (D.H. Huson and C Scornavacca, • Gene Trees Dendroscope 3: An interactive tool for rooted phylogenetic trees and networks, Syst ematic Biology, 2012 ) • Homologues Protein Families Comparative Genomics Whole genome alignments Homo sapiens ACGT-GG--CCAGCGCGGGCTTGTGGCGCGAGCTTCTGAAAC-TAGGCG-GCAGAGGCGGAGC--CGCTG-TGGC---ACTGCTGCGCCTCTG-CTGCGCCTCGGGTGTCTTTTGCGGCG Ancestral sequences ACGT-GG--CCAGCGCGGGCTTGTGGCGCGAGCTTCTGAAAC-TAGGCG-GCAGAGGCGGAGC--CGCTG-TGGC---TCTGCTGCGCCTCTG-CTGCGCCTCGGGTGTCTTTTGCGGCG Pan troglodytes ACGT-GG--CCAGCGCGGGCTTGTGGCGCGAGCTTCTGAAAC-TAGGCG-GCAGAAGCGGAGC--CGCTG-TGGC---TCTGCTGCGCCACTG-CTGCGCCTCGGGTGTCTTTTGCGGCG Ancestral sequences ACGT-GG--CCAGCGCGGGCTTGTGGCGCGAGCTTCTGAAAC-TAGGCG-GCAGAGGCGGAGC--CGCTG-TGGC---TCTGCTGCGCCTCTG-CTGCGCCTCGGGTCTCTTTTGCGGCG Gorilla gorilla gorilla ........................................................................................................................ Great apes Ancestral sequences ........................................................................................................................ Old world Pongo abelii ACGT-GG--CCAGCGCGGGCTTGTGGCGCGAGCTTCTGAAAC-TAGGCG-GCAGAGGCGGAGC--CGCTG-TGGC---TGTGCTGCACCTGTG-CTGCGCCTCGGGTCTCTTTTGCGGCG monkeys Ancestral sequences ACGT-GG--CCAGCGCGGGCTTGTGGCGCGAGCTTCTGAAAT-TAGGCG-GCAGAGGCGGAGC--TGCTG-TGGC--------------TCTG-CTGCGCCTCGGGTCTCTTTTGCGGCG Macaca mulatta ACGT-GG--CCAGCGCGGGCTTGTGGCGCGAGCTTCTGAAAT-CAGGCG-GCAGAGGTGGAAC--TGCTGCTGGC--------------TCTG-CTGCGCCTCGGGTCTCTTTTGCGGCG Primates Ancestral sequences ACGT-GG--CCAGCGCGGGCTTGTGGCGCGAGCGTCTGAAAT-GAGGCG-GCAGAGGCGGAGC--TGCTG-TGGC--------------TCTG-CCGCGCCTCGGGTCTTTTCTGCGGCG Callithrix jacchus ACGT-GG--TCAGCGCGGGCTTGTGGCGCGAGCGTCTGAAAT-GAGGCG-GCAGAGGCGGACC--TGCTG-TGTC--------------TCTG-CCGCGCCTCCGGTCTTTTCTGCGACG Ancestral sequences ACGT-GC--CGAGAGCGGGCTTTTGGCGCGAGCGTCTGAAAT-AAGGCG-GCGGAGGCGGAGC--TGCTG-CGGCT------------------CCGCGTCTCGGGTCTTTTCTGCGGCA Mus musculus ACGG-GC--AGAGCGCGGGCTTTTCGCGGGAGCGGGAGCCGT-G----------AGGCGTTGCCGTCAGT-CAGCT-----------------ACCGCTGC------------------- Rodents Ancestral sequences ACGG-GC--AGAGCGCGGGCTTTTCGCGGGAGCGTGAGAAGT-G----------AGGCGGTGCCGTCCGT-CAGCT-----------------ACCGCAAC------------------- Rattus norvegicus ACGGCGC--AGAGCGCGGGCTTTTCGCAGGAGCGTGAGAAGT-G----------AGGCGGCGCCGTCCGT-CAGCG-----------------GCCGCAAC------------------- Glires Ancestral sequences ACGT-GC--CGAGAGCGGGCTTTTGGCGCGAGCGTCTGAAAT-AAGGCG-GCGGAGGCGGAGC--TCCTT-CAGCT------------------CCGCGTCTCGGGTCTTTTCTGCGGCA Boroeutherian Oryctolagus cuniculus ACGT-GC--CCAGAGCGGGCTTTTGGCGCGAGCGTCTGAAAA-AAGGCT-ATGGAGGCGGAGC--TCCTT-CAGCT------------------CCGCGTCTGGGGTCTTGCCTAGGGCA Ancestral sequences ACGT-GC--CGAGAGCGGGCTTTTGGCGCGAGCGTCTGAAAT-AAGGCG-GCGGAGGCGGAGC--TGCTG-CGGCT------------------CCGCGTCTCGGGTCTTTTCTGCGGCA Bos taurus ACAT-ATCCCGAGAGCAGGCTTTTGGCGCGAGAATCTGAAAC-CCGGTGGGCGGAGGTGCGGC--TGCTG-AAGTTTG----------------C--TGTCTCGGGCGG-T--------- Laurasiatheria Ancestral sequences ACGT-GCTCCGAGAGCGGGCTTTTGGCGCGAGCGTCTGAAAT-AAGGCGAGCGGAGGCGGAGC--TGCTG-GGGCTCC----------------C--TGTCTCGGGTGG-TTCTGTGGCA Canis lupus familiaris .......................................................................................................................
Recommended publications
  • Genome Signature-Based Dissection of Human Gut Metagenomes to Extract Subliminal Viral Sequences

    Genome Signature-Based Dissection of Human Gut Metagenomes to Extract Subliminal Viral Sequences

    ARTICLE Received 16 Apr 2013 | Accepted 8 Aug 2013 | Published 16 Sep 2013 DOI: 10.1038/ncomms3420 OPEN Genome signature-based dissection of human gut metagenomes to extract subliminal viral sequences Lesley A. Ogilvie1, Lucas D. Bowler1, Jonathan Caplin2, Cinzia Dedi1, David Diston2,w, Elizabeth Cheek3, Huw Taylor2, James E. Ebdon2 & Brian V. Jones1 Bacterial viruses (bacteriophages) have a key role in shaping the development and functional outputs of host microbiomes. Although metagenomic approaches have greatly expanded our understanding of the prokaryotic virosphere, additional tools are required for the phage- oriented dissection of metagenomic data sets, and host-range affiliation of recovered sequences. Here we demonstrate the application of a genome signature-based approach to interrogate conventional whole-community metagenomes and access subliminal, phylogen- etically targeted, phage sequences present within. We describe a portion of the biological dark matter extant in the human gut virome, and bring to light a population of potentially gut- specific Bacteroidales-like phage, poorly represented in existing virus like particle-derived viral metagenomes. These predominantly temperate phage were shown to encode functions of direct relevance to human health in the form of antibiotic resistance genes, and provided evidence for the existence of putative ‘viral-enterotypes’ among this fraction of the human gut virome. 1 Centre for Biomedical and Health Science Research, School of Pharmacy and Biomolecular Sciences, University of Brighton, Brighton BN2 4GJ, UK. 2 School of Environment and Technology, University of Brighton, Brighton BN2 4GJ, UK. 3 School of Computing, Engineering and Mathematics, University of Brighton, Brighton BN2 4GJ, UK. w Present address: Mikrobiologische and Biotechnologische Risiken Bundesamt fu¨r Gesundheit BAG, 3003 Bern, Switzerland.
  • The 4D Nucleome Project

    The 4D Nucleome Project

    The 4D nucleome project The MIT Faculty has made this article openly available. Please share how this access benefits you. Your story matters. Citation Dekker, Job et al. “The 4D Nucleome Project.” Nature 549, no. 7671 (September 2017): 219–226 © 2017 Macmillan Publishers Limited, part of Springer Nature As Published http://dx.doi.org/10.1038/NATURE23884 Publisher Nature Publishing Group Version Author's final manuscript Citable link http://hdl.handle.net/1721.1/114838 Terms of Use Creative Commons Attribution-Noncommercial-Share Alike Detailed Terms http://creativecommons.org/licenses/by-nc-sa/4.0/ HHS Public Access Author manuscript Author ManuscriptAuthor Manuscript Author Nature. Manuscript Author Author manuscript; Manuscript Author available in PMC 2017 September 27. Published in final edited form as: Nature. 2017 September 13; 549(7671): 219–226. doi:10.1038/nature23884. The 4D Nucleome Project Job Dekker1, Andrew S. Belmont2, Mitchell Guttman3, Victor O. Leshyk4, John T. Lis5, Stavros Lomvardas6, Leonid A. Mirny7, Clodagh C. O’Shea8, Peter J. Park9, Bing Ren10, Joan C. Ritland Politz11, Jay Shendure12, Sheng Zhong4, and the 4D Nucleome Network13 1Program in Systems Biology, Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Howard Hughes Medical Institute, Worcester, MA 01605 2Department of Cell and Developmental Biology, University of Illinois, Urbana-Champaign, IL 61801 3Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125 4Department
  • The Contribution of Mitochondrial Metagenomics to Large-Scale Data

    The Contribution of Mitochondrial Metagenomics to Large-Scale Data

    Molecular Phylogenetics and Evolution 128 (2018) 1–11 Contents lists available at ScienceDirect Molecular Phylogenetics and Evolution journal homepage: www.elsevier.com/locate/ympev The contribution of mitochondrial metagenomics to large-scale data mining and phylogenetic analysis of Coleoptera T ⁎ Benjamin Linarda,i,j, , Alex Crampton-Platta,b,k, Jerome Morinierec, Martijn J.T.N. Timmermansa,d, Carmelo Andújara,b,l, Paula Arribasa,b,l, Kirsten E. Millera,b,m, Julia Lipeckia,n, Emeline Favreaua,o, Amie Huntera, Carola Gómez-Rodrígueze, Christopher Bartona,p, Ruie Niea,f, Conrad P.D.T. Gilletta,q, Thijmen Breeschotena,g,r, Ladislav Bocakh, Alfried P. Voglera,b a Department of Life Sciences, Natural History Museum, London SW7 5BD, United Kingdom b Department of Life Sciences, Silwood Park Campus, Imperial College London, Ascot SL5 7PY, United Kingdom c Bavarian State Collection of Zoology (SNSB – ZSM), Münchhausenstrasse 21, 81247 München, Germany d Department of Natural Sciences, Hendon Campus, Middlesex University, London NW4 4BT, United Kingdom e Departamento de Zoología, Facultad de Biología, Universidad de Santiago de Compostela, c/ Lope Gómez de Marzoa s/n, 15782 Santiago de Compostela, Spain f Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China g Naturalis Biodiversity Center, Darwinweg 2, 2333 CR Leiden, The Netherlands h Department of Zoology, Faculty of Science, Palacky University, 17. listopadu 50, 771 46 Olomouc, Czech Republic i LIRMM, Univ. Montpellier,
  • The International Human Epigenome Consortium (IHEC): a Blueprint for Scientific Collaboration and Discovery

    The International Human Epigenome Consortium (IHEC): a Blueprint for Scientific Collaboration and Discovery

    The International Human Epigenome Consortium (IHEC): A Blueprint for Scientific Collaboration and Discovery Hendrik G. Stunnenberg1#, Martin Hirst2,3,# 1Department of Molecular Biology, Faculties of Science and Medicine, Radboud University, Nijmegen, The Netherlands 2Department of Microbiology and Immunology, Michael Smith Laboratories, University of British Columbia, Vancouver, BC, Canada V6T 1Z4. 3Canada’s Michael Smith Genome Science Center, BC Cancer Agency, Vancouver, BC, Canada V5Z 4S6 #Corresponding authors [email protected] [email protected] Abstract The International Human Epigenome Consortium (IHEC) coordinates the generation of a catalogue of high-resolution reference epigenomes of major primary human cell types. The studies now presented (cell.com/XXXXXXX) highlight the coordinated achievements of IHEC teams to gather and interpret comprehensive epigenomic data sets to gain insights in the epigenetic control of cell states relevant for human health and disease. One of the great mysteries in developmental biology is how the same genome can be read by cellular machinery to generate the plethora of different cell types required for eukaryotic life. As appreciation grew for the central roles of transcriptional and epigenetic mechanisms in specification of cellular fates and functions, researchers around the world encouraged scientific funding agencies to develop an organized and standardized effort to exploit epigenomic assays to shed additional light on this process (Beck, Olek et al. 1999, Jones and Martienssen 2005, American Association for Cancer Research Human Epigenome Task and European Union 2008). In March 2009, leading scientists and international health research funding agency representatives were invited to a meeting in Bethesda (MD, USA) to gauge the level of interest in an international epigenomics project and to identify potential areas of focus.
  • Genome-Contric Metagenomic Investigation of 134 Samples

    Genome-Contric Metagenomic Investigation of 134 Samples

    GENOME-CONTRIC METAGENOMIC INVESTIGATION OF 134 SAMPLES COLLECTED FROM BIOGAS REACTORS REVEALED IMPORTANT FUNCTIONAL ROLES FOR MICROBIAL SPECIES BELONGING TO UNDEREXPLORED TAXA Campanaro, S.1, Luo, G.2, Treu, L.3 Kougias, P.G.4, Zhu X.3, Angelidaki, I.3 1 University of Padova, Department of Biology, Via U. Bassi, 58b, 35131 Padova (PD), Italy 2 Fudan University, Department of Environmental Science and Engineering, Shanghai Key Laboratory of Atmospheric Particle Pollution and Prevention (LAP3), 200433, Shanghai, China 3 Technical University of Denmark, Department of Environmental Engineering, Kgs. Lyngby DK-2800, Denmark 4 Institute of Animal Science, Hellenic Agricultural Organization DEMETER, Paralimni, Greece e-mail of the corresponding author: [email protected] Keywords: anaerobic digestion; metagenomics; metagenome-assembled genomes; taxonomy. INTRODUCTION In the past four years many samples collected from full-scale biogas plants and lab-scale reactors have been investigated using Illumina shotgun sequencing approaches and deposited in NCBI Sequence Read Archive database. In all these reactors, the Anaerobic Digestion (AD) process is performed by a plethora of different microorganisms organized in a complex functional network where different species have distinct roles in degradation of organic matter. Difficulties associated to cultivation of many prokaryotes present in natural and engineered ecosystems strongly limits the possibility to expand our knowledge regarding their physiology, genetics and function (Tringe and Rubin, 2005). Thanks to the bioinformatics approaches recently developed, which allow the reconstruction of microbial genomes starting from shotgun DNA sequences obtained from mixed cultures (Campanaro et al., 2016), it is now becoming possible to reveal the functional roles of microbial species resisting cultivation in axenic cultures.
  • Pharmacogene Regulatory Elements: from Discovery to Applications

    Pharmacogene Regulatory Elements: from Discovery to Applications

    Smith et al. Genome Medicine 2012, 4:45 http://genomemedicine.com/content/4/5/45 REVIEW Pharmacogene regulatory elements: from discovery to applications Robin P Smith1,2†, Ernest T Lam2,3†, Svetlana Markova1, Sook Wah Yee1* and Nadav Ahituv1,2* Pharmacogenomics and gene regulatory elements: Abstract an emerging picture Regulatory elements play an important role in the Most pharmacogenomics studies to date have focused on variability of individual responses to drug treatment. coding variants of pharmacologically important proteins. This has been established through studies on three However, well-supported examples of variants in regu la- classes of elements that regulate RNA and protein tory elements of genes involved in drug response, such as abundance: promoters, enhancers and microRNAs. drug metabolizing enzymes and transporters (see review Each of these elements, and genetic variants within by Georgitsi et al. [1]), show that variants in noncoding them, are being characterized at an exponential pace regulatory sequences are also likely to be important by next-generation sequencing (NGS) technologies. In (Table 1). Th ree classes of regulatory elements have been this review, we outline examples of how each class of studied in this context: promoters, enhancers and element aff ects drug response via regulation of drug microRNAs (miRNAs). Each of these has a direct impact targets, transporters and enzymes. We also discuss on the abundance of messenger RNA (mRNA) (in the case the impact of NGS technologies such as chromatin of promoters and enhancers) and protein (in the case of immunoprecipitation sequencing (ChIP-Seq) and RNA miRNAs). Genetic variation within each of these elements sequencing (RNA-Seq), and the ramifi cations of new has been linked to human disease as well as interindividual techniques such as high-throughput chromosome diff erences in drug response.
  • Molecular Complexity of Successive Bacterial Epidemics Deconvoluted by Comparative Pathogenomics

    Molecular Complexity of Successive Bacterial Epidemics Deconvoluted by Comparative Pathogenomics

    Molecular complexity of successive bacterial epidemics deconvoluted by comparative pathogenomics Stephen B. Beresa,b, Ronan K. Carrolla,b, Patrick R. Sheaa,b, Izabela Sitkiewicza,b, Juan Carlos Martinez-Gutierreza,b, Donald E. Lowc, Allison McGeerd, Barbara M. Willeyd, Karen Greend, Gregory J. Tyrrelld, Thomas D. Goldmanf, Michael Feldgardeng, Bruce W. Birreng, Yuriy Fofanovh, John Boosi, William D. Wheatoni, Christiane Honischf, and James M. Mussera,b,1 aCenter for Molecular and Translational Human Infectious Diseases Research, The Methodist Hospital Research Institute, and bDepartment of Pathology, The Methodist Hospital, Houston, TX 77030; cOntario Agency for Health Protection and Promotion, and University of Toronto, Toronto, ON M5G 1X5, Canada; dDepartment of Microbiology, Mount Sinai Hospital, Toronto, ON M5G 1X5, Canada; eDepartment of Laboratory medicine and Pathology, University of Alberta, Edmonton, AB T6G 2J2, Canada; fSequenom, Inc., San Diego, CA 92121; gBroad Institute of Massachusetts Institute of Technology and Harvard University, Cambridge, MA 02142; hDepartment of Computer Science and Department of Biology and Biochemistry, University of Houston, Houston, TX 77204; and iRTI International, Research Triangle Park, NC 27709 Edited* by Charles R. Cantor, Sequenom Inc., San Diego, CA, and approved December 14, 2009 (received for review September 30, 2009) Understanding the fine-structure molecular architecture of bacterial century that GAS has the capacity to cause epidemics charac- epidemics has been a long-sought goal of infectious disease research. terized by rapid increase in disease frequency and severity. For We used short-read-length DNA sequencing coupled with mass example, Weech (6) described an epidemic of septic scarlet fever spectroscopy analysis of SNPs to study the molecular pathogenomics in Yunnanfu, China, that killed 50,000 people, fully 25% of the of three successive epidemics of invasive infections involving 344 population of the province.
  • View: Latest Perspectives on Antiinflammatory Actions of Glucocorticoids

    View: Latest Perspectives on Antiinflammatory Actions of Glucocorticoids

    Fleuren et al. BioData Mining 2013, 6:2 http://www.biodatamining.org/content/6/1/2 BioData Mining RESEARCH Open Access Identification of new biomarker candidates for glucocorticoid induced insulin resistance using literature mining Wilco WM Fleuren1,2, Erik JM Toonen3, Stefan Verhoeven4, Raoul Frijters1,6, Tim Hulsen1,7, Ton Rullmann5, René van Schaik4, Jacob de Vlieg1,4 and Wynand Alkema1,8* * Correspondence: [email protected] Abstract 1Computational Drug Discovery (CDD), CMBI, NCMLS, Radboud Background: Glucocorticoids are potent anti-inflammatory agents used for the University Nijmegen Medical Centre, treatment of diseases such as rheumatoid arthritis, asthma, inflammatory bowel P.O. Box 91016500 HB, Nijmegen, disease and psoriasis. Unfortunately, usage is limited because of metabolic side- The Netherlands 8Present address: NIZO Food effects, e.g. insulin resistance, glucose intolerance and diabetes. To gain more insight Research BV, Ede, The Netherlands into the mechanisms behind glucocorticoid induced insulin resistance, it is important Full list of author information is to understand which genes play a role in the development of insulin resistance and available at the end of the article which genes are affected by glucocorticoids. Medline abstracts contain many studies about insulin resistance and the molecular effects of glucocorticoids and thus are a good resource to study these effects. Results: We developed CoPubGene a method to automatically identify gene-disease associations in Medline abstracts. We used this method to create a literature network of genes related to insulin resistance and to evaluate the importance of the genes in this network for glucocorticoid induced metabolic side effects and anti-inflammatory processes.
  • Integrative Functional Genomics Identifies an Enhancer Looping to the SOX9 Gene Disrupted by the 17Q24.3 Prostate Cancer Risk Locus

    Integrative Functional Genomics Identifies an Enhancer Looping to the SOX9 Gene Disrupted by the 17Q24.3 Prostate Cancer Risk Locus

    Downloaded from genome.cshlp.org on October 1, 2021 - Published by Cold Spring Harbor Laboratory Press Research Integrative functional genomics identifies an enhancer looping to the SOX9 gene disrupted by the 17q24.3 prostate cancer risk locus Xiaoyang Zhang,1,4 Richard Cowper-SalÁlari,1,2,4 Swneke D. Bailey,3 Jason H. Moore,1,2 and Mathieu Lupien1,3,5 1Department of Genetics and Institute for Quantitative Biomedical Sciences, Norris Cotton Cancer Center, Dartmouth Medical School, Lebanon, New Hampshire 03756, USA; 2Computational Genetics Laboratory and Department of Genetics, Norris Cotton Cancer Center, Dartmouth Medical School, Lebanon, New Hampshire 03756, USA; 3Ontario Cancer Institute, Princess Margaret Hospital– University Health Network, and the Department of Medical Biophysics, University of Toronto, Toronto, Ontario M5G 1L7, Canada Genome-wide association studies (GWAS) are identifying genetic predisposition to various diseases. The 17q24.3 locus harbors the single nucleotide polymorphism (SNP) rs1859962 that is statistically associated with prostate cancer (PCa). It defines a 130-kb linkage disequilibrium (LD) block that lies in an ~2-Mb gene desert area. The functional biology driving the risk associated with this LD block is unknown. Here, we integrate genome-wide chromatin landscape data sets, namely, epigenomes and chromatin openness from diverse cell types. This identifies a PCa-specific enhancer within the rs1859962 risk LD block that establishes a 1-Mb chromatin loop with the SOX9 gene. The rs8072254 and rs1859961 SNPs mapping to this enhancer impose allele-specific gene expression. The variant allele of rs8072254 facilitates androgen receptor (AR) binding driving increased enhancer activity. The variant allele of rs1859961 decreases FOXA1 binding while increasing AP-1 binding.
  • Swabs to Genomes: a Comprehensive Workflow

    Swabs to Genomes: a Comprehensive Workflow

    UC Davis UC Davis Previously Published Works Title Swabs to genomes: a comprehensive workflow. Permalink https://escholarship.org/uc/item/3817d8gj Journal PeerJ, 3(5) ISSN 2167-8359 Authors Dunitz, Madison I Lang, Jenna M Jospin, Guillaume et al. Publication Date 2015 DOI 10.7717/peerj.960 Peer reviewed eScholarship.org Powered by the California Digital Library University of California Swabs to genomes: a comprehensive workflow Madison I. Dunitz1,3 , Jenna M. Lang1,3 , Guillaume Jospin1, Aaron E. Darling2, Jonathan A. Eisen1 and David A. Coil1 1 UC Davis, Genome Center, USA 2 ithree institute, University of Technology Sydney, Australia 3 These authors contributed equally to this work. ABSTRACT The sequencing, assembly, and basic analysis of microbial genomes, once a painstaking and expensive undertaking, has become much easier for research labs with access to standard molecular biology and computational tools. However, there are a confusing variety of options available for DNA library preparation and sequencing, and inexperience with bioinformatics can pose a significant barrier to entry for many who may be interested in microbial genomics. The objective of the present study was to design, test, troubleshoot, and publish a simple, comprehensive workflow from the collection of an environmental sample (a swab) to a published microbial genome; empowering even a lab or classroom with limited resources and bioinformatics experience to perform it. Subjects Bioinformatics, Genomics, Microbiology Keywords Workflow, Microbial genomics, Genome sequencing, Genome assembly, Bioinformatics INTRODUCTION Thanks to decreases in cost and diYculty, sequencing the genome of a microorganism is becoming a relatively common activity in many research and educational institutions.
  • Ubin – a Manual Refining Tool for Metagenomic Bins Designed for Educational 2 Purposes 3 4 Till L.V

    Ubin – a Manual Refining Tool for Metagenomic Bins Designed for Educational 2 Purposes 3 4 Till L.V

    bioRxiv preprint doi: https://doi.org/10.1101/2020.07.15.204776; this version posted July 16, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. 1 uBin – a manual refining tool for metagenomic bins designed for educational 2 purposes 3 4 Till L.V. Bornemann1, Sarah P. Esser1, Tom L. Stach1, Tim Burg2, and Alexander J. Probst1 5 6 1: Institute for Environmental Microbiology and Biotechnology, Department of Chemistry, 7 University Duisburg-Essen, Germany 8 2: Tim Burg, Im Acker 59, 56072 Koblenz, Germany 9 10 To whom the correspondence should be addressed: 11 [email protected] 12 13 Abstract 14 15 Resolving bacterial and archaeal genomes from metagenomes has revolutionized our 16 understanding of Earth’s biomes, yet producing high quality genomes from assembled 17 fragments has been an ever-standing problem. While automated binning software and their 18 combination produce prokaryotic bins in high-throughput, their manual refinement has been 19 slow and sometimes difficult. Here, we present uBin, a GUI-based, standalone bin refiner that 20 runs on all major operating platforms and was specifically designed for educational purposes. 21 When applied to the public CAMI dataset, refinement of bins was able to improve 78.9% of 22 bins by decreasing their contamination. We also applied the bin refiner as a standalone binner 23 to public metagenomes from the International Space Station and demonstrate the recovery of 24 near-complete genomes, whose replication indices indicate active proliferation of microbes in 25 Earth’s lower orbit.
  • 'Omics' Approaches to Assess the Effects of Phytochemicals in Human

    'Omics' Approaches to Assess the Effects of Phytochemicals in Human

    Downloaded from https://www.cambridge.org/core British Journal of Nutrition (2008), 99, E-Suppl. 1, ES127–ES134 doi:10.1017/S0007114508965818 q The Authors 2008 High throughput ‘omics’ approaches to assess the effects of phytochemicals . IP address: in human health studies 170.106.40.219 Jaroslava Ovesna´1*, Ondrˇej Slaby´2, Olivier Toussaint3, Milan Kodı´cˇek4, Petr Marsˇ´ık5, Vladimı´ra Pouchova´1 and Toma´sˇ Vaneˇk5 1Crop Research Institute, Drnovska´ 507, 161 06 Prague 6, Ruzyne, Czech Republic , on 2Masaryk Memorial Cancer Institute, Zluty kopec 7, 656 53 Brno, Czech Republic 01 Oct 2021 at 22:26:40 3Department of Biology, Unit of Cellular Biochemistry and Biology, University of Namur (FUNDP), 5000 Namur, Belgium 4Institute of Chemical Technology, Technicka´ 3, Praha 6, 160 00 Prague 6, Czech Republic 5Institute of Experimental Botany, Suchodol, 161 06 Prague 6, Czech Republic , subject to the Cambridge Core terms of use, available at Human health is affected by many factors. Diet and inherited genes play an important role. Food constituents, including secondary metabolites of fruits and vegetables, may interact directly with DNA via methylation and changes in expression profiles (mRNA, proteins) which results in metabolite content changes. Many studies have shown that food constituents may affect human health and the exact knowledge of genotypes and food constituent interactions with both genes and proteins may delay or prevent the onset of diseases. Many high throughput methods have been employed to get some insight into the whole process and several examples of successful research, namely in the field of genomics and transcriptomics, exist. Studies on epigenetics and RNome significance have been launched.