Volume 12, Issue 2

Total Page:16

File Type:pdf, Size:1020Kb

Volume 12, Issue 2 THE spring/summer volume 12 PRIMER issue 2 Data Quality, Data Sets and New Directions: Plotting IMG’s Next 10 Years in this issue At the recent 10th Annual Genomics An example of Kyrpides’ efforts to A Decade Using IMG .......... 2 of Energy & Environment meeting systematically describe and classify Tapping Microbial Communities hosted by the U.S. Department of microbes in action can be seen in the in Colorado ................ 4 Energy Joint Genome Institute (DOE Integrated Microbial Genomes (IMG) Highlights from the Annual JGI), a DOE Office of Science User data management system that his DOE JGI Meeting ............ 6 Facility, Nikos Kyrpides (below), head program developed and maintains in Concerned about Melting of the DOE JGI Prokaryote Super partnership with the Biosciences Permafrost ................. 8 Program, received the van Niel Computing Group of Berkeley Lab’s International Prize in Bacterial Computational Research Division. Systematics. The Van Niel Prize was IMG is the leading data analysis established in 1986 in honor of system of the DOE JGI’s Prokaryote microbiologist Cornelis Van Niel’s Super Program, and Kyrpides has contribution to scholarship in the been pushing the developments as field of microbiology, and is awarded the scientific lead of the project from Growing the Interest every three years by the University of its first working prototype in 2005 to in Genomics Queensland in Australia on the its current incarnation. On the IMG recommendation of a panel of experts system’s 10th year anniversary, he With a capacity crowd in atten- of the International Committee on took time to reflect on the milestones dance, the DOE JGI hosted the 10th Systematics of Prokaryotes. Phil achieved thus far and future directions. Annual Genomics of Energy & Hugenholtz, Director of the Center Environment Meeting. To mark the for Ecogenomics at the University of What are the highlights of the last 10 occasion, instead of a single opening Queensland and a former DOE JGI years to you? keynote address, the DOE JGI invited colleague of Kyrpides, was on hand representatives from the three at the Meeting to present the award. In a period of 10 years, IMG has Bioenergy Research Centers to give a Watch the ceremony at broken several records and has been series of short talks that highlighted http://bit.ly/JGI15KyrpidesVanNiel. established as one of the premier their collaborations with the DOE data management systems in the JGI, and featured applications of the community for comparative analysis basic science provided by the of microbial genomes and metage- Institute. Blake Simmons from the nomes. Its data size has grown Joint Bioenergy Institute (JBEI), 70-fold in terms of number of data Shawn Kaeppler of the Great Lakes sets and 22,000-fold in number of Bioenergy Research Center (GLBRC), genes. We have currently almost and Jerry Tuskan of the Bioenergy 50,000 genomes in our system, Science Center (BESC) all spoke containing 90 million genes. It’s briefly, while the closing keynote was taken 20 years to sequence all of delivered by Ed DeLong of the those genomes; I anticipate we will University of Hawaii at Manoa. easily double that number in the next The themes of their talks echoed two years. We have 6,000 metage- in presentations given over the nome data sets, which contain 29 three-day meeting held March billion genes. As far as I know, this 24–26, 2015 in Walnut Creek, Calif. represents the largest publicly Videos of these keynote talks, and of available database of metagenomics other presentations from the annual genes and therefore this is one more meeting, can be viewed on the DOE of IMG’s records. We’ve grown from a JGI YouTube channel at http://bit.ly/ few hundred to about 12,000 JGIUM2015videos. Images from the registered users in more than 90 meeting are online at http://bit.ly/ countries. We continued on page 2 JGI15UMphotos. continued on page 6 THE PRIMER A Decade of IMG Data and new directions continued from page 1 provide an alternative source of data, on how the data analysis tools and efficiently provide a comparison of a particularly for metagenomes, and we workflows should be organized, and metagenome against other metage- add significant value through the the developers implemented exactly nomes. Given the size of the data integration of various data types, as what the biologists wanted. It’s clear involved, that would take weeks and well as with curation and annotation. there was a grand vision upfront to you can’t do this efficiently on a In terms of data integration, we’ve handle this much growth in the past production scale (i.e. on a weekly managed to integrate several different 10 years. We can continue another 10 basis) even with high performance data types including one of the largest years on this current system, although computing (HPC) right now. collections of curated metadata from we also need to start exploring new The National Energy Research the GOLD database, as well as several solutions for more efficient handling Scientific Computing Center (NERSC) omics types including transcriptomics, of the data deluge ahead. is a vital partner in succeeding in the metatranscriptomics, proteomics, and One more of our early choices that era of big data. We’re already operat- methylomics. In an effort to connect I believed proved to be critical both ing at the scale where processing of to our DNA synthesis program at the for the growth and the success of the our data requires a HPC environment JGI, we have integrated a large system was to offer only a single data and we are very fortunate that at the collection of known natural products processing option for all datasets JGI this is provided by NERSC. We and connected them to their biosyn- submitted into our system. We do the need a bigger database and bigger thetic gene clusters, creating one of annotation for the users, and we computer clusters to support the the largest resources in the field. We process the datasets the way we know growing community demand, but we are currently working towards the best. Maintaining a huge system such also need to have the right computa- integration of metabolomics and trans- as IMG gives you great power, and tional environment to run our pipelines. posomics data produced at the JGI. with great power comes great respon- Another big challenge is how to Adding all of these means a complete- sibility. I believe we’re obliged to support big data, without sacrificing ly different operation from the figure out and apply the best annota- data quality. For example, annotating straightforward comparison of genes tion practice at any time rather than the metadata in the Genomes OnLine and genomes. With transcriptomes, allowing users to figure out what to Database (GOLD) is heavily manual, for example, you’re now talking about use and which one choose as some but it adds tremendous value to the the expression of genes you already other systems do. Providing an sequence data. Manual annotation have, and expression levels vary under environment where all the data are certainly contradicts with scaling, but varying conditions. In transposomics, uniformly processed and annotated is the availability of metadata is critical you look at the genes that are essen- of paramount value and importance. information in order to interpret the tial or have different fitness under data we have. varying conditions. So the original Looking forward to the next 10 years, IMG’s three-dimensional model of what are some of the challenges the How do you see IMG integrating with genes, genomes and functions has IMG system will need to tackle? KBase? What are the challenges here? become more multidimensional as you add each of the different data types. Our data sets are thousands of The two systems have different terabytes in size and we’ll be going to scientific goals and overall mission What do you think has helped IMG petabytes soon. We need to scale at and because of that they also have grow over the past 10 years? the level of hundreds of thousands of fundamentally different design data sets and hundred of billions of commitments, and follow different One of the critical things is that it genes. Right now our user interface principles in data organization and was a joint development between a can support the comparison of a few user support. For example, while group of engineers under the leader- hundred datasets but what we need IMG’s focus is on the comparative ship of Victor Markowitz (http://bit.ly/ and what researchers are asking for is analysis of microbial genomes and LBNL-BCG), long experience in to compare thousands against thou- metagenomes with emphasis on the genomic data, and a group of biolo- sands. No one is doing something like interface between the two, KBase’s gists that had very strong genomics that now. Everyone is currently focus seems to be more on the isolate and bioinformatics backgrounds. comparing a metagenome against genome side and metabolic modeling, Biologists provided the requirements isolate genomes, but no system can at least for now. System integration 2 / spring/summer 2015 / volume 12 / issue 2 THE PRIMER A Decade of IMG nomes, worldwide. In terms of new directions, my expectation is that in the next decade, the biggest overhaul in the landscape of microbial genom- ics and metagenomics will be at the interface of the two, and therefore this is where a large part of future IMG developments will focus. In keeping with its goal of supporting the analysis of both the parts and the whole, I would like to see IMG playing a central role in enabling the identifica- tion and analysis of individual popula- tions from environmental communi- ties, as well as facilitating the elucidation of their role within the community.
Recommended publications
  • Insights Into the Dynamics Between Viruses and Their Hosts in a Hot Spring Microbial Mat
    The ISME Journal (2020) 14:2527–2541 https://doi.org/10.1038/s41396-020-0705-4 ARTICLE Insights into the dynamics between viruses and their hosts in a hot spring microbial mat 1,2,11 1,2 1,2 1,2 1,2 Jessica K. Jarett ● Mária Džunková ● Frederik Schulz ● Simon Roux ● David Paez-Espino ● 1,2 1,2 1,2 3 4 Emiley Eloe-Fadrosh ● Sean P. Jungbluth ● Natalia Ivanova ● John R. Spear ● Stephanie A. Carr ● 5 6 7 8,9 1,2 Christopher B. Trivedi ● Frank A. Corsetti ● Hope A. Johnson ● Eric Becraft ● Nikos Kyrpides ● 9 1,2,10 Ramunas Stepanauskas ● Tanja Woyke Received: 13 January 2020 / Revised: 3 June 2020 / Accepted: 11 June 2020 / Published online: 13 July 2020 © The Author(s) 2020. This article is published with open access Abstract Our current knowledge of host–virus interactions in biofilms is limited to computational predictions based on laboratory experiments with a small number of cultured bacteria. However, natural biofilms are diverse and chiefly composed of uncultured bacteria and archaea with no viral infection patterns and lifestyle predictions described to date. Herein, we predict the first DNA sequence-based host–virus interactions in a natural biofilm. Using single-cell genomics and metagenomics – 1234567890();,: 1234567890();,: applied to a hot spring mat of the Cone Pool in Mono County, California, we provide insights into virus host range, lifestyle and distribution across different mat layers. Thirty-four out of 130 single cells contained at least one viral contig (26%), which, together with the metagenome-assembled genomes, resulted in detection of 59 viruses linked to 34 host species.
    [Show full text]
  • GSC Workshop 10 Grant Finances Monthly Teleconference Calls
    Standards in Genomic Sciences (2010) 3:225-231 DOI:10.4056/sigs.1423520 Meeting Report from the Genomic Standards Consortium (GSC) Workshop 10 Elizabeth Glass1, Folker Meyer1,2, Jack A Gilbert1,3, Dawn Field4, Sarah Hunter5, Renzo Kottmann6, Nikos Kyrpides7, Susanna Sansone8, Lynn Schriml9, Peter Sterk10, Owen White9 and John Wooley11 1 Argonne National Laboratory, Argonne, IL USA 2 Computation Institute, University of Chicago, Chicago, IL USA 3 Department of Ecology and Evolution, University of Chicago, Chicago, IL USA, 4 Centre for Ecology & Hydrology, Oxfordshire, UK 5 European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Insti- tute (EBI), Hinxton, Cambridge UK 6 Microbial Genomics Group, Max Planck Institute for Marine Microbiology and Jacobs University Bremen, Bremen, Germany 7 DOE Joint Genome Institute, Walnut Creek, CA, USA 8 University of Oxford, Oxford e-Research Centre, Oxford, UK 9 Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD USA 10 Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge UK 11 University of California San Diego, La Jolla, CA USA *Corresponding Author: Dawn Field This report summarizes the proceedings of the 10th workshop of the Genomic Standards Consortium (GSC), held at Argonne National Laboratory, IL, USA. It was the second GSC workshop to have open registration and attracted over 60 participants who worked together to progress the full range of projects ongoing within the GSC. Overall, the primary focus of the workshop was on advancing the M5 platform for next-generation collaborative computa- tional infrastructures. Other key outcomes included the formation of a GSC working group focused on MIGS/MIMS/MIENS compliance using the ISA software suite and the formal launch of the GSC Developer Working Group.
    [Show full text]
  • The Minimum Information About a Genome Sequence (MIGS) Specification
    University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln US Department of Energy Publications U.S. Department of Energy 2008 The minimum information about a genome sequence (MIGS) specification Dawn Field Natural Environmental Research Council Centre for Ecology and Hydrology, [email protected] George Garrity Michigan State University Tanya Gray Natural Environmental Research Council Centre for Ecology and Hydrology Norman Morrison FUnivollowersity this of and Manchester additional works at: https://digitalcommons.unl.edu/usdoepub Jer emyPart ofSelengut the Bior esource and Agricultural Engineering Commons J. Craig Venter Institute SeeField, next Dawn; page Garrity for additional, George; Grauthorsay, Tany a; Morrison, Norman; Selengut, Jeremy; Sterk, Peter; Tatusova, Tatiana; Thomson, Nicholas; Allen, Michael J.; Angiuoli, Samuel V.; Ashburner, Michael; Axelrod, Nelson; Baldauf, Sandra; Ballard, Stuart; Boore, Jeffrey; Cochrane, Guy; Cole, James; Dawyndt, Peter; De Vos, Paul; dePamphilis, Claude; Edwards, Robert; Faruque, Nadeem; Feldman, Robert; Gilbert, Jack; Gilna, Paul; Glöckner, Frank Oliver; Goldstein, Philip; Guralnick, Robert; Haft, Dan; Hancock, David; Hermjakob, Henning; Hertz-Fowler, Christiane; Hugenholtz, Phil; Joint, Ian; Kagan, Leonid; Kane, Matthew; Kennedy, Jessie; Kowalchuk, George; Kottmann, Renzo; Kolker, Eugene; Kravitz, Saul; Kyrpides, Nikos; Leebens-Mack, Jim; Lewis, Suzanna E.; Li, Kelvin; Lister, Allyson L.; Lord, Phillip; Maltsev, Natalia; Markowitz, Victor; Martiny, Jennifer; Methe, Barbara; Mizrachi, Ilene; Moxon, Richard; Nelson, Karen; Parkhill, Julian; Proctor, Lita; White, Owen; Sansone, Susanna-Assunta; Spiers, Andrew; Stevens, Robert; Swift, Paul; Taylor, Chris; Tateno, Yoshio; Tett, Adrian; Turner, Sarah; Ussery, David; Vaughan, Bob; Ward, Naomi; Whetzel, Trish; San Gil, Ingio; Wilson, Gareth; and Wipat, Anil, "The minimum information about a genome sequence (MIGS) specification" (2008). US Department of Energy Publications.
    [Show full text]
  • Meeting Report: “Metagenomics, Metadata and Meta-Analysis” (M3) Workshop at the Pacific Symposium on Biocomputing 2010
    Standards in Genomic Sciences (2010) 2:357-360 DOI:10.4056/sigs.802738 Meeting Report: “Metagenomics, Metadata and Meta-analysis” (M3) Workshop at the Pacific Symposium on Biocomputing 2010 Lynette Hirschman1, Peter Sterk2,3, Dawn Field2, John Wooley4, Guy Cochrane5, Jack Gilbert6, Eugene Kolker7, Nikos Kyrpides8, Folker Meyer9, Ilene Mizrachi10, Yasukazu Nakamura11, Susanna-Assunta Sansone5, Lynn Schriml12, Tatiana Tatusova10, Owen White12 and Pelin Yilmaz13 1 Information Technology Center, The MITRE Corporation, Bedford, MA, USA 2 NERC Center for Ecology and Hydrology, Oxford, OX1 3SR, UK 3 Wellcome Trust Sanger Institute, Hinxton, Cambridge, UK 4 University of California San Diego, La Jolla, CA, USA 5 European Molecular Biology Laboratory (EMBL), European Bioinformatics Institute (EBI), Hinxton, Cambridge, UK 6 Plymouth Marine Laboratory (PML), Plymouth, UK 7 Seattle Children’s Hospital, Seattle, WA, USA 8 Genome Biology Program, DOE Joint Genome Institute, Walnut Creek, California, USA 9 Argonne National Laboratory, 9700 South Cass Avenue, Argonne, IL 60439, USA 10 National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA 11 DNA Data Bank of Japan, National Institute of Genetics, Yata, Mishima, Japan 12 University of Maryland School of Medicine, Baltimore, MD, USA 13 Microbial Genomics Group, Max Planck Institute for Marine Microbiology & Jacobs University Bremen, Bremen, Germany This report summarizes the M3 Workshop held at the January 2010 Pacific Symposium on Biocomputing. The workshop, organized by Genomic Standards Consortium members, in- cluded five contributed talks, a series of short presentations from stakeholders in the genom- ics standards community, a poster session, and, in the evening, an open discussion session to review current projects and examine future directions for the GSC and its stakeholders.
    [Show full text]
  • The Positive Role of the Ecological Community in the Genomic Revolution
    The positive role of the ecological community in the genomic revolution Dawn Field, Barbara Methe, Karen Nelson, and Nikos Kyrpides, An increasing number of ecologists are turning to the use of genomic technologies to complement and extend traditional methods of characterizing microbial diversity and its biological consequences (Xu, 2006). The SCOPE meeting on Microbial Environmental Genomics (MicroEnGen-II, Shanghai, June 12-15, 2006) provided a rich set of examples of progress at this interface (van Veen, Kowalchuk, 2006), many of which are described in this special issue. The session of this meeting entitled ‘new frontiers’ included two presentations on informatics and data management. We (DF and NK) talked about projects we are involved in, namely, the Integrated Microbial Genomes Resource (IMG) (Markowitz et al., 2006), the Genomic Standards Consortium (GSC) (Field et al., et al, in review) and the NERC Environmental Bioinformatics Centre (NEBC) (Field et al., 2005). In this commentary we briefly describe these projects, provide updated information on their development and attempt to expand on the wider issue of the increasing need for ecologists to become involved in the development of these and similar genomic resources. These projects have all benefited from the direct and indirect contribution of ecologists. Further, each is working in its own way to help strengthen the position of researchers working in eco- and environmental genomics. The IMG system is an example of a new generation database project attempting to standardize access to the publicly available collection of genomes, through a comparative genomics approach (Markowitz et al., 2006). The GSC represents an international effort to produce consensus on the ways in which we collect, exchange and represent genomic metadata (Field et al., et al, in review).
    [Show full text]
  • F. Nucleatum Sub Spp Vincentii and Its Comparison with the Genome of F
    Letter Genome Analysis of F. nucleatum sub spp vincentii and Its Comparison With the Genome of F. nucleatum ATCC 25586 Vinayak Kapatral,1 Natalia Ivanova, Iain Anderson, Gary Reznik, Anamitra Bhattacharyya, Warren L. Gardner, Natalia Mikhailova, Alla Lapidus, Niels Larsen, Mark D’Souza, Theresa Walunas, Robert Haselkorn, Ross Overbeek, and Nikos Kyrpides Integrated Genomics Inc., Chicago, Illinois 60612, USA We present the draft genome sequence and its analysis for Fusobacterium nucleatum sub spp. vincentii (FNV), and compare that genome with F. nucleatum ATCC 25586 (FN). A total of 441 FNV open reading frames (ORFs) with no orthologs in FNhave been identified. Of these, 118 ORFs have no known fun ction and are unique to FNV, whereas 323 ORFs have functional orthologs in other organisms. In addition to the excretion of butyrate, H2S and ammonia-like FN, FNV has the additional capability to excrete lactate and aminobutyrate. Unlike FN, FNV is likely to incorporate galactopyranose, galacturonate, and sialic acid into its O-antigen. It appears to transport ferrous iron by an anaerobic ferrous transporter. Genes for eukaryotic type serine/threonine kinase and phosphatase, transpeptidase E-transglycosylase Pbp1A are found in FNV but not in FN. Unique ABC transporters, cryptic phages, and three types of restriction-modification systems have been identified in FNV. ORFs for ethanolamine utilization, thermostable carboxypeptidase, ␥ glutamyl-transpeptidase, and deblocking aminopeptidases are absent from FNV. FNV, like FN, lacks the classical catalase-peroxidase system, but thioredoxin/glutaredoxin enzymes might alleviate oxidative stress. Genes for resistance to antibiotics such as acriflavin, bacitracin, bleomycin, daunorubicin, florfenicol, and other general multidrug resistance are present.
    [Show full text]
  • High-Quality Permanent Draft Genome Sequence of Bradyrhizobium Sp Th.B2, a Microsymbiont of Amphicarpaea Bracteata Collected in Johnson City, New York
    Binghamton University The Open Repository @ Binghamton (The ORB) Biological Sciences Faculty Scholarship Biological Sciences 5-2015 High-quality permanent draft genome sequence of Bradyrhizobium sp Th.b2, a microsymbiont of Amphicarpaea bracteata collected in Johnson City, New York Rui Tian Matthew Parker Binghamton University--SUNY Rekha Seshadri T. B.K. Reddy Victor Markowitz See next page for additional authors Follow this and additional works at: https://orb.binghamton.edu/bio_fac Part of the Biology Commons Recommended Citation Tian, Rui; Parker, Matthew; Seshadri, Rekha; Reddy, T. B.K.; Markowitz, Victor; Ivanova, Natalia; Pati, Amrita; Woyke, Tanja; Baeshen, Mohammed; Baeshen, Nabih; Kyrpides, Nikos; and Reeve, Wayne, "High- quality permanent draft genome sequence of Bradyrhizobium sp Th.b2, a microsymbiont of Amphicarpaea bracteata collected in Johnson City, New York" (2015). Biological Sciences Faculty Scholarship. 10. https://orb.binghamton.edu/bio_fac/10 This Article is brought to you for free and open access by the Biological Sciences at The Open Repository @ Binghamton (The ORB). It has been accepted for inclusion in Biological Sciences Faculty Scholarship by an authorized administrator of The Open Repository @ Binghamton (The ORB). For more information, please contact [email protected]. Authors Rui Tian, Matthew Parker, Rekha Seshadri, T. B.K. Reddy, Victor Markowitz, Natalia Ivanova, Amrita Pati, Tanja Woyke, Mohammed Baeshen, Nabih Baeshen, Nikos Kyrpides, and Wayne Reeve This article is available at The Open Repository @ Binghamton (The ORB): https://orb.binghamton.edu/bio_fac/10 Tian et al. Standards in Genomic Sciences (2015) 10:24 DOI 10.1186/s40793-015-0008-y SHORT GENOME REPORT Open Access High-quality permanent draft genome sequence of Bradyrhizobium sp.
    [Show full text]
  • Third Annual DOE Joint Genome Institute User Meeting
    Third Annual DOE Joint Genome Institute User Meeting Sponsored By U.S. Department of Energy Office of Science March 26–28, 2008 Marriott Hotel Walnut Creek, California Contents Agenda............................................................................iv Speaker Presentations....................................................1 Poster Presentations.....................................................11 Attendees .......................................................................53 Author Index.................................................................59 iii Agenda All functions to be held at the Walnut Creek Marriott unless otherwise noted WEDNESDAY, March 26 Start Time End Time Subject Session Chair/Speaker 8:30 AM Registration Opens 9:00 AM 12:00 PM Workshops Eukaryotic Annotation Workshop IMG Workshop JGI 101 Workshop 12:00 PM 1:00 PM Lunch provided for WORKSHOP PARTICIPANTS ONLY on this day 1:00 PM 4:30 PM Session 1- Biomass Feedstocks Eddy Rubin, chair 1:00 PM 1:10 PM Introduction Eddy Rubin 1:10 PM 1:40 PM JGI Plant Portfolio Overview Jerry Tuskan 1:40 PM 2:10 PM The Sorghum bicolor Genome, the Diversification Andrew Paterson of Cereals, and the Productivity of Tropical Grasses 2:10 PM 2:40 PM Miscanthus Steve Long 2:40 PM 3:00 PM Break 3:00 PM 3:30 PM Poplar Metabolism Tim Tschaplinski 3:30 PM 4:00 PM Eucalyptus: Sequencing a Global Dario Grattapaglia Tree Genome for Energy, Fiber and Wood 4:00 PM 4:15 PM Short talk from poster abstracts Speaker TBD 4:15 PM 4:30 PM Short talk from poster abstracts Speaker TBD 5:30
    [Show full text]
  • The Future of Microbial Metagenomics (Or Is Ignorance Bliss?)
    The ISME Journal (2011) 5, 777–779 & 2011 International Society for Microbial Ecology All rights reserved 1751-7362/11 www.nature.com/ismej COMMENTARY The Future of microbial metagenomics (or is ignorance bliss?) Jack A Gilbert, Folker Meyer and Mark J Bailey The ISME Journal (2011) 5, 777–779; doi:10.1038/ microbial diversity studies. Although concerted ismej.2010.178; published online 25 November 2010 efforts are under way to improve the coverage of known genome-derived proteins from wet-lab derived biochemical annotations, the technology to A recent explosion in the number of studies taking link the small body of experimentally generated advantage of the power of next-generation sequen- evidence to large ‘families’ of similar proteins is still cing to explore metagenomic or 16S rRNA taxo- in flux. One major factor that hinders the use of the nomic diversity of microbial environments means existing annotations is the bias in the genome and that, we need to stop and think about how we best protein knowledge bases, for example, the vast interpret these data. majority of sequenced organisms originate from the Currently, 16S rRNA gene studies provide us with medical community. The Genomic Encyclopedia the most effective way of fingerprinting the species of Bacteria and Achaea (GEBA) project has provided richness of a community, with weak links to culture- a substantial increase to our understanding of derived functional relationships. However, the vast microbial genomics from the rest of the phylogenetic increase in the number of bacterial taxa known only matrix, highlighting the importance of whole from 16S rRNA sequence has started to sever this genomes in exploring evolution and the protein link, which can only be corrected through more universe.
    [Show full text]
  • GSC 6 and GSC 7 Meeting Reports Lent Speakers and Talks
    Standards in Genomic Sciences (2009) 1:68-71 DOI:10.4056/sigs.25165 Meeting Report from the Genomic Standards Consortium (GSC) Workshops 6 and 7 Dawn Field1*, Peter Sterk1,2, Nikos Kyrpides3, Renzo Kottmann4, Frank Oliver Glöckner4, Lynette Hirschman5, George M. Garrity6, John Wooley7 and Paul Gilna7 1 NERC Center for Ecology and Hydrology, Oxford, United Kingdom 2 The Pathogen Sequencing Unit, The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK 3 Genome Biology Program, DOE Joint Genome Institute, Walnut Creek, California, USA 3 European Molecular Biology Laboratory (EMBL) Outstation, European Bioinformatics Institute (EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, UK. 4 Microbial Genomics Group, Max Planck Institute for Marine Microbiology & Jacobs University Bremen, Bremen, Germany 5 Information Technology Center, The MITRE Corporation, Bedford, Massachusetts, USA 6 Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, Michigan, USA 7 University of California San Diego, La Jolla California Corresponding author: Dawn Field This report summarizes the proceedings of the 6th and 7th workshops of the Genomic Stan- dards Consortium (GSC), held back-to-back in 2008. GSC 6 focused on furthering the activi- ties of GSC working groups, GSC 7 focused on outreach to the wider community. GSC 6 was held October 10-14, 2008 at the European Bioinformatics Institute, Cambridge, United King- dom and included a two-day workshop focused on the refinement of the Genomic
    [Show full text]
  • The Genome Sequence of the Facultative Intracellular Pathogen Brucella Melitensis
    The genome sequence of the facultative intracellular pathogen Brucella melitensis Vito G. DelVecchio*†, Vinayak Kapatral‡, Rajendra J. Redkar*, Guy Patra*, Cesar Mujer*, Tamara Los‡, Natalia Ivanova‡, Iain Anderson‡, Anamitra Bhattacharyya‡, Athanasios Lykidis‡, Gary Reznik‡, Lynn Jablonski‡, Niels Larsen‡, Mark D’Souza‡, Axel Bernal‡, Mikhail Mazur‡, Eugene Goltsman‡, Eugene Selkov‡, Philip H. Elzer§, Sue Hagius§, David O’Callaghan¶, Jean-Jacques Letessonʈ, Robert Haselkorn‡, Nikos Kyrpides‡, and Ross Overbeek‡ *Institute of Molecular Biology and Medicine, University of Scranton, Scranton, PA 18510; ‡Integrated Genomics, Incorporated, 2201 West Campbell Park Drive, Chicago, IL 60612; §Department of Veterinary Science, Louisiana State University Ag Center, 111 Dalrymple Building, Baton Rouge, LA 70803; ¶Institut National de la Sante´et de la Recherche Me´dicale, U431, Faculte´deMe´ decine, Avenue Kennedy, 30900 Nıˆmes,France; and ʈUnite´de Recherche en Biologie Mole´culaire, Laboratoire d’Immunologie et de Microbiologie, Universite´of Namur, 61 Rue de Bruxelles, 5000 Namur, Belgium Contributed by Robert Haselkorn, October 29, 2001 Brucella melitensis is a facultative intracellular bacterial pathogen Brucella melitensis (15–17). However, each biovar exhibits a pref- that causes abortion in goats and sheep and Malta fever in humans. erence for specific hosts: B. melitensis infects goats and sheep, The genome of B. melitensis strain 16M was sequenced and found Brucella abortus causes abortion in cattle, and Brucella suis infects to contain 3,294,935 bp distributed over two circular chromosomes swine. Brucella canis, Brucella ovis, and Brucella neotomae are of 2,117,144 bp and 1,177,787 bp encoding 3,197 ORFs. By using the pathogenic to dogs, sheep, and desert wood rats, respectively.
    [Show full text]
  • 1 Meeting Report
    Meeting Report: The 2nd Annual Argonne Soils Workshop, Argonne National Laboratory, Chicago USA, October 6-8, 2010 Sarah L. O’Brien1*, Elizabeth M. Glass2,4, Jennifer M. Brulc1, Jack A. Gilbert1,3,4, Dionysios A. Antonopoulos1,3, Folker Meyer1,2,5 1Institute for Genomics and Systems Biology, 2Mathematics and Computer Science Division, 3Biosciences Division, Argonne National Laboratory, Argonne, IL 60439, 4Department of Ecology and Evolution, 5Computation Institute, University of Chicago, Chicago, IL 60637 *Corresponding author: Institute for Genomics and Systems Biology, Argonne National Laboratory, 9700 South Cass Avenue, Argonne IL, USA. Email: [email protected] Phone: +1 630-252-7854. The submitted manuscript has been created by UChicago Argonne, LLC, Operator of Argonne National Laboratory (“Argonne”). Argonne, a U.S. Department of Energy Office of Science laboratory, is operated under Contract No. DE‐AC02‐06CH11357. The U.S. Government retains for itself, and others acting on its behalf, a paid‐up nonexclusive, irrevocable worldwide license in said article to reproduce, prepare derivative works, distribute copies to the public, and perform publicly and display publicly, by or on behalf of the Government. 1 Abstract This report summarizes the proceedings of the 2nd Annual Argonne Soils Workshop held at Argonne National Laboratory October 6-8, 2010. The workshop assembled a diverse group of soil ecologists, microbiologists, molecular biologists, and computational scientists to discuss the challenges and opportunities related to implementation of metagenomics approaches in soil microbial ecology. The overarching theme of the workshop was “designing ecologically meaningful soil metagenomics research”, which encouraged presentations on both ecological and computational topics. The workshop fostered valuable cross-discipline communication and delivered the message that soil metagenomics research must be based on an iterative process between biological inquiry and bioinformatics tools.
    [Show full text]