Gotoolbox: Functional Analysis of Gene Datasets Based on Gene Ontology

Total Page:16

File Type:pdf, Size:1020Kb

Gotoolbox: Functional Analysis of Gene Datasets Based on Gene Ontology GOToolBox: functional analysis of gene datasets based on Gene Ontology. David Martin, Christine Brun, Elisabeth Remy, Pierre Mouren, Denis Thieffry, Bernard Jacq To cite this version: David Martin, Christine Brun, Elisabeth Remy, Pierre Mouren, Denis Thieffry, et al.. GOToolBox: functional analysis of gene datasets based on Gene Ontology.. Genome Biology, BioMed Central, 2004, 5, pp.R101. 10.1186/gb-2004-5-12-r101. inserm-00095249 HAL Id: inserm-00095249 https://www.hal.inserm.fr/inserm-00095249 Submitted on 15 Sep 2006 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Software2004MartinetVolume al. 5, Issue 12, Article R101 Open Access GOToolBox: functional analysis of gene datasets based on Gene comment Ontology David Martin*, Christine Brun*, Elisabeth Remy†, Pierre Mouren*, Denis Thieffry* and Bernard Jacq* Addresses: *Laboratoire de Génétique et Physiologie du Développement, IBDM, CNRS/INSERM/Université de la Méditerranée, Parc Scientifique de Luminy, case 907, 13288 Marseille, France. †Institut de Mathématiques de Luminy, Parc Scientifique de Luminy, 13288 Marseille, France. reviews Correspondence: David Martin. E-mail: [email protected] Published: 26 November 2004 Received: 13 April 2004 Revised: 31 August 2004 Genome Biology 2004, 5:R101 Accepted: 25 October 2004 The electronic version of this article is the complete one and can be found online at http://genomebiology.com/2004/5/12/R101 reports © 2004 Martin et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. GOToolBox:<p>Toolsto find genes are functionalwith presented similar analysis to annotations.</p> identify of geGenene datasets Ontology based terms on that Gene are Ontology over- or under-represented in a dataset, to cluster genes by function and deposited research Abstract We have developed methods and tools based on the Gene Ontology (GO) resource allowing the identification of statistically over- or under-represented terms in a gene dataset; the clustering of functionally related genes within a set; and the retrieval of genes sharing annotations with a query gene. GO annotations can also be constrained to a slim hierarchy or a given level of the ontology. The source codes are available upon request, and distributed under the GPL license. refereed research refereed Rationale the description of some aspects of gene function which are Since complete genome sequences have become available, the specific to few lineages only. Within each of these ontologies, amount of annotated genes has increased dramatically. These the terms are organized in a hierarchical way, according to advances have allowed the systematic comparison of the gene parent-child relationships in a directed acyclic graph (DAG). content of different organisms, leading to the conclusion that This allows a progressive functional description, matching organisms share the majority of their genes with only rela- the current level of experimental characterization of the cor- tively few species-specific genes. On this basis, one can responding gene product. The hierarchical organization of interactions develop strategies to infer gene annotations from model spe- the gene ontology is particularly well adapted to computa- cies to less experimentally tractable organisms. However, tional processing and is used for the functional annotations of such functional inferences require the definition of species- gene products of several model organisms such as budding independent annotation policies. yeast [2], Drosophila [3], mouse [4], nematode [5] and Ara- bidopsis [6]. More recently, GO annotations for human genes In this regard, the Gene Ontology consortium [1] has been have been proposed in the context of the GOA project [7]. created to develop a unified view of gene functional annota- tions for different model organisms. Three structured vocab- In parallel, the recent development of new high-throughput information ularies (or ontologies) have been proposed, which allow the methods has generated an enormous amount of functional description of molecular functions, biological processes and data and has motivated the development of dedicated analy- cellular locations of any gene product, respectively. Whereas sis tools. For instance, one might wonder whether genes the majority of GO terms are common to several organisms, detected as being coexpressed in a DNA chip experiment are some of them are specific to a few organisms only, enabling related in terms of molecular or cellular function. In practical Genome Biology 2004, 5:R101 R101.2 Genome Biology 2004, Volume 5, Issue 12, Article R101 Martin et al. http://genomebiology.com/2004/5/12/R101 available for download on the GOToolBox server for one Gene name/ID list Gene name/ID User input week. This file contains also the counts of terms within a ref- erence gene dataset (genome or user-defined), and can then Dataset GO- Program creation Family be used as an input for the GO-Stats and GO-Proxy programs Result described below. Associated terms Functionally and parents related genes Ontology options An optional tool, GO-Diet, allows either the reduction of the term dataset to a slim GO hierarchy (either one proposed by GO-Diet the GO consortium or a user-defined one) or the restriction of Terms sorted GO-Stats by relevance the considered terms to a chosen depth of the ontology. It is Slimmed GO also possible to filter terms based on the way these have been annotation set Genes clustered assigned to the gene products (evidence code). This tool is GO-Proxy by function useful to decrease the number of GO terms associated with a gene dataset, thereby facilitating the analysis of the results of FlowchartFigure 1 of the GOToolBox programs programs described below, particularly when the input gene Flowchart of the GOToolBox programs. list and/or the number of associated GO terms is large. Note that the GO-Diet program can generate a gene-term associa- tion file in the TLF format, allowing the use of GO terms as terms, we address here the following generic questions. First, gene labels with the TreeDyn tree drawing program [9]. The are there statistically over- or under-represented GO terms GO-Diet options are proposed in the Dataset-Creation form. associated with a given gene set, compared to the distribution of these terms among the annotations of the complete GO term statistics genome? Second, among a particular gene set, are there Frequencies of terms within the dataset are calculated and closely functionally related gene subsets? And third, are there compared with reference frequencies (for example with genes having GO similarities with a given probe gene? genomic frequencies or with the frequencies of these terms in the complete list of genes spotted on an array). This proce- To formulate such questions properly in a well defined math- dure allows the delineation of enrichments or depletions of ematical framework, we have developed a set of methods and specific terms in the dataset. The probability of obtaining by tools, collectively called GOToolBox, to process the GO anno- chance a number k of annotated genes for a given term among tations for any model organism for which they are available a dataset of size n, knowing that the reference dataset con- (Figure 1). tains m such annotated genes out of N genes, is then calcu- lated. This test follows the hypergeometric distribution All the programs are written in PERL and use the CGI and described in Equation 1: DBI modules. All the ontology data and the gene-GO terms associations are taken from the GO consortium website. m Nm− These data are structured in a PostGreSQL relational data- k nk− base, which is updated monthly. Statistics are calculated P(rX{}= k= 1) N using the R statistical programming environment. The web n implementation of the GOToolBox is accessible at [8]. where the random variable X represents the number of genes within a given gene subset, annotated with a given GO term. Features Implemented in the GO-Stats tool, this formula permits the In this section, we describe the five main functionalities of the automatic ranking of all annotation terms, as well as the eval- GOToolBox suite. Two of them (GO-Proxy and GO-Family) uation of the significance of their occurrences within the data- are not encompassed by any other GO-processing tool cur- set. An illustration of such an approach is given in 'Mining rently available (see also 'Comparison of the GOToolBox with biological data'. A typical GO-Stats output is presented in Fig- other GO-based analysis programs'). ure 2. Dataset creation GO-based gene clustering The first step in analyzing gene datasets consists in retrieving, The goal of the GO-Proxy tool is to group together function- for each individual gene of the dataset, all the corresponding ally related genes on the basis of their GO terms. The rationale GO terms and their parent terms using the Dataset creation sustaining our method is that the more genes have common program. The genomic frequency of each GO term associated GO terms, and the less they have specific associated terms, with genes present in the dataset is then calculated. The the more likely they are to be functionally related. For any two resulting information is structured and stored in a data file, genes of the gene set, the program calculates an annotation- Genome Biology 2004, 5:R101 http://genomebiology.com/2004/5/12/R101 Genome Biology 2004, Volume 5, Issue 12, Article R101 Martin et al.
Recommended publications
  • 1 Supporting Information for a Microrna Network Regulates
    Supporting Information for A microRNA Network Regulates Expression and Biosynthesis of CFTR and CFTR-ΔF508 Shyam Ramachandrana,b, Philip H. Karpc, Peng Jiangc, Lynda S. Ostedgaardc, Amy E. Walza, John T. Fishere, Shaf Keshavjeeh, Kim A. Lennoxi, Ashley M. Jacobii, Scott D. Rosei, Mark A. Behlkei, Michael J. Welshb,c,d,g, Yi Xingb,c,f, Paul B. McCray Jr.a,b,c Author Affiliations: Department of Pediatricsa, Interdisciplinary Program in Geneticsb, Departments of Internal Medicinec, Molecular Physiology and Biophysicsd, Anatomy and Cell Biologye, Biomedical Engineeringf, Howard Hughes Medical Instituteg, Carver College of Medicine, University of Iowa, Iowa City, IA-52242 Division of Thoracic Surgeryh, Toronto General Hospital, University Health Network, University of Toronto, Toronto, Canada-M5G 2C4 Integrated DNA Technologiesi, Coralville, IA-52241 To whom correspondence should be addressed: Email: [email protected] (M.J.W.); yi- [email protected] (Y.X.); Email: [email protected] (P.B.M.) This PDF file includes: Materials and Methods References Fig. S1. miR-138 regulates SIN3A in a dose-dependent and site-specific manner. Fig. S2. miR-138 regulates endogenous SIN3A protein expression. Fig. S3. miR-138 regulates endogenous CFTR protein expression in Calu-3 cells. Fig. S4. miR-138 regulates endogenous CFTR protein expression in primary human airway epithelia. Fig. S5. miR-138 regulates CFTR expression in HeLa cells. Fig. S6. miR-138 regulates CFTR expression in HEK293T cells. Fig. S7. HeLa cells exhibit CFTR channel activity. Fig. S8. miR-138 improves CFTR processing. Fig. S9. miR-138 improves CFTR-ΔF508 processing. Fig. S10. SIN3A inhibition yields partial rescue of Cl- transport in CF epithelia.
    [Show full text]
  • Supporting Information
    Supporting Information Barshis et al. 10.1073/pnas.1210224110 SI Materials and Methods and “ssu-parc.fasta” downloaded on September 14, 2011 from mRNA Extraction. Total RNA was extracted from each sample www.arb-silva.de), and potential Symbiodinium contamination using a modified TRIzol (GibcoBRL/Invitrogen) protocol. Ap- was removed based significant nucleotide similarity (BLASTN, proximately 150–200 mg of coral tissue and skeleton was placed ≥100 bp and ≥70% identity) to ESTs from Symbiodinium sp. in 1 mL of TRIzol and homogenized for 2 min by vortexing with KB8 (clade A) and Symbiodinium sp. MF1.04b (clade B) (6) ∼100 μL of 0.5-mm Zirconia/Silica Beads (BioSpec Products). and the related dinoflagellate Polarella glacialis. Finally, the re- Resulting tissue/TRIzol slurry was removed by centrifugation, sulting contigs were compared (via BLASTX) to the NCBI non- and the standard TRIzol extraction was performed according to redundant protein database (nr; downloaded on June 7, 2011 manufacturer’s specifications with the replacement of 250 μLof from www.ncbi.nlm.nih.gov). The nonredundant (nr) results were 100% (vol/vol) isopropanol with 250 μL of high-salt buffer (0.8 used to remove any additional sequences likely to be noncoral, M Na citrate, 1.2 M NaCl) during the final precipitation step. based on similarity to alveolates, fungi, bacteria, or Archaea, as Resulting RNA pellet was resuspended in 12 μL of diethylpyro- determined using the metagenome analyzer (MEGAN) Version 4 = = = carbonate (DEPC)-treated H2O. mRNA was isolated from total (min. support 1, min. score 200, top percent 20) (7). RNA using the Micro-FastTrack mRNA isolation kit (In- The remaining, putatively coral, contigs were then “meta- vitrogen) and an overnight precipitation at −80 °C.
    [Show full text]
  • Cytoplasmic Localized Ubiquitin Ligase Cullin 7 Binds to P53 and Promotes Cell Growth by Antagonizing P53 Function
    Oncogene (2006) 25, 4534–4548 & 2006 Nature Publishing Group All rights reserved 0950-9232/06 $30.00 www.nature.com/onc ORIGINAL ARTICLE Cytoplasmic localized ubiquitin ligase cullin 7 binds to p53 and promotes cell growth by antagonizing p53 function P Andrews1,2,YJHe1 and Y Xiong1,2,3 1Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, NC, USA; 2Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC, USA and 3Program in Molecular Biology and Biotechnology, University of North Carolina, Chapel Hill, NC, USA Cullins are a family of evolutionarily conserved proteins monomeric manner (monoubiquitination) to cause that bind to the small RING fingerprotein,ROC1, to conformational change to the substrate or in a constitute potentially a large number of distinct E3 polyubiquitin chain which signals the substrate to be ubiquitin ligases. CUL7 mediates an essential function degraded by the 26S proteasome. Both ubiquitin formouse embryodevelopment and has been linked with activating and conjugating enzymes are well character- cell transformation by its physical association with the ized and contain highly conserved functional domains. SV40 large T antigen. We report here that, like its closely The identity and mechanism of E3 ubiquitin ligases, on related homolog PARC, CUL7 is localized predominantly the other hand, has been elusive and long postulated as in the cytoplasm and binds directly to p53. In contrast to an activity responsible for both recognizing substrates PARC, however, CUL7, even when overexpressed, did not and for catalysing polyubiquitin chain formation sequesterp53 in the cytoplasm. We have identified a (Hershko and Ciechanover, 1998). sequence in the N-terminal region of CUL7 that is highly Currently, two major families of E3 ligases have been conserved in PARC and a sequence spanning the described.
    [Show full text]
  • Visual Data Mining : Background, Techniques, and Drug Discovery
    Visual Data Mining: Background, Techniques, and Drug Discovery Applications Mihael Ankerst The Boeing Company Georges Grinstein UMass Lowell and AnVil Inc. Daniel Keim AT&T Research and University of Konstanz A color version of the tutorial notes can be found via http://www.fmi.uni-konstanz.de/~keim KDD’2002 Conference Emails and URLs Data Exploration • Definition Mihael Ankerst – [email protected] Data Exploration is the process of searching and analyzing – http://www.visualclassification.com/ankerst databases to find implicit but potentially useful information Daniel A. Keim – [email protected] • more formally – [email protected] Data Exploration is the process of finding a – http://www.fmi.uni-konstanz.de/~keim • subset D‘ of the database D and George Grinstein – [email protected] • hypotheses Hu(D‘,C) – http://genome.uml.edu that a user U considers useful in an application context C – http://www.anvilinfo.com Mihael Ankerst, The Boeing Company -- Daniel A. Keim, AT&T and Univ. of Konstanz Mihael Ankerst, The Boeing Company -- Daniel A. Keim, AT&T and Univ. of Konstanz Georges Grinstein, UMass Lowell and AnVil Inc. 2 Georges Grinstein, UMass Lowell and AnVil Inc. 5 Overview Abilities of Humans and Computers Part I: Visualization Techniques 1. Introduction 2. Visual Data Exploration Techniques abilities of Data Storage 3. Distortion and Interaction Techniques the computer Numerical Computation 4. Visual Data Mining Systems Searching Part II: Specific Visual Data Mining Techniques 1. Association Rules Planning 2. Classification Diagnosis Logic 3. Clustering Prediction 4. Text Mining 5. Tightly Integrated Visualization Perception Part III: Drug Discovery Applications Creativity 1.
    [Show full text]
  • Biological Data Integration Using Semantic Web Technologies
    Biological data integration using Semantic Web technologies Pasquier C Phone: +33 492 07 6947 Fax: +33 492 07 6432 Email: [email protected] Institute of Signaling, Developmental Biology & Cancer CNRS - UMR 6543, University of Nice Sophia-Antipolis Parc Valrose, 06108 NICE cedex 2, France. Summary Current research in biology heavily depends on the availability and efficient use of information. In order to build new knowledge, various sources of biological data must often be combined. Semantic Web technologies, which provide a common framework allowing data to be shared and reused between applications, can be applied to the management of disseminated biological data. However, due to some specificities of biological data, the application of these technologies to life science constitutes a real challenge. Through a use case of biological data integration, we show in this paper that current Semantic Web technologies start to become mature and can be applied for the development of large applications. However, in order to get the best from these technologies, improvements are needed both at the level of tool performance and knowledge modeling. Keywords Data integration, Semantic Web, OWL, RDF, SPARQL, Knowledge Base System (KBS) Introduction Biology is now an information-intensive science and research in genomics, transcriptomics and proteomics heavily depend on the availability and the efficient use of information. When data were structured and organized as a collection of records in dedicated, self-sufficient databases, information was retrieved by performing queries on the database using a specialized query language; for example SQL (Structured Query Language) for relational databases or OQL (Object Query Language) for object databases.
    [Show full text]
  • Uman Enome News
    uman enome news ISSN: 1050-6101 Vol. 7, No.2, July-August 1995 Optical Mapping Offers Fast, Accurate Method for Generating Restriction Maps New Approach Eliminates Electrophoresis, Is Amenable to Automation evelopment of cheaper and faster technologies for large-scale Dgenome mapping has been a major priority in the first 5 years of the Human Genome Project. Although many efforts have focused on improving standard gel electrophoresis and hybridization methods, a new approach using optical detection of single DNA mole.cules shows great promise for rapid construction of ordered genome maps based on restriction endonuclease cutting sites. l -4 Restriction endonucleases-enzymes that cut DNA molecules at specific sites in the genome-have played a major role in allowing investigators to identify and characterize various loci on a DNA molecule. Unlike maps based on STSs (a sequence-based landmark), restriction maps provide the precise genomic distances that are essential for efficient sequencing and for determining the spatial relationships of specific loci. Compared with hybridization-based fingerprinting approaches, ordered restriction maps offer relatively unambiguous clone characterization, which is useful for determining overlapping areas in contig formation, establishing minimum tiling paths for sequencing (coverage of a region), and characterizing genetic lesions with respect to various structural alterations. Image of a human chromosome 11 YAC clone (425 kb) cleaved by restriction endonucleases, Despite the broad applications of restriction maps, however, associated stained with a fluorochrome, and visualized by techniques for their generation have changed little over the last 10 years fluorescence microscopy. (White bar at lower left because of their reliance on tedious electrophoresis methods.
    [Show full text]
  • Towards a Sustainable Funding Model for the Uniprot Use Case[Version 2
    F1000Research 2018, 6(ELIXIR):2051 Last updated: 25 OCT 2018 RESEARCH ARTICLE Funding knowledgebases: Towards a sustainable funding model for the UniProt use case [version 2; referees: 3 approved] Chiara Gabella , Christine Durinx , Ron Appel ELIXIR-Switzerland, SIB Swiss Institute of Bioinformatics, Lausanne, 1015, Switzerland First published: 27 Nov 2017, 6(ELIXIR):2051 (doi: Open Peer Review v2 10.12688/f1000research.12989.1) Latest published: 22 Mar 2018, 6(ELIXIR):2051 (doi: 10.12688/f1000research.12989.2) Referee Status: Abstract Invited Referees Millions of life scientists across the world rely on bioinformatics data resources 1 2 3 for their research projects. Data resources can be very expensive, especially those with a high added value as the expert-curated knowledgebases. Despite the increasing need for such highly accurate and reliable sources of scientific version 2 information, most of them do not have secured funding over the near future and published often depend on short-term grants that are much shorter than their planning 22 Mar 2018 horizon. Additionally, they are often evaluated as research projects rather than as research infrastructure components. version 1 In this work, twelve funding models for data resources are described and published report report report applied on the case study of the Universal Protein Resource (UniProt), a key 27 Nov 2017 resource for protein sequences and functional information knowledge. We show that most of the models present inconsistencies with open access or Helen Berman, Rutgers, The State equity policies, and that while some models do not allow to cover the total 1 costs, they could potentially be used as a complementary income source.
    [Show full text]
  • Trapping of the Transport-Segment DNA by the Atpase Domains of a Type II Topoisomerase
    ARTICLE DOI: 10.1038/s41467-018-05005-x OPEN Trapping of the transport-segment DNA by the ATPase domains of a type II topoisomerase Ivan Laponogov1,2,3, Xiao-Su Pan2, Dennis A. Veselkov1, Galyna B. Skamrova1, Trishant R. Umrekar 1,4, L. Mark Fisher 2 & Mark R. Sanderson 1 Type II topoisomerases alter DNA topology to control DNA supercoiling and chromosome segregation and are targets of clinically important anti-infective and anticancer therapeutics. 1234567890():,; They act as ATP-operated clamps to trap a DNA helix and transport it through a transient break in a second DNA. Here, we present the first X-ray crystal structure solved at 2.83 Å of a closed clamp complete with trapped T-segment DNA obtained by co-crystallizing the ATPase domain of S. pneumoniae topoisomerase IV with a nonhydrolyzable ATP analogue and 14-mer duplex DNA. The ATPase dimer forms a 22 Å protein hole occupied by the kinked DNA bound asymmetrically through positively charged residues lining the hole, and whose mutagenesis impacts the DNA decatenation, DNA relaxation and DNA-dependent ATPase activities of topo IV. These results and a side-bound DNA-ParE structure help explain how the T-segment DNA is captured and transported by a type II topoisomerase, and reveal a new enzyme–DNA interface for drug discovery. 1 Randall Centre for Cell and Molecular Biophysics, 3rd Floor New Hunt’s House, Faculty of Life Sciences and Medicine, King’s College London, London SE1 1UL, UK. 2 Molecular and Clinical Sciences Research Institute, St. George’s, University of London, Cranmer Terrace, London SW17 0RE, UK.
    [Show full text]
  • Mutation Discovery in Mice by Whole Exome Sequencing
    Fairfield et al. Genome Biology 2011, 12:R86 http://genomebiology.com/2011/12/9/R86 METHOD Open Access Mutation discovery in mice by whole exome sequencing Heather Fairfield1, Griffith J Gilbert1, Mary Barter1, Rebecca R Corrigan2, Michelle Curtain1, Yueming Ding3, Mark D’Ascenzo4, Daniel J Gerhardt4, Chao He5, Wenhui Huang6, Todd Richmond4, Lucy Rowe1, Frank J Probst2, David E Bergstrom1, Stephen A Murray1, Carol Bult1, Joel Richardson1, Benjamin T Kile7, Ivo Gut8, Jorg Hager8, Snaevar Sigurdsson9, Evan Mauceli9, Federica Di Palma9, Kerstin Lindblad-Toh9, Michael L Cunningham10, Timothy C Cox10, Monica J Justice2, Mona S Spector5, Scott W Lowe5, Thomas Albert4, Leah Rae Donahue1, Jeffrey Jeddeloh4, Jay Shendure10 and Laura G Reinholdt1* Abstract We report the development and optimization of reagents for in-solution, hybridization-based capture of the mouse exome. By validating this approach in a multiple inbred strains and in novel mutant strains, we show that whole exome sequencing is a robust approach for discovery of putative mutations, irrespective of strain background. We found strong candidate mutations for the majority of mutant exomes sequenced, including new models of orofacial clefting, urogenital dysmorphology, kyphosis and autoimmune hepatitis. Background burdensome and expensive for many laboratories. Targeted Phenotype-driven approaches in model organisms, includ- sequencing approaches are less expensive and the data are ing spontaneous mutation discovery, standard N-ethyl-N- accordingly more manageable, but this technique requires nitrosourea (ENU) mutagenesis screens, sensitized screens substantial genetic mapping and the design and purchase and modifier screens, are established approaches in func- of custom capture tools (that is, arrays or probe pools) [4].
    [Show full text]
  • Qt7xg6b543.Pdf
    UC Irvine UC Irvine Previously Published Works Title The chemokine and chemokine receptor superfamilies and their molecular evolution. Permalink https://escholarship.org/uc/item/7xg6b543 Journal Genome biology, 7(12) ISSN 1474-7596 Authors Zlotnik, Albert Yoshie, Osamu Nomiyama, Hisayuki Publication Date 2006 DOI 10.1186/gb-2006-7-12-243 License https://creativecommons.org/licenses/by/4.0/ 4.0 Peer reviewed eScholarship.org Powered by the California Digital Library University of California Review The chemokine and chemokine receptor superfamilies and their molecular evolution Albert Zlotnik*, Osamu Yoshie† and Hisayuki Nomiyama‡ Addresses: *Neurocrine Biosciences, Inc., Department of Molecular Medicine, 12790 El Camino Real, San Diego, CA 92130, USA. †Department of Microbiology, Kinki University School of Medicine, Osaka-Sayama, Osaka 589-8511, Japan. ‡Department of Biochemistry, Kumamoto University Medical School, Kumamoto 860-0811, Japan. Correspondence: Albert Zlotnik. Email: [email protected] Published: 29 December 2006 Genome Biology 2006, 7:243 (doi:10.1186/gb-2006-7-12-243) The electronic version of this article is the complete one and can be found online at http://genomebiology.com/2006/7/12/243 © 2006 BioMed Central Ltd Abstract The human chemokine superfamily currently includes at least 46 ligands, which bind to 18 functionally signaling G-protein-coupled receptors and two decoy or scavenger receptors. The chemokine ligands probably comprise one of the first completely known molecular superfamilies. The genomic organization of the chemokine ligand genes and a comparison of their sequences between species shows that tandem gene duplication has taken place independently in the mouse and human lineages of some chemokine families.
    [Show full text]
  • Mutations in the Gyrb, Parc, and Pare Genes of Quinolone-Resistant Isolates and Mutants of Edwardsiella Tarda
    J. Microbiol. Biotechnol. (2010), 20(12), 1735–1743 doi: 10.4014/jmb.1009.09008 First published online 3 December 2010 Mutations in the gyrB, parC, and parE Genes of Quinolone-Resistant Isolates and Mutants of Edwardsiella tarda Kim, Myoung Sug1, Lyu Jin Jun2, Soon Bum Shin3, Myoung Ae Park1, Sung Hee Jung1, Kwangil Kim4, Kyung Ho Moon5, and Hyun Do Jeong4* 1Pathology Division, National Fisheries Research and Development Institute, Busan 619-705, Korea 2Faculty of Applied Marine Science, College of Ocean Science, Jeju National University, Jeju-do 756, Korea 3Food and Safety Research Center, National Fisheries Research and Development Institute, Busan 619-705, Korea 4Department of Aquatic Life Medicine, Pukyong National University, Busan 608-737, Korea 5College of Pharmacy, Kyungsung University, Busan 608-736, Korea Received: September 6, 2010 / Revised: October 18, 2010 / Accepted: October 19, 2010 The full-length genes gyrB (2,415 bp), parC (2,277 bp), and and secondary targets, respectively, of quinolones in E. parE (1,896 bp) in Edwardsiella tarda were cloned by PCR tarda. with degenerate primers based on the sequence of the Keywords: Edwardsiella tarda, gyrB, parC, parE, quinolone, respective quinolone resistance-determining region (QRDR), in vitro followed by elongation of 5' and 3' ends using cassette ligation-mediated PCR (CLMP). Analysis of the cloned genes revealed open reading frames (ORFs) encoding proteins of 804 (GyrB), 758 (ParC), and 631 (ParE) amino Edwardsiella tarda, a Gram-negative bacterium of the acids with conserved gyrase/topoisomerase features and Enterobacteriaceae family known to be one of the most motifs important for enzymatic function. The ORFs were important fish pathogenic agents, has been demonstrated to preceded by putative promoters, ribosome binding sites, induce hemorrhagic septicemia (edwardsiellosis), resulting and inverted repeats with the potential to form cruciform in extensive economic losses to the aquaculture industry.
    [Show full text]
  • Functional Analysis of Gene Datasets Based on Gene Ontology. David Martin, C
    GOToolBox: functional analysis of gene datasets based on Gene Ontology. David Martin, C. Brun, Elisabeth Remy, Pierre Mouren, Bernard Jacq, Denis Thieffry To cite this version: David Martin, C. Brun, Elisabeth Remy, Pierre Mouren, Bernard Jacq, et al.. GOToolBox: func- tional analysis of gene datasets based on Gene Ontology.. Genome Biology, BioMed Central, 2004, 5, pp.R101. hal-00311020 HAL Id: hal-00311020 https://hal.archives-ouvertes.fr/hal-00311020 Submitted on 18 Apr 2018 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Open Access Software2004MartinetVolume al. 5, Issue 12, Article R101 GOToolBox: functional analysis of gene datasets based on Gene comment Ontology David Martin*, Christine Brun*, Elisabeth Remy†, Pierre Mouren*, Denis Thieffry* and Bernard Jacq* Addresses: *Laboratoire de Génétique et Physiologie du Développement, IBDM, CNRS/INSERM/Université de la Méditerranée, Parc Scientifique de Luminy, case 907, 13288 Marseille, France. †Institut de Mathématiques de Luminy, Parc Scientifique de Luminy, 13288 Marseille, France. reviews Correspondence: David Martin. E-mail: [email protected] Published: 26 November 2004 Received: 13 April 2004 Revised: 31 August 2004 Genome Biology 2004, 5:R101 Accepted: 25 October 2004 The electronic version of this article is the complete one and can be found online at http://genomebiology.com/2004/5/12/R101 reports © 2004 Martin et al; licensee BioMed Central Ltd.
    [Show full text]