Visual Data Mining : Background, Techniques, and Drug Discovery

Total Page:16

File Type:pdf, Size:1020Kb

Visual Data Mining : Background, Techniques, and Drug Discovery Visual Data Mining: Background, Techniques, and Drug Discovery Applications Mihael Ankerst The Boeing Company Georges Grinstein UMass Lowell and AnVil Inc. Daniel Keim AT&T Research and University of Konstanz A color version of the tutorial notes can be found via http://www.fmi.uni-konstanz.de/~keim KDD’2002 Conference Emails and URLs Data Exploration • Definition Mihael Ankerst – [email protected] Data Exploration is the process of searching and analyzing – http://www.visualclassification.com/ankerst databases to find implicit but potentially useful information Daniel A. Keim – [email protected] • more formally – [email protected] Data Exploration is the process of finding a – http://www.fmi.uni-konstanz.de/~keim • subset D‘ of the database D and George Grinstein – [email protected] • hypotheses Hu(D‘,C) – http://genome.uml.edu that a user U considers useful in an application context C – http://www.anvilinfo.com Mihael Ankerst, The Boeing Company -- Daniel A. Keim, AT&T and Univ. of Konstanz Mihael Ankerst, The Boeing Company -- Daniel A. Keim, AT&T and Univ. of Konstanz Georges Grinstein, UMass Lowell and AnVil Inc. 2 Georges Grinstein, UMass Lowell and AnVil Inc. 5 Overview Abilities of Humans and Computers Part I: Visualization Techniques 1. Introduction 2. Visual Data Exploration Techniques abilities of Data Storage 3. Distortion and Interaction Techniques the computer Numerical Computation 4. Visual Data Mining Systems Searching Part II: Specific Visual Data Mining Techniques 1. Association Rules Planning 2. Classification Diagnosis Logic 3. Clustering Prediction 4. Text Mining 5. Tightly Integrated Visualization Perception Part III: Drug Discovery Applications Creativity 1. Biology and Chemistry General Knowledge 2. Bioinformatics and Cheminformatics 3. Examples human abilities 4. Bioinformatics Packages 5. Cheminformatics Packages Mihael Ankerst, The Boeing Company -- Daniel A. Keim, AT&T and Univ. of Konstanz Mihael Ankerst, The Boeing Company -- Daniel A. Keim, AT&T and Univ. of Konstanz Georges Grinstein, UMass Lowell and AnVil Inc. 3 Georges Grinstein, UMass Lowell and AnVil Inc. 6 Goals of Visualization Techniques Brief Historical Overview of Exploratory Data Visualization Techniques (cf. [WB 95]) • Presentation • pioneering work of Tufte [Tuf 83, Tuf 90] and Bertin [Ber 81] – starting point: facts to be presented are fixed a priori focuses on – process: choice of appropriate presentation techniques – result: high-quality visualization of the data to present facts – visualization of data with inherent 2D-/3D-semantics • Confirmatory Analysis – general rules for layout, color composition, attribute mapping, etc. – starting point: hypotheses about the data • development of visualization techniques for different types – process: goal-oriented examination of the hypotheses of data with an underlying physical model – result: visualization of data to confirm or reject the hypotheses – geographic data, CAD data, flow data, image data, voxel data, etc. • Exploratory Analysis – starting point: no hypotheses about the data • development of visualization techniques for arbitrary – process: interactive, usually undirected search for structures, trends multidimensional data (without an underlying physical model) – result: visualization of data to lead to hypotheses about the data – applicable to databases and other information resources Mihael Ankerst, The Boeing Company -- Daniel A. Keim, AT&T and Univ. of Konstanz Mihael Ankerst, The Boeing Company -- Daniel A. Keim, AT&T and Univ. of Konstanz Georges Grinstein, UMass Lowell and AnVil Inc. 4 Georges Grinstein, UMass Lowell and AnVil Inc. 7 1 Data Preprocessing Techniques Overview • Techniques for Dimension Reduction Part I: Visualization Techniques (Set of d-dim Data Items -> Set of k-dim. Data Items; k<<d) 1. Introduction 2. Visual Data Exploration Techniques • Principal Component Analysis [DE 82] 3. Distortion and Interaction Techniques Determines a minimal set of principal components (linear combinations of the 4. Visual Data Mining Systems original dimensions) which explain the main variations of the data. Part II: Specific Visual Data Mining Techniques • Factor Analysis [Har 67] 1. Association Rules Determines a set of unobservable common factors which explain the main 2. Classification variations of the data. The original dimensions are linear combinations of the 3. Clustering common factors. 4. Text Mining • Multidimensional Scaling [SRN 72] 5. Tightly Integrated Visualization Uses the similarity (or dissimilarity) matrix of the data as defining coordinate Part III: Drug Discovery Applications axes in multidimensional space. The Euclidean distance in that space is a 1. Biology and Chemistry measure of the data items. 2. Bioinformatics and Cheminformatics • Fastmap [FL 95] 3. Examples Fastmap also operates on a given similarity matrix and iteratively reduces the 4. Bioinformatics Packages number of dimensions while preserving the distances as much as possible. 5. Cheminformatics Packages Mihael Ankerst, The Boeing Company -- Daniel A. Keim, AT&T and Univ. of Konstanz Mihael Ankerst, The Boeing Company -- Daniel A. Keim, AT&T and Univ. of Konstanz Georges Grinstein, UMass Lowell and AnVil Inc. 8 Georges Grinstein, UMass Lowell and AnVil Inc. 11 Data Preprocessing Techniques Visual Data Exploration Techniques • Subsetting Techniques (Set of Data Items -> Subset of Data Items) – Sampling (determines a representative subset of a database) – Querying (determines a certain, usually a-priori fixed subset of the • Standard 2D/3D Displays database • Segmentation Techniques • Geometric Transformations (Set of Data-Items -> Set of (Set of Data Items)) – Segmentation based upon attribute values or attribute ranges • Iconic Displays • Aggregation Techniques (Set of Data-Items -> Set of Aggregate Values) • Dense Pixel Displays – Aggregation (sum, count, min, max,...) based upon - attribute values • Stacked Displays - topological properties, etc. – Visualization of Aggregations: - Histograms - Pie Charts, Bar Charts, Line Graphs, etc. Mihael Ankerst, The Boeing Company -- Daniel A. Keim, AT&T and Univ. of Konstanz Mihael Ankerst, The Boeing Company -- Daniel A. Keim, AT&T and Univ. of Konstanz Georges Grinstein, UMass Lowell and AnVil Inc. 9 Georges Grinstein, UMass Lowell and AnVil Inc. 12 Classification Standard 2D/3D Displays Data Type to be Visualized Examples from the VisualInsights WebPage from the VisualInsights Examples 1. one-dimensional 2. two-dimensional Visualization Technique 3. mul ti -di mensional Stacked Display 4. text/web Dense Pixel Display Iconic Display 5. hierarchies/graphs Geometrically-transformed Display 6. algorithm/software Standard 2D/3D Display Standard Projection FilteringZoom Di st ort ion Link&Brush Interaction and Distortion Technique Mihael Ankerst, The Boeing Company -- Daniel A. Keim, AT&T and Univ. of Konstanz Mihael Ankerst, The Boeing Company -- Daniel A. Keim, AT&T and Univ. of Konstanz Georges Grinstein, UMass Lowell and AnVil Inc. 10 Georges Grinstein, UMass Lowell and AnVil Inc. 13 2 Geometric Transformations Geometric Transformations Basic Idea: Prosection Views [FB 94, STDS 95] n o d n Visualization of geometric transformations and o L e g e l l projections of the data o C l a i r e p m I , e c n • Scatterplot-Matrices [And 72, Cle 93] e p S . R f o n o • Landscapes [Wis 95] i s s i m r e p • Projection Pursuit Techniques [Hub 85] y b d e s used used by permission of R. Spence, Imperial London College (D techniques for finding meaningful projections of multidimensional data) u • Prosection Views [FB 94, STDS 95 schematic representation example • Hyperslice [WL 93] matrix of all orthogonal projections where the result of the selected multidimensional range is colored differently • Parallel Coordinates [Ins 85, ID 90] (combination of selections and projections) Mihael Ankerst, The Boeing Company -- Daniel A. Keim, AT&T and Univ. of Konstanz Mihael Ankerst, The Boeing Company -- Daniel A. Keim, AT&T and Univ. of Konstanz Georges Grinstein, UMass Lowell and AnVil Inc. 14 Georges Grinstein, UMass Lowell and AnVil Inc. 17 Geometric Transformations Geometric Transformations Hyperslice [ 93] Scatterplot-Matrices [Cle 93] matrix of scatterplots (x-y-diagrams) of the k-dim. data [total of (k2/2-k) scatterplots] ermission of Ward,M. Polytechnic Worcester Institute used used by permission of J. J. van Wijk Used byUsed matrix of k² slices through the k-dim. Data (the slices are determined interactively) Mihael Ankerst, The Boeing Company -- Daniel A. Keim, AT&T and Univ. of Konstanz Mihael Ankerst, The Boeing Company -- Daniel A. Keim, AT&T and Univ. of Konstanz Georges Grinstein, UMass Lowell and AnVil Inc. 15 Georges Grinstein, UMass Lowell and AnVil Inc. 18 Geometric Transformations Geometric Transformations Landscapes [Wis 95] Parallel Coordinates [Ins 85, ID 90] n equidistant axes which are parallel to one of the screen axes and correspond to the attributes the axes are scaled to the [minimum, maximum] - range of the news articles corresponding attribute visualized as a landscape every data item corresponds to a polygonal line which intersects each of the axes at the point which corresponds to the value for the attribute Used by permissionUsed of Wright,B. Decisions Visible Inc. • • • • visualization of the data as perspective landscape • the data needs to be transformed into a (possibly artificial) 2D spatial representation which preserves the characteristics Attr. 1 Attr. 2Attr. 3 Attr. k of the data Mihael
Recommended publications
  • 1 Supporting Information for a Microrna Network Regulates
    Supporting Information for A microRNA Network Regulates Expression and Biosynthesis of CFTR and CFTR-ΔF508 Shyam Ramachandrana,b, Philip H. Karpc, Peng Jiangc, Lynda S. Ostedgaardc, Amy E. Walza, John T. Fishere, Shaf Keshavjeeh, Kim A. Lennoxi, Ashley M. Jacobii, Scott D. Rosei, Mark A. Behlkei, Michael J. Welshb,c,d,g, Yi Xingb,c,f, Paul B. McCray Jr.a,b,c Author Affiliations: Department of Pediatricsa, Interdisciplinary Program in Geneticsb, Departments of Internal Medicinec, Molecular Physiology and Biophysicsd, Anatomy and Cell Biologye, Biomedical Engineeringf, Howard Hughes Medical Instituteg, Carver College of Medicine, University of Iowa, Iowa City, IA-52242 Division of Thoracic Surgeryh, Toronto General Hospital, University Health Network, University of Toronto, Toronto, Canada-M5G 2C4 Integrated DNA Technologiesi, Coralville, IA-52241 To whom correspondence should be addressed: Email: [email protected] (M.J.W.); yi- [email protected] (Y.X.); Email: [email protected] (P.B.M.) This PDF file includes: Materials and Methods References Fig. S1. miR-138 regulates SIN3A in a dose-dependent and site-specific manner. Fig. S2. miR-138 regulates endogenous SIN3A protein expression. Fig. S3. miR-138 regulates endogenous CFTR protein expression in Calu-3 cells. Fig. S4. miR-138 regulates endogenous CFTR protein expression in primary human airway epithelia. Fig. S5. miR-138 regulates CFTR expression in HeLa cells. Fig. S6. miR-138 regulates CFTR expression in HEK293T cells. Fig. S7. HeLa cells exhibit CFTR channel activity. Fig. S8. miR-138 improves CFTR processing. Fig. S9. miR-138 improves CFTR-ΔF508 processing. Fig. S10. SIN3A inhibition yields partial rescue of Cl- transport in CF epithelia.
    [Show full text]
  • Supporting Information
    Supporting Information Barshis et al. 10.1073/pnas.1210224110 SI Materials and Methods and “ssu-parc.fasta” downloaded on September 14, 2011 from mRNA Extraction. Total RNA was extracted from each sample www.arb-silva.de), and potential Symbiodinium contamination using a modified TRIzol (GibcoBRL/Invitrogen) protocol. Ap- was removed based significant nucleotide similarity (BLASTN, proximately 150–200 mg of coral tissue and skeleton was placed ≥100 bp and ≥70% identity) to ESTs from Symbiodinium sp. in 1 mL of TRIzol and homogenized for 2 min by vortexing with KB8 (clade A) and Symbiodinium sp. MF1.04b (clade B) (6) ∼100 μL of 0.5-mm Zirconia/Silica Beads (BioSpec Products). and the related dinoflagellate Polarella glacialis. Finally, the re- Resulting tissue/TRIzol slurry was removed by centrifugation, sulting contigs were compared (via BLASTX) to the NCBI non- and the standard TRIzol extraction was performed according to redundant protein database (nr; downloaded on June 7, 2011 manufacturer’s specifications with the replacement of 250 μLof from www.ncbi.nlm.nih.gov). The nonredundant (nr) results were 100% (vol/vol) isopropanol with 250 μL of high-salt buffer (0.8 used to remove any additional sequences likely to be noncoral, M Na citrate, 1.2 M NaCl) during the final precipitation step. based on similarity to alveolates, fungi, bacteria, or Archaea, as Resulting RNA pellet was resuspended in 12 μL of diethylpyro- determined using the metagenome analyzer (MEGAN) Version 4 = = = carbonate (DEPC)-treated H2O. mRNA was isolated from total (min. support 1, min. score 200, top percent 20) (7). RNA using the Micro-FastTrack mRNA isolation kit (In- The remaining, putatively coral, contigs were then “meta- vitrogen) and an overnight precipitation at −80 °C.
    [Show full text]
  • Cytoplasmic Localized Ubiquitin Ligase Cullin 7 Binds to P53 and Promotes Cell Growth by Antagonizing P53 Function
    Oncogene (2006) 25, 4534–4548 & 2006 Nature Publishing Group All rights reserved 0950-9232/06 $30.00 www.nature.com/onc ORIGINAL ARTICLE Cytoplasmic localized ubiquitin ligase cullin 7 binds to p53 and promotes cell growth by antagonizing p53 function P Andrews1,2,YJHe1 and Y Xiong1,2,3 1Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, NC, USA; 2Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC, USA and 3Program in Molecular Biology and Biotechnology, University of North Carolina, Chapel Hill, NC, USA Cullins are a family of evolutionarily conserved proteins monomeric manner (monoubiquitination) to cause that bind to the small RING fingerprotein,ROC1, to conformational change to the substrate or in a constitute potentially a large number of distinct E3 polyubiquitin chain which signals the substrate to be ubiquitin ligases. CUL7 mediates an essential function degraded by the 26S proteasome. Both ubiquitin formouse embryodevelopment and has been linked with activating and conjugating enzymes are well character- cell transformation by its physical association with the ized and contain highly conserved functional domains. SV40 large T antigen. We report here that, like its closely The identity and mechanism of E3 ubiquitin ligases, on related homolog PARC, CUL7 is localized predominantly the other hand, has been elusive and long postulated as in the cytoplasm and binds directly to p53. In contrast to an activity responsible for both recognizing substrates PARC, however, CUL7, even when overexpressed, did not and for catalysing polyubiquitin chain formation sequesterp53 in the cytoplasm. We have identified a (Hershko and Ciechanover, 1998). sequence in the N-terminal region of CUL7 that is highly Currently, two major families of E3 ligases have been conserved in PARC and a sequence spanning the described.
    [Show full text]
  • Biological Data Integration Using Semantic Web Technologies
    Biological data integration using Semantic Web technologies Pasquier C Phone: +33 492 07 6947 Fax: +33 492 07 6432 Email: [email protected] Institute of Signaling, Developmental Biology & Cancer CNRS - UMR 6543, University of Nice Sophia-Antipolis Parc Valrose, 06108 NICE cedex 2, France. Summary Current research in biology heavily depends on the availability and efficient use of information. In order to build new knowledge, various sources of biological data must often be combined. Semantic Web technologies, which provide a common framework allowing data to be shared and reused between applications, can be applied to the management of disseminated biological data. However, due to some specificities of biological data, the application of these technologies to life science constitutes a real challenge. Through a use case of biological data integration, we show in this paper that current Semantic Web technologies start to become mature and can be applied for the development of large applications. However, in order to get the best from these technologies, improvements are needed both at the level of tool performance and knowledge modeling. Keywords Data integration, Semantic Web, OWL, RDF, SPARQL, Knowledge Base System (KBS) Introduction Biology is now an information-intensive science and research in genomics, transcriptomics and proteomics heavily depend on the availability and the efficient use of information. When data were structured and organized as a collection of records in dedicated, self-sufficient databases, information was retrieved by performing queries on the database using a specialized query language; for example SQL (Structured Query Language) for relational databases or OQL (Object Query Language) for object databases.
    [Show full text]
  • Uman Enome News
    uman enome news ISSN: 1050-6101 Vol. 7, No.2, July-August 1995 Optical Mapping Offers Fast, Accurate Method for Generating Restriction Maps New Approach Eliminates Electrophoresis, Is Amenable to Automation evelopment of cheaper and faster technologies for large-scale Dgenome mapping has been a major priority in the first 5 years of the Human Genome Project. Although many efforts have focused on improving standard gel electrophoresis and hybridization methods, a new approach using optical detection of single DNA mole.cules shows great promise for rapid construction of ordered genome maps based on restriction endonuclease cutting sites. l -4 Restriction endonucleases-enzymes that cut DNA molecules at specific sites in the genome-have played a major role in allowing investigators to identify and characterize various loci on a DNA molecule. Unlike maps based on STSs (a sequence-based landmark), restriction maps provide the precise genomic distances that are essential for efficient sequencing and for determining the spatial relationships of specific loci. Compared with hybridization-based fingerprinting approaches, ordered restriction maps offer relatively unambiguous clone characterization, which is useful for determining overlapping areas in contig formation, establishing minimum tiling paths for sequencing (coverage of a region), and characterizing genetic lesions with respect to various structural alterations. Image of a human chromosome 11 YAC clone (425 kb) cleaved by restriction endonucleases, Despite the broad applications of restriction maps, however, associated stained with a fluorochrome, and visualized by techniques for their generation have changed little over the last 10 years fluorescence microscopy. (White bar at lower left because of their reliance on tedious electrophoresis methods.
    [Show full text]
  • Towards a Sustainable Funding Model for the Uniprot Use Case[Version 2
    F1000Research 2018, 6(ELIXIR):2051 Last updated: 25 OCT 2018 RESEARCH ARTICLE Funding knowledgebases: Towards a sustainable funding model for the UniProt use case [version 2; referees: 3 approved] Chiara Gabella , Christine Durinx , Ron Appel ELIXIR-Switzerland, SIB Swiss Institute of Bioinformatics, Lausanne, 1015, Switzerland First published: 27 Nov 2017, 6(ELIXIR):2051 (doi: Open Peer Review v2 10.12688/f1000research.12989.1) Latest published: 22 Mar 2018, 6(ELIXIR):2051 (doi: 10.12688/f1000research.12989.2) Referee Status: Abstract Invited Referees Millions of life scientists across the world rely on bioinformatics data resources 1 2 3 for their research projects. Data resources can be very expensive, especially those with a high added value as the expert-curated knowledgebases. Despite the increasing need for such highly accurate and reliable sources of scientific version 2 information, most of them do not have secured funding over the near future and published often depend on short-term grants that are much shorter than their planning 22 Mar 2018 horizon. Additionally, they are often evaluated as research projects rather than as research infrastructure components. version 1 In this work, twelve funding models for data resources are described and published report report report applied on the case study of the Universal Protein Resource (UniProt), a key 27 Nov 2017 resource for protein sequences and functional information knowledge. We show that most of the models present inconsistencies with open access or Helen Berman, Rutgers, The State equity policies, and that while some models do not allow to cover the total 1 costs, they could potentially be used as a complementary income source.
    [Show full text]
  • Trapping of the Transport-Segment DNA by the Atpase Domains of a Type II Topoisomerase
    ARTICLE DOI: 10.1038/s41467-018-05005-x OPEN Trapping of the transport-segment DNA by the ATPase domains of a type II topoisomerase Ivan Laponogov1,2,3, Xiao-Su Pan2, Dennis A. Veselkov1, Galyna B. Skamrova1, Trishant R. Umrekar 1,4, L. Mark Fisher 2 & Mark R. Sanderson 1 Type II topoisomerases alter DNA topology to control DNA supercoiling and chromosome segregation and are targets of clinically important anti-infective and anticancer therapeutics. 1234567890():,; They act as ATP-operated clamps to trap a DNA helix and transport it through a transient break in a second DNA. Here, we present the first X-ray crystal structure solved at 2.83 Å of a closed clamp complete with trapped T-segment DNA obtained by co-crystallizing the ATPase domain of S. pneumoniae topoisomerase IV with a nonhydrolyzable ATP analogue and 14-mer duplex DNA. The ATPase dimer forms a 22 Å protein hole occupied by the kinked DNA bound asymmetrically through positively charged residues lining the hole, and whose mutagenesis impacts the DNA decatenation, DNA relaxation and DNA-dependent ATPase activities of topo IV. These results and a side-bound DNA-ParE structure help explain how the T-segment DNA is captured and transported by a type II topoisomerase, and reveal a new enzyme–DNA interface for drug discovery. 1 Randall Centre for Cell and Molecular Biophysics, 3rd Floor New Hunt’s House, Faculty of Life Sciences and Medicine, King’s College London, London SE1 1UL, UK. 2 Molecular and Clinical Sciences Research Institute, St. George’s, University of London, Cranmer Terrace, London SW17 0RE, UK.
    [Show full text]
  • Mutation Discovery in Mice by Whole Exome Sequencing
    Fairfield et al. Genome Biology 2011, 12:R86 http://genomebiology.com/2011/12/9/R86 METHOD Open Access Mutation discovery in mice by whole exome sequencing Heather Fairfield1, Griffith J Gilbert1, Mary Barter1, Rebecca R Corrigan2, Michelle Curtain1, Yueming Ding3, Mark D’Ascenzo4, Daniel J Gerhardt4, Chao He5, Wenhui Huang6, Todd Richmond4, Lucy Rowe1, Frank J Probst2, David E Bergstrom1, Stephen A Murray1, Carol Bult1, Joel Richardson1, Benjamin T Kile7, Ivo Gut8, Jorg Hager8, Snaevar Sigurdsson9, Evan Mauceli9, Federica Di Palma9, Kerstin Lindblad-Toh9, Michael L Cunningham10, Timothy C Cox10, Monica J Justice2, Mona S Spector5, Scott W Lowe5, Thomas Albert4, Leah Rae Donahue1, Jeffrey Jeddeloh4, Jay Shendure10 and Laura G Reinholdt1* Abstract We report the development and optimization of reagents for in-solution, hybridization-based capture of the mouse exome. By validating this approach in a multiple inbred strains and in novel mutant strains, we show that whole exome sequencing is a robust approach for discovery of putative mutations, irrespective of strain background. We found strong candidate mutations for the majority of mutant exomes sequenced, including new models of orofacial clefting, urogenital dysmorphology, kyphosis and autoimmune hepatitis. Background burdensome and expensive for many laboratories. Targeted Phenotype-driven approaches in model organisms, includ- sequencing approaches are less expensive and the data are ing spontaneous mutation discovery, standard N-ethyl-N- accordingly more manageable, but this technique requires nitrosourea (ENU) mutagenesis screens, sensitized screens substantial genetic mapping and the design and purchase and modifier screens, are established approaches in func- of custom capture tools (that is, arrays or probe pools) [4].
    [Show full text]
  • Qt7xg6b543.Pdf
    UC Irvine UC Irvine Previously Published Works Title The chemokine and chemokine receptor superfamilies and their molecular evolution. Permalink https://escholarship.org/uc/item/7xg6b543 Journal Genome biology, 7(12) ISSN 1474-7596 Authors Zlotnik, Albert Yoshie, Osamu Nomiyama, Hisayuki Publication Date 2006 DOI 10.1186/gb-2006-7-12-243 License https://creativecommons.org/licenses/by/4.0/ 4.0 Peer reviewed eScholarship.org Powered by the California Digital Library University of California Review The chemokine and chemokine receptor superfamilies and their molecular evolution Albert Zlotnik*, Osamu Yoshie† and Hisayuki Nomiyama‡ Addresses: *Neurocrine Biosciences, Inc., Department of Molecular Medicine, 12790 El Camino Real, San Diego, CA 92130, USA. †Department of Microbiology, Kinki University School of Medicine, Osaka-Sayama, Osaka 589-8511, Japan. ‡Department of Biochemistry, Kumamoto University Medical School, Kumamoto 860-0811, Japan. Correspondence: Albert Zlotnik. Email: [email protected] Published: 29 December 2006 Genome Biology 2006, 7:243 (doi:10.1186/gb-2006-7-12-243) The electronic version of this article is the complete one and can be found online at http://genomebiology.com/2006/7/12/243 © 2006 BioMed Central Ltd Abstract The human chemokine superfamily currently includes at least 46 ligands, which bind to 18 functionally signaling G-protein-coupled receptors and two decoy or scavenger receptors. The chemokine ligands probably comprise one of the first completely known molecular superfamilies. The genomic organization of the chemokine ligand genes and a comparison of their sequences between species shows that tandem gene duplication has taken place independently in the mouse and human lineages of some chemokine families.
    [Show full text]
  • Mutations in the Gyrb, Parc, and Pare Genes of Quinolone-Resistant Isolates and Mutants of Edwardsiella Tarda
    J. Microbiol. Biotechnol. (2010), 20(12), 1735–1743 doi: 10.4014/jmb.1009.09008 First published online 3 December 2010 Mutations in the gyrB, parC, and parE Genes of Quinolone-Resistant Isolates and Mutants of Edwardsiella tarda Kim, Myoung Sug1, Lyu Jin Jun2, Soon Bum Shin3, Myoung Ae Park1, Sung Hee Jung1, Kwangil Kim4, Kyung Ho Moon5, and Hyun Do Jeong4* 1Pathology Division, National Fisheries Research and Development Institute, Busan 619-705, Korea 2Faculty of Applied Marine Science, College of Ocean Science, Jeju National University, Jeju-do 756, Korea 3Food and Safety Research Center, National Fisheries Research and Development Institute, Busan 619-705, Korea 4Department of Aquatic Life Medicine, Pukyong National University, Busan 608-737, Korea 5College of Pharmacy, Kyungsung University, Busan 608-736, Korea Received: September 6, 2010 / Revised: October 18, 2010 / Accepted: October 19, 2010 The full-length genes gyrB (2,415 bp), parC (2,277 bp), and and secondary targets, respectively, of quinolones in E. parE (1,896 bp) in Edwardsiella tarda were cloned by PCR tarda. with degenerate primers based on the sequence of the Keywords: Edwardsiella tarda, gyrB, parC, parE, quinolone, respective quinolone resistance-determining region (QRDR), in vitro followed by elongation of 5' and 3' ends using cassette ligation-mediated PCR (CLMP). Analysis of the cloned genes revealed open reading frames (ORFs) encoding proteins of 804 (GyrB), 758 (ParC), and 631 (ParE) amino Edwardsiella tarda, a Gram-negative bacterium of the acids with conserved gyrase/topoisomerase features and Enterobacteriaceae family known to be one of the most motifs important for enzymatic function. The ORFs were important fish pathogenic agents, has been demonstrated to preceded by putative promoters, ribosome binding sites, induce hemorrhagic septicemia (edwardsiellosis), resulting and inverted repeats with the potential to form cruciform in extensive economic losses to the aquaculture industry.
    [Show full text]
  • Functional Analysis of Gene Datasets Based on Gene Ontology. David Martin, C
    GOToolBox: functional analysis of gene datasets based on Gene Ontology. David Martin, C. Brun, Elisabeth Remy, Pierre Mouren, Bernard Jacq, Denis Thieffry To cite this version: David Martin, C. Brun, Elisabeth Remy, Pierre Mouren, Bernard Jacq, et al.. GOToolBox: func- tional analysis of gene datasets based on Gene Ontology.. Genome Biology, BioMed Central, 2004, 5, pp.R101. hal-00311020 HAL Id: hal-00311020 https://hal.archives-ouvertes.fr/hal-00311020 Submitted on 18 Apr 2018 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Open Access Software2004MartinetVolume al. 5, Issue 12, Article R101 GOToolBox: functional analysis of gene datasets based on Gene comment Ontology David Martin*, Christine Brun*, Elisabeth Remy†, Pierre Mouren*, Denis Thieffry* and Bernard Jacq* Addresses: *Laboratoire de Génétique et Physiologie du Développement, IBDM, CNRS/INSERM/Université de la Méditerranée, Parc Scientifique de Luminy, case 907, 13288 Marseille, France. †Institut de Mathématiques de Luminy, Parc Scientifique de Luminy, 13288 Marseille, France. reviews Correspondence: David Martin. E-mail: [email protected] Published: 26 November 2004 Received: 13 April 2004 Revised: 31 August 2004 Genome Biology 2004, 5:R101 Accepted: 25 October 2004 The electronic version of this article is the complete one and can be found online at http://genomebiology.com/2004/5/12/R101 reports © 2004 Martin et al; licensee BioMed Central Ltd.
    [Show full text]
  • An Overview of in Vivo and in Vitro Models for Autosomal Dominant Polycystic Kidney Disease: a Journey from 3D-Cysts to Mini-Pigs
    International Journal of Molecular Sciences Review An Overview of In Vivo and In Vitro Models for Autosomal Dominant Polycystic Kidney Disease: A Journey from 3D-Cysts to Mini-Pigs Svenja Koslowski 1,2,* , Camille Latapy 2, Pierrïck Auvray 2, Marc Blondel 3,4 and Laurent Meijer 1,* 1 Perha Pharmaceuticals & ManRos Therapeutics, Centre de Perharidy, 29680 Roscoff, Bretagne, France 2 C.RIS Pharma, Parc Technopolitain, Atalante Saint Malo, 35 400 Saint-Malo, France; [email protected] (C.L.); [email protected] (P.A.) 3 Univ Brest, Inserm, EFS, UMR1078, F 29200 Brest, France; [email protected] 4 CHRU Brest, Service de Génétique clinique et de Biologie de la reproduction, F 29200 Brest, France * Correspondence: [email protected] (S.K.); [email protected] (L.M.) Received: 2 June 2020; Accepted: 18 June 2020; Published: 25 June 2020 Abstract: Autosomal dominant polycystic kidney disease (ADPKD) is the most common inheritable cause of end stage renal disease and, as of today, only a single moderately effective treatment is available for patients. Even though ADPKD research has made huge progress over the last decades, the precise disease mechanisms remain elusive. However, a wide variety of cellular and animal models have been developed to decipher the pathophysiological mechanisms and related pathways underlying the disease. As none of these models perfectly recapitulates the complexity of the human disease, the aim of this review is to give an overview of the main tools currently available to ADPKD researchers, as well as their main advantages and limitations. Keywords: autosomal dominant polycystic kidney disease; research tool; model organism; biomedical research 1.
    [Show full text]