Frontiers in Integrative Genomics and Translational Bioinformatics

BioMed Research International Frontiers in Integrative Genomics and Translational Bioinformatics Guest Editors: Zhongming Zhao, Victor X. Jin, Yufei Huang, Chittibabu Guda, and Jianhua Ruan Frontiers in Integrative Genomics and Translational Bioinformatics BioMed Research International Frontiers in Integrative Genomics and Translational Bioinformatics Guest Editors: Zhongming Zhao, Victor X. Jin, Yufei Huang, Chittibabu Guda, and Jianhua Ruan Copyright © òýÔ Hindawi Publishing Corporation. All rights reserved. is is a special issue published in “BioMed Research International.” All articles are open access articles distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Contents Frontiers in Integrative Genomics and Translational Bioinformatics, Zhongming Zhao, Victor X. Jin, Yufei Huang, Chittibabu Guda, and Jianhua Ruan Volume òýÔ , Article ID Þò ¥ÀÔ, ç pages Building Integrated Ontological Knowledge Structures with Ecient Approximation Algorithms, Yang Xiang and Sarath Chandra Janga Volume òýÔ , Article ID ýÔ ò, Ô¥ pages Predicting Drug-Target Interactions via Within-Score and Between-Score, Jian-Yu Shi, Zun Liu, Hui Yu, and Yong-Jun Li Volume òýÔ , Article ID ç ýÀç, À pages RNAseq by Total RNA Library Identies Additional RNAs Compared to Poly(A) RNA Library, Yan Guo, Shilin Zhao, Quanhu Sheng, Mingsheng Guo, Brian Lehmann, Jennifer Pietenpol, David C. Samuels, and Yu Shyr Volume òýÔ , Article ID âòÔçý, À pages Construction of Pancreatic Cancer Classier Based on SVM Optimized by Improved FOA, Huiyan Jiang, Di Zhao, Ruiping Zheng, and Xiaoqi Ma Volume òýÔ , Article ID ÞÔýòç, Ôò pages OperomeDB: A Database of Condition-Specic Transcription Units in Prokaryotic Genomes, Kashish Chetal and Sarath Chandra Janga Volume òýÔ , Article ID çÔòÔÞ, Ôý pages How to Choose In Vitro Systems to Predict In Vivo Drug Clearance: A System Pharmacology Perspective, Lei Wang, ChienWei Chiang, Hong Liang, Hengyi Wu, Weixing Feng, Sara K. Quinney, Jin Li, and Lang Li Volume òýÔ , Article ID ÞçòÞ, À pages A Comparison of Variant Calling Pipelines Using Genome in a Bottle as a Reference, Adam Cornish and Chittibabu Guda Volume òýÔ , Article ID ¥ â¥ÞÀ, ÔÔ pages Assessing Computational Steps for CLIP-Seq Data Analysis, Qi Liu, Xue Zhong, Blair B. Madison, Anil K. Rustgi, and Yu Shyr Volume òýÔ , Article ID ÔÀâýò, Ôý pages Classication of Cancer Primary Sites Using Machine Learning and Somatic Mutations, Yukun Chen, Jingchun Sun, Liang-Chin Huang, Hua Xu, and Zhongming Zhao Volume òýÔ , Article ID ¥ÀÔ ýò, À pages Coexpression Network Analysis of miRNA-Ô¥ò Overexpression in Neuronal Cells, Ishwor apa, Howard S. Fox, and Dhundy Bastola Volume òýÔ , Article ID ÀòÔ ÔÞ, À pages Hindawi Publishing Corporation BioMed Research International Volume 2015, Article ID 725491, 3 pages http://dx.doi.org/10.1155/2015/725491 Editorial Frontiers in Integrative Genomics and Translational Bioinformatics Zhongming Zhao,1,2,3 Victor X. Jin,4 Yufei Huang,5 Chittibabu Guda,6 and Jianhua Ruan7 1 Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN 37203, USA 2Department of Psychiatry, Vanderbilt University School of Medicine, Nashville, TN 37212, USA 3Department of Cancer Biology, Vanderbilt University School of Medicine, Nashville, TN 37232, USA 4Department of Molecular Medicine, The University of Texas Health Science Center at San Antonio, San Antonio, TX 78229, USA 5Department of Electrical and Computer Engineering, The University of Texas at San Antonio, San Antonio, TX 78249, USA 6Department Genetics, Cell Biology and Anatomy, University of Nebraska Medical Center, Omaha, NE 68198, USA 7Department of Computer Science, The University of Texas at San Antonio, San Antonio, TX 78249, USA Correspondence should be addressed to Zhongming Zhao; [email protected] and Jianhua Ruan; [email protected] Received 21 September 2015; Accepted 21 September 2015 Copyright © 2015 Zhongming Zhao et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. As we have officially entered the big data era, we are embrac- incomplete and inaccurate knowledgebase (e.g., reference ing numerous opportunities but also meeting strong demand networks and pathways), false discovery, lack of novel algo- on innovative approaches to managing and analyzing the rithms for data integration, computational efficiency, data data in the digital world. The speed of data generation is interpretation, and visualization. astonishing—from now until 2020, the volume of the digital Translational bioinformatics is an emerging field that data is expected to approximately double every two years, and focuses on applying informatics methodology to the increas- 90% of the data available to us today were generated in just ing amount of biomedical and genomic data in order to the last two years. Specifically in biological and biomedical generate knowledge for clinical applications. With the large research fields, we have witnessed the rapid advances in genomic data linked to phenotype and medical records, we biotechnologies, especially the next-generation sequencing now can not only discover interesting biological features and single cell technologies, enabling the investigators to and regulations using genomic approaches, but also translate create massive amounts of data for genomics and transla- some of the findings for clinical practice. For example, inves- tional research. While analysis of the data from single domain tigators have been interested in finding actionable mutations like mutations or gene expression is often the first choice that can be used for development of precision medicine in a project, integrative genomic approaches have more strategies from thousands of mutations or even more in advantages such as robustness in detecting the biological an individual genome. In addition to the challenges above, signals or biomarkers and low false discovery rates due to there are other topics that require immediate attention such the evidence from multiple domains. Therefore, we have as ownership and privacy of the findings, data sharing, steadily seen more integrative genomic studies during the efficient clinical decision support system, and design and past several years. Although these approaches are promising development of specific gene panel for fast patient screening, andpowerful,therearemanychallengesthatarerequiredto among others. be addressed by bioinformaticians, computational biologists, Therefore,welaunchedthisspecialissuetoaddress and other scientists. These challenges include, but are not the demand for integrative genomics and translational bio- limited to, data quality and process from different technology informatics.Weareinterestedinbothnewalgorithms/tools platforms, sample size and consistency, data missingness, and applications. The special issue welcomes the genomics, 2 BioMed Research International bioinformatics, and computational work in broad areas attempted to classify cancer primary sites using large-scale such as various omics technologies, multidimensional data somatic mutations observed in cancer genomes and machine integration, systems biology approaches, precision medicine learning method. Specifically, they examined the patterns of studies, single cell research, pharmacogenomics, machine 1,760,846 somatic mutations identified from 230,255 cancer learning, high performance computing, and visualization. patients covering 17 tumor sites using support vector machine Special call for papers went through The International (SVM). Through a multiclass classification experiment and Conference on Intelligent Biology and Medicine (ICIBM using gene symbol, somatic mutation, chromosome, and gene 2014, held on December 4–6, 2014, http://compgenomics.utsa functional pathway as predictors, the authors reported the .edu/icibm2014/) and BioMed Research International journal performance of the baseline using only gene features to be website. After rigorous peer review, articles were selected for 0.57 in accuracy, but it was improved to 0.62 when adding this special issue. We briefly describe the research projects the information of mutation and chromosome. Moreover, F- presented in these articles as follows. measure values could reach 0.70 in five primary sites with Three papers present the work to advance the next- the large intestine being 0.87. The study suggested that the generation sequencing technologies. In “A Comparison of somatic mutation information is useful for prediction of pri- Variant Calling Pipelines Using Genome in a Bottle as a mary tumor sites. In another machine learning paper, entitled Reference,” A. Cornish and C. Guda performed a systematic “Construction of Pancreatic Cancer Classifier Based on SVM evaluation of variant callers to determine which pipeline has Optimized by Improved FOA,” H. Jiang et al. introduced an the best performance in variant calling. They compared six improved quantum fruit fly optimal algorithm (FOA) based different aligners and five different variant callers—a totalof method. Specifically, the improved FOA was used to optimize 30 combinations—using the data generated by NIST Genome the parameters of SVM and a classifier was constructed in a Bottle Consortium. For single nucleotide variant call, basedontheoptimizedSVM.Theauthorsappliedtheir

Frontiers in Integrative Genomics and Translational Bioinformatics

Establishing the Pathogenicity of Novel Mitochondrial DNA Sequence Variations: a Cell and Molecular Biology Approach

The Molecular Karyotype of 25 Clinical-Grade Human Embryonic Stem Cell Lines Received: 07 August 2015 1 1 2 3,4 Accepted: 27 October 2015 Maurice A

Dual Proteome-Scale Networks Reveal Cell-Specific Remodeling of the Human Interactome

A Computational Approach for Defining a Signature of Β-Cell Golgi Stress in Diabetes Mellitus

Computational and Experimental Approaches for Evaluating the Genetic Basis of Mitochondrial Disorders

Aneuploidy: Using Genetic Instability to Preserve a Haploid Genome?

Whole Exome Sequencing in Families at High Risk for Hodgkin Lymphoma: Identification of a Predisposing Mutation in the KDR Gene

Looking for Missing Proteins in the Proteome Of

Detection of Convergent Genome-Wide Signals Of

Atlas of the Open Reading Frames in Human Diseases: Dark Matter of the Human Genome

Supplementary Information – Postema Et Al., the Genetics of Situs Inversus Totalis Without Primary Ciliary Dyskinesia

Supplemental Table 1