Development and Application of an Integrative Genomics Approach to Lung Cancer

Total Page:16

File Type:pdf, Size:1020Kb

Development and Application of an Integrative Genomics Approach to Lung Cancer DEVELOPMENT AND APPLICATION OF AN INTEGRATIVE GENOMICS APPROACH TO LUNG CANCER by RAJAGOPAL CHARI B.Sc., University of British Columbia, 2001 B.Sc., University of British Columbia, 2004 A THESIS SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY in THE FACULTY OF GRADUATE STUDIES (Pathology and Laboratory Medicine) THE UNIVERSITY OF BRITISH COLUMBIA (Vancouver) June 2010 © Rajagopal Chari, 2010 Abstract Lung cancer has the highest mortality rate amongst all diagnosed malignancies with adenocarcinoma (AC) being the most commonly diagnosed subtype of this disease in North America. The dismal survival statistics of lung cancer patients are largely due to the detection of the disease at an advanced stage and to a lesser extent, the limited efficacy of current front line treatments. Genomic approaches, namely gene expression analysis, have provided tremendous insight into lung cancer. While many gene expression changes have been identified, most changes are likely reactive to changes which have a primary role in cancer development. Moreover, one feature which can discern primary from reactive changes is the presence of concordant DNA level alteration. Many well known genes involved in cancer such as TP53 and CDKN2A have been shown to be affected by multiple mechanisms of alteration such as somatic mutation in or loss of DNA sequence. For a given gene, one tumor may be affected by one mechanism while another tumor may be affected by a different mechanism. Although this level of multi-dimensional analysis has been performed for specific genes, such analysis has not been done at the genome-wide level. This thesis highlights the development and application of a multi-dimensional genetic and epigenetic approach to identify frequently aberrant genes and pathways in lung AC. I present, first, the design and implementation of the system for integrative genomic multi-dimensional analysis of cancer genomes, epigenomes and transcriptomes (SIGMA2). Next, analyzing a multi-dimensional dataset generated from ten lung AC specimens with non-malignant controls, I identified novel genes and pathways that would have been missed if a non-integrative approach were used. Finally, examining genes involved with EGFR signaling, I identified a gene, signal receptor protein alpha (SIRPA), which had not been previously shown to be associated with lung cancer. Taken together, these findings demonstrate the power of a multi-dimensional approach to identify important genes and pathways in lung cancer. Moreover, identifying key genes using a multi-dimensional approach on a small sample set suggests the need of large datasets may be circumvented by using a more comprehensive approach on a smaller set of samples. ii Table of Contents Abstract ......................................................................................................................................... ii Table of Contents ......................................................................................................................... iii List of Tables ............................................................................................................................... vii List of Figures ............................................................................................................................. viii List of Abbreviations ...................................................................................................................... x Acknowledgements ..................................................................................................................... xii Dedication ................................................................................................................................... xiii Co-Authorship Statement ........................................................................................................... xiv Chapter 1: Introduction ................................................................................................................. 1 1.1 Lung cancer ......................................................................................................................... 2 1.2 Genomic profiling of lung cancer ......................................................................................... 3 1.2.1 Gene expression analysis ............................................................................................. 3 1.2.2 DNA copy number analysis .......................................................................................... 4 1.2.3 Loss of heterozygosity (LOH) and allelic imbalance ..................................................... 5 1.3 Somatic mutations in lung cancer ....................................................................................... 5 1.4 Epigenetic alterations in lung cancer ................................................................................... 6 1.4.1 DNA methylation ........................................................................................................... 6 1.5 Current level of integrative analysis .................................................................................... 7 1.6 Need for an integrative approach to study lung cancer ....................................................... 7 1.7 Bioinformatic tools for genomic analysis ............................................................................. 8 1.8 Thesis theme ....................................................................................................................... 9 1.9 Objectives and hypothesis .................................................................................................. 9 1.10 Specific aims and outline of thesis .................................................................................. 10 1.11 Description of high throughput data in this thesis ............................................................ 13 1.12 Other relevant contributions not included as chapters in this thesis ............................... 13 1.12.1 Development of tools for genomic analysis .............................................................. 14 1.12.2 Baseline gene expression in non-malignant lung tissue ........................................... 14 1.12.3 Differential gene expression analysis in lung cancer ................................................ 15 1.12.4 Integration of gene dosage and gene expression in lung cancer ............................. 16 1.13 References ...................................................................................................................... 18 iii Chapter 2: SIGMA2: A system for the integrative genomic multi-dimensional analysis of cancer genomes, epigenomes, and transcriptomes1 .............................................................................. 24 2.1 Introduction ........................................................................................................................ 25 2.2 Implementation .................................................................................................................. 26 2.3 Results and discussion ...................................................................................................... 26 2.3.1 Look and feel of SIGMA2 ............................................................................................ 26 2.3.2 Description of application scope and functionality ...................................................... 27 2.3.3 Approach to integration between array platforms and assays .................................... 27 2.3.4 Format requirements of input data .............................................................................. 27 2.3.5 Description of user interface ....................................................................................... 28 2.3.6 Analysis of data from a single assay type ................................................................... 29 2.3.7 Analysis of data from multiple assays in a given 'omics dimension ............................ 30 2.3.8 Combinatorial analysis of multiple 'omics dimensions - gene dosage and gene expression ........................................................................................................................... 30 2.3.9 Group comparison analysis - single ‘omics dimension ............................................... 31 2.3.10 Group comparison analysis - integrating multiple 'omics dimensions ....................... 31 2.3.11 Multi-dimensional analysis of a breast cancer genome ............................................ 31 2.3.12 Exporting data and results ........................................................................................ 32 2.4 Conclusions ....................................................................................................................... 32 2.5 Availability and requirements ............................................................................................ 33 2.6 References ........................................................................................................................ 46 Chapter 3: An integrative multi-dimensional genetic and epigenetic strategy to identify aberrant genes and pathways in cancer2 .................................................................................................. 48 3.1 Background ....................................................................................................................... 49 3.2 Methods ............................................................................................................................
Recommended publications
  • Systems and Chemical Biology Approaches to Study Cell Function and Response to Toxins
    Dissertation submitted to the Combined Faculties for the Natural Sciences and for Mathematics of the Ruperto-Carola University of Heidelberg, Germany for the degree of Doctor of Natural Sciences Presented by MSc. Yingying Jiang born in Shandong, China Oral-examination: Systems and chemical biology approaches to study cell function and response to toxins Referees: Prof. Dr. Rob Russell Prof. Dr. Stefan Wölfl CONTRIBUTIONS The chapter III of this thesis was submitted for publishing under the title “Drug mechanism predominates over toxicity mechanisms in drug induced gene expression” by Yingying Jiang, Tobias C. Fuchs, Kristina Erdeljan, Bojana Lazerevic, Philip Hewitt, Gordana Apic & Robert B. Russell. For chapter III, text phrases, selected tables, figures are based on this submitted manuscript that has been originally written by myself. i ABSTRACT Toxicity is one of the main causes of failure during drug discovery, and of withdrawal once drugs reached the market. Prediction of potential toxicities in the early stage of drug development has thus become of great interest to reduce such costly failures. Since toxicity results from chemical perturbation of biological systems, we combined biological and chemical strategies to help understand and ultimately predict drug toxicities. First, we proposed a systematic strategy to predict and understand the mechanistic interpretation of drug toxicities based on chemical fragments. Fragments frequently found in chemicals with certain toxicities were defined as structural alerts for use in prediction. Some of the predictions were supported with mechanistic interpretation by integrating fragment- chemical, chemical-protein, protein-protein interactions and gene expression data. Next, we systematically deciphered the mechanisms of drug actions and toxicities by analyzing the associations of drugs’ chemical features, biological features and their gene expression profiles from the TG-GATEs database.
    [Show full text]
  • Origins and Functional Impact of Copy Number Variation in the Human Genome
    doi:10.1038/nature08516 ARTICLES Origins and functional impact of copy number variation in the human genome Donald F. Conrad1*, Dalila Pinto2*, Richard Redon1,3, Lars Feuk2,4, Omer Gokcumen5, Yujun Zhang1, Jan Aerts1, T. Daniel Andrews1, Chris Barnes1, Peter Campbell1, Tomas Fitzgerald1, Min Hu1, Chun Hwa Ihm5, Kati Kristiansson1, Daniel G. MacArthur1, Jeffrey R. MacDonald2, Ifejinelo Onyiah1, Andy Wing Chun Pang2, Sam Robson1, Kathy Stirrups1, Armand Valsesia1, Klaudia Walter1, John Wei2, Wellcome Trust Case Control Consortium{, Chris Tyler-Smith1, Nigel P. Carter1, Charles Lee5, Stephen W. Scherer2,6 & Matthew E. Hurles1 Structural variations of DNA greater than 1 kilobase in size account for most bases that vary among human genomes, but are still relatively under-ascertained. Here we use tiling oligonucleotide microarrays, comprising 42 million probes, to generate a comprehensive map of 11,700 copy number variations (CNVs) greater than 443 base pairs, of which most (8,599) have been validated independently. For 4,978 of these CNVs, we generated reference genotypes from 450 individuals of European, African or East Asian ancestry. The predominant mutational mechanisms differ among CNV size classes. Retrotransposition has duplicated and inserted some coding and non-coding DNA segments randomly around the genome. Furthermore, by correlation with known trait-associated single nucleotide polymorphisms (SNPs), we identified 30 loci with CNVs that are candidates for influencing disease susceptibility. Despite this, having assessed the completeness of our map and the patterns of linkage disequilibrium between CNVs and SNPs, we conclude that, for complex traits, the heritability void left by genome-wide association studies will not be accounted for by common CNVs.
    [Show full text]
  • Investigation of the Underlying Hub Genes and Molexular Pathogensis in Gastric Cancer by Integrated Bioinformatic Analyses
    bioRxiv preprint doi: https://doi.org/10.1101/2020.12.20.423656; this version posted December 22, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Investigation of the underlying hub genes and molexular pathogensis in gastric cancer by integrated bioinformatic analyses Basavaraj Vastrad1, Chanabasayya Vastrad*2 1. Department of Biochemistry, Basaveshwar College of Pharmacy, Gadag, Karnataka 582103, India. 2. Biostatistics and Bioinformatics, Chanabasava Nilaya, Bharthinagar, Dharwad 580001, Karanataka, India. * Chanabasayya Vastrad [email protected] Ph: +919480073398 Chanabasava Nilaya, Bharthinagar, Dharwad 580001 , Karanataka, India bioRxiv preprint doi: https://doi.org/10.1101/2020.12.20.423656; this version posted December 22, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Abstract The high mortality rate of gastric cancer (GC) is in part due to the absence of initial disclosure of its biomarkers. The recognition of important genes associated in GC is therefore recommended to advance clinical prognosis, diagnosis and and treatment outcomes. The current investigation used the microarray dataset GSE113255 RNA seq data from the Gene Expression Omnibus database to diagnose differentially expressed genes (DEGs). Pathway and gene ontology enrichment analyses were performed, and a proteinprotein interaction network, modules, target genes - miRNA regulatory network and target genes - TF regulatory network were constructed and analyzed. Finally, validation of hub genes was performed. The 1008 DEGs identified consisted of 505 up regulated genes and 503 down regulated genes.
    [Show full text]
  • Transcriptomic and Proteomic Profiling Provides Insight Into
    BASIC RESEARCH www.jasn.org Transcriptomic and Proteomic Profiling Provides Insight into Mesangial Cell Function in IgA Nephropathy † † ‡ Peidi Liu,* Emelie Lassén,* Viji Nair, Celine C. Berthier, Miyuki Suguro, Carina Sihlbom,§ † | † Matthias Kretzler, Christer Betsholtz, ¶ Börje Haraldsson,* Wenjun Ju, Kerstin Ebefors,* and Jenny Nyström* *Department of Physiology, Institute of Neuroscience and Physiology, §Proteomics Core Facility at University of Gothenburg, University of Gothenburg, Gothenburg, Sweden; †Division of Nephrology, Department of Internal Medicine and Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan; ‡Division of Molecular Medicine, Aichi Cancer Center Research Institute, Nagoya, Japan; |Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden; and ¶Integrated Cardio Metabolic Centre, Karolinska Institutet Novum, Huddinge, Sweden ABSTRACT IgA nephropathy (IgAN), the most common GN worldwide, is characterized by circulating galactose-deficient IgA (gd-IgA) that forms immune complexes. The immune complexes are deposited in the glomerular mesangium, leading to inflammation and loss of renal function, but the complete pathophysiology of the disease is not understood. Using an integrated global transcriptomic and proteomic profiling approach, we investigated the role of the mesangium in the onset and progression of IgAN. Global gene expression was investigated by microarray analysis of the glomerular compartment of renal biopsy specimens from patients with IgAN (n=19) and controls (n=22). Using curated glomerular cell type–specific genes from the published literature, we found differential expression of a much higher percentage of mesangial cell–positive standard genes than podocyte-positive standard genes in IgAN. Principal coordinate analysis of expression data revealed clear separation of patient and control samples on the basis of mesangial but not podocyte cell–positive standard genes.
    [Show full text]
  • A Chromosome Level Genome of Astyanax Mexicanus Surface Fish for Comparing Population
    bioRxiv preprint doi: https://doi.org/10.1101/2020.07.06.189654; this version posted July 6, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. 1 Title 2 A chromosome level genome of Astyanax mexicanus surface fish for comparing population- 3 specific genetic differences contributing to trait evolution. 4 5 Authors 6 Wesley C. Warren1, Tyler E. Boggs2, Richard Borowsky3, Brian M. Carlson4, Estephany 7 Ferrufino5, Joshua B. Gross2, LaDeana Hillier6, Zhilian Hu7, Alex C. Keene8, Alexander Kenzior9, 8 Johanna E. Kowalko5, Chad Tomlinson10, Milinn Kremitzki10, Madeleine E. Lemieux11, Tina 9 Graves-Lindsay10, Suzanne E. McGaugh12, Jeff T. Miller12, Mathilda Mommersteeg7, Rachel L. 10 Moran12, Robert Peuß9, Edward Rice1, Misty R. Riddle13, Itzel Sifuentes-Romero5, Bethany A. 11 Stanhope5,8, Clifford J. Tabin13, Sunishka Thakur5, Yamamoto Yoshiyuki14, Nicolas Rohner9,15 12 13 Authors for correspondence: Wesley C. Warren ([email protected]), Nicolas Rohner 14 ([email protected]) 15 16 Affiliation 17 1Department of Animal Sciences, Department of Surgery, Institute for Data Science and 18 Informatics, University of Missouri, Bond Life Sciences Center, Columbia, MO 19 2 Department of Biological Sciences, University of Cincinnati, Cincinnati, OH 20 3 Department of Biology, New York University, New York, NY 21 4 Department of Biology, The College of Wooster, Wooster, OH 22 5 Harriet L. Wilkes Honors College, Florida Atlantic University, Jupiter FL 23 6 Department of Genome Sciences, University of Washington, Seattle, WA 1 bioRxiv preprint doi: https://doi.org/10.1101/2020.07.06.189654; this version posted July 6, 2020.
    [Show full text]
  • A Yeast Bifc-Seq Method for Genome-Wide Interactome Mapping
    bioRxiv preprint doi: https://doi.org/10.1101/2020.06.16.154146; this version posted June 17, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. 1 A yeast BiFC-seq method for genome-wide interactome mapping 2 3 Limin Shang1,a, Yuehui Zhang1,b, Yuchen Liu1,c, Chaozhi Jin1,d, Yanzhi Yuan1,e, 4 Chunyan Tian1,f, Ming Ni2,g, Xiaochen Bo2,h, Li Zhang3,i, Dong Li1,j, Fuchu He1,*,k & 5 Jian Wang1,*,l 6 1State Key Laboratory of Proteomics, Beijing Proteome Research Center, National 7 Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, 8 China; 2Department of Biotechnology, Beijing Institute of Radiation Medicine, 9 Beijing 100850, China; 3Department of Rehabilitation Medicine, Nan Lou; 10 Department of Key Laboratory of Wound Repair and Regeneration of PLA, College 11 of Life Sciences, Chinese PLA General Hospital, Beijing 100853, China; 12 Correspondence should be addressed to F.H. ([email protected]) and J.W. 13 ([email protected]). 14 * Corresponding authors. 15 E-mail:[email protected] (Fuchu He),[email protected] (Jian Wang) 16 a ORCID: 0000-0002-6371-1956. 17 b ORCID: 0000-0001-5257-1671 18 c ORCID: 0000-0003-4691-4951 19 d ORCID: 0000-0002-1477-0255 20 e ORCID: 0000-0002-6576-8112 21 f ORCID: 0000-0003-1589-293X 22 g ORCID: 0000-0001-9465-2787 23 h ORCID: 0000-0003-3490-5812 24 i ORCID: 0000-0002-3477-8860 25 j ORCID: 0000-0002-8680-0468 26 k ORCID: 0000-0002-8571-2351 27 l ORCID: 0000-0002-8116-7691 28 Total word counts:4398 (from “Introduction” to “Conclusions”) 29 Total Keywords: 5 bioRxiv preprint doi: https://doi.org/10.1101/2020.06.16.154146; this version posted June 17, 2020.
    [Show full text]
  • Molecular Dysexpression in Gastric Cancer Revealed by Integrated Analysis of Transcriptome Data
    ONCOLOGY LETTERS 13: 3177-3185, 2017 Molecular dysexpression in gastric cancer revealed by integrated analysis of transcriptome data XIAOMEI LI1,2*, WEIWEI DONG1,2*, XUELING QU1,3, HUIXIA ZHAO1, SHUO WANG1, YIXIN HAO1, QIUWEN LI1, JIANHUA ZHU1, MIN YE1 and WENHUA XIAO1 1Department of Oncology, The First Affiliated Hospital of PLA General Hospital, Beijing 100048;2 Department of Oncology, PLA General Hospital, Beijing 100853; 3Graduate School Department of Oncology, Medical College of Liaoning, Jinzhou, Liaoning 121001, P.R. China Received September 2, 2015; Accepted November 30, 2016 DOI: 10.3892/ol.2017.5798 Abstract. Gastric cancer (GC) is often diagnosed in the DEGs, which represents a robust pool of candidates for further advanced stages and is associated with a poor prognosis. evaluation of GC pathogenesis and treatment. Obtaining an in depth understanding of the molecular mecha- nisms of GC has lagged behind compared with other cancers. Introduction This study aimed to identify candidate biomarkers for GC. An integrated analysis of microarray datasets was performed to Gastric cancer (GC) is one of the most prevalent cancers and identify differentially expressed genes (DEGs) between GC the second most common cause of cancer-associated mortality and normal tissues. Gene ontology and Kyoto Encyclopedia of worldwide (1). Almost one million people are diagnosed with Genes and Genomes (KEGG) enrichment analyses were then GC each year. Despite decades of a steady decline in the inci- performed to identify the functions of the DEGs. Furthermore, dence of GC, the GC fatality rate remains paradoxically high a protein-protein interaction (PPI) network of the DEGs was in most countries, particularly in those of East Asia (2).
    [Show full text]
  • Supplementary Tables S1-S3
    Supplementary Table S1: Real time RT-PCR primers COX-2 Forward 5’- CCACTTCAAGGGAGTCTGGA -3’ Reverse 5’- AAGGGCCCTGGTGTAGTAGG -3’ Wnt5a Forward 5’- TGAATAACCCTGTTCAGATGTCA -3’ Reverse 5’- TGTACTGCATGTGGTCCTGA -3’ Spp1 Forward 5'- GACCCATCTCAGAAGCAGAA -3' Reverse 5'- TTCGTCAGATTCATCCGAGT -3' CUGBP2 Forward 5’- ATGCAACAGCTCAACACTGC -3’ Reverse 5’- CAGCGTTGCCAGATTCTGTA -3’ Supplementary Table S2: Genes synergistically regulated by oncogenic Ras and TGF-β AU-rich probe_id Gene Name Gene Symbol element Fold change RasV12 + TGF-β RasV12 TGF-β 1368519_at serine (or cysteine) peptidase inhibitor, clade E, member 1 Serpine1 ARE 42.22 5.53 75.28 1373000_at sushi-repeat-containing protein, X-linked 2 (predicted) Srpx2 19.24 25.59 73.63 1383486_at Transcribed locus --- ARE 5.93 27.94 52.85 1367581_a_at secreted phosphoprotein 1 Spp1 2.46 19.28 49.76 1368359_a_at VGF nerve growth factor inducible Vgf 3.11 4.61 48.10 1392618_at Transcribed locus --- ARE 3.48 24.30 45.76 1398302_at prolactin-like protein F Prlpf ARE 1.39 3.29 45.23 1392264_s_at serine (or cysteine) peptidase inhibitor, clade E, member 1 Serpine1 ARE 24.92 3.67 40.09 1391022_at laminin, beta 3 Lamb3 2.13 3.31 38.15 1384605_at Transcribed locus --- 2.94 14.57 37.91 1367973_at chemokine (C-C motif) ligand 2 Ccl2 ARE 5.47 17.28 37.90 1369249_at progressive ankylosis homolog (mouse) Ank ARE 3.12 8.33 33.58 1398479_at ryanodine receptor 3 Ryr3 ARE 1.42 9.28 29.65 1371194_at tumor necrosis factor alpha induced protein 6 Tnfaip6 ARE 2.95 7.90 29.24 1386344_at Progressive ankylosis homolog (mouse)
    [Show full text]
  • The Consensus Coding Sequences of Human Breast and Colorectal Cancers Tobias Sjöblom,1* Siân Jones,1* Laura D
    The Consensus Coding Sequences of Human Breast and Colorectal Cancers Tobias Sjöblom,1* Siân Jones,1* Laura D. Wood,1* D. Williams Parsons,1* Jimmy Lin,1 Thomas Barber,1 Diana Mandelker,1 Rebecca J. Leary,1 Janine Ptak,1 Natalie Silliman,1 Steve Szabo,1 Phillip Buckhaults,2 Christopher Farrell,2 Paul Meeh,2 Sanford D. Markowitz,3 Joseph Willis,4 Dawn Dawson,4 James K. V. Willson,5 Adi F. Gazdar,6 James Hartigan,7 Leo Wu,8 Changsheng Liu,8 Giovanni Parmigiani,9 Ben Ho Park,10 Kurtis E. Bachman,11 Nickolas Papadopoulos,1 Bert Vogelstein,1† Kenneth W. Kinzler,1† Victor E. Velculescu1† 1Ludwig Center and Howard Hughes Medical Institute, Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins, Baltimore, MD 21231, USA. 2Department of Pathology and Microbiology, Center for Colon Cancer Research, and South Carolina Cancer Center, Division of Basic Research, University of South Carolina School of Medicine, Columbia, SC 29229, USA. 3Department of Medicine, Ireland Cancer Center, and Howard Hughes Medical Institute, Case Western Reserve University and University Hospitals of Cleveland, Cleveland, OH 44106, USA. 4Department of Pathology and Ireland Cancer Center, Case Western Reserve University and University Hospitals of Cleveland, Cleveland, OH 44106, USA. 5Harold C. Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA. 6Hamon Center for Therapeutic Oncology Research and Department of Pathology, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA. 7Agencourt Bioscience Corporation, Beverly, MA 01915, USA. 8SoftGenetics LLC, State College, PA 16803, USA. 9Departments of Oncology, Biostatistics, and Pathology, Johns Hopkins Medical Institutions, Baltimore, MD 21205, USA.
    [Show full text]
  • Identification of Gene-Oriented Exon Orthology Between Human and Mouse
    Fu and Lin BMC Genomics 2012, 13(Suppl 1):S10 http://www.biomedcentral.com/1471-2164/13/S1/S10 PROCEEDINGS Open Access Identification of gene-oriented exon orthology between human and mouse Gloria C-L Fu1,2,3, Wen-chang Lin3* From The Tenth Asia Pacific Bioinformatics Conference (APBC 2012) Melbourne, Australia. 17-19 January 2012 Abstract Background: Gene orthology has been well studied in the evolutionary area and is thought to be an important implication to functional genome annotations. As the accumulation of transcriptomic data, alternative splicing is taken into account in the assignments of gene orthologs and the orthology is suggested to be further considered at transcript level. Whether gene or transcript orthology, exons are the basic units that represent the whole gene structure; however, there is no any reported study on how to build exon level orthology in a whole genome scale. Therefore, it is essential to establish a gene-oriented exon orthology dataset. Results: Using a customized pipeline, we first build exon orthologous relationships from assigned gene orthologs pairs in two well-annotated genomes: human and mouse. More than 92% of non-overlapping exons have at least one ortholog between human and mouse and only a small portion of them own more than one ortholog. The exons located in the coding region are more conserved in terms of finding their ortholog counterparts. Within the untranslated region, the 5’ UTR seems to have more diversity than the 3’ UTR according to exon orthology designations. Interestingly, most exons located in the coding region are also conserved in length but this conservation phenomenon dramatically drops down in untranslated regions.
    [Show full text]
  • Novel Genes Associated with Colorectal Cancer Are Revealed by High Resolution Cytogenetic Analysis in a Patient Specific Manner
    Novel Genes Associated with Colorectal Cancer Are Revealed by High Resolution Cytogenetic Analysis in a Patient Specific Manner Hisham Eldai1., Sathish Periyasamy1., Saeed Al Qarni2, Maha Al Rodayyan2, Sabeena Muhammed Mustafa1, Ahmad Deeb3, Ebthehal Al Sheikh2, Mohammed Afzal Khan4, Mishal Johani5, Zeyad Yousef6, Mohammad Azhar Aziz2*. 1 Bioinformatics, King Abdullah International Medical Research Center, Riyadh, Saudi Arabia, 2 Medical Biotechnology, King Abdullah International Medical Research Center, Riyadh, Saudi Arabia, 3 Research Office, King Abdullah International Medical Research Center, Riyadh, Saudi Arabia, 4 Anatomic pathology, National Guard Health Affairs, Riyadh, Saudi Arabia, 5 Endoscopy, National Guard Health Affairs, Riyadh, Saudi Arabia, 6 Surgery, National Guard Health Affairs, Riyadh, Saudi Arabia Abstract Genomic abnormalities leading to colorectal cancer (CRC) include somatic events causing copy number aberrations (CNAs) as well as copy neutral manifestations such as loss of heterozygosity (LOH) and uniparental disomy (UPD). We studied the causal effect of these events by analyzing high resolution cytogenetic microarray data of 15 tumor-normal paired samples. We detected 144 genes affected by CNAs. A subset of 91 genes are known to be CRC related yet high GISTIC scores indicate 24 genes on chromosomes 7, 8, 18 and 20 to be strongly relevant. Combining GISTIC ranking with functional analyses and degree of loss/gain we identify three genes in regions of significant loss (ATP8B1, NARS, and ATP5A1) and eight in regions of gain (CTCFL, SPO11, ZNF217, PLEKHA8, HOXA3, GPNMB, IGF2BP3 and PCAT1) as novel in their association with CRC. Pathway and target prediction analysis of CNA affected genes and microRNAs, respectively indicates TGF-b signaling pathway to be involved in causing CRC.
    [Show full text]
  • Download Special Issue
    Complexity Biomolecular Networks for Complex Diseases Guest Editors: Fang X. Wu, Jianxin Wang, Min Li, and Haiying Wang Biomolecular Networks for Complex Diseases Complexity Biomolecular Networks for Complex Diseases Guest Editors: Kazuo Toda, Jorge L. Zeredo, Sae Uchida, and Vitaly Napadow Copyright © 2018 Hindawi. All rights reserved. This is a special issue published in “Complexity.” All articles are open access articles distributed under the Creative Commons Attribu- tion License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Editorial Board José Ángel Acosta, Spain Mattia Frasca, Italy Daniela Paolotti, Italy Rodrigo Aldecoa, USA Lucia Valentina Gambuzza, Italy Luis M. Rocha, USA Juan A. Almendral, Spain Carlos Gershenson, Mexico Miguel Romance, Spain David Arroyo, Spain Peter Giesl, UK Matilde Santos, Spain Arturo Buscarino, Italy Sergio Gómez, Spain Hiroki Sayama, USA Guido Caldarelli, Italy Sigurdur F. Hafstein, Iceland Michele Scarpiniti, Italy Danilo Comminiello, Italy Giacomo Innocenti, Italy Enzo Pasquale Scilingo, Italy Manlio De Domenico, Italy Jeffrey H. Johnson, UK Samuel Stanton, USA Pietro De Lellis, Italy Vittorio Loreto, Italy Roberto Tonelli, Italy Albert Diaz-Guilera, Spain Didier Maquin, France Shahadat Uddin, Australia Jordi Duch, Spain Eulalia Martínez, Spain Gaetano Valenza, Italy Joshua Epstein, USA Ch. P. Monterola, Philippines Dimitri Volchenkov, USA Thierry Floquet, France Roberto Natella, Italy Christos Volos, Greece Contents Biomolecular
    [Show full text]