Disease Biomarker Identification from Gene Network Modules For

Total Page:16

File Type:pdf, Size:1020Kb

Disease Biomarker Identification from Gene Network Modules For www.nature.com/scientificreports OPEN Disease biomarker identifcation from gene network modules for metastasized breast cancer Received: 25 January 2017 Pooja Sharma1, Dhruba K. Bhattacharyya1 & Jugal Kalita2 Accepted: 21 March 2017 Published: xx xx xxxx Advancement in science has tended to improve treatment of fatal diseases such as cancer. A major concern in the area is the spread of cancerous cells, technically refered to as metastasis into other organs beyond the primary organ. Treatment in such a stage of cancer is extremely difcult and usually palliative only. In this study, we focus on fnding gene-gene network modules which are functionally similar in nature in the case of breast cancer. These modules extracted during the disease progression stages are analyzed using p-value and their associated pathways. We also explore interesting patterns associated with the causal genes, viz., SCGB1D2, MET, CYP1B1 and MMP9 in terms of expression similarity and pathway contexts. We analyze the genes involved in both the stages– non metastasis and metastatsis and change in their expression values, their associated pathways and roles as the disease progresses from one stage to another. We discover three additional pathways viz., Glycerophospholipid metablism, h-Efp pathway and CARM1 and Regulation of Estrogen Receptor, which can be related to the metastasis phase of breast cancer. These new pathways can be further explored to identify their relevance during the progression of the disease. A normal cell follows a well-known path of growth, divison and death although there are exceptions. Some cells do not die. Tey keep on dividing again and again, leading to abnormal growth. Such condition of cells is called cancerous, and it corresponds to a class of diseases known as cancer. Unrestricted growth of cells leads to the formation of lumps known as tumors, which can interfere in activities of bodily systems such as the nevous sys- tem or the circulatory system. Tese lumps of cells can be benign or malignant. Benign tumors do not disturb the normal functioning of the body. Tey are still usually controlled by genes controlling apoptosis. On the other hand, malignant tumor cells invade organs, thereby disrupting their normal functioning. Tey can also move throughout the body using the blood or the lymphatic system as the channel and invade other parts. Such spread of the disease is referred to as the metastatic stage. When cancerous cells metastasize, it becomes very difcult to treat them. Te current characterization of this crucial stage of the disease is incomplete and as a result, proper diagnosis is ofen not feasible. Certain studies such as1–3 have analyzed diferentially expressed genes for meta- static stages of lymphoma, lung cancer and leukaemia. Veer et al.4 and Wang et al.5 have identifed 70 genes that may be associated with the metastatic stage of breast cancer. Chuang et al.6 use a protein-protein interaction network to determine the markers in the metastasis stage. All these studies focus on extracting individual disease markers responsible for disease progression. However, studies have shown that these genes may be associated with a variety of functions in the body, and interplay among the genes may lead to disorders that fnally lead to cancer. Terefore, it would be benefcial to study the disease progression from the pathway point of view. Tis is because a biological pathway involves a number of molecules simultaneously, and these molecules have to work in coordination to perform normal cellular activities. A slight change in any of the molecules may lead others in the chain to behave abnormally, resulting in disease diagnosis. Treating cancer for the non-metastatic stage is possible by drugs or by chemotherapy, but if the disease has spread to other organs, it becomes very difcult to diagnose on time and accordingly provide treatment. In this paper, we use the biological similarity between genes to identify functionally related modules. Gene expression data only consider the preparatory information available during the experiments. Analyzing the modules using only a part of the laboratory information would not be justifable as genes are known to be functionally correlated in nature. Terefore, we use both topological and functional properties of genes to fnd biologically signifcant 1epur niersit omputer cience an nineerin Dept epur ssam 78408 Inia. Department of omputer cience niersit of oorao oorao prins nite tates. orresponence an reuests for materias sou e aresse to D... emai: teu.ernet.in) Scientific RepoRts | 7: 1072 | DOI:10.1038/s41598-017-00996-x 1 www.nature.com/scientificreports/ SST = 0.3 SST = 0.5 SST = 0.7 SST = 0.3 SST = 0.5 SST = 0.7 SST = 0.3 SST = 0.5 SST = 0.7 Modules CCT = 0.3 CCT = 0.3 CCT = 0.3 CCT = 0.5 CCT = 0.5 CCT = 0.5 CCT = 0.7 CCT = 0.7 CCT = 0.7 M1 5.41E-6 2.25E-5 5.41E-6 4.14E-6 5.41E-6 5.10E-7 5.41E-6 2.25E-5 5.41E-6 M2 3.41E-5 3.41E-5 4.14E-6 5.41E-6 3.41E-5 7.14E-7 3.41E-5 1.56E-5 3.41E-5 M3 6.25E-5 1.68E-4 3.41E-5 3.41E-5 2.25E-5 8.41E-6 1.68E-4 1.68E-4 2.25E-5 Table 1. p–value of top 3 modules obtained using diferent thresholds in metastasis stage. Figure 1. Chemokine signalling pathway (source-http://www.kegg.jp/kegg/kegg1.html). modules. Finding modules with high coherence is very essential in our case, as the main target of this work is to analyze the diferent characteristics of genes including the causal genes of breast cancer from diferent per- spective. Such an analysis incorporating evidence from published literature can be used by biologists and other researchers to carry forward their work on these genes and their associated features. Tis work provides a comprehensive study of biomarkers associated with breast cancer. In this paper, we make the following contributions. t We propose an efective gene-gene network module extraction technique from micro-array gene expression data. We use both the toplogical and functional characteristics of the gene networks to fnd highly enriched modules. t We analyze members of these modules from diferent perspectives, eg., expression patterns and pathways. t We identify relevant pathways by analyzing modules for both the stages. Such pathways provide an insight into the role of each gene in the body and can be further extended to identify its role during disease progression. t We discover certain interesting characteristics of causal genes such as MET and WARS. t We fnd three more pathways to be associated with the metastasis stage of the disease. Tese pathways can be further explored to fnd their role in progression of the disease. Experimental Results We implemented our network construction and module extraction method in MATLAB running on an HP Z 800 workstation with two 2.4 GHz Intel(R) Xeon(R) processors and 12 GB RAM, using the Windows7 operating system. The semantic similarity for a given pair of genes is found using the GOSemSim package7 in R. Parameter tuning for p-value computation. We carried out our experiments at various CCT and SST thresholds. CCT and SST are a clustering coefcient and a semantic similarity threshold respectively, set by the Scientific RepoRts | 7: 1072 | DOI:10.1038/s41598-017-00996-x 2 www.nature.com/scientificreports/ Non-metastasis modules Pathway associated Module No.: Members Pathways gene names p-value antigen processing and presentation CTSS purine metabolism PDE4B cytokine-cytokine receptor interaction, chemokine signalling pathway, NOD-like receptor signaling pathway, cytosolic DNA-sensing pathway, CCL5 toll-like receptor signaling pathway. cytokine-cytokine receptor interaction, chemokine signalling pathway, NOD-like receptor signaling pathway, cytosolic DNA-sensing pathway, CXCL10 1: SRGN, MX1, GBP1, PLEK, toll-like receptor signaling pathway, RIG-I-like receptor signaling pathway PDE4B, SLA, IL32, CXCL9, RUNX3, p53 signaling pathway, Wnt signaling pathway, focal adhesion, Jak-STAT IFI44L, CD52, LRMP, TRBV19, CCND2 signaling pathway, cyclins and cell cycle regulation 5.38E-6 CTSS, PDE4B, CCL5, CXCL10, CCND2, CYP1B1, WARS, MMP9, steroid hormone biosynthesis CYP1B1 PFKP, TAP1, ARHGAP4, SLC2A3 tryptophan metabolism, Aminoacyl-tRNA biosynthesis WARS leukocyte transendothelial migration, pathways in cancer MMP9 glycolysis/gluconeogenesis pentose phosphate pathway, fructose and PFKP mannose metabolism, galactose metabolism antigen processing and presentation TAP1 Rho cell motility signaling pathway ARHGAP4 facilitated glucose transporter SLC2A3 cytokine-cytokine receptor interaction, chemokine signalling pathway, NOD-like receptor signaling pathway, cytosolic DNA-sensing pathway, CCL5 toll-like receptor signaling pathway. 2: CCL5, CCND2, WARS, LAG3 3.08E-5 p53 signaling pathway, Wnt signaling pathway, focal adhesion, Jak-STAT CCND2 signaling pathway, cyclins and cell cycle regulation tryptophan metabolism, aminoacyl-tRNA biosynthesis WARS cytokine-cytokine receptor interaction, chemokine signalling pathway, NOD-like receptor signaling pathway, cytosolic DNA-sensing pathway, CCL5 toll-like receptor signaling pathway. 3: CCL5, CCND2, WARS, IGK@, 1.68E-4 IGLV5-45 p53 signaling pathway, Wnt signaling pathway, focal adhesion, Jak-STAT CCND2 signaling pathway, cyclins and cell cycle regulation tryptophan metabolism, aminoacyl-tRNA biosynthesis WARS cytokine-cytokine receptor interaction, chemokine signalling pathway, NOD-like receptor signaling pathway, cytosolic DNA-sensing pathway, CCL5 toll-like receptor signaling pathway. 4: CCL5, CCND2, WARS, IGLV3-19 1.68E-4 p53 signaling pathway, Wnt signaling pathway, focal
Recommended publications
  • Location Analysis of Estrogen Receptor Target Promoters Reveals That
    Location analysis of estrogen receptor ␣ target promoters reveals that FOXA1 defines a domain of the estrogen response Jose´ e Laganie` re*†, Genevie` ve Deblois*, Ce´ line Lefebvre*, Alain R. Bataille‡, Franc¸ois Robert‡, and Vincent Gigue` re*†§ *Molecular Oncology Group, Departments of Medicine and Oncology, McGill University Health Centre, Montreal, QC, Canada H3A 1A1; †Department of Biochemistry, McGill University, Montreal, QC, Canada H3G 1Y6; and ‡Laboratory of Chromatin and Genomic Expression, Institut de Recherches Cliniques de Montre´al, Montreal, QC, Canada H2W 1R7 Communicated by Ronald M. Evans, The Salk Institute for Biological Studies, La Jolla, CA, July 1, 2005 (received for review June 3, 2005) Nuclear receptors can activate diverse biological pathways within general absence of large scale functional data linking these putative a target cell in response to their cognate ligands, but how this binding sites with gene expression in specific cell types. compartmentalization is achieved at the level of gene regulation is Recently, chromatin immunoprecipitation (ChIP) has been used poorly understood. We used a genome-wide analysis of promoter in combination with promoter or genomic DNA microarrays to occupancy by the estrogen receptor ␣ (ER␣) in MCF-7 cells to identify loci recognized by transcription factors in a genome-wide investigate the molecular mechanisms underlying the action of manner in mammalian cells (20–24). This technology, termed 17␤-estradiol (E2) in controlling the growth of breast cancer cells. ChIP-on-chip or location analysis, can therefore be used to deter- We identified 153 promoters bound by ER␣ in the presence of E2. mine the global gene expression program that characterize the Motif-finding algorithms demonstrated that the estrogen re- action of a nuclear receptor in response to its natural ligand.
    [Show full text]
  • Functional Genomics Atlas of Synovial Fibroblasts Defining Rheumatoid Arthritis
    medRxiv preprint doi: https://doi.org/10.1101/2020.12.16.20248230; this version posted December 18, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission. Functional genomics atlas of synovial fibroblasts defining rheumatoid arthritis heritability Xiangyu Ge1*, Mojca Frank-Bertoncelj2*, Kerstin Klein2, Amanda Mcgovern1, Tadeja Kuret2,3, Miranda Houtman2, Blaž Burja2,3, Raphael Micheroli2, Miriam Marks4, Andrew Filer5,6, Christopher D. Buckley5,6,7, Gisela Orozco1, Oliver Distler2, Andrew P Morris1, Paul Martin1, Stephen Eyre1* & Caroline Ospelt2*,# 1Versus Arthritis Centre for Genetics and Genomics, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Manchester, UK 2Department of Rheumatology, Center of Experimental Rheumatology, University Hospital Zurich, University of Zurich, Zurich, Switzerland 3Department of Rheumatology, University Medical Centre, Ljubljana, Slovenia 4Schulthess Klinik, Zurich, Switzerland 5Institute of Inflammation and Ageing, University of Birmingham, Birmingham, UK 6NIHR Birmingham Biomedical Research Centre, University Hospitals Birmingham NHS Foundation Trust, University of Birmingham, Birmingham, UK 7Kennedy Institute of Rheumatology, University of Oxford Roosevelt Drive Headington Oxford UK *These authors contributed equally #corresponding author: [email protected] NOTE: This preprint reports new research that has not been certified by peer review and should not be used to guide clinical practice. 1 medRxiv preprint doi: https://doi.org/10.1101/2020.12.16.20248230; this version posted December 18, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
    [Show full text]
  • Probabilistic Models for Collecting, Analyzing, and Modeling Expression Data
    Probabilistic Models for Collecting, Analyzing, and Modeling Expression Data Hai-Son Phuoc Le May 2013 CMU-ML-13-101 Probabilistic Models for Collecting, Analyzing, and Modeling Expression Data Hai-Son Phuoc Le May 2013 CMU-ML-13-101 Machine Learning Department School of Computer Science Carnegie Mellon University Thesis Committee Ziv Bar-Joseph, Chair Christopher Langmead Roni Rosenfeld Quaid Morris Submitted in partial fulfillment of the requirements for the Degree of Doctor of Philosophy. Copyright @ 2013 Hai-Son Le This research was sponsored by the National Institutes of Health under grant numbers 5U01HL108642 and 1R01GM085022, the National Science Foundation under grant num- bers DBI0448453 and DBI0965316, and the Pittsburgh Life Sciences Greenhouse. The views and conclusions contained in this document are those of the author and should not be interpreted as representing the official policies, either expressed or implied, of any sponsoring institution, the U.S. government or any other entity. Keywords: genomics, gene expression, gene regulation, microarray, RNA-Seq, transcriptomics, error correction, comparative genomics, regulatory networks, cross-species, expression database, Gene Expression Omnibus, GEO, orthologs, microRNA, target prediction, Dirichlet Process, Indian Buffet Process, hidden Markov model, immune response, cancer. To Mom and Dad. i Abstract Advances in genomics allow researchers to measure the complete set of transcripts in cells. These transcripts include messenger RNAs (which encode for proteins) and microRNAs, short RNAs that play an important regulatory role in cellular networks. While this data is a great resource for reconstructing the activity of networks in cells, it also presents several computational challenges. These challenges include the data collection stage which often results in incomplete and noisy measurement, developing methods to integrate several experiments within and across species, and designing methods that can use this data to map the interactions and networks that are activated in specific conditions.
    [Show full text]
  • Nazal Polipli Hastalarda Scgb1c1 (Lıgand-Bındıng Proteın Ryd5)
    NAZAL POLİPLİ HASTALARDA SCGB1C1 (LIGAND-BINDING PROTEIN RYD5) GENİNDEKİ MOLEKÜLER PATOLOJİLERİN ARAŞTIRILMASI INVESTIGATION OF MOLECULAR PATHOLOGIES IN SCGB1C1 (LIGAND-BINDING PROTEIN RYD5) GENE IN PATIENTS WITH NASAL POLYPOSIS SİBEL UNURLU ÖZDAŞ PROF.DR. AFİFE İZBIRAK Tez Danışmanı Hacettepe Üniversitesi Lisansüstü Eğitim – Öğretim ve Sınav Yönetmeliğinin Fen Bilimleri Enstitüsü, Biyoloji Anabilim Dalı İçin Öngördüğü DOKTORA TEZİ olarak hazırlanmıştır 2013 NAZAL POLİPLİ HASTALARDA SCGB1C1 (LIGAND-BINDING PROTEIN RYD5) GENİNDEKİ MOLEKÜLER PATOLOJİLERİN ARAŞTIRILMASI INVESTIGATION OF MOLECULAR PATHOLOGIES IN SCGB1C1 (LIGAND-BINDING PROTEIN RYD5) GENE IN PATIENTS WITH NASAL POLYPOSIS SİBEL UNURLU ÖZDAŞ PROF.DR. AFİFE İZBIRAK Tez Danışmanı Hacettepe Üniversitesi Lisansüstü Eğitim – Öğretim ve Sınav Yönetmeliğinin Fen Bilimleri Enstitüsü, Biyoloji Anabilim Dalı İçin Öngördüğü DOKTORA TEZİ olarak hazırlanmıştır 2013 Sibel UNURLU ÖZDAŞ’ın hazırladığı “Nazal Polipli Hastalarda SCGB1C1 (Ligand-Binding Protein RYD5) Genindeki Moleküler Patolojilerin Araştırılması” adlı bu çalışma aşağıdaki jüri tarafından BİYOLOJİ ANABİLİM DALI'nda DOKTORA TEZİ olarak kabul edilmiştir. Başkan (Prof. Dr., Erol AKSÖZ) …………………………….. Danışman (Prof. Dr., Afife İZBIRAK) …………………………….. Üye (Prof. Dr. Hatice MERGEN) …………………………….. Üye (Doç. Dr., Rıza Köksal ÖZGÜL) …………………………….. Üye (Doç. Dr., Selim Sermed ERBEK) …………………………….. Bu tez Hacettepe Üniversitesi Fen Bilimleri Enstitüsü tarafından DOKTORA TEZİ olarak onaylanmıştır. Prof. Dr., Fatma Sevin DÜZ Fen Bilimleri Enstitüsü
    [Show full text]
  • Integration of Text Mining with Systems Biology Provides New Insight Into the Pathogenesis of Diabetic Neuropathy
    INTEGRATION OF TEXT MINING WITH SYSTEMS BIOLOGY PROVIDES NEW INSIGHT INTO THE PATHOGENESIS OF DIABETIC NEUROPATHY by Junguk Hur A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy (Bioinformatics) in The University of Michigan 2010 Doctoral Committee: Professor Eva L. Feldman, Co-Chair Professor Hosagrahar V. Jagadish, Co-Chair Professor Matthias Kretzler Research Assistant Professor Maureen Sartor Professor David J. States, University of Texas Junguk Hur © 2010 All Rights Reserved DEDICATION To my family ii ACKNOWLEDGMENTS Over the past few years, I have been tremendously fortunate to have the company and mentorship of the most wonderful and smartest scientists I know. My advisor, Prof. Eva Feldman, guided me through my graduate studies with constant support, encouragement, enthusiasm and infinite patience. I would like to thank her for being a mom in my academic life and raising me to become a better scientist. I would also like to thank my co-advisors Prof. H. V. Jagadish and Prof. David States. They provided sound advice and inspiration that have been instrumental in my Ph.D. study. I am also very grateful to Prof. Matthias Kretzler and Prof. Maureen Sartor for being an active part of my committee as well as for their continuous encouragement and guidance of my work. I would also like to thank Prof. Daniel Burns, Prof. Margit Burmeister, Prof. Gil Omenn and Prof. Brain Athey from the Bioinformatics Program, who have been very generous with their support and advice about academic life. I am also very grateful to Sherry, Julia and Judy, who helped me through the various administrative processes with cheerful and encouraging dispositions.
    [Show full text]
  • Lipophilin B (SCGB1D2) (NM 006551) Human Tagged ORF Clone Lentiviral Particle Product Data
    OriGene Technologies, Inc. 9620 Medical Center Drive, Ste 200 Rockville, MD 20850, US Phone: +1-888-267-4436 [email protected] EU: [email protected] CN: [email protected] Product datasheet for RC220905L3V Lipophilin B (SCGB1D2) (NM_006551) Human Tagged ORF Clone Lentiviral Particle Product data: Product Type: Lentiviral Particles Product Name: Lipophilin B (SCGB1D2) (NM_006551) Human Tagged ORF Clone Lentiviral Particle Symbol: SCGB1D2 Synonyms: LIPB; LPHB; LPNB Vector: pLenti-C-Myc-DDK-P2A-Puro (PS100092) ACCN: NM_006551 ORF Size: 270 bp ORF Nucleotide The ORF insert of this clone is exactly the same as(RC220905). Sequence: OTI Disclaimer: The molecular sequence of this clone aligns with the gene accession number as a point of reference only. However, individual transcript sequences of the same gene can differ through naturally occurring variations (e.g. polymorphisms), each with its own valid existence. This clone is substantially in agreement with the reference, but a complete review of all prevailing variants is recommended prior to use. More info OTI Annotation: This clone was engineered to express the complete ORF with an expression tag. Expression varies depending on the nature of the gene. RefSeq: NM_006551.3 RefSeq Size: 454 bp RefSeq ORF: 273 bp Locus ID: 10647 UniProt ID: O95969 Protein Families: Secreted Protein MW: 9.9 kDa This product is to be used for laboratory only. Not for diagnostic or therapeutic use. View online » ©2021 OriGene Technologies, Inc., 9620 Medical Center Drive, Ste 200, Rockville, MD 20850, US 1 / 2 Lipophilin B (SCGB1D2) (NM_006551) Human Tagged ORF Clone Lentiviral Particle – RC220905L3V Gene Summary: The protein encoded by this gene is a member of the lipophilin subfamily, part of the uteroglobin superfamily, and is an ortholog of prostatein, the major secretory glycoprotein of the rat ventral prostate gland.
    [Show full text]
  • SCGB1D2 Antibody (Center) Affinity Purified Rabbit Polyclonal Antibody (Pab) Catalog # Ap13793c
    10320 Camino Santa Fe, Suite G San Diego, CA 92121 Tel: 858.875.1900 Fax: 858.622.0609 SCGB1D2 Antibody (Center) Affinity Purified Rabbit Polyclonal Antibody (Pab) Catalog # AP13793c Specification SCGB1D2 Antibody (Center) - Product Information Application WB,E Primary Accession O95969 Other Accession NP_006542.1 Reactivity Human Host Rabbit Clonality Polyclonal Isotype Rabbit Ig Calculated MW 9925 Antigen Region 22-50 SCGB1D2 Antibody (Center) - Additional Information SCGB1D2 Antibody (Center) (Cat. Gene ID 10647 #AP13793c) western blot analysis in K562 cell line lysates (35ug/lane).This Other Names Secretoglobin family 1D member 2, demonstrates the SCGB1D2 antibody Lipophilin-B, SCGB1D2, LIPHB, LPNB detected the SCGB1D2 protein (arrow). Target/Specificity This SCGB1D2 antibody is generated from SCGB1D2 Antibody (Center) - Background rabbits immunized with a KLH conjugated synthetic peptide between 22-50 amino The protein encoded by this gene is a member acids from the Central region of human of the SCGB1D2. lipophilin subfamily, part of the uteroglobin superfamily, and is Dilution an ortholog of prostatein, the major secretory WB~~1:1000 glycoprotein of the rat ventral prostate gland. Lipophilin gene Format products are widely Purified polyclonal antibody supplied in PBS expressed in normal tissues, especially in with 0.09% (W/V) sodium azide. This endocrine-responsive antibody is purified through a protein A organs. Assuming that human lipophilins are column, followed by peptide affinity the functional purification. counterparts of prostatein, they may be transcriptionally regulated Storage by steroid hormones, with the ability to bind Maintain refrigerated at 2-8°C for up to 2 androgens, other weeks. For long term storage store at -20°C steroids and possibly bind and concentrate in small aliquots to prevent freeze-thaw estramustine, a cycles.
    [Show full text]
  • Downloaded from Ensembl
    UCSF UC San Francisco Electronic Theses and Dissertations Title Detecting genetic similarity between complex human traits by exploring their common molecular mechanism Permalink https://escholarship.org/uc/item/1k40s443 Author Gu, Jialiang Publication Date 2019 Peer reviewed|Thesis/dissertation eScholarship.org Powered by the California Digital Library University of California by Submitted in partial satisfaction of the requirements for degree of in in the GRADUATE DIVISION of the UNIVERSITY OF CALIFORNIA, SAN FRANCISCO AND UNIVERSITY OF CALIFORNIA, BERKELEY Approved: ______________________________________________________________________________ Chair ______________________________________________________________________________ ______________________________________________________________________________ ______________________________________________________________________________ ______________________________________________________________________________ Committee Members ii Acknowledgement This project would not have been possible without Prof. Dr. Hao Li, Dr. Jiashun Zheng and Dr. Chris Fuller at the University of California, San Francisco (UCSF) and Caribou Bioscience. The Li lab grew into a multi-facet research group consist of both experimentalists and computational biologists covering three research areas including cellular/molecular mechanism of ageing, genetic determinants of complex human traits and structure, function, evolution of gene regulatory network. Labs like these are the pillar of global success and reputation
    [Show full text]
  • Genome-Scale Detection of Positive Selection in 9 Primates Predicts Human-Virus Evolutionary Conflicts
    bioRxiv preprint doi: https://doi.org/10.1101/131680; this version posted April 27, 2017. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-ND 4.0 International license. Genome-scale detection of positive selection in 9 primates predicts human-virus evolutionary conflicts Robin van der Lee1,*,^, Laurens Wiel1,2, Teunis J.P. van Dam1,#, Martijn A. Huynen1 1Centre for Molecular and Biomolecular Informatics, 2Department of Human Genetics, Radboud Institute for Molecular Life Sciences, Radboud university medical center, Nijmegen, The Netherlands *Correspondence: [email protected] ^Current address: Centre for Molecular Medicine and Therapeutics, Department of Medical Genetics, BC Children’s Hospital Research Institute, University of British Columbia, Vancouver, Canada #Current address: Theoretical Biology and Bioinformatics, Department of Biology, Faculty of Science, Utrecht University, The Netherlands bioRxiv preprint doi: https://doi.org/10.1101/131680; this version posted April 27, 2017. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-ND 4.0 International license. Abstract Hotspots of rapid genome evolution hold clues about human adaptation. Here, we present a comparative analysis of nine whole-genome sequenced primates to identify high-confidence targets of positive selection. We find strong statistical evidence for positive selection acting on 331 protein-coding genes (3%), pinpointing 934 adaptively evolving codons (0.014%).
    [Show full text]
  • Identification and Mechanistic Investigation of Recurrent Functional Genomic and Transcriptional Alterations in Advanced Prostat
    Identification and Mechanistic Investigation of Recurrent Functional Genomic and Transcriptional Alterations in Advanced Prostate Cancer Thomas A. White A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy University of Washington 2013 Reading Committee: Peter S. Nelson, Chair Janet L. Stanford Raymond J. Monnat Jay A. Shendure Program Authorized to Offer Degree: Molecular and Cellular Biology ©Copyright 2013 Thomas A. White University of Washington Abstract Identification and Mechanistic Investigation of Recurrent Functional Genomic and Transcriptional Alterations in Advanced Prostate Cancer Thomas A. White Chair of the Supervisory Committee: Peter S. Nelson, MD Novel functionally altered transcripts may be recurrent in prostate cancer (PCa) and may underlie lethal and advanced disease and the neuroendocrine small cell carcinoma (SCC) phenotype. We conducted an RNASeq survey of the LuCaP series of 24 PCa xenograft tumors from 19 men, and validated observations on metastatic tumors and PCa cell lines. Key findings include discovery and validation of 40 novel fusion transcripts including one recurrent chimera, many SCC-specific and castration resistance (CR) -specific novel splice isoforms, new observations on SCC-specific and TMPRSS2-ERG specific differential expression, the allele-specific expression of certain recurrent non- synonymous somatic single nucleotide variants (nsSNVs) previously discovered via exome sequencing of the same tumors, rgw ubiquitous A-to-I RNA editing of base excision repair (BER) gene NEIL1 as well as CDK13, a kinase involved in RNA splicing, and the SCC expression of a previously unannotated long noncoding RNA (lncRNA) at Chr6p22.2. Mechanistic investigation of the novel lncRNA indicates expression is regulated by derepression by master neuroendocrine regulator RE1-silencing transcription factor (REST), and may regulate some genes of axonogenesis and angiogenesis.
    [Show full text]
  • SCGB1D2 Antibody Cat
    SCGB1D2 Antibody Cat. No.: 57-276 SCGB1D2 Antibody Specifications HOST SPECIES: Rabbit SPECIES REACTIVITY: Human This SCGB1D2 antibody is generated from rabbits immunized with a KLH conjugated IMMUNOGEN: synthetic peptide between 22-50 amino acids from the Central region of human SCGB1D2. TESTED APPLICATIONS: WB APPLICATIONS: For WB starting dilution is: 1:1000 PREDICTED MOLECULAR 10 kDa WEIGHT: Properties This antibody is purified through a protein A column, followed by peptide affinity PURIFICATION: purification. CLONALITY: Polyclonal ISOTYPE: Rabbit Ig CONJUGATE: Unconjugated September 24, 2021 1 https://www.prosci-inc.com/scgb1d2-antibody-57-276.html PHYSICAL STATE: Liquid BUFFER: Supplied in PBS with 0.09% (W/V) sodium azide. CONCENTRATION: batch dependent Store at 4˚C for three months and -20˚C, stable for up to one year. As with all antibodies STORAGE CONDITIONS: care should be taken to avoid repeated freeze thaw cycles. Antibodies should not be exposed to prolonged high temperatures. Additional Info OFFICIAL SYMBOL: SCGB1D2 ALTERNATE NAMES: Secretoglobin family 1D member 2, Lipophilin-B, SCGB1D2, LIPHB, LPNB ACCESSION NO.: O95969 GENE ID: 10647 USER NOTE: Optimal dilutions for each application to be determined by the researcher. Background and References The protein encoded by this gene is a member of the lipophilin subfamily, part of the uteroglobin superfamily, and is an ortholog of prostatein, the major secretory glycoprotein of the rat ventral prostate gland. Lipophilin gene products are widely expressed in normal tissues, especially in endocrine-responsive organs. Assuming that human lipophilins are the functional counterparts of prostatein, they may be BACKGROUND: transcriptionally regulated by steroid hormones, with the ability to bind androgens, other steroids and possibly bind and concentrate estramustine, a chemotherapeutic agent widely used for prostate cancer.
    [Show full text]
  • Arxiv:2105.12807V2 [Q-Bio.GN] 4 Jul 2021 CE6T Odn Kand UK ∗ London, 6BT, WC1E 1 Nfr Aiodapoiainadpoeto (UMAP) Projection and Other and 2008] Data
    arXiv, 2021, pp. 1–12 Preprint XOmiVAE Problem Solving Protocol PROBLEM SOLVING PROTOCOL XOmiVAE: an interpretable deep learning model for cancer classification using high-dimensional omics data Eloise Withnell,1,2,† Xiaoyu Zhang,1,∗,† Kai Sun1 and Yike Guo1,3 1Data Science Institute, Imperial College London, SW7 2AZ, London, UK, 2Department of Health Informatics, University College London, WC1E 6BT, London, UK and 3Department of Computer Science, Hong Kong Baptist University, Hong Kong, China ∗Corresponding author. [email protected]†The first two authors contributed equally to this paper. Abstract The lack of explainability is one of the most prominent disadvantages of deep learning applications in omics. This “black box” problem can undermine the credibility and limit the practical implementation of biomedical deep learning models. Here we present XOmiVAE, a variational autoencoder (VAE) based interpretable deep learning model for cancer classification using high-dimensional omics data. XOmiVAE is capable of revealing the contribution of each gene and latent dimension for each classification prediction, and the correlation between each gene and each latent dimension. It is also demonstrated that XOmiVAE can explain not only the supervised classification but the unsupervised clustering results from the deep learning network. To the best of our knowledge, XOmiVAE is one of the first activation level-based interpretable deep learning models explaining novel clusters generated by VAE. The explainable results generated by XOmiVAE were validated by both the performance of downstream tasks and the biomedical knowledge. In our experiments, XOmiVAE explanations of deep learning based cancer classification and clustering aligned with current domain knowledge including biological annotation and academic literature, which shows great potential for novel biomedical knowledge discovery from deep learning models.
    [Show full text]