Improved Detection of Gene Fusions by Applying Statistical Methods Reveals Oncogenic RNA Cancer Drivers Roozbeh Dehghannasiria, Donald E

Total Page:16

File Type:pdf, Size:1020Kb

Improved Detection of Gene Fusions by Applying Statistical Methods Reveals Oncogenic RNA Cancer Drivers Roozbeh Dehghannasiria, Donald E Improved detection of gene fusions by applying statistical methods reveals oncogenic RNA cancer drivers Roozbeh Dehghannasiria, Donald E. Freemana,b, Milos Jordanskic, Gillian L. Hsieha, Ana Damljanovicd, Erik Lehnertd, and Julia Salzmana,b,e,1 aDepartment of Biochemistry, Stanford University, Stanford, CA 94305; bDepartment of Biomedical Data Science, Stanford University, Stanford, CA 94305; cDepartment of Computer Science, University of Belgrade, 11000 Belgrade, Serbia; dSeven Bridges Genomics Inc., Cambridge, MA 02142; and eStanford Cancer Institute, Stanford University, Stanford, CA 94305 Edited by Ali Torkamani, The Scripps Research Institute, La Jolla, CA, and accepted by Editorial Board Member Peter K. Vogt June 14, 2019 (received for review January 10, 2019) The extent to which gene fusions function as drivers of can- of multiple algorithms and filtering lists of fusions using manual cer remains a critical open question. Current algorithms do not approaches (13–15). These approaches lead to what third-party sufficiently identify false-positive fusions arising during library reviews agree is imprecise fusion discovery and bias against dis- preparation, sequencing, and alignment. Here, we introduce Data- covering novel oncogenes (15–17). This suboptimal performance Enriched Efficient PrEcise STatistical fusion detection (DEEPEST), becomes more problematic when fusion detection is deployed on an algorithm that uses statistical modeling to minimize false- large cancer sequencing datasets that contain thousands or tens positives while increasing the sensitivity of fusion detection. In of thousands of samples. In such scenarios, precise fusion detec- 9,946 tumor RNA-sequencing datasets from The Cancer Genome tion must overcome the problem of multiple hypothesis testing: Atlas (TCGA) across 33 tumor types, DEEPEST identifies 31,007 each algorithm is testing for fusions thousands of times, a regime fusions, 30% more than identified by other methods, while known to introduce FPs. To overcome these problems, the field calling 10-fold fewer false-positive fusions in nontransformed has turned to consensus-based approaches, where multiple algo- human tissues. We leverage the increased precision of DEEPEST rithms are run in parallel (10), and a metacaller allows “voting” to discover fundamental cancer biology. Namely, 888 candi- to produce the final list of fusions. This is also unsatisfactory, as date oncogenes are identified based on overrepresentation in it introduces FNs. DEEPEST calls, and 1,078 previously unreported fusions involv- Both shortcomings in the ascertainment of fusions by existing ing long intergenic noncoding RNAs, demonstrating a previously algorithms and using recurrence alone to assess fusions’ function unappreciated prevalence and potential for function. DEEPEST also reveals a high enrichment for fusions involving oncogenes Significance in cancers, including ovarian cancer, which has had minimal treat- ment advances in recent decades, finding that more than 50% of Gene fusions are tumor-specific genomic aberrations and are tumors harbor gene fusions predicted to be oncogenic. Specific among the most powerful biomarkers and drug targets in trans- protein domains are enriched in DEEPEST calls, indicating a global lational cancer biology. The advent of RNA-sequencing tech- selection for fusion functionality: kinase domains are nearly 2-fold nologies over the last decade has provided a unique opportu- more enriched in DEEPEST calls than expected by chance, as are nity for detecting novel fusions via deploying computational domains involved in (anaerobic) metabolism and DNA binding. algorithms on public sequencing databases. However, pre- The statistical algorithms, population-level analytic framework, cise fusion detection algorithms are still out of reach. We and the biological conclusions of DEEPEST call for increased atten- develop Data-Enriched Efficient PrEcise STatistical fusion detec- tion to gene fusions as drivers of cancer and for future research tion (DEEPEST), a highly specific and efficient statistical pipeline into using fusions for targeted therapy. specially designed for mining massive sequencing databases gene fusion j cancer genomics j bioinformatics j and apply it to all 33 tumor types and 10,500 samples in The pan-cancer analysis j TCGA Cancer Genome Atlas database. We systematically profile the landscape of detected fusions via classic statistical models and ene fusions are known to drive some cancers and can be identify several signatures of selection for fusions in tumors. Ghighly specific and personalized therapeutic targets; some Author contributions: R.D., D.E.F., and J.S. designed research; R.D., D.E.F., and J.S. of the most famous fusions are the BCR–ABL1 fusion in chronic performed research; R.D., D.E.F., M.J., G.L.H., A.D., E.L., and J.S. contributed new myelogenous leukemia (CML), the EML4–ALK fusion in non- reagents/analytic tools; R.D., D.E.F., and J.S. analyzed data; and R.D. and J.S. wrote the small lung cell carcinoma, TMPRSS2–ERG in prostate cancer, paper.y and FGFR3–TACC3 in a variety of cancers including glioblas- The authors declare no conflict of interest.y toma multiforme (1–4). Since fusions are generally absent in This article is a PNAS Direct Submission. A.T. is a guest editor invited by the Editorial healthy tissues, they are among the most clinically relevant events Board.y in cancer to direct targeted therapy and to be used as effective Published under the PNAS license.y diagnostic tools in early detection strategies using RNA or pro- Data deposition: DEEPEST workflow, with all needed softwares preinstalled, have been teins; moreover, as they are truly specific to cancer, they have deposited in GitHub, https://github.com/salzmanlab/DEEPEST-Fusion. Also, a publicly- promising potential as neo-antigens (5–7). available online tool with web interface is available for the DEEPEST algorithm on the Cancer Genomics Cloud platform, https://cgc.sbgenomics.com/public/apps#jordanski. Because of this, clinicians and large sequencing consortia have milos/deepest-fusion/deepest-fusion/. All custom scripts used to generate the figures made major efforts to identify fusions expressed in tumors via have been deposited in GitHub, https://github.com/salzmanlab/DEEPEST-Fusion/tree/ screening massive cancer sequencing datasets (8–12). However, master/custom scripts.y these attempts are limited by critical roadblocks: current algo- 1 To whom correspondence may be addressed. Email: [email protected] rithms suffer from high false-positive (FP) rates and unknown This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10. false-negative (FN) rates. Thus, ad hoc choices have been made 1073/pnas.1900391116/-/DCSupplemental.y in calling and analyzing fusions including taking the consensus Published online July 15, 2019. 15524–15533 j PNAS j July 30, 2019 j vol. 116 j no. 31 www.pnas.org/cgi/doi/10.1073/pnas.1900391116 Downloaded by guest on September 26, 2021 have limited the use of fusions to discover new cancer biology. As nature of fusion expression consistent with the existence of one of many examples, a recent study of more than 400 pancre- under-appreciated drivers of human cancer, including selection atic cancers found no recurrent gene fusions, raising the question for rare or private gene fusions with implications from basic of whether this is due to high FN rates or whether this means biology to the clinic. that fusions are not drivers in the disease (18). Recurrence of fusions is currently one of the only standards in the field used Results to assess the functionality of fusions, but the most frequently DEEPEST Is a Statistical Algorithm for Gene Fusion Discovery in expressed fusions may not be the most carcinogenic (19); on the Massive Public Databases. We engineered a statistical algorithm, other hand, there may still be many undiscovered gene fusions DEEPEST, to discover and estimate the prevalence of gene that drive cancer. fusions in massive numbers of datasets. Here, we have applied Thus, the critical question “Are gene fusions underappreci- DEEPEST to ∼10,000 datasets, but in principle, DEEPEST can ated drivers of cancer?” is still unanswered. In this paper, we be applied to 100,000, 1 million, or more samples. DEEPEST first provide an algorithm that has significant advance in preci- includes key innovations such as controlling FPs arising from sion for unbiased fusion detection at exon boundaries in mas- analysis of massive RNA-Seq datasets for fusion discovery, a sive genomics datasets. The algorithm, Data-Enriched Efficient problem conceptually analogous to multiple hypothesis testing PrEcise STatistical fusion detection (DEEPEST), is a second- via P values, which cannot be solved by direct application of com- generation fusion algorithm with significant computational and mon false-discovery rate (FDR)-controlling procedures, which algorithmic advance over our previously developed MACHETE rely on the assumption of a uniform distribution of P values (Mismatched Alignment Chimera Tracking Engine) algorithm under the null hypothesis. (20). A key innovation in DEEPEST is its statistical test of fusion The DEEPEST pipeline contains 2 main computational steps: prevalence across populations, which can identify FPs in a global 1) junction nomination component which is run on a subset of all unbiased manner. samples to be analyzed, called “the discovery set”; and 2) statis- The precision and efficient implementation of DEEPEST tical testing of nominated junctions
Recommended publications
  • Improved Detection of Gene Fusions by Applying Statistical Methods Reveals New Oncogenic RNA Cancer Drivers
    bioRxiv preprint doi: https://doi.org/10.1101/659078; this version posted June 3, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Improved detection of gene fusions by applying statistical methods reveals new oncogenic RNA cancer drivers Roozbeh Dehghannasiri1, Donald Eric Freeman1,2, Milos Jordanski3, Gillian L. Hsieh1, Ana Damljanovic4, Erik Lehnert4, Julia Salzman1,2,5* Author affiliation 1Department of Biochemistry, Stanford University, Stanford, CA 94305 2Department of Biomedical Data Science, Stanford University, Stanford, CA 94305 3Department of Computer Science, University of Belgrade, Belgrade, Serbia 4Seven Bridges Genomics, Cambridge, MA 02142 5Stanford Cancer Institute, Stanford, CA 94305 *Corresponding author [email protected] Short Abstract: The extent to which gene fusions function as drivers of cancer remains a critical open question. Current algorithms do not sufficiently identify false-positive fusions arising during library preparation, sequencing, and alignment. Here, we introduce a new algorithm, DEEPEST, that uses statistical modeling to minimize false-positives while increasing the sensitivity of fusion detection. In 9,946 tumor RNA-sequencing datasets from The Cancer Genome Atlas (TCGA) across 33 tumor types, DEEPEST identifies 31,007 fusions, 30% more than identified by other methods, while calling ten-fold fewer false-positive fusions in non-transformed human tissues. We leverage the increased precision of DEEPEST to discover new cancer biology. For example, 888 new candidate oncogenes are identified based on over-representation in DEEPEST-Fusion calls, and 1,078 previously unreported fusions involving long intergenic noncoding RNAs partners, demonstrating a previously unappreciated prevalence and potential for function.
    [Show full text]
  • "The Genecards Suite: from Gene Data Mining to Disease Genome Sequence Analyses". In: Current Protocols in Bioinformat
    The GeneCards Suite: From Gene Data UNIT 1.30 Mining to Disease Genome Sequence Analyses Gil Stelzer,1,5 Naomi Rosen,1,5 Inbar Plaschkes,1,2 Shahar Zimmerman,1 Michal Twik,1 Simon Fishilevich,1 Tsippi Iny Stein,1 Ron Nudel,1 Iris Lieder,2 Yaron Mazor,2 Sergey Kaplan,2 Dvir Dahary,2,4 David Warshawsky,3 Yaron Guan-Golan,3 Asher Kohn,3 Noa Rappaport,1 Marilyn Safran,1 and Doron Lancet1,6 1Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel 2LifeMap Sciences Ltd., Tel Aviv, Israel 3LifeMap Sciences Inc., Marshfield, Massachusetts 4Toldot Genetics Ltd., Hod Hasharon, Israel 5These authors contributed equally to the paper 6Corresponding author GeneCards, the human gene compendium, enables researchers to effectively navigate and inter-relate the wide universe of human genes, diseases, variants, proteins, cells, and biological pathways. Our recently launched Version 4 has a revamped infrastructure facilitating faster data updates, better-targeted data queries, and friendlier user experience. It also provides a stronger foundation for the GeneCards suite of companion databases and analysis tools. Improved data unification includes gene-disease links via MalaCards and merged biological pathways via PathCards, as well as drug information and proteome expression. VarElect, another suite member, is a phenotype prioritizer for next-generation sequencing, leveraging the GeneCards and MalaCards knowledgebase. It au- tomatically infers direct and indirect scored associations between hundreds or even thousands of variant-containing genes and disease phenotype terms. Var- Elect’s capabilities, either independently or within TGex, our comprehensive variant analysis pipeline, help prepare for the challenge of clinical projects that involve thousands of exome/genome NGS analyses.
    [Show full text]
  • Molecular Mechanisms in Muscular Dystrophy : a Gene Expression Profiling Study Turk, R
    Molecular mechanisms in muscular dystrophy : a gene expression profiling study Turk, R. Citation Turk, R. (2006, September 27). Molecular mechanisms in muscular dystrophy : a gene expression profiling study. Retrieved from https://hdl.handle.net/1887/4577 Version: Corrected Publisher’s Version Licence agreement concerning inclusion of doctoral License: thesis in the Institutional Repository of the University of Leiden Downloaded from: https://hdl.handle.net/1887/4577 Note: To cite this publication please use the final published version (if applicable). Molecular Mechanisms In Muscular Dystrophy A Gene Expression Profiling Study Molecular Mechanisms In Muscular Dystrophy A Gene Expression Profiling Study Proefschrift ter verkrijging van de graad van Doctor aan de Universiteit Leiden, op gezag van de Rector Magnificus Dr. D.D.Breimer, hoogleraar in de faculteit der Wiskunde en Natuurwetenschappen en die der Geneeskunde, volgens besluit van het College voor Promoties te verdedigen op woensdag 27 september 2006 klokke 15.00 uur door Rolf Turk Geboren te Leiden in 1975 Promotiecommissie Promotor Prof. Dr. G.J.B. van Ommen Co-promotores Dr. J.T. den Dunnen Dr. P.A.C. ‘t Hoen Referent Prof. Dr. R.M.W. Hofstra (Rijksuniversiteit Groningen) Overige leden Prof. Dr. M. Koenig (Université Louis Pasteur de Strasbourg) An experiment is a question which science poses to Nature, and a measurement is the recording of Nature’s answer. Max Planck Aan Maaike, Gerard en Annet Printed by: Drukkerij Duineveld ISBN-10: 90-9021042-3 ISBN-13: 978-90-9021042-1 Turk, Rolf Molecular mechanisms in muscular dystrophy. A gene expression profiling study. Thesis, Leiden University Medical Center September 27, 2006 © Rolf Turk No part of this thesis may be reproduced or transmitted in any form or by any means, without the written permission of the copyright owner Molecular Mechanisms In Muscular Dystrophies Preface 9 Chapter 1 Introduction 11 1.
    [Show full text]
  • ALK Gene ALK Receptor Tyrosine Kinase
    ALK gene ALK receptor tyrosine kinase Normal Function The ALK gene provides instructions for making a protein called ALK receptor tyrosine kinase, which is part of a family of proteins called receptor tyrosine kinases (RTKs). Receptor tyrosine kinases transmit signals from the cell surface into the cell through a process called signal transduction. The process begins when the kinase is stimulated at the cell surface and then attaches to a similar kinase (dimerizes). After dimerization, the kinase is tagged with a marker called a phosphate group (a cluster of oxygen and phosphorus atoms) in a process called phosphorylation. Phosphorylation turns on ( activates) the kinase. The activated kinase is able to transfer a phosphate group to another protein inside the cell, which is activated as a result. The activation continues through a series of proteins in a signaling pathway. These signaling pathways are important in many cellular processes such as cell growth and division (proliferation) or maturation (differentiation). Although the specific function of ALK receptor tyrosine kinase is unknown, it is thought to act early in development to help regulate the proliferation of nerve cells. Health Conditions Related to Genetic Changes Neuroblastoma At least 16 mutations in the ALK gene have been identified in some people with neuroblastoma, a type of cancerous tumor composed of immature nerve cells ( neuroblasts). Neuroblastoma and other cancers occur when a buildup of genetic mutations in critical genes—those that control cell proliferation or differentiation—allows cells to grow and divide uncontrollably to form a tumor. In most cases, these genetic changes are acquired during a person's lifetime and are called somatic mutations.
    [Show full text]
  • Crystal Structure of EML1 Reveals the Basis for Hsp90 Dependence of Oncogenic EML4-ALK by Disruption of an Atypical Β-Propeller Domain
    Crystal structure of EML1 reveals the basis for Hsp90 dependence of oncogenic EML4-ALK by disruption of an atypical β-propeller domain Mark W. Richardsa, Edward W. P. Lawb,La’Verne P. Rennallsc, Sara Busaccab, Laura O’Regana, Andrew M. Frya, Dean A. Fennellb, and Richard Baylissa,1 aDepartment of Biochemistry, University of Leicester, Leicester LE1 9HN, United Kingdom; bDepartment of Cancer Studies and Molecular Medicine, University of Leicester, Leicester LE1 9HN, United Kingdom; and cSection of Structural Biology, Institute of Cancer Research, London SW3 6JB, United Kingdom Edited by Charles David Stout, The Scripps Research Institute, La Jolla, CA, and accepted by the Editorial Board February 24, 2014 (received for review December 9, 2013) Proteins of the echinoderm microtubule-associated protein (EMAP)- variant 1 and regression in some EML4-ALK–positive tumor like (EML) family contribute to formation of the mitotic spindle and models (7, 9, 10). Furthermore, clinical efficacy of an Hsp90 in- interphase microtubule network. They contain a unique hydropho- hibitor in EML4-ALK NSCLC has been confirmed (11), and bic EML protein (HELP) motif and a variable number of WD40 clinical trials are ongoing. However, because neither ALK nor repeats. Recurrent gene rearrangements in nonsmall cell lung cancer EML4 are native Hsp90 clients, it was proposed that Hsp90 sen- fuse EML4 to anaplastic lymphoma kinase (ALK), causing expression sitivity of EML4-ALK fusions was due to their protein-folding of several fusion oncoprotein variants. We have determined a 2.6-Å properties, which might expose hydrophobic residues that lead to crystal structure of the representative ∼70-kDa core of EML1, re- Hsp90 recruitment (12).
    [Show full text]
  • S12967-021-02982-4.Pdf
    Xia et al. J Transl Med (2021) 19:308 https://doi.org/10.1186/s12967-021-02982-4 Journal of Translational Medicine RESEARCH Open Access Molecular characteristics and clinical outcomes of complex ALK rearrangements identifed by next-generation sequencing in non-small cell lung cancers Peiyi Xia1†, Lan Zhang1†, Pan Li1†, Enjie Liu1, Wencai Li1, Jianying Zhang2, Hui Li3, Xiaoxing Su3 and Guozhong Jiang1* Abstract Background: Complex kinase rearrangement, a mutational process involving one or two chromosomes with clustered rearrangement breakpoints, interferes with the accurate detection of kinase fusions by DNA-based next- generation sequencing (NGS). We investigated the characteristics of complex ALK rearrangements in non-small cell lung cancers using multiple molecular tests. Methods: Samples of non-small cell lung cancer patients were analyzed by targeted-capture DNA-based NGS with probes tilling the selected intronic regions of fusion partner genes, RNA-based NGS, RT-PCR, immunohistochemistry (IHC) and fuorescence in situ hybridization (FISH). Results: In a large cohort of 6576 non-small cell lung cancer patients, 343 (5.2%) cases harboring ALK rearrange- ments were identifed. Fourteen cases with complex ALK rearrangements were identifed by DNA-based NGS and classifed into three types by integrating various genomic features, including intergenic (n 3), intragenic (n 5) and “bridge joint” rearrangements (n 6). All thirteen cases with sufcient samples actually expressed= canonical EML4-ALK= fusion transcripts confrmed by RNA-based= NGS. Besides, positive ALK IHC was detected in 13 of 13 cases, and 9 of 11 cases were positive in FISH testing. Patients with complex ALK rearrangements who received ALK inhibitors treatment (n 6), showed no diference in progression-free survival (PFS) compared with patients with canonical ALK fusions n = 36, P 0.9291).
    [Show full text]
  • Cytogenetic and Molecular Characterization of the Macro- And
    University of Ulm Department of Human Genetics Prof. Dr. med. Walther Vogel Cytogenetic and Molecular Characterization of the Macro- and Micro-inversions, which Distinguish the Human and the Chimpanzee Karyotypes - from Speciation to Polymorphism Thesis Applying for the Degree of Doctor of Human Biology (Dr. hum. biol.) Faculty of Medicine University of Ulm Presented by Justyna Monika Szamalek from Wrze śnia in Poland 2006 Amtierender Dekan: Prof. Dr. Klaus-Michael Debatin 1. Berichterstatter: Prof. Dr. med. Horst Hameister 2. Berichterstatter: Prof. Dr. med. Konstanze Döhner Tag der Promotion: 28.07.2006 Content Content 1. Introduction ...................................................................................................................7 1.1. Primate phylogeny........................................................................................................7 1.2. Africa as the place of human origin and the living area of the present-day chimpanzee populations .................................................................9 1.3. Cytogenetic and molecular differences between human and chimpanzee genomes.............................................................................................10 1.4. Cytogenetic and molecular differences between common chimpanzee and bonobo genomes................................................................................17 1.5. Theory of speciation .....................................................................................................18 1.6. Theory of selection
    [Show full text]
  • Aggressive Differentiated Thyroid Cancer Due to Eml4e13-Alke20 Fusion: a Case Presentation and Review of the Literature
    Hindawi Case Reports in Endocrinology Volume 2021, Article ID 8837399, 7 pages https://doi.org/10.1155/2021/8837399 Case Report Aggressive Differentiated Thyroid Cancer due to EML4e13-ALKe20 Fusion: A Case Presentation and Review of the Literature Rodhan Khthir ,1 Zainab Shaheen ,1 Prasanna Santhanam,2 and Saroj Sigdel1 1Marshall University, School of Medicine, Huntington, WV, USA 2Johns Hopkins University, School of Medicine, Baltimore, MD, USA Correspondence should be addressed to Rodhan Khthir; [email protected] Received 5 August 2020; Revised 14 January 2021; Accepted 28 January 2021; Published 15 February 2021 Academic Editor: Toshihiro Kita Copyright © 2021 Rodhan Khthir et al. (is is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Background. Differentiated thyroid cancer (DTC) is an indolent malignancy. It rarely presents with aggressive local invasion and/or distant metastatic disease. Patient findings. We describe a case of a 30-year-old man with a locally aggressive form of papillary thyroid cancer with EML4e13-ALKe20 fusion (EML4: echinoderm microtubule-associated protein-like 4; ALK: anaplastic lymphoma kinase). He presented with right-side cervical lymphadenopathy with a highly suspicious right-side thyroid nodule. Total thyroidectomy and level IV lymph node resection showed extensive bilateral disease, with extra- thyroidal and extranodal extension. FDG-PET CT scan following surgery confirmed the presence of significant residual disease in the neck area. He underwent bilateral lateral lymph node dissection followed by radioactive iodine treatment. Somatic mutation testing showed EML4e13-ALKe20 fusion.
    [Show full text]
  • Mouse Eml4 Conditional Knockout Project (CRISPR/Cas9)
    https://www.alphaknockout.com Mouse Eml4 Conditional Knockout Project (CRISPR/Cas9) Objective: To create a Eml4 conditional knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering. Strategy summary: The Eml4 gene (NCBI Reference Sequence: NM_001114361 ; Ensembl: ENSMUSG00000032624 ) is located on Mouse chromosome 17. 24 exons are identified, with the ATG start codon in exon 1 and the TAA stop codon in exon 24 (Transcript: ENSMUST00000096766). Exon 5~7 will be selected as conditional knockout region (cKO region). Deletion of this region should result in the loss of function of the Mouse Eml4 gene. To engineer the targeting vector, homologous arms and cKO region will be generated by PCR using BAC clone RP23-456J9 as template. Cas9, gRNA and targeting vector will be co-injected into fertilized eggs for cKO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Exon 5 starts from about 17.31% of the coding region. The knockout of Exon 5~7 will result in frameshift of the gene. The size of intron 4 for 5'-loxP site insertion: 1791 bp, and the size of intron 7 for 3'-loxP site insertion: 11782 bp. The size of effective cKO region: ~1526 bp. The cKO region does not have any other known gene. Page 1 of 8 https://www.alphaknockout.com Overview of the Targeting Strategy Wildtype allele 5' gRNA region gRNA region 3' 1 4 5 6 7 24 Targeting vector Targeted allele Constitutive KO allele (After Cre recombination) Legends Exon of mouse Eml4 Homology arm cKO region loxP site Page 2 of 8 https://www.alphaknockout.com Overview of the Dot Plot Window size: 10 bp Forward Reverse Complement Sequence 12 Note: The sequence of homologous arms and cKO region is aligned with itself to determine if there are tandem repeats.
    [Show full text]
  • Anaplastic Lymphoma Kinase (ALK): Structure, Oncogenic Activation, and Pharmacological Inhibition
    Pharmacological Research 68 (2013) 68–94 Contents lists available at SciVerse ScienceDirect Pharmacological Research jo urnal homepage: www.elsevier.com/locate/yphrs Invited review Anaplastic lymphoma kinase (ALK): Structure, oncogenic activation, and pharmacological inhibition ∗ Robert Roskoski Jr. Blue Ridge Institute for Medical Research, 3754 Brevard Road, Suite 116, Box 19, Horse Shoe, NC 28742, USA a r t i c l e i n f o a b s t r a c t Article history: Anaplastic lymphoma kinase was first described in 1994 as the NPM-ALK fusion protein that is expressed Received 14 November 2012 in the majority of anaplastic large-cell lymphomas. ALK is a receptor protein-tyrosine kinase that was Accepted 18 November 2012 more fully characterized in 1997. Physiological ALK participates in embryonic nervous system develop- ment, but its expression decreases after birth. ALK is a member of the insulin receptor superfamily and Keywords: is most closely related to leukocyte tyrosine kinase (Ltk), which is a receptor protein-tyrosine kinase. Crizotinib Twenty different ALK-fusion proteins have been described that result from various chromosomal rear- Drug discovery rangements, and they have been implicated in the pathogenesis of several diseases including anaplastic Non-small cell lung cancer large-cell lymphoma, diffuse large B-cell lymphoma, and inflammatory myofibroblastic tumors. The Protein kinase inhibitor EML4-ALK fusion protein and four other ALK-fusion proteins play a fundamental role in the development Targeted cancer therapy Acquired drug resistance in about 5% of non-small cell lung cancers. The formation of dimers by the amino-terminal portion of the ALK fusion proteins results in the activation of the ALK protein kinase domain that plays a key role in the tumorigenic process.
    [Show full text]
  • Genome-Wide Expression Profiling Establishes Novel Modulatory Roles
    Batra et al. BMC Genomics (2017) 18:252 DOI 10.1186/s12864-017-3635-4 RESEARCHARTICLE Open Access Genome-wide expression profiling establishes novel modulatory roles of vitamin C in THP-1 human monocytic cell line Sakshi Dhingra Batra, Malobi Nandi, Kriti Sikri and Jaya Sivaswami Tyagi* Abstract Background: Vitamin C (vit C) is an essential dietary nutrient, which is a potent antioxidant, a free radical scavenger and functions as a cofactor in many enzymatic reactions. Vit C is also considered to enhance the immune effector function of macrophages, which are regarded to be the first line of defence in response to any pathogen. The THP- 1 cell line is widely used for studying macrophage functions and for analyzing host cell-pathogen interactions. Results: We performed a genome-wide temporal gene expression and functional enrichment analysis of THP-1 cells treated with 100 μM of vit C, a physiologically relevant concentration of the vitamin. Modulatory effects of vitamin C on THP-1 cells were revealed by differential expression of genes starting from 8 h onwards. The number of differentially expressed genes peaked at the earliest time-point i.e. 8 h followed by temporal decline till 96 h. Further, functional enrichment analysis based on statistically stringent criteria revealed a gamut of functional responses, namely, ‘Regulation of gene expression’, ‘Signal transduction’, ‘Cell cycle’, ‘Immune system process’, ‘cAMP metabolic process’, ‘Cholesterol transport’ and ‘Ion homeostasis’. A comparative analysis of vit C-mediated modulation of gene expression data in THP-1cells and human skin fibroblasts disclosed an overlap in certain functional processes such as ‘Regulation of transcription’, ‘Cell cycle’ and ‘Extracellular matrix organization’, and THP-1 specific responses, namely, ‘Regulation of gene expression’ and ‘Ion homeostasis’.
    [Show full text]
  • Evolution and Diversity of Copy Number Variation in the Great Ape Lineage
    Downloaded from genome.cshlp.org on September 24, 2021 - Published by Cold Spring Harbor Laboratory Press Evolution and diversity of copy number variation in the great ape lineage Peter H. Sudmant1, John Huddleston1,7, Claudia R. Catacchio2, Maika Malig1, LaDeana W. Hillier3, Carl Baker1, Kiana Mohajeri1, Ivanela Kondova4, Ronald E. Bontrop4, Stephan Persengiev4, Francesca Antonacci2, Mario Ventura2, Javier Prado-Martinez5, Tomas 5,6 1,7 Marques-Bonet , and Evan E. Eichler 1. Department of Genome Sciences, University of Washington, Seattle, WA, USA 2. University of Bari, Bari, Italy 3. The Genome Institute, Washington University School of Medicine, St. Louis, MO, USA 4. Department of Comparative Genetics, Biomedical Primate Research Centre, Rijswijk, The Netherlands 5. Institut de Biologia Evolutiva, (UPF-CSIC) Barcelona, Spain 6. Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain 7. Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA Correspondence to: Evan Eichler Department of Genome Sciences University of Washington School of Medicine Foege S-413A, Box 355065 3720 15th Ave NE Seattle, WA 98195 E-mail: [email protected] 1 Downloaded from genome.cshlp.org on September 24, 2021 - Published by Cold Spring Harbor Laboratory Press ABSTRACT Copy number variation (CNV) contributes to the genetic basis of disease and has significantly restructured the genomes of humans and great apes. The diversity and rate of this process, however, has not been extensively explored among the great ape lineages. We analyzed 97 deeply sequenced great ape and human genomes and estimate that 16% (469 Mbp) of the hominid genome has been affected by recent copy number changes.
    [Show full text]