Gene Network Analysis in a Pediatric Cohort Identifies Novel Lung Function Genes

Total Page:16

File Type:pdf, Size:1020Kb

Gene Network Analysis in a Pediatric Cohort Identifies Novel Lung Function Genes Gene Network Analysis in a Pediatric Cohort Identifies Novel Lung Function Genes Bruce A. Ong1.¤,JinLi2., Joseph M. McDonough1, Zhi Wei3, Cecilia Kim2, Rosetta Chiavacci2, Frank Mentch2, Jason B. Caboot4, Jonathan Spergel5,7,8, Julian L. Allen1,7, Patrick M. A. Sleiman2,6,7*, Hakon Hakonarson1,2,6,7* 1 Division of Pulmonary Medicine and Cystic Fibrosis Center, The Children’s Hospital of Philadelphia, Philadelphia, Pennsylvania, United States of America, 2 Center for Applied Genomics, The Children’s Hospital of Philadelphia, Philadelphia, Pennsylvania, United States of America, 3 Department of Computer Science, New Jersey Institute of Technology, Newark, New Jersey, United States of America, 4 Division of Pediatric Pulmonology, Madigan Army Medical Center, Tacoma, Washington, United States of America, 5 Center for Pediatric Eosinophilic Disorders, The Children’s Hospital of Philadelphia, Philadelphia, Pennsylvania, United States of America, 6 Division of Human Genetics, The Children’s Hospital of Philadelphia, Philadelphia, Pennsylvania, United States of America, 7 Department of Pediatrics, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania, United States of America, 8 Division of Allergy and Immunology, The Children’s Hospital of Philadelphia, Philadelphia, Pennsylvania, United States of America Abstract Lung function is a heritable trait and serves as an important clinical predictor of morbidity and mortality for pulmonary conditions in adults, however, despite its importance, no studies have focused on uncovering pediatric-specific loci influencing lung function. To identify novel genetic determinants of pediatric lung function, we conducted a genome-wide association study (GWAS) of four pulmonary function traits, including FVC, FEV1, FEV1/FVC and FEF25–75% in 1556 children. Further, we carried out gene network analyses for each trait including all SNPs with a P-value of ,1.061023 from the individual GWAS. The GWAS identified SNPs with notable trends towards association with the pulmonary function 27 measures, including the previously described INTS12 locus association with FEV1 (pmeta = 1.41610 ). The gene network analyses identified 34 networks of genes associated with pulmonary function variables in Caucasians. Of those, the 24 glycoprotein gene network reached genome-wide significance for all four variables. P-value range pmeta = 6.29610 - 2.8061028 on meta-analysis. In this study, we report on specific pathways that are significantly associated with pediatric lung function at genome-wide significance. In addition, we report the first loci associated with lung function in both pediatric Caucasian and African American populations. Citation: Ong BA, Li J, McDonough JM, Wei Z, Kim C, et al. (2013) Gene Network Analysis in a Pediatric Cohort Identifies Novel Lung Function Genes. PLoS ONE 8(9): e72899. doi:10.1371/journal.pone.0072899 Editor: Wenge Guo, New Jersey Institute of Technology, United States of America Received April 8, 2013; Accepted July 15, 2013; Published September 2, 2013 Copyright: ß 2013 Ong et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Funding: This study was funded by an Institutional Development Fund to The Center for Applied Genomics, and by a Pennsylvania State Grant (SAP Number: 4100042728). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Competing Interests: The authors have declared that no competing interests exist. * E-mail: [email protected] (HH); [email protected] (PMAS) ¤ Current address: Division of Pediatric Pulmonology, Tripler Army Medical Center, Honolulu, Hawaii, United States of America . These authors contributed equally to this work. Introduction gene with large effect underlying normal variation in lung function has been identified, indicating that a polygenic model of The respiratory system plays a central role in overall health and inheritance is most likely to determine lung function [5,7]. In lung-related disease. Spirometry is a commonly used technique to addition to a genetic determinants, environmental variables are measure lung function as well as identify respiratory illness and known to affect lung function. Smoking is a well established follow lung disease progression. Commonly used spirometric environmental detriment to lung function, and children living near indices include the forced expiratory volume in 1 second (FEV1), highways have also been shown to have significantly reduced lung the forced vital capacity (FVC), and the FEV1/FVC ratio. Prior function [8]. studies have demonstrated that pulmonary function can predict The first genome-wide association study (GWAS) to look at both pulmonary morbidity and mortality [1,2]. adult pulmonary function, specifically FEV1/FVC was reported in Both genetic and environmental factors influence lung function. 2009 [9]. Recently, three large-cohort-based studies have reported Population-based studies have described significant correlations a total of 28 additional loci significantly associated with pulmonary between pulmonary function measures among monozygotic twins function measures in adults [10,11,12]. However, these loci are [3], and heritability estimates of lung function in excess of 40% estimated to explain only a small portion of the variation in FEV1 have been reported [4,5]. Longitudinal studies have also and FEV1/FVC and do not address the unique aspects of determined that the decline in lung function is also influenced pulmonary function in developing children or African-Americans. by genetic factors [6]. While the heritability estimates of lung It is likely that a large number of loci influencing the heritable function are high, no definitive evidence of a single Mendelian variation in lung function remain unidentified [10,11,12]. PLOS ONE | www.plosone.org 1 September 2013 | Volume 8 | Issue 9 | e72899 Network Analysis and Pediatric Lung Function Genes Two factors make children a unique population to study lung genotype clustering files were used to process the genotyping function. First, lung function rapidly increases from childhood to signals and generate genotype calls. adolescence and by age 21 reaches a plateau phase followed by a slow but steady decline [13]. Thus, studying pulmonary function Quality Control in children could yield associations with loci related to lung After genotyping, only samples with a call rate .95% and development which may not manifest phenotypic expression in genotyping markers with a missing rate ,5%, minor allele adults. Second, it is possible that effects such as cigarette smoke frequency .0.01 and HWE-p-value .0.0001 were retained for exposure or other environmental factors over time may weaken analysis. We further extracted markers shared by 550k and 610k phenotype expression and mask associations with genetic loci. It is chips for the association study. The population structure of all the plausible that the pediatric population may exhibit a stronger samples was confirmed by Multi-Dimensional Scaling (MDS) association between pulmonary function measures and genetic analysis using PLINK software version 1.06 [15]. We detected determinants. cryptic related or duplicated samples in the cohort by determining Using pulmonary function data from children undergoing the whole genome identity-by-descent (IBD) scores with PLINK spirometry testing at the Children’s Hospital of Philadelphia, we software and excluded one sample from each pair with IBD scores conducted a GWAS to identify loci influencing pediatric lung .0.25. function. To enhance study power, we additionally applied a pathway-based approach to search for gene networks influencing Association Analysis lung function. We and others have previously shown that GWAS Linear regression was performed at each SNP for the four lung derived gene network analyses can effectively uncover significant function traits using PLINK software version 1.06 [15]. In the associations between gene pathways and complex diseases that are Caucasian cohort, a subset of the subjects had cystic fibrosis, not discovered by individual SNP analyses [14]. asthma or cancer diagnoses, all of which were treated as covariates in the regression model. In the African American cohort, Materials and Methods diagnoses of sickle cell disease, asthma and cancer were similarly treated as covariates. All four lung function phenotypes were Subjects approximately normally distributed (Figure S1), a linear regression This study was approved by the institutional review board of the model was utilized to evaluate the association between each of the Children’s Hospital of Philadelphia. In accordance with IRB phenotypes and the SNP genotype within Caucasian and African protocol, all subjects eligible for participation in this study were American cohorts separately. We also conducted meta-analyses enrolled after obtaining informed written consent either from their using the PLINK and METAL packages [16]. parents, if under the age of 18 with the child’s assent, or directly from the subject if aged 18 or over. The study cohort consisted of Genotype Imputation children and young adults between 5–21 years of age that had completed pulmonary function testing and genotype analysis at the We performed pre-phasing imputation of the two loci on Children’s Hospital of Philadelphia.
Recommended publications
  • Variation in Protein Coding Genes Identifies Information
    bioRxiv preprint doi: https://doi.org/10.1101/679456; this version posted June 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Animal complexity and information flow 1 1 2 3 4 5 Variation in protein coding genes identifies information flow as a contributor to 6 animal complexity 7 8 Jack Dean, Daniela Lopes Cardoso and Colin Sharpe* 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 Institute of Biological and Biomedical Sciences 25 School of Biological Science 26 University of Portsmouth, 27 Portsmouth, UK 28 PO16 7YH 29 30 * Author for correspondence 31 [email protected] 32 33 Orcid numbers: 34 DLC: 0000-0003-2683-1745 35 CS: 0000-0002-5022-0840 36 37 38 39 40 41 42 43 44 45 46 47 48 49 Abstract bioRxiv preprint doi: https://doi.org/10.1101/679456; this version posted June 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Animal complexity and information flow 2 1 Across the metazoans there is a trend towards greater organismal complexity. How 2 complexity is generated, however, is uncertain. Since C.elegans and humans have 3 approximately the same number of genes, the explanation will depend on how genes are 4 used, rather than their absolute number.
    [Show full text]
  • PDF Output of CLIC (Clustering by Inferred Co-Expression)
    PDF Output of CLIC (clustering by inferred co-expression) Dataset: Num of genes in input gene set: 14 Total number of genes: 16493 CLIC PDF output has three sections: 1) Overview of Co-Expression Modules (CEMs) Heatmap shows pairwise correlations between all genes in the input query gene set. Red lines shows the partition of input genes into CEMs, ordered by CEM strength. Each row shows one gene, and the brightness of squares indicates its correlations with other genes. Gene symbols are shown at left side and on the top of the heatmap. 2) Details of each CEM and its expansion CEM+ Top panel shows the posterior selection probability (dataset weights) for top GEO series datasets. Bottom panel shows the CEM genes (blue rows) as well as expanded CEM+ genes (green rows). Each column is one GEO series dataset, sorted by their posterior probability of being selected. The brightness of squares indicates the gene's correlations with CEM genes in the corresponding dataset. CEM+ includes genes that co-express with CEM genes in high-weight datasets, measured by LLR score. 3) Details of each GEO series dataset and its expression profile: Top panel shows the detailed information (e.g. title, summary) for the GEO series dataset. Bottom panel shows the background distribution and the expression profile for CEM genes in this dataset. Overview of Co-Expression Modules (CEMs) with Dataset Weighting Scale of average Pearson correlations Num of Genes in Query Geneset: 14. Num of CEMs: 1. 0.0 0.2 0.4 0.6 0.8 1.0 Cpsf3l Polr2b Ints3 Ints7 Ints1 Ints4 Ints9 Ints2
    [Show full text]
  • PHF22 (INTS12) (NM 020395) Human Over-Expression Lysate Product Data
    OriGene Technologies, Inc. 9620 Medical Center Drive, Ste 200 Rockville, MD 20850, US Phone: +1-888-267-4436 [email protected] EU: [email protected] CN: [email protected] Product datasheet for LC412498 PHF22 (INTS12) (NM_020395) Human Over-expression Lysate Product data: Product Type: Over-expression Lysates Description: INTS12 HEK293T cell transient overexpression lysate (as WB positive control) Species: Human Expression Host: HEK293T Expression cDNA Clone TrueORF Clone RC204838 or AA Sequence: Tag: C-Myc/DDK Detection Antibodies: Clone OTI4C5, Anti-DDK (FLAG) monoclonal antibody (TA50011-100) Accession Number: NM_020395, NP_065128 Synonyms: INT12; PHF22; SBBI22 Predicted MW: 48.8 kDa Components: 1 vial of 20 ug lyophilized gene specific transient over-expression cell lysate Storage: The lysate can be shipped at ambient temperature. Upon receiving, store the sample at - 20°C. Lysate samples can be reconstituted with SDS Sample Buffer. Avoid repeated freeze- thaw cycles after reconstitution. Lysate samples are stable for 12 months from date of receipt when stored at -20°C. Preparation: HEK293T cells in 10-cm dishes were transiently transfected withM egaTran Transfection Reagent (TT200002) and 5ug TrueORF cDNA plasmid. Transfected cells were cultured for 48hrs before collection. The cells were lysed in modified RIPA buffer (25mM Tris-HCl pH7.6, 150mM NaCl, 1% NP-40, 1mM EDTA, 1xProteinase inhibitor cocktail mix (Sigma), 1mM PMSF and 1mM Na3VO4), and then centrifuged to clarify the lysate. Protein concentration was measured by BCA kit (Thermo Scientific Inc.). To facilitate transportation and protein, the products are supplied as lyophilized proteins. RefSeq: NP_065128 Locus ID: 57117 Cytogenetics: 4q24 Protein Families: Druggable Genome, Transcription Factors This product is to be used for laboratory only.
    [Show full text]
  • (Q36.1;Q24) with a Concurrent Submicroscopic Del(4)(Q23q24) in an Adult with Polycythemia Vera
    cancers Case Report A Novel Acquired t(2;4)(q36.1;q24) with a Concurrent Submicroscopic del(4)(q23q24) in An Adult with Polycythemia Vera Eigil Kjeldsen Cancer Cytogenetic Section, HemoDiagnostic Laboratory, Department of Hematology, Aarhus University Hospital, Tage-Hansens Gade 2, DK-8000 Aarhus C, Denmark; [email protected]; Tel.: +45-7846-7398; Fax: +45-7846-7399 Received: 6 June 2018; Accepted: 21 June 2018; Published: 25 June 2018 Abstract: Background: Polycythemia vera (PV) is a clonal myeloid stem cell disease characterized by a growth-factor independent erythroid proliferation with an inherent tendency to transform into overt acute myeloid malignancy. Approximately 95% of the PV patients harbor the JAK2V617F mutation while less than 35% of the patients harbor cytogenetic abnormalities at the time of diagnosis. Methods and Results: Here we present a JAK2V617F positive PV patient where G-banding revealed an apparently balanced t(2;4)(q35;q21), which was confirmed by 24-color karyotyping. Oligonucleotide array-based Comparative Genomic Hybridization (aCGH) analysis revealed an interstitial 5.4 Mb large deletion at 4q23q24. Locus-specific fluorescent in situ hybridization (FISH) analyses confirmed the mono-allelic 4q deletion and that it was located on der(4)t(2;4). Additional locus-specific bacterial artificial chromosome (BAC) probes and mBanding refined the breakpoint on chromosome 2. With these methods the karyotype was revised to 46,XX,t(2;4)(q36.1;q24)[18]/46,XX[7]. Conclusions: This is the first report on a PV patient associated with an acquired novel t(2;4)(q36.1;q24) and a concurrent submicroscopic deletion del(4)(q23q24).
    [Show full text]
  • GSTCD and INTS12 Regulation and Expression in the Human Lung
    GSTCD and INTS12 Regulation and Expression in the Human Lung Ma’en Obeidat1,4☯, Suzanne Miller1☯, Kelly Probert1, Charlotte K. Billington1, Amanda P. Henry1, Emily Hodge1, Carl P. Nelson1, Ceri E. Stewart1, Caroline Swan1, Louise V. Wain2, María Soler Artigas2, Erik Melén3, Kevin Ushey4, Ke Hao5, Maxime Lamontagne7, Yohan Bossé6,7, Dirkje S. Postma8, Martin D. Tobin2,9, Ian Sayers1, Ian P. Hall1* 1 Division of Respiratory Medicine, University of Nottingham, Queen’s Medical Center, Nottingham, United Kingdom, 2 Genetic Epidemiology Group, Department of Health Sciences, University of Leicester, Leicester, United Kingdom, 3 Institute of Environmental Medicine, Karolinska Institutet and Sachs’ Children’s Hospital, Stockholm, Sweden, 4 James Hogg Research Centre, Institute for Heart and Lung Health, University of British Columbia, Vancouver, British Columbia, Canada, 5 Department of Genetics and Genomic Sciences, Icahn Institute of Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, United States of America, 6 Department of Molecular Medicine, Laval University, Québec City, Canada, 7 Institut Universitaire de Cardiologie et de Pneumologie de Québec, Laval University, Québec City, Canada, 8 Department of Pulmonology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands, 9 National Institute for Health Research (NIHR) Leicester Respiratory Biomedical Research Unit, Glenfield Hospital, Leicester, United Kingdom Abstract Genome-Wide Association Study (GWAS) meta-analyses have identified a strong association signal for lung function, which maps to a region on 4q24 containing two oppositely transcribed genes: glutathione S-transferase, C- terminal domain containing (GSTCD) and integrator complex subunit 12 (INTS12). Both genes were found to be expressed in a range of human airway cell types.
    [Show full text]
  • Coexpression Networks Based on Natural Variation in Human Gene Expression at Baseline and Under Stress
    University of Pennsylvania ScholarlyCommons Publicly Accessible Penn Dissertations Fall 2010 Coexpression Networks Based on Natural Variation in Human Gene Expression at Baseline and Under Stress Renuka Nayak University of Pennsylvania, [email protected] Follow this and additional works at: https://repository.upenn.edu/edissertations Part of the Computational Biology Commons, and the Genomics Commons Recommended Citation Nayak, Renuka, "Coexpression Networks Based on Natural Variation in Human Gene Expression at Baseline and Under Stress" (2010). Publicly Accessible Penn Dissertations. 1559. https://repository.upenn.edu/edissertations/1559 This paper is posted at ScholarlyCommons. https://repository.upenn.edu/edissertations/1559 For more information, please contact [email protected]. Coexpression Networks Based on Natural Variation in Human Gene Expression at Baseline and Under Stress Abstract Genes interact in networks to orchestrate cellular processes. Here, we used coexpression networks based on natural variation in gene expression to study the functions and interactions of human genes. We asked how these networks change in response to stress. First, we studied human coexpression networks at baseline. We constructed networks by identifying correlations in expression levels of 8.9 million gene pairs in immortalized B cells from 295 individuals comprising three independent samples. The resulting networks allowed us to infer interactions between biological processes. We used the network to predict the functions of poorly-characterized human genes, and provided some experimental support. Examining genes implicated in disease, we found that IFIH1, a diabetes susceptibility gene, interacts with YES1, which affects glucose transport. Genes predisposing to the same diseases are clustered non-randomly in the network, suggesting that the network may be used to identify candidate genes that influence disease susceptibility.
    [Show full text]
  • Powerful Gene-Based Testing by Integrating Long-Range Chromatin Interactions and Knockoff Genotypes
    medRxiv preprint doi: https://doi.org/10.1101/2021.07.14.21260405; this version posted July 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license . Powerful gene-based testing by integrating long-range chromatin interactions and knockoff genotypes Shiyang Ma1, James L. Dalgleish1, Justin Lee2, Chen Wang1, Linxi Liu3, Richard Gill4;5, Joseph D. Buxbaum6, Wendy Chung7, Hugues Aschard8, Edwin K. Silverman9, Michael H. Cho9, Zihuai He2;10, Iuliana Ionita-Laza1;# 1 Department of Biostatistics, Columbia University, New York, NY, 10032, USA 2 Quantitative Sciences Unit, Department of Medicine, Stanford University, Stanford, CA, 94305, USA 3 Department of Statistics, University of Pittsburgh, Pittsburgh, PA, 15260, USA 4 Department of Human Genetics, Genentech, South San Francisco, CA, 94080, USA 5 Department of Epidemiology, Columbia University, New York, NY, 10032, USA 6 Departments of Psychiatry, Neuroscience, and Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA 7 Department of Pediatrics and Medicine, Herbert Irving Comprehensive Cancer Center, Columbia Uni- versity Irving Medical Center, New York, NY, 10032, USA 8 Department of Computational Biology, Institut Pasteur, Paris, France 9 Channing Division of Network Medicine and Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA 02115, USA 10 Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA 94305, USA # e-mail: [email protected] Abstract Gene-based tests are valuable techniques for identifying genetic factors in complex traits.
    [Show full text]
  • Whole-Genome Association Analyses of Sleep-Disordered Breathing Phenotypes in the NHLBI Topmed Program Brian E
    Cade et al. Genome Medicine (2021) 13:136 https://doi.org/10.1186/s13073-021-00917-8 RESEARCH Open Access Whole-genome association analyses of sleep-disordered breathing phenotypes in the NHLBI TOPMed program Brian E. Cade1,2,3* , Jiwon Lee1, Tamar Sofer1,2, Heming Wang1,2,3, Man Zhang4, Han Chen5,6, Sina A. Gharib7, Daniel J. Gottlieb1,2,8, Xiuqing Guo9, Jacqueline M. Lane1,2,3,10, Jingjing Liang11, Xihong Lin12, Hao Mei13, Sanjay R. Patel14, Shaun M. Purcell1,2,3, Richa Saxena1,2,3,10, Neomi A. Shah15, Daniel S. Evans16, Craig L. Hanis5, David R. Hillman17, Sutapa Mukherjee18,19, Lyle J. Palmer20, Katie L. Stone16, Gregory J. Tranah16, NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium, Gonçalo R. Abecasis21,EricA.Boerwinkle5,22, Adolfo Correa23,24, L. Adrienne Cupples25,26,RobertC.Kaplan27, Deborah A. Nickerson28,29,KariE.North30,BruceM.Psaty31,32, Jerome I. Rotter9,StephenS.Rich33, Russell P. Tracy34, Ramachandran S. Vasan26,35,36, James G. Wilson37,XiaofengZhu11, Susan Redline1,2,38 andTOPMedSleepWorkingGroup Abstract Background: Sleep-disordered breathing is a common disorder associated with significant morbidity. The genetic architecture of sleep-disordered breathing remains poorly understood. Through the NHLBI Trans-Omics for Precision Medicine (TOPMed) program, we performed the first whole-genome sequence analysis of sleep-disordered breathing. Methods: The study sample was comprised of 7988 individuals of diverse ancestry. Common-variant and pathway analyses included an additional 13,257 individuals. We examined five complementary traits describing different aspects of sleep-disordered breathing: the apnea-hypopnea index, average oxyhemoglobin desaturation per event, average and minimum oxyhemoglobin saturation across the sleep episode, and the percentage of sleep with oxyhemoglobin saturation < 90%.
    [Show full text]
  • Us 2018 / 0305689 A1
    US 20180305689A1 ( 19 ) United States (12 ) Patent Application Publication ( 10) Pub . No. : US 2018 /0305689 A1 Sætrom et al. ( 43 ) Pub . Date: Oct. 25 , 2018 ( 54 ) SARNA COMPOSITIONS AND METHODS OF plication No . 62 /150 , 895 , filed on Apr. 22 , 2015 , USE provisional application No . 62/ 150 ,904 , filed on Apr. 22 , 2015 , provisional application No. 62 / 150 , 908 , (71 ) Applicant: MINA THERAPEUTICS LIMITED , filed on Apr. 22 , 2015 , provisional application No. LONDON (GB ) 62 / 150 , 900 , filed on Apr. 22 , 2015 . (72 ) Inventors : Pål Sætrom , Trondheim (NO ) ; Endre Publication Classification Bakken Stovner , Trondheim (NO ) (51 ) Int . CI. C12N 15 / 113 (2006 .01 ) (21 ) Appl. No. : 15 /568 , 046 (52 ) U . S . CI. (22 ) PCT Filed : Apr. 21 , 2016 CPC .. .. .. C12N 15 / 113 ( 2013 .01 ) ; C12N 2310 / 34 ( 2013. 01 ) ; C12N 2310 /14 (2013 . 01 ) ; C12N ( 86 ) PCT No .: PCT/ GB2016 /051116 2310 / 11 (2013 .01 ) $ 371 ( c ) ( 1 ) , ( 2 ) Date : Oct . 20 , 2017 (57 ) ABSTRACT The invention relates to oligonucleotides , e . g . , saRNAS Related U . S . Application Data useful in upregulating the expression of a target gene and (60 ) Provisional application No . 62 / 150 ,892 , filed on Apr. therapeutic compositions comprising such oligonucleotides . 22 , 2015 , provisional application No . 62 / 150 ,893 , Methods of using the oligonucleotides and the therapeutic filed on Apr. 22 , 2015 , provisional application No . compositions are also provided . 62 / 150 ,897 , filed on Apr. 22 , 2015 , provisional ap Specification includes a Sequence Listing . SARNA sense strand (Fessenger 3 ' SARNA antisense strand (Guide ) Mathew, Si Target antisense RNA transcript, e . g . NAT Target Coding strand Gene Transcription start site ( T55 ) TY{ { ? ? Targeted Target transcript , e .
    [Show full text]
  • Molecular Basis for the Interaction Between Integrator Subunits Ints9 and Ints11 and Its Functional Importance
    Molecular basis for the interaction between Integrator subunits IntS9 and IntS11 and its functional importance Yixuan Wua,1, Todd R. Albrechtb,1, David Baillatb, Eric J. Wagnerb,2, and Liang Tonga,2 aDepartment of Biological Sciences, Columbia University, New York, NY 10027; and bDepartment of Biochemistry and Molecular Biology, The University of Texas Medical Branch, Galveston, TX 77550 Edited by Michael Sattler, Helmholtz Zentrum München, Neuherberg, Germany, and accepted by Editorial Board Member Dinshaw J. Patel, March 16, 2017 (received for review October 5, 2016) The metazoan Integrator complex (INT) has important functions in catalysis (Fig. 1A and Fig. S1) (18). CPSF-73, IntS11, and their the 3′-end processing of noncoding RNAs, including the uridine- close homologs also have a β-CASP domain (named after its rich small nuclear RNA (UsnRNA) and enhancer RNA (eRNA), and in founding members: CPSF73, Artemis, SNM, and PSO), which is the transcription of coding genes by RNA polymerase II. The INT an insert in the metallo-β-lactamase domain (Fig. 1A). The active contains at least 14 subunits, but its molecular mechanism of ac- site is located at the interface between the two domains, and tion is poorly understood, because currently there is little struc- therefore the β-CASP domain likely regulates substrate access to tural information about its subunits. The endonuclease activity of the active site. The metallo-β-lactamase and the β-CASP do- INT is mediated by its subunit 11 (IntS11), which belongs to the mains of human IntS11 (residues 1–450) (Fig. S1) and CPSF- metallo-β-lactamase superfamily and is a paralog of CPSF-73, the 73 share 40% amino acid sequence identity, indicating their high endonuclease for pre-mRNA 3′-end processing.
    [Show full text]
  • DSIF and NELF Interact with Integrator to Specify the Correct Post-Transcriptional Fate of Snrna Genes
    ARTICLE Received 28 Mar 2014 | Accepted 1 Jun 2014 | Published 27 Jun 2014 DOI: 10.1038/ncomms5263 DSIF and NELF interact with Integrator to specify the correct post-transcriptional fate of snRNA genes Junichi Yamamoto1, Yuri Hagiwara1, Kunitoshi Chiba1, Tomoyasu Isobe1, Takashi Narita2, Hiroshi Handa3 & Yuki Yamaguchi1,4 The elongation factors DSIF and NELF are responsible for promoter-proximal RNA polymerase II (Pol II) pausing. NELF is also involved in 30 processing of replication-dependent histone genes, which produce non-polyadenylated mRNAs. Here we show that DSIF and NELF contribute to the synthesis of small nuclear RNAs (snRNAs) through their association with Integrator, the large multisubunit complex responsible for 30 processing of pre-snRNAs. In HeLa cells, Pol II, Integrator, DSIF and NELF accumulate at the 30 end of the U1 snRNA gene. Knockdown of NELF results in misprocessing of U1, U2, U4 and U5 snRNAs, while DSIF is required for proper transcription of these genes. Knocking down NELF also disrupts transcription termination and induces the production of polyadenylated U1 transcripts caused by an enhanced recruitment of cleavage stimulation factor. Our results indicate that NELF plays a key role in determining the post-transcriptional fate of Pol II-transcribed genes. 1 Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, Yokohama 226-8501, Japan. 2 Graduate School of Frontier Biosciences, Osaka University, Suita 565-0871, Japan. 3 Department of Nanoparticle Translational Research, Tokyo Medical University, Tokyo 160-8402, Japan. 4 PRESTO, Japan Science and Technology Agency, Kawaguchi 332-0012, Japan. Correspondence and requests for materials should be addressed to Y.Y.
    [Show full text]
  • The Integrator Complex Terminates Promoter-Proximal Transcription at Protein- Coding Genes
    bioRxiv preprint doi: https://doi.org/10.1101/725507; this version posted August 5, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. The Integrator complex terminates promoter-proximal transcription at protein- coding genes Nathan D. Elrod1#, Telmo Henriques2#, Kai-Lieh Huang1, Deirdre C. Tatomer3, Jeremy E. Wilusz3, Eric J. Wagner1* & Karen Adelman2* Affiliations: 1Department of Biochemistry and Molecular Biology, University of Texas Medical Branch at Galveston, Galveston, TX 77550, USA. 2Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, MA 02115, USA. 3Department of Biochemistry and Biophysics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA. #These authors contributed equally to this work. *Corresponding authors E-mail: [email protected] (K.A.); [email protected] (E.J.W.) Lead Contact: [email protected] Highlights: • The Integrator complex inhibits transcription elongation at ~15% of mRNA genes • Integrator targets promoter-proximally paused Pol II for termination • The RNA endonuclease of Integrator subunit 11 is critical for gene attenuation • Integrator-repressed genes are enriched in signaling and growth-responsive pathways 1 bioRxiv preprint doi: https://doi.org/10.1101/725507; this version posted August 5, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.
    [Show full text]