Feature Selection for Longitudinal Data by Using Sign Averages to Summarize Gene Expression Values Over Time

Total Page:16

File Type:pdf, Size:1020Kb

Feature Selection for Longitudinal Data by Using Sign Averages to Summarize Gene Expression Values Over Time University of Kentucky UKnowledge Biostatistics Faculty Publications Biostatistics 3-19-2019 Feature Selection for Longitudinal Data by Using Sign Averages to Summarize Gene Expression Values over Time Suyan Tian The First Hospital of Jilin University, China, [email protected] Chi Wang University of Kentucky, [email protected] Right click to open a feedback form in a new tab to let us know how this document benefits oy u. Follow this and additional works at: https://uknowledge.uky.edu/biostatistics_facpub Part of the Biostatistics Commons, Computational Biology Commons, Longitudinal Data Analysis and Time Series Commons, and the Microarrays Commons Repository Citation Tian, Suyan and Wang, Chi, "Feature Selection for Longitudinal Data by Using Sign Averages to Summarize Gene Expression Values over Time" (2019). Biostatistics Faculty Publications. 43. https://uknowledge.uky.edu/biostatistics_facpub/43 This Article is brought to you for free and open access by the Biostatistics at UKnowledge. It has been accepted for inclusion in Biostatistics Faculty Publications by an authorized administrator of UKnowledge. For more information, please contact [email protected]. Feature Selection for Longitudinal Data by Using Sign Averages to Summarize Gene Expression Values over Time Notes/Citation Information Published in BioMed Research International, v. 2019, article ID 1724898, p. 1-12. © 2019 Suyan Tian and Chi Wang. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Digital Object Identifier (DOI) https://doi.org/10.1155/2019/1724898 This article is available at UKnowledge: https://uknowledge.uky.edu/biostatistics_facpub/43 Hindawi BioMed Research International Volume 2019, Article ID 1724898, 12 pages https://doi.org/10.1155/2019/1724898 Research Article Feature Selection for Longitudinal Data by Using Sign Averages to Summarize Gene Expression Values over Time Suyan Tian 1,2 and Chi Wang 3,4 Division of Clinical Research, e First Hospital of Jilin University, Xinmin Street, Changchun, Jilin , China Center for Applied Statistical Research, School of Mathematics, Jilin University, Qianjin Street, Changchun, Jilin , China Department of Biostatistics, College of Public Health, University of Kentucky, Lexington, KY , USA Markey Cancer Center, University of Kentucky, Rose St., Lexington, KY , USA Correspondence should be addressed to Suyan Tian; [email protected] and Chi Wang; [email protected] Received 28 November 2018; Accepted 25 February 2019; Published 19 March 2019 Academic Editor: Betti Giusti Copyright © 2019 Suyan Tian and Chi Wang. Tis is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. With the rapid evolution of high-throughput technologies, time series/longitudinal high-throughput experiments have become possible and afordable. However, the development of statistical methods dealing with gene expression profles across time points has not kept up with the explosion of such data. Te feature selection process is of critical importance for longitudinal microarray data. In this study, we proposed aggregating a gene’s expression values across time into a single value using the sign average method, thereby degrading a longitudinal feature selection process into a classic one. Regularized logistic regression models with pseudogenes (i.e., the sign average of genes across time as predictors) were then optimized by either the coordinate descent method or the threshold gradient descent regularization method. By applying the proposed methods to simulated data and a traumatic injury dataset, we have demonstrated that the proposed methods, especially for the combination of sign average and threshold gradient descent regularization, outperform other competitive algorithms. To conclude, the proposed methods are highly recommended for studies with the objective of carrying out feature selection for longitudinal gene expression data. 1. Introduction with expression profles across time points has not kept pace. Feature selection, a mighty tool to tackle the high dimen- One commonly used strategy is to stratify time series sionality issue accompanying high-throughput experiments data into separate time points and then analyze these points where the number of measured features (e.g., genes or separately. Tis approach may lead to inefciency in statistical metabolites), is much larger than that of samples and has been power by ignoring the highly correlated structure of gene employed with increasing frequency in many research areas, expression values across time and thus result in failure to including biomedical research. Te ultimate goal of feature detect patterns of change across time [1–3]. selection is to correctly identify features associated with the An alternative strategy to conduct feature selection for phenotypes of interest while ruling out irrelevant features as longitudinal gene expression data is to use statistical methods much as possible. capable of detecting diferent expression patterns across time Because biological systems or processes are dynamic, between groups. Examples include Signifcance Analysis of it is useful for researchers to investigate gene expression Microarray [4], Extraction of Diferential Gene Expression patterns across time in order to capture biologically mean- (EDGE) [1, 5], Linear Models for Microarray Data (limma) ingful dynamic changes. With the rapid evolution of high- [6], and Microarray Signifcant profles [7]. EDGE uses a throughput technology, time series/longitudinal microarray spline approach and is one of the frst methods to specifcally experiments have become possible and even afordable. address identifcation of diferentially expressed genes across However, development of specifc statistical methods dealing time [8]. In contrast, the limma method has a more general 2 BioMed Research International purpose and is easily understood and implemented [7]; association with the phenotype of interest at diferent time therefore, it has gained extreme popularity and become points, the mean average operator that does not account for the gold standard to detect diferentially expressed genes efect directions may cancel out the diferent time efects of under diferent scenarios (e.g., two-group or multiple-group this gene [20]. In contrast, construction of pathway dereg- comparison) for microarray data. Nevertheless, because the ulation scores is more theoretically complicated and com- limma method usually does not correctly account for the putationally intensive, requiring involvement of an expert order of time points or the correlation structure introduced statistician. For the frst principal component summary, it is by multiple observations from the same subject, it tends well known that the direction of the largest variance of the to be outperformed by other relevant methods. Since these gene expression values is pinpointed instead of the genes that statistical strategies usually screen genes one by one according are most related to the phenotypes of interest. to the magnitude of a gene’s relevance to the phenotype of Te sign average [21, 22], also known as the Direction interest, they may be classifed as the flter methods [9]. Te Aware Average [20], takes into account the directions of big drawback of flter methods is that many false positive association between genes and the phenotype of interest as genes remain in the fnal model [9]. well as genes’ expression values and might be less subject Some researchers have extended two typical longitudinal to overftting since these directions are more robust than data analysis strategies, namely, the generalized estimating their estimated efects [20, 21]. As opposed to the average equation (GEE) method [10] and a mixed model [11], to operator, the sign average considers not only the expression carry out feature selection for time series gene expression values of a gene at each time point but also the direction profles. Te GEE-based screening procedure [3], penalized- of its association with the phenotype of interest at those GEE (PGEE) [2], and glmmLasso [12] methods belong to this time points; therefore, positive and negative associations do category. Among them, the GEE-based screening procedure not cancel each other out. Te sign average is an average in fts a GEE model to each gene and then flters out the non- essence, however, and not as sophisticated as the pathway signifcant genes. By fltering genes one by one, this procedure deregulation scores. Given the fact that the sign average is is very likely to mistakenly include redundant genes highly capable of mitigating these two drawbacks simultaneously, correlated with the true relevant genes in the fnal gene list. it may be a more suitable choice to summarize a genes’ Te PGEE algorithm [2] adds the SCAD penalty term [13] to expression value at the gene set level or a gene’s expression the corresponding quasilikelihood function of a GEE model values over time. to implement feature selection and model construction. In In this study, we consider a scenario that has a long contrast, the glmmLasso method [12] maximizes the corre- history—traumatic injury with subsequent infection. In sponding penalized log likelihood function of a generalized ancient times, traumatic injury with
Recommended publications
  • Deregulated Gene Expression Pathways in Myelodysplastic Syndrome Hematopoietic Stem Cells
    Leukemia (2010) 24, 756–764 & 2010 Macmillan Publishers Limited All rights reserved 0887-6924/10 $32.00 www.nature.com/leu ORIGINAL ARTICLE Deregulated gene expression pathways in myelodysplastic syndrome hematopoietic stem cells A Pellagatti1, M Cazzola2, A Giagounidis3, J Perry1, L Malcovati2, MG Della Porta2,MJa¨dersten4, S Killick5, A Verma6, CJ Norbury7, E Hellstro¨m-Lindberg4, JS Wainscoat1 and J Boultwood1 1LRF Molecular Haematology Unit, NDCLS, John Radcliffe Hospital, Oxford, UK; 2Department of Hematology Oncology, University of Pavia Medical School, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy; 3Medizinische Klinik II, St Johannes Hospital, Duisburg, Germany; 4Division of Hematology, Department of Medicine, Karolinska Institutet, Stockholm, Sweden; 5Department of Haematology, Royal Bournemouth Hospital, Bournemouth, UK; 6Albert Einstein College of Medicine, Bronx, NY, USA and 7Sir William Dunn School of Pathology, University of Oxford, Oxford, UK To gain insight into the molecular pathogenesis of the the World Health Organization.6,7 Patients with refractory myelodysplastic syndromes (MDS), we performed global gene anemia (RA) with or without ringed sideroblasts, according to expression profiling and pathway analysis on the hemato- poietic stem cells (HSC) of 183 MDS patients as compared with the the French–American–British classification, were subdivided HSC of 17 healthy controls. The most significantly deregulated based on the presence or absence of multilineage dysplasia. In pathways in MDS include interferon signaling, thrombopoietin addition, patients with RA with excess blasts (RAEB) were signaling and the Wnt pathways. Among the most signifi- subdivided into two categories, RAEB1 and RAEB2, based on the cantly deregulated gene pathways in early MDS are immuno- percentage of bone marrow blasts.
    [Show full text]
  • Analysis of Gene Expression Data for Gene Ontology
    ANALYSIS OF GENE EXPRESSION DATA FOR GENE ONTOLOGY BASED PROTEIN FUNCTION PREDICTION A Thesis Presented to The Graduate Faculty of The University of Akron In Partial Fulfillment of the Requirements for the Degree Master of Science Robert Daniel Macholan May 2011 ANALYSIS OF GENE EXPRESSION DATA FOR GENE ONTOLOGY BASED PROTEIN FUNCTION PREDICTION Robert Daniel Macholan Thesis Approved: Accepted: _______________________________ _______________________________ Advisor Department Chair Dr. Zhong-Hui Duan Dr. Chien-Chung Chan _______________________________ _______________________________ Committee Member Dean of the College Dr. Chien-Chung Chan Dr. Chand K. Midha _______________________________ _______________________________ Committee Member Dean of the Graduate School Dr. Yingcai Xiao Dr. George R. Newkome _______________________________ Date ii ABSTRACT A tremendous increase in genomic data has encouraged biologists to turn to bioinformatics in order to assist in its interpretation and processing. One of the present challenges that need to be overcome in order to understand this data more completely is the development of a reliable method to accurately predict the function of a protein from its genomic information. This study focuses on developing an effective algorithm for protein function prediction. The algorithm is based on proteins that have similar expression patterns. The similarity of the expression data is determined using a novel measure, the slope matrix. The slope matrix introduces a normalized method for the comparison of expression levels throughout a proteome. The algorithm is tested using real microarray gene expression data. Their functions are characterized using gene ontology annotations. The results of the case study indicate the protein function prediction algorithm developed is comparable to the prediction algorithms that are based on the annotations of homologous proteins.
    [Show full text]
  • Trait Locus Chr Position P-Value Effect Effect SE Ref MA MAF Gene Gene Function FP Rs137787931 14 1880378 4.27E-07
    Table S1. The SNPs which reached significance in single-locus association for fat percentages at suggestive threshold (P < 2.22 × 10-5). Trait Locus Chr Position P-Value Effect Effect SE Ref MA MAF Gene Gene function FP rs137787931 14 1,880,378 4.27E-07 -0.06 0.01 T C 0.42 MROH1 - rs134432442 14 1,736,599 8.69E-07 0.06 0.01 C T 0.49 CPSF1 mRNA polyadenylation FP = fat percentage, Chr = chromosome, Ref = reference allele, MA = minor allele, MAF = minor allele frequency, MROH1 = maestro heat like repeat family member 1, CPSF1 = cleavage and polyadenylation specific factor 1. Table S2. The 1-Mb SNP windows surpass suggestive significance level that is proportion of genetic variance (PVE) at 0.19 % and their window posterior probability of association (WPPA) for percentages of milk fat (FP) and crude protein (CPP), milk urea (MU) and efficiency of crude protein utilization (ECPU). Trait Window Chr Start-end window (Mb) Start SNP End SNP No. of SNP PVE (%) WPPA Gene Gene function FP 1849 18 15-16 15,039,844 15,954,290 21 0.31 0.22 GPT2 Regulation of biosynthesis CPP 323 3 26-27 26,020,004 26,925,312 18 0.36 0.22 TRIM45 Protein ubiquitination 1567 14 62-63 62,081,472 62,960,995 17 0.34 0.12 UBR5 Protein ubiquitination MU 563 5 23-24 23,019,369 23,949,571 19 0.34 0.08 UBE2N Protein ubiquitination 1084 9 75-76 75,026,578 75,935,285 19 0.33 0.14 TNFAIP3 Protein ubiquitination 1677 16 1-2 1,033,239 1,972,109 24 0.32 0.14 ATP2B4, Urinary bladder smooth muscle contraction, REN Kidney development 4 1 3-4 3,079,342 3,987,104 19 0.31 0.18 UBR1 Protein
    [Show full text]
  • Seq2pathway Vignette
    seq2pathway Vignette Bin Wang, Xinan Holly Yang, Arjun Kinstlick May 19, 2021 Contents 1 Abstract 1 2 Package Installation 2 3 runseq2pathway 2 4 Two main functions 3 4.1 seq2gene . .3 4.1.1 seq2gene flowchart . .3 4.1.2 runseq2gene inputs/parameters . .5 4.1.3 runseq2gene outputs . .8 4.2 gene2pathway . 10 4.2.1 gene2pathway flowchart . 11 4.2.2 gene2pathway test inputs/parameters . 11 4.2.3 gene2pathway test outputs . 12 5 Examples 13 5.1 ChIP-seq data analysis . 13 5.1.1 Map ChIP-seq enriched peaks to genes using runseq2gene .................... 13 5.1.2 Discover enriched GO terms using gene2pathway_test with gene scores . 15 5.1.3 Discover enriched GO terms using Fisher's Exact test without gene scores . 17 5.1.4 Add description for genes . 20 5.2 RNA-seq data analysis . 20 6 R environment session 23 1 Abstract Seq2pathway is a novel computational tool to analyze functional gene-sets (including signaling pathways) using variable next-generation sequencing data[1]. Integral to this tool are the \seq2gene" and \gene2pathway" components in series that infer a quantitative pathway-level profile for each sample. The seq2gene function assigns phenotype-associated significance of genomic regions to gene-level scores, where the significance could be p-values of SNPs or point mutations, protein-binding affinity, or transcriptional expression level. The seq2gene function has the feasibility to assign non-exon regions to a range of neighboring genes besides the nearest one, thus facilitating the study of functional non-coding elements[2]. Then the gene2pathway summarizes gene-level measurements to pathway-level scores, comparing the quantity of significance for gene members within a pathway with those outside a pathway.
    [Show full text]
  • DNA Topoisomerases and Cancer Topoisomerases and TOP Genes in Humans Humans Vs
    DNA Topoisomerases Review DNA Topoisomerases And Cancer Topoisomerases and TOP Genes in Humans Humans vs. Escherichia Coli Topoisomerase differences Comparisons Topoisomerase and genomes Top 1 Top1 and Top2 differences Relaxation of DNA Top1 DNA supercoiling DNA supercoiling In the context of chromatin, where the rotation of DNA is constrained, DNA supercoiling (over- and under-twisting and writhe) is readily generated. TOP1 and TOP1mt remove supercoiling by DNA untwisting, acting as “swivelases”, whereas TOP2a and TOP2b remove writhe, acting as “writhases” at DNA crossovers (see TOP2 section). Here are some basic facts concerning DNA supercoiling that are relevant to topoisomerase activity: • Positive supercoiling (Sc+) tightens the DNA helix whereas negative supercoiling (Sc-) facilitates the opening of the duplex and the generation of single-stranded segments. • Nucleosome formation and disassembly absorbs and releases Sc-, respectively. • Polymerases generate Sc+ ahead and Sc- behind their tracks. • Excess of Sc+ arrests DNA tracking enzymes (helicases and polymerases), suppresses transcription elongation and initiation, and destabilizes nucleosomes. • Sc- facilitates DNA melting during the initiation of replication and transcription, D-loop formation and homologous recombination and nucleosome formation. • Excess of Sc- favors the formation of alternative DNA structures (R-loops, guanine quadruplexes, right-handed DNA (Z-DNA), plectonemic structures), which then absorb Sc- upon their formation and attract regulatory proteins. The
    [Show full text]
  • Processing of 3′-Blocked Dna Double-Strand Breaks by Tyrosyl-Dna Phosphodiesterase 1, Artemis and Polynucleotide Kinase/ Phosphatase
    Virginia Commonwealth University VCU Scholars Compass Theses and Dissertations Graduate School 2018 PROCESSING OF 3′-BLOCKED DNA DOUBLE-STRAND BREAKS BY TYROSYL-DNA PHOSPHODIESTERASE 1, ARTEMIS AND POLYNUCLEOTIDE KINASE/ PHOSPHATASE Ajinkya S. Kawale Virginia Commonwealth University Follow this and additional works at: https://scholarscompass.vcu.edu/etd Part of the Biochemistry Commons, Molecular Biology Commons, and the Molecular Genetics Commons © The Author Downloaded from https://scholarscompass.vcu.edu/etd/5329 This Dissertation is brought to you for free and open access by the Graduate School at VCU Scholars Compass. It has been accepted for inclusion in Theses and Dissertations by an authorized administrator of VCU Scholars Compass. For more information, please contact [email protected]. © Ajinkya Kawale 2018 All Rights Reserved PROCESSING OF 3′-BLOCKED DNA DOUBLE-STRAND BREAKS BY TYROSYL-DNA PHOSPHODIESTERASE 1, ARTEMIS AND POLYNUCLEOTIDE KINASE/ PHOSPHATASE A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy at Virginia Commonwealth University. Ajinkya Kawale, MS, Pharmacology and Toxicology, Virginia Commonwealth University, 2015 B.Tech, Biotechnology, Dr. D. Y. Patil University, 2013 Advisor: Dr. Lawrence F. Povirk, Professor, Department of Pharmacology and Toxicology Virginia Commonwealth University Richmond, Virginia, April, 2018. ii ACKNOWLEDGEMENTS Several people have played important roles in helping me reach this stage in my professional career and the little success that I have achieved is a direct extension of all of their hardwork and blessings. First and foremost, I would like to express my sincere gratitude to my PhD Advisor, Dr. Lawrence F. Povirk. He has been a fantastic mentor throughout my time in his lab and has helped me at each and every stage of my graduate career.
    [Show full text]
  • Snotarget Shows That Human Orphan Snorna Targets Locate Close to Alternative Splice Junctions
    Available online at www.sciencedirect.com Gene 408 (2008) 172–179 www.elsevier.com/locate/gene snoTARGET shows that human orphan snoRNA targets locate close to alternative splice junctions Peter S. Bazeley a, Valery Shepelev b, Zohreh Talebizadeh c, Merlin G. Butler c, Larisa Fedorova d, ⁎ Vadim Filatov e, Alexei Fedorov a,d, a Program in Bioinformatics and Proteomics/Genomics, University of Toledo Health Science Campus, Toledo, OH 43614, USA b Department of Bioinformatics, Institute of Molecular Genetics, RAS, Moscow 123182, Russia c Section of Medical Genetics and Molecular Medicine, Children's Mercy Hospitals and Clinics and University of Missouri, Kansas City School of Medicine, Kansas City, MO, USA d Department of Medicine, University of Toledo Health Science Campus, Toledo, OH 43614, USA e Dinom LLC, 8/44 Pedagogicheskaya st., Moscow 115404, Russia Received 31 July 2007; received in revised form 19 October 2007; accepted 24 October 2007 Available online 21 November 2007 Received by Takashi Gojobori Abstract Among thousands of non-protein-coding RNAs which have been found in humans, a significant group represents snoRNA molecules that guide other types of RNAs to specific chemical modifications, cleavages, or proper folding. Yet, hundreds of mammalian snoRNAs have unknown function and are referred to as “orphan” molecules. In 2006, for the first time, it was shown that a particular orphan snoRNA (HBII-52) plays an important role in the regulation of alternative splicing of the serotonin receptor gene in humans and other mammals. In order to facilitate the investigation of possible involvement of snoRNAs in the regulation of pre-mRNA processing, we developed a new computational web resource, snoTARGET, which searches for possible guiding sites for snoRNAs among the entire set of human and rodent exonic and intronic sequences.
    [Show full text]
  • Androgen Receptor Interacting Proteins and Coregulators Table
    ANDROGEN RECEPTOR INTERACTING PROTEINS AND COREGULATORS TABLE Compiled by: Lenore K. Beitel, Ph.D. Lady Davis Institute for Medical Research 3755 Cote Ste Catherine Rd, Montreal, Quebec H3T 1E2 Canada Telephone: 514-340-8260 Fax: 514-340-7502 E-Mail: [email protected] Internet: http://androgendb.mcgill.ca Date of this version: 2010-08-03 (includes articles published as of 2009-12-31) Table Legend: Gene: Official symbol with hyperlink to NCBI Entrez Gene entry Protein: Protein name Preferred Name: NCBI Entrez Gene preferred name and alternate names Function: General protein function, categorized as in Heemers HV and Tindall DJ. Endocrine Reviews 28: 778-808, 2007. Coregulator: CoA, coactivator; coR, corepressor; -, not reported/no effect Interactn: Type of interaction. Direct, interacts directly with androgen receptor (AR); indirect, indirect interaction; -, not reported Domain: Interacts with specified AR domain. FL-AR, full-length AR; NTD, N-terminal domain; DBD, DNA-binding domain; h, hinge; LBD, ligand-binding domain; C-term, C-terminal; -, not reported References: Selected references with hyperlink to PubMed abstract. Note: Due to space limitations, all references for each AR-interacting protein/coregulator could not be cited. The reader is advised to consult PubMed for additional references. Also known as: Alternate gene names Gene Protein Preferred Name Function Coregulator Interactn Domain References Also known as AATF AATF/Che-1 apoptosis cell cycle coA direct FL-AR Leister P et al. Signal Transduction 3:17-25, 2003 DED; CHE1; antagonizing regulator Burgdorf S et al. J Biol Chem 279:17524-17534, 2004 CHE-1; AATF transcription factor ACTB actin, beta actin, cytoplasmic 1; cytoskeletal coA - - Ting HJ et al.
    [Show full text]
  • Genetic and Genomic Analysis of Hyperlipidemia, Obesity and Diabetes Using (C57BL/6J × TALLYHO/Jngj) F2 Mice
    University of Tennessee, Knoxville TRACE: Tennessee Research and Creative Exchange Nutrition Publications and Other Works Nutrition 12-19-2010 Genetic and genomic analysis of hyperlipidemia, obesity and diabetes using (C57BL/6J × TALLYHO/JngJ) F2 mice Taryn P. Stewart Marshall University Hyoung Y. Kim University of Tennessee - Knoxville, [email protected] Arnold M. Saxton University of Tennessee - Knoxville, [email protected] Jung H. Kim Marshall University Follow this and additional works at: https://trace.tennessee.edu/utk_nutrpubs Part of the Animal Sciences Commons, and the Nutrition Commons Recommended Citation BMC Genomics 2010, 11:713 doi:10.1186/1471-2164-11-713 This Article is brought to you for free and open access by the Nutrition at TRACE: Tennessee Research and Creative Exchange. It has been accepted for inclusion in Nutrition Publications and Other Works by an authorized administrator of TRACE: Tennessee Research and Creative Exchange. For more information, please contact [email protected]. Stewart et al. BMC Genomics 2010, 11:713 http://www.biomedcentral.com/1471-2164/11/713 RESEARCH ARTICLE Open Access Genetic and genomic analysis of hyperlipidemia, obesity and diabetes using (C57BL/6J × TALLYHO/JngJ) F2 mice Taryn P Stewart1, Hyoung Yon Kim2, Arnold M Saxton3, Jung Han Kim1* Abstract Background: Type 2 diabetes (T2D) is the most common form of diabetes in humans and is closely associated with dyslipidemia and obesity that magnifies the mortality and morbidity related to T2D. The genetic contribution to human T2D and related metabolic disorders is evident, and mostly follows polygenic inheritance. The TALLYHO/ JngJ (TH) mice are a polygenic model for T2D characterized by obesity, hyperinsulinemia, impaired glucose uptake and tolerance, hyperlipidemia, and hyperglycemia.
    [Show full text]
  • 1 Supporting Information for a Microrna Network Regulates
    Supporting Information for A microRNA Network Regulates Expression and Biosynthesis of CFTR and CFTR-ΔF508 Shyam Ramachandrana,b, Philip H. Karpc, Peng Jiangc, Lynda S. Ostedgaardc, Amy E. Walza, John T. Fishere, Shaf Keshavjeeh, Kim A. Lennoxi, Ashley M. Jacobii, Scott D. Rosei, Mark A. Behlkei, Michael J. Welshb,c,d,g, Yi Xingb,c,f, Paul B. McCray Jr.a,b,c Author Affiliations: Department of Pediatricsa, Interdisciplinary Program in Geneticsb, Departments of Internal Medicinec, Molecular Physiology and Biophysicsd, Anatomy and Cell Biologye, Biomedical Engineeringf, Howard Hughes Medical Instituteg, Carver College of Medicine, University of Iowa, Iowa City, IA-52242 Division of Thoracic Surgeryh, Toronto General Hospital, University Health Network, University of Toronto, Toronto, Canada-M5G 2C4 Integrated DNA Technologiesi, Coralville, IA-52241 To whom correspondence should be addressed: Email: [email protected] (M.J.W.); yi- [email protected] (Y.X.); Email: [email protected] (P.B.M.) This PDF file includes: Materials and Methods References Fig. S1. miR-138 regulates SIN3A in a dose-dependent and site-specific manner. Fig. S2. miR-138 regulates endogenous SIN3A protein expression. Fig. S3. miR-138 regulates endogenous CFTR protein expression in Calu-3 cells. Fig. S4. miR-138 regulates endogenous CFTR protein expression in primary human airway epithelia. Fig. S5. miR-138 regulates CFTR expression in HeLa cells. Fig. S6. miR-138 regulates CFTR expression in HEK293T cells. Fig. S7. HeLa cells exhibit CFTR channel activity. Fig. S8. miR-138 improves CFTR processing. Fig. S9. miR-138 improves CFTR-ΔF508 processing. Fig. S10. SIN3A inhibition yields partial rescue of Cl- transport in CF epithelia.
    [Show full text]
  • Chromosomal Rearrangements Are Commonly Post-Transcriptionally Attenuated in Cancer
    bioRxiv preprint doi: https://doi.org/10.1101/093369; this version posted February 1, 2017. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. Chromosomal rearrangements are commonly post-transcriptionally attenuated in cancer 1 3 1 3, 4, 5 Emanuel Gonçalves ,​ Athanassios Fragoulis ,​ Luz Garcia-Alonso ,​ Thorsten Cramer ,​ ​ ​ ​ ​ 1,2# 1# Julio Saez-Rodriguez ,​ Pedro Beltrao ​ ​ 1 ​ European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridge CB10 1SD, UK 2 ​ RWTH Aachen University, Faculty of Medicine, Joint Research Centre for Computational Biomedicine, Aachen 52057, Germany 3 ​ Molecular Tumor Biology, Department of General, Visceral and Transplantation Surgery, RWTH University Hospital, Pauwelsstraße 30, 52074 Aachen, Germany 4 ​ NUTRIM School of Nutrition and Translational Research in Metabolism, Maastricht University, Maastricht, The Netherlands 5 ​ ESCAM – European Surgery Center Aachen Maastricht, Germany and The Netherlands # ​ co-last authors: [email protected]; [email protected] ​ ​ ​ Running title: Chromosomal rearrangement attenuation in cancer Keywords: Cancer; Gene dosage; Proteomics; Copy-number variation; Protein complexes 1 bioRxiv preprint doi: https://doi.org/10.1101/093369; this version posted February 1, 2017. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. Abstract Chromosomal rearrangements, despite being detrimental, are ubiquitous in cancer and often act as driver events.
    [Show full text]
  • Differential and Common DNA Repair Pathways for Topoisomerase I- and II-Targeted Drugs in a Genetic DT40 Repair Cell Screen Panel
    Published OnlineFirst October 15, 2013; DOI: 10.1158/1535-7163.MCT-13-0551 Molecular Cancer Cancer Biology and Signal Transduction Therapeutics Differential and Common DNA Repair Pathways for Topoisomerase I- and II-Targeted Drugs in a Genetic DT40 Repair Cell Screen Panel Yuko Maede1, Hiroyasu Shimizu4, Toru Fukushima2, Toshiaki Kogame1, Terukazu Nakamura3, Tsuneharu Miki4, Shunichi Takeda1, Yves Pommier5, and Junko Murai1,5 Abstract Clinical topoisomerase I (Top1) and II (Top2) inhibitors trap topoisomerases on DNA, thereby inducing protein-linked DNA breaks. Cancer cells resist the drugs by removing topoisomerase-DNA complexes, and repairing the drug-induced DNA double-strand breaks (DSB) by homologous recombination and nonhomol- ogous end joining (NHEJ). Because numerous enzymes and cofactors are involved in the removal of the topoisomerase-DNA complexes and DSB repair, it has been challenging to comprehensively analyze the relative contribution of multiple genetic pathways in vertebrate cells. Comprehending the relative contribution of individual repair factors would give insights into the lesions induced by the inhibitors and genetic determinants of response. Ultimately, this information would be useful to target specific pathways to augment the therapeutic activity of topoisomerase inhibitors. To this end, we put together 48 isogenic DT40 mutant cells deficient in DNA repair and generated one cell line deficient in autophagy (ATG5). Sensitivity profiles were established for three clinically relevant Top1 inhibitors (camptothecin and the indenoisoquinolines LMP400 and LMP776) and three Top2 inhibitors (etoposide, doxorubicin, and ICRF-193). Highly significant correlations were found among Top1 inhibitors as well as Top2 inhibitors, whereas the profiles of Top1 inhibitors were different from those of Top2 inhibitors.
    [Show full text]