R.ROSETTA: a Package for Analysis of Rule-Based Classification Models

Total Page:16

File Type:pdf, Size:1020Kb

R.ROSETTA: a Package for Analysis of Rule-Based Classification Models R.ROSETTA: a package for analysis of rule-based classification models Supplementary material Mateusz Garbulowski1,*, Klev Diamanti1,#, Karolina Smolińska1,#, Patricia Stoll2, Susanne Bornelöv3, Aleksander Øhrn4 and Jan Komorowski1,* 1Department of Cell and Molecular Biology, Uppsala University, Sweden 2Department of Biosystems Science and Engineering, ETH Zurich, Switzerland 3Wellcome Trust Medical Research Council Stem Cell Institute, University of Cambridge, England 4Department of Informatics, Oslo University, Norway #These authors contributed equally to the work as second authors. *Corresponding author E-mail: [email protected] (Mateusz Garbulowski) E-mail: [email protected] (Jan Komorowski) R.ROSETTA is freely available at: https://github.com/komorowskilab/R.ROSETTA Table of contents 1. Supplementary notes ....................................................................................................... 2 1.1. Package architecture .......................................................................................................... 2 1.2. Main upgrades ................................................................................................................... 2 1.1.1 Undersampling .......................................................................................................................... 2 1.1.2 Rule p-value estimation ............................................................................................................. 2 1.1.3 Votes normalization in class prediction ...................................................................................... 2 1.1.4 Retrieval of support sets from rules ............................................................................................ 2 1.1.5 Rule visualization ...................................................................................................................... 2 1.3. Data preprocessing ............................................................................................................ 2 1.4. Feature selection ............................................................................................................... 3 1.5. Feature validation .............................................................................................................. 3 1.6. Classification .................................................................................................................... 3 2. Supplementary figures .................................................................................................... 4 3. Supplementary tables ...................................................................................................... 9 4. Supplementary references ............................................................................................. 17 1. Supplementary notes 1.1. Package architecture The ROSETTA framework comes in a GUI version for Windows systems and a command line version for the UNIX-based systems. The R.ROSETTA package is a cross-platform application that uses command line ROSETTA. However, UNIX-based systems require installation of the compatibility layer software wine. For more information we recommend you read the original ROSETTA articles (Øhrn, 1999; Øhrn, 2000; Øhrn et al., 1998) and the technical reference manual (Øhrn, 2001). Detailed instructions for R.ROSETTA installation, functions and a sample code are available in the package manual. 1.2. Main upgrades 1.1.1 Undersampling Classification models are trained with labeled examples that are a priori assigned to a decision class. Ideally, each decision class shall contain approximately the same number of examples. However, decision classes that are not represented equally may lead to a biased performance of the model. In such cases, we suggested that a number of examples equal to the population of the minority class is randomly sampled without replacement a sufficient number of times. The approach of balancing the data (Liu et al., 2009) is generally known as undersampling. We have implemented an option that creates sets of equal sizes by undersampling the larger sets of the input dataset. By default, each example is selected at least once, although the user can set a custom number of sampled sets, as well as a custom size for each set. The final classification models for each undersampled set are merged into a single model that consist of unique rules from each classifier. The overall accuracy of the model is estimated as the mean value of all the submodels. Finally, all the statistics of each rule shall be recalculated according to the original input dataset. 1.1.2 Rule p-value estimation Classification models generated by R.ROSETTA consist of a set of varying number of rules estimated from different reduct computation algorithms. In case of the Johnson algorithm (Johnson, 1974) this set is manageable, while in case of the Genetic algorithm (Wróblewski, 1995) is considerably large (Supplementary Figure S3; Supplementary Table S4). In both cases, supervised pruning of rules from the models would not heavily affect the overall performance of the classifier. To better assess the quality of each rule we assume a hypergeometric distribution to calculate p-values followed by multiple testing correction. The hypergeometric distribution follows the idea of estimating the power of representation of rule support against the total number of examples. Such optimized models illustrate the essential interactions among features and can be applied to external data. 1.1.3 Votes normalization in class prediction The rule-based models allow straightforward class prediction. Rules can be applied for the classification of unseen data. We collected the votes from the rules for each object individually and examine the predominance of each class. For some models an imbalanced number of rules for the outcome was generated. We propose to adjust for the rule-imbalance by normalizing on vote counts. We implemented various vote normalization methods in R.ROSETTA. Votes normalization can be performed by dividing the number of votes by its mean, median, maximum, total number of rules or square root of the sum of squares. We compared the performance of these methods in (Supplementary Table S8). Yet an additional necessary condition to be fulfilled is that the validation set shall contain the same set of features as the training set. In this case the class prediction is performed by applying the rules to the data. If the condition is fulfilled, the votes are estimated. 1.1.4 Retrieval of support sets from rules R.ROSETTA is able to recover support sets that represent the contribution of the objects to rules (Supplementary Figure S4). As a result, each rule is characterized with the set of objects that fulfill the given IF or IF-THEN component. There are several advantages of knowing such information. Support sets contribute to uncovering objects whose levels of variables might have shared patterns. Such sets may be further investigated to uncover specific subsets within decision classes. Moreover, low accuracy support sets allow detecting objects that may potentially introduce a bias in the model. 1.1.5 Rule visualization The package provides several ways of rule visualization. One of these is a heatmap (Supplementary Figure S4) that illustrates the discrete levels of each feature, that is a member of the selected conjunctive rule for each object. The heatmap gathers the objects into those that belong to the support set for the given class, do not belong to the support set for the given class and the remaining objects for the other classes. The heatmap can assist the interpretation of individual rules of interest and visualization of interactions. A more holistic approach displays the entire model as an interaction network (Shmulevich et al., 2002). The package allows exporting the rules in a line-by-line format which is suitable with rule visualization software such as Ciruvis (Bornelöv et al., 2014) or VisuNet (Supplementary Figure S5) (Anyango, 2016). These approaches provide a different point of view of the rule-based models that allow discovering known proof-of-concept and novel interactions among the features (Dramiński et al., 2016; Enroth et al., 2012). 1.3. Data preprocessing The gene expression data (Alter et al., 2011) for autistic and healthy young males (Supplementary Table S1) was downloaded from the GEO repository, under the accession number GSE25507. Expression levels of 54678 genes were measured with the Affymetrix Human Genome 2 U133 Plus 2.0 array. The so-called autism-control dataset was loaded and processed using the getGEO function from the GEOquery R library (Davis and Meltzer, 2007). The data was normalized with the Robust Multi-array Average (RMA) functions ReadAffy and rma from the affy R package (Gautier et al., 2004). Gene names were annotated using AnnotationDbi (Pages et al., 2008) and hgu133plus2.db (Carlson, 2016) R packages. To identify unknown probe names, the annotation table HG-U133_Plus_2.na36 (http://www.affymetrix.com/site/mainPage.affx) was processed and the probe coordinates were intersected with the unknown probes using the GenomicRanges (Lawrence et al., 2013) R package. For one probe the gene name could not be identified due to the lack of coordinates. The clinical data was investigated for potential batch effects using Pearson correlation with the rcorr function from the Hmisc R library (Harrell and Dupont, 2008). The age of the samples was highly correlated to the outcome (Supplementary
Recommended publications
  • Injury by Mechanical Ventilation Gene Transcription and Promotion Of
    Modulation of Lipopolysaccharide-Induced Gene Transcription and Promotion of Lung Injury by Mechanical Ventilation This information is current as William A. Altemeier, Gustavo Matute-Bello, Sina A. of September 29, 2021. Gharib, Robb W. Glenny, Thomas R. Martin and W. Conrad Liles J Immunol 2005; 175:3369-3376; ; doi: 10.4049/jimmunol.175.5.3369 http://www.jimmunol.org/content/175/5/3369 Downloaded from Supplementary http://www.jimmunol.org/content/suppl/2005/08/23/175.5.3369.DC1 Material http://www.jimmunol.org/ References This article cites 37 articles, 7 of which you can access for free at: http://www.jimmunol.org/content/175/5/3369.full#ref-list-1 Why The JI? Submit online. • Rapid Reviews! 30 days* from submission to initial decision by guest on September 29, 2021 • No Triage! Every submission reviewed by practicing scientists • Fast Publication! 4 weeks from acceptance to publication *average Subscription Information about subscribing to The Journal of Immunology is online at: http://jimmunol.org/subscription Permissions Submit copyright permission requests at: http://www.aai.org/About/Publications/JI/copyright.html Email Alerts Receive free email-alerts when new articles cite this article. Sign up at: http://jimmunol.org/alerts The Journal of Immunology is published twice each month by The American Association of Immunologists, Inc., 1451 Rockville Pike, Suite 650, Rockville, MD 20852 Copyright © 2005 by The American Association of Immunologists All rights reserved. Print ISSN: 0022-1767 Online ISSN: 1550-6606. The Journal of Immunology Modulation of Lipopolysaccharide-Induced Gene Transcription and Promotion of Lung Injury by Mechanical Ventilation1 William A.
    [Show full text]
  • Genetics of Cell Surface Receptors for Bioactive Polypeptides
    Proc. Nati. Acad. Sci. USA Vol. 77, No. 6, pp. 3600-3604, June 1980 Genetics Genetics of cell surface receptors for bioactive polypeptides: Binding of epidermal growth factor is associated with the presence of human chromosome 7 in human-mouse cell hybrids (hormone/gene mapping/gene regulation) NOBUYOSHI SHIMIZU, M. ALI BEHZADIAN, AND YOSHIKO SHIMIZU Department of Cellular and Developmental Biology, University of Arizona, Tucson, Arizona 85721 Communicated by Frank H. Ruddle, March 24, 1980 ABSTRACT Mouse A9 cells, L-cell-derived mutants defi- Although recent work has identified the EGF receptor as a cient in hypoxanthine phosphoribosyltransferase (HPRT; glycoprotein with subunit structure (6-8), little is known about IMP:pyrophosphate Ihosphoribosyltransferase, EC 2.4.2.8) were its genetics and biosynthesis. By an application of somatic cell foun to e incapable of binding 25I-labeled epidermal growth factor (EGF) to the cell surface. The A9 cells were fused with genetics we have been studying the genetic and molecular basis human diploid fibroblasts (WI-38) possessing EGF-binding of receptor-mediated mitogenic action of EGF (9). In this report ability, and human-mouse cell hybrids (TA series) were isolated we present the evidence for the dominance of EGF-binding after hypoxanthine/aminopterin/thymidine/ouabain selection. ability and its linkage with human chromosome 7 based on Analyses of isozyme markers and chromosomes of four repre- analysis of human-mouse cell hybrids. sentative clones of TA hybrids indicated that the expression of EGF-binding ability is correlated with the presence of human chromosome 7 or 19. Four subclones were isolated from an MATERIALS AND METHODS EGF-binding-positive line, TA4, and segregation of EGF- Cell Lines.
    [Show full text]
  • B Inhibition in a Mouse Model of Chronic Colitis1
    The Journal of Immunology Differential Expression of Inflammatory and Fibrogenic Genes and Their Regulation by NF-␬B Inhibition in a Mouse Model of Chronic Colitis1 Feng Wu and Shukti Chakravarti2 Fibrosis is a major complication of chronic inflammation, as seen in Crohn’s disease and ulcerative colitis, two forms of inflam- matory bowel diseases. To elucidate inflammatory signals that regulate fibrosis, we investigated gene expression changes under- lying chronic inflammation and fibrosis in trinitrobenzene sulfonic acid-induced murine colitis. Six weekly 2,4,6-trinitrobenzene sulfonic acid enemas were given to establish colitis and temporal gene expression patterns were obtained at 6-, 8-, 10-, and 12-wk time points. The 6-wk point, TNBS-w6, was the active, chronic inflammatory stage of the model marked by macrophage, neu- trophil, and CD3؉ and CD4؉ T cell infiltrates in the colon, consistent with the idea that this model is T cell immune response driven. Proinflammatory genes Cxcl1, Ccl2, Il1b, Lcn2, Pla2g2a, Saa3, S100a9, Nos2, Reg2, and Reg3g, and profibrogenic extra- cellular matrix genes Col1a1, Col1a2, Col3a1, and Lum (lumican), encoding a collagen-associated proteoglycan, were up-regulated at the active/chronic inflammatory stages. Rectal administration of the NF-␬B p65 antisense oligonucleotide reduced but did not abrogate inflammation and fibrosis completely. The antisense oligonucleotide treatment reduced total NF-␬B by 60% and down- regulated most proinflammatory genes. However, Ccl2, a proinflammatory chemokine known to promote fibrosis, was not down- regulated. Among extracellular matrix gene expressions Lum was suppressed while Col1a1 and Col3a1 were not. Thus, effective treatment of fibrosis in inflammatory bowel disease may require early and complete blockade of NF-␬B with particular attention to specific proinflammatory and profibrogenic genes that remain active at low levels of NF-␬B.
    [Show full text]
  • In This Table Protein Name, Uniprot Code, Gene Name P-Value
    Supplementary Table S1: In this table protein name, uniprot code, gene name p-value and Fold change (FC) for each comparison are shown, for 299 of the 301 significantly regulated proteins found in both comparisons (p-value<0.01, fold change (FC) >+/-0.37) ALS versus control and FTLD-U versus control. Two uncharacterized proteins have been excluded from this list Protein name Uniprot Gene name p value FC FTLD-U p value FC ALS FTLD-U ALS Cytochrome b-c1 complex P14927 UQCRB 1.534E-03 -1.591E+00 6.005E-04 -1.639E+00 subunit 7 NADH dehydrogenase O95182 NDUFA7 4.127E-04 -9.471E-01 3.467E-05 -1.643E+00 [ubiquinone] 1 alpha subcomplex subunit 7 NADH dehydrogenase O43678 NDUFA2 3.230E-04 -9.145E-01 2.113E-04 -1.450E+00 [ubiquinone] 1 alpha subcomplex subunit 2 NADH dehydrogenase O43920 NDUFS5 1.769E-04 -8.829E-01 3.235E-05 -1.007E+00 [ubiquinone] iron-sulfur protein 5 ARF GTPase-activating A0A0C4DGN6 GIT1 1.306E-03 -8.810E-01 1.115E-03 -7.228E-01 protein GIT1 Methylglutaconyl-CoA Q13825 AUH 6.097E-04 -7.666E-01 5.619E-06 -1.178E+00 hydratase, mitochondrial ADP/ATP translocase 1 P12235 SLC25A4 6.068E-03 -6.095E-01 3.595E-04 -1.011E+00 MIC J3QTA6 CHCHD6 1.090E-04 -5.913E-01 2.124E-03 -5.948E-01 MIC J3QTA6 CHCHD6 1.090E-04 -5.913E-01 2.124E-03 -5.948E-01 Protein kinase C and casein Q9BY11 PACSIN1 3.837E-03 -5.863E-01 3.680E-06 -1.824E+00 kinase substrate in neurons protein 1 Tubulin polymerization- O94811 TPPP 6.466E-03 -5.755E-01 6.943E-06 -1.169E+00 promoting protein MIC C9JRZ6 CHCHD3 2.912E-02 -6.187E-01 2.195E-03 -9.781E-01 Mitochondrial 2-
    [Show full text]
  • Mclean, Chelsea.Pdf
    COMPUTATIONAL PREDICTION AND EXPERIMENTAL VALIDATION OF NOVEL MOUSE IMPRINTED GENES A Dissertation Presented to the Faculty of the Graduate School of Cornell University In Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy by Chelsea Marie McLean August 2009 © 2009 Chelsea Marie McLean COMPUTATIONAL PREDICTION AND EXPERIMENTAL VALIDATION OF NOVEL MOUSE IMPRINTED GENES Chelsea Marie McLean, Ph.D. Cornell University 2009 Epigenetic modifications, including DNA methylation and covalent modifications to histone tails, are major contributors to the regulation of gene expression. These changes are reversible, yet can be stably inherited, and may last for multiple generations without change to the underlying DNA sequence. Genomic imprinting results in expression from one of the two parental alleles and is one example of epigenetic control of gene expression. So far, 60 to 100 imprinted genes have been identified in the human and mouse genomes, respectively. Identification of additional imprinted genes has become increasingly important with the realization that imprinting defects are associated with complex disorders ranging from obesity to diabetes and behavioral disorders. Despite the importance imprinted genes play in human health, few studies have undertaken genome-wide searches for new imprinted genes. These have used empirical approaches, with some success. However, computational prediction of novel imprinted genes has recently come to the forefront. I have developed generalized linear models using data on a variety of sequence and epigenetic features within a training set of known imprinted genes. The resulting models were used to predict novel imprinted genes in the mouse genome. After imposing a stringency threshold, I compiled an initial candidate list of 155 genes.
    [Show full text]
  • The Mammalian Odz Gene Family
    Developmental Biology 217, 107–120 (2000) doi:10.1006/dbio.1999.9532, available online at http://www.idealibrary.com on View metadata, citation and similar papers at core.ac.uk brought to you by CORE The Mammalian Odz Gene Family: Homologs provided by Elsevier - Publisher Connector of a Drosophila Pair-Rule Gene with Expression Implying Distinct yet Overlapping Developmental Roles Tali Ben-Zur, Erez Feige, Benny Motro, and Ron Wides1 Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan, 52900 Israel The Drosophila pair-rule gene odz (Tenm) has many patterning roles throughout development. We have identified four mammalian homologs of this gene, including one previously described as a mouse ER stress response gene, Doc4 (Wang et al., 1998). The Odz genes encode large polypeptides displaying the hallmarks of Drosophila Odz: a putative signal peptide; eight EGF-like repeats; and a putative transmembrane domain followed by a 1800-amino-acid stretch without homology to any proteins outside of this family. The mouse genes Odz3 and Doc4/Odz4 exhibit partially overlapping, but clearly distinct, embryonic expression patterns. The major embryonic sites of expression are in the nervous system, including the tectum, optic recess, optic stalk, and developing eye. Additional sites of expression include trachea and mesodermally derived tissues, such as mesentery, and forming limb and bone. Expression of the Odz2 gene is restricted to the nervous system. The expression patterns suggest that each of the genes has its own distinct developmental role. Comparisons of Drosophila and vertebrate Odz expression patterns suggest evolutionarily conserved functions. © 2000 Academic Press Key Words: odz (odd Oz); pattern formation; central nervous system (CNS); embryonic development; eye.
    [Show full text]
  • Scin Is an Outer Membrane Lipoprotein Required for Type VI Secretion in Enteroaggregative Escherichia Coliᰔ Marie-Ste´Phanie Aschtgen, Christophe S
    JOURNAL OF BACTERIOLOGY, Nov. 2008, p. 7523–7531 Vol. 190, No. 22 0021-9193/08/$08.00ϩ0 doi:10.1128/JB.00945-08 Copyright © 2008, American Society for Microbiology. All Rights Reserved. SciN Is an Outer Membrane Lipoprotein Required for Type VI Secretion in Enteroaggregative Escherichia coliᰔ Marie-Ste´phanie Aschtgen, Christophe S. Bernard, Sophie De Bentzmann, Roland Lloube`s, and Eric Cascales* Laboratoire d’Inge´nierie des Syste`mes Macromole´culaires, Institut de Biologie Structurale et Microbiologie, CNRS, UPR 9027, 31 Chemin Joseph Aiguier, 13402 Marseille Cedex 20, France Received 10 July 2008/Accepted 29 August 2008 Enteroaggregative Escherichia coli (EAEC) is a pathogen implicated in several infant diarrhea or diarrheal outbreaks in areas of endemicity. Although multiple genes involved in EAEC pathogenesis have been identified, the overall mechanism of virulence is not well understood. Recently, a novel secretion system, called type VI secretion (T6S) system (T6SS), has been identified in EAEC and most animal or plant gram-negative patho- gens. T6SSs are multicomponent cell envelope machines responsible for the secretion of at least two putative substrates, Hcp and VgrG. In EAEC, two copies of T6S gene clusters, called sci-1 and sci-2, are present on the pheU pathogenicity island. In this study, we focused our work on the sci-1 gene cluster. The Sci-1 apparatus is probably composed of all, or a subset of, the 21 gene products encoded on the cluster. Among these subunits, some are shared by all T6SSs identified to date, including a ClpV-type AAA؉ ATPase (SciG) and an IcmF (SciS) and an IcmH (SciP) homologue, as well as a putative lipoprotein (SciN).
    [Show full text]
  • A Monomeric Photoconvertible Fluorescent Protein for Imaging of Dynamic Protein Localization
    doi:10.1016/j.jmb.2010.06.056 J. Mol. Biol. (2010) 401, 776–791 Available online at www.sciencedirect.com A Monomeric Photoconvertible Fluorescent Protein for Imaging of Dynamic Protein Localization Hiofan Hoi1, Nathan C. Shaner2, Michael W. Davidson3, Christopher W. Cairo1,4, Jiwu Wang2 and Robert E. Campbell1⁎ 1Department of Chemistry, The use of green-to-red photoconvertible fluorescent proteins (FPs) enables University of Alberta, researchers to highlight a subcellular population of a fusion protein of interest Edmonton, Alberta, Canada and to image its dynamics in live cells. In an effort to enrich the arsenal of T6G 2G2 photoconvertible FPs and to overcome the limitations imposed by the oligomeric structure of natural photoconvertible FPs, we designed and 2Allele Biotechnology, 9924 optimized a new monomeric photoconvertible FP. Using monomeric Mesa Rim Road, San Diego, versions of Clavularia sp. cyan FP as template, we employed sequence- CA 92121, USA alignment-guided design to create a chromophore environment analogous to 3National High Magnetic Field that shared by known photoconvertible FPs. The designed gene was Laboratory and Department of synthesized and, when expressed in Escherichia coli, found to produce green Biological Science, The Florida fluorescent colonies that gradually switched to red after exposure to white State University, 1800 East light. We subjected this first-generation FP [named mClavGR1 (monomeric Paul Dirac Drive, Tallahassee, Clavularia-derived green-to-red photoconvertible 1)] to a combination of FL 32310, USA random and targeted mutageneses and screened libraries for efficient 4 photoconversion using a custom-built system for illuminating a 10-cm Petri Alberta Ingenuity Center for plate with 405-nm light.
    [Show full text]
  • Original Article Scinderin Is a Novel Transcriptional Target of BRMS1 Involved in Regulation of Hepatocellular Carcinoma Cell Apoptosis
    Am J Cancer Res 2018;8(6):1008-1018 www.ajcr.us /ISSN:2156-6976/ajcr0077479 Original Article Scinderin is a novel transcriptional target of BRMS1 involved in regulation of hepatocellular carcinoma cell apoptosis Xiaojing Qiao1, Yiren Zhou1, Wenjuan Xie1, Yi Wang1, Yicheng Zhang1, Tian Tian1,3, Jianming Dou1, Xi Yang1, Suqin Shen1, Jianwei Hu2, Shouyi Qiao1, Yanhua Wu1 1School of Life Sciences, Fudan University, Shanghai 200433, P. R. China; 2Endoscopy Center and Department of General Surgery, Zhongshan Hospital of Fudan University, Shanghai 200032, P. R. China; 3Centre for Discovery Brain Sciences, University of Edinburgh, Edinburgh, EH89XD, Scotland Received April 6, 2018; Accepted May 18, 2018; Epub June 1, 2018; Published June 15, 2018 Abstract: Tumor metastasis suppressor factor BRMS1 can regulate the metastasis of breast cancer and other tumors. Here we report scinderin (SCIN) as a novel transcriptional target of BRMS1. SCIN protein belongs to the cy- toskeletal gelsolin protein superfamily and its involvement in tumorigenesis remains largely illusive. An inverse cor- relation between the expression levels of BRMS1 and SCIN was observed in hepatocellular carcinoma (HCC) cells and tissues. On the molecular level, BRMS1 binds to SCIN promoter and exerts a suppressive role in regulating SCIN transcription. FACS analysis and caspase 9 immunoblot reveal that knockdown of SCIN expression can sensitize HCC cells to chemotherapeutic drugs, leading to suppression of tumor growth in vivo. Consistently, overexpression of SCIN protects cells from apoptotic death, contributing to increased xenografted HCC cell growth. In summary, our study reveals SCIN as a functional apoptosis regulator as well as a novel target of BRMS1 during HCC tumorigen- esis.
    [Show full text]
  • Transcriptomes of Major Renal Collecting Duct Cell Types In
    Transcriptomes of major renal collecting duct cell types PNAS PLUS in mouse identified by single-cell RNA-seq Lihe Chena, Jae Wook Leeb, Chung-Lin Choua, Anil V. Nairc, Maria A. Battistonec, Teodor G. Paunescu˘ c, Maria Merkulovac, Sylvie Bretonc, Jill W. Verlanderd, Susan M. Walle,f, Dennis Brownc, Maurice B. Burga,1, and Mark A. Kneppera,1 aSystems Biology Center, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD 20892; bNephrology Clinic, National Cancer Center, Goyang, 10408, South Korea; cCenter for Systems Biology, Program in Membrane Biology and Division of Nephrology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114; dDivision of Nephrology, Hypertension and Renal Transplantation, University of Florida College of Medicine, Gainesville, FL 32610; eDepartment of Medicine, Emory University School of Medicine, Atlanta, GA 30322; and fDepartment of Physiology, Emory University School of Medicine, Atlanta, GA 30322 Contributed by Maurice B. Burg, October 3, 2017 (sent for review June 22, 2017; reviewed by Andrew P. McMahon and George J. Schwartz) Prior RNA sequencing (RNA-seq) studies have identified complete count for a small fraction of the kidney parenchyma. Therefore, transcriptomes for most renal epithelial cell types. The exceptions methods were required for selective enrichment of the three cell are the cell types that make up the renal collecting duct, namely types from mouse kidney-cell suspensions. Here, we have identi- intercalated cells (ICs) and principal cells (PCs), which account for fied cell-surface markers for A-ICs, B-ICs, and PCs, allowing these only a small fraction of the kidney mass, but play critical physio- cell types to be enriched from kidney-cell suspensions by using logical roles in the regulation of blood pressure, extracellular fluid FACS.
    [Show full text]
  • Quantitative Apical Membrane Proteomics Reveals Vasopressin-Induced Actin Dynamics in Collecting Duct Cells
    Quantitative apical membrane proteomics reveals vasopressin-induced actin dynamics in collecting duct cells Chin-San Looa,1, Cheng-Wei Chena,1, Po-Jen Wanga, Pei-Yu Chena, Shu-Yu Linb, Kay-Hooi Khoob, Robert A. Fentonc, Mark A. Knepperd, and Ming-Jiun Yua,2 aInstitute of Biochemistry and Molecular Biology, National Taiwan University College of Medicine, Taipei 10051, Taiwan; bInstitute of Biological Chemistry, Academia Sinica, Taipei 11529, Taiwan; cDepartment of Biomedicine, InterPrET Center, Aarhus University, Aarhus 8000, Denmark; and dSystems Biology Center, National Heart, Lung, and Blood Institute, Bethesda, MD 20892 Edited by Maurice B. Burg, National Heart, Lung, and Blood Institute, Bethesda, MD, and approved September 5, 2013 (received for review May 20, 2013) In kidney collecting duct cells, filamentous actin (F-actin) depoly- plasma membrane proteins of native rat kidney collecting ducts merization is a critical step in vasopressin-induced trafficking of and identified these proteins by mass spectrometry (MS) (12). A aquaporin-2 to the apical plasma membrane. However, the molec- number of apical membrane proteins involved in vesicular traf- ular components of this response are largely unknown. Using stable ficking and cytoskeletal regulation were identified. However, isotope-based quantitative protein mass spectrometry and surface several technical barriers prohibited deeper quantitative analysis: biotinylation, we identified 100 proteins that showed significant (i) labeling of the apical plasma membrane of native collecting abundance changes in the apical plasma membrane of mouse ducts required technically difficult perfusion of the biotinylation cortical collecting duct cells in response to vasopressin. Fourteen reagent into the collecting duct lumens; (ii) AQP2, the major of these proteins are involved in actin cytoskeleton regulation, vasopressin target, was not efficiently biotinylated because it has including actin itself, 10 actin-associated proteins, and 3 regulatory only one extracellular NHS-reactive lysine that is likely occluded fi proteins.
    [Show full text]
  • Scin (NM 009132) Mouse Tagged ORF Clone Product Data
    OriGene Technologies, Inc. 9620 Medical Center Drive, Ste 200 Rockville, MD 20850, US Phone: +1-888-267-4436 [email protected] EU: [email protected] CN: [email protected] Product datasheet for MR218765L3 Scin (NM_009132) Mouse Tagged ORF Clone Product data: Product Type: Expression Plasmids Product Name: Scin (NM_009132) Mouse Tagged ORF Clone Tag: Myc-DDK Symbol: Scin Synonyms: adseverin; AW545522 Vector: pLenti-C-Myc-DDK-P2A-Puro (PS100092) E. coli Selection: Chloramphenicol (34 ug/mL) Cell Selection: Puromycin ORF Nucleotide The ORF insert of this clone is exactly the same as(MR218765). Sequence: Restriction Sites: SgfI-MluI Cloning Scheme: ACCN: NM_009132 ORF Size: 1845 bp This product is to be used for laboratory only. Not for diagnostic or therapeutic use. View online » ©2021 OriGene Technologies, Inc., 9620 Medical Center Drive, Ste 200, Rockville, MD 20850, US 1 / 2 Scin (NM_009132) Mouse Tagged ORF Clone – MR218765L3 OTI Disclaimer: The molecular sequence of this clone aligns with the gene accession number as a point of reference only. However, individual transcript sequences of the same gene can differ through naturally occurring variations (e.g. polymorphisms), each with its own valid existence. This clone is substantially in agreement with the reference, but a complete review of all prevailing variants is recommended prior to use. More info OTI Annotation: This clone was engineered to express the complete ORF with an expression tag. Expression varies depending on the nature of the gene. RefSeq: NM_009132.2, NP_033158.2 RefSeq Size: 2708 bp RefSeq ORF: 1848 bp Locus ID: 20259 UniProt ID: Q60604, Q3UWV5 Gene Summary: Ca(2+)-dependent actin filament-severing protein that has a regulatory function in exocytosis by affecting the organization of the microfilament network underneath the plasma membrane (PubMed:9671468).
    [Show full text]