Modeling and Analysis of Regulatory Elements in Arabidopsis Thaliana from Annotated Genomes and Gene Expression Data

Total Page:16

File Type:pdf, Size:1020Kb

Modeling and Analysis of Regulatory Elements in Arabidopsis Thaliana from Annotated Genomes and Gene Expression Data Modeling and Analysis of Regulatory Elements in Arabidopsis thaliana from Annotated Genomes and Gene Expression Data Amrita Pati Thesis submitted to the faculty of the Virginia Polytechnic Institute and State University in partial fulfillment of the requirements for the degree of MASTER OF SCIENCE in Computer Science and Applications Lenwood S. Heath, Ph.D., Chairman Ruth Grene, Ph.D. T. M. Murali, Ph.D. July, 2005 Blacksburg, Virginia Copyright 2005, Amrita Pati Modeling and Analysis of Regulatory Elements in Arabidopsis thaliana from Annotated Genomes and Gene Expression Data by Amrita Pati Committee Chairman: Lenwood S. Heath, Ph.D. Computer Science (ABSTRACT) Modeling of cis-elements in the upstream regions of genes is a challenging computational prob- lem. A set of regulatory motifs present in the promoters of a set of genes can be modeled by a biclique. Combinations of cis-elements play a vital role in ascertaining that the correct co-action of transcription factors binding to the gene promoter, results in appropriate gene expression in response to various stimuli. Geometrical and spatial constraints in transcription factor binding also impose restrictions on order and separation of cis-elements. Not all regulatory elements that coexist are biologically significant. If the set of genes in which a set of regulatory elements co-occur, are tightly correlated with respect to gene expression data over a set of treatments, the regulatory element combination can be biologically directed. The system developed in this work, XcisClique, consists of a comprehensive infrastructure for an- notated genome and gene expression data for Arabidopsis thaliana. XcisClique models cis-regulatory elements as regular expressions and detects maximal bicliques of genes and motifs, called itemsets. An itemset consists of a set of genes (called a geneset) and a set of motifs (called a motifset) such that every motif in the motifset occurs in the promoter of every gene in the geneset. XcisClique differs from existing tools of the same kind in that, it offers a common platform for the integration of sequence and gene expression data. Itemsets identified by XcisClique are not only evaluated for statistical over-representation in sequence data, but are also examined with respect to the expres- sion patterns of the corresponding geneset. Thus, the results produced are biologically directed. XcisClique is also the only tool of its kind for Arabidopsis thaliana, and can also be used for other organisms in the presence of appropriate sequence, expression, and regulatory element data. The web-interface to a subset of functionalities, source code and supplemental material are available online at http://bioinformatics.cs.vt.edu/~xcisclique. iii ACKNOWLEDGMENTS I express my sincere gratitude to my advisor, Dr. Lenwood S. Heath, for being a constant source of inspiration, support, and guidance. I thank him for patiently listening to all my ideas, meticulously going over cumbersome results, giving valuable direction to my work, and always being there to talk. I am also thankful to my committee members Dr. Ruth Grene and Dr. T. M. Murali. Dr. Grene has provided the much needed biological insight in all stages of this work. Dr. Murali understood my work with the briefest of meetings, had a solution ready for almost any dilemma, and pointed me to the appropriate places to look for more information. I thank the Department of Computer Science, Virginia Tech, for supporting me through my first year of graduate study. I would like to acknowledge National Science Foundation Grant ITR- 0219322. Thanks to the National Center for Biotechnology Information, Nottingham database, and Plant Acting cis-Element database for the databases used. Thanks are due to Allan for listening to all my technical problems patiently and helping me work them out. I also thank Cecilia for her help in interpreting the results. Thanks to Animesh, Kiran, my roommates, everybody in Avataar, and all my friends for being there and making graduate school fun. Finally, I thank my parents Sanjukta and Aloke Pati for their love and understanding. iv TABLE OF CONTENTS 1 Introduction 1 2 Preliminaries 4 2.1 Definitions . 4 2.1.1 Biological Terms . 4 2.1.2 Computational Terms and Concepts . 8 2.1.3 Statistical Terms and Concepts . 14 2.2 Transcriptional Regulation . 20 2.2.1 Basal Promoter . 20 2.2.2 Proximal Promoter . 21 2.2.3 Distal Promoter . 22 3 Promoter Analysis 25 3.1 Promoter Discovery . 25 3.2 Motif Discovery . 27 3.2.1 Enumerative Approaches . 27 3.2.2 Probabilistic Methods . 32 4 The Regulatory Biclique problem 34 4.1 Computational Problem Formulation and Solution . 36 4.2 Motivation . 36 4.3 Identifying Putative Co-Regulated Genes . 37 4.4 Combinatorial Analysis Programs . 38 v 4.5 Solution to The Regulatory Biclique problem . 40 5 Data Sources and Databases 44 5.1 Sequence Data . 44 5.2 Gene Expression Data . 49 5.3 Cis-Element Data Sources . 51 6 Probabilistic Methods 52 6.1 Correlation Methods . 52 6.1.1 Pearson's Correlation Coefficient . 52 6.1.2 Spearman's Rank Correlation . 53 6.2 The Hypergeometric Distribution . 55 6.3 The Chi-Square Test of Independence . 56 7 XcisClique Process Flow 58 7.1 Modeling Gene Expression Data . 61 7.2 XcisClique Inputs . 64 7.3 Analysis of Over-representation of Individual Motifs . 64 7.4 Identification of Itemsets . 65 7.4.1 Identification of Significant Itemsets . 66 7.5 Integrating Itemset and Expression Data . 67 7.5.1 Using p-values of Correlation . 68 7.5.2 Sum of Absolute Values of Correlation (SAV): A New Statistic . 70 7.5.3 Calculating Final p-values . 76 7.6 System Requirements . 78 8 Biological Case Studies 81 8.1 Heat Shock Response . 81 8.1.1 Heat Shock Protein Families . 81 8.1.2 Heat Shock Regulation . 83 8.1.3 Regular Expressions for HSEs . 84 8.1.4 Results of Combinatorial Analysis using XcisClique . 89 8.2 Cold Long Term Up-regulated Genes in Metabolism . 95 vi 8.3 Cold Long Term Down-regulated genes in Metabolism . 101 8.4 Analysis of Senescence genes involved in Stress, Pathogenicity, and Secondary Metabo- lites . 104 9 Conclusions and Future Directions 106 A cis-Elements used in XcisClique 117 B Slides used for retrieving Expression Data 122 C Perl Code to Retrieve Entrez Data Using E-Utilities 130 C.1 Perl Code: createGeneDB.pl . 130 C.2 Perl Code: createProteinDB.pl . 136 C.3 Perl Code: ExtractATUpstream.pl . 143 D Shell Scrits and Perl Code from the XcisClique Pipeline 150 D.1 Perl Code: GetPromoter.pl . 150 D.2 Perl Code: SelectSequences.pl . 151 D.3 Perl Code: GetXUpstream.pl . 153 D.4 Perl Code: FindMotifs.pl . 154 D.5 Perl Code: Chi-Square.pl . 159 D.6 Shell Script: CountMotifs.sh . 160 D.7 Perl Code: MakeAprioriMatrix.pl . 161 D.8 Perl Code: Hypergeometric.pl . 162 D.9 Perl Code: Eval Itemset.pl . 165 E Expression Data Analysis: Perl and MATLAB Code 179 E.1 Perl Code: MakeExpressionVectors.pl . 179 E.2 Perl Code: CorrelateRepVectors.pl . 183 E.3 Perl Code: MakeAverageVectors.pl . 186 E.4 Perl Code: GeneCorrelate.pl . 188 E.5 Perl Code: SelectTreatmentVectors.pl . 194 E.6 MATLAB Code: GeneStat.m . 196 E.7 MATLAB Code: GeneCorrelateInter.m . 197 vii E.8 MATLAB Code: GeneCorrelateWithGenome.m . 199 E.9 Perl Code: copyCorrstoDB.pl . 201 E.10 MATLAB Code: CallSimulate.m . 204 E.11 MATLAB Code: simulate.m . 205 E.12 Perl Code: getSDistribution.pl . 206 viii LIST OF FIGURES 2.1 Organization of the eukaryotic gene . 5 2.2 Transcription of DNA into mRNA . 6 2.3 Example of a biclique . 13 2.4 Organization of the eukaryotic promoter . 20 2.5 Transcription in eukaryotes . 21 4.1 Curation of yeast regulatory motifs in Pilpel et al. [2001]. 43 5.1 Entity-Relationship diagram of the POPS database . 45 5.2 POPS database tables-I . 46 5.3 POPS database tables-II . 47 5.4 POPS database tables-III . 47 5.5 POPS database tables-IV . 49 5.6 POPS database tables-V . 50 6.1 The Hypergeometric model . 56 7.1 XcisClique schematic . 60 7.2 Distribution of ρ . ..
Recommended publications
  • Early Pregnancy Vitamin D Status and Risk of Preeclampsia
    Early pregnancy vitamin D status and risk of preeclampsia Hooman Mirzakhani, … , Joseph Loscalzo, Scott T. Weiss J Clin Invest. 2016;126(12):4702-4715. https://doi.org/10.1172/JCI89031. Clinical Medicine Clinical trials Reproductive biology BACKGROUND. Low vitamin D status in pregnancy was proposed as a risk factor of preeclampsia. METHODS. We assessed the effect of vitamin D supplementation (4,400 vs. 400 IU/day), initiated early in pregnancy (10–18 weeks), on the development of preeclampsia. The effects of serum vitamin D (25-hydroxyvitamin D [25OHD]) levels on preeclampsia incidence at trial entry and in the third trimester (32–38 weeks) were studied. We also conducted a nested case-control study of 157 women to investigate peripheral blood vitamin D–associated gene expression profiles at 10 to 18 weeks in 47 participants who developed preeclampsia. RESULTS. Of 881 women randomized, outcome data were available for 816, with 67 (8.2%) developing preeclampsia. There was no significant difference between treatment (N = 408) or control (N = 408) groups in the incidence of preeclampsia (8.08% vs. 8.33%, respectively; relative risk: 0.97; 95% CI, 0.61–1.53). However, in a cohort analysis and after adjustment for confounders, a significant effect of sufficient vitamin D status (25OHD ≥30 ng/ml) was observed in both early and late pregnancy compared with insufficient levels (25OHD <30 ng/ml) (adjusted odds ratio, 0.28; 95% CI, 0.10–0.96). Differential expression of 348 vitamin D–associated […] Find the latest version: https://jci.me/89031/pdf CLINICAL MEDICINE The Journal of Clinical Investigation Early pregnancy vitamin D status and risk of preeclampsia Hooman Mirzakhani,1 Augusto A.
    [Show full text]
  • Supplementary Table S4. FGA Co-Expressed Gene List in LUAD
    Supplementary Table S4. FGA co-expressed gene list in LUAD tumors Symbol R Locus Description FGG 0.919 4q28 fibrinogen gamma chain FGL1 0.635 8p22 fibrinogen-like 1 SLC7A2 0.536 8p22 solute carrier family 7 (cationic amino acid transporter, y+ system), member 2 DUSP4 0.521 8p12-p11 dual specificity phosphatase 4 HAL 0.51 12q22-q24.1histidine ammonia-lyase PDE4D 0.499 5q12 phosphodiesterase 4D, cAMP-specific FURIN 0.497 15q26.1 furin (paired basic amino acid cleaving enzyme) CPS1 0.49 2q35 carbamoyl-phosphate synthase 1, mitochondrial TESC 0.478 12q24.22 tescalcin INHA 0.465 2q35 inhibin, alpha S100P 0.461 4p16 S100 calcium binding protein P VPS37A 0.447 8p22 vacuolar protein sorting 37 homolog A (S. cerevisiae) SLC16A14 0.447 2q36.3 solute carrier family 16, member 14 PPARGC1A 0.443 4p15.1 peroxisome proliferator-activated receptor gamma, coactivator 1 alpha SIK1 0.435 21q22.3 salt-inducible kinase 1 IRS2 0.434 13q34 insulin receptor substrate 2 RND1 0.433 12q12 Rho family GTPase 1 HGD 0.433 3q13.33 homogentisate 1,2-dioxygenase PTP4A1 0.432 6q12 protein tyrosine phosphatase type IVA, member 1 C8orf4 0.428 8p11.2 chromosome 8 open reading frame 4 DDC 0.427 7p12.2 dopa decarboxylase (aromatic L-amino acid decarboxylase) TACC2 0.427 10q26 transforming, acidic coiled-coil containing protein 2 MUC13 0.422 3q21.2 mucin 13, cell surface associated C5 0.412 9q33-q34 complement component 5 NR4A2 0.412 2q22-q23 nuclear receptor subfamily 4, group A, member 2 EYS 0.411 6q12 eyes shut homolog (Drosophila) GPX2 0.406 14q24.1 glutathione peroxidase
    [Show full text]
  • Hippo and Sonic Hedgehog Signalling Pathway Modulation of Human Urothelial Tissue Homeostasis
    Hippo and Sonic Hedgehog signalling pathway modulation of human urothelial tissue homeostasis Thomas Crighton PhD University of York Department of Biology November 2020 Abstract The urinary tract is lined by a barrier-forming, mitotically-quiescent urothelium, which retains the ability to regenerate following injury. Regulation of tissue homeostasis by Hippo and Sonic Hedgehog signalling has previously been implicated in various mammalian epithelia, but limited evidence exists as to their role in adult human urothelial physiology. Focussing on the Hippo pathway, the aims of this thesis were to characterise expression of said pathways in urothelium, determine what role the pathways have in regulating urothelial phenotype, and investigate whether the pathways are implicated in muscle-invasive bladder cancer (MIBC). These aims were assessed using a cell culture paradigm of Normal Human Urothelial (NHU) cells that can be manipulated in vitro to represent different differentiated phenotypes, alongside MIBC cell lines and The Cancer Genome Atlas resource. Transcriptomic analysis of NHU cells identified a significant induction of VGLL1, a poorly understood regulator of Hippo signalling, in differentiated cells. Activation of upstream transcription factors PPARγ and GATA3 and/or blockade of active EGFR/RAS/RAF/MEK/ERK signalling were identified as mechanisms which induce VGLL1 expression in NHU cells. Ectopic overexpression of VGLL1 in undifferentiated NHU cells and MIBC cell line T24 resulted in significantly reduced proliferation. Conversely, knockdown of VGLL1 in differentiated NHU cells significantly reduced barrier tightness in an unwounded state, while inhibiting regeneration and increasing cell cycle activation in scratch-wounded cultures. A signalling pathway previously observed to be inhibited by VGLL1 function, YAP/TAZ, was unaffected by VGLL1 manipulation.
    [Show full text]
  • Genetic Network Analysis of Human CD34+ Hematopoietic Stem
    ■ ORIGINAL ARTICLE ■ + GENETIC NETWORK ANALYSIS OF HUMAN CD34 HEMATOPOIETIC STEM/PRECURSOR CELLS Shing-Jyh Chang1,2, Tse-Sung Huang3, Kung-Liahng Wang4,5,6, Tao-Yeuan Wang7, Yuh-Cheng Yang4, Margaret Dah-Tsyr Chang2, Yu-Hsuan Wu3, Hsei-Wei Wang3,8,9,10* 1Department of Obstetrics and Gynecology, Mackay Memorial Hospital, 2National Tsing Hua University, HsinChu, 3Institute of Microbiology and Immunology, National Yang-Ming University, 4Department of Obstetrics and Gynecology, Mackay Memorial Hospital, 5National Taipei College of Nursing, 6Mackay Medicine, Nursing and Management College, 7Department of Pathology, Mackay Memorial Hospital, 8Institute of Clinical Medicine and 9VGH-YM Genome Center, National Yang-Ming University, and 10Department of Education and Research, Taipei City Hospital, Taipei, Taiwan. SUMMARY Objective: Somatic CD34+ hematopoietic stem/precursor cells (HSPCs) give rise to hematopoietic cells and endothelial cells and have been used in clinical applications. Understanding the genes responsible for stemness and how they interact with each other will help us to manipulate these cells more efficiently in the future. Materials and Methods: We performed microarray analysis on human CD34+ HSPCs and on two different progeny cell types, i.e. microvascular endothelial cells and peripheral blood mononuclear cells. Systems biology and advanced bioinformatics tools were used to help clarify the genetic networks associated with these stem cell genes. Results: We identified CD34+ HSPC genes and found that they were involved in critical biologic processes such as cell cycle regulation, chromosome organization, and DNA repair. We also identified a novel precursor gene cluster on chromosome 19p13.3. Analysis of HSPC-enriched genes using systems biology tools revealed a com- plex genetic network functioning in CD34+ cells, in which several genes acted as hubs to maintain the stability (such as GATA1) or connectivity (such as hepatic growth factor) of the whole network.
    [Show full text]
  • Human Induced Pluripotent Stem Cell–Derived Podocytes Mature Into Vascularized Glomeruli Upon Experimental Transplantation
    BASIC RESEARCH www.jasn.org Human Induced Pluripotent Stem Cell–Derived Podocytes Mature into Vascularized Glomeruli upon Experimental Transplantation † Sazia Sharmin,* Atsuhiro Taguchi,* Yusuke Kaku,* Yasuhiro Yoshimura,* Tomoko Ohmori,* ‡ † ‡ Tetsushi Sakuma, Masashi Mukoyama, Takashi Yamamoto, Hidetake Kurihara,§ and | Ryuichi Nishinakamura* *Department of Kidney Development, Institute of Molecular Embryology and Genetics, and †Department of Nephrology, Faculty of Life Sciences, Kumamoto University, Kumamoto, Japan; ‡Department of Mathematical and Life Sciences, Graduate School of Science, Hiroshima University, Hiroshima, Japan; §Division of Anatomy, Juntendo University School of Medicine, Tokyo, Japan; and |Japan Science and Technology Agency, CREST, Kumamoto, Japan ABSTRACT Glomerular podocytes express proteins, such as nephrin, that constitute the slit diaphragm, thereby contributing to the filtration process in the kidney. Glomerular development has been analyzed mainly in mice, whereas analysis of human kidney development has been minimal because of limited access to embryonic kidneys. We previously reported the induction of three-dimensional primordial glomeruli from human induced pluripotent stem (iPS) cells. Here, using transcription activator–like effector nuclease-mediated homologous recombination, we generated human iPS cell lines that express green fluorescent protein (GFP) in the NPHS1 locus, which encodes nephrin, and we show that GFP expression facilitated accurate visualization of nephrin-positive podocyte formation in
    [Show full text]
  • Characterizing Genomic Duplication in Autism Spectrum Disorder by Edward James Higginbotham a Thesis Submitted in Conformity
    Characterizing Genomic Duplication in Autism Spectrum Disorder by Edward James Higginbotham A thesis submitted in conformity with the requirements for the degree of Master of Science Graduate Department of Molecular Genetics University of Toronto © Copyright by Edward James Higginbotham 2020 i Abstract Characterizing Genomic Duplication in Autism Spectrum Disorder Edward James Higginbotham Master of Science Graduate Department of Molecular Genetics University of Toronto 2020 Duplication, the gain of additional copies of genomic material relative to its ancestral diploid state is yet to achieve full appreciation for its role in human traits and disease. Challenges include accurately genotyping, annotating, and characterizing the properties of duplications, and resolving duplication mechanisms. Whole genome sequencing, in principle, should enable accurate detection of duplications in a single experiment. This thesis makes use of the technology to catalogue disease relevant duplications in the genomes of 2,739 individuals with Autism Spectrum Disorder (ASD) who enrolled in the Autism Speaks MSSNG Project. Fine-mapping the breakpoint junctions of 259 ASD-relevant duplications identified 34 (13.1%) variants with complex genomic structures as well as tandem (193/259, 74.5%) and NAHR- mediated (6/259, 2.3%) duplications. As whole genome sequencing-based studies expand in scale and reach, a continued focus on generating high-quality, standardized duplication data will be prerequisite to addressing their associated biological mechanisms. ii Acknowledgements I thank Dr. Stephen Scherer for his leadership par excellence, his generosity, and for giving me a chance. I am grateful for his investment and the opportunities afforded me, from which I have learned and benefited. I would next thank Drs.
    [Show full text]
  • Views of the NIDA, NINDS Or the National Summed Across the Three Auditory Forebrain Lobule Sec- Institutes of Health
    Xie et al. BMC Biology 2010, 8:28 http://www.biomedcentral.com/1741-7007/8/28 RESEARCH ARTICLE Open Access The zebra finch neuropeptidome: prediction, detection and expression Fang Xie1, Sarah E London2,6, Bruce R Southey1,3, Suresh P Annangudi1,6, Andinet Amare1, Sandra L Rodriguez-Zas2,3,5, David F Clayton2,4,5,6, Jonathan V Sweedler1,2,5,6* Abstract Background: Among songbirds, the zebra finch (Taeniopygia guttata) is an excellent model system for investigating the neural mechanisms underlying complex behaviours such as vocal communication, learning and social interactions. Neuropeptides and peptide hormones are cell-to-cell signalling molecules known to mediate similar behaviours in other animals. However, in the zebra finch, this information is limited. With the newly-released zebra finch genome as a foundation, we combined bioinformatics, mass-spectrometry (MS)-enabled peptidomics and molecular techniques to identify the complete suite of neuropeptide prohormones and final peptide products and their distributions. Results: Complementary bioinformatic resources were integrated to survey the zebra finch genome, identifying 70 putative prohormones. Ninety peptides derived from 24 predicted prohormones were characterized using several MS platforms; tandem MS confirmed a majority of the sequences. Most of the peptides described here were not known in the zebra finch or other avian species, although homologous prohormones exist in the chicken genome. Among the zebra finch peptides discovered were several unique vasoactive intestinal and adenylate cyclase activating polypeptide 1 peptides created by cleavage at sites previously unreported in mammalian prohormones. MS-based profiling of brain areas required for singing detected 13 peptides within one brain nucleus, HVC; in situ hybridization detected 13 of the 15 prohormone genes examined within at least one major song control nucleus.
    [Show full text]
  • University of Southern Denmark the Correlation of Copy Number
    University of Southern Denmark The correlation of copy number variations with longevity in a genome-wide association study of Han Chinese Zhao, Xin; Liu, Xiaomin; Zhang, Aiping; Chen, Huashuai; Huo, Qing; Li, Weiyang; Ye, Rui; Chen, Zhihua; Liang, Liping; Liu, Qiong A; Shen, Juan; Jin, Xin; Li, Wenwen; Nygaard, Marianne; Liu, Xiao; Hou, Yong; Ni, Ting; Bolund, Lars; Gottschalk, William; Tao, Wei; Gu, Jun; Tian, Xiao-Li; Yang, Huanming; Wang, Jian; Xu, Xun; Lutz, Michael W; Min, Junxia; Zeng, Yi; Nie, Chao Published in: Aging DOI: 10.18632/aging.101461 Publication date: 2018 Document version: Final published version Document license: CC BY Citation for pulished version (APA): Zhao, X., Liu, X., Zhang, A., Chen, H., Huo, Q., Li, W., Ye, R., Chen, Z., Liang, L., Liu, Q. A., Shen, J., Jin, X., Li, W., Nygaard, M., Liu, X., Hou, Y., Ni, T., Bolund, L., Gottschalk, W., ... Nie, C. (2018). The correlation of copy number variations with longevity in a genome-wide association study of Han Chinese. Aging, 10(6), 1206-1222. https://doi.org/10.18632/aging.101461 Go to publication entry in University of Southern Denmark's Research Portal Terms of use This work is brought to you by the University of Southern Denmark. Unless otherwise specified it has been shared according to the terms for self-archiving. If no other license is stated, these terms apply: • You may download this work for personal use only. • You may not further distribute the material or use it for any profit-making activity or commercial gain • You may freely distribute the URL identifying this open access version If you believe that this document breaches copyright please contact us providing details and we will investigate your claim.
    [Show full text]
  • A Community-Based Transcriptomics Classification and Nomenclature Of
    COMMENT | FOCUS comment | FOCUS A community-based transcriptomics classifcation and nomenclature of neocortical cell types To understand the function of cortical circuits, it is necessary to catalog their cellular diversity. Past attempts to do so using anatomical, physiological or molecular features of cortical cells have not resulted in a unifed taxonomy of neuronal or glial cell types, partly due to limited data. Single-cell transcriptomics is enabling, for the frst time, systematic high-throughput measurements of cortical cells and generation of datasets that hold the promise of being complete, accurate and permanent. Statistical analyses of these data reveal clusters that often correspond to cell types previously defned by morphological or physiological criteria and that appear conserved across cortical areas and species. To capitalize on these new methods, we propose the adoption of a transcriptome-based taxonomy of cell types for mammalian neocortex. This classifcation should be hierarchical and use a standardized nomenclature. It should be based on a probabilistic defnition of a cell type and incorporate data from diferent approaches, developmental stages and species. A community-based classifcation and data aggregation model, such as a knowledge graph, could provide a common foundation for the study of cortical circuits. This community-based classifcation, nomenclature and data aggregation could serve as an example for cell type atlases in other parts of the body. Rafael Yuste, Michael Hawrylycz, Nadia Aalling, Argel Aguilar-Valles, Detlev Arendt, Ruben Armananzas Arnedillo, Giorgio A. Ascoli, Concha Bielza, Vahid Bokharaie, Tobias Borgtoft Bergmann, Irina Bystron, Marco Capogna, Yoonjeung Chang, Ann Clemens, Christiaan P. J. de Kock, Javier DeFelipe, Sandra Esmeralda Dos Santos, Keagan Dunville, Dirk Feldmeyer, Richárd Fiáth, Gordon James Fishell, Angelica Foggetti, Xuefan Gao, Parviz Ghaderi, Natalia A.
    [Show full text]
  • An Integrated Analysis Reveals Geniposide Extracted from Gardenia Jasminoides Ellis Regulates Calcium Signaling Pathway Essential for Infuenza a Virus Replication
    An Integrated Analysis Reveals Geniposide Extracted From Gardenia Jasminoides Ellis Regulates Calcium Signaling Pathway Essential for Inuenza a Virus Replication LiRun Zhou China Academy of Chinese Medical Sciences https://orcid.org/0000-0002-1350-940X Lei Bao China Academy of Chinese Medical Sciences Yaxin Wang China Academy of Chinese Medical Sciences Mengping Chen China Academy of Chinese Medical Sciences Yingying Zhang Beijing University of Chinese Medicine Aliated Dongzhimen Hospital Zihan Geng China Academy of Chinese Medical Sciences Ronghua Zhao China Academy of Chinese Medical Sciences Jing Sun China Academy of Chinese Medical Sciences Yanyan Bao China Academy of Chinese Medical Sciences Yujing Shi China Academy of Chinese Medical Sciences Rongmei Yao China Academy of Chinese Medical Sciences Shanshan Guo ( [email protected] ) China Academy of Chinese Medical Sciences Xiaolan Cui China Academy of Chinese Medical Sciences Research Page 1/30 Keywords: Geniposide, Inuenza A virus, RNA polymerase, Calcium signaling pathway, CAMKII Posted Date: August 9th, 2021 DOI: https://doi.org/10.21203/rs.3.rs-764303/v1 License: This work is licensed under a Creative Commons Attribution 4.0 International License. Read Full License Page 2/30 Abstract Background Geniposide, an iridoid glycoside puried from the fruit of Gardenia jasminoides Ellis, has been reported to possess pleiotropic activity against different diseases. In particular, geniposide possesses a variety of biological activities and exerts good therapeutic effects in the treatment of several strains of the inuenza virus. However, the molecular mechanism for the therapeutic effect has not been well dened. Methods This study aimed to investigate the mechanism of geniposide on inuenza A virus (IAV).
    [Show full text]
  • Integrated Functional Genomic Analysis Enables Annotation of Kidney Genome-Wide Association Study Loci
    BASIC RESEARCH www.jasn.org Integrated Functional Genomic Analysis Enables Annotation of Kidney Genome-Wide Association Study Loci Karsten B. Sieber,1 Anna Batorsky,2 Kyle Siebenthall,2 Kelly L. Hudkins,3 Jeff D. Vierstra,2 Shawn Sullivan,4 Aakash Sur,4,5 Michelle McNulty,6 Richard Sandstrom,2 Alex Reynolds,2 Daniel Bates,2 Morgan Diegel,2 Douglass Dunn,2 Jemma Nelson,2 Michael Buckley,2 Rajinder Kaul,2 Matthew G. Sampson,6 Jonathan Himmelfarb,7,8 Charles E. Alpers,3,8 Dawn Waterworth,1 and Shreeram Akilesh3,8 Due to the number of contributing authors, the affiliations are listed at the end of this article. ABSTRACT Background Linking genetic risk loci identified by genome-wide association studies (GWAS) to their causal genes remains a major challenge. Disease-associated genetic variants are concentrated in regions con- taining regulatory DNA elements, such as promoters and enhancers. Although researchers have previ- ously published DNA maps of these regulatory regions for kidney tubule cells and glomerular endothelial cells, maps for podocytes and mesangial cells have not been available. Methods We generated regulatory DNA maps (DNase-seq) and paired gene expression profiles (RNA-seq) from primary outgrowth cultures of human glomeruli that were composed mainly of podo- cytes and mesangial cells. We generated similar datasets from renal cortex cultures, to compare with those of the glomerular cultures. Because regulatory DNA elements can act on target genes across large genomic distances, we also generated a chromatin conformation map from freshly isolated human glomeruli. Results We identified thousands of unique regulatory DNA elements, many located close to transcription factor genes, which the glomerular and cortex samples expressed at different levels.
    [Show full text]
  • Table S8: Summary of CN Loss Genes Identified by Fisher Test on SNP-FASST2 Segmentation
    Table S8: Summary of CN Loss Genes identified by Fisher Test on SNP-FASST2 Segmentation Gene Symbol Chromosome Start End Length SLC6A19 chr5 1201709 1225230 23522 SLC6A18 chr5 1225469 1246304 20836 TERT chr5 1253286 1295162 41877 SURF6 chr9 136197551 136203047 5497 MED22 chr9 136207754 136214972 7219 SNORD24 chr9 136216250 136216325 76 RPL7A chr9 136215068 136218280 3213 SNORD36B chr9 136216948 136217019 72 SNORD36A chr9 136217310 136217382 73 SNORD36C chr9 136217700 136217767 68 SURF1 chr9 136218665 136223361 4697 SURF2 chr9 136223420 136228040 4621 SURF4 chr9 136228339 136242970 14632 C9orf96 chr9 136243283 136271220 27938 REXO4 chr9 136271185 136283164 11980 ADAMTS13 chr9 136279458 136324508 45051 CACFD1 chr9 136325086 136335909 10824 SLC2A6 chr9 136336215 136344276 8062 TMEM8C chr9 136379707 136390068 10362 ADAMTSL2 chr9 136397285 136440641 43357 FAM163B chr9 136443536 136445368 1833 DBH chr9 136501484 136524466 22983 SARDH chr9 136528683 136605077 76395 VAV2 chr9 136627015 136857446 230432 LINC00094 chr9 136890560 136896719 6160 BRD3 chr9 136895445 136933141 37697 WDR5 chr9 137001209 137025094 23886 RNU6ATAC chr9 137029561 137029686 126 OLFM1 chr9 137967088 138013030 45943 LOC401557 chr9 138073454 138079152 5699 DDX11L10 chr16 61554 64090 2537 POLR3K chr16 96978 103632 6655 SNRNP25 chr16 103828 107669 3842 RHBDF1 chr16 108057 122629 14573 MPG chr16 127017 135850 8834 NPRL3 chr16 135803 188697 52895 HBZ chr16 202853 204504 1652 HBM chr16 215972 216767 796 HBA2 chr16 222845 223709 865 HBA1 chr16 226678 227520 843 HBQ1 chr16 230332 231178
    [Show full text]