Highly Conserved Upstream Sequences for Transcription Factor Genes and Implications for the Regulatory Network

Total Page:16

File Type:pdf, Size:1020Kb

Highly Conserved Upstream Sequences for Transcription Factor Genes and Implications for the Regulatory Network Highly conserved upstream sequences for transcription factor genes and implications for the regulatory network Hisakazu Iwama*† and Takashi Gojobori*‡§ *Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Research Organization of Information and Systems, Yata 1111, Mishima, 411-8540 Japan; and ‡Integrated Database Group, Biological Information Research Center, National Institute of Advanced Industrial Science and Technology, Time 24 Building, 10th Floor, 2-45 Aomi, Koto-ku, Tokyo 135-0064, Japan Communicated by Wen-Hsiung Li, University of Chicago, Chicago, IL, October 15, 2004 (received for review May 27, 2004) Identifying evolutionarily conserved blocks in orthologous We report here that the genes with high upstream conserva- genomic sequences is an effective way to detect regulatory ele- tion are predominantly transcription factor (TF) genes. Further- ments. In this study, with the aim of elucidating the architecture of more, we show that the developmental process-related TF genes the regulatory network, we systematically estimated the degree of have significantly higher conservation of the upstream sequences conservation of the upstream sequences of 3,750 human–mouse than other TF genes. orthologue pairs along 8-kb stretches. We found that the genes with high upstream conservation are predominantly transcription Materials and Methods factor (TF) genes. In particular, developmental process-related TF Orthologue Identification and Upstream Sequence Collection. We genes showed significantly higher conservation of the upstream searched the human and mouse Reference Sequence (RefSeq) sequences than other TF genes. Such extreme upstream conserva- (8) annotations from the National Center for Biotechnology tion of the developmental process-related TF genes suggests that Information (ftp:͞͞ftp.ncbi.nih.gov͞refseq͞LocusLink͞ the regulatory networks involved with developmental processes LL࿝tmpl) for genes whose human and mouse official gene have been evolutionarily well conserved in both human and mouse symbols were identical (9,207 gene pairs, as of February 2, 2004). lineages. Next, we selected only the nuclear protein-coding genes (7,408 genes). For these genes, we collected the corresponding genomic cis-element ͉ development ͉ noncoding ͉ ZFHX1B ͉ Hirschsprung disease sequences, i.e., the RefSeq contig entries, according to the contig feature descriptions in the RefSeq annotations. We surveyed the ross-species genome-wide comparison of noncoding or- entire annotation of every contig to check whether there were Cthologous sequences has been demonstrated to be effective any genic sequences within the 9-kb stretch upstream of the first for identifying regulatory sequences for Saccharomyces species coding site for each of the genes collected. Then, we excised the (1, 2). For higher eukaryotes, orthologous noncoding sequence 8-kb genomic sequence immediately upstream of the coding start comparison has been successfully applied to human and mouse site for every gene that did not contain any descriptions of genic sequences (3–5). These results can contribute to the elucidation regions within its 9-kb upstream stretch. We set a 1-kb margin to decrease the frequency of cases in which the excised 8-kb of the architecture of regulatory networks. Ј However, because comprehensive knowledge regarding reg- sequences overlapped with promoter regions or 3 regulatory sequences of adjacent genes. For genes having alternative coding ulatory networks remains to be elucidated, particularly for Ј higher eukaryotes, direct comparison of their regulatory net- start sites, we always used the most 5 coding start site according works is still difficult. Thus, in the present study, with the aim of to the annotation. elucidating the features of regulatory networks that are charac- teristic of higher eukaryotes, we systematically estimated the Genomic Global Alignment. Initially, we made local nucleotide degree of the sequence conservation upstream of human–mouse alignments of every human–mouse orthologue pair of genomic orthologous genes and categorized the gene function according sequences by using BLAST 2 sequences (9). To appropriately align to the Gene Ontology (GO) Consortium (6). the short conserved regulatory sequences in the noncoding regions, we reduced the mismatch penalty to Ϫ2 and shortened In higher eukaryotes, the regulatory sequences are located in the word size to 7. We processed the resultant set of alignments a wider range outside the coding sequences than in yeast. by using the program REALIGNER, which we developed to obtain However, to date, 85% of mouse regulatory sequences have been genomic global alignments based on the results from BLAST 2 estimated to be located within 2 kb from the promoter, and most sequences. First, we selected the local alignments by using the promoters reside immediately upstream of the transcription start following set of criteria: hit length Ͼ7 bps, identity of 70% or site (7), both of which play major roles in gene expression higher, and hit strand in the same direction. For these local control. Thus, between humans and mice, we can expect that the alignments, REALIGNER performed the following two steps: (i) degree of orthologous upstream sequence conservation in the when two local alignments overlapped, the program removed the kilobase range could reflect the evolutionary conservation of alignment with the lower bit score and retained the other and (ii) features related to gene expression control. when two local alignments were not syntenic, the alignment with In the present study, we examined the upstream sequences of 3,750 human–mouse orthologous gene pairs and constructed a global alignment of the 8-kb upstream sequences for each of the Freely available online through the PNAS open access option. orthologous gene pairs based on their local alignments. To Abbreviations: TF, transcription factor; GO, gene ontology; EPD, Eukaryotic Promoter identify human–mouse orthologous genes, we focused on genes Database. that have been assigned an identical official gene symbol (www. †Present address: Information Technology Center, Kagawa University, 1750-1 Ikenobe, gene.ucl.ac.uk͞nomenclature͞) between humans and mice, be- Miki-cho, Kita-gun, Kagawa Prefect 761-0793, Japan. cause these kinds of genes are annotated not only on the basis §To whom correspondence should be addressed at the * address. E-mail: tgojobor@ of sequence homology but also on evidence from functional and genes.nig.ac.jp. physiological experiments. © 2004 by The National Academy of Sciences of the USA 17156–17161 ͉ PNAS ͉ December 7, 2004 ͉ vol. 101 ͉ no. 49 www.pnas.org͞cgi͞doi͞10.1073͞pnas.0407670101 Downloaded by guest on September 27, 2021 We finally confirmed 347 developmental process-related genes (Ndev). Statistical Analysis. We counted the total number of genes that were assigned any of the GO terms in the categories of molecular function or biological process for either the mouse or human annotation (Ntotal ϭ 2,883). Assuming a binomial distribution of ptf ϭ Ntf͞Ntotal, we calculated the cumulative probability, p,of observing T or more TF genes in the top n genes as follows, unless specified otherwise: n n p ϭ ͸ͩ ͪpi ͑ Ϫ p ͒nϪi i tf 1 tf . Fig. 1. Bar graph showing the frequencies of the 3,750 human–mouse iϭT orthologue pairs relative to the number of identical sites along the 8-kb upstream sequences. The area of each bar corresponds to each relative fre- Retrieval of SNP Information for the ZFHX1B Gene. We searched the quency. The line graph shows the relative frequency of the result of the RefSeq contig annotations for every description of variation simulation study in which 10,000 randomly generated 8-kb sequence pairs linked to the SNP Database (11) (www.ncbi.nlm.nih.gov͞SNP) were processed in the same way as the human–mouse orthologue alignments. within the range of the 8-kb upstream stretch of the ZFHX1B gene. We also confirmed the variation information according to the lower bit score was removed and the other was retained. the H-Invitational Database (12) (www.h-invitational.jp). REALIGNER performed steps i and ii in decreasing order of the Results bit score for each local alignment of each sequence pair. In these steps, if the bit scores to be compared were equal, then the longer Alignment of the 8-kb Upstream Sequences of the Human–Mouse hit-stretch and then the more downstream alignments had the Orthologue Pairs. We identified 9,207 genes whose human and higher priority. Finally, the numbers of identical sites for every mouse official gene symbols were identical. We then selected local alignment were summed for each orthologue pair. only the nuclear protein-coding genes, which amounted to 7,408 genes. We regarded these gene pairs as orthologues. Among Simulation Analysis. We generated 10,000 pairs of 8-kb random these, we were able to collect 9-kb genomic upstream nucleotide sequences. Each pair of 8-kb sequences was generated so that its sequences without any described genic regions for 3,750 ortholo- frequencies of A, T, G, C, and N became proportional to the gous gene pairs. Then, we excised 8-kb stretches upstream of the observed average counts of all of the examined human and translation start sites. We set a 1-kb margin to decrease the mouse 8-kb sequences, respectively. These 10,000 random se- frequency of cases in which the excised 8-kb sequences over- quence pairs were then processed in the same way as described lapped with promoter regions or 3Ј regulatory sequences of above. adjacent genes. For all of the 3,750 pairs of human and mouse genes, the accessions of the contig entries used are shown in Validation of Genomic Alignment Procedures by the Eukaryotic Pro- Table 3, which is published as supporting information on the moter Database (EPD) Data Set. We downloaded the EPD (10) data PNAS web site, together with the positions of the excised set (Release 77࿝1, February 2004) from ftp:͞͞ftp.epd.unil.ch͞ sequences. Finally, we were able to make a global pairwise pub͞databases͞epd͞77࿝1.
Recommended publications
  • Supplementary Table 1: Adhesion Genes Data Set
    Supplementary Table 1: Adhesion genes data set PROBE Entrez Gene ID Celera Gene ID Gene_Symbol Gene_Name 160832 1 hCG201364.3 A1BG alpha-1-B glycoprotein 223658 1 hCG201364.3 A1BG alpha-1-B glycoprotein 212988 102 hCG40040.3 ADAM10 ADAM metallopeptidase domain 10 133411 4185 hCG28232.2 ADAM11 ADAM metallopeptidase domain 11 110695 8038 hCG40937.4 ADAM12 ADAM metallopeptidase domain 12 (meltrin alpha) 195222 8038 hCG40937.4 ADAM12 ADAM metallopeptidase domain 12 (meltrin alpha) 165344 8751 hCG20021.3 ADAM15 ADAM metallopeptidase domain 15 (metargidin) 189065 6868 null ADAM17 ADAM metallopeptidase domain 17 (tumor necrosis factor, alpha, converting enzyme) 108119 8728 hCG15398.4 ADAM19 ADAM metallopeptidase domain 19 (meltrin beta) 117763 8748 hCG20675.3 ADAM20 ADAM metallopeptidase domain 20 126448 8747 hCG1785634.2 ADAM21 ADAM metallopeptidase domain 21 208981 8747 hCG1785634.2|hCG2042897 ADAM21 ADAM metallopeptidase domain 21 180903 53616 hCG17212.4 ADAM22 ADAM metallopeptidase domain 22 177272 8745 hCG1811623.1 ADAM23 ADAM metallopeptidase domain 23 102384 10863 hCG1818505.1 ADAM28 ADAM metallopeptidase domain 28 119968 11086 hCG1786734.2 ADAM29 ADAM metallopeptidase domain 29 205542 11085 hCG1997196.1 ADAM30 ADAM metallopeptidase domain 30 148417 80332 hCG39255.4 ADAM33 ADAM metallopeptidase domain 33 140492 8756 hCG1789002.2 ADAM7 ADAM metallopeptidase domain 7 122603 101 hCG1816947.1 ADAM8 ADAM metallopeptidase domain 8 183965 8754 hCG1996391 ADAM9 ADAM metallopeptidase domain 9 (meltrin gamma) 129974 27299 hCG15447.3 ADAMDEC1 ADAM-like,
    [Show full text]
  • Genome-Wide DNA Methylation Analysis of KRAS Mutant Cell Lines Ben Yi Tew1,5, Joel K
    www.nature.com/scientificreports OPEN Genome-wide DNA methylation analysis of KRAS mutant cell lines Ben Yi Tew1,5, Joel K. Durand2,5, Kirsten L. Bryant2, Tikvah K. Hayes2, Sen Peng3, Nhan L. Tran4, Gerald C. Gooden1, David N. Buckley1, Channing J. Der2, Albert S. Baldwin2 ✉ & Bodour Salhia1 ✉ Oncogenic RAS mutations are associated with DNA methylation changes that alter gene expression to drive cancer. Recent studies suggest that DNA methylation changes may be stochastic in nature, while other groups propose distinct signaling pathways responsible for aberrant methylation. Better understanding of DNA methylation events associated with oncogenic KRAS expression could enhance therapeutic approaches. Here we analyzed the basal CpG methylation of 11 KRAS-mutant and dependent pancreatic cancer cell lines and observed strikingly similar methylation patterns. KRAS knockdown resulted in unique methylation changes with limited overlap between each cell line. In KRAS-mutant Pa16C pancreatic cancer cells, while KRAS knockdown resulted in over 8,000 diferentially methylated (DM) CpGs, treatment with the ERK1/2-selective inhibitor SCH772984 showed less than 40 DM CpGs, suggesting that ERK is not a broadly active driver of KRAS-associated DNA methylation. KRAS G12V overexpression in an isogenic lung model reveals >50,600 DM CpGs compared to non-transformed controls. In lung and pancreatic cells, gene ontology analyses of DM promoters show an enrichment for genes involved in diferentiation and development. Taken all together, KRAS-mediated DNA methylation are stochastic and independent of canonical downstream efector signaling. These epigenetically altered genes associated with KRAS expression could represent potential therapeutic targets in KRAS-driven cancer. Activating KRAS mutations can be found in nearly 25 percent of all cancers1.
    [Show full text]
  • Accompanies CD8 T Cell Effector Function Global DNA Methylation
    Global DNA Methylation Remodeling Accompanies CD8 T Cell Effector Function Christopher D. Scharer, Benjamin G. Barwick, Benjamin A. Youngblood, Rafi Ahmed and Jeremy M. Boss This information is current as of October 1, 2021. J Immunol 2013; 191:3419-3429; Prepublished online 16 August 2013; doi: 10.4049/jimmunol.1301395 http://www.jimmunol.org/content/191/6/3419 Downloaded from Supplementary http://www.jimmunol.org/content/suppl/2013/08/20/jimmunol.130139 Material 5.DC1 References This article cites 81 articles, 25 of which you can access for free at: http://www.jimmunol.org/content/191/6/3419.full#ref-list-1 http://www.jimmunol.org/ Why The JI? Submit online. • Rapid Reviews! 30 days* from submission to initial decision • No Triage! Every submission reviewed by practicing scientists by guest on October 1, 2021 • Fast Publication! 4 weeks from acceptance to publication *average Subscription Information about subscribing to The Journal of Immunology is online at: http://jimmunol.org/subscription Permissions Submit copyright permission requests at: http://www.aai.org/About/Publications/JI/copyright.html Email Alerts Receive free email-alerts when new articles cite this article. Sign up at: http://jimmunol.org/alerts The Journal of Immunology is published twice each month by The American Association of Immunologists, Inc., 1451 Rockville Pike, Suite 650, Rockville, MD 20852 Copyright © 2013 by The American Association of Immunologists, Inc. All rights reserved. Print ISSN: 0022-1767 Online ISSN: 1550-6606. The Journal of Immunology Global DNA Methylation Remodeling Accompanies CD8 T Cell Effector Function Christopher D. Scharer,* Benjamin G. Barwick,* Benjamin A. Youngblood,*,† Rafi Ahmed,*,† and Jeremy M.
    [Show full text]
  • Identification of Potential Key Genes and Pathway Linked with Sporadic Creutzfeldt-Jakob Disease Based on Integrated Bioinformatics Analyses
    medRxiv preprint doi: https://doi.org/10.1101/2020.12.21.20248688; this version posted December 24, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission. Identification of potential key genes and pathway linked with sporadic Creutzfeldt-Jakob disease based on integrated bioinformatics analyses Basavaraj Vastrad1, Chanabasayya Vastrad*2 , Iranna Kotturshetti 1. Department of Biochemistry, Basaveshwar College of Pharmacy, Gadag, Karnataka 582103, India. 2. Biostatistics and Bioinformatics, Chanabasava Nilaya, Bharthinagar, Dharwad 580001, Karanataka, India. 3. Department of Ayurveda, Rajiv Gandhi Education Society`s Ayurvedic Medical College, Ron, Karnataka 562209, India. * Chanabasayya Vastrad [email protected] Ph: +919480073398 Chanabasava Nilaya, Bharthinagar, Dharwad 580001 , Karanataka, India NOTE: This preprint reports new research that has not been certified by peer review and should not be used to guide clinical practice. medRxiv preprint doi: https://doi.org/10.1101/2020.12.21.20248688; this version posted December 24, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission. Abstract Sporadic Creutzfeldt-Jakob disease (sCJD) is neurodegenerative disease also called prion disease linked with poor prognosis. The aim of the current study was to illuminate the underlying molecular mechanisms of sCJD. The mRNA microarray dataset GSE124571 was downloaded from the Gene Expression Omnibus database. Differentially expressed genes (DEGs) were screened.
    [Show full text]
  • Single Cell Regulatory Landscape of the Mouse Kidney Highlights Cellular Differentiation Programs and Disease Targets
    ARTICLE https://doi.org/10.1038/s41467-021-22266-1 OPEN Single cell regulatory landscape of the mouse kidney highlights cellular differentiation programs and disease targets Zhen Miao 1,2,3,8, Michael S. Balzer 1,2,8, Ziyuan Ma 1,2,8, Hongbo Liu1,2, Junnan Wu 1,2, Rojesh Shrestha 1,2, Tamas Aranyi1,2, Amy Kwan4, Ayano Kondo 4, Marco Pontoglio 5, Junhyong Kim6, ✉ Mingyao Li 7, Klaus H. Kaestner2,4 & Katalin Susztak 1,2,4 1234567890():,; Determining the epigenetic program that generates unique cell types in the kidney is critical for understanding cell-type heterogeneity during tissue homeostasis and injury response. Here, we profile open chromatin and gene expression in developing and adult mouse kidneys at single cell resolution. We show critical reliance of gene expression on distal regulatory elements (enhancers). We reveal key cell type-specific transcription factors and major gene- regulatory circuits for kidney cells. Dynamic chromatin and expression changes during nephron progenitor differentiation demonstrates that podocyte commitment occurs early and is associated with sustained Foxl1 expression. Renal tubule cells follow a more complex differentiation, where Hfn4a is associated with proximal and Tfap2b with distal fate. Mapping single nucleotide variants associated with human kidney disease implicates critical cell types, developmental stages, genes, and regulatory mechanisms. The single cell multi-omics atlas reveals key chromatin remodeling events and gene expression dynamics associated with kidney development. 1 Renal, Electrolyte, and Hypertension Division, Department of Medicine, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA, USA. 2 Institute for Diabetes, Obesity, and Metabolism, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA, USA.
    [Show full text]
  • Diversification of Behavior and Postsynaptic Properties by Netrin-G
    Zhang et al. Molecular Brain (2016) 9:6 DOI 10.1186/s13041-016-0187-5 RESEARCH Open Access Diversification of behavior and postsynaptic properties by netrin-G presynaptic adhesion family proteins Qi Zhang, Hiromichi Goto, Sachiko Akiyoshi-Nishimura, Pavel Prosselkov, Chie Sano, Hiroshi Matsukawa, Kunio Yaguchi, Toshiaki Nakashiba and Shigeyoshi Itohara* Abstract Background: Vertebrate-specific neuronal genes are expected to play a critical role in the diversification and evolution of higher brain functions. Among them, the glycosylphosphatidylinositol (GPI)-anchored netrin-G subfamily members in the UNC6/netrin family are unique in their differential expression patterns in many neuronal circuits, and differential binding ability to their cognate homologous post-synaptic receptors. Results: To gain insight into the roles of these genes in higher brain functions, we performed comprehensive behavioral batteries using netrin-G knockout mice. We found that two netrin-G paralogs that recently diverged in evolution, netrin-G1 and netrin-G2 (gene symbols: Ntng1 and Ntng2, respectively), were responsible for complementary behavioral functions. Netrin-G2, but not netrin-G1, encoded demanding sensorimotor functions. Both paralogs were responsible for complex vertebrate-specific cognitive functions and fine-scale regulation of basic adaptive behaviors conserved between invertebrates and vertebrates, such as spatial reference and working memory, attention, impulsivity and anxiety etc. Remarkably, netrin-G1 and netrin-G2 encoded a genetic “division of labor” in behavioral regulation, selectively mediating different tasks or even different details of the same task. At the cellular level, netrin-G1 and netrin-G2 differentially regulated the sub-synaptic localization of their cognate receptors and differentiated the properties of postsynaptic scaffold proteins in complementary neural pathways.
    [Show full text]
  • Evidence for a Role of Developmental Genes in the Origin of Obesity and Body Fat Distribution
    Evidence for a role of developmental genes in the origin of obesity and body fat distribution Stephane Gesta*, Matthias Blu¨ her†, Yuji Yamamoto*, Andrew W. Norris*, Janin Berndt†, Susan Kralisch†, Jeremie Boucher*, Choy Lewis*, and C. Ronald Kahn*‡ *Joslin Diabetes Center and Harvard Medical School, Boston, MA 02215; and †Department of Internal Medicine III, University of Leipzig, 04103 Leipzig, Germany Contributed by C. Ronald Kahn, March 9, 2006 Obesity, especially central obesity, is a hereditable trait associated In the present study, we have explored the hypothesis that with a high risk for development of diabetes and metabolic patterns of fat distribution and, perhaps, to some degree, obesity disorders. Combined gene expression analysis of adipocyte- and itself may have a developmental genetic origin. Indeed, we find preadipocyte-containing fractions from intraabdominal and sub- major differences in expression of multiple genes involved in cutaneous adipose tissue of mice revealed coordinated depot- embryonic development and pattern specification between adipo- specific differences in expression of multiple genes involved in cytes taken from intraabdominal and s.c. depots in rodents and embryonic development and pattern specification. These differ- humans. We also demonstrate similar differences in the stromo- ences were intrinsic and persisted during in vitro culture and vascular fraction (SVF)-containing preadipocytes and that these differentiation. Similar depot-specific differences in expression of differences persist in culture. Most importantly, we demonstrate developmental genes were observed in human subcutaneous ver- that some of these developmental genes exhibit changes in expres- sus visceral adipose tissue. Furthermore, in humans, several genes sion that are closely correlated with the level of obesity and the exhibited changes in expression that correlated closely with body pattern of fat distribution.
    [Show full text]
  • Supplementary Material Sumoylation Regulates the Chromatin
    Supplementary material SUMOylation Regulates the Chromatin Occupancy and Anti-Proliferative Gene Programs of Glucocorticoid Receptor Ville Paakinaho, Sanna Kaikkonen, Harri Makkonen, Vladimir Benes, and Jorma J. Palvimo A FRT wtGR GR3KR clone wt-1 wt-2 wt-3 3KR-1 3KR-2 3KR-3 - - - - - - - dex + + + + + + + kDa a-GR - 100 a-GAPDH - 35 A549 + - - HEK293-wtGR - + - - - HEK293-GR3KR + kDa a-GR - 100 a-GAPDH - 35 B wtGR GR3KR 43°C - - + + - - + + dex - + - + - + - + kDa - 170 a-GR - 130 - 100 a-GAPDH - 35 - 170 a-SUMO-1 - 130 R - 100 G - a : P I a-SUMO-2/3 - 170 - 130 - 100 Figure S1. Expression and SUMOylation of GR in stable isogenic HEK293 cells. (A) Upper panel, immunoblot analysis of three HEK293 cell clones expressing either wtGR (wt-1-3) or GR3KR (3KR-1-3). The clones and background HEK293 (FRT) cells were exposed to vehicle or dex for 17 h, and cell samples were immunoblotting with anti-GR (Santa Cruz, sc-1003) and anti- GAPDH (Santa Cruz, sc-25778) antibodies. Lower panel, comparison of the GR level in A549 cells (from ATCC) with that of the selected clones from isogenic HEK293 cells (wt-3 and 3KR-3) by immunoblotting. (B) To analyse GR SUMOylation, wtGR- and GR3KR-expressing cells were grown with of without dex for 2 h and exposed for 30 min to 43°C. Immunoprecipitation (IP) with anti- GR antibody and immunoblotting with anti-SUMO-1 (Invitrogen, 33-2400) or anti-SUMO-2/3 (MBL M114-3) antibody were performed essentially as described (Rytinki et al. 2011, Methods Mol. Biol.). A total dex- sensitivity induced for dex expression ADARB1
    [Show full text]
  • Supplementary Material Computational Prediction of SARS
    Supplementary_Material Computational prediction of SARS-CoV-2 encoded miRNAs and their putative host targets Sheet_1 List of potential stem-loop structures in SARS-CoV-2 genome as predicted by VMir. Rank Name Start Apex Size Score Window Count (Absolute) Direct Orientation 1 MD13 2801 2864 125 243.8 61 2 MD62 11234 11286 101 211.4 49 4 MD136 27666 27721 104 205.6 119 5 MD108 21131 21184 110 204.7 210 9 MD132 26743 26801 119 188.9 252 19 MD56 9797 9858 128 179.1 59 26 MD139 28196 28233 72 170.4 133 28 MD16 2934 2974 76 169.9 71 43 MD103 20002 20042 80 159.3 403 46 MD6 1489 1531 86 156.7 171 51 MD17 2981 3047 131 152.8 38 87 MD4 651 692 75 140.3 46 95 MD7 1810 1872 121 137.4 58 116 MD140 28217 28252 72 133.8 62 122 MD55 9712 9758 96 132.5 49 135 MD70 13171 13219 93 130.2 131 164 MD95 18782 18820 79 124.7 184 173 MD121 24086 24135 99 123.1 45 176 MD96 19046 19086 75 123.1 179 196 MD19 3197 3236 76 120.4 49 200 MD86 17048 17083 73 119.8 428 223 MD75 14534 14600 137 117 51 228 MD50 8824 8870 94 115.8 79 234 MD129 25598 25642 89 115.6 354 Reverse Orientation 6 MR61 19088 19132 88 197.8 271 10 MR72 23563 23636 148 188.8 286 11 MR11 3775 3844 136 185.1 116 12 MR94 29532 29582 94 184.6 271 15 MR43 14973 15028 109 183.9 226 27 MR14 4160 4206 89 170 241 34 MR35 11734 11792 111 164.2 37 52 MR5 1603 1652 89 152.7 118 53 MR57 18089 18132 101 152.7 139 94 MR8 2804 2864 122 137.4 38 107 MR58 18474 18508 72 134.9 237 117 MR16 4506 4540 72 133.8 311 120 MR34 10010 10048 82 132.7 245 133 MR7 2534 2578 90 130.4 75 146 MR79 24766 24808 75 127.9 59 150 MR65 21528 21576 99 127.4 83 180 MR60 19016 19049 70 122.5 72 187 MR51 16450 16482 75 121 363 190 MR80 25687 25734 96 120.6 75 198 MR64 21507 21544 70 120.3 35 206 MR41 14500 14542 84 119.2 94 218 MR84 26840 26894 108 117.6 94 Sheet_2 List of stable stem-loop structures based on MFE.
    [Show full text]
  • (12) United States Patent (10) Patent No.: US 9,476,099 B2 Spinella Et Al
    US009476.099B2 (12) United States Patent (10) Patent No.: US 9,476,099 B2 Spinella et al. (45) Date of Patent: Oct. 25, 2016 (54) METHOD FOR DETERMINING FOREIGN PATENT DOCUMENTS SENSTIVITY TO DECTABINE WO WO 2012/031008 A2 3, 2012 TREATMENT WO WO 2012, 11885.6 A1 9, 2012 (71) Applicant: TRUSTEES OF DARTMOUTH COLLEGE, Hanover, NH (US) OTHER PUBLICATIONS (72) Inventors: Michael Spinella, Hanover, NH (US); Tsai et al Cancer Cell. Mar. 20, 2012. 21(3):43.0-446 and Supple Maroun J. Beyrouthy, Lebanon, NH mental pages 1-18. (US) Agilent Technologies. DNA Oligo Microarray Gene Lists and Annotations, available via url: <chem.agilent.com/cag?bSp? gene (73) Assignee: Trustees of Dartmouth College, lists.asp printed on Mar. 1, 2016.* Hanover, NH (US) Abele et al. “The EORTC Early Clinical Trials Cooperative Group Experience with 5-Aza-2'-deoxycytidine (NSC 127716) in Patients (*) Notice: Subject to any disclaimer, the term of this with Colo-rectal, Head and Neck, Renal Carcinomas and Malignant patent is extended or adjusted under 35 Melanomas' European Journal of Cancer and Clinical Oncology U.S.C. 154(b) by 0 days. 1987 23(12): 1921-1924. Adewumi et al. “Characterization of Human Embryonic Stem Cell (21) Appl. No.: 14/416,142 Lines by the International StemCell Initiative” Nature Biotechnol ogy 2007 vol. 25(7):803-816. (22) PCT Filed: Jul. 31, 2013 Al-Hajj et al. "Prospective Identification of Tumorigenic Breast Cancer Cells' Proceedings of the National Academy of Sciences (86). PCT No.: PCT/US2013/052899 2003 100(7):3983-3988 with correction.
    [Show full text]
  • Aberrant HOXC Expression Accompanies the Malignant Phenotype in Human Prostate1
    [CANCER RESEARCH 63, 5879–5888, September 15, 2003] Aberrant HOXC Expression Accompanies the Malignant Phenotype in Human Prostate1 Gary J. Miller,2 Heidi L. Miller, Adrie van Bokhoven, James R. Lambert, Priya N. Werahera, Osvaldo Schirripa,3 M. Scott Lucia, and Steven K. Nordeen4 Department of Pathology, University of Colorado Health Sciences Center, Denver, Colorado 80262 ABSTRACT breast (13, 14), and renal (15) carcinomas; melanomas (16); and squamous carcinomas of the skin (17). Because the genes implicated Dysregulation of HOX gene expression has been implicated as a factor show little consensus, the dysregulation may be a tissue-specific in malignancies for a number of years. However, no consensus has perturbation of the existing HOX expression pattern rather than a emerged regarding specific causative genes. Using a degenerate reverse transcription-PCR technique, we show up-regulation of genes from the single causative gene. Tissue-specific expression patterns have been HOXC cluster in malignant prostate cell lines and lymph node metastases. reported in kidney and colon, by Northern blot analysis (12, 15). When relative expression levels of the four HOX clusters were examined, Primary tumors in both kidney and colon showed variations in spe- lymph node metastases and cell lines derived from lymph node metastases cific HOX gene expression from the corresponding normal tissue, but exhibited very similar patterns, patterns distinct from those in benign cells overall expression patterns for individual tumors were not reported. or malignant cell lines derived from other tumor sites. Specific reverse Only primary kidney tumors were examined (15), but liver metastases transcription-PCR for HOXC4, HOXC5, HOXC6, and HOXC8 confirmed from colon tumors reportedly displayed expression of specific HOX overexpression of these genes in malignant cell lines and lymph node genes similar to that seen in either primary colon tumors or normal metastases.
    [Show full text]
  • Strand Breaks for P53 Exon 6 and 8 Among Different Time Course of Folate Depletion Or Repletion in the Rectosigmoid Mucosa
    SUPPLEMENTAL FIGURE COLON p53 EXONIC STRAND BREAKS DURING FOLATE DEPLETION-REPLETION INTERVENTION Supplemental Figure Legend Strand breaks for p53 exon 6 and 8 among different time course of folate depletion or repletion in the rectosigmoid mucosa. The input of DNA was controlled by GAPDH. The data is shown as ΔCt after normalized to GAPDH. The higher ΔCt the more strand breaks. The P value is shown in the figure. SUPPLEMENT S1 Genes that were significantly UPREGULATED after folate intervention (by unadjusted paired t-test), list is sorted by P value Gene Symbol Nucleotide P VALUE Description OLFM4 NM_006418 0.0000 Homo sapiens differentially expressed in hematopoietic lineages (GW112) mRNA. FMR1NB NM_152578 0.0000 Homo sapiens hypothetical protein FLJ25736 (FLJ25736) mRNA. IFI6 NM_002038 0.0001 Homo sapiens interferon alpha-inducible protein (clone IFI-6-16) (G1P3) transcript variant 1 mRNA. Homo sapiens UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 15 GALNTL5 NM_145292 0.0001 (GALNT15) mRNA. STIM2 NM_020860 0.0001 Homo sapiens stromal interaction molecule 2 (STIM2) mRNA. ZNF645 NM_152577 0.0002 Homo sapiens hypothetical protein FLJ25735 (FLJ25735) mRNA. ATP12A NM_001676 0.0002 Homo sapiens ATPase H+/K+ transporting nongastric alpha polypeptide (ATP12A) mRNA. U1SNRNPBP NM_007020 0.0003 Homo sapiens U1-snRNP binding protein homolog (U1SNRNPBP) transcript variant 1 mRNA. RNF125 NM_017831 0.0004 Homo sapiens ring finger protein 125 (RNF125) mRNA. FMNL1 NM_005892 0.0004 Homo sapiens formin-like (FMNL) mRNA. ISG15 NM_005101 0.0005 Homo sapiens interferon alpha-inducible protein (clone IFI-15K) (G1P2) mRNA. SLC6A14 NM_007231 0.0005 Homo sapiens solute carrier family 6 (neurotransmitter transporter) member 14 (SLC6A14) mRNA.
    [Show full text]