Distal Cpg Islands Can Serve As Alternative Promoters to Transcribe 2 Genes with Silenced Proximal-Promoters

Total Page:16

File Type:pdf, Size:1020Kb

Distal Cpg Islands Can Serve As Alternative Promoters to Transcribe 2 Genes with Silenced Proximal-Promoters Downloaded from genome.cshlp.org on September 25, 2021 - Published by Cold Spring Harbor Laboratory Press 1 Distal CpG islands can serve as alternative promoters to transcribe 2 genes with silenced proximal-promoters. 3 4 Shrutii Sarda1, Avinash Das1, Charles Vinson2 and Sridhar Hannenhalli1,* 5 1Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD, USA 6 2Center for Cancer Research, National Cancer Institute, NIH, Bethesda, MD, USA 7 8 *Corresponding author: 9 Sridhar Hannenhalli 10 3104G Biomolecular Sciences Building 11 University of Maryland 12 College Park, MD 20742 13 301 405 8219 (v) 14 301 314 1341 (f) 15 [email protected] 16 17 Subject categories: Transcriptional Regulation, Epigenetics and Computational Biology 18 Keywords: DNA methylation, promoter silencing, CpG islands, alternative promoters, genome-scale. 19 20 Running title: Orphan CGIs transcribe methylated-promoter genes. 21 22 23 24 25 26 27 Downloaded from genome.cshlp.org on September 25, 2021 - Published by Cold Spring Harbor Laboratory Press 28 ABSTRACT 29 DNA methylation at the promoter of a gene is presumed to render it silent, yet, a sizable fraction of 30 genes with methylated proximal-promoters exhibit elevated expression. Here, we show, through 31 extensive analysis of the methylome and transcriptome in 34 tissues, that in many such cases, 32 transcription is initiated by a distal upstream CpG island (CGI) located several kilobases away that 33 functions as an alternative promoter. Specifically, such genes are expressed precisely when the 34 neighboring CGI remains unmethylated, but remain silenced otherwise. Based on CAGE and PolII 35 localization data, we found strong evidence of transcription initiation at the upstream CGI and a lack 36 thereof at the methylated proximal promoter itself. Consistent with their alternative promoter activity, 37 CGI-initiated transcripts are associated with signals of stable elongation and splicing that extend into the 38 gene body, as evidenced by tissue-specific RNA-seq and other DNA-encoded splice signals. Furthermore, 39 based on both inter and intra-species analyses, such CGIs were found to be under greater purifying 40 selection relative to CGIs upstream of silenced genes. Overall, our study describes a hitherto unreported 41 conserved mechanism of transcription of genes with methylated proximal promoters in a tissue-specific 42 fashion. Importantly, this phenomenon explains the aberrant expression patterns of some cancer driver 43 genes, potentially due to aberrant hypomethylation of distal CGIs, despite methylation at proximal 44 promoters. 45 INTRODUCTION 46 In mammalian DNA, cytosines within CpG dinucleotides are heavily methylated throughout the genome, 47 yet there are several discrete “islands” that contain a high frequency of unmethylated CpG sites. These 48 are called CpG islands (CGI), and their identification has long been considered important in the 49 annotation of functional landmarks within the genome. Historically, CGIs served as landing strips to 50 locate annotated genes (Larsen et al. 1992), and it was for good reason as it was later discovered that 51 55-60% of all genes contain CGIs at their annotated promoters. While about half of all CGIs in the 52 genome coincide with gene promoters, the remaining half are either intragenic or intergenic and were 53 termed “orphan CGIs” due to their remote location that suggested the uncertainty over their biological 54 significance (Deaton and Bird 2011). 55 Does there exist evidence to support the idea that orphan CGIs are involved in gene regulation? Indeed, 56 several specific examples, showing promoter activity at orphan CGIs were uncovered in the context of 57 critical functions like imprinting and development (Deaton et al. 2011). For example, a CGI in intron 10 58 of the imprinted Kcnq1 gene (Mancini-DiNardo et al. 2003) promotes the initiation of a noncoding 59 transcript (Kcnq1ot1) required for the imprinting of several genes at this locus. Tissue-specific 60 alternative promoter activity was detected at another orphan CGI that promotes a specific isoform of 61 the Rapgef4 gene (Hoivik et al. 2013). Cumulative evidence suggest that most, perhaps all, CGIs have 62 promoter-like characteristics and are sites of transcription initiation (Illingworth et al. 2010). 63 Additionally, most of the conserved methylation differences between tissues occurred at orphan CGIs 64 (Illingworth and Bird 2009), suggesting that they are tightly regulated. A recent study that derived CGI 65 annotations from experimental methylation data (eCGIs) also showed that promoter-distal eCGIs 66 exhibited the most tissue-specific methylation patterns, and were linked to the tissue-specific 67 production of alternative transcripts (Mendizabal and Yi 2015). In fact, studies profiling CpG methylation 68 patterns have identified differentially methylated regions (DMRs) even in the shores of CpG islands 69 (Pollard et al. 2009). These regions of lower CpG density in close proximity (up to 2 kb) to CGIs, whose Downloaded from genome.cshlp.org on September 25, 2021 - Published by Cold Spring Harbor Laboratory Press 70 differential methylation patterns are strongly related to gene expression, are highly conserved and have 71 distinct tissue-specific methylation patterns (Irizarry et al. 2009a). Thus, over time, CG-dense genomic 72 loci (viz., CGIs and their shores) have been realized to be increasingly important in many functional 73 contexts, whose immense regulatory potential outside of annotated promoters is only beginning to be 74 understood. 75 Typically, methylation at a gene’s promoter renders it silent (Han et al. 2011) by modifying DNA 76 accessibility to the transcriptional machinery (Suzuki and Bird 2008), or by recruiting factors that aid in 77 generating a refractory chromatin conformation unsuitable for transcription (Kouzarides 2007). While 78 several prior studies (Suzuki and Bird 2008; van Eijk et al. 2012; Deaton and Bird 2011; Smith et al. 2012; 79 Sproul et al. 2011) have observed strongly negative correlations between promoter methylation and 80 gene expression, others report more nuanced relationships between the two (Wagner et al. 2014; Wan 81 et al. 2015; Shilpa et al. 2014; Martino and Saffery 2015), including a lack thereof. Additionally, there are 82 several instances of genes in cancer cells, wherein abnormal expression is persistent despite widespread 83 promoter hypermethylation (Guillaumet-Adkins et al. 2014; Moarii et al. 2015; Van Vlodrop et al. 2011). 84 These collectively indicate that additional factors controlling expression of genes with methylated 85 promoters have not been identified. Furthermore, due to the long standing interest in CGI-promoter 86 genes, most of this knowledge is based on analysis of CGI-promoters (Jones 2012), and the details of the 87 role of methylation in controlling non-CGI transcription start sites (TSSs) have largely been overlooked. 88 This lack of consensus prompted us to explore the global transcriptional landscape of methylated- 89 promoter genes. We found that substantial numbers of methylated-promoter genes (~1500 in each of 90 34 tissues) are expressed at high levels; such promoters are predominantly non-CGI, which is consistent 91 with prevailing knowledge on the rarity of methylation at CGI promoters (Illingworth et al. 2010; 92 Brandeis et al. 1994; Lienert et al. 2011). While the expression of many such genes can be attributed to 93 the use of alternate gene body promoters, as has been shown in some normal and cancer cells 94 (Nagarajan et al. 2014; Maunakea et al. 2010), we estimate that the high levels of expression realized by 95 almost 50% of all methylated-promoter genes remain completely unexplained (see Results). 96 Here we show, through detailed analyses across 34 primary tissues and cell types, that genes with 97 methylated and silenced promoters can in some instances utilize an upstream, hitherto unknown, CpG 98 island as an alternative promoter to express their gene product. Our results strongly support this 99 previously unreported general regulatory mechanism, which may play a role in promoting aberrant 100 transcription of driver genes in cancer cells. 101 RESULTS 102 Highly expressed genes with methylated promoters 103 We obtained RNA-seq expression and whole genome bisulfite sequencing (WGBS) methylation data for 104 30 primary tissues and 4 cell lines from the Roadmap Epigenomics Project (Bernstein et al. 2010) and 105 other sources (Lay et al. 2015; Ziller et al. 2013; Djebali et al. 2012; Menafra et al. 2014) (Supplemental 106 Table 1). Henceforth, we will refer to these 34 samples simply as ‘tissue types’. In a given tissue type, 107 there exists, on average about 9000 genes whose primary promoters are maintained in a heavily 108 methylated state (see Methods). Although methylation at a gene’s promoter is expected to render it 109 silent, we observed that ~1500 of such genes exhibited high levels of expression. We then excluded 110 genes whose expression could be explained by alternative gene body promoter activity (see Methods), Downloaded from genome.cshlp.org on September 25, 2021 - Published by Cold Spring Harbor Laboratory Press 111 and this resulted in 700 genes in each tissue whose expression remains unexplained. To specifically 112 assess the involvement of the closest upstream CGI in the expression of these genes, we restricted 113 downstream analysis to only those genes that did not have another gene annotated (including non- 114 coding RNAs) in the genomic region between the gene’s transcription start site and the CGI. This 115 eliminates potential biases owing to intervening transcriptional activity. We further verified that this 116 subset of genes was not enriched for any specific biological function or expression status compared to 117 the set of all methylated-promoter genes (Supplemental Fig. 1). These filters resulted in a set of ~3200 118 methylated-promoter genes (down from ~9000 overall) out of which ~440 (down from ~1500) are highly 119 expressed per tissue. The number of genes at various filtering stages across tissues are provided in 120 Supplemental Table 2.
Recommended publications
  • Reconstructing Cell Cycle Pseudo Time-Series Via Single-Cell Transcriptome Data—Supplement
    School of Natural Sciences and Mathematics Reconstructing Cell Cycle Pseudo Time-Series Via Single-Cell Transcriptome Data—Supplement UT Dallas Author(s): Michael Q. Zhang Rights: CC BY 4.0 (Attribution) ©2017 The Authors Citation: Liu, Zehua, Huazhe Lou, Kaikun Xie, Hao Wang, et al. 2017. "Reconstructing cell cycle pseudo time-series via single-cell transcriptome data." Nature Communications 8, doi:10.1038/s41467-017-00039-z This document is being made freely available by the Eugene McDermott Library of the University of Texas at Dallas with permission of the copyright owner. All rights are reserved under United States copyright law unless specified otherwise. File name: Supplementary Information Description: Supplementary figures, supplementary tables, supplementary notes, supplementary methods and supplementary references. CCNE1 CCNE1 CCNE1 CCNE1 36 40 32 34 32 35 30 32 28 30 30 28 28 26 24 25 Normalized Expression Normalized Expression Normalized Expression Normalized Expression 26 G1 S G2/M G1 S G2/M G1 S G2/M G1 S G2/M Cell Cycle Stage Cell Cycle Stage Cell Cycle Stage Cell Cycle Stage CCNE1 CCNE1 CCNE1 CCNE1 40 32 40 40 35 30 38 30 30 28 36 25 26 20 20 34 Normalized Expression Normalized Expression Normalized Expression 24 Normalized Expression G1 S G2/M G1 S G2/M G1 S G2/M G1 S G2/M Cell Cycle Stage Cell Cycle Stage Cell Cycle Stage Cell Cycle Stage Supplementary Figure 1 | High stochasticity of single-cell gene expression means, as demonstrated by relative expression levels of gene Ccne1 using the mESC-SMARTer data. For every panel, 20 sample cells were randomly selected for each of the three stages, followed by plotting the mean expression levels at each stage.
    [Show full text]
  • 1 AGING Supplementary Table 2
    SUPPLEMENTARY TABLES Supplementary Table 1. Details of the eight domain chains of KIAA0101. Serial IDENTITY MAX IN COMP- INTERFACE ID POSITION RESOLUTION EXPERIMENT TYPE number START STOP SCORE IDENTITY LEX WITH CAVITY A 4D2G_D 52 - 69 52 69 100 100 2.65 Å PCNA X-RAY DIFFRACTION √ B 4D2G_E 52 - 69 52 69 100 100 2.65 Å PCNA X-RAY DIFFRACTION √ C 6EHT_D 52 - 71 52 71 100 100 3.2Å PCNA X-RAY DIFFRACTION √ D 6EHT_E 52 - 71 52 71 100 100 3.2Å PCNA X-RAY DIFFRACTION √ E 6GWS_D 41-72 41 72 100 100 3.2Å PCNA X-RAY DIFFRACTION √ F 6GWS_E 41-72 41 72 100 100 2.9Å PCNA X-RAY DIFFRACTION √ G 6GWS_F 41-72 41 72 100 100 2.9Å PCNA X-RAY DIFFRACTION √ H 6IIW_B 2-11 2 11 100 100 1.699Å UHRF1 X-RAY DIFFRACTION √ www.aging-us.com 1 AGING Supplementary Table 2. Significantly enriched gene ontology (GO) annotations (cellular components) of KIAA0101 in lung adenocarcinoma (LinkedOmics). Leading Description FDR Leading Edge Gene EdgeNum RAD51, SPC25, CCNB1, BIRC5, NCAPG, ZWINT, MAD2L1, SKA3, NUF2, BUB1B, CENPA, SKA1, AURKB, NEK2, CENPW, HJURP, NDC80, CDCA5, NCAPH, BUB1, ZWILCH, CENPK, KIF2C, AURKA, CENPN, TOP2A, CENPM, PLK1, ERCC6L, CDT1, CHEK1, SPAG5, CENPH, condensed 66 0 SPC24, NUP37, BLM, CENPE, BUB3, CDK2, FANCD2, CENPO, CENPF, BRCA1, DSN1, chromosome MKI67, NCAPG2, H2AFX, HMGB2, SUV39H1, CBX3, TUBG1, KNTC1, PPP1CC, SMC2, BANF1, NCAPD2, SKA2, NUP107, BRCA2, NUP85, ITGB3BP, SYCE2, TOPBP1, DMC1, SMC4, INCENP. RAD51, OIP5, CDK1, SPC25, CCNB1, BIRC5, NCAPG, ZWINT, MAD2L1, SKA3, NUF2, BUB1B, CENPA, SKA1, AURKB, NEK2, ESCO2, CENPW, HJURP, TTK, NDC80, CDCA5, BUB1, ZWILCH, CENPK, KIF2C, AURKA, DSCC1, CENPN, CDCA8, CENPM, PLK1, MCM6, ERCC6L, CDT1, HELLS, CHEK1, SPAG5, CENPH, PCNA, SPC24, CENPI, NUP37, FEN1, chromosomal 94 0 CENPL, BLM, KIF18A, CENPE, MCM4, BUB3, SUV39H2, MCM2, CDK2, PIF1, DNA2, region CENPO, CENPF, CHEK2, DSN1, H2AFX, MCM7, SUV39H1, MTBP, CBX3, RECQL4, KNTC1, PPP1CC, CENPP, CENPQ, PTGES3, NCAPD2, DYNLL1, SKA2, HAT1, NUP107, MCM5, MCM3, MSH2, BRCA2, NUP85, SSB, ITGB3BP, DMC1, INCENP, THOC3, XPO1, APEX1, XRCC5, KIF22, DCLRE1A, SEH1L, XRCC3, NSMCE2, RAD21.
    [Show full text]
  • Original Article Prognostic Value and Interrelated Expression Profile of GINS Complex Subunits in Lung Adenocarcinoma
    Int J Clin Exp Med 2020;13(10):8068-8085 www.ijcem.com /ISSN:1940-5901/IJCEM0117492 Original Article Prognostic value and interrelated expression profile of GINS complex subunits in lung adenocarcinoma Fangyan Zhong1*, Jianxiong Deng1*, Xiaotong Duan2, Junming Lai3, Xin Long4, Ruoyan Shen5, Yi Zhong6, Hong Lin1, Hui Luo1 1Department of Oncology, The First Affiliated Hospital of Nanchang University, Nanchang University, Nanchang 330006, Jiangxi, P. R. China; 2Department of Radiation Oncology, Nanfang Hospital, Southern Medical University, Guangzhou 510515, Guangdong, P. R. China; 3Department of Radiation Oncology, Yiwu Central Hospital, The Affiliated Yiwu Hospital of Wenzhou Medical University, Yiwu 322000, Zhejiang, P. R. China; 4Department of Radiation and Chemotherapy Oncology, Zhongnan Hospital, Wuhan University, Wuhan 430000, Hubei, P. R. China; 5Department of Second Institute of Clinical Medicine, Zhejiang Chinese Medical University, Hangzhou 310013, Zhejiang, P. R. China; 6Department of Acupuncture and Immune Rheumatism, First People’s Hospital of Tongxiang, Tongxiang 314500, Zhejiang, P. R. China. *Equal contributors. Received July 3, 2020; Accepted August 30, 2020; Epub October 15, 2020; Published October 30, 2020 Abstract: Lung adenocarcinoma (LUAD) is one of the common pathological types of lung cancer, it has a poor 5-year overall survival rate. The GINS complex was found to be a core component of replication helicases in eukaryotes, and its abnormal function may lead to the cause of human cancers and other diseases. However, few attempts have been made to find an association between GINS complex subunits (GINSs) and LUAD. Therefore, in this study, we will attempt to explore the prognostic value of GINSs in patients with LUAD and the interrelated expression profile.
    [Show full text]
  • Biallelic GINS2 Variant P.(Arg114leu) Causes Meier-Gorlin Syndrome With
    Genotype- phenotype correlations J Med Genet: first published as 10.1136/jmedgenet-2020-107572 on 5 August 2021. Downloaded from Short report Biallelic GINS2 variant p.(Arg114Leu) causes Meier- Gorlin syndrome with craniosynostosis Maria J Nabais Sá ,1,2 Kerry A Miller,3 Mary McQuaid,4 Nils Koelling,3 Andrew O M Wilkie ,3 Hugo Wurtele,4 Arjan P M de Brouwer,1 Jorge Oliveira 5,6 ► Additional supplemental ABSTRACT complex (Cdc45/MCM2–7/GINS) is a DNA heli- material is published online Introduction Replication of the nuclear genome is an case that separates the two strands of the DNA only. To view, please visit essential step for cell division. Pathogenic variants in double helix at replication origins, subsequently the journal online (http:// dx. 8 doi. org/ 10. 1136/ jmedgenet- genes coding for highly conserved components of the enabling their replication. 2020- 107572). DNA replication machinery cause Meier- Gorlin syndrome The genotype of individuals with MGORS (MGORS). requires at least one allele of genes encoding essen- For numbered affiliations see Objective Identification of novel genes associated with tial DNA replication factors allowing for some end of article. MGORS. residual activity (hypomorphic variant).2–6 Further- more, less severe phenotypes are often associated Correspondence to Methods Exome sequencing was performed to Dr Maria J Nabais Sá, investigate the genotype of an individual presenting with with two hypomorphic variants, while more severe Department of Human Genetics prenatal and postnatal growth restriction, a craniofacial phenotypes result from a combination of a hypo- (route 836), Radboud University gestalt of MGORS and coronal craniosynostosis. The morphic and a loss- of- function variant.1 Never- Medical Center, P.O.
    [Show full text]
  • Identification of Biomarkers in Colon Cancer Based on Bioinformatic Analysis
    4895 Original Article Identification of biomarkers in colon cancer based on bioinformatic analysis Ying Zhu1#^, Leitao Sun2#^, Jieru Yu3, Yuying Xiang1^, Minhe Shen2^, Harpreet S. Wasan4^, Shanming Ruan2^, Shengliang Qiu2^ 1The First Clinical Medical College of Zhejiang Chinese Medical University, Hangzhou, China; 2Department of Medical Oncology, The First Affiliated Hospital of Zhejiang Chinese Medical University, Hangzhou, China; 3College of Basic Medical Science, Zhejiang Chinese Medical University, Hangzhou, China; 4Department of Cancer Medicine, Hammersmith Hospital, Imperial College Healthcare NHS Trust, London, UK Contributions: (I) Conception and design: Y Zhu, L Sun; (II) Administrative support: M Shen, S Ruan, S Qiu; (III) Provision of study materials or patients: Z Ying, L Sun; (IV) Collection and assembly of data: J Yu, Y Xiang; (V) Data analysis and interpretation: Ying Zhu, Leitao Sun, Harpreet S. Wasan, S Qiu, S Ruan; (VI) Manuscript writing: All authors; (VII) Final approval of manuscript: All authors. #These authors contributed equally to this work. Correspondence to: Shengliang Qiu. Department of Medical Oncology, The First Affiliated Hospital of Zhejiang Chinese Medical University, 54 Youdian Road, Shangcheng, Hangzhou 310006, China. Email: [email protected]; Shanming Ruan. Department of Medical Oncology, The First Affiliated Hospital of Zhejiang Chinese Medical University, 54 Youdian Road, Shangcheng, Hangzhou 310006, China. Email: shanmingruan@ zcmu.edu.cn. Background: Colon cancer is one of the most common cancers in the world. Targeting biomarkers is helpful for the diagnosis and treatment of colon cancer. This study aimed to identify biomarkers in colon cancer, in addition to those that have already been reported, using microarray datasets and bioinformatics analysis.
    [Show full text]
  • Node and Edge Attributes
    Cytoscape User Manual Table of Contents Cytoscape User Manual ........................................................................................................ 3 Introduction ...................................................................................................................... 48 Development .............................................................................................................. 4 License ...................................................................................................................... 4 What’s New in 2.7 ....................................................................................................... 4 Please Cite Cytoscape! ................................................................................................. 7 Launching Cytoscape ........................................................................................................... 8 System requirements .................................................................................................... 8 Getting Started .......................................................................................................... 56 Quick Tour of Cytoscape ..................................................................................................... 12 The Menus ............................................................................................................... 15 Network Management ................................................................................................. 18 The
    [Show full text]
  • Distal Cpg Islands Can Serve As Alternative Promoters to Transcribe Genes with Silenced Proximal Promoters
    Downloaded from genome.cshlp.org on September 25, 2021 - Published by Cold Spring Harbor Laboratory Press Research Distal CpG islands can serve as alternative promoters to transcribe genes with silenced proximal promoters Shrutii Sarda,1 Avinash Das,1 Charles Vinson,2 and Sridhar Hannenhalli1 1Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland 20742, USA; 2Center for Cancer Research, National Cancer Institute, NIH, Bethesda, Maryland 20892, USA DNA methylation at the promoter of a gene is presumed to render it silent, yet a sizable fraction of genes with methylated proximal promoters exhibit elevated expression. Here, we show, through extensive analysis of the methylome and transcrip- tome in 34 tissues, that in many such cases, transcription is initiated by a distal upstream CpG island (CGI) located several kilobases away that functions as an alternative promoter. Specifically, such genes are expressed precisely when the neigh- boring CGI is unmethylated but remain silenced otherwise. Based on CAGE and Pol II localization data, we found strong evidence of transcription initiation at the upstream CGI and a lack thereof at the methylated proximal promoter itself. Consistent with their alternative promoter activity, CGI-initiated transcripts are associated with signals of stable elongation and splicing that extend into the gene body, as evidenced by tissue-specific RNA-seq and other DNA-encoded splice signals. Furthermore, based on both inter- and intra-species analyses, such CGIs were found to be under greater purifying selection relative to CGIs upstream of silenced genes. Overall, our study describes a hitherto unreported conserved mechanism of transcription of genes with methylated proximal promoters in a tissue-specific fashion.
    [Show full text]
  • A High-Throughput Approach to Uncover Novel Roles of APOBEC2, a Functional Orphan of the AID/APOBEC Family
    Rockefeller University Digital Commons @ RU Student Theses and Dissertations 2018 A High-Throughput Approach to Uncover Novel Roles of APOBEC2, a Functional Orphan of the AID/APOBEC Family Linda Molla Follow this and additional works at: https://digitalcommons.rockefeller.edu/ student_theses_and_dissertations Part of the Life Sciences Commons A HIGH-THROUGHPUT APPROACH TO UNCOVER NOVEL ROLES OF APOBEC2, A FUNCTIONAL ORPHAN OF THE AID/APOBEC FAMILY A Thesis Presented to the Faculty of The Rockefeller University in Partial Fulfillment of the Requirements for the degree of Doctor of Philosophy by Linda Molla June 2018 © Copyright by Linda Molla 2018 A HIGH-THROUGHPUT APPROACH TO UNCOVER NOVEL ROLES OF APOBEC2, A FUNCTIONAL ORPHAN OF THE AID/APOBEC FAMILY Linda Molla, Ph.D. The Rockefeller University 2018 APOBEC2 is a member of the AID/APOBEC cytidine deaminase family of proteins. Unlike most of AID/APOBEC, however, APOBEC2’s function remains elusive. Previous research has implicated APOBEC2 in diverse organisms and cellular processes such as muscle biology (in Mus musculus), regeneration (in Danio rerio), and development (in Xenopus laevis). APOBEC2 has also been implicated in cancer. However the enzymatic activity, substrate or physiological target(s) of APOBEC2 are unknown. For this thesis, I have combined Next Generation Sequencing (NGS) techniques with state-of-the-art molecular biology to determine the physiological targets of APOBEC2. Using a cell culture muscle differentiation system, and RNA sequencing (RNA-Seq) by polyA capture, I demonstrated that unlike the AID/APOBEC family member APOBEC1, APOBEC2 is not an RNA editor. Using the same system combined with enhanced Reduced Representation Bisulfite Sequencing (eRRBS) analyses I showed that, unlike the AID/APOBEC family member AID, APOBEC2 does not act as a 5-methyl-C deaminase.
    [Show full text]
  • Bioinformatics Analysis of Differentially Expressed Genes and Pathways in the Development of Cervical Cancer Baojie Wu* and Shuyi Xi
    Wu and Xi BMC Cancer (2021) 21:733 https://doi.org/10.1186/s12885-021-08412-4 RESEARCH Open Access Bioinformatics analysis of differentially expressed genes and pathways in the development of cervical cancer Baojie Wu* and Shuyi Xi Abstract Background: This study aimed to explore and identify key genes and signaling pathways that contribute to the progression of cervical cancer to improve prognosis. Methods: Three gene expression profiles (GSE63514, GSE64217 and GSE138080) were screened and downloaded from the Gene Expression Omnibus database (GEO). Differentially expressed genes (DEGs) were screened using the GEO2R and Venn diagram tools. Then, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses were performed. Gene set enrichment analysis (GSEA) was performed to analyze the three gene expression profiles. Moreover, a protein–protein interaction (PPI) network of the DEGs was constructed, and functional enrichment analysis was performed. On this basis, hub genes from critical PPI subnetworks were explored with Cytoscape software. The expression of these genes in tumors was verified, and survival analysis of potential prognostic genes from critical subnetworks was conducted. Functional annotation, multiple gene comparison and dimensionality reduction in candidate genes indicated the clinical significance of potential targets. Results: A total of 476 DEGs were screened: 253 upregulated genes and 223 downregulated genes. DEGs were enriched in 22 biological processes, 16 cellular components and 9 molecular functions in precancerous lesions and cervical cancer. DEGs were mainly enriched in 10 KEGG pathways. Through intersection analysis and data mining, 3 key KEGG pathways and related core genes were revealed by GSEA.
    [Show full text]
  • Microarray Bioinformatics and Its Applications to Clinical Research
    Microarray Bioinformatics and Its Applications to Clinical Research A dissertation presented to the School of Electrical and Information Engineering of the University of Sydney in fulfillment of the requirements for the degree of Doctor of Philosophy i JLI ··_L - -> ...·. ...,. by Ilene Y. Chen Acknowledgment This thesis owes its existence to the mercy, support and inspiration of many people. In the first place, having suffering from adult-onset asthma, interstitial cystitis and cold agglutinin disease, I would like to express my deepest sense of appreciation and gratitude to Professors Hong Yan and David Levy for harbouring me these last three years and providing me a place at the University of Sydney to pursue a very meaningful course of research. I am also indebted to Dr. Craig Jin, who has been a source of enthusiasm and encouragement on my research over many years. In the second place, for contexts concerning biological and medical aspects covered in this thesis, I am very indebted to Dr. Ling-Hong Tseng, Dr. Shian-Sehn Shie, Dr. Wen-Hung Chung and Professor Chyi-Long Lee at Change Gung Memorial Hospital and University of Chang Gung School of Medicine (Taoyuan, Taiwan) as well as Professor Keith Lloyd at University of Alabama School of Medicine (AL, USA). All of them have contributed substantially to this work. In the third place, I would like to thank Mrs. Inge Rogers and Mr. William Ballinger for their helpful comments and suggestions for the writing of my papers and thesis. In the fourth place, I would like to thank my swim coach, Hirota Homma.
    [Show full text]
  • Novel Gene Signatures for Stage Classification of the Squamous Cell
    www.nature.com/scientificreports OPEN Novel gene signatures for stage classifcation of the squamous cell carcinoma of the lung Angel Juarez‑Flores, Gabriel S. Zamudio & Marco V. José* The squamous cell carcinoma of the lung (SCLC) is one of the most common types of lung cancer. As GLOBOCAN reported in 2018, lung cancer was the frst cause of death and new cases by cancer worldwide. Typically, diagnosis is made in the later stages of the disease with few treatment options available. The goal of this work was to fnd some key components underlying each stage of the disease, to help in the classifcation of tumor samples, and to increase the available options for experimental assays and molecular targets that could be used in treatment development. We employed two approaches. The frst was based in the classic method of diferential gene expression analysis, network analysis, and a novel concept known as network gatekeepers. The second approach was using machine learning algorithms. From our combined approach, we identifed two sets of genes that could function as a signature to identify each stage of the cancer pathology. We also arrived at a network of 55 nodes, which according to their biological functions, they can be regarded as drivers in this cancer. Although biological experiments are necessary for their validation, we proposed that all these genes could be used for cancer development treatments. As GLOBOCAN reported in 2018, lung cancer was the frst cause of deaths and new cases by cancer worldwide 1. Squamous cell carcinoma of the lung (SCC) is one type of lung cancer which comprises approximately 30% of all lung cancer cases.
    [Show full text]
  • Tissue-Specific Requirement for the GINS Complex During Zebrafish
    fcell-08-00373 May 26, 2020 Time: 17:57 # 1 ORIGINAL RESEARCH published: 28 May 2020 doi: 10.3389/fcell.2020.00373 Tissue-Specific Requirement for the GINS Complex During Zebrafish Development Máté Varga1,2*, Kitti Csályi1, István Bertyák1, Dóra K. Menyhárd3, Richard J. Poole2, Kara L. Cerveny4, Dorottya Kövesdi5,6, Balázs Barátki6, Hannah Rouse2, Zsuzsa Vad1, Thomas A. Hawkins2, Heather L. Stickney2, Florencia Cavodeassi2,7, Quenten Schwarz8, Rodrigo M. Young2† and Stephen W. Wilson2* Edited by: 1 Department of Genetics, ELTE Eötvös Loránd University, Budapest, Hungary, 2 Department of Cell and Developmental Brian C. Schaefer, Biology, Division of Biosciences, University College London, London, United Kingdom, 3 HAS-ELTE Protein Modeling Uniformed Services University of the Research Group and Laboratory of Structural Chemistry and Biology, ELTE Eötvös Loránd University, Budapest, Hungary, Health Sciences, United States 4 Biology Department, Reed College, Portland, OR, United States, 5 Office of Supported Research Groups of the Hungarian 6 Reviewed by: Academy of Sciences, Budapest, Hungary, Department of Immunology, ELTE Eötvös Loránd University, Budapest, 7 Robert M. Brosh, Hungary, Institute of Medical and Biomedical Education, St. George’s University of London, London, United Kingdom, 8 National Institute on Aging, National Centre for Cancer Biology, University of South Australia and SA Pathology, Adelaide, SA, Australia Institutes of Health (NIH), United States Efficient and accurate DNA replication is particularly critical in stem and progenitor Masato Kanemaki, National Institute of Genetics, Japan cells for successful proliferation and survival. The replisome, an amalgam of protein *Correspondence: complexes, is responsible for binding potential origins of replication, unwinding the Máté Varga double helix, and then synthesizing complimentary strands of DNA.
    [Show full text]