Integrated Data Analysis Reveals Potential Drivers and Pathways

Total Page:16

File Type:pdf, Size:1020Kb

Load more

Beltrami et al. Clinical Epigenetics (2017) 9:45 DOI 10.1186/s13148-017-0346-2 RESEARCH Open Access Integrated data analysis reveals potential drivers and pathways disrupted by DNA methylation in papillary thyroid carcinomas Caroline Moraes Beltrami1†, Mariana Bisarro dos Reis1,2†, Mateus Camargo Barros-Filho1, Fabio Albuquerque Marchi1, Hellen Kuasne1,2, Clóvis Antônio Lopes Pinto3, Srikant Ambatipudi4, Zdenko Herceg4, Luiz Paulo Kowalski1,5 and Silvia Regina Rogatto2,6* Abstract Background: Papillary thyroid carcinoma (PTC) is a common endocrine neoplasm with a recent increase in incidence in many countries. Although PTC has been explored by gene expression and DNA methylation studies, the regulatory mechanisms of the methylation on the gene expression was poorly clarified. In this study, DNA methylation profile (Illumina HumanMethylation 450K) of 41 PTC paired with non-neoplastic adjacent tissues (NT) was carried out to identify and contribute to the elucidation of the role of novel genic and intergenic regions beyond those described in the promoter and CpG islands (CGI). An integrative and cross-validation analysis were performed aiming to identify molecular drivers and pathways that are PTC-related. Results: The comparisons between PTC and NT revealed 4995 methylated probes (88% hypomethylated in PTC) and 1446 differentially expressed transcripts cross-validated by the The Cancer Genome Atlas data. The majority of these probes was found in non-promoters regions, distant from CGI and enriched by enhancers. The integrative analysis between gene expression and DNA methylation revealed 185 and 38 genes (mainly in the promoter and body regions, respectively) with negative and positive correlation, respectively. Genes showing negative correlation underlined FGF and retinoic acid signaling as critical canonical pathways disrupted by DNA methylation in PTC. BRAF mutation was detected in 68% (28 of 41) of the tumors, which presented a higher level of demethylation (95% hypomethylated probes) compared with BRAF wild-type tumors. A similar integrative analysis uncovered 40 of 254 differentially expressed genes, which are potentially regulated by DNA methylation in BRAFV600E-positive tumors. The methylation and expression pattern of six selected genes (ERBB3, FGF1, FGFR2, GABRB2, HMGA2, and RDH5) were confirmed as altered by pyrosequencing and RT-qPCR. Conclusions: DNA methylation loss in non-promoter, poor CGI and enhancer-enriched regions was a significant event in PTC, especially in tumors harboring BRAFV600E. In addition to the promoter region, gene body and 3’UTR methylation have also the potential to influence the gene expression levels (both, repressing and inducing). The integrative analysis revealed genes potentially regulated by DNA methylation pointing out potential drivers and biomarkers related to PTC development. Keywords: Papillary thyroid cancer, DNA methylation, Integrative analysis, FGF signaling pathway, Retinoic acid pathway, BRAFV600E mutation * Correspondence: [email protected]; [email protected] †Equal contributors 2Department of Urology, Faculty of Medicine, UNESP, Sao Paulo State University, Botucatu, São Paulo, Brazil 6Department of Clinical Genetics, Vejle Hospital and Institute of Regional Health Research, University of Southern Denmark, Kabbeltoft 25, Vejle 7100, Denmark Full list of author information is available at the end of the article © The Author(s). 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. Beltrami et al. Clinical Epigenetics (2017) 9:45 Page 2 of 11 Background retrospectively. Forty one papillary thyroid carcinomas of Thyroid cancer is the most common tumor of the head patients treated with total thyroidectomy followed by and neck region, with the highest incidence among the radioiodine therapy and matched non-neoplastic adjacent endocrine neoplasias [1]. Papillary thyroid cancer (PTC) is tissues (NT) samples were included in this study. Cases the histological subtype with higher incidence (80% of with incomplete clinical data in medical records or diag- cases) worldwide [2]. nosed with previous or synchronic malignancies were The thyroid carcinogenesis involves a constitutive excluded (except basal skin cell carcinoma). Clinical and activation of two major pathways associated to tyrosine- histopathological data are summarized in Table 1. kinase, including mitogen-activated protein kinase (MAPK) and phosphatidylinositol 3-kinase (PI3K) [3]. The activation Nucleic acids extraction and analysis of somatic of these pathways occurs mainly due to point mutations in mutations BRAF and RAS and chromosomal rearrangements in RET Genomic DNA extraction was performed according to [3]. MAPK signaling pathway activated by genetic alter- conventional protocol using enzymatic degradation with ations is frequently involved in cell proliferation, growth, proteinase K followed by purification with organic solvents and survival [4]. (phenol/chloroform). RNA was isolated as previously Approximately 60% of PTC cases are characterized by reported [26]. T1799A BRAF transversion nucleotide change (over 95% Somatic point mutations of BRAF (codon 599), KRAS of the mutations), resulting in V600E mutant protein (codon 12/13), HRAS (codon 61), and NRAS (codon 61) with constitutive activation of BRAF kinase [5–7]. BRAF were evaluated by pyrosequencing using a Pyromark Q96 mutation has been associated with unfavorable prognosis ID system (Qiagen, Valencia, CA, USA). RET rearrange- including large primary tumors, lymph node and vascu- ments (RET/PTC1 and RET/PTC3) were detected by lar invasion, advanced stage, extrathyroidal extension, RT-qPCR on a 7500 Real Time PCR System (Applied distant metastases, and recurrence [8, 9]. However, there Biosystems, Foster, CA, USA) (detailed in Additional file 1). are no consensus in literature, since many studies have not found this association [10–12]. The methylation pattern of several genes has been DNA methylation and gene expression profiling assessed in PTC, and most of them plays a role in thyroid Five hundred nanograms of DNA (Qubit® dsDNA BR gland function (TSHR) [13] and iodine metabolism (NIS Assay no Qubit® 2.0 Fluorometer (Life Technologies, and SLC26A4) [14] or acts as a tumor suppressor gene Carlsbad, CA, USA) were bisulfite-modified using (RASSF1A, TIMP3,andRARβ2) [15, 16]. In addition, an EZ-DNA Methylation-Gold Kit (Zymo Research, Irvine, association between BRAFV600E mutation and aberrant CA, USA) according to the manufacturer’s recommen- DNA methylation profile has been reported in thyroid dations. Converted DNA was used for the genome-wide cancer [17–21]. methylation assays (Infinium Human Methylation450 Recently, large-scale approaches were used to investigate BeadChip array-Illumina, San Diego, CA, USA). Arrays the methylation profile of thyroid cancer [17–23]. These were scanned by HiScan system (Illumina), and methylation technologies allow the assessment of not only CpG islands data were analyzed as β values. Genome-wide DNA methy- (CGI) and promoter regions, as previously reported, but lation data processing was done as reported previously [27] unveiled novel regions involved in neoplastic process, (detailed in Additional file 1). Limma package [28] was used such as CGI shores/shelf, non-CGI promoter and to identify significant probes adopting adjusted (Bonferroni) enhancers [24, 25]. However, the methylation as a regula- p value <0.05 and mean delta β value (Δβ)of0.15asa tory mechanism of gene expression is poorly explored in threshold for differential DNA methylation. A hypergeo- PTC, even in the multiplatform robust study performed metric test (p < 0.05) was performed from phyper function by The Cancer Genome Atlas (TCGA) [21]. of STATS package in R language to compare differentially To our knowledge, the current study is the first to assess methylated probes in relation to genomic regions (Illumina a substantial matched-sample subset with methylation 450K array annotation). and expression data addressing the available data from Gene expression data were obtained from our previously TCGA study to cross-validate the results. The genes reported study (GEO accession number GSE50901) [26]. signature potentially regulated by methylation inferred the The unsupervised hierarchical clustering analysis was role of this epigenetic event in PTC development. performed using the most variable probes (interquartile range >0.2 to methylation and >1.0 to gene expression). Methods Euclidean distance with average linkage method was used Sample population in all clustering analysis by BRB array tool software Snap frozen PTC samples stored at tissue biobank of the (https://brb.nci.nih.gov/BRB-ArrayTools/download.html). A.C. Camargo Cancer Center, SP, Brazil, were obtained Student t test was assessed to verify the association Beltrami et al. Clinical Epigenetics (2017) 9:45 Page 3 of 11 Table 1 Clinicopathological features of 41 patients diagnosed Table 1 Clinicopathological features of 41 patients diagnosed with papillary
Recommended publications
  • Metagenomics Approaches for the Detection and Surveillance of Emerging and Recurrent Plant Pathogens

    Metagenomics Approaches for the Detection and Surveillance of Emerging and Recurrent Plant Pathogens

    microorganisms Review Metagenomics Approaches for the Detection and Surveillance of Emerging and Recurrent Plant Pathogens Edoardo Piombo 1,2 , Ahmed Abdelfattah 3,4 , Samir Droby 5, Michael Wisniewski 6,7, Davide Spadaro 1,8,* and Leonardo Schena 9 1 Department of Agricultural, Forest and Food Sciences (DISAFA), University of Torino, 10095 Grugliasco, Italy; [email protected] 2 Department of Forest Mycology and Plant Pathology, Uppsala Biocenter, Swedish University of Agricultural Sciences, P.O. Box 7026, 75007 Uppsala, Sweden 3 Institute of Environmental Biotechnology, Graz University of Technology, Petersgasse 12, 8010 Graz, Austria; [email protected] 4 Department of Ecology, Environment and Plant Sciences, University of Stockholm, Svante Arrhenius väg 20A, 11418 Stockholm, Sweden 5 Department of Postharvest Science, Agricultural Research Organization (ARO), The Volcani Center, Rishon LeZion 7505101, Israel; [email protected] 6 U.S. Department of Agriculture—Agricultural Research Service (USDA-ARS), Kearneysville, WV 25430, USA; [email protected] 7 Department of Biological Sciences, Virginia Technical University, Blacksburg, VA 24061, USA 8 AGROINNOVA—Centre of Competence for the Innovation in the Agroenvironmental Sector, University of Torino, 10095 Grugliasco, Italy 9 Department of Agriculture, Università Mediterranea, 89122 Reggio Calabria, Italy; [email protected] * Correspondence: [email protected]; Tel.: +39-0116708942 Abstract: Globalization has a dramatic effect on the trade and movement of seeds, fruits and vegeta- bles, with a corresponding increase in economic losses caused by the introduction of transboundary Citation: Piombo, E.; Abdelfattah, A.; plant pathogens. Current diagnostic techniques provide a useful and precise tool to enact surveillance Droby, S.; Wisniewski, M.; Spadaro, protocols regarding specific organisms, but this approach is strictly targeted, while metabarcoding D.; Schena, L.
  • The Tumor Suppressor Notch Inhibits Head and Neck Squamous Cell

    The Tumor Suppressor Notch Inhibits Head and Neck Squamous Cell

    The Texas Medical Center Library DigitalCommons@TMC The University of Texas MD Anderson Cancer Center UTHealth Graduate School of The University of Texas MD Anderson Cancer Biomedical Sciences Dissertations and Theses Center UTHealth Graduate School of (Open Access) Biomedical Sciences 12-2015 THE TUMOR SUPPRESSOR NOTCH INHIBITS HEAD AND NECK SQUAMOUS CELL CARCINOMA (HNSCC) TUMOR GROWTH AND PROGRESSION BY MODULATING PROTO-ONCOGENES AXL AND CTNNAL1 (α-CATULIN) Shhyam Moorthy Shhyam Moorthy Follow this and additional works at: https://digitalcommons.library.tmc.edu/utgsbs_dissertations Part of the Biochemistry, Biophysics, and Structural Biology Commons, Cancer Biology Commons, Cell Biology Commons, and the Medicine and Health Sciences Commons Recommended Citation Moorthy, Shhyam and Moorthy, Shhyam, "THE TUMOR SUPPRESSOR NOTCH INHIBITS HEAD AND NECK SQUAMOUS CELL CARCINOMA (HNSCC) TUMOR GROWTH AND PROGRESSION BY MODULATING PROTO-ONCOGENES AXL AND CTNNAL1 (α-CATULIN)" (2015). The University of Texas MD Anderson Cancer Center UTHealth Graduate School of Biomedical Sciences Dissertations and Theses (Open Access). 638. https://digitalcommons.library.tmc.edu/utgsbs_dissertations/638 This Dissertation (PhD) is brought to you for free and open access by the The University of Texas MD Anderson Cancer Center UTHealth Graduate School of Biomedical Sciences at DigitalCommons@TMC. It has been accepted for inclusion in The University of Texas MD Anderson Cancer Center UTHealth Graduate School of Biomedical Sciences Dissertations and Theses (Open Access) by an authorized administrator of DigitalCommons@TMC. For more information, please contact [email protected]. THE TUMOR SUPPRESSOR NOTCH INHIBITS HEAD AND NECK SQUAMOUS CELL CARCINOMA (HNSCC) TUMOR GROWTH AND PROGRESSION BY MODULATING PROTO-ONCOGENES AXL AND CTNNAL1 (α-CATULIN) by Shhyam Moorthy, B.S.
  • Histone-Related Genes Are Hypermethylated in Lung Cancer

    Histone-Related Genes Are Hypermethylated in Lung Cancer

    Published OnlineFirst October 1, 2019; DOI: 10.1158/0008-5472.CAN-19-1019 Cancer Genome and Epigenome Research Histone-Related Genes Are Hypermethylated in Lung Cancer and Hypermethylated HIST1H4F Could Serve as a Pan-Cancer Biomarker Shihua Dong1,Wei Li1, Lin Wang2, Jie Hu3,Yuanlin Song3, Baolong Zhang1, Xiaoguang Ren1, Shimeng Ji3, Jin Li1, Peng Xu1, Ying Liang1, Gang Chen4, Jia-Tao Lou2, and Wenqiang Yu1 Abstract Lung cancer is the leading cause of cancer-related deaths lated in all 17 tumor types from TCGA datasets (n ¼ 7,344), worldwide. Cytologic examination is the current "gold stan- which was further validated in nine different types of cancer dard" for lung cancer diagnosis, however, this has low sensi- (n ¼ 243). These results demonstrate that HIST1H4F can tivity. Here, we identified a typical methylation signature of function as a universal-cancer-only methylation (UCOM) histone genes in lung cancer by whole-genome DNA methyl- marker, which may aid in understanding general tumorigen- ation analysis, which was validated by The Cancer Genome esis and improve screening for early cancer diagnosis. Atlas (TCGA) lung cancer cohort (n ¼ 907) and was further confirmed in 265 bronchoalveolar lavage fluid samples with Significance: These findings identify a new biomarker for specificity and sensitivity of 96.7% and 87.0%, respectively. cancer detection and show that hypermethylation of histone- More importantly, HIST1H4F was universally hypermethy- related genes seems to persist across cancers. Introduction to its low specificity, LDCT is far from satisfactory as a screening tool for clinical application, similar to other currently used cancer Lung cancer is one of the most common malignant tumors and biomarkers, such as carcinoembryonic antigen (CEA), neuron- the leading cause of cancer-related deaths worldwide (1, 2).
  • Histone H3.1 (Human) Cell-Based ELISA Kit

    Histone H3.1 (Human) Cell-Based ELISA Kit

    Histone H3.1 (Human) Cell-Based ELISA Kit Catalog # : KA2761 規格 : [ 1 Kit ] List All Specification Application Image Product Histone H3.1 (Human) Cell-Based ELISA Kit is an indirect enzyme-linked Qualitative Description: immunoassay for qualitative determination of Histone H3 expression in cultured cells. Reactivity: Human, Mouse, Rat Storage Store the kit at 4°C. Instruction: Protocol: Protocol Download Suitable Attached Cell, Loosely Attached Cell, Suspension Cell Sample: Label: HRP-conjugated Detection Colorimetric Method: Regulation For research use only (RUO) Status: Datasheet: Download Applications Qualitative HIST1H3A HIST1H3D HIST1H3C HIST1H3E HIST1H3I HIST1H3G HIST1H3J HIST1H3H HIST1H3B HIST1H3F Gene Information Entrez GeneID: 8350 Protein P68431 Accession#: Gene Name: HIST1H3A Gene Alias: H3/A,H3FA Gene histone cluster 1, H3a Description: Omim ID: 602810 Gene Ontology: Hyperlink Gene Summary: Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. This structure consists of approximately 146 bp of DNA wrapped around a Page 1 of 6 2021/6/18 nucleosome, an octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene is intronless and encodes a member of the histone H3 family. Transcripts from this gene lack polyA tails; instead, they contain a palindromic termination element. This gene is found in the large histone gene cluster on chromosome 6p22-p21.3. [provided by RefSeq Other H3 histone family, member A,histone 1, H3a Designations: Gene Information Entrez GeneID: 8351 Protein P68431 Accession#: Gene Name: HIST1H3D Gene Alias: H3/b,H3FB Gene histone cluster 1, H3d Description: Omim ID: 602811 Gene Ontology: Hyperlink Gene Summary: Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes.
  • High-Throughput Pyrosequencing (To Use GS 20; Whole Genome Sequencer)

    High-Throughput Pyrosequencing (To Use GS 20; Whole Genome Sequencer)

    2007 International Meeting of the Microbiological Society of Korea >>> W1-1 High-throughput Pyrosequencing (to use GS 20; whole genome sequencer) Youseak Go Macrogen Corporation Genomic Analysis Division The Genome sequencer FLX system, developed by 454 Life Science Corporation, is an ultra-high-throughput automatic DNA sequencing system capable carrying out and monitoring sequencing reactions in a massively parallel fashion. Hundreds of thousands of simultaneous reactions, in the “well” of a PicoTiterPlate device. The basic chemistry utilize the release of pyrophosphate (PPi) that occurs with each nucleotide addition during DNA-directed DNA synthesis to generate an amount of light commensurate with the amount of PPi released; this light is captured by a charge-coupled device camera and converted into a digital signal. The raw data resulting from a sequencing run consists of a series of digital images captured by the camera, where the images are a representation of the surface of the PicoTiterPlate device over which the sequencing reaction are taking place; and each image corresponding to one reagent flow over that surface, as defined by the Run script. If the sample DNA fragment present in a given PicoTiterPlate well is extended during a nucleotide flow, light is emitted from the well and captured on the image corresponding to that flow. Furthermore, the amount of light emitted is proportional to the number of nucleotide extended. Knowledge of the nucleotide flowed while each image is being captured, of location on the PicoTiterPlate device where light is being emitted during each flow allows the software to identify PicoTiterPlate wells that contain a DNA library fragment and determine the sequence of the DNA fragments present in each well.
  • Identification of Genetic Factors Underpinning Phenotypic Heterogeneity in Huntington’S Disease and Other Neurodegenerative Disorders

    Identification of Genetic Factors Underpinning Phenotypic Heterogeneity in Huntington’S Disease and Other Neurodegenerative Disorders

    Identification of genetic factors underpinning phenotypic heterogeneity in Huntington’s disease and other neurodegenerative disorders. By Dr Davina J Hensman Moss A thesis submitted to University College London for the degree of Doctor of Philosophy Department of Neurodegenerative Disease Institute of Neurology University College London (UCL) 2020 1 I, Davina Hensman Moss confirm that the work presented in this thesis is my own. Where information has been derived from other sources, I confirm that this has been indicated in the thesis. Collaborative work is also indicated in this thesis. Signature: Date: 2 Abstract Neurodegenerative diseases including Huntington’s disease (HD), the spinocerebellar ataxias and C9orf72 associated Amyotrophic Lateral Sclerosis / Frontotemporal dementia (ALS/FTD) do not present and progress in the same way in all patients. Instead there is phenotypic variability in age at onset, progression and symptoms. Understanding this variability is not only clinically valuable, but identification of the genetic factors underpinning this variability has the potential to highlight genes and pathways which may be amenable to therapeutic manipulation, hence help find drugs for these devastating and currently incurable diseases. Identification of genetic modifiers of neurodegenerative diseases is the overarching aim of this thesis. To identify genetic variants which modify disease progression it is first necessary to have a detailed characterization of the disease and its trajectory over time. In this thesis clinical data from the TRACK-HD studies, for which I collected data as a clinical fellow, was used to study disease progression over time in HD, and give subjects a progression score for subsequent analysis. In this thesis I show blood transcriptomic signatures of HD status and stage which parallel HD brain and overlap with Alzheimer’s disease brain.
  • A Multiprotein Occupancy Map of the Mrnp on the 3 End of Histone

    A Multiprotein Occupancy Map of the Mrnp on the 3 End of Histone

    Downloaded from rnajournal.cshlp.org on October 6, 2021 - Published by Cold Spring Harbor Laboratory Press A multiprotein occupancy map of the mRNP on the 3′ end of histone mRNAs LIONEL BROOKS III,1 SHAWN M. LYONS,2 J. MATTHEW MAHONEY,1 JOSHUA D. WELCH,3 ZHONGLE LIU,1 WILLIAM F. MARZLUFF,2 and MICHAEL L. WHITFIELD1 1Department of Genetics, Dartmouth Geisel School of Medicine, Hanover, New Hampshire 03755, USA 2Integrative Program for Biological and Genome Sciences, University of North Carolina, Chapel Hill, North Carolina 27599, USA 3Department of Computer Science, University of North Carolina, Chapel Hill, North Carolina 27599, USA ABSTRACT The animal replication-dependent (RD) histone mRNAs are coordinately regulated with chromosome replication. The RD-histone mRNAs are the only known cellular mRNAs that are not polyadenylated. Instead, the mature transcripts end in a conserved stem– loop (SL) structure. This SL structure interacts with the stem–loop binding protein (SLBP), which is involved in all aspects of RD- histone mRNA metabolism. We used several genomic methods, including high-throughput sequencing of cross-linked immunoprecipitate (HITS-CLIP) to analyze the RNA-binding landscape of SLBP. SLBP was not bound to any RNAs other than histone mRNAs. We performed bioinformatic analyses of the HITS-CLIP data that included (i) clustering genes by sequencing read coverage using CVCA, (ii) mapping the bound RNA fragment termini, and (iii) mapping cross-linking induced mutation sites (CIMS) using CLIP-PyL software. These analyses allowed us to identify specific sites of molecular contact between SLBP and its RD-histone mRNA ligands. We performed in vitro crosslinking assays to refine the CIMS mapping and found that uracils one and three in the loop of the histone mRNA SL preferentially crosslink to SLBP, whereas uracil two in the loop preferentially crosslinks to a separate component, likely the 3′hExo.
  • Research Article Common Expression Quantitative Trait Loci Shared by Histone Genes

    Research Article Common Expression Quantitative Trait Loci Shared by Histone Genes

    Hindawi International Journal of Genomics Volume 2017, Article ID 6202567, 14 pages https://doi.org/10.1155/2017/6202567 Research Article Common Expression Quantitative Trait Loci Shared by Histone Genes Hanseol Kim, Yujin Suh, and Chaeyoung Lee Department of Bioinformatics and Life Science, Soongsil University, Seoul, Republic of Korea Correspondence should be addressed to Chaeyoung Lee; [email protected] Received 29 May 2017; Revised 26 July 2017; Accepted 2 August 2017; Published 27 August 2017 Academic Editor: Marco Gerdol Copyright © 2017 Hanseol Kim et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. A genome-wide association study (GWAS) was conducted to examine expression quantitative trait loci (eQTLs) for histone genes. We examined common eQTLs for multiple histone genes in 373 European lymphoblastoid cell lines (LCLs). A linear regression model was employed to identify single-nucleotide polymorphisms (SNPs) associated with expression of the histone genes, and the number of eQTLs was determined by linkage disequilibrium analysis. Additional associations of the identified eQTLs with other genes were also examined. We identified 31 eQTLs for 29 histone genes through genome-wide analysis using 29 histone genes (P <297 × 10−10). Among them, 12 eQTLs were associated with the expression of multiple histone genes. Transcriptome- wide association analysis using the identified eQTLs showed their associations with additional 80 genes (P <475 × 10−6). In particular, expression of RPPH1, SCARNA2, and SCARNA7 genes was associated with 26, 25, and 23 eQTLs, respectively.
  • Pyrosequencing Technology for Short DNA Sequencing and Whole

    Pyrosequencing Technology for Short DNA Sequencing and Whole

    生物物理 47(2),129-132(2007) 理論/実験 技術 Pyrosequencing Technology for Short DNA Sequencing and Whole Genome Sequencing Baback Gharizadeh1, Mehran Ghaderi2 and Pål Nyrén3 1Stanford Genome Technology Center, Stanford University 2Clinical Pathology/Cytology, Karolinska University Hospital 3Department of Biotechnology, Royal Institute of Technology 1.Introduction 2.Pyrosequencing chemistry Recent impressive advances in DNA sequencing technolo- Pyrosequencing technique is based on sequencing-by-syn- gies have accelerated the detailed analysis of genomes from thesis principle8), 9) and employs a series of four enzymes to ac- many organisms. We have been observing reports of complete curately detect short nucleic acid sequences during DNA syn- or draft versions of the genome sequence of several well-stud- thesis. In Pyrosequencing10), the sequencing primer is ied, multicellular organisms. Human biology and medicine hybridized to a single-stranded DNA biotin-labeled template are in the midst of a revolution by Human Genome Project1) (post-PCR alkali treated) and mixed with the enzymes; DNA as the main catalyst. polymerase, ATP sulfurylase, luciferase and apyrase, and the The chain termination sequencing method, also known as substrates adenosine 5′ phosphosulfate (APS) and luciferin. Sanger sequencing, developed by Frederick Sanger and col- Fig. 1 illustrates schematically the chemistry of Pyrose- leagues2), has been the most widely used sequencing method quencing method. Cycles of four deoxynucleotide triphos- since its advent in 1977 and still is in use after more than 29 phates (dNTPs) are separately added to the reaction mixture years since its advent. The remarkable advances in chemistry iteratively. After each nucleotide dispensation, DNA poly- and automation to the Sanger sequencing method has made it merase catalyzes the incorporation of complementary dNTP to a simple and elegant technique, central to almost all past into the template strand.
  • Leveraging Omics Data to Expand the Value and Understanding of Alternative Splicing

    Leveraging Omics Data to Expand the Value and Understanding of Alternative Splicing

    Leveraging Omics Data to Expand the Value and Understanding of Alternative Splicing By ______________________ Nathan T. Johnson A Dissertation Submitted to the Faculty of WORCESTER POLYTECHNIC INSTITUTE In Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy In Bioinformatics & Computational Biology APPROVED: __________________________ __________________________ Dmitry Korkin, Ph.D. Amity L. Manning, Ph.D. Advisor Committee Member Program Director __________________________ __________________________ Zheyang Wu, Ph.D. Scarlet Shell, Ph.D. Committee Member Committee Member __________________________ Ben Raphael, Ph.D. External Committee Member i “Science, my boy, is made up of mistakes, but they are mistakes which it is useful to make, because they lead little by little to the truth.” Jules Verne, Journey to the Center of the Earth ii ABSTRACT Utilizing ‘omics’ data of diverse types such as genomics, proteomics, transcriptomics, epigenomics, and others has largely been attributed as holding great promise for solving the complexity of many health and ecological problems such as complex genetic diseases and parasitic destruction of farming crops. By using bioinformatics, it is possible to take advantage of ‘omics’ data to gain a systems level molecular perspective to achieve insight into possible solutions. One possible solution is understanding and expanding the use of alternative splicing (AS) of mRNA precursors. Typically, genes are considered the focal point as the main players in the molecular world. However, due to recent ‘omics’ analysis across the past decade, AS has been demonstrated to be the main player in causing protein diversity. This is possible as AS rearranges the key components of a gene (exon, intron, and untranslated regions) to generate diverse functionally unique proteins and regulatory RNAs.
  • Searching for Compact Hierarchical Structures in DNA by Means of the Smallest Grammar Problem Matthias Gallé

    Searching for Compact Hierarchical Structures in DNA by Means of the Smallest Grammar Problem Matthias Gallé

    Searching for Compact Hierarchical Structures in DNA by means of the Smallest Grammar Problem Matthias Gallé To cite this version: Matthias Gallé. Searching for Compact Hierarchical Structures in DNA by means of the Smallest Grammar Problem. Modeling and Simulation. Université Rennes 1, 2011. English. tel-00595494 HAL Id: tel-00595494 https://tel.archives-ouvertes.fr/tel-00595494 Submitted on 24 May 2011 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. N° d’ordre : ANNÉE 2011 THÈSE / UNIVERSITÉ DE RENNES 1 sous le sceau de l’Université Européenne de Bretagne pour le grade de DOCTEUR DE L’UNIVERSITÉ DE RENNES 1 Mention : Informatique Ecole doctorale MATISSE présentée par Matthias Gallé préparée au équipe Symbiose du laboratoire INRIA Rennes - Bretagne Atlantique Searching for Compact Hierarchical Structures in DNA by means of the Smallest Grammar Problem Thèse soutenue à Rennes le 15 février 2011 devant le jury composé de : Alberto APOSTOLICO Professeur Georgia Tech, Atlanta, USA François COSTE Charge de Recherche INRIA, France Alexander CLARK Professeur, University of London, London, UK Jacques NICOLAS Directeur de Recherche INRIA, France Eric RIVALS Directeur de Recherche CNRS, LIRMM, France Gabriel INFANTE-LOPEZ Professeur, Universidad de Córdoba, Argentina Twas brillig, and the slithy toves Did gyre and gimble in the wabe; All mimsy were the borogoves, And the mome raths outgrabe.
  • Multiplexed Microsatellite Recovery Using Massively Parallel Sequencing

    Multiplexed Microsatellite Recovery Using Massively Parallel Sequencing

    Molecular Ecology Resources (2011) 11, 1060–1067 doi: 10.1111/j.1755-0998.2011.03033.x Multiplexed microsatellite recovery using massively parallel sequencing T. N. JENNINGS,* B. J. KNAUS,* T. D. MULLINS,† S. M. HAIG† and R. C. CRONN* *Pacific Northwest Research Station, USDA Forest Service, 3200 SW Jefferson Way, Corvallis, OR 97331, USA, †Forest and Rangeland Ecosystem Science Center, US Geological Survey, 3200 SW Jefferson Way, Corvallis, OR 97331, USA Abstract Conservation and management of natural populations requires accurate and inexpensive genotyping methods. Traditional microsatellite, or simple sequence repeat (SSR), marker analysis remains a popular genotyping method because of the com­ paratively low cost of marker development, ease of analysis and high power of genotype discrimination. With the availabil­ ity of massively parallel sequencing (MPS), it is now possible to sequence microsatellite-enriched genomic libraries in multiplex pools. To test this approach, we prepared seven microsatellite-enriched, barcoded genomic libraries from diverse taxa (two conifer trees, five birds) and sequenced these on one lane of the Illumina Genome Analyzer using paired-end 80-bp reads. In this experiment, we screened 6.1 million sequences and identified 356 958 unique microreads that contained di- or trinucleotide microsatellites. Examination of four species shows that our conversion rate from raw sequences to polymorphic markers compares favourably to Sanger- and 454-based methods. The advantage of multiplexed MPS is that the staggering capacity of modern microread sequencing is spread across many libraries; this reduces sample preparation and sequencing costs to less than $400 (USD) per species. This price is sufficiently low that microsatellite libraries could be prepared and sequenced for all 1373 organisms listed as ‘threatened’ and ‘endangered’ in the United States for under $0.5 M (USD).