Brno University of Technology Vysoké Učení Technické V Brně

Total Page:16

File Type:pdf, Size:1020Kb

Brno University of Technology Vysoké Učení Technické V Brně BRNO UNIVERSITY OF TECHNOLOGY VYSOKÉ UČENÍ TECHNICKÉ V BRNĚ FACULTY OF ELECTRICAL ENGINEERING AND COMMUNICATION FAKULTA ELEKTROTECHNIKY A KOMUNIKAČNÍCH TECHNOLOGIÍ DEPARTMENT OF BIOMEDICAL ENGINEERING ÚSTAV BIOMEDICÍNSKÉHO INŽENÝRSTVÍ TRANSCRIPTOMIC CHARACTERIZATION USING RNA-SEQ DATA ANALYSIS TRANSKRIPTOMICKÁ CHARAKTERIZACE POMOCÍ ANALÝZY RNA-SEQ DAT DOCTORAL THESIS DIZERTAČNÍ PRÁCE AUTHOR Layal Abo Khayal AUTOR PRÁCE SUPERVISOR prof. Ing. Ivo Provazník, Ph.D. ŠKOLITEL BRNO 2017 ABSTRACT The high-throughputs sequence technologies produce a massive amount of data, that can reveal new genes, identify splice variants, and quantify gene expression genome-wide. However, the volume and the complexity of data from RNA-seq experiments necessitate a scalable, and mathematical analysis based on a robust statistical model. Therefore, it is challenging to design integrated workflow, that incorporates the various analysis procedures. Particularly, the comparative transcriptome analysis is complicated due to several sources of measurement variability and poses numerous statistical challenges. In this research, we performed an integrated transcriptional profiling pipeline, which generates novel reproducible codes to obtain biologically interpretable results. Starting with the annotation of RNA-seq data and quality assessment, we provided a set of codes to serve the quality assessment visualization needed for establishing the RNA-Seq data analysis experiment. Additionally, we performed comprehensive differential gene expression analysis, presenting descriptive methods to interpret the RNA-Seq data. For implementing alternative splicing and differential exons usage analysis, we improved the performance of the Bioconductor package DEXSeq by defining the open reading frame of the exonic regions, which are differentially used between biological conditions due to the alternative splicing of the transcripts. Furthermore, we present a new methodology to analyze the differentially expressed long non-coding RNA, by finding the functional correlation of the long non-coding RNA with neighboring differential expressed protein coding genes. Thus, we obtain a clearer view of the regulation mechanism, and give a hypothesis about the role of long non-coding RNA in gene expression regulation. KEYWORDS: RNA-Seq, Differential Gene Expression (DGE), Alternative splicing, Differential Exon Usage (DEU), long non-coding RNA (lncRNA). I ABSTRAKT Vysoce výkonné sekvenční technologie produkují obrovské množství dat, která mohou odhalit nové geny, identifikovat splice varianty a kvantifikovat genovou expresi v celém genomu. Objem a složitost dat z RNA-seq experimentů vyžadují škálovatelné metody matematické analýzy založené na robustníchstatistických modelech. Je náročné navrhnout integrované pracovní postupy, které zahrnují různé postupy analýzy. Konkrétně jsou to srovnávací testy transkriptů, které jsou komplikovány několika zdroji variability měření a představují řadu statistických problémů. V tomto výzkumu byla sestavena integrovaná transkripční profilová pipeline k produkci nových reprodukovatelných kódů pro získání biologicky interpretovovatelných výsledků. Počínaje anotací údajů RNA-seq a hodnocení kvality je navržen soubor kódů, který slouží pro vizualizaci hodnocení kvality, potřebné pro zajištění RNA-Seq experimentu s analýzou dat. Dále je provedena komplexní diferenciální analýza genových expresí, která poskytuje popisné metody pro testované RNA-Seq data. Pro implementaci analýzy alternativního sestřihu a diferenciálních exonů jsme zlepšili výkon DEXSeq definováním otevřeného čtecího rámce exonového regionu, který se používá alternativně. Dále je popsána nová metodologie pro analýzu diferenciálně exprimované dlouhé nekódující RNA nalezením funkční korelace této RNA se sousedícími diferenciálně exprimovanými geny kódujícími proteiny. Takto je získán jasnější pohled na regulační mechanismus a poskytnuta hypotéza o úloze dlouhé nekódující RNA v regulaci genové exprese. KLÍČOVÁ SLOVA RNA-Seq, diferenciální genová exprese (DGE), alternativní splicing, diferenciální použití exonů (DEU), dlouhá nekódující RNA (lncRNA). II BIBLIOGRAPHIC CITATION Abo Khayal, Layal. Transcriptomic characterization using RNA-Seq data analysis Brno: Brno University of Technology, Faculty of Electrical Engineering and Communication, 2018. 125 p. Academic advisor: Prof. Ing. Ivo Provazník, Ph.D. III DECLARATION I declare that I have written my doctoral thesis on the theme of “Transcriptomic characterization using RNA-Seq data analysis” independently, under the guidance of the doctoral thesis supervisor and using the technical literature and other sources of information which are all quoted in the thesis and detailed in the list of literature at the end of the thesis. As the author of the doctoral thesis I furthermore declare that, as regards the creation of this doctoral thesis, I have not infringed any copyright. In particular, I have not unlawfully encroached on anyone’s personal and/or ownership rights and I am fully aware of the consequences in the case of breaking Regulation §11 and the following of the Copyright Act No 121/2000 Sb., and of the rights related to intellectual property right and changes in some Acts (Intellectual Property Act) and formulated in later regulations, inclusive of the possible consequences resulting from the provisions of Criminal Act No 40/2009 Sb., Section 2, Head VI, Part 4. Brno 18.10.2017 .................................. Ing. Layal Abo Khayal IV ACKNOWLEDGEMENT I would like to thank all the people who supported me during my doctoral study in the Department of Biomedical Engineering at Brno University of Technology, and during my research stay in Charité medical university, Germany. Especially, I thank my supervisor Prof. Ing. Ivo Provaznik, Ph.D. head of the Department of Biomedical Engineering at Brno University of Technology for his endless support. I would like also to express my gratitude and appreciation to my external supervisor Prof. Dr. Med. Peter N Robinson from Charité Berlin, for his professional guidance. Special thanks to my family in Syria and friends for their great patience, persistence, and inspiration. I dedicate my PhD thesis to my mother, and to my great mother and homeland Syria, wishing peace to return to its whole lands soon. Brno, 18.10.2017 …………………………….. Ing. Layal Abo Khayal V TABLE OF CONTENTS Introduction to RNA Sequencing ........................................................................................................................ 1 1. THE THEORETICAL REVIEW ............................................................................................................... 3 1.1. Isolation of RNAs ................................................................................................................................. 3 1.2. Quality Control of RNA ........................................................................................................................ 3 1.3. Library Preparation ............................................................................................................................... 4 1.4. RNA-Seq Platforms .............................................................................................................................. 6 1.4.1 Illumina ........................................................................................................................................ 6 1.4.2 SOLID .......................................................................................................................................... 7 1.4.3 Roche 454 ..................................................................................................................................... 7 1.4.4 Ion Torrent .................................................................................................................................... 9 1.4.5 Pacific Biosciences ..................................................................................................................... 10 1.4.6 Nanopore Technologies .............................................................................................................. 13 1.5. RNA-Seq Applications ........................................................................................................................ 14 1.5.1 Protein Coding Gene Structure ................................................................................................... 15 1.5.2 Novel Protein-Coding Genes ...................................................................................................... 16 1.5.3 Quantifying and Comparing Gene Expression ........................................................................... 16 1.5.4 Expression Quantitative Trait Loci (eQTL) ............................................................................... 18 1.5.5 Single-Cell RNA-Seq ................................................................................................................. 18 1.5.6 Fusion Genes .............................................................................................................................. 18 1.5.7 Gene Variations .......................................................................................................................... 19 1.5.8 Long Noncoding RNAs .............................................................................................................. 19 1.5.9 Small Noncoding RNAs (miRNA-seq) .....................................................................................
Recommended publications
  • Analysis of Trans Esnps Infers Regulatory Network Architecture
    Analysis of trans eSNPs infers regulatory network architecture Anat Kreimer Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in the Graduate School of Arts and Sciences COLUMBIA UNIVERSITY 2014 © 2014 Anat Kreimer All rights reserved ABSTRACT Analysis of trans eSNPs infers regulatory network architecture Anat Kreimer eSNPs are genetic variants associated with transcript expression levels. The characteristics of such variants highlight their importance and present a unique opportunity for studying gene regulation. eSNPs affect most genes and their cell type specificity can shed light on different processes that are activated in each cell. They can identify functional variants by connecting SNPs that are implicated in disease to a molecular mechanism. Examining eSNPs that are associated with distal genes can provide insights regarding the inference of regulatory networks but also presents challenges due to the high statistical burden of multiple testing. Such association studies allow: simultaneous investigation of many gene expression phenotypes without assuming any prior knowledge and identification of unknown regulators of gene expression while uncovering directionality. This thesis will focus on such distal eSNPs to map regulatory interactions between different loci and expose the architecture of the regulatory network defined by such interactions. We develop novel computational approaches and apply them to genetics-genomics data in human. We go beyond pairwise interactions to define network motifs, including regulatory modules and bi-fan structures, showing them to be prevalent in real data and exposing distinct attributes of such arrangements. We project eSNP associations onto a protein-protein interaction network to expose topological properties of eSNPs and their targets and highlight different modes of distal regulation.
    [Show full text]
  • Genomic Correlates of Relationship QTL Involved in Fore- Versus Hind Limb Divergence in Mice
    Loyola University Chicago Loyola eCommons Biology: Faculty Publications and Other Works Faculty Publications 2013 Genomic Correlates of Relationship QTL Involved in Fore- Versus Hind Limb Divergence in Mice Mihaela Palicev Gunter P. Wagner James P. Noonan Benedikt Hallgrimsson James M. Cheverud Loyola University Chicago, [email protected] Follow this and additional works at: https://ecommons.luc.edu/biology_facpubs Part of the Biology Commons Recommended Citation Palicev, M, GP Wagner, JP Noonan, B Hallgrimsson, and JM Cheverud. "Genomic Correlates of Relationship QTL Involved in Fore- Versus Hind Limb Divergence in Mice." Genome Biology and Evolution 5(10), 2013. This Article is brought to you for free and open access by the Faculty Publications at Loyola eCommons. It has been accepted for inclusion in Biology: Faculty Publications and Other Works by an authorized administrator of Loyola eCommons. For more information, please contact [email protected]. This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 License. © Palicev et al., 2013. GBE Genomic Correlates of Relationship QTL Involved in Fore- versus Hind Limb Divergence in Mice Mihaela Pavlicev1,2,*, Gu¨ nter P. Wagner3, James P. Noonan4, Benedikt Hallgrı´msson5,and James M. Cheverud6 1Konrad Lorenz Institute for Evolution and Cognition Research, Altenberg, Austria 2Department of Pediatrics, Cincinnati Children‘s Hospital Medical Center, Cincinnati, Ohio 3Yale Systems Biology Institute and Department of Ecology and Evolutionary Biology, Yale University 4Department of Genetics, Yale University School of Medicine 5Department of Cell Biology and Anatomy, The McCaig Institute for Bone and Joint Health and the Alberta Children’s Hospital Research Institute for Child and Maternal Health, University of Calgary, Calgary, Canada 6Department of Anatomy and Neurobiology, Washington University *Corresponding author: E-mail: [email protected].
    [Show full text]
  • Coupling of Spliceosome Complexity to Intron Diversity
    bioRxiv preprint doi: https://doi.org/10.1101/2021.03.19.436190; this version posted March 20, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Coupling of spliceosome complexity to intron diversity Jade Sales-Lee1, Daniela S. Perry1, Bradley A. Bowser2, Jolene K. Diedrich3, Beiduo Rao1, Irene Beusch1, John R. Yates III3, Scott W. Roy4,6, and Hiten D. Madhani1,6,7 1Dept. of Biochemistry and Biophysics University of California – San Francisco San Francisco, CA 94158 2Dept. of Molecular and Cellular Biology University of California - Merced Merced, CA 95343 3Department of Molecular Medicine The Scripps Research Institute, La Jolla, CA 92037 4Dept. of Biology San Francisco State University San Francisco, CA 94132 5Chan-Zuckerberg Biohub San Francisco, CA 94158 6Corresponding authors: [email protected], [email protected] 7Lead Contact 1 bioRxiv preprint doi: https://doi.org/10.1101/2021.03.19.436190; this version posted March 20, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. SUMMARY We determined that over 40 spliceosomal proteins are conserved between many fungal species and humans but were lost during the evolution of S. cerevisiae, an intron-poor yeast with unusually rigid splicing signals. We analyzed null mutations in a subset of these factors, most of which had not been investigated previously, in the intron-rich yeast Cryptococcus neoformans.
    [Show full text]
  • A Gene-Level Methylome-Wide Association Analysis Identifies Novel
    bioRxiv preprint doi: https://doi.org/10.1101/2020.07.13.201376; this version posted July 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. 1 A gene-level methylome-wide association analysis identifies novel 2 Alzheimer’s disease genes 1 1 2 3 4 3 Chong Wu , Jonathan Bradley , Yanming Li , Lang Wu , and Hong-Wen Deng 1 4 Department of Statistics, Florida State University; 2 5 Department of Biostatistics & Data Science, University of Kansas Medical Center; 3 6 Population Sciences in the Pacific Program, University of Hawaii Cancer center; 4 7 Tulane Center for Biomedical Informatics and Genomics, Deming Department of Medicine, 8 Tulane University School of Medicine 9 Corresponding to: Chong Wu, Assistant Professor, Department of Statistics, Florida State 10 University, email: [email protected] 1 bioRxiv preprint doi: https://doi.org/10.1101/2020.07.13.201376; this version posted July 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. 11 Abstract 12 Motivation: Transcriptome-wide association studies (TWAS) have successfully facilitated the dis- 13 covery of novel genetic risk loci for many complex traits, including late-onset Alzheimer’s disease 14 (AD). However, most existing TWAS methods rely only on gene expression and ignore epige- 15 netic modification (i.e., DNA methylation) and functional regulatory information (i.e., enhancer- 16 promoter interactions), both of which contribute significantly to the genetic basis ofAD.
    [Show full text]
  • A Computational Approach for Defining a Signature of Β-Cell Golgi Stress in Diabetes Mellitus
    Page 1 of 781 Diabetes A Computational Approach for Defining a Signature of β-Cell Golgi Stress in Diabetes Mellitus Robert N. Bone1,6,7, Olufunmilola Oyebamiji2, Sayali Talware2, Sharmila Selvaraj2, Preethi Krishnan3,6, Farooq Syed1,6,7, Huanmei Wu2, Carmella Evans-Molina 1,3,4,5,6,7,8* Departments of 1Pediatrics, 3Medicine, 4Anatomy, Cell Biology & Physiology, 5Biochemistry & Molecular Biology, the 6Center for Diabetes & Metabolic Diseases, and the 7Herman B. Wells Center for Pediatric Research, Indiana University School of Medicine, Indianapolis, IN 46202; 2Department of BioHealth Informatics, Indiana University-Purdue University Indianapolis, Indianapolis, IN, 46202; 8Roudebush VA Medical Center, Indianapolis, IN 46202. *Corresponding Author(s): Carmella Evans-Molina, MD, PhD ([email protected]) Indiana University School of Medicine, 635 Barnhill Drive, MS 2031A, Indianapolis, IN 46202, Telephone: (317) 274-4145, Fax (317) 274-4107 Running Title: Golgi Stress Response in Diabetes Word Count: 4358 Number of Figures: 6 Keywords: Golgi apparatus stress, Islets, β cell, Type 1 diabetes, Type 2 diabetes 1 Diabetes Publish Ahead of Print, published online August 20, 2020 Diabetes Page 2 of 781 ABSTRACT The Golgi apparatus (GA) is an important site of insulin processing and granule maturation, but whether GA organelle dysfunction and GA stress are present in the diabetic β-cell has not been tested. We utilized an informatics-based approach to develop a transcriptional signature of β-cell GA stress using existing RNA sequencing and microarray datasets generated using human islets from donors with diabetes and islets where type 1(T1D) and type 2 diabetes (T2D) had been modeled ex vivo. To narrow our results to GA-specific genes, we applied a filter set of 1,030 genes accepted as GA associated.
    [Show full text]
  • Supplementary Table 1: Adhesion Genes Data Set
    Supplementary Table 1: Adhesion genes data set PROBE Entrez Gene ID Celera Gene ID Gene_Symbol Gene_Name 160832 1 hCG201364.3 A1BG alpha-1-B glycoprotein 223658 1 hCG201364.3 A1BG alpha-1-B glycoprotein 212988 102 hCG40040.3 ADAM10 ADAM metallopeptidase domain 10 133411 4185 hCG28232.2 ADAM11 ADAM metallopeptidase domain 11 110695 8038 hCG40937.4 ADAM12 ADAM metallopeptidase domain 12 (meltrin alpha) 195222 8038 hCG40937.4 ADAM12 ADAM metallopeptidase domain 12 (meltrin alpha) 165344 8751 hCG20021.3 ADAM15 ADAM metallopeptidase domain 15 (metargidin) 189065 6868 null ADAM17 ADAM metallopeptidase domain 17 (tumor necrosis factor, alpha, converting enzyme) 108119 8728 hCG15398.4 ADAM19 ADAM metallopeptidase domain 19 (meltrin beta) 117763 8748 hCG20675.3 ADAM20 ADAM metallopeptidase domain 20 126448 8747 hCG1785634.2 ADAM21 ADAM metallopeptidase domain 21 208981 8747 hCG1785634.2|hCG2042897 ADAM21 ADAM metallopeptidase domain 21 180903 53616 hCG17212.4 ADAM22 ADAM metallopeptidase domain 22 177272 8745 hCG1811623.1 ADAM23 ADAM metallopeptidase domain 23 102384 10863 hCG1818505.1 ADAM28 ADAM metallopeptidase domain 28 119968 11086 hCG1786734.2 ADAM29 ADAM metallopeptidase domain 29 205542 11085 hCG1997196.1 ADAM30 ADAM metallopeptidase domain 30 148417 80332 hCG39255.4 ADAM33 ADAM metallopeptidase domain 33 140492 8756 hCG1789002.2 ADAM7 ADAM metallopeptidase domain 7 122603 101 hCG1816947.1 ADAM8 ADAM metallopeptidase domain 8 183965 8754 hCG1996391 ADAM9 ADAM metallopeptidase domain 9 (meltrin gamma) 129974 27299 hCG15447.3 ADAMDEC1 ADAM-like,
    [Show full text]
  • Dynamic Transcriptomic Profiles of Zebrafish Gills in Response to Zinc
    Zheng et al. BMC Genomics 2010, 11:548 http://www.biomedcentral.com/1471-2164/11/548 RESEARCH ARTICLE Open Access Dynamic transcriptomic profiles of zebrafish gills in response to zinc depletion Dongling Zheng1,4, Peter Kille2, Graham P Feeney2, Phil Cunningham1, Richard D Handy3, Christer Hogstrand1* Abstract Background: Zinc deficiency is detrimental to organisms, highlighting its role as an essential micronutrient contributing to numerous biological processes. To investigate the underlying molecular events invoked by zinc depletion we performed a temporal analysis of transcriptome changes observed within the zebrafish gill. This tissue represents a model system for studying ion absorption across polarised epithelial cells as it provides a major pathway for fish to acquire zinc directly from water whilst sharing a conserved zinc transporting system with mammals. Results: Zebrafish were treated with either zinc-depleted (water = 2.61 μgL-1; diet = 26 mg kg-1) or zinc-adequate (water = 16.3 μgL-1; diet = 233 mg kg-1) conditions for two weeks. Gill samples were collected at five time points and transcriptome changes analysed in quintuplicate using a 16K oligonucleotide array. Of the genes represented the expression of a total of 333 transcripts showed differential regulation by zinc depletion (having a fold-change greater than 1.8 and an adjusted P-value less than 0.1, controlling for a 10% False Discovery Rate). Down-regulation was dominant at most time points and distinct sets of genes were regulated at different stages. Annotation enrichment analysis revealed that ‘Developmental Process’ was the most significantly overrepresented Biological Process GO term (P = 0.0006), involving 26% of all regulated genes.
    [Show full text]
  • The Interactome of KRAB Zinc Finger Proteins Reveals the Evolutionary History of Their Functional Diversification
    Resource The interactome of KRAB zinc finger proteins reveals the evolutionary history of their functional diversification Pierre-Yves Helleboid1,†, Moritz Heusel2,†, Julien Duc1, Cécile Piot1, Christian W Thorball1, Andrea Coluccio1, Julien Pontis1, Michaël Imbeault1, Priscilla Turelli1, Ruedi Aebersold2,3,* & Didier Trono1,** Abstract years ago (MYA) (Imbeault et al, 2017). Their products harbor an N-terminal KRAB (Kru¨ppel-associated box) domain related to that of Krüppel-associated box (KRAB)-containing zinc finger proteins Meisetz (a.k.a. PRDM9), a protein that originated prior to the diver- (KZFPs) are encoded in the hundreds by the genomes of higher gence of chordates and echinoderms, and a C-terminal array of zinc vertebrates, and many act with the heterochromatin-inducing fingers (ZNF) with sequence-specific DNA-binding potential (Urru- KAP1 as repressors of transposable elements (TEs) during early tia, 2003; Birtle & Ponting, 2006; Imbeault et al, 2017). KZFP genes embryogenesis. Yet, their widespread expression in adult tissues multiplied by gene and segment duplication to count today more and enrichment at other genetic loci indicate additional roles. than 350 and 700 representatives in the human and mouse Here, we characterized the protein interactome of 101 of the ~350 genomes, respectively (Urrutia, 2003; Kauzlaric et al, 2017). A human KZFPs. Consistent with their targeting of TEs, most KZFPs majority of human KZFPs including all primate-restricted family conserved up to placental mammals essentially recruit KAP1 and members target sequences derived from TEs, that is, DNA trans- associated effectors. In contrast, a subset of more ancient KZFPs posons, ERVs (endogenous retroviruses), LINEs, SINEs (long and rather interacts with factors related to functions such as genome short interspersed nuclear elements, respectively), or SVAs (SINE- architecture or RNA processing.
    [Show full text]
  • Análise Integrativa De Perfis Transcricionais De Pacientes Com
    UNIVERSIDADE DE SÃO PAULO FACULDADE DE MEDICINA DE RIBEIRÃO PRETO PROGRAMA DE PÓS-GRADUAÇÃO EM GENÉTICA ADRIANE FEIJÓ EVANGELISTA Análise integrativa de perfis transcricionais de pacientes com diabetes mellitus tipo 1, tipo 2 e gestacional, comparando-os com manifestações demográficas, clínicas, laboratoriais, fisiopatológicas e terapêuticas Ribeirão Preto – 2012 ADRIANE FEIJÓ EVANGELISTA Análise integrativa de perfis transcricionais de pacientes com diabetes mellitus tipo 1, tipo 2 e gestacional, comparando-os com manifestações demográficas, clínicas, laboratoriais, fisiopatológicas e terapêuticas Tese apresentada à Faculdade de Medicina de Ribeirão Preto da Universidade de São Paulo para obtenção do título de Doutor em Ciências. Área de Concentração: Genética Orientador: Prof. Dr. Eduardo Antonio Donadi Co-orientador: Prof. Dr. Geraldo A. S. Passos Ribeirão Preto – 2012 AUTORIZO A REPRODUÇÃO E DIVULGAÇÃO TOTAL OU PARCIAL DESTE TRABALHO, POR QUALQUER MEIO CONVENCIONAL OU ELETRÔNICO, PARA FINS DE ESTUDO E PESQUISA, DESDE QUE CITADA A FONTE. FICHA CATALOGRÁFICA Evangelista, Adriane Feijó Análise integrativa de perfis transcricionais de pacientes com diabetes mellitus tipo 1, tipo 2 e gestacional, comparando-os com manifestações demográficas, clínicas, laboratoriais, fisiopatológicas e terapêuticas. Ribeirão Preto, 2012 192p. Tese de Doutorado apresentada à Faculdade de Medicina de Ribeirão Preto da Universidade de São Paulo. Área de Concentração: Genética. Orientador: Donadi, Eduardo Antonio Co-orientador: Passos, Geraldo A. 1. Expressão gênica – microarrays 2. Análise bioinformática por module maps 3. Diabetes mellitus tipo 1 4. Diabetes mellitus tipo 2 5. Diabetes mellitus gestacional FOLHA DE APROVAÇÃO ADRIANE FEIJÓ EVANGELISTA Análise integrativa de perfis transcricionais de pacientes com diabetes mellitus tipo 1, tipo 2 e gestacional, comparando-os com manifestações demográficas, clínicas, laboratoriais, fisiopatológicas e terapêuticas.
    [Show full text]
  • Mouse Fam32a Knockout Project (CRISPR/Cas9)
    https://www.alphaknockout.com Mouse Fam32a Knockout Project (CRISPR/Cas9) Objective: To create a Fam32a knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering. Strategy summary: The Fam32a gene (NCBI Reference Sequence: NM_026455 ; Ensembl: ENSMUSG00000003039 ) is located on Mouse chromosome 8. 4 exons are identified, with the ATG start codon in exon 1 and the TAA stop codon in exon 4 (Transcript: ENSMUST00000003123). Exon 1~4 will be selected as target site. Cas9 and gRNA will be co-injected into fertilized eggs for KO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Exon 1 starts from about 0.3% of the coding region. Exon 1~4 covers 100.0% of the coding region. The size of effective KO region: ~2461 bp. The KO region does not have any other known gene. Page 1 of 8 https://www.alphaknockout.com Overview of the Targeting Strategy Wildtype allele 5' gRNA region gRNA region 3' 1 2 3 4 Legends Exon of mouse Fam32a Knockout region Page 2 of 8 https://www.alphaknockout.com Overview of the Dot Plot (up) Window size: 15 bp Forward Reverse Complement Sequence 12 Note: The 2000 bp section upstream of start codon is aligned with itself to determine if there are tandem repeats. No significant tandem repeat is found in the dot plot matrix. So this region is suitable for PCR screening or sequencing analysis. Overview of the Dot Plot (down) Window size: 15 bp Forward Reverse Complement Sequence 12 Note: The 2000 bp section downstream of stop codon is aligned with itself to determine if there are tandem repeats.
    [Show full text]
  • About the Existence of Common Determinants of Gene Expression In
    González-Prendes et al. BMC Genomics (2019) 20:518 https://doi.org/10.1186/s12864-019-5889-5 RESEARCH ARTICLE Open Access About the existence of common determinants of gene expression in the porcine liver and skeletal muscle Rayner González-Prendes1,6†, Emilio Mármol-Sánchez1†, Raquel Quintanilla2, Anna Castelló1,3, Ali Zidi1, Yuliaxis Ramayo-Caldas1, Tainã Figueiredo Cardoso1,4, Arianna Manunza1,ÁngelaCánovas1,5 and Marcel Amills 1,3* Abstract Background: The comparison of expression QTL (eQTL) maps obtained in different tissues is an essential step to understand how gene expression is genetically regulated in a context-dependent manner. In the current work, we have compared the transcriptomic and eQTL profiles of two porcine tissues (skeletal muscle and liver) which typically show highly divergent expression profiles, in 103 Duroc pigs genotyped with the Porcine SNP60 BeadChip (Illumina) and with available microarray-based measurements of hepatic and muscle mRNA levels. Since structural variation could have effects on gene expression, we have also investigated the co-localization of cis-eQTLs with copy number variant regions (CNVR) segregating in this Duroc population. Results: The analysis of differential expresssion revealed the existence of 1204 and 1490 probes that were overexpressed and underexpressed in the gluteus medius muscle when compared to liver, respectively (|fold- change| > 1.5, q-value < 0.05). By performing genome scans in 103 Duroc pigs with available expression and genotypic data, we identified 76 and 28 genome-wide significant cis-eQTLs regulating gene expression in the gluteus medius muscle and liver, respectively. Twelve of these cis-eQTLs were shared by both tissues (i.e.
    [Show full text]
  • Mouse Mtarc2 Conditional Knockout Project (CRISPR/Cas9)
    https://www.alphaknockout.com Mouse Mtarc2 Conditional Knockout Project (CRISPR/Cas9) Objective: To create a Mtarc2 conditional knockout Mouse model (C57BL/6J) by CRISPR/Cas-mediated genome engineering. Strategy summary: The Mtarc2 gene (NCBI Reference Sequence: NM_133684.3 ; Ensembl: ENSMUSG00000073481 ) is located on Mouse chromosome 1. 8 exons are identified, with the ATG start codon in exon 1 and the TAG stop codon in exon 7 (Transcript: ENSMUST00000068725). Exon 3 will be selected as conditional knockout region (cKO region). Deletion of this region should result in the loss of function of the Mouse Mtarc2 gene. To engineer the targeting vector, homologous arms and cKO region will be generated by PCR using BAC clone RP23-221J17 as template. Cas9, gRNA and targeting vector will be co-injected into fertilized eggs for cKO Mouse production. The pups will be genotyped by PCR followed by sequencing analysis. Note: Exon 3 starts from about 44.08% of the coding region. The knockout of Exon 3 will result in frameshift of the gene. The size of intron 2 for 5'-loxP site insertion: 7225 bp, and the size of intron 3 for 3'-loxP site insertion: 1143 bp. The size of effective cKO region: ~669 bp. The cKO region does not have any other known gene. Page 1 of 8 https://www.alphaknockout.com Overview of the Targeting Strategy Wildtype allele gRNA region 5' gRNA region 3' 1 3 4 8 Targeting vector Targeted allele Constitutive KO allele (After Cre recombination) Legends Exon of mouse Mtarc2 Homology arm cKO region loxP site Page 2 of 8 https://www.alphaknockout.com Overview of the Dot Plot Window size: 10 bp Forward Reverse Complement Sequence 12 Note: The sequence of homologous arms and cKO region is aligned with itself to determine if there are tandem repeats.
    [Show full text]