Flexible Approach for Novel Transcript Reconstruction from RNA-Seq Data Using Maximum Likelihood Integer Programming

Total Page:16

File Type:pdf, Size:1020Kb

Flexible Approach for Novel Transcript Reconstruction from RNA-Seq Data Using Maximum Likelihood Integer Programming Flexible Approach for Novel Transcript Reconstruction from RNA-Seq Data using Maximum Likelihood Integer Programming Serghei Mangul Adrian Caciula Sahar A. Seesi Georgia State University Georgia State University University of Connecticut Atlanta, GA, 30303, USA Atlanta, GA, 30303, USA Storrs, CT, 06269, USA [email protected] [email protected] [email protected] Dumitru Brinza Abdul R. Banday Rahul Kanadia Life Technologies Corporation University of Connecticut University of Connecticut Foster City, CA, 94404, USA Storrs, CT, 06269, USA Storrs, CT, 06269, USA [email protected] [email protected] [email protected] Ion Mandoiu Alex. Zelikovsky University of Connecticut Georgia State University Storrs, CT, 06269, USA Atlanta, GA, 30303, USA [email protected] [email protected] Abstract sis, rapidly replacing array-based technologies. Most cur- In this paper, we propose a novel, intuitive and flex- rent research, using RNA-Seq, employs methods that de- ible approach for transcriptome reconstruction from sin- pend on existing transcriptome annotations. Unfortunately, gle RNA-Seq reads, called “ Maximum Likelihood Inte- as shown by recent targeted RNA-Seq studies [1], existing ger Programming ” (MLIP) method. MLIP creates a splice transcript libraries still miss large numbers of transcripts. graph based on aligned RNA-Seq reads and enumerates all The incompleteness of annotation libraries poses a serious maximal paths corresponding to putative transcripts. The limitation to using this powerful technology since accurate problem of selecting true transcripts is formulated as an normalization of RNA-Seq data critically requires knowl- integer program which minimizes the number of selected edge of expressed transcript sequences [2, 3, 4, 5]. As a re- candidate transcripts. Our method purpose is to predict the sult, transcript discovery from RNA-Seq has been the focus minimum number of transcripts explaining the set of in- of many research in recent years. The sequences of novel put reads with the highest quantification accuracy. This is transcripts can be reconstructed from deep RNA-Seq data, achieved by coupling a integer programming formulation but this is computationally challenging due to sequencing with an expectation maximization model for transcript ex- errors, uneven coverage of expressed transcripts, and the pression estimation. MLIP has the advantage of offering need to distinguish between highly similar transcripts pro- different levels of stringency that would gear the results to- duced by alternative splicing. wards higher precision or higher sensitivity, according to the user preference. We test MLIP method on simulated and real data, and we show that MLIP outperforms both 2 Related Work Cufflinks and IsoLasso. A number of recent works have addressed the prob- lem of transcriptome reconstruction from RNA-Seq reads. 1 Introduction These methods fall into three categories: “genome- guided”, “genome-independent” and “annotation-guided” Massively parallel whole transcriptome sequencing, methods [6]. Genome-independent methods such as Trin- commonly referred to as RNA-Seq, and its ability to gen- ity [7] or transAbyss [8] directly assemble reads into tran- erate full transcriptome data at the single transcript level, scripts. A commonly used approach for such methods is de provides a powerful tool for gene and isoform specific ex- Brujin graph [9] utilizing ”k-mers”. The use of genome- pression profiling. As a result, RNA-Seq has become the independent methods becomes essential when there is no technology of choice for performing transcriptome analy- trusted genome reference that can be used to guide recon- struction. On the other end of the spectrum, annotation fragment for paired-end technology[22] . The primary goal guided methods [10, 11, 12] make use of available infor- of our study is to developed a method for longer single-end mation in existing transcript annotations to aid in the dis- reads. In future we plan to extend out method to support covery of novel transcripts. RNA-Seq reads can be mapped paired-end reads. We test our method on simulated and real onto reference genome, reference annotations, exon-exon data and compare with two of the available genome guided junction libraries, or combinations thereof, and the result- methods, Cufflinks and IsoLasso. ing alignments are used to reconstruct transcripts. Many transcriptome reconstruction methods fall in the genome-guided category. They typically start by map- 3 Methods ping sequencing reads onto the reference genome,using spliced alignment tools, such as TopHat [13] or SpliceMap The maximum likelihood integer programming [14]. The spliced alignments are used to identify exons and (MLIP) method starts from a set of putative transcripts and transcripts that explain the alignments. While some meth- selects the subset of this transcripts with the highest sup- ods aim to achieve the highest sensitivity, others work to port from the RNA-Seq reads. We formulate this prob- predict the smallest set of transcripts explaining the given lem as an integer program. The objective is to select the input reads. Furthermore, some methods aim to recon- smallest set of putative transcripts that sufficiently explain struct the set of transcripts that would insure the highest the RNA-Seq data. Further, maximum likelihood estima- quantification accuracy. Scripture [15] construct a splic- tor is applied to all possible combinations of putative tran- ing graph from the mapped reads and reconstructs isoforms scripts of minimum size determined by integer program. corresponding to all possible paths in this graph. It then Our method offers different level of stringency from low to uses paired-end information to filter out some transcripts. high. Low stringency gives priority to sensitivity of recon- Although scripture achieves very high sensitivity, it may struction over precision of reconstruction, high stringency predict a lot of incorrect isoforms. The method of Trap- gives priority to precision over sensitivity. The default pa- nell et al. [16, 17], referred to as Cufflinks, constructs a rameter of the MLIP method is medium stringency that read overlap graph and generates candidate transcripts by achieves balance between sensitivity and precision of re- finding a minimal size path cover via a reduction to max- construction imum matching in a weighted bipartite graph. TRIP [18] uses an integer programming model where the objective is to select the smallest set of putative transcripts that yields 3.1 Model description a good statistical fit between the fragment length distribu- tion empirically determined during library preparation and We use a splice graph (SG) to represent alternatively fragment lengths implied by mapping read pairs to selected spliced isoforms for every gene in a sample. A SG is a transcripts. Cufflinks, Scripture, and TRIP do not target directed acyclic graph where each vertex in the graph rep- the quantification accuracy. IsoLasso [19] uses the LASSO resents a segment of a gene. Two segments are connected [20] algorithm, and it aims to achieve a balance between by an edge if they are adjacent in at least one transcript. quantification accuracy and predicting the minimum num- To partition a gene into a set of non-overlapping segments, ber of isoforms. It formulates the problem as a quadratic information about alternative variants is used. Typically, programming one, with additional constraints to ensure that alternative variants occurs due alternative transcriptional all exons and junctions supported by the reads are included events and alternative splicing events [23] . Transcriptional in the predicted isoforms. CLIIQ [21] uses an integer linear events include: alternative first exon, alternative last exon. programming solution that minimizes the number of pre- Splicing events include: exon skipping, intron retention, al- dicted isoforms explaining the RNA-Seq reads while mini- ternative 5’ splice site (A5SS), and alternative 3’ splice site mizing the difference between estimated and observed ex- (A3SS). Transcriptional events may consist only of non- pression levels of exons and junctions within the predicted overlapping exons. If exons partially overlap and they serve isoforms. as a first or last exons we will refer to such event as A5SS In this paper, we present a genome guided method or A3SS respectively. for transcriptome reconstruction from RNA-Seq reads. Our Figure 1-A shows an example of a gene with 4 differ- method aims to predict the minimum number of transcripts ent exons, and 3 transcripts produced by alternative splic- explaining the set of input reads with the highest quantifica- ing. To represent such alternative variants we suggest tion accuracy. This is achieved by coupling a integer pro- to process the gene as a set of so called “pseudo-exons” gramming formulation with an expectation maximization based on alternative variants obtained from aligned RNA- model for isoform expression estimation. seq reads. A pseudo-exon is a region of a gene between Recent advances in Next Generation Sequencing consecutive transcriptional or splicing events, i.e. starting (NGS) technologies made it possible to produce longer or ending of an exon, as shown in figure 1-B. Hence ev- single-end reads with the length comparable to length of ery gene has a set of non-overlapping pseudo-exons, from which it is possible to reconstruct a set of putative tran- can use existing annotation for P AS or specialized proto- scripts. cols such as the PolyA-Seq protocol [24]. PQC7R$ 3.2 Maximum Likelihood Integer Programming AAA A. Map single reads to Solution genome and identify the pseudo-exons Genome Here we introduce 2-step approach for novel tran- pse intron pse pse pse 1 2 3 4 script reconstruction from single-end RNA-Seq reads. First, we introduce the integer program (IP ) formulation, which has an objective to minimize number of transcripts B. Splice Graph (SG) 1 2 3 4 sufficiently well covering observed reads. Since such for- mulation can lead to many identical optimal solutions we will use the additional step to select maximum likelihood Pseudo-exons Introns solution based on deviation between observed and expected read frequencies.
Recommended publications
  • Transposon Activation Mutagenesis As a Screening Tool for Identifying
    Chen et al. BMC Cancer 2013, 13:93 http://www.biomedcentral.com/1471-2407/13/93 RESEARCH ARTICLE Open Access Transposon activation mutagenesis as a screening tool for identifying resistance to cancer therapeutics Li Chen1,2*, Lynda Stuart2, Toshiro K Ohsumi3, Shawn Burgess4, Gaurav K Varshney4, Anahita Dastur1, Mark Borowsky3, Cyril Benes1, Adam Lacy-Hulbert2 and Emmett V Schmidt2 Abstract Background: The development of resistance to chemotherapies represents a significant barrier to successful cancer treatment. Resistance mechanisms are complex, can involve diverse and often unexpected cellular processes, and can vary with both the underlying genetic lesion and the origin or type of tumor. For these reasons developing experimental strategies that could be used to understand, identify and predict mechanisms of resistance in different malignant cells would be a major advance. Methods: Here we describe a gain-of-function forward genetic approach for identifying mechanisms of resistance. This approach uses a modified piggyBac transposon to generate libraries of mutagenized cells, each containing transposon insertions that randomly activate nearby gene expression. Genes of interest are identified using next- gen high-throughput sequencing and barcode multiplexing is used to reduce experimental cost. Results: Using this approach we successfully identify genes involved in paclitaxel resistance in a variety of cancer cell lines, including the multidrug transporter ABCB1, a previously identified major paclitaxel resistance gene. Analysis of co-occurring transposons integration sites in single cell clone allows for the identification of genes that might act cooperatively to produce drug resistance a level of information not accessible using RNAi or ORF expression screening approaches. Conclusion: We have developed a powerful pipeline to systematically discover drug resistance in mammalian cells in vitro.
    [Show full text]
  • Algorithms for Transcriptome Quantification and Reconstruction from RNA-Seq Data
    Georgia State University ScholarWorks @ Georgia State University Computer Science Dissertations Department of Computer Science Fall 11-16-2012 Algorithms for Transcriptome Quantification and Reconstruction from RNA-Seq Data Serghei Mangul Follow this and additional works at: https://scholarworks.gsu.edu/cs_diss Recommended Citation Mangul, Serghei, "Algorithms for Transcriptome Quantification and Reconstruction from RNA-Seq Data." Dissertation, Georgia State University, 2012. https://scholarworks.gsu.edu/cs_diss/71 This Dissertation is brought to you for free and open access by the Department of Computer Science at ScholarWorks @ Georgia State University. It has been accepted for inclusion in Computer Science Dissertations by an authorized administrator of ScholarWorks @ Georgia State University. For more information, please contact [email protected]. ALGORITHMS FOR TRANSCRIPTOME QUANTIFICATION AND RECONSTRUCTION FROM RNA-SEQ DATA by SERGHEI MANGUL Under the Direction of Dr. Alexander Zelikovsky ABSTRACT Massively parallel whole transcriptome sequencing and its ability to generate full tran- scriptome data at the single transcript level provides a powerful tool with multiple inter- related applications, including transcriptome reconstruction, gene/isoform expression esti- mation, also known as transcriptome quantification. As a result, whole transcriptome se- quencing has become the technology of choice for performing transcriptome analysis, rapidly replacing array-based technologies. The most commonly used transcriptome sequencing pro- tocol, referred to as RNA-Seq, generates short (single or paired) sequencing tags from the ends of randomly generated cDNA fragments. RNA-Seq protocol reduces the sequencing cost and significantly increases data throughput, but is computationally challenging to re- construct full-length transcripts and accurately estimate their abundances across all cell types. We focus on two main problems in transcriptome data analysis, namely, transcriptome reconstruction and quantification.
    [Show full text]
  • Muscle Enriched Lamin Interacting Protein (Mlip) Binds Chromatin and Is Required for Myoblast Differentiation
    cells Article Muscle Enriched Lamin Interacting Protein (Mlip) Binds Chromatin and Is Required for Myoblast Differentiation Elmira Ahmady 1,2, Alexandre Blais 1,3,4 and Patrick G. Burgon 5,* 1 Department of Biochemistry, Microbiology, and Immunology, University of Ottawa, Ottawa, ON K1H 8M5, Canada; [email protected] (E.A.); [email protected] (A.B.) 2 Molecular Signaling Laboratory, University of Ottawa Heart Institute, Ottawa, ON K1Y 4W7, Canada 3 Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1H 8M5, Canada 4 University of Ottawa Centre for Infection, Immunity and Inflammation (CI3), Ottawa, ON K1H 8M5, Canada 5 Department of Chemistry and Earth Sciences, College of Arts and Sciences, Qatar University, Doha 999043, Qatar * Correspondence: [email protected] Abstract: Muscle-enriched A-type lamin-interacting protein (Mlip) is a recently discovered Amniota gene that encodes proteins of unknown biological function. Here we report Mlip’s direct interaction with chromatin, and it may function as a transcriptional co-factor. Chromatin immunoprecipitations with microarray analysis demonstrated a propensity for Mlip to associate with genomic regions in close proximity to genes that control tissue-specific differentiation. Gel mobility shift assays confirmed that Mlip protein complexes with genomic DNA. Blocking Mlip expression in C2C12 myoblasts down- regulates myogenic regulatory factors (MyoD and MyoG) and subsequently significantly inhibits myogenic differentiation and the formation of myotubes. Collectively our data demonstrate that Mlip is required for C2C12 myoblast differentiation into myotubes. Mlip may exert this role as a transcriptional regulator of a myogenic program that is unique to amniotes. Citation: Ahmady, E.; Blais, A.; Burgon, P.G.
    [Show full text]
  • S41467-019-13840-9.Pdf
    ARTICLE https://doi.org/10.1038/s41467-019-13840-9 OPEN Accurate quantification of circular RNAs identifies extensive circular isoform switching events Jinyang Zhang 1,2, Shuai Chen1, Jingwen Yang1 & Fangqing Zhao1,2,3* Detection and quantification of circular RNAs (circRNAs) face several significant challenges, including high false discovery rate, uneven rRNA depletion and RNase R treatment efficiency, and underestimation of back-spliced junction reads. Here, we propose a novel algorithm, fi 1234567890():,; CIRIquant, for accurate circRNA quanti cation and differential expression analysis. By con- structing pseudo-circular reference for re-alignment of RNA-seq reads and employing sophisticated statistical models to correct RNase R treatment biases, CIRIquant can provide more accurate expression values for circRNAs with significantly reduced false discovery rate. We further develop a one-stop differential expression analysis pipeline implementing two independent measures, which helps unveil the regulation of competitive splicing between circRNAs and their linear counterparts. We apply CIRIquant to RNA-seq datasets of hepa- tocellular carcinoma, and characterize two important groups of linear-circular switching and circular transcript usage switching events, which demonstrate the promising ability to explore extensive transcriptomic changes in liver tumorigenesis. 1 Computational Genomics Lab, Beijing Institutes of Life Science, Chinese Academy of Sciences, 100101 Beijing, China. 2 University of Chinese Academy of Sciences, 100049 Beijing, China.
    [Show full text]
  • Population-Haplotype Models for Mapping and Tagging Structural Variation Using Whole Genome Sequencing
    Population-haplotype models for mapping and tagging structural variation using whole genome sequencing Eleni Loizidou Submitted in part fulfilment of the requirements for the degree of Doctor of Philosophy Section of Genomics of Common Disease Department of Medicine Imperial College London, 2018 1 Declaration of originality I hereby declare that the thesis submitted for a Doctor of Philosophy degree is based on my own work. Proper referencing is given to the organisations/cohorts I collaborated with during the project. 2 Copyright Declaration The copyright of this thesis rests with the author and is made available under a Creative Commons Attribution Non-Commercial No Derivatives licence. Researchers are free to copy, distribute or transmit the thesis on the condition that they attribute it, that they do not use it for commercial purposes and that they do not alter, transform or build upon it. For any reuse or redistribution, researchers must make clear to others the licence terms of this work 3 Abstract The scientific interest in copy number variation (CNV) is rapidly increasing, mainly due to the evidence of phenotypic effects and its contribution to disease susceptibility. Single nucleotide polymorphisms (SNPs) which are abundant in the human genome have been widely investigated in genome-wide association studies (GWAS). Despite the notable genomic effects both CNVs and SNPs have, the correlation between them has been relatively understudied. In the past decade, next generation sequencing (NGS) has been the leading high-throughput technology for investigating CNVs and offers mapping at a high-quality resolution. We created a map of NGS-defined CNVs tagged by SNPs using the 1000 Genomes Project phase 3 (1000G) sequencing data to examine patterns between the two types of variation in protein-coding genes.
    [Show full text]
  • Characterizing Genomic Duplication in Autism Spectrum Disorder by Edward James Higginbotham a Thesis Submitted in Conformity
    Characterizing Genomic Duplication in Autism Spectrum Disorder by Edward James Higginbotham A thesis submitted in conformity with the requirements for the degree of Master of Science Graduate Department of Molecular Genetics University of Toronto © Copyright by Edward James Higginbotham 2020 i Abstract Characterizing Genomic Duplication in Autism Spectrum Disorder Edward James Higginbotham Master of Science Graduate Department of Molecular Genetics University of Toronto 2020 Duplication, the gain of additional copies of genomic material relative to its ancestral diploid state is yet to achieve full appreciation for its role in human traits and disease. Challenges include accurately genotyping, annotating, and characterizing the properties of duplications, and resolving duplication mechanisms. Whole genome sequencing, in principle, should enable accurate detection of duplications in a single experiment. This thesis makes use of the technology to catalogue disease relevant duplications in the genomes of 2,739 individuals with Autism Spectrum Disorder (ASD) who enrolled in the Autism Speaks MSSNG Project. Fine-mapping the breakpoint junctions of 259 ASD-relevant duplications identified 34 (13.1%) variants with complex genomic structures as well as tandem (193/259, 74.5%) and NAHR- mediated (6/259, 2.3%) duplications. As whole genome sequencing-based studies expand in scale and reach, a continued focus on generating high-quality, standardized duplication data will be prerequisite to addressing their associated biological mechanisms. ii Acknowledgements I thank Dr. Stephen Scherer for his leadership par excellence, his generosity, and for giving me a chance. I am grateful for his investment and the opportunities afforded me, from which I have learned and benefited. I would next thank Drs.
    [Show full text]
  • Cell Atlas of Aqueous Humor Outflow Pathways in Eyes of Humans and Four Model Species Provides Insight Into Glaucoma Pathogenesis
    Cell atlas of aqueous humor outflow pathways in eyes of humans and four model species provides insight into glaucoma pathogenesis Tavé van Zyla,b,c,1,2, Wenjun Yanb,c,1, Alexi McAdamsa,b,c, Yi-Rong Pengb,c,3, Karthik Shekhard,e,4, Aviv Regevd,e,f, Dejan Juricg, and Joshua R. Sanesb,c,2 aDepartment of Ophthalmology, Harvard Medical School and Massachusetts Eye and Ear, Boston, MA 02114; bCenter for Brain Science, Harvard University, Cambridge, MA 02138; cDepartment of Molecular and Cellular Biology, Harvard University, Cambridge, MA 02138; dHoward Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA 02142; eKoch Institute of Integrative Cancer Research, Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02142; fKlarman Cell Observatory, Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142; and gDepartment of Medicine, Harvard Medical School and Massachusetts General Hospital Cancer Center, Boston, MA 02114 Contributed by Joshua R. Sanes, March 10, 2020 (sent for review January 22, 2020; reviewed by Iok-Hou Pang and Joel S. Schuman) Increased intraocular pressure (IOP) represents a major risk factor juxtacanalicular tissue (JCT) prior to being conveyed into a for glaucoma, a prevalent eye disease characterized by death of specialized vessel called Schlemm canal (SC). From SC, AH exits retinal ganglion cells; lowering IOP is the only proven treatment the eye through a network of collector channels (CCs) continu- strategy to delay disease progression. The main determinant of ous with the venous system. The remaining AH exits the anterior IOP is the equilibrium between production and drainage of chamber via the uveoscleral pathway, draining through the in- aqueous humor, with compromised drainage generally viewed terstices of the ciliary muscle (CM) and ultimately exiting the eye as the primary contributor to dangerous IOP elevations.
    [Show full text]
  • MLIP (C-18): Sc-242224
    SAN TA C RUZ BI OTEC HNOL OG Y, INC . MLIP (C-18): sc-242224 BACKGROUND APPLICATIONS Making up nearly 6% of the human genome, chromosome 6 contains around MLIP (C-18) is recommended for detection of MLIP of human origin by 1,200 genes within 170 million base pairs of sequence. Deletion of a portion Western Blotting (starting dilution 1:200, dilution range 1:100-1:1000), of the q arm of chromosome 6 is associated with early onset intestinal can cer immunoprecipitation [1-2 µg per 100-500 µg of total protein (1 ml of cell suggesting the presence of a cancer susceptibility locus. Porphyria cutanea lysate)], immunofluorescence (starting dilution 1:50, dilution range 1:50- tarda is associated with chromosome 6 through the HFE gene which, when 1:500) and solid phase ELISA (starting dilution 1:30, dilution range 1:30- mutated, predisposes an individual to developing this porphyria. Notably, the 1:3000). PARK2 gene, which is associated with Parkinson’s disease, and the genes MLIP (C-18) is also recommended for detection of MLIP in additional species, encoding the major histocompatiblity complex proteins, which are key molecu - including canine. lar components of the immune system and determine predisposition to rheu - matic diseases, are also located on chromosome 6. Stickler syndrome, Suitable for use as control antibody for MLIP siRNA (h): sc-95453, MLIP 21- hydroxylase deficiency and maple syrup urine disease are also associated shRNA Plasmid (h): sc-95453-SH and MLIP shRNA (h) Lentiviral Particles: with genes on chromosome 6. A bipolar disorder susceptibility locus has been sc-95453-V.
    [Show full text]
  • Final Dissertation V5
    UCLA UCLA Electronic Theses and Dissertations Title Heart Failure Genetics in Mice and Men Permalink https://escholarship.org/uc/item/4c59k544 Author Wang, Jessica Jen-Chu Publication Date 2014 Peer reviewed|Thesis/dissertation eScholarship.org Powered by the California Digital Library University of California UNIVERSITY OF CALIFORNIA Los Angeles Heart Failure Genetics in Mice and Men A dissertation submitted in partial satisfaction of the requirements for the degree Doctor of Philosophy in Human Genetics by Jessica Jen-Chu Wang 2014 ABSTRACT OF THE DISSERTATION Heart Failure Genetics in Mice and Men by Jessica Jen-Chu Wang Doctor of Philosophy in Human Genetics University of California, Los Angeles, 2014 Professor Aldons Jake Lusis, Chair The genetics of heart failure is complex. In familial cases of cardiomyopathy, where mutations of large effects predominate in theory, genetic testing using a gene panel of up to 76 genes returned negative results in about half of the cases. In common forms of heart failure (HF), where a large number of genes with small to modest effects are expected to modify disease, only a few candidate genomic loci have been identified through genome-wide association (GWA) analyses in humans. We aimed to use exome sequencing to rapidly identify rare causal mutations in familial cardiomyopathy cases and effectively filter and classify the variants based on family pedigree and family member samples. We identified a number of existing variants and novel genes with great potential to be disease causing. On the other hand, we also set out to understand genetic factors that predispose to common late-onset forms of heart failure by performing GWA in the isoproterenol-induced HF model across the Hybrid Mouse Diversity Panel (HMDP) of 105 strains of mice.
    [Show full text]
  • MLIP Genotype As a Predictor of Pharmacological Response in Primary Open‑Angle Glaucoma and Ocular Hypertension María I
    www.nature.com/scientificreports OPEN MLIP genotype as a predictor of pharmacological response in primary open‑angle glaucoma and ocular hypertension María I. Canut1,8, Olaya Villa2,8, Bachar Kudsieh3, Heidi Mattlin2, Isabel Banchs2, Juan R. González4,5, Lluís Armengol2 & Ricardo P. Casaroli‑Marano6,7* Predicting the therapeutic response to ocular hypotensive drugs is crucial for the clinical treatment and management of glaucoma. Our aim was to identify a possible genetic contribution to the response to current pharmacological treatments of choice in a white Mediterranean population with primary open‑angle glaucoma (POAG) or ocular hypertension (OH). We conducted a prospective, controlled, randomized, partial crossover study that included 151 patients of both genders, aged 18 years and older, diagnosed with and requiring pharmacological treatment for POAG or OH in one or both eyes. We sought to identify copy number variants (CNVs) associated with diferences in pharmacological response, using a DNA pooling strategy of carefully phenotyped treatment responders and non‑responders, treated for a minimum of 6 weeks with a beta‑blocker (timolol maleate) and/or prostaglandin analog (latanoprost). Diurnal intraocular pressure reduction and comparative genome wide CNVs were analyzed. Our fnding that copy number alleles of an intronic portion of the MLIP gene is a predictor of pharmacological response to beta blockers and prostaglandin analogs could be used as a biomarker to guide frst‑tier POAG and OH treatment. Our fnding improves understanding of the genetic factors modulating pharmacological response in POAG and OH, and represents an important contribution to the establishment of a personalized approach to the treatment of glaucoma.
    [Show full text]
  • The Emerging Role of the RBM20 and PTBP1 Ribonucleoproteins in Heart Development and Cardiovascular Diseases
    G C A T T A C G G C A T genes Review The Emerging Role of the RBM20 and PTBP1 Ribonucleoproteins in Heart Development and Cardiovascular Diseases Stefania Fochi, Pamela Lorenzi, Marilisa Galasso, Chiara Stefani , Elisabetta Trabetti, Donato Zipeto and Maria Grazia Romanelli * Department of Neurosciences, Biomedicine and Movement Sciences, Section of Biology and Genetics, University of Verona, 37134 Verona, Italy; [email protected] (S.F.); [email protected] (P.L.); [email protected] (M.G.); [email protected] (C.S.); [email protected] (E.T.); [email protected] (D.Z.) * Correspondence: [email protected]; Tel.: +39-045-802-7182 Received: 9 March 2020; Accepted: 6 April 2020; Published: 8 April 2020 Abstract: Alternative splicing is a regulatory mechanism essential for cell differentiation and tissue organization. More than 90% of human genes are regulated by alternative splicing events, which participate in cell fate determination. The general mechanisms of splicing events are well known, whereas only recently have deep-sequencing, high throughput analyses and animal models provided novel information on the network of functionally coordinated, tissue-specific, alternatively spliced exons. Heart development and cardiac tissue differentiation require thoroughly regulated splicing events. The ribonucleoprotein RBM20 is a key regulator of the alternative splicing events required for functional and structural heart properties, such as the expression of TTN isoforms. Recently, the polypyrimidine tract-binding protein PTBP1 has been demonstrated to participate with RBM20 in regulating splicing events. In this review, we summarize the updated knowledge relative to RBM20 and PTBP1 structure and molecular function; their role in alternative splicing mechanisms involved in the heart development and function; RBM20 mutations associated with idiopathic dilated cardiovascular disease (DCM); and the consequences of RBM20-altered expression or dysfunction.
    [Show full text]
  • Discovery of Rare Variants Associated with Blood Pressure Regulation Through Meta- 2 Analysis of 1.3 Million Individuals 3 4 Praveen Surendran1,2,3,4,266, Elena V
    1 Discovery of rare variants associated with blood pressure regulation through meta- 2 analysis of 1.3 million individuals 3 4 Praveen Surendran1,2,3,4,266, Elena V. Feofanova5,266, Najim Lahrouchi6,7,8,266, Ioanna Ntalla9,266, Savita 5 Karthikeyan1,266, James Cook10, Lingyan Chen1, Borbala Mifsud9,11, Chen Yao12,13, Aldi T. Kraja14, James 6 H. Cartwright9, Jacklyn N. Hellwege15, Ayush Giri15,16, Vinicius Tragante17,18, Gudmar Thorleifsson18, 7 Dajiang J. Liu19, Bram P. Prins1, Isobel D. Stewart20, Claudia P. Cabrera9,21, James M. Eales22, Artur 8 Akbarov22, Paul L. Auer23, Lawrence F. Bielak24, Joshua C. Bis25, Vickie S. Braithwaite20,26,27, Jennifer A. 9 Brody25, E. Warwick Daw14, Helen R. Warren9,21, Fotios Drenos28,29, Sune Fallgaard Nielsen30, Jessica D. 10 Faul31, Eric B. Fauman32, Cristiano Fava33,34, Teresa Ferreira35, Christopher N. Foley1,36, Nora 11 Franceschini37, He Gao38,39, Olga Giannakopoulou9,40, Franco Giulianini41, Daniel F. Gudbjartsson18,42, 12 Xiuqing Guo43, Sarah E. Harris44,45, Aki S. Havulinna45,46, Anna Helgadottir18, Jennifer E. Huffman47, Shih- 13 Jen Hwang48,49, Stavroula Kanoni9,50, Jukka Kontto46, Martin G. Larson51,52, Ruifang Li-Gao53, Jaana 14 Lindström46, Luca A. Lotta20, Yingchang Lu54,55, Jian’an Luan20, Anubha Mahajan56,57, Giovanni Malerba58, 15 Nicholas G. D. Masca59,60, Hao Mei61, Cristina Menni62, Dennis O. Mook-Kanamori53,63, David Mosen- 16 Ansorena38, Martina Müller-Nurasyid64,65,66, Guillaume Paré67, Dirk S. Paul1,2,68, Markus Perola46,69, Alaitz 17 Poveda70, Rainer Rauramaa71,72, Melissa Richard73, Tom G. Richardson74, Nuno Sepúlveda75,76, Xueling 18 Sim77,78, Albert V. Smith79,80,81, Jennifer A.
    [Show full text]