Supplemental Materials for Genome and Transcriptome Sequencing of Lung Cancers Reveal Diverse Mutational and Splicing Events
Total Page:16
File Type:pdf, Size:1020Kb

Load more
Recommended publications
-
PARSANA-DISSERTATION-2020.Pdf
DECIPHERING TRANSCRIPTIONAL PATTERNS OF GENE REGULATION: A COMPUTATIONAL APPROACH by Princy Parsana A dissertation submitted to The Johns Hopkins University in conformity with the requirements for the degree of Doctor of Philosophy Baltimore, Maryland July, 2020 © 2020 Princy Parsana All rights reserved Abstract With rapid advancements in sequencing technology, we now have the ability to sequence the entire human genome, and to quantify expression of tens of thousands of genes from hundreds of individuals. This provides an extraordinary opportunity to learn phenotype relevant genomic patterns that can improve our understanding of molecular and cellular processes underlying a trait. The high dimensional nature of genomic data presents a range of computational and statistical challenges. This dissertation presents a compilation of projects that were driven by the motivation to efficiently capture gene regulatory patterns in the human transcriptome, while addressing statistical and computational challenges that accompany this data. We attempt to address two major difficulties in this domain: a) artifacts and noise in transcriptomic data, andb) limited statistical power. First, we present our work on investigating the effect of artifactual variation in gene expression data and its impact on trans-eQTL discovery. Here we performed an in-depth analysis of diverse pre-recorded covariates and latent confounders to understand their contribution to heterogeneity in gene expression measurements. Next, we discovered 673 trans-eQTLs across 16 human tissues using v6 data from the Genotype Tissue Expression (GTEx) project. Finally, we characterized two trait-associated trans-eQTLs; one in Skeletal Muscle and another in Thyroid. Second, we present a principal component based residualization method to correct gene expression measurements prior to reconstruction of co-expression networks. -
Distinguishing Pleiotropy from Linked QTL Between Milk Production Traits
Cai et al. Genet Sel Evol (2020) 52:19 https://doi.org/10.1186/s12711-020-00538-6 Genetics Selection Evolution RESEARCH ARTICLE Open Access Distinguishing pleiotropy from linked QTL between milk production traits and mastitis resistance in Nordic Holstein cattle Zexi Cai1*†, Magdalena Dusza2†, Bernt Guldbrandtsen1, Mogens Sandø Lund1 and Goutam Sahana1 Abstract Background: Production and health traits are central in cattle breeding. Advances in next-generation sequencing technologies and genotype imputation have increased the resolution of gene mapping based on genome-wide association studies (GWAS). Thus, numerous candidate genes that afect milk yield, milk composition, and mastitis resistance in dairy cattle are reported in the literature. Efect-bearing variants often afect multiple traits. Because the detection of overlapping quantitative trait loci (QTL) regions from single-trait GWAS is too inaccurate and subjective, multi-trait analysis is a better approach to detect pleiotropic efects of variants in candidate genes. However, large sample sizes are required to achieve sufcient power. Multi-trait meta-analysis is one approach to deal with this prob- lem. Thus, we performed two multi-trait meta-analyses, one for three milk production traits (milk yield, protein yield and fat yield), and one for milk yield and mastitis resistance. Results: For highly correlated traits, the power to detect pleiotropy was increased by multi-trait meta-analysis com- pared with the subjective assessment of overlapping of single-trait QTL confdence intervals. Pleiotropic efects of lead single nucleotide polymorphisms (SNPs) that were detected from the multi-trait meta-analysis were confrmed by bivariate association analysis. The previously reported pleiotropic efects of variants within the DGAT1 and MGST1 genes on three milk production traits, and pleiotropic efects of variants in GHR on milk yield and fat yield were con- frmed. -
Noelia Díaz Blanco
Effects of environmental factors on the gonadal transcriptome of European sea bass (Dicentrarchus labrax), juvenile growth and sex ratios Noelia Díaz Blanco Ph.D. thesis 2014 Submitted in partial fulfillment of the requirements for the Ph.D. degree from the Universitat Pompeu Fabra (UPF). This work has been carried out at the Group of Biology of Reproduction (GBR), at the Department of Renewable Marine Resources of the Institute of Marine Sciences (ICM-CSIC). Thesis supervisor: Dr. Francesc Piferrer Professor d’Investigació Institut de Ciències del Mar (ICM-CSIC) i ii A mis padres A Xavi iii iv Acknowledgements This thesis has been made possible by the support of many people who in one way or another, many times unknowingly, gave me the strength to overcome this "long and winding road". First of all, I would like to thank my supervisor, Dr. Francesc Piferrer, for his patience, guidance and wise advice throughout all this Ph.D. experience. But above all, for the trust he placed on me almost seven years ago when he offered me the opportunity to be part of his team. Thanks also for teaching me how to question always everything, for sharing with me your enthusiasm for science and for giving me the opportunity of learning from you by participating in many projects, collaborations and scientific meetings. I am also thankful to my colleagues (former and present Group of Biology of Reproduction members) for your support and encouragement throughout this journey. To the “exGBRs”, thanks for helping me with my first steps into this world. Working as an undergrad with you Dr. -
Greg's Awesome Thesis
Analysis of alignment error and sitewise constraint in mammalian comparative genomics Gregory Jordan European Bioinformatics Institute University of Cambridge A dissertation submitted for the degree of Doctor of Philosophy November 30, 2011 To my parents, who kept us thinking and playing This dissertation is the result of my own work and includes nothing which is the out- come of work done in collaboration except where specifically indicated in the text and acknowledgements. This dissertation is not substantially the same as any I have submitted for a degree, diploma or other qualification at any other university, and no part has already been, or is currently being submitted for any degree, diploma or other qualification. This dissertation does not exceed the specified length limit of 60,000 words as defined by the Biology Degree Committee. November 30, 2011 Gregory Jordan ii Analysis of alignment error and sitewise constraint in mammalian comparative genomics Summary Gregory Jordan November 30, 2011 Darwin College Insight into the evolution of protein-coding genes can be gained from the use of phylogenetic codon models. Recently sequenced mammalian genomes and powerful analysis methods developed over the past decade provide the potential to globally measure the impact of natural selection on pro- tein sequences at a fine scale. The detection of positive selection in particular is of great interest, with relevance to the study of host-parasite conflicts, immune system evolution and adaptive dif- ferences between species. This thesis examines the performance of methods for detecting positive selection first with a series of simulation experiments, and then with two empirical studies in mammals and primates. -
Genomic Study of RNA Polymerase II and III Snapc-Bound Promoters Reveals a Gene Transcribed by Both Enzymes and a Broad Use of Common Activators
Genomic Study of RNA Polymerase II and III SNAPc-Bound Promoters Reveals a Gene Transcribed by Both Enzymes and a Broad Use of Common Activators Nicole James Faresse1., Donatella Canella1., Viviane Praz1,2, Joe¨lle Michaud1¤, David Romascano1, Nouria Hernandez1* 1 Center for Integrative Genomics, Faculty of Biology and Medicine, University of Lausanne, Lausanne, Switzerland, 2 Swiss Institute of Bioinformatics, Lausanne, Switzerland Abstract SNAPc is one of a few basal transcription factors used by both RNA polymerase (pol) II and pol III. To define the set of active SNAPc-dependent promoters in human cells, we have localized genome-wide four SNAPc subunits, GTF2B (TFIIB), BRF2, pol II, and pol III. Among some seventy loci occupied by SNAPc and other factors, including pol II snRNA genes, pol III genes with type 3 promoters, and a few un-annotated loci, most are primarily occupied by either pol II and GTF2B, or pol III and BRF2. A notable exception is the RPPH1 gene, which is occupied by significant amounts of both polymerases. We show that the large majority of SNAPc-dependent promoters recruit POU2F1 and/or ZNF143 on their enhancer region, and a subset also recruits GABP, a factor newly implicated in SNAPc-dependent transcription. These activators associate with pol II and III promoters in G1 slightly before the polymerase, and ZNF143 is required for efficient transcription initiation complex assembly. The results characterize a set of genes with unique properties and establish that polymerase specificity is not absolute in vivo. Citation: James Faresse N, Canella D, Praz V, Michaud J, Romascano D, et al. -
Mapping of a Chromosome 12 Region Associated with Airway Hyperresponsiveness in a Recombinant Congenic Mouse Strain and Selectio
Mapping of a Chromosome 12 Region Associated with Airway Hyperresponsiveness in a Recombinant Congenic Mouse Strain and Selection of Potential Candidate Genes by Expression and Sequence Variation Analyses Cynthia Kanagaratham1*, Rafael Marino2, Pierre Camateros2, John Ren3, Daniel Houle4, Robert Sladek1,2,5, Silvia M. Vidal1,3, Danuta Radzioch1,2 1 Department of Human Genetics, McGill University, Montreal, Quebec, Canada, 2 Faculty of Medicine, Division of Experimental Medicine, McGill University, Montreal, Quebec, Canada, 3 Department of Microbiology and Immunology, McGill University, Montreal, Quebec, Canada, 4 Research Institute of the McGill University Health Center, Montreal, Quebec, Canada, 5 McGill University and Genome Quebec Innovation Centre, Montreal, Quebec, Canada Abstract In a previous study we determined that BcA86 mice, a strain belonging to a panel of AcB/BcA recombinant congenic strains, have an airway responsiveness phenotype resembling mice from the airway hyperresponsive A/J strain. The majority of the BcA86 genome is however from the hyporesponsive C57BL/6J strain. The aim of this study was to identify candidate regions and genes associated with airway hyperresponsiveness (AHR) by quantitative trait locus (QTL) analysis using the BcA86 strain. Airway responsiveness of 205 F2 mice generated from backcrossing BcA86 strain to C57BL/6J strain was measured and used for QTL analysis to identify genomic regions in linkage with AHR. Consomic mice for the QTL containing chromosomes were phenotyped to study the contribution of each chromosome to lung responsiveness. Candidate genes within the QTL were selected based on expression differences in mRNA from whole lungs, and the presence of coding non- synonymous mutations that were predicted to have a functional effect by amino acid substitution prediction tools. -
Characterizing Genomic Duplication in Autism Spectrum Disorder by Edward James Higginbotham a Thesis Submitted in Conformity
Characterizing Genomic Duplication in Autism Spectrum Disorder by Edward James Higginbotham A thesis submitted in conformity with the requirements for the degree of Master of Science Graduate Department of Molecular Genetics University of Toronto © Copyright by Edward James Higginbotham 2020 i Abstract Characterizing Genomic Duplication in Autism Spectrum Disorder Edward James Higginbotham Master of Science Graduate Department of Molecular Genetics University of Toronto 2020 Duplication, the gain of additional copies of genomic material relative to its ancestral diploid state is yet to achieve full appreciation for its role in human traits and disease. Challenges include accurately genotyping, annotating, and characterizing the properties of duplications, and resolving duplication mechanisms. Whole genome sequencing, in principle, should enable accurate detection of duplications in a single experiment. This thesis makes use of the technology to catalogue disease relevant duplications in the genomes of 2,739 individuals with Autism Spectrum Disorder (ASD) who enrolled in the Autism Speaks MSSNG Project. Fine-mapping the breakpoint junctions of 259 ASD-relevant duplications identified 34 (13.1%) variants with complex genomic structures as well as tandem (193/259, 74.5%) and NAHR- mediated (6/259, 2.3%) duplications. As whole genome sequencing-based studies expand in scale and reach, a continued focus on generating high-quality, standardized duplication data will be prerequisite to addressing their associated biological mechanisms. ii Acknowledgements I thank Dr. Stephen Scherer for his leadership par excellence, his generosity, and for giving me a chance. I am grateful for his investment and the opportunities afforded me, from which I have learned and benefited. I would next thank Drs. -
Network-Based Method for Drug Target Discovery at the Isoform Level
www.nature.com/scientificreports OPEN Network-based method for drug target discovery at the isoform level Received: 20 November 2018 Jun Ma1,2, Jenny Wang2, Laleh Soltan Ghoraie2, Xin Men3, Linna Liu4 & Penggao Dai 1 Accepted: 6 September 2019 Identifcation of primary targets associated with phenotypes can facilitate exploration of the underlying Published: xx xx xxxx molecular mechanisms of compounds and optimization of the structures of promising drugs. However, the literature reports limited efort to identify the target major isoform of a single known target gene. The majority of genes generate multiple transcripts that are translated into proteins that may carry out distinct and even opposing biological functions through alternative splicing. In addition, isoform expression is dynamic and varies depending on the developmental stage and cell type. To identify target major isoforms, we integrated a breast cancer type-specifc isoform coexpression network with gene perturbation signatures in the MCF7 cell line in the Connectivity Map database using the ‘shortest path’ drug target prioritization method. We used a leukemia cancer network and diferential expression data for drugs in the HL-60 cell line to test the robustness of the detection algorithm for target major isoforms. We further analyzed the properties of target major isoforms for each multi-isoform gene using pharmacogenomic datasets, proteomic data and the principal isoforms defned by the APPRIS and STRING datasets. Then, we tested our predictions for the most promising target major protein isoforms of DNMT1, MGEA5 and P4HB4 based on expression data and topological features in the coexpression network. Interestingly, these isoforms are not annotated as principal isoforms in APPRIS. -
Promoter-Specific Dynamics of TATA-Binding Protein Association with the Human Genome
Downloaded from genome.cshlp.org on October 2, 2021 - Published by Cold Spring Harbor Laboratory Press Research Promoter-specific dynamics of TATA-binding protein association with the human genome Yuko Hasegawa1,2 and Kevin Struhl1 1Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, Massachusetts 02115, USA; 2Department of Human Genetics, University of Chicago, Chicago, Illinois 60637, USA Transcription factor binding to target sites in vivo is a dynamic process that involves cycles of association and dissociation, with individual proteins differing in their binding dynamics. The dynamics at individual sites on a genomic scale have been investigated in yeast cells, but comparable experiments have not been done in multicellular eukaryotes. Here, we describe a tamoxifen-inducible, time-course ChIP-seq approach to measure transcription factor binding dynamics at target sites throughout the human genome. As observed in yeast cells, the TATA-binding protein (TBP) typically displays rapid turn- over at RNA polymerase (Pol) II-transcribed promoters, slow turnover at Pol III promoters, and very slow turnover at the Pol I promoter. Turnover rates vary widely among Pol II promoters in a manner that does not correlate with the level of TBP occupancy. Human Pol II promoters with slow TBP dissociation preferentially contain a TATA consensus motif, support high transcriptional activity of downstream genes, and are linked with specific activators and chromatin remodelers. These properties of human promoters with slow TBP turnover differ from those of yeast promoters with slow turnover. These observations suggest that TBP binding dynamics differentially affect promoter function and gene expression, possibly at the level of transcriptional reinitiation/bursting. -
Cryo-EM Structures of Human RNA Polymerase III in Its Unbound and Transcribing States
bioRxiv preprint doi: https://doi.org/10.1101/2020.06.29.177642; this version posted June 29, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Cryo-EM structures of human RNA polymerase III in its unbound and transcribing states Mathias Girbig1,3,4, Agata D. Misiaszek1,3,4, Matthias K. Vorländer1, Aleix Lafita2, Helga Grötsch1, Florence Baudin1, Alex Bateman2, Christoph W. Müller1* 1European Molecular Biology Laboratory (EMBL), Structural and Computational Biology Unit, Meyerhofstrasse 1, 69117 Heidelberg, Germany. 2European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK. 3Candidate for joint PhD degree from EMBL and Heidelberg University, Faculty of Biosciences, 69120 Heidelberg, Germany. 4These authors contributed equally: Mathias Girbig, Agata D. Misiaszek. * Correspondence to CWM ([email protected]) 1 bioRxiv preprint doi: https://doi.org/10.1101/2020.06.29.177642; this version posted June 29, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. ABSTRACT RNA polymerase III (Pol III) synthesises tRNAs and other short, essential RNAs. Human Pol III misregulation is linked to tumour transformation, neurodegenerative and developmental disorders, and increased sensitivity to viral infections. Pol III inhibition increases longevity in different animals but also promotes intracellular bacterial growth owing to its role in the immune system. -
Supplementary Table 1 Double Treatment Vs Single Treatment
Supplementary table 1 Double treatment vs single treatment Probe ID Symbol Gene name P value Fold change TC0500007292.hg.1 NIM1K NIM1 serine/threonine protein kinase 1.05E-04 5.02 HTA2-neg-47424007_st NA NA 3.44E-03 4.11 HTA2-pos-3475282_st NA NA 3.30E-03 3.24 TC0X00007013.hg.1 MPC1L mitochondrial pyruvate carrier 1-like 5.22E-03 3.21 TC0200010447.hg.1 CASP8 caspase 8, apoptosis-related cysteine peptidase 3.54E-03 2.46 TC0400008390.hg.1 LRIT3 leucine-rich repeat, immunoglobulin-like and transmembrane domains 3 1.86E-03 2.41 TC1700011905.hg.1 DNAH17 dynein, axonemal, heavy chain 17 1.81E-04 2.40 TC0600012064.hg.1 GCM1 glial cells missing homolog 1 (Drosophila) 2.81E-03 2.39 TC0100015789.hg.1 POGZ Transcript Identified by AceView, Entrez Gene ID(s) 23126 3.64E-04 2.38 TC1300010039.hg.1 NEK5 NIMA-related kinase 5 3.39E-03 2.36 TC0900008222.hg.1 STX17 syntaxin 17 1.08E-03 2.29 TC1700012355.hg.1 KRBA2 KRAB-A domain containing 2 5.98E-03 2.28 HTA2-neg-47424044_st NA NA 5.94E-03 2.24 HTA2-neg-47424360_st NA NA 2.12E-03 2.22 TC0800010802.hg.1 C8orf89 chromosome 8 open reading frame 89 6.51E-04 2.20 TC1500010745.hg.1 POLR2M polymerase (RNA) II (DNA directed) polypeptide M 5.19E-03 2.20 TC1500007409.hg.1 GCNT3 glucosaminyl (N-acetyl) transferase 3, mucin type 6.48E-03 2.17 TC2200007132.hg.1 RFPL3 ret finger protein-like 3 5.91E-05 2.17 HTA2-neg-47424024_st NA NA 2.45E-03 2.16 TC0200010474.hg.1 KIAA2012 KIAA2012 5.20E-03 2.16 TC1100007216.hg.1 PRRG4 proline rich Gla (G-carboxyglutamic acid) 4 (transmembrane) 7.43E-03 2.15 TC0400012977.hg.1 SH3D19 -
Program in Human Neutrophils Fails To
Downloaded from http://www.jimmunol.org/ by guest on September 25, 2021 is online at: average * The Journal of Immunology Anaplasma phagocytophilum , 20 of which you can access for free at: 2005; 174:6364-6372; ; from submission to initial decision 4 weeks from acceptance to publication J Immunol doi: 10.4049/jimmunol.174.10.6364 http://www.jimmunol.org/content/174/10/6364 Insights into Pathogen Immune Evasion Mechanisms: Fails to Induce an Apoptosis Differentiation Program in Human Neutrophils Dori L. Borjesson, Scott D. Kobayashi, Adeline R. Whitney, Jovanka M. Voyich, Cynthia M. Argue and Frank R. DeLeo cites 28 articles Submit online. Every submission reviewed by practicing scientists ? is published twice each month by Receive free email-alerts when new articles cite this article. Sign up at: http://jimmunol.org/alerts http://jimmunol.org/subscription Submit copyright permission requests at: http://www.aai.org/About/Publications/JI/copyright.html http://www.jimmunol.org/content/suppl/2005/05/03/174.10.6364.DC1 This article http://www.jimmunol.org/content/174/10/6364.full#ref-list-1 Information about subscribing to The JI No Triage! Fast Publication! Rapid Reviews! 30 days* • Why • • Material References Permissions Email Alerts Subscription Supplementary The Journal of Immunology The American Association of Immunologists, Inc., 1451 Rockville Pike, Suite 650, Rockville, MD 20852 Copyright © 2005 by The American Association of Immunologists All rights reserved. Print ISSN: 0022-1767 Online ISSN: 1550-6606. This information is current as of September 25, 2021. The Journal of Immunology Insights into Pathogen Immune Evasion Mechanisms: Anaplasma phagocytophilum Fails to Induce an Apoptosis Differentiation Program in Human Neutrophils1 Dori L.