Arxiv:2108.07396V1 [Cs.LG] 17 Aug 2021
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
PARSANA-DISSERTATION-2020.Pdf
DECIPHERING TRANSCRIPTIONAL PATTERNS OF GENE REGULATION: A COMPUTATIONAL APPROACH by Princy Parsana A dissertation submitted to The Johns Hopkins University in conformity with the requirements for the degree of Doctor of Philosophy Baltimore, Maryland July, 2020 © 2020 Princy Parsana All rights reserved Abstract With rapid advancements in sequencing technology, we now have the ability to sequence the entire human genome, and to quantify expression of tens of thousands of genes from hundreds of individuals. This provides an extraordinary opportunity to learn phenotype relevant genomic patterns that can improve our understanding of molecular and cellular processes underlying a trait. The high dimensional nature of genomic data presents a range of computational and statistical challenges. This dissertation presents a compilation of projects that were driven by the motivation to efficiently capture gene regulatory patterns in the human transcriptome, while addressing statistical and computational challenges that accompany this data. We attempt to address two major difficulties in this domain: a) artifacts and noise in transcriptomic data, andb) limited statistical power. First, we present our work on investigating the effect of artifactual variation in gene expression data and its impact on trans-eQTL discovery. Here we performed an in-depth analysis of diverse pre-recorded covariates and latent confounders to understand their contribution to heterogeneity in gene expression measurements. Next, we discovered 673 trans-eQTLs across 16 human tissues using v6 data from the Genotype Tissue Expression (GTEx) project. Finally, we characterized two trait-associated trans-eQTLs; one in Skeletal Muscle and another in Thyroid. Second, we present a principal component based residualization method to correct gene expression measurements prior to reconstruction of co-expression networks. -
MUC4/MUC16/Muc20high Signature As a Marker of Poor Prognostic for Pancreatic, Colon and Stomach Cancers
Jonckheere and Van Seuningen J Transl Med (2018) 16:259 https://doi.org/10.1186/s12967-018-1632-2 Journal of Translational Medicine RESEARCH Open Access Integrative analysis of the cancer genome atlas and cancer cell lines encyclopedia large‑scale genomic databases: MUC4/MUC16/ MUC20 signature is associated with poor survival in human carcinomas Nicolas Jonckheere* and Isabelle Van Seuningen* Abstract Background: MUC4 is a membrane-bound mucin that promotes carcinogenetic progression and is often proposed as a promising biomarker for various carcinomas. In this manuscript, we analyzed large scale genomic datasets in order to evaluate MUC4 expression, identify genes that are correlated with MUC4 and propose new signatures as a prognostic marker of epithelial cancers. Methods: Using cBioportal or SurvExpress tools, we studied MUC4 expression in large-scale genomic public datasets of human cancer (the cancer genome atlas, TCGA) and cancer cell line encyclopedia (CCLE). Results: We identifed 187 co-expressed genes for which the expression is correlated with MUC4 expression. Gene ontology analysis showed they are notably involved in cell adhesion, cell–cell junctions, glycosylation and cell signal- ing. In addition, we showed that MUC4 expression is correlated with MUC16 and MUC20, two other membrane-bound mucins. We showed that MUC4 expression is associated with a poorer overall survival in TCGA cancers with diferent localizations including pancreatic cancer, bladder cancer, colon cancer, lung adenocarcinoma, lung squamous adeno- carcinoma, skin cancer and stomach cancer. We showed that the combination of MUC4, MUC16 and MUC20 signature is associated with statistically signifcant reduced overall survival and increased hazard ratio in pancreatic, colon and stomach cancer. -
Investigation of the Underlying Hub Genes and Molexular Pathogensis in Gastric Cancer by Integrated Bioinformatic Analyses
bioRxiv preprint doi: https://doi.org/10.1101/2020.12.20.423656; this version posted December 22, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Investigation of the underlying hub genes and molexular pathogensis in gastric cancer by integrated bioinformatic analyses Basavaraj Vastrad1, Chanabasayya Vastrad*2 1. Department of Biochemistry, Basaveshwar College of Pharmacy, Gadag, Karnataka 582103, India. 2. Biostatistics and Bioinformatics, Chanabasava Nilaya, Bharthinagar, Dharwad 580001, Karanataka, India. * Chanabasayya Vastrad [email protected] Ph: +919480073398 Chanabasava Nilaya, Bharthinagar, Dharwad 580001 , Karanataka, India bioRxiv preprint doi: https://doi.org/10.1101/2020.12.20.423656; this version posted December 22, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Abstract The high mortality rate of gastric cancer (GC) is in part due to the absence of initial disclosure of its biomarkers. The recognition of important genes associated in GC is therefore recommended to advance clinical prognosis, diagnosis and and treatment outcomes. The current investigation used the microarray dataset GSE113255 RNA seq data from the Gene Expression Omnibus database to diagnose differentially expressed genes (DEGs). Pathway and gene ontology enrichment analyses were performed, and a proteinprotein interaction network, modules, target genes - miRNA regulatory network and target genes - TF regulatory network were constructed and analyzed. Finally, validation of hub genes was performed. The 1008 DEGs identified consisted of 505 up regulated genes and 503 down regulated genes. -
Neurological Disorders, Genetic Correlations, and the Role of Exome Sequencing
Journal of Translational Science Review Article ISSN: 2059-268X Neurological disorders, genetic correlations, and the role of exome sequencing Tony L Brown1* and Theresa M Meloche2 1Columbia University, USA 2Advanced Research and Human Development Institute, USA Abstract Genomic information access and utilization by researchers and clinicians have barely begun the journey for fulfillment of their full potential in the research and clinical arenas. Exciting is the potential depth and breadth of research, clinical applications, and more personalized medicine, that remain on the horizon. Exome sequencing has clarified the responsibilities of over 130 genes, greatly expanding the medical genetics database and enabling the development of orphan disease- based pharmaceuticals. The focus of our research efforts was to review several literature sources related to rare genomic disease research and exome sequencing, as well as to review the new research and diagnostic strategies that were utilized. Using a systems approach, under discussion are neurology, neuropathy, and the central nervous and musculoskeletal systems. Also discussed will be the topics of inborn errors of metabolism, and the genetic targets related to developmental delay. Recommendations for future research will also be discussed. Exome sequencing neuronal ceroid lipofuscinoses the most common group of inherited neurological degenerative disorders [6]. Whether examining the A review of new strategies for rare genomic disease research mitochondrial defect implicated in prenatal ventriculomegaly -
Gene Promoter and Exon DNA Methylation Changes in Colon
Molnár et al. BMC Cancer (2018) 18:695 https://doi.org/10.1186/s12885-018-4609-x RESEARCH ARTICLE Open Access Gene promoter and exon DNA methylation changes in colon cancer development – mRNA expression and tumor mutation alterations Béla Molnár1,2*†, Orsolya Galamb1†, Bálint Péterfia2, Barnabás Wichmann1, István Csabai3, András Bodor3,4, Alexandra Kalmár1, Krisztina Andrea Szigeti2, Barbara Kinga Barták2, Zsófia Brigitta Nagy2, Gábor Valcz1, Árpád V. Patai2, Péter Igaz1,2 and Zsolt Tulassay1,2 Abstract Background: DNA mutations occur randomly and sporadically in growth-related genes, mostly on cytosines. Demethylation of cytosines may lead to genetic instability through spontaneous deamination. Aims were whole genome methylation and targeted mutation analysis of colorectal cancer (CRC)-related genes and mRNA expression analysis of TP53 pathway genes. Methods: Long interspersed nuclear element-1 (LINE-1) BS-PCR followed by pyrosequencing was performed for the estimation of global DNA metlyation levels along the colorectal normal-adenoma-carcinoma sequence. Methyl capture sequencing was done on 6 normal adjacent (NAT), 15 adenomatous (AD) and 9 CRC tissues. Overall quantitative methylation analysis, selection of top hyper/hypomethylated genes, methylation analysis on mutation regions and TP53 pathway gene promoters were performed. Mutations of 12 CRC-related genes (APC, BRAF, CTNNB1, EGFR, FBXW7, KRAS, NRAS, MSH6, PIK3CA, SMAD2, SMAD4, TP53) were evaluated. mRNA expression of TP53 pathway genes was also analyzed. Results: According to the LINE-1 methylation results, overall hypomethylation was observed along the normal- adenoma-carcinoma sequence. Within top50 differential methylated regions (DMRs), in AD-N comparison TP73, NGFR, PDGFRA genes were hypermethylated, FMN1, SLC16A7 genes were hypomethylated. -
Cytogenetic Analysis of a Pseudoangiomatous Pleomorphic/Spindle Cell Lipoma
ANTICANCER RESEARCH 37 : 2219-2223 (2017) doi:10.21873/anticanres.11557 Cytogenetic Analysis of a Pseudoangiomatous Pleomorphic/Spindle Cell Lipoma IOANNIS PANAGOPOULOS 1, LUDMILA GORUNOVA 1, INGVILD LOBMAIER 2, HEGE KILEN ANDERSEN 1, BODIL BJERKEHAGEN 2 and SVERRE HEIM 1,3 1Section for Cancer Cytogenetics, Institute for Cancer Genetics and Informatics, The Norwegian Radium Hospital, Oslo University Hospital, Oslo, Norway; 2Department of Pathology, The Norwegian Radium Hospital, Oslo University Hospital, Oslo, Norway; 3Faculty of Medicine, University of Oslo, Oslo, Norway Abstract. Background: Pseudoangiomatous pleomorphic/ appearance’ (1). To date, only 20 patients have been described spindle cell lipoma is a rare subtype of pleomorphic/spindle in the literature with this diagnosis, 15 of whom were males cell lipoma. Only approximately 20 such tumors have been (1-10). The pseudoangiomatous pleomorphic/spindle cell described. Genetic information on pseudoangiomatous lipomas were mostly found in the neck (seven patients) and pleomorphic/spindle cell lipoma is restricted to a single case shoulders (four patients), but have also been seen in the in which deletion of the forkhead box O1 (FOXO1) gene was cheek, chest, chin, elbow, finger, subscapular region, and found, using fluorescence in situ hybridization (FISH). thumb. Genetic information on pseudoangiomatous Materials and Methods: G-banding and FISH analyses were pleomorphic/ spindle cell lipoma is restricted to one case only performed on a pseudoangiomatous pleomorphic/spindle cell (8) in which fluorescence in situ hybridization (FISH) with a lipoma. Results: G-banding of tumor cells showed complex probe for the forkhead box O1 ( FOXO1 ) gene, which maps karyotypic changes including loss of chromosome 13. FISH to chromosome sub-band 13q14.11, showed a signal pattern analysis revealed that the deleted region contained the RB1 indicating monoallelic loss of the gene in 57% of the gene (13q14.2) and the part of chromosome arm 13q (q14.2- examined cells. -
ECM Alterations in Fndc3a (Fibronectin Domain Containing
www.nature.com/scientificreports OPEN ECM alterations in Fndc3a (Fibronectin Domain Containing Protein 3A) defcient zebrafsh Received: 22 April 2019 Accepted: 5 September 2019 cause temporal fn development Published: xx xx xxxx and regeneration defects Daniel Liedtke 1, Melanie Orth1, Michelle Meissler1, Sinje Geuer2,3, Sabine Knaup1, Isabell Köblitz1,4 & Eva Klopocki 1 Fin development and regeneration are complex biological processes that are highly relevant in teleost fsh. They share genetic factors, signaling pathways and cellular properties to coordinate formation of regularly shaped extremities. Especially correct tissue structure defned by extracellular matrix (ECM) formation is essential. Gene expression and protein localization studies demonstrated expression of fndc3a (fbronectin domain containing protein 3a) in both developing and regenerating caudal fns of zebrafsh (Danio rerio). We established a hypomorphic fndc3a mutant line (fndc3awue1/wue1) via CRISPR/ Cas9, exhibiting phenotypic malformations and changed gene expression patterns during early stages of median fn fold development. These developmental efects are mostly temporary, but result in a fraction of adults with permanent tail fn deformations. In addition, caudal fn regeneration in adult fndc3awue1/wue1 mutants is hampered by interference with actinotrichia formation and epidermal cell organization. Investigation of the ECM implies that loss of epidermal tissue structure is a common cause for both of the observed defects. Our results thereby provide a molecular link between these developmental processes and foreshadow Fndc3a as a novel temporal regulator of epidermal cell properties during extremity development and regeneration in zebrafsh. A wide number of conserved genetic and structural features have been identifed regulating fn development in ray fnned fsh species, like zebrafsh (Danio rerio), and imply shared mechanisms throughout evolution1. -
Nicotinic Receptors in Sleep-Related Hypermotor Epilepsy: Pathophysiology and Pharmacology
brain sciences Review Nicotinic Receptors in Sleep-Related Hypermotor Epilepsy: Pathophysiology and Pharmacology Andrea Becchetti 1,* , Laura Clara Grandi 1 , Giulia Colombo 1 , Simone Meneghini 1 and Alida Amadeo 2 1 Department of Biotechnology and Biosciences, University of Milano-Bicocca, 20126 Milano, Italy; [email protected] (L.C.G.); [email protected] (G.C.); [email protected] (S.M.) 2 Department of Biosciences, University of Milano, 20133 Milano, Italy; [email protected] * Correspondence: [email protected] Received: 13 October 2020; Accepted: 21 November 2020; Published: 25 November 2020 Abstract: Sleep-related hypermotor epilepsy (SHE) is characterized by hyperkinetic focal seizures, mainly arising in the neocortex during non-rapid eye movements (NREM) sleep. The familial form is autosomal dominant SHE (ADSHE), which can be caused by mutations in genes encoding subunits of the neuronal nicotinic acetylcholine receptor (nAChR), Na+-gated K+ channels, as well as non-channel signaling proteins, such as components of the gap activity toward rags 1 (GATOR1) macromolecular complex. The causative genes may have different roles in developing and mature brains. Under this respect, nicotinic receptors are paradigmatic, as different pathophysiological roles are exerted by distinct nAChR subunits in adult and developing brains. The widest evidence concerns α4 and β2 subunits. These participate in heteromeric nAChRs that are major modulators of excitability in mature neocortical circuits as well as regulate postnatal synaptogenesis. However, growing evidence implicates mutant α2 subunits in ADSHE, which poses interpretive difficulties as very little is known about the function of α2-containing (α2*) nAChRs in the human brain. -
Genome-Wide Gene Expression Profiles of Ovarian Carcinoma: Identification of Molecular Targets for the Treatment of Ovarian Carcinoma
365-384 30/3/2009 09:55 Ì Page 365 MOLECULAR MEDICINE REPORTS 2: 365-384, 2009 365 Genome-wide gene expression profiles of ovarian carcinoma: Identification of molecular targets for the treatment of ovarian carcinoma DRAGOMIRA NIKOLAEVA NIKOLOVA1,4, NIKOLAI DOGANOV2, RUMEN DIMITROV2, KRASIMIR ANGELOV3, SIEW-KEE LOW1, IVANKA DIMOVA4, DRAGA TONCHEVA4, YUSUKE NAKAMURA1 and HITOSHI ZEMBUTSU1 1Laboratory of Molecular Medicine, Human Genome Center, Institute of Medical Science, The University of Tokyo, Tokyo 108-8639, Japan; 2University Hospital of Obstetrics and Gynecology ‘Maichin Dom’, Sofia 1431; 3National Hospital of Oncology, Sofia 1233; 4Department of Medical Genetics, Medical University of Sofia, Sofia 1431, Bulgaria Received October 31, 2008; Accepted January 7, 2009 DOI: 10.3892/mmr_00000109 Abstract. This study aimed to clarify the molecular mecha- in women. As there are no specific indicators or symptoms of nisms involved in ovarian carcinogenesis, and to identify ovarian cancer during the early stages of the disease, the candidate molecular targets for its diagnosis and treatment. majority of patients with epithelial ovarian cancer (EOC) are The genome-wide gene expression profiles of 22 epithelial diagnosed at an advanced stage, with involvement of other ovarian carcinomas were analyzed with a microarray represent- sites such as the upper abdomen, pleural space and paraaortic ing 38,500 genes, in combination with laser microbeam lymph nodes. The cancer antigen 125 assay (CA-125) has been microdissection. A total of 273 commonly up-regulated used to screen for ovarian cancer, but is not specific; only 50% transcripts and 387 down-regulated transcripts were identified of patients with early ovarian cancer test positive using this in the ovarian carcinoma samples. -
Identification of Genomic Regions Associated with Phenotypic Variation Between Dog Breeds Using Selection Mapping
Identification of Genomic Regions Associated with Phenotypic Variation between Dog Breeds using Selection Mapping Amaury Vaysse1., Abhirami Ratnakumar2., Thomas Derrien1, Erik Axelsson2, Gerli Rosengren Pielberg2, Snaevar Sigurdsson3, Tove Fall4, Eija H. Seppa¨la¨ 5, Mark S. T. Hansen6, Cindy T. Lawley6, Elinor K. Karlsson3,7, The LUPA Consortium, Danika Bannasch8, Carles Vila` 9, Hannes Lohi5, Francis Galibert1, Merete Fredholm10, Jens Ha¨ggstro¨ m11,A˚ ke Hedhammar11, Catherine Andre´ 1, Kerstin Lindblad-Toh2,3, Christophe Hitte1, Matthew T. Webster2* 1 Institut de Ge´ne´tique et De´veloppement de Rennes, CNRS-UMR6061, Universite´ de Rennes 1, Rennes, France, 2 Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden, 3 Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America, 4 Department of Medical Epidemiology and Biostatistics, Karolinska Institute, Stockholm, Sweden, 5 Department of Veterinary Biosciences, Research Programs Unit, Molecular Medicine, University of Helsinki and Folkha¨lsan Research Center, Helsinki, Finland, 6 Illumina, San Diego, California, United States of America, 7 FAS Center for Systems Biology, Harvard University, Cambridge, Massachusetts, United States of America, 8 Department of Population Health and Reproduction, School of Veterinary Medicine, University of California Davis, Davis, California, United States of America, 9 Department of Integrative Ecology, Don˜ana Biological Station (CSIC), Seville, Spain, 10 Faculty of Life Sciences, Division of Genetics and Bioinformatics, Department of Basic Animal and Veterinary Sciences, University of Copenhagen, Frederiksberg, Denmark, 11 Department of Clinical Sciences, Swedish University of Agricultural Sciences, Uppsala, Sweden Abstract The extraordinary phenotypic diversity of dog breeds has been sculpted by a unique population history accompanied by selection for novel and desirable traits. -
ID AKI Vs Control Fold Change P Value Symbol Entrez Gene Name *In
ID AKI vs control P value Symbol Entrez Gene Name *In case of multiple probesets per gene, one with the highest fold change was selected. Fold Change 208083_s_at 7.88 0.000932 ITGB6 integrin, beta 6 202376_at 6.12 0.000518 SERPINA3 serpin peptidase inhibitor, clade A (alpha-1 antiproteinase, antitrypsin), member 3 1553575_at 5.62 0.0033 MT-ND6 NADH dehydrogenase, subunit 6 (complex I) 212768_s_at 5.50 0.000896 OLFM4 olfactomedin 4 206157_at 5.26 0.00177 PTX3 pentraxin 3, long 212531_at 4.26 0.00405 LCN2 lipocalin 2 215646_s_at 4.13 0.00408 VCAN versican 202018_s_at 4.12 0.0318 LTF lactotransferrin 203021_at 4.05 0.0129 SLPI secretory leukocyte peptidase inhibitor 222486_s_at 4.03 0.000329 ADAMTS1 ADAM metallopeptidase with thrombospondin type 1 motif, 1 1552439_s_at 3.82 0.000714 MEGF11 multiple EGF-like-domains 11 210602_s_at 3.74 0.000408 CDH6 cadherin 6, type 2, K-cadherin (fetal kidney) 229947_at 3.62 0.00843 PI15 peptidase inhibitor 15 204006_s_at 3.39 0.00241 FCGR3A Fc fragment of IgG, low affinity IIIa, receptor (CD16a) 202238_s_at 3.29 0.00492 NNMT nicotinamide N-methyltransferase 202917_s_at 3.20 0.00369 S100A8 S100 calcium binding protein A8 215223_s_at 3.17 0.000516 SOD2 superoxide dismutase 2, mitochondrial 204627_s_at 3.04 0.00619 ITGB3 integrin, beta 3 (platelet glycoprotein IIIa, antigen CD61) 223217_s_at 2.99 0.00397 NFKBIZ nuclear factor of kappa light polypeptide gene enhancer in B-cells inhibitor, zeta 231067_s_at 2.97 0.00681 AKAP12 A kinase (PRKA) anchor protein 12 224917_at 2.94 0.00256 VMP1/ mir-21likely ortholog -
The Human Gene Connectome As a Map of Short Cuts for Morbid Allele Discovery
The human gene connectome as a map of short cuts for morbid allele discovery Yuval Itana,1, Shen-Ying Zhanga,b, Guillaume Vogta,b, Avinash Abhyankara, Melina Hermana, Patrick Nitschkec, Dror Friedd, Lluis Quintana-Murcie, Laurent Abela,b, and Jean-Laurent Casanovaa,b,f aSt. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY 10065; bLaboratory of Human Genetics of Infectious Diseases, Necker Branch, Paris Descartes University, Institut National de la Santé et de la Recherche Médicale U980, Necker Medical School, 75015 Paris, France; cPlateforme Bioinformatique, Université Paris Descartes, 75116 Paris, France; dDepartment of Computer Science, Ben-Gurion University of the Negev, Beer-Sheva 84105, Israel; eUnit of Human Evolutionary Genetics, Centre National de la Recherche Scientifique, Unité de Recherche Associée 3012, Institut Pasteur, F-75015 Paris, France; and fPediatric Immunology-Hematology Unit, Necker Hospital for Sick Children, 75015 Paris, France Edited* by Bruce Beutler, University of Texas Southwestern Medical Center, Dallas, TX, and approved February 15, 2013 (received for review October 19, 2012) High-throughput genomic data reveal thousands of gene variants to detect a single mutated gene, with the other polymorphic genes per patient, and it is often difficult to determine which of these being of less interest. This goes some way to explaining why, variants underlies disease in a given individual. However, at the despite the abundance of NGS data, the discovery of disease- population level, there may be some degree of phenotypic homo- causing alleles from such data remains somewhat limited. geneity, with alterations of specific physiological pathways under- We developed the human gene connectome (HGC) to over- come this problem.