Text Mining Gene Selection to Understand Pathological Phenotype Using Biological Big Data
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Supplementary Information
doi: 10.1038/nature08795 SUPPLEMENTARY INFORMATION Supplementary Discussion Population naming In some contexts, the indigenous hunter-gatherer and pastoralist peoples of southern Africa are referred to collectively as the Khoisan (Khoi-San) or more recently Khoesan (Khoe-San) people. This grouping is based on the unique linguistic use of click-consonants1. Many names, often country-specific, have been used by Bantu pastoralists and European settlers to describe the hunter-gatherers, including San, Saan, Sonqua, Soaqua, Souqua, Sanqua, Kwankhala, Basarwa, Batwa, Abathwa, Baroa, Bushmen, Bossiesmans, Bosjemans, or Bosquimanos. In addition, group-specific names such as !Kung and Khwe are often used for the broader population. The two most commonly used names, “San” and “Bushmen”, have both been associated with much controversy due to derogatory connotations2. “San” has become the more popular term used in Western literature, although “Bushmen” is arguably the more commonly recognized term within the communities. Since they have no collective name for themselves, the term Bushmen was selected for use in this paper as the term most familiar to the participants themselves. Regarding identification of individuals The five men identified in this study have all elected to have their identity made public knowledge. Thus we present two complete personal genomes (KB1 and ABT), a low-coverage personal genome (NB1), and personal exomes for all five men. On a scientific level, identification allows for current and future correlation of genetic data with demographic and medical histories. On a social level, identification allows for maximizing community benefit. For !Gubi, G/aq’o, D#kgao and !Aî, their name represents not only themselves, but importantly their extended family unit and a way of life severely under threat. -
Transcriptomic Signatures of Brain Regional Vulnerability to Parkinsonâ
ARTICLE https://doi.org/10.1038/s42003-020-0804-9 OPEN Transcriptomic signatures of brain regional vulnerability to Parkinson’s disease Arlin Keo 1,2, Ahmed Mahfouz 1,2, Angela M.T. Ingrassia3, Jean-Pascal Meneboo4,5, Celine Villenet4, Eugénie Mutez6,7,8, Thomas Comptdaer 6,7, Boudewijn P.F. Lelieveldt2,9, Martin Figeac4,5, ✉ ✉ ✉ Marie-Christine Chartier-Harlin 6,7 , Wilma D.J. van de Berg 3 , Jacobus J. van Hilten 10 & 1234567890():,; ✉ Marcel J.T. Reinders 1,2 The molecular mechanisms underlying caudal-to-rostral progression of Lewy body pathology in Parkinson’s disease remain poorly understood. Here, we identified transcriptomic sig- natures across brain regions involved in Braak Lewy body stages in non-neurological adults from the Allen Human Brain Atlas. Among the genes that are indicative of regional vulner- ability, we found known genetic risk factors for Parkinson’s disease: SCARB2, ELOVL7, SH3GL2, SNCA, BAP1, and ZNF184. Results were confirmed in two datasets of non- neurological subjects, while in two datasets of Parkinson’s disease patients we found altered expression patterns. Co-expression analysis across vulnerable regions identified a module enriched for genes associated with dopamine synthesis and microglia, and another module related to the immune system, blood-oxygen transport, and endothelial cells. Both were highly expressed in regions involved in the preclinical stages of the disease. Finally, altera- tions in genes underlying these region-specific functions may contribute to the selective regional vulnerability in Parkinson’s disease brains. 1 Leiden Computational Biology Center, Leiden University Medical Center, Leiden, The Netherlands. 2 Delft Bioinformatics Lab, Delft University of Technology, Delft, The Netherlands. -
Sequence Overlap Between Autosomal and Sex-Linked Probes on the Illumina Humanmethylation27 Microarray
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Elsevier - Publisher Connector Genomics 97 (2011) 214–222 Contents lists available at ScienceDirect Genomics journal homepage: www.elsevier.com/locate/ygeno Sequence overlap between autosomal and sex-linked probes on the Illumina HumanMethylation27 microarray Yi-an Chen a,b, Sanaa Choufani a, Jose Carlos Ferreira a,b, Daria Grafodatskaya a, Darci T. Butcher a, Rosanna Weksberg a,b,⁎ a Program in Genetics and Genome Biology, Hospital for Sick Children Research Institute, Toronto, Ontario, Canada b Institute of Medical Science, University of Toronto, Toronto, Ontario, Canada article info abstract Article history: The Illumina Infinium HumanMethylation27 BeadChip (Illumina 27k) microarray is a high-throughput Received 12 August 2010 platform capable of interrogating the human DNA methylome. In a search for autosomal sex-specific DNA Accepted 18 December 2010 methylation using this microarray, we discovered autosomal CpG loci showing significant methylation Available online 4 January 2011 differences between the sexes. However, we found that the majority of these probes cross-reacted with sequences from sex chromosomes. Moreover, we determined that 6–10% of the microarray probes are non- Keywords: specific and map to highly homologous genomic sequences. Using probes targeting different CpGs that are Illumina Infinium HumanMethylation 27 BeadChip exact duplicates of each other, we investigated the precision of these repeat measurements and concluded Non-specific cross-reactive probe that the overall precision of this microarray is excellent. In addition, we identified a small number of probes Sex-specific DNA methylation targeting CpGs that include single-nucleotide polymorphisms. -
A Computational Approach for Defining a Signature of Β-Cell Golgi Stress in Diabetes Mellitus
Page 1 of 781 Diabetes A Computational Approach for Defining a Signature of β-Cell Golgi Stress in Diabetes Mellitus Robert N. Bone1,6,7, Olufunmilola Oyebamiji2, Sayali Talware2, Sharmila Selvaraj2, Preethi Krishnan3,6, Farooq Syed1,6,7, Huanmei Wu2, Carmella Evans-Molina 1,3,4,5,6,7,8* Departments of 1Pediatrics, 3Medicine, 4Anatomy, Cell Biology & Physiology, 5Biochemistry & Molecular Biology, the 6Center for Diabetes & Metabolic Diseases, and the 7Herman B. Wells Center for Pediatric Research, Indiana University School of Medicine, Indianapolis, IN 46202; 2Department of BioHealth Informatics, Indiana University-Purdue University Indianapolis, Indianapolis, IN, 46202; 8Roudebush VA Medical Center, Indianapolis, IN 46202. *Corresponding Author(s): Carmella Evans-Molina, MD, PhD ([email protected]) Indiana University School of Medicine, 635 Barnhill Drive, MS 2031A, Indianapolis, IN 46202, Telephone: (317) 274-4145, Fax (317) 274-4107 Running Title: Golgi Stress Response in Diabetes Word Count: 4358 Number of Figures: 6 Keywords: Golgi apparatus stress, Islets, β cell, Type 1 diabetes, Type 2 diabetes 1 Diabetes Publish Ahead of Print, published online August 20, 2020 Diabetes Page 2 of 781 ABSTRACT The Golgi apparatus (GA) is an important site of insulin processing and granule maturation, but whether GA organelle dysfunction and GA stress are present in the diabetic β-cell has not been tested. We utilized an informatics-based approach to develop a transcriptional signature of β-cell GA stress using existing RNA sequencing and microarray datasets generated using human islets from donors with diabetes and islets where type 1(T1D) and type 2 diabetes (T2D) had been modeled ex vivo. To narrow our results to GA-specific genes, we applied a filter set of 1,030 genes accepted as GA associated. -
Radial Glia Reveal Complex Regulation by the Neuropeptide Secretoneurin
Transcriptomic and proteomic characterizations of goldfish (Carassius auratus) radial glia reveal complex regulation by the neuropeptide secretoneurin Dillon Da Fonte Thesis submitted to the Faculty of Graduate and Postdoctoral Studies University of Ottawa in partial fulfillment of the requirements for the Master of Science degree in biology Department of Biology Faculty of Science University of Ottawa © Dillon Da Fonte, Ottawa, Canada, 2016 Acknowledgements Finishing this thesis has been both a challenging and rewarding experience. This accomplishment would not have been possible without the never-ending support of colleagues, friends, family. First, I would like to express my most sincere gratitude to my supervisor and mentor, Dr. Vance Trudeau. Thank you for the opportunities you have given me, this experience has truly solidified my passion for research. I appreciate our many conversations that were enjoyed over a beer – it was truly a memorable experience. I would also like to thank my M.Sc. advisory committee, Dr. Michael Jonz and Dr. Marc Ekker for your time and insightful comments. A special thank you to Dr. Chris Martynuik who taught me the bioinformatics needed to analyze both transcriptomic and proteomic data and for all your help during my time at the University of Florida. I would like to also acknowledge my funding support from University of Ottawa, NSERC, and the Michael Smith Foreign Study Award for supporting my research stay at the University of Florida. To all current and past members of TeamENDO, thank you for the sense of community you all instilled in the lab. Both inside and outside the lab, I have made memories with all of you that I will cherish forever. -
Integrating Single-Step GWAS and Bipartite Networks Reconstruction Provides Novel Insights Into Yearling Weight and Carcass Traits in Hanwoo Beef Cattle
animals Article Integrating Single-Step GWAS and Bipartite Networks Reconstruction Provides Novel Insights into Yearling Weight and Carcass Traits in Hanwoo Beef Cattle Masoumeh Naserkheil 1 , Abolfazl Bahrami 1 , Deukhwan Lee 2,* and Hossein Mehrban 3 1 Department of Animal Science, University College of Agriculture and Natural Resources, University of Tehran, Karaj 77871-31587, Iran; [email protected] (M.N.); [email protected] (A.B.) 2 Department of Animal Life and Environment Sciences, Hankyong National University, Jungang-ro 327, Anseong-si, Gyeonggi-do 17579, Korea 3 Department of Animal Science, Shahrekord University, Shahrekord 88186-34141, Iran; [email protected] * Correspondence: [email protected]; Tel.: +82-31-670-5091 Received: 25 August 2020; Accepted: 6 October 2020; Published: 9 October 2020 Simple Summary: Hanwoo is an indigenous cattle breed in Korea and popular for meat production owing to its rapid growth and high-quality meat. Its yearling weight and carcass traits (backfat thickness, carcass weight, eye muscle area, and marbling score) are economically important for the selection of young and proven bulls. In recent decades, the advent of high throughput genotyping technologies has made it possible to perform genome-wide association studies (GWAS) for the detection of genomic regions associated with traits of economic interest in different species. In this study, we conducted a weighted single-step genome-wide association study which combines all genotypes, phenotypes and pedigree data in one step (ssGBLUP). It allows for the use of all SNPs simultaneously along with all phenotypes from genotyped and ungenotyped animals. Our results revealed 33 relevant genomic regions related to the traits of interest. -
Supplementary Table 1: Adhesion Genes Data Set
Supplementary Table 1: Adhesion genes data set PROBE Entrez Gene ID Celera Gene ID Gene_Symbol Gene_Name 160832 1 hCG201364.3 A1BG alpha-1-B glycoprotein 223658 1 hCG201364.3 A1BG alpha-1-B glycoprotein 212988 102 hCG40040.3 ADAM10 ADAM metallopeptidase domain 10 133411 4185 hCG28232.2 ADAM11 ADAM metallopeptidase domain 11 110695 8038 hCG40937.4 ADAM12 ADAM metallopeptidase domain 12 (meltrin alpha) 195222 8038 hCG40937.4 ADAM12 ADAM metallopeptidase domain 12 (meltrin alpha) 165344 8751 hCG20021.3 ADAM15 ADAM metallopeptidase domain 15 (metargidin) 189065 6868 null ADAM17 ADAM metallopeptidase domain 17 (tumor necrosis factor, alpha, converting enzyme) 108119 8728 hCG15398.4 ADAM19 ADAM metallopeptidase domain 19 (meltrin beta) 117763 8748 hCG20675.3 ADAM20 ADAM metallopeptidase domain 20 126448 8747 hCG1785634.2 ADAM21 ADAM metallopeptidase domain 21 208981 8747 hCG1785634.2|hCG2042897 ADAM21 ADAM metallopeptidase domain 21 180903 53616 hCG17212.4 ADAM22 ADAM metallopeptidase domain 22 177272 8745 hCG1811623.1 ADAM23 ADAM metallopeptidase domain 23 102384 10863 hCG1818505.1 ADAM28 ADAM metallopeptidase domain 28 119968 11086 hCG1786734.2 ADAM29 ADAM metallopeptidase domain 29 205542 11085 hCG1997196.1 ADAM30 ADAM metallopeptidase domain 30 148417 80332 hCG39255.4 ADAM33 ADAM metallopeptidase domain 33 140492 8756 hCG1789002.2 ADAM7 ADAM metallopeptidase domain 7 122603 101 hCG1816947.1 ADAM8 ADAM metallopeptidase domain 8 183965 8754 hCG1996391 ADAM9 ADAM metallopeptidase domain 9 (meltrin gamma) 129974 27299 hCG15447.3 ADAMDEC1 ADAM-like, -
Cellular and Molecular Signatures in the Disease Tissue of Early
Cellular and Molecular Signatures in the Disease Tissue of Early Rheumatoid Arthritis Stratify Clinical Response to csDMARD-Therapy and Predict Radiographic Progression Frances Humby1,* Myles Lewis1,* Nandhini Ramamoorthi2, Jason Hackney3, Michael Barnes1, Michele Bombardieri1, Francesca Setiadi2, Stephen Kelly1, Fabiola Bene1, Maria di Cicco1, Sudeh Riahi1, Vidalba Rocher-Ros1, Nora Ng1, Ilias Lazorou1, Rebecca E. Hands1, Desiree van der Heijde4, Robert Landewé5, Annette van der Helm-van Mil4, Alberto Cauli6, Iain B. McInnes7, Christopher D. Buckley8, Ernest Choy9, Peter Taylor10, Michael J. Townsend2 & Costantino Pitzalis1 1Centre for Experimental Medicine and Rheumatology, William Harvey Research Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, Charterhouse Square, London EC1M 6BQ, UK. Departments of 2Biomarker Discovery OMNI, 3Bioinformatics and Computational Biology, Genentech Research and Early Development, South San Francisco, California 94080 USA 4Department of Rheumatology, Leiden University Medical Center, The Netherlands 5Department of Clinical Immunology & Rheumatology, Amsterdam Rheumatology & Immunology Center, Amsterdam, The Netherlands 6Rheumatology Unit, Department of Medical Sciences, Policlinico of the University of Cagliari, Cagliari, Italy 7Institute of Infection, Immunity and Inflammation, University of Glasgow, Glasgow G12 8TA, UK 8Rheumatology Research Group, Institute of Inflammation and Ageing (IIA), University of Birmingham, Birmingham B15 2WB, UK 9Institute of -
The Role of Pregnancy-Specific Glycoproteins on Trophoblast Motility in Three-Dimensional Gelatin Hydrogels
bioRxiv preprint doi: https://doi.org/10.1101/2020.09.25.314195; this version posted September 26, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The role of pregnancy-specific glycoproteins on trophoblast motility in three-dimensional gelatin hydrogels Samantha G. Zambuto1, Shemona Rattila2, Gabriela Dveksler2, Brendan A.C. Harley3,4,* 1 Dept. of Bioengineering University of Illinois at Urbana-Champaign Urbana, IL, USA 61801 2 Dept. of Pathology UniformeD Services University of Health Sciences BethesDa, MD, USA 20814 3 Dept. Chemical anD Biomolecular Engineering 4 Carl R. Woese Institute for Genomic Biology University of Illinois at Urbana-Champaign Urbana, IL, USA 61801 *Correspondence: B.A.C. Harley Dept. of Chemical anD Biomolecular Engineering Carl R. Woese Institute for Genomic Biology University of Illinois at Urbana-Champaign 110 Roger Adams Laboratory 600 S. Mathews Ave. Urbana, IL 61801 Phone: (217) 244-7112 Fax: (217) 333-5052 e-mail: [email protected] bioRxiv preprint doi: https://doi.org/10.1101/2020.09.25.314195; this version posted September 26, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. SUMMARY Trophoblast invasion is a complex biological process necessary for establishment of pregnancy; however, much remains unknown regarding what signaling factors coordinate the extent of invasion. Pregnancy-specific glycoproteins (PSGs) are some of the most abundant circulating trophoblastic proteins in maternal blooD during human pregnancy, with maternal serum concentrations rising to as high as 200-400 μg/mL at term. -
Transcription-Replication Collisions—A Series of Unfortunate Events
biomolecules Review Transcription-Replication Collisions—A Series of Unfortunate Events Commodore St Germain 1,2,*, Hongchang Zhao 2 and Jacqueline H. Barlow 2,* 1 School of Mathematics and Science, Solano Community College, 4000 Suisun Valley Road, Fairfield, CA 94534, USA 2 Department of Microbiology and Molecular Genetics, University of California Davis, One Shields Avenue, Davis, CA 95616, USA; [email protected] * Correspondence: [email protected] (C.S.G.); [email protected] (J.H.B.) Abstract: Transcription-replication interactions occur when DNA replication encounters genomic regions undergoing transcription. Both replication and transcription are essential for life and use the same DNA template making conflicts unavoidable. R-loops, DNA supercoiling, DNA secondary structure, and chromatin-binding proteins are all potential obstacles for processive replication or transcription and pose an even more potent threat to genome integrity when these processes co-occur. It is critical to maintaining high fidelity and processivity of transcription and replication while navigating through a complex chromatin environment, highlighting the importance of defining cellular pathways regulating transcription-replication interaction formation, evasion, and resolution. Here we discuss how transcription influences replication fork stability, and the safeguards that have evolved to navigate transcription-replication interactions and maintain genome integrity in mammalian cells. Keywords: DNA replication; transcription; R-loops; replication stress Citation: St Germain, C.; Zhao, H.; Barlow, J.H. Transcription-Replication Collisions—A Series of Unfortunate Events. Biomolecules 2021, 11, 1249. 1. Introduction https://doi.org/10.3390/ From bacteria to humans, transcription has been identified as a source of genome biom11081249 instability, initially observed as spontaneous recombination referred to as transcription- associated recombination (TAR) [1]. -
Mysterious Inhibitory Cell Regulator Investigated and Found Likely to Be Secretogranin II Related
Mysterious inhibitory cell regulator investigated and found likely to be secretogranin II related John E. Hart1, Iain J. Clarke2, Gail P. Risbridger3, Ben Ferneyhough4 and Mo´nica Vega-Herna´ndez5 1 Endocrine Pharmaceuticals, Tadley, Hampshire, UK 2 Department of Physiology, Neuroscience Program, Monash Biomedical Discovery Institute, Monash University, Clayton, VIC, Australia 3 Department of Anatomy and Developmental Biology, Biomedical Discovery Institute, Monash University, Clayton, VIC, Australia 4 Systems Biology Laboratory UK, Abingdon, Oxfordshire, UK 5 Department of Zoology, Lawrence Laboratory, University of Cambridge, Cambridge, Cambridgeshire, UK ABSTRACT In the context of a hunt for a postulated hormone that is tissue-mass inhibiting and reproductively associated, there is described probable relatedness to a granin protein. A 7–8 kDa polypeptide candidate (gels/MS) appeared in a bioassay-guided fractionation campaign involving sheep plasma. An N-terminal sequence of 14 amino acids was obtained for the polypeptide by Edman degradation. Bioinformatics and molecular biology failed to illuminate any ovine or non-ovine protein which might relate to this sequence. The N-terminal sequence was synthesized as the 14mer EPL001 peptide and surprisingly found to be inhibitory in an assay in vivo of compensatory renal growth in the rat and modulatory of nematode fecundity, in line with the inhibitory hormone hypothesis. Antibodies were raised to EPL001 and their deployment upheld the hypothesis that the EPL001 amino acid sequence is meaningful and relevant, notwithstanding bioinformatic obscurity. Immunohistochemistry (IHC) in sheep, rodents and humans yielded Submitted 6 July 2017 staining of seeming endocrine relevance (e.g. hypothalamus, gonads and Accepted 30 August 2017 neuroendocrine cells in diverse tissues), with apparent upregulation in certain Published 13 October 2017 human tumours (e.g. -
Metastatic Adrenocortical Carcinoma Displays Higher Mutation Rate and Tumor Heterogeneity Than Primary Tumors
ARTICLE DOI: 10.1038/s41467-018-06366-z OPEN Metastatic adrenocortical carcinoma displays higher mutation rate and tumor heterogeneity than primary tumors Sudheer Kumar Gara1, Justin Lack2, Lisa Zhang1, Emerson Harris1, Margaret Cam2 & Electron Kebebew1,3 Adrenocortical cancer (ACC) is a rare cancer with poor prognosis and high mortality due to metastatic disease. All reported genetic alterations have been in primary ACC, and it is 1234567890():,; unknown if there is molecular heterogeneity in ACC. Here, we report the genetic changes associated with metastatic ACC compared to primary ACCs and tumor heterogeneity. We performed whole-exome sequencing of 33 metastatic tumors. The overall mutation rate (per megabase) in metastatic tumors was 2.8-fold higher than primary ACC tumor samples. We found tumor heterogeneity among different metastatic sites in ACC and discovered recurrent mutations in several novel genes. We observed 37–57% overlap in genes that are mutated among different metastatic sites within the same patient. We also identified new therapeutic targets in recurrent and metastatic ACC not previously described in primary ACCs. 1 Endocrine Oncology Branch, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA. 2 Center for Cancer Research, Collaborative Bioinformatics Resource, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA. 3 Department of Surgery and Stanford Cancer Institute, Stanford University, Stanford, CA 94305, USA. Correspondence and requests for materials should be addressed to E.K. (email: [email protected]) NATURE COMMUNICATIONS | (2018) 9:4172 | DOI: 10.1038/s41467-018-06366-z | www.nature.com/naturecommunications 1 ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/s41467-018-06366-z drenocortical carcinoma (ACC) is a rare malignancy with types including primary ACC from the TCGA to understand our A0.7–2 cases per million per year1,2.