Searching for Rare Variants Associated with Osahs-Related Phenotypes

Total Page:16

File Type:pdf, Size:1020Kb

Searching for Rare Variants Associated with Osahs-Related Phenotypes SEARCHING FOR RARE VARIANTS ASSOCIATED WITH OSAHS-RELATED PHENOTYPES THROUGH PEDIGREES by JINGJING LIANG Dissertation Advisor: Dr. Xiaofeng Zhu Department of Population and Quantitative Health Sciences CASE WESTERN RESERVE UNIVERSITY May 29, 2019 CASE WESTERN RESERVE UNIVERSITY SCHOOL OF GRADUATE STUDIES We hereby approve the thesis/dissertation of Jingjing Liang candidate for the degree of Ph.D Committee Chair Scott M. Williams Committee Member Jonathan L. Haines Committee Member Xiaofeng Zhu Committee Member Rong Xu Committee Member Curtis M. Tatsuoka Date of Defense January 29, 2019 *We also certify that written approval has been obtained for any proprietary material contained therein. 1 Table of Contents CHAPTER 1: LITERATURE REVIEW AND SPECIFIC AIMS ………………14 1.1 Obstructive sleep apnea-hypopnea syndrome …………………………………..14 1.2 AHI and SpO2 …………………………………………………………………...16 1.3 Rare variants and missing heritability …………………………………………..22 1.4 Rare variant association analysis ...………………………………………………24 1.5 Rare variant test using pedigree………………………………………………….27 1.6 Annotating variants in genetic regions ………………………………………….29 1.7 Mendelian randomization………………………………………………………..32 1.8 Specific aims ……………………………………………………………………36 CHAPTER 2: IDENTIFYING LOW FREQUENCY AND RARE VARIANTS ASSOCIATED WITH AVSPO2S USING PEDIGREES .…….………38 2.1 Introduction ……………………………………………………………………...38 2.2 Material and methods……………………………………………………………..42 2.2.1 Description of study samples…..……………………………………………..42 2.2.2 Overview of the method………………………………………………………45 2.2.3 Primary phenotype……………………………………………………………47 2.2.4 Whole-genome sequencing…………………………………………………...47 2.2.5 Imputation of replication cohorts……………………………………………...48 2 2.2.6 Linkage analysis of AvSpO2S………………………………………………...48 2.2.7 Simulated family data…………………………………………………………50 2.2.8 Analysis of CFS stage I families………………….…………………………50 2.2.9 Gene-based association test…… …………………………………………….51 2.2.10 Identifying variants contributing to linkage evidence in CFS-EAs……...…..54 2.2.11 Estimating the proportion of AvSpO2S variability explained by the Identified variants in DLC1 ……………………….………………………...….……55 2.2.12 Test the aggregation of variant effect size directions in DLC1…….……….55 2.2.13 Gene expression Analysis …………………………………………………56 2.2.14 Cell type specific regulatory annotation enrichment analysis ……………....57 2.2.15 Mendelian randomization analysis …………………………………………58 2.2.16 AvSpO2S and methylation in the DLC1 gene in MESA ……………………58 2.2.17 AvSpO2S and DLC1 gene expression in the Sleep Heart Health Study from Framingham Heart Study…...……………..……………………….…………..59 2.3 Results ……………………………………………………………………….......60 2.3.1 Persistent linkage evidence of AvSpO2S on chromosome 8p23………….....60 2.3.2 Choice of family specific LOD score threshold 0.1………………………….62 2.3.3 Multiple low frequency and rare variants in DLC1 are associated with AvSpO2S……………………………………………………………………………..65 3 2.3.4 Conditioning on the effects of identified DLC1 variants reduces the linkage evidence in CFS-EAs………...……………………………………………...73 2.3.5 Identified DLC1 variants are enriched in regulatory regions………………...76 2.3.6 DLC1 non-coding variants are associated with DLC1 expression level in human skin cells-transformed fibroblasts……………………………………………77 2.3.7 DLC1 DNA methylation is weakly associated with AvSpO2S and AHI ….....82 2.3.8 DLC1 expression is weakly associated with AvSpO2S and AHI …………....84 2.3.9 Association of DLC1 with AvSpO2S is not mediated by AHI……………….84 2.3.10 Potential pleiotropic effect between DLC1, AvSpO2S and lung function ….84 2.4 Discussion ……………………………………………………………………….88 CHAPTER 3: ASSESSING THE INDEPENDENCE OF ASSOCIATION TESTS AND LINKAGE EVIDENCE OBTAINED IN THE SAME DATA ..……..93 3.1 Introduction ………………………………………………………………………93 3.2 Method .………………………………………………………………………94 3.3 Results ……………………………………………………………………...…….95 3.3.1 Assess the empirical P-values for burden and SKAT tests using genome-wide variants ……………………………………………………………..95 3.3.2 Enrichment of 5% significant gene-based tests on chromosome 8 target region in CFS-EAs ………………………………………………...……..……98 3.4 Discussion ………………………………………………………………………99 4 CHAPTER 4: DEVELOPING AN ANALYSIS PIPELINE TO SEARCH RARE VARIANTS THROUGH LINKAGE ANALYIS USING WGS DATA …..102 4.1 Introduction …………………………………………………………………….102 4.2 Method …………………………………………………………………….104 4.2.1 Overview of the analysis pipeline ………………………………………….104 4.2.2 Variance component linkage analysis ………………………………………106 4.2.3 Identify candidate variants in WGS data …………………………………107 4.2.4 Gene-based association tests using WGS data ……………………………107 4.3 Results ………………………………………………………………………….109 4.3.1 Multiple rare variants in CAV1 are associated with AHI ……………………109 4.3.2 CAV1 variants regulatory annotation and gene expression analysis …………110 4.4 Discussion ……………………………………………………………………112 CHAPTER 5: DISCUSSION AND FUTURE WORK …………………………118 5 List of Tables Table 1.1 Summary of the features, the pros and cons of the different type of methods for rare variants association test ……………………………………………26 Table 2.1 Sample characteristics of TOPMed WGS and imputed genotype studies…...42 Table 2.2 Selected coding and non-coding variants on 105 genes in target region ……....52 Table 2.3 Type I error of burden and SKAT test ………………………………………66 Table 2.4 Power of burden and SKAT tests when simulated rare variants account for 1% of variability of phenotype ………………………………………………………66 Table 2.5 Power of burden and SKAT tests when simulated rare variants account for 0.5% of variability of phenotype…………………………………………………...…67 Table 2.6 Stage I and II gene-based association tests with AvSpO2S……………..……..71 Table 2.7 Gene-based association test for DLC1 with AvSpO2S in TOPMed sequencing and independent replication data with imputed genotypes….……………... 72 Table 2.8 Mendelian randomization analysis to assess causal effects of DLC1 expression in skin cell-transformed fibroblasts to AvSpO2S …………...………………..81 Table 2.9 Sample characteristics and results of DLC1 methylation association test with AvSpO2S ……………………………………………………………………...83 Table 2.10 Sample characteristics and results of DLC1 expression level association with AHI and AvSpO2S …………………………..…………………………83 6 Table 2.11 Stage I and II gene-based association tests with AvSpO2S by adjusting AHI as covariate ……………………………………………………................85 Table 2.12 Stage I and II gene-based association tests for DLC1 with AvSpO2S Or FEV1/FVC by adjusting FEV1/FVC or AvSpO2S as covariates ……………………86 Table 3.1 Summary of the genome-wide gene based test using CFS-EA stage I family filtered variants for AvSpO2S…………………………………………………….96 Table 3.2 Enrichment of 5% significant AvSpO2S gene-based tests on chromosome 8 target region in CFS-EA ………………………………...………………98 Table 4.1 Stage I and II gene-based association tests with AHI …………………111 7 List of Figures Figure 1.1 Normal breathing, snoring and OSAHS …………………………………...15 Figure 1.2 Example of polysomnography trace during repetitive apnoeas …………...17 Figure 1.3 Distribution of AHI and AvSpO2S raw measurement in CFS-EAs………..18 Figure 1.4 Boxplots to compare AHI and AvSpO2S in healthy people and OSAHS patients in CFS-EAs……………..……………………………………………..19 Figure 1.5 Variance component linkage analysis of AHI ………………………………21 Figure 1.6 Genetic variant frequencies and effect sizes ………………………………22 Figure 1.7 Non-coding regions affect proximal and distal regulators ………………30 Figure 2.1 Variance component linkage analysis of AvSpO2S in CFS-EAs on chromosome 8……………………………………………………………………………62 Figure 2.2 Mixture distribution of family specific LOD scores in CFS families……...63 Figure 2.3 Distribution of family specific LOD score (All 40 causal variants have the same effect directions.)………………..…………… …………………………..64 Figure 2.4 Distribution of family specific LOD score (Half of 40 causal variants have the same effect directions.)………… …………………………………...64 Figure 2.5 The analysis flow chart for searching low frequency and rare variants associated with AvSpO2S using the TOPMed WGS data ………………………………68 Figure 2.6 Linkage evidence of AvSpO2S on chromosome 8 in CFS-EAs …………..70 8 Figure 2.7 Conditional linkage analysis for AvSpO2S on chromosome 8 in CFS-EAs TOPMed individuals ……………...………………………………………….75 Figure 2.8 Effect sizes of the 57 variants in DLC1 estimated using the stage II samples conditional on MAF …..………………………………………………………..76 Figure 2.9 Cell type specific regulatory annotation enrichment tests for the identified non-coding variants in DLC1 …………...……………………………………78 Figure 2.10 51 non-coding variants and the corresponding effect sizes in DLC1 genes plotted against physical locations ………………………………………….79 Figure 2.11 Gene based test for DLC1, MYOM2 and CSMD1 selected variants with corresponding expression level in GTEx tissues …………………………………..80 Figure 2.12 Mendelian randomization analysis using 24 DLC1 variants as instrumental variables…………………………………………………...……………….81 Figure 2.13 Comparison of the effect sizes of single variant associations with and without adjusting for FEV1, FVC and AHI as covariate for DLC1 selected variants ………………………………………………………………………....87 Figure 3.1 Distribution of variants number in each gene in genome-wide gene-based test using CFS-EA Stage I families filtered for coding and non-coding variants ……………………………………………………………………………..……96 Figure 3.2 Quantile-quantile plots for genome-wide gene-based test using CFS-EA stage I family filtered variants for AvSpO2S……………………………..……97 Figure 4.1 Overview of the analysis pipeline processes ……………………………..106 9 Figure 4.2 Variance component linkage analysis of AHI in
Recommended publications
  • Network Medicine Approach for Analysis of Alzheimer's Disease Gene Expression Data
    International Journal of Molecular Sciences Article Network Medicine Approach for Analysis of Alzheimer’s Disease Gene Expression Data David Cohen y, Alexander Pilozzi y and Xudong Huang * Neurochemistry Laboratory, Department of Psychiatry, Massachusetts General Hospital and Harvard Medical School, Charlestown, MA 02129, USA; [email protected] (D.C.); [email protected] (A.P.) * Correspondence: [email protected]; Tel./Fax: +1-617-724-9778 These authors contributed equally to this work. y Received: 15 November 2019; Accepted: 30 December 2019; Published: 3 January 2020 Abstract: Alzheimer’s disease (AD) is the most widespread diagnosed cause of dementia in the elderly. It is a progressive neurodegenerative disease that causes memory loss as well as other detrimental symptoms that are ultimately fatal. Due to the urgent nature of this disease, and the current lack of success in treatment and prevention, it is vital that different methods and approaches are applied to its study in order to better understand its underlying mechanisms. To this end, we have conducted network-based gene co-expression analysis on data from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database. By processing and filtering gene expression data taken from the blood samples of subjects with varying disease states and constructing networks based on that data to evaluate gene relationships, we have been able to learn about gene expression correlated with the disease, and we have identified several areas of potential research interest. Keywords: Alzheimer’s disease; network medicine; gene expression; neurodegeneration; neuroinflammation 1. Introduction Alzheimer’s disease (AD) is the most widespread diagnosed cause of dementia in the elderly [1].
    [Show full text]
  • Funktionelle in Vitro Und in Vivo Charakterisierung Des Putativen Tumorsuppressorgens SFRP1 Im Humanen Mammakarzinom
    Funktionelle in vitro und in vivo Charakterisierung des putativen Tumorsuppressorgens SFRP1 im humanen Mammakarzinom Von der Fakult¨at fur¨ Mathematik, Informatik und Naturwissenschaften der RWTH Aachen University zur Erlangung des akademischen Grades einer Doktorin der Naturwissenschaften genehmigte Dissertation vorgelegt von Diplom-Biologin Laura Huth (geb. Franken) aus Julich¨ Berichter: Universit¨atsprofessor Dr. rer. nat. Edgar Dahl Universit¨atsprofessor Dr. rer. nat. Ralph Panstruga Tag der mundlichen¨ Prufung:¨ 6. August 2014 Diese Dissertation ist auf den Internetseiten der Hochschulbibliothek online verfugbar.¨ Zusammenfassung Krebserkrankungen stellen weltweit eine der h¨aufigsten Todesursachen dar. Aus diesem Grund ist die Aufkl¨arung der zugrunde liegenden Mechanismen und Ur- sachen ein essentielles Ziel der molekularen Onkologie. Die Tumorforschung der letzten Jahre hat gezeigt, dass die Entstehung solider Karzinome ein Mehrstufen- Prozess ist, bei dem neben Onkogenen auch Tumorsuppresorgene eine entschei- dende Rolle spielen. Viele der heute bekannten Gene des WNT-Signalweges wur- den bereits als Onkogene oder Tumorsuppressorgene charakterisiert. Eine Dere- gulation des WNT-Signalweges wird daher mit der Entstehung und Progression vieler humaner Tumorentit¨aten wie beispielsweise auch dem Mammakarzinom, der weltweit h¨aufigsten Krebserkrankung der Frau, assoziiert. SFRP1, ein nega- tiver Regulator der WNT-Signalkaskade, wird in Brusttumoren haupts¨achlich durch den epigenetischen Mechanismus der Promotorhypermethylierung
    [Show full text]
  • Systems and Chemical Biology Approaches to Study Cell Function and Response to Toxins
    Dissertation submitted to the Combined Faculties for the Natural Sciences and for Mathematics of the Ruperto-Carola University of Heidelberg, Germany for the degree of Doctor of Natural Sciences Presented by MSc. Yingying Jiang born in Shandong, China Oral-examination: Systems and chemical biology approaches to study cell function and response to toxins Referees: Prof. Dr. Rob Russell Prof. Dr. Stefan Wölfl CONTRIBUTIONS The chapter III of this thesis was submitted for publishing under the title “Drug mechanism predominates over toxicity mechanisms in drug induced gene expression” by Yingying Jiang, Tobias C. Fuchs, Kristina Erdeljan, Bojana Lazerevic, Philip Hewitt, Gordana Apic & Robert B. Russell. For chapter III, text phrases, selected tables, figures are based on this submitted manuscript that has been originally written by myself. i ABSTRACT Toxicity is one of the main causes of failure during drug discovery, and of withdrawal once drugs reached the market. Prediction of potential toxicities in the early stage of drug development has thus become of great interest to reduce such costly failures. Since toxicity results from chemical perturbation of biological systems, we combined biological and chemical strategies to help understand and ultimately predict drug toxicities. First, we proposed a systematic strategy to predict and understand the mechanistic interpretation of drug toxicities based on chemical fragments. Fragments frequently found in chemicals with certain toxicities were defined as structural alerts for use in prediction. Some of the predictions were supported with mechanistic interpretation by integrating fragment- chemical, chemical-protein, protein-protein interactions and gene expression data. Next, we systematically deciphered the mechanisms of drug actions and toxicities by analyzing the associations of drugs’ chemical features, biological features and their gene expression profiles from the TG-GATEs database.
    [Show full text]
  • Seq2pathway Vignette
    seq2pathway Vignette Bin Wang, Xinan Holly Yang, Arjun Kinstlick May 19, 2021 Contents 1 Abstract 1 2 Package Installation 2 3 runseq2pathway 2 4 Two main functions 3 4.1 seq2gene . .3 4.1.1 seq2gene flowchart . .3 4.1.2 runseq2gene inputs/parameters . .5 4.1.3 runseq2gene outputs . .8 4.2 gene2pathway . 10 4.2.1 gene2pathway flowchart . 11 4.2.2 gene2pathway test inputs/parameters . 11 4.2.3 gene2pathway test outputs . 12 5 Examples 13 5.1 ChIP-seq data analysis . 13 5.1.1 Map ChIP-seq enriched peaks to genes using runseq2gene .................... 13 5.1.2 Discover enriched GO terms using gene2pathway_test with gene scores . 15 5.1.3 Discover enriched GO terms using Fisher's Exact test without gene scores . 17 5.1.4 Add description for genes . 20 5.2 RNA-seq data analysis . 20 6 R environment session 23 1 Abstract Seq2pathway is a novel computational tool to analyze functional gene-sets (including signaling pathways) using variable next-generation sequencing data[1]. Integral to this tool are the \seq2gene" and \gene2pathway" components in series that infer a quantitative pathway-level profile for each sample. The seq2gene function assigns phenotype-associated significance of genomic regions to gene-level scores, where the significance could be p-values of SNPs or point mutations, protein-binding affinity, or transcriptional expression level. The seq2gene function has the feasibility to assign non-exon regions to a range of neighboring genes besides the nearest one, thus facilitating the study of functional non-coding elements[2]. Then the gene2pathway summarizes gene-level measurements to pathway-level scores, comparing the quantity of significance for gene members within a pathway with those outside a pathway.
    [Show full text]
  • Stelios Pavlidis3, Matthew Loza3, Fred Baribaud3, Anthony
    Supplementary Data Th2 and non-Th2 molecular phenotypes of asthma using sputum transcriptomics in UBIOPRED Chih-Hsi Scott Kuo1.2, Stelios Pavlidis3, Matthew Loza3, Fred Baribaud3, Anthony Rowe3, Iaonnis Pandis2, Ana Sousa4, Julie Corfield5, Ratko Djukanovic6, Rene 7 7 8 2 1† Lutter , Peter J. Sterk , Charles Auffray , Yike Guo , Ian M. Adcock & Kian Fan 1†* # Chung on behalf of the U-BIOPRED consortium project team 1Airways Disease, National Heart & Lung Institute, Imperial College London, & Biomedical Research Unit, Biomedical Research Unit, Royal Brompton & Harefield NHS Trust, London, United Kingdom; 2Department of Computing & Data Science Institute, Imperial College London, United Kingdom; 3Janssen Research and Development, High Wycombe, Buckinghamshire, United Kingdom; 4Respiratory Therapeutic Unit, GSK, Stockley Park, United Kingdom; 5AstraZeneca R&D Molndal, Sweden and Areteva R&D, Nottingham, United Kingdom; 6Faculty of Medicine, Southampton University, Southampton, United Kingdom; 7Faculty of Medicine, University of Amsterdam, Amsterdam, Netherlands; 8European Institute for Systems Biology and Medicine, CNRS-ENS-UCBL, Université de Lyon, France. †Contributed equally #Consortium project team members are listed under Supplementary 1 Materials *To whom correspondence should be addressed: [email protected] 2 List of the U-BIOPRED Consortium project team members Uruj Hoda & Christos Rossios, Airways Disease, National Heart & Lung Institute, Imperial College London, UK & Biomedical Research Unit, Biomedical Research Unit, Royal
    [Show full text]
  • A Computational Approach for Defining a Signature of Β-Cell Golgi Stress in Diabetes Mellitus
    Page 1 of 781 Diabetes A Computational Approach for Defining a Signature of β-Cell Golgi Stress in Diabetes Mellitus Robert N. Bone1,6,7, Olufunmilola Oyebamiji2, Sayali Talware2, Sharmila Selvaraj2, Preethi Krishnan3,6, Farooq Syed1,6,7, Huanmei Wu2, Carmella Evans-Molina 1,3,4,5,6,7,8* Departments of 1Pediatrics, 3Medicine, 4Anatomy, Cell Biology & Physiology, 5Biochemistry & Molecular Biology, the 6Center for Diabetes & Metabolic Diseases, and the 7Herman B. Wells Center for Pediatric Research, Indiana University School of Medicine, Indianapolis, IN 46202; 2Department of BioHealth Informatics, Indiana University-Purdue University Indianapolis, Indianapolis, IN, 46202; 8Roudebush VA Medical Center, Indianapolis, IN 46202. *Corresponding Author(s): Carmella Evans-Molina, MD, PhD ([email protected]) Indiana University School of Medicine, 635 Barnhill Drive, MS 2031A, Indianapolis, IN 46202, Telephone: (317) 274-4145, Fax (317) 274-4107 Running Title: Golgi Stress Response in Diabetes Word Count: 4358 Number of Figures: 6 Keywords: Golgi apparatus stress, Islets, β cell, Type 1 diabetes, Type 2 diabetes 1 Diabetes Publish Ahead of Print, published online August 20, 2020 Diabetes Page 2 of 781 ABSTRACT The Golgi apparatus (GA) is an important site of insulin processing and granule maturation, but whether GA organelle dysfunction and GA stress are present in the diabetic β-cell has not been tested. We utilized an informatics-based approach to develop a transcriptional signature of β-cell GA stress using existing RNA sequencing and microarray datasets generated using human islets from donors with diabetes and islets where type 1(T1D) and type 2 diabetes (T2D) had been modeled ex vivo. To narrow our results to GA-specific genes, we applied a filter set of 1,030 genes accepted as GA associated.
    [Show full text]
  • Enteric Alpha Defensins in Norm and Pathology Nikolai a Lisitsyn1*, Yulia a Bukurova1, Inna G Nikitina1, George S Krasnov1, Yuri Sykulev2 and Sergey F Beresten1
    Lisitsyn et al. Annals of Clinical Microbiology and Antimicrobials 2012, 11:1 http://www.ann-clinmicrob.com/content/11/1/1 REVIEW Open Access Enteric alpha defensins in norm and pathology Nikolai A Lisitsyn1*, Yulia A Bukurova1, Inna G Nikitina1, George S Krasnov1, Yuri Sykulev2 and Sergey F Beresten1 Abstract Microbes living in the mammalian gut exist in constant contact with immunity system that prevents infection and maintains homeostasis. Enteric alpha defensins play an important role in regulation of bacterial colonization of the gut, as well as in activation of pro- and anti-inflammatory responses of the adaptive immune system cells in lamina propria. This review summarizes currently available data on functions of mammalian enteric alpha defensins in the immune defense and changes in their secretion in intestinal inflammatory diseases and cancer. Keywords: Enteric alpha defensins, Paneth cells, innate immunity, IBD, colon cancer Introduction hydrophobic structure with a positively charged hydro- Defensins are short, cysteine-rich, cationic peptides philic part) is essential for the insertion into the micro- found in vertebrates, invertebrates and plants, which bial membrane and the formation of a pore leading to play an important role in innate immunity against bac- membrane permeabilization and lysis of the microbe teria, fungi, protozoa, and viruses [1]. Mammalian [10]. Initial recognition of numerous microbial targets is defensins are predominantly expressed in epithelial cells a consequence of electrostatic interactions between the of skin, respiratory airways, gastrointestinal and geni- defensins arginine residues and the negatively charged tourinary tracts, which form physical barriers to external phospholipids of the microbial cytoplasmic membrane infectious agents [2,3], and also in leukocytes (mostly [2,5].
    [Show full text]
  • Duplications and Copy Number Variants of 8P23.1 Are Cytogenetically Indistinguishable but Distinct at the Molecular Level
    European Journal of Human Genetics (2005) 13, 1131–1136 & 2005 Nature Publishing Group All rights reserved 1018-4813/05 $30.00 www.nature.com/ejhg ARTICLE Duplications and copy number variants of 8p23.1 are cytogenetically indistinguishable but distinct at the molecular level John CK Barber*,1,2,3, Viv Maloney2, Edward J Hollox4, Annegret Stuke-Sontheimer5, Gabi du Bois6, Eva Daumiller6, Ute Klein-Vogler7, Andreas Dufke7, John AL Armour4 and Thomas Liehr8 1Wessex Regional Genetics Laboratory, Salisbury Hospital NHS Trust, Salisbury, Wiltshire, UK; 2National Genetics Reference Laboratory (Wessex), Salisbury Hospital NHS Trust, Salisbury, Wiltshire, UK; 3Human Genetics Division, Southampton University School of Medicine, Southampton General Hospital, Southampton, UK; 4Institute of Genetics, University of Nottingham, Queen’s Medical Centre, Nottingham, UK; 5Genetics Clinic, Wernigerode, Germany; 6Institute for Chromosome Diagnostics and Genetic Counselling, Boeblingen, Germany; 7Department of Medical Genetics, Eberhard-Karls University, Tuebingen, Germany; 8Institute for Human Genetics and Anthropology, Friedrich-Schiller University, Jena, Germany It has been proposed that duplications of 8p23.1 are either euchromatic variants of the 8p23.1 defensin domain with no phenotypic consequences or true duplications associated with developmental delay and heart defects. Here, we provide evidence for both alternatives in two new families. A duplication of most of band 8p23.1 (circa 5 Mb) was found in a girl of 8 years with pulmonary stenosis and mild language delay. BAC fluorescence in situ hybridisation (FISH) and multiplex amplifiable probe hybridisation (MAPH) showed that the two copies of the duplicated segment were sited, in an alternating fashion, between three copies of a circa 300–450 kb segment from 8p23.1 distal to REPD.
    [Show full text]
  • Downloaded from the Protein Data Bank (PDB
    bioRxiv preprint doi: https://doi.org/10.1101/2021.07.07.451411; this version posted July 7, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. CAT, AGTR2, L-SIGN and DC-SIGN are potential receptors for the entry of SARS-CoV-2 into human cells Dongjie Guo 1, 2, #, Ruifang Guo1, 2, #, Zhaoyang Li 1, 2, Yuyang Zhang 1, 2, Wei Zheng 3, Xiaoqiang Huang 3, Tursunjan Aziz 1, 2, Yang Zhang 3, 4, Lijun Liu 1, 2, * 1 College of Life and Health Sciences, Northeastern University, Shenyang, Liaoning, China 2 Key Laboratory of Data Analytics and Optimization for Smart Industry (Ministry of Education), Northeastern University, Shenyang, Liaoning, China 3 Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, USA 4 Department of Biological Chemistry, University of Michigan, Ann Arbor, USA * Corresponding author. College of Life and Health Sciences, Northeastern University, Shenyang, 110169, China. E-mail address: [email protected] (L. Liu) # These authors contributed equally to this work. 1 bioRxiv preprint doi: https://doi.org/10.1101/2021.07.07.451411; this version posted July 7, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Abstract Since December 2019, the COVID-19 caused by SARS-CoV-2 has been widely spread all over the world.
    [Show full text]
  • Supplementary Table S4. FGA Co-Expressed Gene List in LUAD
    Supplementary Table S4. FGA co-expressed gene list in LUAD tumors Symbol R Locus Description FGG 0.919 4q28 fibrinogen gamma chain FGL1 0.635 8p22 fibrinogen-like 1 SLC7A2 0.536 8p22 solute carrier family 7 (cationic amino acid transporter, y+ system), member 2 DUSP4 0.521 8p12-p11 dual specificity phosphatase 4 HAL 0.51 12q22-q24.1histidine ammonia-lyase PDE4D 0.499 5q12 phosphodiesterase 4D, cAMP-specific FURIN 0.497 15q26.1 furin (paired basic amino acid cleaving enzyme) CPS1 0.49 2q35 carbamoyl-phosphate synthase 1, mitochondrial TESC 0.478 12q24.22 tescalcin INHA 0.465 2q35 inhibin, alpha S100P 0.461 4p16 S100 calcium binding protein P VPS37A 0.447 8p22 vacuolar protein sorting 37 homolog A (S. cerevisiae) SLC16A14 0.447 2q36.3 solute carrier family 16, member 14 PPARGC1A 0.443 4p15.1 peroxisome proliferator-activated receptor gamma, coactivator 1 alpha SIK1 0.435 21q22.3 salt-inducible kinase 1 IRS2 0.434 13q34 insulin receptor substrate 2 RND1 0.433 12q12 Rho family GTPase 1 HGD 0.433 3q13.33 homogentisate 1,2-dioxygenase PTP4A1 0.432 6q12 protein tyrosine phosphatase type IVA, member 1 C8orf4 0.428 8p11.2 chromosome 8 open reading frame 4 DDC 0.427 7p12.2 dopa decarboxylase (aromatic L-amino acid decarboxylase) TACC2 0.427 10q26 transforming, acidic coiled-coil containing protein 2 MUC13 0.422 3q21.2 mucin 13, cell surface associated C5 0.412 9q33-q34 complement component 5 NR4A2 0.412 2q22-q23 nuclear receptor subfamily 4, group A, member 2 EYS 0.411 6q12 eyes shut homolog (Drosophila) GPX2 0.406 14q24.1 glutathione peroxidase
    [Show full text]
  • A Study on Acute Myeloid Leukemias with Trisomy 8, 11, Or 13, Monosomy 7, Or Deletion 5Q
    Leukemia (2005) 19, 1224–1228 & 2005 Nature Publishing Group All rights reserved 0887-6924/05 $30.00 www.nature.com/leu Genomic gains and losses influence expression levels of genes located within the affected regions: a study on acute myeloid leukemias with trisomy 8, 11, or 13, monosomy 7, or deletion 5q C Schoch1, A Kohlmann1, M Dugas1, W Kern1, W Hiddemann1, S Schnittger1 and T Haferlach1 1Laboratory for Leukemia Diagnostics, Department of Internal Medicine III, University Hospital Grosshadern, Ludwig-Maximilians-University, Munich, Germany We performed microarray analyses in AML with trisomies 8 aim of this study to investigate whether gains and losses on the (n ¼ 12), 11 (n ¼ 7), 13 (n ¼ 7), monosomy 7 (n ¼ 9), and deletion genomic level translate into altered genes expression also in 5q (n ¼ 7) as sole changes to investigate whether genomic gains and losses translate into altered expression levels of other areas of the genome in AML. genes located in the affected chromosomal regions. Controls were 104 AML with normal karyotype. In subgroups with trisomy, the median expression of genes located on gained Materials and methods chromosomes was higher, while in AML with monosomy 7 and deletion 5q the median expression of genes located in deleted Samples regions was lower. The 50 most differentially expressed genes, as compared to all other subtypes, were equally distributed Bone marrow samples of AML patients at diagnosis were over the genome in AML subgroups with trisomies. In contrast, 30 and 86% of the most differentially expressed genes analyzed: 12 cases with trisomy 8 (AML-TRI8), seven with characteristic for AML with 5q deletion and monosomy 7 are trisomy 11 (AML-TRI11), seven with trisomy 13 (AML-TRI13), located on chromosomes 5 or 7.
    [Show full text]
  • Literature Mining Sustains and Enhances Knowledge Discovery from Omic Studies
    LITERATURE MINING SUSTAINS AND ENHANCES KNOWLEDGE DISCOVERY FROM OMIC STUDIES by Rick Matthew Jordan B.S. Biology, University of Pittsburgh, 1996 M.S. Molecular Biology/Biotechnology, East Carolina University, 2001 M.S. Biomedical Informatics, University of Pittsburgh, 2005 Submitted to the Graduate Faculty of School of Medicine in partial fulfillment of the requirements for the degree of Doctor of Philosophy University of Pittsburgh 2016 UNIVERSITY OF PITTSBURGH SCHOOL OF MEDICINE This dissertation was presented by Rick Matthew Jordan It was defended on December 2, 2015 and approved by Shyam Visweswaran, M.D., Ph.D., Associate Professor Rebecca Jacobson, M.D., M.S., Professor Songjian Lu, Ph.D., Assistant Professor Dissertation Advisor: Vanathi Gopalakrishnan, Ph.D., Associate Professor ii Copyright © by Rick Matthew Jordan 2016 iii LITERATURE MINING SUSTAINS AND ENHANCES KNOWLEDGE DISCOVERY FROM OMIC STUDIES Rick Matthew Jordan, M.S. University of Pittsburgh, 2016 Genomic, proteomic and other experimentally generated data from studies of biological systems aiming to discover disease biomarkers are currently analyzed without sufficient supporting evidence from the literature due to complexities associated with automated processing. Extracting prior knowledge about markers associated with biological sample types and disease states from the literature is tedious, and little research has been performed to understand how to use this knowledge to inform the generation of classification models from ‘omic’ data. Using pathway analysis methods to better understand the underlying biology of complex diseases such as breast and lung cancers is state-of-the-art. However, the problem of how to combine literature- mining evidence with pathway analysis evidence is an open problem in biomedical informatics research.
    [Show full text]