Gene and Protein Networks in Understanding Cellular Function

Total Page:16

File Type:pdf, Size:1020Kb

Gene and Protein Networks in Understanding Cellular Function Gene and Protein Networks in Understanding Cellular Function Sonja Katriina Lehtinen A thesis sumbitted to University College London for the degree of Doctor of Philosophy March 2015 1 I, Sonja Lehtinen, confirm that the work presented in this thesis is my own. Where information has been derived from other sources, I confirm that this has been indicated in the thesis. Sonja Lehtinen 2 March, 2015 2 Abstract Over the past decades, networks have emerged as a useful way of representing complex large-scale systems in a variety of fields. In cellular and molecular biology, gene and protein networks have attracted considerable interest as tools for making sense of increasingly large volumes of data. Despite this interest, there is still substantial debate over how to best exploit network models in cellular biology. This thesis explores the use of gene and protein networks in various biological contexts. The first part of the thesis (Chapter 2) examines protein function prediction using network-based `guilt-by-association' approaches. Given the falling costs of genome sequencing and the availability of large volumes of biological data, automated annotation of gene and protein function is becoming increasingly useful. Chapter 2 describes the development of a new network-based protein function prediction method and compares it to a leading algorithm on a number of benchmarks. Biases in benchmarking methods are also explicitly explored. The second part (Chapters 3 and 4) explores network approaches in under- standing loss of function variation in the human genome. For a number of genes, homozygous loss of function appears to have no detrimental effect. A possible ex- planation is that these genes are only necessary in specific genetic backgrounds. Chapter 3 develops methods for identifying these types of relationships between apparently loss of function tolerant genes. Chapter 4 describes the use of net- works in predicting the functional effects of loss of function mutations. The third part of the thesis (Chapters 5 and 6) uses network representations to model the effects of cellular stress on yeast cells. Chapter 5 examines stress induced changes in co-expression and protein interaction networks, finding evi- dence of increased modularisation in both types of network. Chapter 6 explores the effect of stress on resilience to node removal in the co-expression networks. 3 Acknowledgements If theses are written from the shoulders of giants, I have been lucky to work with particularly big and friendly ones. First and foremost, I would like to thank my primary supervisor Christine Orengo for her outstanding guidance and genuine kindness. I am also grateful for the excellent input and good humour of my second supervisor J¨urg B¨ahlerand the support of my thesis chair Konstantinos Thalassinos. I have had the pleasure of collaborating with a number of excellent scientists during this PhD. My thesis benefited greatly from the expertise and insight of John Shawe-Taylor, Jon Lees, Vera Pancaldi, Suganthi Balasubramanian and Koon-Kiu Yan. I would also like to thank everybody in the Orengo group for their help along the way. In my final year, I had the chance to visit the Gerstein Lab at Yale for three months. This was a fantastic experience - a big thank you to Mark Gerstein, UCL graduate school and the Yale-UCL Collaborative for making my visit possible and all of the Gerstein Lab for their friendly welcome. This PhD would not have been possible without the financial and academic support of CoMPLEX or the friendship of the CoMPLEX people. I could not have asked for a better PhD cohort. Finally, to my family and the rest of my friends: thank you for rarely asking about the PhD - it was appreciated. 4 Contents 1 Introduction 12 1.1 Overview: Network Biology . 12 1.2 Network Concepts and Terminology . 13 1.3 Network Approaches . 15 1.3.1 Characterising Nodes . 15 1.3.2 Characterising Networks . 16 1.3.3 Modelling Networks . 18 1.4 Network Types . 20 1.4.1 Protein-Protein Interaction Networks . 22 1.4.2 Co-Expression Networks . 30 1.4.3 Genetic Interaction Networks . 32 1.4.4 Other Functional Association Networks . 32 1.4.5 Dynamic Networks . 33 1.5 Thesis Overview . 34 2 A Graph Kernel Method for Gene Function Prediction 36 2.1 Introduction . 36 2.1.1 Gene Function Prediction: Scope and Definitions . 36 2.1.2 Exploiting Networks for Function Prediction . 37 2.1.3 Problems with Network Based Approaches . 38 2.1.4 Aims and Objectives . 42 2.2 Technical Background . 42 2.2.1 Kernel Methods . 42 2.2.2 Kernelized Prediction Algorithms . 45 2.2.3 Protein Function Prediction and Negative Examples . 50 2.3 Benchmark Development . 50 2.3.1 GO Rollback Benchmark . 50 2.3.2 Phenotypic RNAi Benchmark . 51 2.3.3 Fission Yeast Ageing Benchmark . 52 2.4 Preliminary Work . 54 2.4.1 Regression and Support Vector Machines . 54 2.4.2 Dimensionality Reduction Approaches . 55 2.5 Algorithms . 57 2.5.1 Compass Algorithm . 57 2.5.2 GeneMANIA algorithm . 60 2.5.3 Network Construction and Weighting . 62 2.6 Comparison to GeneMANIA . 62 2.6.1 Results Summary . 62 2.6.2 RNAi Benchmark . 63 2.6.3 GO Benchmark . 64 2.6.4 Ageing Benchmark . 64 5 2.6.5 GeneMANIA weighting scheme . 65 2.7 Detailed Investigation of Prediction . 65 2.7.1 Cross-Validation vs Rollback . 65 2.7.2 Effect of Gene Degree on Label Predictability . 66 2.7.3 Effect of Discovery Date on Label Predictability . 68 2.7.4 Effect of Degree on Discovery of New Labels . 68 2.8 Discussion . 71 2.8.1 Relative Performance of GeneMANIA and Compass . 71 2.8.2 Further Investigation of the Effects of Network Quality on Predictive Performance . 71 2.8.3 Cross-Validation and Parameter Selection . 72 2.8.4 Temporal Effects . 72 2.8.5 Problems with CAFA-Style Benchmarks . 73 2.8.6 Conclusion and Further Work . 73 3 Identifying Genetic Interactions between Loss of Function Tol- erant Genes 75 3.1 Introduction . 75 3.1.1 Loss of Function Variation . 75 3.1.2 Challenges in LoF Variant Identification . 76 3.1.3 Interactions between Genes: Recessive LoF Variants and Epistasis . 77 3.1.4 Aims and Objectives . 77 3.2 LoF Data . 77 3.3 Identifying Pairwise Genetic Interactions . 78 3.3.1 Hypergeometric Model . 80 3.3.2 Confounding Factors and Model Refinement . 80 3.3.3 Pairwise Interactions: Results . 86 3.3.4 Interpretation and Evaluation of Putative Interactions . 86 3.4 Network Approaches to LoF pairs . 89 3.4.1 Introduction to Modularity . 90 3.4.2 Anti-Community Clustering . 92 3.4.3 Identification of Epistatic Communities from Co-Occurrence Data . 92 3.4.4 Evaluation of Partition Approaches . 95 3.4.5 Epistatic Communities . 103 3.5 Conclusion . 103 4 Functional Association Networks For Prediction of Loss of Func- tion Tolerance 107 4.1 Introduction . 107 4.2 Datasets . 108 4.3 Network properties . 108 4.3.1 Protein Interaction Networks . 109 4.3.2 Genetic Interaction Networks . 109 4.3.3 Metabolic Networks . 111 4.4 Prediction Using Centrality . 113 4.5 Guilt-by-Association . 114 4.6 Integrated Prediction . 115 4.7 Discussion and Further Work . 117 6 5 Network Approaches to Modelling the Stress Response in Fis- sion Yeast 121 5.1 Introduction . 121 5.1.1 Stress Response . 121 5.1.2 Studying Changing Networks . 122 5.1.3 Work Undertaken . 123 5.2 Methods . 124 5.2.1 Co-Expression Network Construction . 124 5.2.2 Protein Interaction Network Construction . 129 5.2.3 Network Modularity . 130 5.3 Stress Induced Changes to Network Structure . 133 5.3.1 Co-Expression Networks . 133 5.3.2 Protein Interaction Networks . 142 5.4 Biological Correlates of Network Change . 148 5.4.1 Principles of Enrichment Analysis . 148 5.4.2 Co-Expression Networks . 148 5.4.3 PPI Networks . 149 5.4.4 Summary of Enrichment Analyses . 150 5.5 Possible Extensions of this Work . 150 5.6 Conclusion . 151 6 Network Resilience to Node Removal: Variability in Network Models and Co-Expression Networks 152 6.1 Introduction . 152 6.1.1 Robustness and Stress . 154 6.1.2 Variability of Resilience . 154 6.1.3 Aims and Objectives . 154 6.2 Methods . 155 6.2.1 Network Models . 155 6.2.2 Stress Networks . 156 6.2.3 Resilience Measure . 156 6.3 Network Models . 156 6.4 Stress Networks . 157 6.5 Discussion and Conclusion . 163 7 Discussion 166 7.1 Protein Function Prediction . 166 7.2 Loss of Function Variation . 168 7.3 Stress Response . 169 7.4 Overall Conclusions . 170 7 List of Figures 1.1 Examples of regular lattice, random and `small-world' networks . 13 1.2 Degree distributions in random and `scale-free' networks . 19 1.3 Illustration of a yeast two hybrid system . 23 1.4 Illustration of tandem affinity purification . 23 1.5 The number of physical protein-protein interactions in the Bi- oGRID database . 26 2.1 Illustration of the effect of information transfer across databases on benchmarking . 40 2.2 Illustration of how a mapping into a different space can make patterns in data more detectable . 43 2.3 The relative performance of different prediction approaches on.
Recommended publications
  • Funktionelle in Vitro Und in Vivo Charakterisierung Des Putativen Tumorsuppressorgens SFRP1 Im Humanen Mammakarzinom
    Funktionelle in vitro und in vivo Charakterisierung des putativen Tumorsuppressorgens SFRP1 im humanen Mammakarzinom Von der Fakult¨at fur¨ Mathematik, Informatik und Naturwissenschaften der RWTH Aachen University zur Erlangung des akademischen Grades einer Doktorin der Naturwissenschaften genehmigte Dissertation vorgelegt von Diplom-Biologin Laura Huth (geb. Franken) aus Julich¨ Berichter: Universit¨atsprofessor Dr. rer. nat. Edgar Dahl Universit¨atsprofessor Dr. rer. nat. Ralph Panstruga Tag der mundlichen¨ Prufung:¨ 6. August 2014 Diese Dissertation ist auf den Internetseiten der Hochschulbibliothek online verfugbar.¨ Zusammenfassung Krebserkrankungen stellen weltweit eine der h¨aufigsten Todesursachen dar. Aus diesem Grund ist die Aufkl¨arung der zugrunde liegenden Mechanismen und Ur- sachen ein essentielles Ziel der molekularen Onkologie. Die Tumorforschung der letzten Jahre hat gezeigt, dass die Entstehung solider Karzinome ein Mehrstufen- Prozess ist, bei dem neben Onkogenen auch Tumorsuppresorgene eine entschei- dende Rolle spielen. Viele der heute bekannten Gene des WNT-Signalweges wur- den bereits als Onkogene oder Tumorsuppressorgene charakterisiert. Eine Dere- gulation des WNT-Signalweges wird daher mit der Entstehung und Progression vieler humaner Tumorentit¨aten wie beispielsweise auch dem Mammakarzinom, der weltweit h¨aufigsten Krebserkrankung der Frau, assoziiert. SFRP1, ein nega- tiver Regulator der WNT-Signalkaskade, wird in Brusttumoren haupts¨achlich durch den epigenetischen Mechanismus der Promotorhypermethylierung
    [Show full text]
  • A Computational Approach for Defining a Signature of Β-Cell Golgi Stress in Diabetes Mellitus
    Page 1 of 781 Diabetes A Computational Approach for Defining a Signature of β-Cell Golgi Stress in Diabetes Mellitus Robert N. Bone1,6,7, Olufunmilola Oyebamiji2, Sayali Talware2, Sharmila Selvaraj2, Preethi Krishnan3,6, Farooq Syed1,6,7, Huanmei Wu2, Carmella Evans-Molina 1,3,4,5,6,7,8* Departments of 1Pediatrics, 3Medicine, 4Anatomy, Cell Biology & Physiology, 5Biochemistry & Molecular Biology, the 6Center for Diabetes & Metabolic Diseases, and the 7Herman B. Wells Center for Pediatric Research, Indiana University School of Medicine, Indianapolis, IN 46202; 2Department of BioHealth Informatics, Indiana University-Purdue University Indianapolis, Indianapolis, IN, 46202; 8Roudebush VA Medical Center, Indianapolis, IN 46202. *Corresponding Author(s): Carmella Evans-Molina, MD, PhD ([email protected]) Indiana University School of Medicine, 635 Barnhill Drive, MS 2031A, Indianapolis, IN 46202, Telephone: (317) 274-4145, Fax (317) 274-4107 Running Title: Golgi Stress Response in Diabetes Word Count: 4358 Number of Figures: 6 Keywords: Golgi apparatus stress, Islets, β cell, Type 1 diabetes, Type 2 diabetes 1 Diabetes Publish Ahead of Print, published online August 20, 2020 Diabetes Page 2 of 781 ABSTRACT The Golgi apparatus (GA) is an important site of insulin processing and granule maturation, but whether GA organelle dysfunction and GA stress are present in the diabetic β-cell has not been tested. We utilized an informatics-based approach to develop a transcriptional signature of β-cell GA stress using existing RNA sequencing and microarray datasets generated using human islets from donors with diabetes and islets where type 1(T1D) and type 2 diabetes (T2D) had been modeled ex vivo. To narrow our results to GA-specific genes, we applied a filter set of 1,030 genes accepted as GA associated.
    [Show full text]
  • Supplementary Table 3 Complete List of RNA-Sequencing Analysis of Gene Expression Changed by ≥ Tenfold Between Xenograft and Cells Cultured in 10%O2
    Supplementary Table 3 Complete list of RNA-Sequencing analysis of gene expression changed by ≥ tenfold between xenograft and cells cultured in 10%O2 Expr Log2 Ratio Symbol Entrez Gene Name (culture/xenograft) -7.182 PGM5 phosphoglucomutase 5 -6.883 GPBAR1 G protein-coupled bile acid receptor 1 -6.683 CPVL carboxypeptidase, vitellogenic like -6.398 MTMR9LP myotubularin related protein 9-like, pseudogene -6.131 SCN7A sodium voltage-gated channel alpha subunit 7 -6.115 POPDC2 popeye domain containing 2 -6.014 LGI1 leucine rich glioma inactivated 1 -5.86 SCN1A sodium voltage-gated channel alpha subunit 1 -5.713 C6 complement C6 -5.365 ANGPTL1 angiopoietin like 1 -5.327 TNN tenascin N -5.228 DHRS2 dehydrogenase/reductase 2 leucine rich repeat and fibronectin type III domain -5.115 LRFN2 containing 2 -5.076 FOXO6 forkhead box O6 -5.035 ETNPPL ethanolamine-phosphate phospho-lyase -4.993 MYO15A myosin XVA -4.972 IGF1 insulin like growth factor 1 -4.956 DLG2 discs large MAGUK scaffold protein 2 -4.86 SCML4 sex comb on midleg like 4 (Drosophila) Src homology 2 domain containing transforming -4.816 SHD protein D -4.764 PLP1 proteolipid protein 1 -4.764 TSPAN32 tetraspanin 32 -4.713 N4BP3 NEDD4 binding protein 3 -4.705 MYOC myocilin -4.646 CLEC3B C-type lectin domain family 3 member B -4.646 C7 complement C7 -4.62 TGM2 transglutaminase 2 -4.562 COL9A1 collagen type IX alpha 1 chain -4.55 SOSTDC1 sclerostin domain containing 1 -4.55 OGN osteoglycin -4.505 DAPL1 death associated protein like 1 -4.491 C10orf105 chromosome 10 open reading frame 105 -4.491
    [Show full text]
  • Functional Annotations of Single-Nucleotide Polymorphism
    CLINICAL RESEARCH e-ISSN 1643-3750 © Med Sci Monit, 2020; 26: e922710 DOI: 10.12659/MSM.922710 Received: 2020.01.08 Accepted: 2020.02.20 Functional Annotations of Single-Nucleotide Available online: 2020.03.30 Published: 2020.05.25 Polymorphism (SNP)-Based and Gene-Based Genome-Wide Association Studies Show Genes Affecting Keratitis Susceptibility Authors’ Contribution: BCDEF 1 Yue Xu* 1 Department of Ophthalmology, First Affiliated Hospital of Soochow University, Study Design A BCDEF 2 Xiao-Lin Yang* Suzhou, Jiangsu, P.R. China Data Collection B 2 Center for Genetic Epidemiology and Genomics, School of Public Health, Medical Statistical Analysis C BCD 1 Xiao-Long Yang College of Soochow University, Suzhou, Jiangsu, P.R. China Data Interpretation D BC 1 Ya-Ru Ren Manuscript Preparation E BC 1 Xin-Yu Zhuang Literature Search F Funds Collection G ADE 2 Lei Zhang ADE 1 Xiao-Feng Zhang * Yue Xu and Xiao-Lin Yang contributed equally Corresponding Authors: Xiao-Feng Zhang, e-mail: [email protected], Lei Zhang, e-mail: [email protected] Source of support: Departmental sources Background: Keratitis is a complex condition in humans and is the second most common cause of legal blindness worldwide. Material/Methods: To reveal the genomic loci underlying keratitis, we performed functional annotations of SNP-based and gene- based genome-wide association studies of keratitis in the UK Biobank (UKB) cohort with 337 199 subjects of European ancestry. Results: The publicly available SNP-based association results showed a total of 34 SNPs, from 14 distinct loci, associated with keratitis in the UKB. Gene-based association analysis identified 2 significant genes:IQCF3 (p=2.0×10–6) and SOD3 (p=2.0×10–6).
    [Show full text]
  • Molecular Characterization of Acute Myeloid Leukemia by Next Generation Sequencing: Identification of Novel Biomarkers and Targets of Personalized Therapies
    Alma Mater Studiorum – Università di Bologna Dipartimento di Medicina Specialistica, Diagnostica e Sperimentale Dottorato di Ricerca in Oncologia, Ematologia e Patologia XXX Ciclo Settore Scientifico Disciplinare: MED/15 Settore Concorsuale:06/D3 Molecular characterization of acute myeloid leukemia by Next Generation Sequencing: identification of novel biomarkers and targets of personalized therapies Presentata da: Antonella Padella Coordinatore Prof. Pier-Luigi Lollini Supervisore: Prof. Giovanni Martinelli Esame finale anno 2018 Abstract Acute myeloid leukemia (AML) is a hematopoietic neoplasm that affects myeloid progenitor cells and it is one of the malignancies best studied by next generation sequencing (NGS), showing a highly heterogeneous genetic background. The aim of the study was to characterize the molecular landscape of 2 subgroups of AML patients carrying either chromosomal number alterations (i.e. aneuploidy) or rare fusion genes. We performed whole exome sequencing and we integrated the mutational data with transcriptomic and copy number analysis. We identified the cell cycle, the protein degradation, response to reactive oxygen species, energy metabolism and biosynthetic process as the pathways mostly targeted by alterations in aneuploid AML. Moreover, we identified a 3-gene expression signature including RAD50, PLK1 and CDC20 that characterize this subgroup. Taking advantage of RNA sequencing we aimed at the discovery of novel and rare gene fusions. We detected 9 rare chimeric transcripts, of which partner genes were transcription factors (ZEB2, BCL11B and MAFK) or tumor suppressors (SAV1 and PUF60) rarely translocated across cancer types. Moreover, we detected cryptic events hiding the loss of NF1 and WT1, two recurrently altered genes in AML. Finally, we explored the oncogenic potential of the ZEB2-BCL11B fusion, which revealed no transforming ability in vitro.
    [Show full text]
  • Cellular and Molecular Signatures in the Disease Tissue of Early
    Cellular and Molecular Signatures in the Disease Tissue of Early Rheumatoid Arthritis Stratify Clinical Response to csDMARD-Therapy and Predict Radiographic Progression Frances Humby1,* Myles Lewis1,* Nandhini Ramamoorthi2, Jason Hackney3, Michael Barnes1, Michele Bombardieri1, Francesca Setiadi2, Stephen Kelly1, Fabiola Bene1, Maria di Cicco1, Sudeh Riahi1, Vidalba Rocher-Ros1, Nora Ng1, Ilias Lazorou1, Rebecca E. Hands1, Desiree van der Heijde4, Robert Landewé5, Annette van der Helm-van Mil4, Alberto Cauli6, Iain B. McInnes7, Christopher D. Buckley8, Ernest Choy9, Peter Taylor10, Michael J. Townsend2 & Costantino Pitzalis1 1Centre for Experimental Medicine and Rheumatology, William Harvey Research Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, Charterhouse Square, London EC1M 6BQ, UK. Departments of 2Biomarker Discovery OMNI, 3Bioinformatics and Computational Biology, Genentech Research and Early Development, South San Francisco, California 94080 USA 4Department of Rheumatology, Leiden University Medical Center, The Netherlands 5Department of Clinical Immunology & Rheumatology, Amsterdam Rheumatology & Immunology Center, Amsterdam, The Netherlands 6Rheumatology Unit, Department of Medical Sciences, Policlinico of the University of Cagliari, Cagliari, Italy 7Institute of Infection, Immunity and Inflammation, University of Glasgow, Glasgow G12 8TA, UK 8Rheumatology Research Group, Institute of Inflammation and Ageing (IIA), University of Birmingham, Birmingham B15 2WB, UK 9Institute of
    [Show full text]
  • Understanding Chronic Kidney Disease: Genetic and Epigenetic Approaches
    University of Pennsylvania ScholarlyCommons Publicly Accessible Penn Dissertations 2017 Understanding Chronic Kidney Disease: Genetic And Epigenetic Approaches Yi-An Ko Ko University of Pennsylvania, [email protected] Follow this and additional works at: https://repository.upenn.edu/edissertations Part of the Bioinformatics Commons, Genetics Commons, and the Systems Biology Commons Recommended Citation Ko, Yi-An Ko, "Understanding Chronic Kidney Disease: Genetic And Epigenetic Approaches" (2017). Publicly Accessible Penn Dissertations. 2404. https://repository.upenn.edu/edissertations/2404 This paper is posted at ScholarlyCommons. https://repository.upenn.edu/edissertations/2404 For more information, please contact [email protected]. Understanding Chronic Kidney Disease: Genetic And Epigenetic Approaches Abstract The work described in this dissertation aimed to better understand the genetic and epigenetic factors influencing chronic kidney disease (CKD) development. Genome-wide association studies (GWAS) have identified single nucleotide polymorphisms (SNPs) significantly associated with chronic kidney disease. However, these studies have not effectively identified target genes for the CKD variants. Most of the identified variants are localized to non-coding genomic regions, and how they associate with CKD development is not well-understood. As GWAS studies only explain a small fraction of heritability, we hypothesized that epigenetic changes could explain part of this missing heritability. To identify potential gene targets of the genetic variants, we performed expression quantitative loci (eQTL) analysis, using genotyping arrays and RNA sequencing from human kidney samples. To identify the target genes of CKD-associated SNPs, we integrated the GWAS-identified SNPs with the eQTL results using a Bayesian colocalization method, coloc. This resulted in a short list of target genes, including PGAP3 and CASP9, two genes that have been shown to present with kidney phenotypes in knockout mice.
    [Show full text]
  • Myopia in African Americans Is Significantly Linked to Chromosome 7P15.2-14.2
    Genetics Myopia in African Americans Is Significantly Linked to Chromosome 7p15.2-14.2 Claire L. Simpson,1,2,* Anthony M. Musolf,2,* Roberto Y. Cordero,1 Jennifer B. Cordero,1 Laura Portas,2 Federico Murgia,2 Deyana D. Lewis,2 Candace D. Middlebrooks,2 Elise B. Ciner,3 Joan E. Bailey-Wilson,1,† and Dwight Stambolian4,† 1Department of Genetics, Genomics and Informatics and Department of Ophthalmology, University of Tennessee Health Science Center, Memphis, Tennessee, United States 2Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, Maryland, United States 3The Pennsylvania College of Optometry at Salus University, Elkins Park, Pennsylvania, United States 4Department of Ophthalmology, University of Pennsylvania, Philadelphia, Pennsylvania, United States Correspondence: Joan E. PURPOSE. The purpose of this study was to perform genetic linkage analysis and associ- Bailey-Wilson, NIH/NHGRI, 333 ation analysis on exome genotyping from highly aggregated African American families Cassell Drive, Suite 1200, Baltimore, with nonpathogenic myopia. African Americans are a particularly understudied popula- MD 21131, USA; tion with respect to myopia. [email protected]. METHODS. One hundred six African American families from the Philadelphia area with a CLS and AMM contributed equally to family history of myopia were genotyped using an Illumina ExomePlus array and merged this work and should be considered co-first authors. with previous microsatellite data. Myopia was initially measured in mean spherical equiv- JEB-W and DS contributed equally alent (MSE) and converted to a binary phenotype where individuals were identified as to this work and should be affected, unaffected, or unknown.
    [Show full text]
  • Wo2017/132291
    (12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) (19) World Intellectual Property Organization International Bureau (10) International Publication Number (43) International Publication Date W O 2017/132291 A l 3 August 2017 (03.08.2017) P O P C T (51) International Patent Classification: [US/US]; 77 Massachusetts Avenue, Cambridge, MA A61K 48/00 (2006.01) C12Q 1/68 (2006.01) 02139 (US). THE GENERAL HOSPITAL CORPORA¬ A61K 39/395 (2006.01) G01N 33/574 (2006.01) TION [US/US]; 55 Fruit Street, Boston, MA 021 14 (US). C12N 15/11 (2006.01) (72) Inventors; and (21) International Application Number: (71) Applicants : REGEV, Aviv [US/US]; 415 Main Street, PCT/US2017/014995 Cambridge, MA 02142 (US). BERNSTEIN, Bradley [US/US]; 55 Fruit Street, Boston, MA 021 14 (US). (22) International Filing Date: TIROSH, Itay [US/US]; 415 Main Street, Cambridge, 25 January 20 17 (25.01 .2017) MA 02142 (US). SUVA, Mario [US/US]; 55 Fruit Street, (25) Filing Language: English Bostn, MA 02144 (US). ROZENBALTT-ROSEN, Orit [US/US]; 415 Main Street, Cambridge, MA 02142 (US). (26) Publication Language: English (74) Agent: NIX, F., Brent; Johnson, Marcou & Isaacs, LLC, (30) Priority Data: 317A East Liberty St., Savannah, GA 31401 (US). 62/286,850 25 January 2016 (25.01.2016) US 62/437,558 2 1 December 201 6 (21. 12.2016) US (81) Designated States (unless otherwise indicated, for every kind of national protection available): AE, AG, AL, AM, (71) Applicants: THE BROAD INSTITUTE, INC. [US/US]; AO, AT, AU, AZ, BA, BB, BG, BH, BN, BR, BW, BY, 415 Main Street, Cambridge, MA 02142 (US).
    [Show full text]
  • Supplementary Information.Pdf
    Supplementary Information Whole transcriptome profiling reveals major cell types in the cellular immune response against acute and chronic active Epstein‐Barr virus infection Huaqing Zhong1, Xinran Hu2, Andrew B. Janowski2, Gregory A. Storch2, Liyun Su1, Lingfeng Cao1, Jinsheng Yu3, and Jin Xu1 Department of Clinical Laboratory1, Children's Hospital of Fudan University, Minhang District, Shanghai 201102, China; Departments of Pediatrics2 and Genetics3, Washington University School of Medicine, Saint Louis, Missouri 63110, United States. Supplementary information includes the following: 1. Supplementary Figure S1: Fold‐change and correlation data for hyperactive and hypoactive genes. 2. Supplementary Table S1: Clinical data and EBV lab results for 110 study subjects. 3. Supplementary Table S2: Differentially expressed genes between AIM vs. Healthy controls. 4. Supplementary Table S3: Differentially expressed genes between CAEBV vs. Healthy controls. 5. Supplementary Table S4: Fold‐change data for 303 immune mediators. 6. Supplementary Table S5: Primers used in qPCR assays. Supplementary Figure S1. Fold‐change (a) and Pearson correlation data (b) for 10 cell markers and 61 hypoactive and hyperactive genes identified in subjects with acute EBV infection (AIM) in the primary cohort. Note: 23 up‐regulated hyperactive genes were highly correlated positively with cytotoxic T cell (Tc) marker CD8A and NK cell marker CD94 (KLRD1), and 38 down‐regulated hypoactive genes were highly correlated positively with B cell, conventional dendritic cell
    [Show full text]
  • A High-Resolution Integrated Map of Copy Number Polymorphisms Within
    Nicholas et al. BMC Genomics 2011, 12:414 http://www.biomedcentral.com/1471-2164/12/414 RESEARCHARTICLE Open Access A high-resolution integrated map of copy number polymorphisms within and between breeds of the modern domesticated dog Thomas J Nicholas1, Carl Baker1, Evan E Eichler1,2 and Joshua M Akey1* Abstract Background: Structural variation contributes to the rich genetic and phenotypic diversity of the modern domestic dog, Canis lupus familiaris, although compared to other organisms, catalogs of canine copy number variants (CNVs) are poorly defined. To this end, we developed a customized high-density tiling array across the canine genome and used it to discover CNVs in nine genetically diverse dogs and a gray wolf. Results: In total, we identified 403 CNVs that overlap 401 genes, which are enriched for defense/immunity, oxidoreductase, protease, receptor, signaling molecule and transporter genes. Furthermore, we performed detailed comparisons between CNVs located within versus outside of segmental duplications (SDs) and find that CNVs in SDs are enriched for gene content and complexity. Finally, we compiled all known dog CNV regions and genotyped them with a custom aCGH chip in 61 dogs from 12 diverse breeds. These data allowed us to perform the first population genetics analysis of canine structural variation and identify CNVs that potentially contribute to breed specific traits. Conclusions: Our comprehensive analysis of canine CNVs will be an important resource in genetically dissecting canine phenotypic and behavioral variation. Background domestication, examine relatedness among breeds, and The domestication of the modern dog from their wolf identify signatures of artificial selection [4,7-9].
    [Show full text]
  • Single Cell Derived Clonal Analysis of Human Glioblastoma Links
    SUPPLEMENTARY INFORMATION: Single cell derived clonal analysis of human glioblastoma links functional and genomic heterogeneity ! Mona Meyer*, Jüri Reimand*, Xiaoyang Lan, Renee Head, Xueming Zhu, Michelle Kushida, Jane Bayani, Jessica C. Pressey, Anath Lionel, Ian D. Clarke, Michael Cusimano, Jeremy Squire, Stephen Scherer, Mark Bernstein, Melanie A. Woodin, Gary D. Bader**, and Peter B. Dirks**! ! * These authors contributed equally to this work.! ** Correspondence: [email protected] or [email protected]! ! Supplementary information - Meyer, Reimand et al. Supplementary methods" 4" Patient samples and fluorescence activated cell sorting (FACS)! 4! Differentiation! 4! Immunocytochemistry and EdU Imaging! 4! Proliferation! 5! Western blotting ! 5! Temozolomide treatment! 5! NCI drug library screen! 6! Orthotopic injections! 6! Immunohistochemistry on tumor sections! 6! Promoter methylation of MGMT! 6! Fluorescence in situ Hybridization (FISH)! 7! SNP6 microarray analysis and genome segmentation! 7! Calling copy number alterations! 8! Mapping altered genome segments to genes! 8! Recurrently altered genes with clonal variability! 9! Global analyses of copy number alterations! 9! Phylogenetic analysis of copy number alterations! 10! Microarray analysis! 10! Gene expression differences of TMZ resistant and sensitive clones of GBM-482! 10! Reverse transcription-PCR analyses! 11! Tumor subtype analysis of TMZ-sensitive and resistant clones! 11! Pathway analysis of gene expression in the TMZ-sensitive clone of GBM-482! 11! Supplementary figures and tables" 13" "2 Supplementary information - Meyer, Reimand et al. Table S1: Individual clones from all patient tumors are tumorigenic. ! 14! Fig. S1: clonal tumorigenicity.! 15! Fig. S2: clonal heterogeneity of EGFR and PTEN expression.! 20! Fig. S3: clonal heterogeneity of proliferation.! 21! Fig.
    [Show full text]