A General Statistic Framework for Genome-Based Disease Risk Prediction

Total Page:16

File Type:pdf, Size:1020Kb

A General Statistic Framework for Genome-Based Disease Risk Prediction A General Statistic Framework for Genome-based Disease Risk Prediction Long Ma1, Nan Lin1 and Momiao Xiong1,* 1 Division of Biostatistics, The University of Texas School of Public Health, Houston, TX 77030, USA Running Title: biomarker identification for risk prediction Key Words: genetic variants, clinical utility, sufficient dimension reduction, risk prediction, convex optimization. *Address for correspondence and reprints: Dr. Momiao Xiong, Human Genetics Center, The University of Texas Health Science Center at Houston, P.O. Box 20186, Houston, Texas 77225, (Phone): 713-500-9894, (Fax): 713-500-0900, E-mail: [email protected]. 1 Abstract Fast and more economical next generation sequencing (NGS) technologies will generate unprecedentedly massive and highly-dimensional genomic and epigenomic variation data. In the near future, a routine part of medical records will include the sequenced genomes. How to efficiently extract biomarkers for risk prediction and treatment selection from millions or dozens of millions of genomic variants raises a great challenge. Traditional paradigms for identifying variants of clinical validity are to test association of the variants. However, significantly associated genetic variants may or may not be usefulness for diagnosis and prognosis of diseases. Alternative to association studies for finding genetic variants of predictive utility is to systematically search variants that contain sufficient information for phenotype prediction. To achieve this, we introduce concepts of sufficient dimension reduction (SDR) and coordinate hypothesis which project the original high dimensional data to very low dimensional space while preserving all information on response phenotypes. We then formulate a clinically significant genetic variant discovery problem into the sparse SDR problem and develop algorithms that can select significant genetic variants from up to or even ten millions of predictors with the aid of a split-and-conquer approach. The sparse SDR is in turn formulated as a sparse optimal scoring problem, but with penalty which can remove row vectors from the basis matrix. To speed up computation, we apply the alternating direction method of multipliers to solving the sparse optimal scoring problem which can easily be implemented in parallel. To illustrate its application, the proposed method is applied to a coronary artery disease (CAD) dataset from the Wellcome Trust Case Control Consortium (WTCCC) study, Rheumatoid Arthritis (RA) dataset from the GWAS of North American Rheumatoid Arthritis Consortium (NARAC) and the early- onset myocardial infarction (EOMI) exome sequence datasets which have European origin from the NHLBI’s Exome Sequencing Project. 2 Introduction Although there is heated debate about whether DNA variation has value to predict diseases1-4, genetic variations that are being generated by next generation sequencing (NGS) technologies will provide invaluable information for disease prediction and prevention5. However, how to efficiently extract genetic variants for risk prediction and treatment selection from millions or dozens of millions of genomic variants is among the biggest challenges in genomic research and medicine6. Innovative approaches should be developed to address these challenges. Traditional paradigms for identifying variants of clinical validity test association of the variants7. The variants are ranked by P-values or odds ratios. A few “top-SNPs” with the smallest P-values are then used to predict disease risk of individuals8,9. However, most variants have small effect sizes and the often low to moderate heritability of common diseases. Many associated genetic variants, on their own, have little or no predictive utility. The studies based on a few number of the loci demonstrate limited predictive value of disease10-14. The best predictors may not always be at the top of the list and variable selection by P-values will miss many important predictors8. To overcome this limitation, rather than using a few highly significant SNPs from genome- wide association studies (GWAS), some researchers use penalized techniques such as the penalized pseudo-likelihood approaches and the bridge regression15, the LASSO16, the SCAD and other folded-concave penalties17. The Dantzig selector18 and their related methods19 have recently been developed to directly search a set of SNPs for disease risk prediction from a large number of genetic variants20 with mild success. 3 The widely used methods for disease risk prediction have several serious limitations. Dimension reduction in which we attempt to identify linear or nonlinear combinations of the original set of variables or chose a subset of variables from the original set21 is an essential issue for genome-based prediction of common diseases. The first limitation of the current methods for disease risk prediction is to use unsupervised dimension reduction in which we fail to make use of the phenotype information. The second limitation of these methods is their lack of algorithms to systematically search millions of genetic variants for disease risk prediction. We observe that the maximum number of features which the LASSO type methods are predictive is less than 50,000. Simple application of the penalized pseudo-likelihood approaches to millions or dozen of millions of features is infeasible. How to select an informative subset of features from millions of variables is still an open question. The genome-based prediction of common diseases often demands extremely intensive computations. A practical solution is to employ the power of parallel computing. The third limitation is that it is difficult to adapt the current methods for risk prediction to parallel computing. The purpose of this paper is to address the limitations of these traditional methods for disease risk prediction. Dimension reduction and variable selection are a dominant theme in the genome-based disease risk prediction. To overcome the most important limitation of the currently used unsupervised dimension reduction in disease risk prediction, which will lose individual disease status information, we propose a supervised dimension reduction method in which both genetic variant information and disease information will be used. An essential concept in supervised dimension reduction is sufficient dimension reduction (SDR) and testing of the coordinate relation hypothesis22,23. Since the dimension of genomic variation data is extremely high and the most genetic variants across the genome are irrelevant with disease, we 4 replace the original predictor vector (genetic variants) with its prediction onto a low dimensional space of the predictor space –which results in a minimal set of linear combinations of the original predictors, without loss of information on disease or response24,25. Such low dimensional space is called dimension reduction subspace22. Minimum dimension reduction subspace is referred to as central subspace (CS). All information of the predictors about disease or response in the genetic dataset is contained in the central subspace. The central subspace provides sufficient information to predict response or risk. We only need to study relationships between responses (disease risk) and variables in the central space regardless of the original predictors. The number of methods including sliced inverse regression (SIR)26, sliced average variance estimation27, principal Hessian direction25,28, directional regression29, and likelihood-based SDR30 have been developed to estimate the central subspace. Although great progress in SDR has been made the most current methods for dimension reduction still have serious limitation in their practice applications. The first limitation is that the current popular methods for the SDR require the inverse of the sample covariance matrix of the predictors which are often near singular for large number of predictors31. Second, the CS estimation needs to use all the original predictors. Therefore, the estimations are not efficient due to the presence of a lot of irrelevant and noise data and the results are difficult to interpret32. These methods cannot be used to identify genetic variants of clinical significance. In the past several years, the variable selection methods have been explored in the SDR analysis to address these two problems. The variable selection methods for the SDR include model-free variable selection33, the shrinkage sliced inverse regression34, sparse SDR35, constrained canonical correlation and variable filtering for dimension reduction36. The common feature of these methods is that the identification of CS and variable selection are separated. At 5 the first step, the relevant SDR methods are used to estimate the CS. At the second step, the regularization methods are applied to the estimated basis vector forming the CS to select predictors. In addition, these methods still cannot solve the singularity problem of the covariance matrix of the predictors if at the first stage all predictors are used for SDR. Chen et al.32 proposed coordinate-independent sparse sufficient dimension reduction and variable selection for SDR which can simultaneously reduce dimension and select variables. However, they need to partition all response variables into slices and still require the inverse of the sample covariance matrix of the predictors. As a consequence, their method can only solve SDR with a small number of predictors. To avoid the inverse of covariance of predictors, Wang and Zhu31 used the optimal scoring
Recommended publications
  • Genetic Variation Across the Human Olfactory Receptor Repertoire Alters Odor Perception
    bioRxiv preprint doi: https://doi.org/10.1101/212431; this version posted November 1, 2017. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. Genetic variation across the human olfactory receptor repertoire alters odor perception Casey Trimmer1,*, Andreas Keller2, Nicolle R. Murphy1, Lindsey L. Snyder1, Jason R. Willer3, Maira Nagai4,5, Nicholas Katsanis3, Leslie B. Vosshall2,6,7, Hiroaki Matsunami4,8, and Joel D. Mainland1,9 1Monell Chemical Senses Center, Philadelphia, Pennsylvania, USA 2Laboratory of Neurogenetics and Behavior, The Rockefeller University, New York, New York, USA 3Center for Human Disease Modeling, Duke University Medical Center, Durham, North Carolina, USA 4Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, North Carolina, USA 5Department of Biochemistry, University of Sao Paulo, Sao Paulo, Brazil 6Howard Hughes Medical Institute, New York, New York, USA 7Kavli Neural Systems Institute, New York, New York, USA 8Department of Neurobiology and Duke Institute for Brain Sciences, Duke University Medical Center, Durham, North Carolina, USA 9Department of Neuroscience, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania, USA *[email protected] ABSTRACT The human olfactory receptor repertoire is characterized by an abundance of genetic variation that affects receptor response, but the perceptual effects of this variation are unclear. To address this issue, we sequenced the OR repertoire in 332 individuals and examined the relationship between genetic variation and 276 olfactory phenotypes, including the perceived intensity and pleasantness of 68 odorants at two concentrations, detection thresholds of three odorants, and general olfactory acuity.
    [Show full text]
  • Database Tool the Systematic Annotation of the Three Main GPCR
    Database, Vol. 2010, Article ID baq018, doi:10.1093/database/baq018 ............................................................................................................................................................................................................................................................................................. Database tool The systematic annotation of the three main Downloaded from https://academic.oup.com/database/article-abstract/doi/10.1093/database/baq018/406672 by guest on 15 January 2019 GPCR families in Reactome Bijay Jassal1, Steven Jupe1, Michael Caudy2, Ewan Birney1, Lincoln Stein2, Henning Hermjakob1 and Peter D’Eustachio3,* 1European Bioinformatics Institute, Hinxton, Cambridge, CB10 1SD, UK, 2Ontario Institute for Cancer Research, Toronto, ON M5G 0A3, Canada and 3New York University School of Medicine, New York, NY 10016, USA *Corresponding author: Tel: +212 263 5779; Fax: +212 263 8166; Email: [email protected] Submitted 14 April 2010; Revised 14 June 2010; Accepted 13 July 2010 ............................................................................................................................................................................................................................................................................................. Reactome is an open-source, freely available database of human biological pathways and processes. A major goal of our work is to provide an integrated view of cellular signalling processes that spans from ligand–receptor
    [Show full text]
  • Table S1 the Four Gene Sets Derived from Gene Expression Profiles of Escs and Differentiated Cells
    Table S1 The four gene sets derived from gene expression profiles of ESCs and differentiated cells Uniform High Uniform Low ES Up ES Down EntrezID GeneSymbol EntrezID GeneSymbol EntrezID GeneSymbol EntrezID GeneSymbol 269261 Rpl12 11354 Abpa 68239 Krt42 15132 Hbb-bh1 67891 Rpl4 11537 Cfd 26380 Esrrb 15126 Hba-x 55949 Eef1b2 11698 Ambn 73703 Dppa2 15111 Hand2 18148 Npm1 11730 Ang3 67374 Jam2 65255 Asb4 67427 Rps20 11731 Ang2 22702 Zfp42 17292 Mesp1 15481 Hspa8 11807 Apoa2 58865 Tdh 19737 Rgs5 100041686 LOC100041686 11814 Apoc3 26388 Ifi202b 225518 Prdm6 11983 Atpif1 11945 Atp4b 11614 Nr0b1 20378 Frzb 19241 Tmsb4x 12007 Azgp1 76815 Calcoco2 12767 Cxcr4 20116 Rps8 12044 Bcl2a1a 219132 D14Ertd668e 103889 Hoxb2 20103 Rps5 12047 Bcl2a1d 381411 Gm1967 17701 Msx1 14694 Gnb2l1 12049 Bcl2l10 20899 Stra8 23796 Aplnr 19941 Rpl26 12096 Bglap1 78625 1700061G19Rik 12627 Cfc1 12070 Ngfrap1 12097 Bglap2 21816 Tgm1 12622 Cer1 19989 Rpl7 12267 C3ar1 67405 Nts 21385 Tbx2 19896 Rpl10a 12279 C9 435337 EG435337 56720 Tdo2 20044 Rps14 12391 Cav3 545913 Zscan4d 16869 Lhx1 19175 Psmb6 12409 Cbr2 244448 Triml1 22253 Unc5c 22627 Ywhae 12477 Ctla4 69134 2200001I15Rik 14174 Fgf3 19951 Rpl32 12523 Cd84 66065 Hsd17b14 16542 Kdr 66152 1110020P15Rik 12524 Cd86 81879 Tcfcp2l1 15122 Hba-a1 66489 Rpl35 12640 Cga 17907 Mylpf 15414 Hoxb6 15519 Hsp90aa1 12642 Ch25h 26424 Nr5a2 210530 Leprel1 66483 Rpl36al 12655 Chi3l3 83560 Tex14 12338 Capn6 27370 Rps26 12796 Camp 17450 Morc1 20671 Sox17 66576 Uqcrh 12869 Cox8b 79455 Pdcl2 20613 Snai1 22154 Tubb5 12959 Cryba4 231821 Centa1 17897
    [Show full text]
  • Human Artificial Chromosome (Hac) Vector
    Europäisches Patentamt *EP001559782A1* (19) European Patent Office Office européen des brevets (11) EP 1 559 782 A1 (12) EUROPEAN PATENT APPLICATION published in accordance with Art. 158(3) EPC (43) Date of publication: (51) Int Cl.7: C12N 15/09, C12N 1/15, 03.08.2005 Bulletin 2005/31 C12N 1/19, C12N 1/21, C12N 5/10, C12P 21/02 (21) Application number: 03751334.8 (86) International application number: (22) Date of filing: 03.10.2003 PCT/JP2003/012734 (87) International publication number: WO 2004/031385 (15.04.2004 Gazette 2004/16) (84) Designated Contracting States: • KATOH, Motonobu, Tottori University AT BE BG CH CY CZ DE DK EE ES FI FR GB GR Yonago-shi, Tottori 683-8503 (JP) HU IE IT LI LU MC NL PT RO SE SI SK TR • TOMIZUKA, Kazuma, Designated Extension States: Kirin Beer Kabushiki Kaisha AL LT LV MK Takashi-shi, Gunma 370-1295 (JP) • KUROIWA, Yoshimi, (30) Priority: 04.10.2002 JP 2002292853 Kirin Beer Kabushiki Kaisha Takasaki-shi, Gunma 370-1295 (JP) (71) Applicant: KIRIN BEER KABUSHIKI KAISHA • KAKEDA, Minoru, Kirin Beer Kabushiki Kaisha Tokyo 104-8288 (JP) Takasaki-shi, Gunma 370-1295 (JP) (72) Inventors: (74) Representative: HOFFMANN - EITLE • OSHIMURA, Mitsuo, Tottori University Patent- und Rechtsanwälte Yonago-shi, Tottori 683-8503 (JP) Arabellastrasse 4 81925 München (DE) (54) HUMAN ARTIFICIAL CHROMOSOME (HAC) VECTOR (57) The present invention relates to a human arti- ing a cell which expresses foreign DNA. Furthermore, ficial chromosome (HAC) vector and a method for pro- the present invention relates to a method for producing ducing the same.
    [Show full text]
  • HMGB1 in Health and Disease R
    Donald and Barbara Zucker School of Medicine Journal Articles Academic Works 2014 HMGB1 in health and disease R. Kang R. C. Chen Q. H. Zhang W. Hou S. Wu See next page for additional authors Follow this and additional works at: https://academicworks.medicine.hofstra.edu/articles Part of the Emergency Medicine Commons Recommended Citation Kang R, Chen R, Zhang Q, Hou W, Wu S, Fan X, Yan Z, Sun X, Wang H, Tang D, . HMGB1 in health and disease. 2014 Jan 01; 40():Article 533 [ p.]. Available from: https://academicworks.medicine.hofstra.edu/articles/533. Free full text article. This Article is brought to you for free and open access by Donald and Barbara Zucker School of Medicine Academic Works. It has been accepted for inclusion in Journal Articles by an authorized administrator of Donald and Barbara Zucker School of Medicine Academic Works. Authors R. Kang, R. C. Chen, Q. H. Zhang, W. Hou, S. Wu, X. G. Fan, Z. W. Yan, X. F. Sun, H. C. Wang, D. L. Tang, and +8 additional authors This article is available at Donald and Barbara Zucker School of Medicine Academic Works: https://academicworks.medicine.hofstra.edu/articles/533 NIH Public Access Author Manuscript Mol Aspects Med. Author manuscript; available in PMC 2015 December 01. NIH-PA Author ManuscriptPublished NIH-PA Author Manuscript in final edited NIH-PA Author Manuscript form as: Mol Aspects Med. 2014 December ; 0: 1–116. doi:10.1016/j.mam.2014.05.001. HMGB1 in Health and Disease Rui Kang1,*, Ruochan Chen1, Qiuhong Zhang1, Wen Hou1, Sha Wu1, Lizhi Cao2, Jin Huang3, Yan Yu2, Xue-gong Fan4, Zhengwen Yan1,5, Xiaofang Sun6, Haichao Wang7, Qingde Wang1, Allan Tsung1, Timothy R.
    [Show full text]
  • DNA Supercoiling with a Twist Edwin Kamau Louisiana State University and Agricultural and Mechanical College, [email protected]
    Louisiana State University LSU Digital Commons LSU Doctoral Dissertations Graduate School 2005 DNA supercoiling with a twist Edwin Kamau Louisiana State University and Agricultural and Mechanical College, [email protected] Follow this and additional works at: https://digitalcommons.lsu.edu/gradschool_dissertations Recommended Citation Kamau, Edwin, "DNA supercoiling with a twist" (2005). LSU Doctoral Dissertations. 999. https://digitalcommons.lsu.edu/gradschool_dissertations/999 This Dissertation is brought to you for free and open access by the Graduate School at LSU Digital Commons. It has been accepted for inclusion in LSU Doctoral Dissertations by an authorized graduate school editor of LSU Digital Commons. For more information, please [email protected]. DNA SUPERCOILING WITH A TWIST A Dissertation Submitted to the Graduate Faculty of the Louisiana State University and Agricultural and Mechanical College in partial fulfillment of the requirements for the degree of Doctor of Philosophy in The Department of Biological Sciences By Edwin Kamau B.S. Horticulture, Egerton University, Kenya, 1996, May, 2005 DEDICATIONS I would like to dedicate this dissertation to my son, Ashford Kimani Kamau. Your age marks the life of my graduate school career. I have been a weekend dad, broke and with enormous financial responsibilities; it kills me to see how sometimes you cry your heart out on Sunday evening when our weekend time together is over. I miss you too every moment, you have encouraged me to be a better person, a better dad so that I can make you proud. At your tender age, I have learned a lot from you. We will soon spend all the time in the world we need together; travel, sports, music, pre-historic discoveries and adventures; the road is wide open.
    [Show full text]
  • Supplementary Methods
    Heterogeneous Contribution of Microdeletions in the Development of Common Generalized and Focal epilepsies. SUPPLEMENTARY METHODS Epilepsy subtype extended description. Genetic Gereralized Epilepsy (GGE): Features unprovoked tonic and/or clonic seizures, originated inconsistently at some focal point within the brain that rapidly generalizes engaging bilateral distributed spikes and waves discharges on the electroencephalogram. This generalization can include cortical and sub cortical structures but not necessarily the entire cortex[1]. GGE is the most common group of epilepsies accounting for 20% of all cases[2]. It is characterized by an age-related onset and a strong familial aggregation and heritability which allows the assumption of a genetic cause. Although genetic associations have been identified, a broad spectrum of causes is acknowledged and remains largely unsolved [3]. Rolandic Epilepsy (RE): Commonly known also as Benign Epilepsy with Centrotemporal Spikes (BECTS), hallmarks early onset diagnosis (mean onset = 7 years old) with brief, focal hemifacial or oropharyngeal sensorimotor seizures alongside speech arrest and secondarily generalized tonic– clonic seizures, which mainly occur during sleep[4]. Rolandic epilepsy features a broad spectrum of less benign related syndromes called atypical Rolandic epilepsy (ARE), including benign partial epilepsy (ABPE), Landau–Kleffner syndrome(LKS) and epileptic encephalopathy with continuous spike-and-waves during sleep (CSWSS)[5]. Together they are the most common childhood epilepsy with a prevalence of 0.2–0.73/1000 (i.e. _1/2500)[6]. Adult Focal Epilepsy (AFE). Focal epilepsy is characterized by sporadic events of seizures originated within a specific brain region and restricted to one hemisphere. Although they can exhibit more than one network of wave discharges on the electroencephalogram, and different degrees of spreading, they feature a consistent site of origin.
    [Show full text]
  • Greg's Awesome Thesis
    Analysis of alignment error and sitewise constraint in mammalian comparative genomics Gregory Jordan European Bioinformatics Institute University of Cambridge A dissertation submitted for the degree of Doctor of Philosophy November 30, 2011 To my parents, who kept us thinking and playing This dissertation is the result of my own work and includes nothing which is the out- come of work done in collaboration except where specifically indicated in the text and acknowledgements. This dissertation is not substantially the same as any I have submitted for a degree, diploma or other qualification at any other university, and no part has already been, or is currently being submitted for any degree, diploma or other qualification. This dissertation does not exceed the specified length limit of 60,000 words as defined by the Biology Degree Committee. November 30, 2011 Gregory Jordan ii Analysis of alignment error and sitewise constraint in mammalian comparative genomics Summary Gregory Jordan November 30, 2011 Darwin College Insight into the evolution of protein-coding genes can be gained from the use of phylogenetic codon models. Recently sequenced mammalian genomes and powerful analysis methods developed over the past decade provide the potential to globally measure the impact of natural selection on pro- tein sequences at a fine scale. The detection of positive selection in particular is of great interest, with relevance to the study of host-parasite conflicts, immune system evolution and adaptive dif- ferences between species. This thesis examines the performance of methods for detecting positive selection first with a series of simulation experiments, and then with two empirical studies in mammals and primates.
    [Show full text]
  • European Patent Office of Opposition to That Patent, in Accordance with the Implementing Regulations
    (19) TZZ Z_T (11) EP 2 884 280 B1 (12) EUROPEAN PATENT SPECIFICATION (45) Date of publication and mention (51) Int Cl.: of the grant of the patent: G01N 33/566 (2006.01) 09.05.2018 Bulletin 2018/19 (21) Application number: 13197310.9 (22) Date of filing: 15.12.2013 (54) Method for evaluating the scent performance of perfumes and perfume mixtures Verfahren zur Bewertung des Duftverhaltens von Duftstoffen und Duftstoffmischungen Procédé d’evaluation de senteur performance du parfums et mixtures de parfums (84) Designated Contracting States: (56) References cited: AL AT BE BG CH CY CZ DE DK EE ES FI FR GB WO-A2-03/091388 GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR • BAGHAEI KAVEH A: "Deorphanization of human olfactory receptors by luciferase and Ca-imaging (43) Date of publication of application: methods.",METHODS IN MOLECULAR BIOLOGY 17.06.2015 Bulletin 2015/25 (CLIFTON, N.J.) 2013, vol. 1003, 19 June 2013 (2013-06-19), pages229-238, XP008168583, ISSN: (73) Proprietor: Symrise AG 1940-6029 37603 Holzminden (DE) • KAVEH BAGHAEI ET AL: "Olfactory receptors coded by segregating pseudo genes and (72) Inventors: odorants with known specific anosmia.", 33RD • Hatt, Hanns ANNUAL MEETING OF THE ASSOCIATION FOR 44789 Bochum (DE) CHEMORECEPTION, 1 April 2011 (2011-04-01), • Gisselmann, Günter XP055111507, 58456 Witten (DE) • TOUHARA ET AL: "Deorphanizing vertebrate • Ashtibaghaei, Kaveh olfactory receptors: Recent advances in 44801 Bochum (DE) odorant-response assays", NEUROCHEMISTRY • Panten, Johannes INTERNATIONAL, PERGAMON PRESS, 37671 Höxter (DE) OXFORD, GB, vol.
    [Show full text]
  • Misexpression of Cancer/Testis (Ct) Genes in Tumor Cells and the Potential Role of Dream Complex and the Retinoblastoma Protein Rb in Soma-To-Germline Transformation
    Michigan Technological University Digital Commons @ Michigan Tech Dissertations, Master's Theses and Master's Reports 2019 MISEXPRESSION OF CANCER/TESTIS (CT) GENES IN TUMOR CELLS AND THE POTENTIAL ROLE OF DREAM COMPLEX AND THE RETINOBLASTOMA PROTEIN RB IN SOMA-TO-GERMLINE TRANSFORMATION SABHA M. ALHEWAT Michigan Technological University, [email protected] Copyright 2019 SABHA M. ALHEWAT Recommended Citation ALHEWAT, SABHA M., "MISEXPRESSION OF CANCER/TESTIS (CT) GENES IN TUMOR CELLS AND THE POTENTIAL ROLE OF DREAM COMPLEX AND THE RETINOBLASTOMA PROTEIN RB IN SOMA-TO- GERMLINE TRANSFORMATION", Open Access Master's Thesis, Michigan Technological University, 2019. https://doi.org/10.37099/mtu.dc.etdr/933 Follow this and additional works at: https://digitalcommons.mtu.edu/etdr Part of the Cancer Biology Commons, and the Cell Biology Commons MISEXPRESSION OF CANCER/TESTIS (CT) GENES IN TUMOR CELLS AND THE POTENTIAL ROLE OF DREAM COMPLEX AND THE RETINOBLASTOMA PROTEIN RB IN SOMA-TO-GERMLINE TRANSFORMATION By Sabha Salem Alhewati A THESIS Submitted in partial fulfillment of the requirements for the degree of MASTER OF SCIENCE In Biological Sciences MICHIGAN TECHNOLOGICAL UNIVERSITY 2019 © 2019 Sabha Alhewati This thesis has been approved in partial fulfillment of the requirements for the Degree of MASTER OF SCIENCE in Biological Sciences. Department of Biological Sciences Thesis Advisor: Paul Goetsch. Committee Member: Ebenezer Tumban. Committee Member: Zhiying Shan. Department Chair: Chandrashekhar Joshi. Table of Contents List of figures .......................................................................................................................v
    [Show full text]
  • WO 2019/068007 Al Figure 2
    (12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) (19) World Intellectual Property Organization I International Bureau (10) International Publication Number (43) International Publication Date WO 2019/068007 Al 04 April 2019 (04.04.2019) W 1P O PCT (51) International Patent Classification: (72) Inventors; and C12N 15/10 (2006.01) C07K 16/28 (2006.01) (71) Applicants: GROSS, Gideon [EVIL]; IE-1-5 Address C12N 5/10 (2006.0 1) C12Q 1/6809 (20 18.0 1) M.P. Korazim, 1292200 Moshav Almagor (IL). GIBSON, C07K 14/705 (2006.01) A61P 35/00 (2006.01) Will [US/US]; c/o ImmPACT-Bio Ltd., 2 Ilian Ramon St., C07K 14/725 (2006.01) P.O. Box 4044, 7403635 Ness Ziona (TL). DAHARY, Dvir [EilL]; c/o ImmPACT-Bio Ltd., 2 Ilian Ramon St., P.O. (21) International Application Number: Box 4044, 7403635 Ness Ziona (IL). BEIMAN, Merav PCT/US2018/053583 [EilL]; c/o ImmPACT-Bio Ltd., 2 Ilian Ramon St., P.O. (22) International Filing Date: Box 4044, 7403635 Ness Ziona (E.). 28 September 2018 (28.09.2018) (74) Agent: MACDOUGALL, Christina, A. et al; Morgan, (25) Filing Language: English Lewis & Bockius LLP, One Market, Spear Tower, SanFran- cisco, CA 94105 (US). (26) Publication Language: English (81) Designated States (unless otherwise indicated, for every (30) Priority Data: kind of national protection available): AE, AG, AL, AM, 62/564,454 28 September 2017 (28.09.2017) US AO, AT, AU, AZ, BA, BB, BG, BH, BN, BR, BW, BY, BZ, 62/649,429 28 March 2018 (28.03.2018) US CA, CH, CL, CN, CO, CR, CU, CZ, DE, DJ, DK, DM, DO, (71) Applicant: IMMP ACT-BIO LTD.
    [Show full text]
  • Sean Raspet – Molecules
    1. Commercial name: Fructaplex© IUPAC Name: 2-(3,3-dimethylcyclohexyl)-2,5,5-trimethyl-1,3-dioxane SMILES: CC1(C)CCCC(C1)C2(C)OCC(C)(C)CO2 Molecular weight: 240.39 g/mol Volume (cubic Angstroems): 258.88 Atoms number (non-hydrogen): 17 miLogP: 4.43 Structure: Biological Properties: Predicted Druglikenessi: GPCR ligand -0.23 Ion channel modulator -0.03 Kinase inhibitor -0.6 Nuclear receptor ligand 0.15 Protease inhibitor -0.28 Enzyme inhibitor 0.15 Commercial name: Fructaplex© IUPAC Name: 2-(3,3-dimethylcyclohexyl)-2,5,5-trimethyl-1,3-dioxane SMILES: CC1(C)CCCC(C1)C2(C)OCC(C)(C)CO2 Predicted Olfactory Receptor Activityii: OR2L13 83.715% OR1G1 82.761% OR10J5 80.569% OR2W1 78.180% OR7A2 77.696% 2. Commercial name: Sylvoxime© IUPAC Name: N-[4-(1-ethoxyethenyl)-3,3,5,5tetramethylcyclohexylidene]hydroxylamine SMILES: CCOC(=C)C1C(C)(C)CC(CC1(C)C)=NO Molecular weight: 239.36 Volume (cubic Angstroems): 252.83 Atoms number (non-hydrogen): 17 miLogP: 4.33 Structure: Biological Properties: Predicted Druglikeness: GPCR ligand -0.6 Ion channel modulator -0.41 Kinase inhibitor -0.93 Nuclear receptor ligand -0.17 Protease inhibitor -0.39 Enzyme inhibitor 0.01 Commercial name: Sylvoxime© IUPAC Name: N-[4-(1-ethoxyethenyl)-3,3,5,5tetramethylcyclohexylidene]hydroxylamine SMILES: CCOC(=C)C1C(C)(C)CC(CC1(C)C)=NO Predicted Olfactory Receptor Activity: OR52D1 71.900% OR1G1 70.394% 0R52I2 70.392% OR52I1 70.390% OR2Y1 70.378% 3. Commercial name: Hyperflor© IUPAC Name: 2-benzyl-1,3-dioxan-5-one SMILES: O=C1COC(CC2=CC=CC=C2)OC1 Molecular weight: 192.21 g/mol Volume
    [Show full text]