Novel P-Value Combination Methods for Signal Detection in Large-Scale Data Analysis

Total Page:16

File Type:pdf, Size:1020Kb

Novel P-Value Combination Methods for Signal Detection in Large-Scale Data Analysis Novel P-value Combination Methods for Signal Detection in Large-Scale Data Analysis by Hong Zhang A Dissertation Submitted to the Faculty of the WORCESTER POLYTECHNIC INSTITUTE in partial fulfillment of the requirements for the Degree of Doctor of Philosophy in Mathematical Sciences April 2018 c 2018 - Hong Zhang All rights reserved. Thesis advisor Author Zheyang Wu Hong Zhang Novel P-value Combination Methods for Signal Detection in Large-Scale Data Analysis Abstract In this dissertation, we first study the distributional properties of gGOF, a family of maximum based goodness-of-fit statistical tests and we propose TFisher, a new family of aggregation based tests that generalize and optimize the classic Fisher's p-value combination method. The robust data-adaptive versions of these tests are proposed to reduce the sensitivity of statistical power to different signal patterns. We also develop analytical algorithms to efficiently find the p-values of both tests under arbitrary correlation structures so that these optimal methods are not only powerful but also computationally feasible for analyzing large-scale correlated data. Both families of tests are successfully applied to detect the joint genetic effect of human complex diseases by analyzing genome-wide association study (GWAS) data and whole exome sequencing data. In Chapter 1, we study analytical distribution calculations for gGOF statistics, which covers the optimal tests, φ-divergence statistics, under arbitrary independent and continuous H0 and H1 models. Comparing with a rich literature of analytical p-value calculations, this work possesses advantages in its generality, accuracy, and computational simplicity. We also provide a general data analysis framework to apply gGOF statistics into SNP-set based GWAS for either quantitative or categorical traits. An application to Crohn's disease study shows that these optimal tests do have a good potential for detecting novel disease genes. iii Abstract iv In Chapter 2, we address the issue for gGOF under general settings of corre- lated data. We provide a novel p-value calculation approach which often possess an improved accuracy than commonly used moment-matching approach under various correlation structures. We also propose a strategy of combining innovated trans- formation and gGOF statistics, called igGOF. Furthermore, igGOF allows a nature double-ominbus test, called digGOF, which adapt both the functional of statistics and the truncation of input p-values to unknown data and signal patterns. We applied the tests in genetic association studies, both by simulations and a real exome-sequencing data analysis of amyotrophic lateral sclerosis (ALS). In Chapter 3, we propose a unifying family of Fisher's p-value combination statis- tics, called TFisher, with general p-value truncation and weighting schemes. Analyt- ical calculations for the p-value and the statistical power of TFisher under general hypotheses are given. A soft-thresholding scheme is shown to be optimal for signal detection in a large space of signal patterns. When prior information of signal pattern is unavailable, an omnibus test, oTFisher, can adapt to the given data. Simulations evidenced the accuracy of calculations and validated the theoretical properties. The TFisher tests were applied to analyzing a whole exome sequencing data of ALS. In Chapter 4, we propose an approximation for the p-value of TFisher and oT- Fisher for analyzing correlated data with general correlation structures. The methods extend the Brown's method [1] to a more general Gamma distribution. An analytical approximation of the variance of TFisher is also provided. Numerical results show that both approximations are accurate. Contents Title Page....................................i Abstract..................................... iii Table of Contents................................v 1 Distributions and Statistical Power of Optimal Signal-Detection Meth- ods In Finite Cases1 1.1 Introduction................................2 1.2 The gGOF Family for Weak-Sparse Signals.............. 10 1.3 Analytical Results............................. 14 1.4 Numerical Results............................. 23 1.5 A Framework for GWAS And Application to Crohn's Disease Study. 34 1.6 Discussion................................. 38 2 digGOF: Double-Omnibus Innovated Goodness-Of-Fit Tests For De- pendent Data Analysis 40 2.1 Introduction................................ 41 2.2 Models of Hypotheses........................... 46 2.3 The gGOF Family Under Dependence................. 50 2.4 Innovated Transformation........................ 56 2.5 Numerical Studies............................. 66 2.6 Application to Genome-wide Association Study............ 85 2.7 Discussion................................. 88 3 TFisher Tests: Optimal and Adaptive Thresholding for Combining p-Values 91 3.1 Introduction................................ 92 3.2 TFisher Tests and Hypotheses...................... 95 3.3 TFisher Distribution Under H0 ..................... 100 3.4 TFisher Distribution Under General H1 ................. 104 3.5 Asymptotic Optimality for Signal Detection.............. 106 3.6 Statistical Power Comparison For Signal Detection.......... 119 v Contents vi 3.7 ALS Exome-seq Data Analysis...................... 124 3.8 Discussion................................. 129 4 TFisher Distribution Under Dependent Input Statistics 131 4.1 Introduction................................ 132 4.2 Approximate Null Distribution of TFisher............... 133 4.3 Numerical Results............................. 136 4.4 Extension to oTFisher.......................... 143 A Proofs of Chapter1 156 B Proofs of Chapter2 169 C Proofs of Chapter3 177 D Proofs of Chapter4 186 Chapter 1 Distributions and Statistical Power of Optimal Signal-Detection Methods In Finite Cases 1 Chapter 1: gGOF under Independence 2 1.1 Introduction In big data analysis, signals are often buried within a large amount of noises and are thus relatively weak and sparse. Developing optimal tests for detecting weak- sparse signals is important for many data-driven scientific researches. For example, in large-scale genetic association studies millions of genetic variants are quarried. Only a relatively small proportion are expected to be truly associated with a given disease, and most genetic effects are relatively weak especially comparing with the cumulative noise level of high-throughput data [2,3]. In recent years, theoretical studies have revealed a collection of asymptotically optimal tests in the sense that they can reach the boundary of detectable region. In other words, they are capable of detecting the signals at an intensity level that is minimally required for any statistics to detect reliably. In particular, the Higher Criticism (HC) type tests [4,5,6,7], the Berk-Jones (B-J) type tests [8], the φ-divergence type tests [9], etc., have been proven to share the same property of such asymptotic optimality. Despite these exciting progresses, some practical questions still remain to be an- swered. In particular, the asymptotic optimality is mostly proven under theoretical assumptions such as mixture model and testing group size n ! 1 [4,9, 10, 11]. For real data analysis, however, the hypothesis model could be more arbitrary and n is al- ways finite (even small or moderate). Optimal tests that are equivalent in asymptotic sense could in fact perform quite differently. In order to apply these optimal tests to real data analysis as well as to make an appropriate choice based on signal patterns, it is important to analytically calculate p-values as well as statistical power under more general and realistic assumptions. Moreover, in order to better understand the Chapter 1: gGOF under Independence 3 performance of relevant tests in a broader context, it is beneficial to study a generic statistic family that unifies the common style of these test statistics. This chapter is to address these issues. 1.1.1 Scope of The Work This chapter considers a generic family of goodness-of-fit (GOF) test statistics, called gGOF. Following the essential idea of GOF tests, a gGOF statistic is loosely defined as a summary statistic based on the maximal contrast between ordered input p-values and their expectations under the null hypothesis. Let P1; :::; Pn, for given n > 1, be a set of input p-values, and P(1) ≤ ::: ≤ P(n) be the ordered. A gGOF statistic is defined as the supremum of a generic contrast function f over a truncation domain R: i \ Sn;R = sup f( ;P(i)); where R = fi : k0 ≤ i ≤ k1g fP(i) : α0 < P(i) < α1g; (1.1) R n for given k0 ≤ k1 2 f1; :::; ng and α0 ≤ α1 2 [0; 1]. The gGOF requires the null hypothesis i:i:d: H0 : Pi ∼ Uniform[0; 1]; i = 1; :::; n; (1.2) i so that the null expectation E(P(i)) = n+1 . If the null is untrue, P(i) will differ from their null expectations, which is to be captured by the contrast function f. In practice, smaller p-values indicate signals or positive outcomes. Therefore f(x; y) can be any monotonically decreasing function in y at fixed x, so that the smaller the input p-values, the larger the statistic and the stronger the evidence against the null i i hypothesis. In other words, gGOF is one-sided in nature. Note that n , instead of n+1 , Chapter 1: gGOF under Independence 4 is used to represent the null mean of P(i) following the tradition of GOF definition. In fact, either fraction can be used and they impose no practical difference. For both theoretical and practical reasons, it is important to allow a general trun- cation domain R to restrict both the index i and the magnitude of the p-values P(i). Aside the benefit of computational efficiency (e.g., big input p-values can be safely truncated as they are likely not signals), some gGOF statistics could be improved by excluding too small p-values. For example, HC could have the long-tail problem under the null due to small p-values, while removing P(1);P(2), etc., may not guar- antee the fix [4, 12]. It is better to directly restrict the magnitude of the p-values, and thus a modified version of HC was created with R = f1 < i ≤ n=2;P(i) ≥ 1=ng [4, 13]. The significant influence of restricting P(i) ≥ 1=n under finite n is further demonstrated in Section 1.4.
Recommended publications
  • WO 2014/071279 A2 8 May 20 14 (08.05.2014) W P O P C T
    (12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) (19) World Intellectual Property Organization International Bureau (10) International Publication Number (43) International Publication Date WO 2014/071279 A2 8 May 20 14 (08.05.2014) W P O P C T (51) International Patent Classification: (74) Agent: McCLELLAN, Kelly Brett; Genomic Health, C12Q 1/68 (2006.01) Inc., 301 Penobscot Drive, Redwood City, California 94063 (US). (21) International Application Number: PCT/US2013/068236 (81) Designated States (unless otherwise indicated, for every kind of national protection available): AE, AG, AL, AM, (22) International Filing Date: AO, AT, AU, AZ, BA, BB, BG, BH, BN, BR, BW, BY, 4 November 2013 (04.1 1.2013) BZ, CA, CH, CL, CN, CO, CR, CU, CZ, DE, DK, DM, (25) Filing Language: English DO, DZ, EC, EE, EG, ES, FI, GB, GD, GE, GH, GM, GT, HN, HR, HU, ID, IL, IN, IR, IS, JP, KE, KG, KN, KP, KR, (26) Publication Language: English KZ, LA, LC, LK, LR, LS, LT, LU, LY, MA, MD, ME, (30) Priority Data: MG, MK, MN, MW, MX, MY, MZ, NA, NG, NI, NO, NZ, 61/722,634 5 November 2012 (05. 11.2012) US OM, PA, PE, PG, PH, PL, PT, QA, RO, RS, RU, RW, SA, 61/766,561 19 February 2013 (19.02.2013) US SC, SD, SE, SG, SK, SL, SM, ST, SV, SY, TH, TJ, TM, TN, TR, TT, TZ, UA, UG, US, UZ, VC, VN, ZA, ZM, (71) Applicant: GENOMIC HEALTH, INC. [US/US]; 301 ZW. Penobscot Drive, Redwood City, California 94063 (US).
    [Show full text]
  • Caractérisation De Nouveaux Gènes Et Polymorphismes Potentiellement Impliqués Dans Les Interactions Hôtes-Pathogènes
    Aix-Marseille Université, Faculté de Médecine de Marseille Ecole Doctorale des Sciences de la Vie et de la Santé THÈSE DE DOCTORAT Présentée par Charbel ABOU-KHATER Date et lieu de naissance: 08-Juilllet-1990, Zahlé, LIBAN En vue de l’obtention du grade de Docteur de l’Université d’Aix-Marseille Mention: Biologie, Spécialité: Microbiologie Caractérisation de nouveaux gènes et polymorphismes potentiellement impliqués dans les interactions hôtes-pathogènes Publiquement soutenue le 5 Juillet 2017 devant le jury composé de : Pr. Daniel OLIVE Directeur de Thèse Pr. Brigitte CROUAU-ROY Rapporteur Dr. Benoît FAVIER Rapporteur Dr. Pierre PONTAROTTI Examinateur Thèse codirigée par Pr. Daniel OLIVE et Dr Laurent ABI-RACHED Laboratoires d’accueil URMITE Research Unit on Emerging Infectious and Tropical Diseases, UMR 6236, Faculty of Medicine, 27, Boulevard Jean Moulin, 13385 Marseille, France CRCM, Centre de Recherche en Cancérologie de Marseille,Inserm 1068, 27 Boulevard Leï Roure, BP 30059, 13273 Marseille Cedex 09, France 2 Acknowledgements First and foremost, praises and thanks to God, Holy Mighty, Holy Immortal, All-Holy Trinity, for His showers of blessings throughout my whole life and to whom I owe my very existence. Glory to the Father, and to the Son, and to the Holy Spirit: now and ever and unto ages of ages. I would like to express my sincere gratitude to my advisors Prof. Daniel Olive and Dr. Laurent Abi-Rached, for the continuous support, for their patience, motivation, and immense knowledge. Someday, I hope to be just like you. A special thanks to my “Godfather” who perfectly fulfilled his role, Dr.
    [Show full text]
  • Embedded Human Breast Cancer Tissues Reveals a Link to Tumor Progression
    Fusion Transcript Discovery in Formalin-Fixed Paraffin- Embedded Human Breast Cancer Tissues Reveals a Link to Tumor Progression Yan Ma, Ranjana Ambannavar, James Stephans, Jennie Jeong, Andrew Dei Rossi, Mei-Lan Liu, Adam J. Friedman, Jason J. Londry, Richard Abramson, Ellen M. Beasley, Joffre Baker, Samuel Levy, Kunbin Qu* Genomic Health Inc., Redwood City, California, United States of America Abstract The identification of gene fusions promises to play an important role in personalized cancer treatment decisions. Many rare gene fusion events have been identified in fresh frozen solid tumors from common cancers employing next-generation sequencing technology. However the ability to detect transcripts from gene fusions in RNA isolated from formalin-fixed paraffin-embedded (FFPE) tumor tissues, which exist in very large sample repositories for which disease outcome is known, is still limited due to the low complexity of FFPE libraries and the lack of appropriate bioinformatics methods. We sought to develop a bioinformatics method, named gFuse, to detect fusion transcripts in FFPE tumor tissues. An integrated, cohort based strategy has been used in gFuse to examine single-end 50 base pair (bp) reads generated from FFPE RNA-Sequencing (RNA-Seq) datasets employing two breast cancer cohorts of 136 and 76 patients. In total, 118 fusion events were detected transcriptome-wide at base-pair resolution across the 212 samples. We selected 77 candidate fusions based on their biological relevance to cancer and supported 61% of these using TaqMan assays. Direct sequencing of 19 of the fusion sequences identified by TaqMan confirmed them. Three unique fused gene pairs were recurrent across the 212 patients with 6, 3, 2 individuals harboring these fusions respectively.
    [Show full text]
  • 82712428.Pdf
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Elsevier - Publisher Connector Available online at www.sciencedirect.com Genomics 90 (2007) 595–609 www.elsevier.com/locate/ygeno Identification of six putative human transporters with structural similarity to the drug transporter SLC22 family ⁎ Josefin A. Jacobsson, Tatjana Haitina, Jonas Lindblom, Robert Fredriksson Department of Neuroscience, Unit of Pharmacology, Uppsala University, BMC, Uppsala SE 75124, Sweden Received 5 February 2007; accepted 24 March 2007 Available online 22 August 2007 Abstract The solute carrier family 22 (SLC22) is a large family of organic cation and anion transporters. These are transmembrane proteins expressed predominantly in kidneys and liver and mediate the uptake and excretion of environmental toxins, endogenous substances, and drugs from the body. Through a comprehensive database search we identified six human proteins not yet cloned or annotated in the reference sequence databases. Five of these belong to the SLC22 family, SLC22A20, SLC22A23, SLC22A24, SLC22A25, and SPNS3, and the sixth gene, SVOPL, is a paralog to the synaptic vesicle protein SVOP. We identified the orthologs for these genes in mouse and rat and additional homologous proteins and performed the first phylogenetic analysis on the entire SLC22 family in human, mouse, and rat. In addition, we performed a phylogenetic analysis which showed that SVOP and SV2A-C are, in a comparison with all vertebrate proteins, most similar to the SLC22 family. Finally, we performed a tissue localization study on 15 genes on a panel of 30 rat tissues using quantitative real-time polymerase chain reaction.
    [Show full text]
  • Unraveling the Functional Role of the Orphan Solute Carrier, SLC22A24 in the Transport of Steroid Conjugates Through Metabolomic and Genome-Wide Association Studies
    UCSF UC San Francisco Previously Published Works Title Unraveling the functional role of the orphan solute carrier, SLC22A24 in the transport of steroid conjugates through metabolomic and genome-wide association studies. Permalink https://escholarship.org/uc/item/2383h9q2 Journal PLoS genetics, 15(9) ISSN 1553-7390 Authors Yee, Sook Wah Stecula, Adrian Chien, Huan-Chieh et al. Publication Date 2019-09-25 DOI 10.1371/journal.pgen.1008208 Peer reviewed eScholarship.org Powered by the California Digital Library University of California RESEARCH ARTICLE Unraveling the functional role of the orphan solute carrier, SLC22A24 in the transport of steroid conjugates through metabolomic and genome-wide association studies 1☯ 1☯ 1 1 2 Sook Wah Yee , Adrian Stecula , Huan-Chieh Chien , Ling Zou , Elena V. FeofanovaID , 1 1 3,4 5 Marjolein van BorselenID , Kit Wun Kathy Cheung , Noha A. Yousri , Karsten Suhre , Jason M. Kinchen6, Eric Boerwinkle2,7, Roshanak Irannejad8, Bing Yu2, Kathleen 1,9 a1111111111 M. GiacominiID * a1111111111 a1111111111 1 Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, California, United States of America, 2 Human Genetics Center, University of Texas Health Science Center at Houston, a1111111111 Houston, Texas, United States of America, 3 Genetic Medicine, Weill Cornell Medicine-Qatar, Doha, Qatar, a1111111111 4 Computer and Systems Engineering, Alexandria University, Alexandria, Egypt, 5 Physiology and Biophysics, Weill Cornell Medicine-Qatar, Doha, Qatar, 6 Metabolon, Inc, Durham, United States of America, 7 Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, United States of America, 8 The Cardiovascular Research Institute, University of California, San Francisco, California, United States of America, 9 Institute for Human Genetics, University of California San Francisco, California, United States of America OPEN ACCESS Citation: Yee SW, Stecula A, Chien H-C, Zou L, ☯ These authors contributed equally to this work.
    [Show full text]
  • The Porcine Translational Research Database: a Manually Curated, Genomics and Proteomics-Based Research Resource Harry D
    Dawson et al. BMC Genomics (2017) 18:643 DOI 10.1186/s12864-017-4009-7 DATABASE Open Access The porcine translational research database: a manually curated, genomics and proteomics-based research resource Harry D. Dawson1* , Celine Chen1, Brady Gaynor2, Jonathan Shao2 and Joseph F. Urban Jr1 Abstract Background: The use of swine in biomedical research has increased dramatically in the last decade. Diverse genomic- and proteomic databases have been developed to facilitate research using human and rodent models. Current porcine gene databases, however, lack the robust annotation to study pig models that are relevant to human studies and for comparative evaluation with rodent models. Furthermore, they contain a significant number of errors due to their primary reliance on machine-based annotation. To address these deficiencies, a comprehensive literature-based survey was conducted to identify certain selected genes that have demonstrated function in humans, mice or pigs. Results: The process identified 13,054 candidate human, bovine, mouse or rat genes/proteins used to select potential porcine homologs by searching multiple online sources of porcine gene information. The data in the Porcine Translational Research Database ((http://www.ars.usda.gov/Services/docs.htm?docid=6065) is supported by >5800 references, and contains 65 data fields for each entry, including >9700 full length (5′ and 3′) unambiguous pig sequences, >2400 real time PCR assays and reactivity information on >1700 antibodies. It also contains gene and/or protein expression data for >2200 genes and identifies and corrects 8187 errors (gene duplications artifacts, mis-assemblies, mis-annotations, and incorrect species assignments) for 5337 porcine genes.
    [Show full text]
  • 1 Supplementary Material Figure S1. Volcano Plot of Differentially
    Supplementary material Figure S1. Volcano Plot of differentially expressed genes between preterm infants fed own mother’s milk (OMM) or pasteurized donated human milk (DHM). Table S1. The 10 most representative biological processes filtered for enrichment p- value in preterm infants. Biological Processes p-value Quantity of DEG* Transcription, DNA-templated 3.62x10-24 189 Regulation of transcription, DNA-templated 5.34x10-22 188 Transport 3.75x10-17 140 Cell cycle 1.03x10-13 65 Gene expression 3.38x10-10 60 Multicellular organismal development 6.97x10-10 86 1 Protein transport 1.73x10-09 56 Cell division 2.75x10-09 39 Blood coagulation 3.38x10-09 46 DNA repair 8.34x10-09 39 Table S2. Differential genes in transcriptomic analysis of exfoliated epithelial intestinal cells between preterm infants fed own mother’s milk (OMM) and pasteurized donated human milk (DHM). Gene name Gene Symbol p-value Fold-Change (OMM vs. DHM) (OMM vs. DHM) Lactalbumin, alpha LALBA 0.0024 2.92 Casein kappa CSN3 0.0024 2.59 Casein beta CSN2 0.0093 2.13 Cytochrome c oxidase subunit I COX1 0.0263 2.07 Casein alpha s1 CSN1S1 0.0084 1.71 Espin ESPN 0.0008 1.58 MTND2 ND2 0.0138 1.57 Small ubiquitin-like modifier 3 SUMO3 0.0037 1.54 Eukaryotic translation elongation EEF1A1 0.0365 1.53 factor 1 alpha 1 Ribosomal protein L10 RPL10 0.0195 1.52 Keratin associated protein 2-4 KRTAP2-4 0.0019 1.46 Serine peptidase inhibitor, Kunitz SPINT1 0.0007 1.44 type 1 Zinc finger family member 788 ZNF788 0.0000 1.43 Mitochondrial ribosomal protein MRPL38 0.0020 1.41 L38 Diacylglycerol
    [Show full text]
  • Whole-Exome Sequencing Identifies Common and Rare Variant Metabolic
    ARTICLE DOI: 10.1038/s41467-017-01972-9 OPEN Whole-exome sequencing identifies common and rare variant metabolic QTLs in a Middle Eastern population Noha A. Yousri1,2, Khalid A. Fakhro1,3, Amal Robay1, Juan L. Rodriguez-Flores 4, Robert P. Mohney 5, Hassina Zeriri1, Tala Odeh1, Sara Abdul Kader6, Eman K. Aldous7, Gaurav Thareja6, Manish Kumar6, Alya Al-Shakaki1, Omar M. Chidiac1, Yasmin A. Mohamoud7, Jason G. Mezey4, Joel A. Malek 1,7, Ronald G. Crystal4 & Karsten Suhre 6 1234567890 Metabolomics-genome-wide association studies (mGWAS) have uncovered many metabolic quantitative trait loci (mQTLs) influencing human metabolic individuality, though pre- dominantly in European cohorts. By combining whole-exome sequencing with a high- resolution metabolomics profiling for a highly consanguineous Middle Eastern population, we discover 21 common variant and 12 functional rare variant mQTLs, of which 45% are novel altogether. We fine-map 10 common variant mQTLs to new metabolite ratio associations, and 11 common variant mQTLs to putative protein-altering variants. This is the first work to report common and rare variant mQTLs linked to diseases and/or pharmacological targets in a consanguineous Arab cohort, with wide implications for precision medicine in the Middle East. 1 Genetic Medicine, Weill Cornell Medicine-Qatar, PO Box 24144 Doha, Qatar. 2 Computer and Systems Engineering, Alexandria University, Alexandria, Egypt. 3 Sidra Medical Research Center, Department of Human Genetics, PO Box 26999 Doha, Qatar. 4 Genetic Medicine, Weill Cornell Medicine, New York, NY 10065, USA. 5 Metabolon Inc, Durham, NC 27560, USA. 6 Physiology and Biophysics, Weill Cornell Medicine-Qatar, PO Box 24144 Doha, Qatar.
    [Show full text]
  • Identificação De Mutações Germinativas Em Pacientes Com Câncer Papilífero Familiar De Tireoide Através De Análise De Exoma
    UNIVERSIDADE FEDERAL DE MINAS GERAIS PROGRAMA DE PÓS-GRADUAÇÃO EM MEDICINA MOLECULAR IDENTIFICAÇÃO DE MUTAÇÕES GERMINATIVAS EM PACIENTES COM CÂNCER PAPILÍFERO FAMILIAR DE TIREOIDE ATRAVÉS DE ANÁLISE DE EXOMA Débora Chaves Moraes Ribeiro Orientador: Prof. Luiz Armando Cunha de Marco Co-orientadora: Profª. Luciana Bastos Rodrigues Belo Horizonte 2018 IDENTIFICAÇÃO DE MUTAÇÕES GERMINATIVAS EM PACIENTES COM CÂNCER PAPILÍFERO FAMILIAR DE TIREOIDE ATRAVÉS DE ANÁLISE DE EXOMA Débora Chaves Moraes Ribeiro Débora Chaves Moraes Ribeiro IDENTIFICAÇÃO DE MUTAÇÕES GERMINATIVAS EM PACIENTES COM CÂNCER PAPILÍFERO FAMILIAR DE TIREOIDE ATRAVÉS DE ANÁLISE DE EXOMA Tese apresentada ao Programa de Pós-graduação em Medicina Molecular da Universidade Federal de Minas Gerais, como requisito para qualificação no doutorado em Medicina Molecular Área de concentração: Medicina Molecular Linha de pesquisa: Oncologia Orientador: Prof. Dr. Luiz Armando Cunha de Marco Co-Orientadora: Profª. Drª. Luciana Bastos Rodrigues Belo Horizonte Faculdade de Medicina da UFMG 2018 Dedico esse trabalho aos meus pais, ao Eugênio Ribeiro, ao João Guilherme, as minhas irmãs e ao Eduardo Galo. A eles, toda minha admiração, amor e gratidão. i AGRADECIMENTOS A Deus por todas as oportunidades de aprendizado e crescimento, e pela beleza e grandiosidade da vida; Aos meus pais, meus grandes exemplos de integridade, superação, virtude e compaixão, agradeço, com toda admiração e gratidão, pela convivência, força, amor, paciência, carinho e cuidado; Ao Eugênio, meu grande companheiro, pessoa
    [Show full text]