Whole Genome Sequencing of Familial Non-Medullary Thyroid Cancer Identifies Germline Alterations in MAPK/ERK and PI3K/AKT Signaling Pathways

Preprints (www.preprints.org) | NOT PEER-REVIEWED | Posted: 13 October 2019 doi:10.20944/preprints201910.0154.v1 Article Whole genome sequencing of familial non-medullary thyroid cancer identifies germline alterations in MAPK/ERK and PI3K/AKT signaling pathways Aayushi Srivastava 1,2,3,4, Abhishek Kumar 1,5,6, Sara Giangiobbe 1, Elena Bonora 7, Kari Hemminki 1, Asta Försti 1,2,3 and Obul Reddy Bandapalli 1,2,3,* 1 Division of Molecular Genetic Epidemiology, German Cancer Research Center (DKFZ), D-69120, Heidelberg, Germany; [email protected] (A.S.), [email protected] (A.K.); [email protected] (S.G.); [email protected] (K.H.); [email protected] (A.F.) 2 Hopp Children’s Cancer Center (KiTZ), D-69120, Heidelberg, Germany 3 Division of Pediatric Neurooncology, German Cancer Research Center (DKFZ), German Cancer Consortium (DKTK), D-69120, Heidelberg, Germany 4 Medical Faculty, Heidelberg University, D-69120, Heidelberg, Germany 5 Institute of Bioinformatics, International Technology Park, Bangalore, 560066, India 6 Manipal Academy of Higher Education (MAHE), Manipal, Karnataka, 576104, India 7 S.Orsola-Malphigi Hospital, Unit of Medical Genetics,40138, Bologna, Italy ; [email protected] * Correspondence: [email protected]; Tel.: +49-6221-42-1709 Abstract: Evidence of familial inheritance in non-medullary thyroid cancer (NMTC) has accumulated over the last few decades. However, known variants account for a very small percentage of the genetic burden. Here, we focused on the identification of common pathways and networks enriched in NMTC families to better understand its pathogenesis with the final aim of identifying one novel high/moderate-penetrance germline predisposition variant segregating with the disease in each studied family. We performed whole genome sequencing on 23 affected and 3 unaffected family members from five NMTC-prone families and prioritized the identified variants using our Familial Cancer Variant Prioritization Pipeline (FCVPPv2). In total, 31 coding variants and 39 variants located in upstream, downstream, 5′ or 3′ untranslated regions passed FCVPPv2 filtering. Altogether, 210 genes affected by variants that passed the first three steps of the FCVPPv2 were analyzed using Ingenuity Pathway Analysis software. These genes were enriched in tumorigenic signaling pathways mediated by receptor tyrosine kinases and G-protein coupled receptors, implicating a central role of PI3K/AKT and MAPK/ERK signaling in familial NMTC. Our approach can facilitate the identification and functional validation of causal variants in each family as well as the screening and genetic counseling of other individuals at risk of developing NMTC. Keywords: papillary thyroid cancer; germline mutations; whole genome sequencing; predisposition markers; pathway analysis 1. Introduction Thyroid cancer is the most common endocrine malignancy with an age adjusted incidence of 0.5-20/100,000 persons per year [1]. Significant regional differences exist with Italy being among the countries with the highest incidence rates in the world [1]. An increasing incidence has been observed worldwide during the past decades, which can to a certain extent be related to changes in the availability of medical services and in standard clinical practice. On the other hand, regional differences in incidence as well as changes over time may also be related to lifestyle, nutritional iodine, ionizing radiation and genetic factors [2]. For instance, the high incidence of thyroid cancer in Italy can © 2019 by the author(s). Distributed under a Creative Commons CC BY license. Preprints (www.preprints.org) | NOT PEER-REVIEWED | Posted: 13 October 2019 doi:10.20944/preprints201910.0154.v1 2 of 25 be attributed to the disruptive and carcinogenic effect of volcanic environments on the endocrine system [3]. The familial relative risk of developing thyroid cancer is estimated to be increased 6.7-fold in a study based on the Swedish Family-Cancer Database, in which 3.4% of all thyroid cancer cases had a concordant family history [4]. Approximately 90-95% of all thyroid cancers are non-medullary thyroid cancers (NMTC) [5] and can be classified into four histological subtypes: papillary, follicular, Hürthle cell and anaplastic thyroid cancer, with papillary thyroid cancer (PTC) being the most common one. Familial NMTC (FNMTC) accounts for only a small percentage of all NMTCs and can be divided into non-syndromic and syndromic forms. in the first, it occurs as the primary feature and in the second, as a minor component of a familial cancer syndrome, such as familial adenomatous polyposis, Gardner’s syndrome, Cowden’s disease, Carney’s complex type 1, Werner’s syndrome, papillary renal neoplasia, and DICER1 syndrome [6]. Known syndromes explain only a small proportion of all FNMTCs. Unlike the case of familial medullary thyroid cancer, in which there is extensive evidence linking germline point mutations in the RET proto-oncogene to the development of thyroid cancer, the genetic causes for FNMTC remain largely unknown. Over the years, studies seeking genetic factors predisposing to NMTC have been performed using linkage analysis, candidate gene sequencing and recently also whole genome sequencing. These studies have suggested several genes as potential NMTC-predisposing genes, including, FOXE1, SRGAP1, TITF-1/NKX2.1, SRRM2, and HABP2 [7-11]. In addition, an imbalance of the telomere-telomerase complex has been demonstrated in the peripheral blood of familial PTC patients [12]. Nonetheless, NMTC is one of the most heritable cancers wherein first degree relatives of an affected individual have an 8-10-fold increased risk of developing the disease [13]. Therefore, there are many underlying germline mutations that are yet to be discovered. The identification of such predisposition genes could be of great value in the screening of individuals at risk of developing NMTCs as well as in the development of personalized adjuvant therapies based on the affected pathways. It has been observed that hereditary NMTC is characterized by early onset, a higher degree of aggressiveness and more frequent multifocal disease and recurrence compared with its sporadic counterpart [13]. Thus, medical centers recommend more aggressive treatment of affected family members, reinforcing the importance of identifying such cases. Here we report the germline genomic landscape of five families with NMTC aggregation consistent with an autosomal dominant pattern of inheritance. The aim of the current study was to use whole genome sequencing (WGS) data to discern pathways affected in the FNMTC families to facilitate the identification of possible disease-causing high/moderate-penetrance germline variants in each family. With our results, we hope to facilitate genetic counseling and targeted therapy in these families and improve screening of other individuals at risk of developing NMTC. 2. Materials and Methods 2.1. Ethical Approval Blood samples were collected from the participants with informed consent following ethical guidelines approved by “Comitato Etico Indipendente dell ‘Azienda Ospedaliero-Universitaria di Bologna, Policlinico S. Orsola-Malpighi (Bologna, Italy)” and “comité utiltative de protection des personnes dans la recherche biomédicale, Le centre de ute contre le cancer Léon-Bérard (Lyon, France)” (ethical code: 736/2016). 2.2. NMTC Families Five families with NMTC aggregation consistent with an autosomal dominant pattern of inheritance were provided by the S. Orsola-Malpighi Hospital, Unit of Medical Genetics in Bologna, Italy. Samples from a total of 23 affected and 3 unaffected family members from the five families were submitted for WGS. Their respective pedigrees are shown in Figure 1. In family 1, the mother (I-1) was affected by insular carcinoma of the thyroid whereas three of her children and her grandchild were diagnosed with PTC or micro-PTC (II-2, II-3, II-6, III-1) and one child with benign nodules (II-1). Her Preprints (www.preprints.org) | NOT PEER-REVIEWED | Posted: 13 October 2019 doi:10.20944/preprints201910.0154.v1 3 of 25 unaffected son was deemed a reliable control (II-4). WGS (*) was performed on five family members. In family 2, there were six cases (III-1, III-3, III-4, IV-3, IV-4, IV-5), one probable case (IV-1) and one control (IV-2) out of which six underwent WGS. Family 3 consisted of two related cases (IV-4, IV-5) and one unrelated case (III-1) of which all three underwent WGS. Family 4 is characterized by bilateral PTCs concurrent with other subtypes of NMTCs (Hürthle cell cancer, follicular cancer). Four family members were diagnosed with thyroid cancer of which all underwent WGS (II-2, III-1, III-2, III-3). WGS was performed on eight family members of family 5. Five members were affected by PTC, Hürthle cell cancer, micro-PTC or a combination of two of the subtypes (II-2, II-3, II-5, II-8, II-9). Four members were possible carriers either affected by benign nodules or deceased (I-1, II-4, II-6) and two were unaffected (II-1, II-7). Figure 1. Pedigrees of the five non-medullary thyroid cancer (NMTC)-prone families analyzed in this study. 2.3. Whole genome sequencing and variant evaluation WGS for 23 cases and 3 controls was performed using Illumina-based small read sequencing after DNA was isolated from peripheral blood using the QIAamp ® DNA Mini Kit (Qiagen, Cat No. 51104) according to the manufacturer’s instructions. 2.4. Variant Calling Annotation and Filtering Sequencing data was mapped to a reference human genome (assembly version Hs37d5) using BWA mem (version 0.7.8) and duplicates were removed using biobambam (version 0.0.148). Single nucleotide variants (SNVs) and indels were called from all the samples in a family together using Platypus (version 0.8.1). ANNOVAR, 1000 Genomes, dbSNP and ExAC (Exome Aggregation Consortium) were used in the annotation of variants as explained in detail in our previous paper [14]. Variants to be evaluated further were selected using the following criteria: i) A quality score greater than 20, and a coverage greater than 5x; ii) All Platypus filters were met.

Load more