A Gene-Based Risk Score Model for Predicting Recurrence-Free Survival

Total Page:16

File Type:pdf, Size:1020Kb

A Gene-Based Risk Score Model for Predicting Recurrence-Free Survival Wang et al. BMC Cancer (2021) 21:6 https://doi.org/10.1186/s12885-020-07692-6 RESEARCH ARTICLE Open Access A gene-based risk score model for predicting recurrence-free survival in patients with hepatocellular carcinoma Wenhua Wang1,2, Lingchen Wang1,2, Xinsheng Xie3, Yehong Yan4, Yue Li1,2 and Quqin Lu1,2* Abstract Background: Hepatocellular carcinoma (HCC) remains the most frequent liver cancer, accounting for approximately 90% of primary liver cancers worldwide. The recurrence-free survival (RFS) of HCC patients is a critical factor in devising a personal treatment plan. Thus, it is necessary to accurately forecast the prognosis of HCC patients in clinical practice. Methods: Using The Cancer Genome Atlas (TCGA) dataset, we identified genes associated with RFS. A robust likelihood-based survival modeling approach was used to select the best genes for the prognostic model. Then, the GSE76427 dataset was used to evaluate the prognostic model’s effectiveness. Results: We identified 1331 differentially expressed genes associated with RFS. Seven of these genes were selected to generate the prognostic model. The validation in both the TCGA cohort and GEO cohort demonstrated that the 7-gene prognostic model can predict the RFS of HCC patients. Meanwhile, the results of the multivariate Cox regression analysis showed that the 7-gene risk score model could function as an independent prognostic factor. In addition, according to the time-dependent ROC curve, the 7-gene risk score model performed better in predicting the RFS of the training set and the external validation dataset than the classical TNM staging and BCLC. Furthermore, these seven genes were found to be related to the occurrence and development of liver cancer by exploring three other databases. Conclusion: Our study identified a seven-gene signature for HCC RFS prediction that can be used as a novel and convenient prognostic tool. These seven genes might be potential target genes for metabolic therapy and the treatment of HCC. Keywords: TCGA, Hepatocellular carcinoma, Recurrence-free survival, Risk score, Prognostic model * Correspondence: [email protected] 1Jiangxi Provincial Key Laboratory of Preventive Medicine, Nanchang University, Nanchang 330006, Jiangxi, China 2Department of Biostatistics and Epidemiology, School of Public Health, Nanchang University, Nanchang 330006, Jiangxi, China Full list of author information is available at the end of the article © The Author(s). 2021 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. Wang et al. BMC Cancer (2021) 21:6 Page 2 of 15 Background prognosis of liver cancer remain poor with a 2-year re- In 2018, liver cancer remained among the top six preva- currence rate of 76.9% [6–8]. And many studies have lent carcinomas. There were 841,080 new patients, and shown that HCC is the most difficult to cure cancer, and 781,631 patients died of liver cancer according to the because of this, HCC has been described as a “chemore- Global Cancer Statistics [1, 2]. Hepatocellular carcinoma sistant” tumor [9]. Because of this, the prognosis of HCC (HCC) is the most frequent liver cancer, accounting for is poor. The recurrence-free survival (RFS) of HCC pa- approximately 90% of primary liver cancers [3]. Cur- tients is a critical factor in devising a personal treatment rently, Hepatectomy and Radiofrequency ablation are plan [10]. Thus, it is necessary to accurately forecast the main two ways to treat HCC [4, 5]. Despite the con- HCC patients’ prognosis to improve the prognosis of tinuous development of medical technology, the out- HCC. Most previous studies constructed prognostic come of many patients who receive treatment and the models using the Tumor-Node-Metastasis (TNM) Fig. 1 GO functional and KEGG pathway analyses. a Summary of the differentially expressed genes and GO pathway enrichment. Red, blue, and green bars represent the biological process, cellular component, and molecular function categories, respectively. The height of the bar represents the number of differentially expressed genes observed in each category. b The top 10 pathways of genes associated with RFS Wang et al. BMC Cancer (2021) 21:6 Page 3 of 15 staging system to assess the prognosis of HCC patients Identification of the best genes for modeling [11]. However, the TNM staging system does not predict A robust likelihood-based survival approach was used to the prognosis of HCC. Therefore, it is important to de- identify the best genes for modeling after determining velop a reliable tool for clinicians to predict the progno- the genes associated with RFS [22]. We used the sis of patients with HCC. “rbsurv” package in R to complete this modeling Given the remarkable advances in high-throughput process. technologies, the development of The Cancer Genome Atlas (TCGA) (https://portal.gdc.cancer.gov/) and the Construction and validation of the risk score system intergovernmental Gene Expression Omnibus (GEO) A multivariate Cox regression analysis and “rbsurv” ana- (https://www.ncbi.nlm.nih.gov/gds) database provides an lysis were performed to identify the genes related to RFS abundance of high-quality information regarding HCC and construct the prognostic gene signature. The “survi- [12]. Hence, it is urgent to develop methods to identify valROC” package in R was used to investigate the time- reliable therapeutic gene targets that could enable earlier dependent prognostic value. The optimal cut-off values prognostic evaluation and better therapeutic strategies based on ROC curves were obtained to classify the pa- [13]. Therefore, we considered whether we could build a tients into low-risk groups and high-risk groups. A cali- gene-based risk score model [14]. Our goal was to gen- bration curve and the concordance index (C-index) were erate simple and effective prognostic tools based on sev- used to evaluate the risk score system. eral genes and other factors that may affect RFS [13, 15]. Using the TCGA dataset, we selected 7 genes by robust likelihood-based survival modeling and built a risk score External validation of the risk score system system [16, 17]. We used an independent dataset We calculated the risk score in the GSE76427 dataset. (GSE76427) to validate the effectiveness of the risk score Then, the AUCs of the 12-month, 15-month, and 18- system and demonstrate that its clinical value in predict- month RFS and Kaplan-Meier curves were used to verify ing RFS in HCC patients is better than that of the TNM the risk score system. A calibration curve was used to staging system. validate the risk score system. In addition, the prognosis-related genes included in the risk score system were verified at the protein level by using The Human Methods Protein Atlas database. The CBioPortal for cancer gen- Data collection and survival analyses omics was used to study genetic alterations in the risk First, we downloaded gene expression profiles and clin- score system [23]. ical information from The Cancer Genome Atlas-liver hepatocellular carcinoma (TCGA-LIHC) dataset, which included 334 HCC samples [18]. We used GSE76427, Statistical analysis which contained the gene expression and clinical infor- The statistical tests were performed using R software mation of 115 HCC samples, as the validation group. and SPSS. Univariate and multivariate Cox regression The samples in TCGA-LIHC and GSE76427 that met analyses were performed using a forward stepwise pro- p the following inclusion criteria were included in this cedure. A -value less than 0.05 was considered statisti- study: all samples had mRNA sequencing data and clin- cally significant [23]. ical information related to RFS [19]. Table 1 The best genes predicting recurrence-free survival of hepatocellular carcinoma patients Identification of genes associated with RFS The raw count data were normalized with a log(a + 1) Gene symbol nloglik AIC Select transformation. Then, using the “survfit” function in the TTK 808.79 1619.59 * “survival” package, we plotted Kaplan-Meier curves for C16orf105 797.58 1599.16 * the high and low expression groups of each gene. A log PPAT 791.22 1588.43 * p rank test with a -value less than 0.05 was considered CD3EAP 788.83 1585.66 * statistically significant [20]. SLCO2A1 787.91 1585.83 * ACAT1 786.25 1584.50 * Enrichment analysis of GO functions and KEGG pathways GAS2L3 784.91 1583.83 * For the selected genes, we used WebGestalt (http:// SH2D5 784.84 1585.68 bioinfo.vanderbilt.edu/webgestalt) based on Gene Ontol- ogy (GO) functions and the Kyoto Encyclopedia of ATP8A2 784.75 1587.50 Genes and Genomes (KEGG) to understand the bio- PABPC5 784.74 1589.49 logical significance of the identified genes [21]. *Gene selected for the risk score Wang et al. BMC Cancer (2021) 21:6 Page 4 of 15 Fig. 2 Analysis of the seven-gene signature of HCC in TCGA dataset. a Risk score of each patient; b The RFS time and RFS status of the HCC patients; c the expression levels of TTK, C16orf105, PPAT, CD3EAP, SLCO2A1, ACAT1 and GAS2L3 in the signature; Kaplan-Meier analysis of the TCGA dataset; d The Kaplan-Meier curve for the risk score model in TCGA dataset Wang et al.
Recommended publications
  • Insights Into the Genetic Basis of the Renal Cell Carcinomas from the Cancer Genome Atlas Scott M
    Published OnlineFirst June 21, 2016; DOI: 10.1158/1541-7786.MCR-16-0115 Review Molecular Cancer Research Insights into the Genetic Basis of the Renal Cell Carcinomas from The Cancer Genome Atlas Scott M. Haake, Jamie D. Weyandt, and W. Kimryn Rathmell Abstract The renal cell carcinomas (RCC), clear cell, papillary, and mutations in chromatin modifier genes. Importantly, the pap- chromophobe, have recently undergone an unmatched geno- illary RCC classification encompasses a heterogeneous group mic characterization by The Cancer Genome Atlas. This anal- of diseases, each with highly distinct genetic and molecular ysis has revealed new insights into each of these malignancies features. In conclusion, this review summarizes RCCs that and underscores the unique biology of clear cell, papillary, and represent a diverse set of malignancies, each with novel bio- chromophobe RCC. Themes that have emerged include dis- logic programs that define new paradigms for cancer biology. tinct mechanisms of metabolic dysregulation and common Mol Cancer Res; 14(7); 589–98. Ó2016 AACR. Introduction ullary carcinoma and translocation carcinoma. It is anticipated that a revised World Health Organization (WHO) pathologic Cancers of the kidney have long fascinated physicians with classification recognizing even more rare and largely low-risk their highly divergent phenotypes and patterns of behavior (1). entities, such as clear cell tubulopapillary RCC and multi- Kidney tumors are conventionally grouped based upon their locular cystic RCC, will be published in 2016
    [Show full text]
  • Deep Learning-Based Gene Selection in Comprehensive Gene Analysis In
    www.nature.com/scientificreports OPEN Deep learning‑based gene selection in comprehensive gene analysis in pancreatic cancer Yasukuni Mori1*, Hajime Yokota2, Isamu Hoshino3, Yosuke Iwatate4, Kohei Wakamatsu5, Takashi Uno2 & Hiroki Suyari1 The selection of genes that are important for obtaining gene expression data is challenging. Here, we developed a deep learning‑based feature selection method suitable for gene selection. Our novel deep learning model includes an additional feature‑selection layer. After model training, the units in this layer with high weights correspond to the genes that worked efectively in the processing of the networks. Cancer tissue samples and adjacent normal pancreatic tissue samples were collected from 13 patients with pancreatic ductal adenocarcinoma during surgery and subsequently frozen. After processing, gene expression data were extracted from the specimens using RNA sequencing. Task 1 for the model training was to discriminate between cancerous and normal pancreatic tissue in six patients. Task 2 was to discriminate between patients with pancreatic cancer (n = 13) who survived for more than one year after surgery. The most frequently selected genes were ACACB, ADAMTS6, NCAM1, and CADPS in Task 1, and CD1D, PLA2G16, DACH1, and SOWAHA in Task 2. According to The Cancer Genome Atlas dataset, these genes are all prognostic factors for pancreatic cancer. Thus, the feasibility of using our deep learning‑based method for the selection of genes associated with pancreatic cancer development and prognosis was confrmed. Deep learning techniques have produced impressive results in many felds, including image classifcation, image generation, automated driving, and even board gameplay 1–6. A great deal of research is now being conducted to apply the superior inference performance of deep learning to the feld of medicine 7,8.
    [Show full text]
  • Association of RYR2 Mutation with Tumor Mutation Burden, Prognosis
    ORIGINAL RESEARCH published: 17 May 2021 doi: 10.3389/fgene.2021.669694 Association of RYR2 Mutation With Tumor Mutation Burden, Prognosis, and Antitumor Immunity in Patients With Esophageal Adenocarcinoma Zaoqu Liu 1,2,3†, Long Liu 4†, Dechao Jiao 1, Chunguang Guo 5, Libo Wang 4, Zhaonan Li 1, Zhenqiang Sun 6, Yanan Zhao 1,2,3* and Xinwei Han 1,2,3* 1 Department of Interventional Radiology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China, 2 Interventional Institute of Zhengzhou University, Zhengzhou, China, 3 Interventional Treatment and Clinical Research Center of Henan Province, Zhengzhou, China, 4 Department of Hepatobiliary and Pancreatic Surgery, The First Affiliated Hospital of Edited by: 5 Valerio Costa, Zhengzhou University, Zhengzhou, China, Department of Endovascular Surgery, The First Affiliated Hospital of Zhengzhou 6 Consiglio Nazionale delle Ricerche University, Zhengzhou, China, Department of Colorectal Surgery, The First Affiliated Hospital of Zhengzhou University, (CNR), Italy Zhengzhou, China Reviewed by: Umberto Malapelle, Background: Esophageal adenocarcinoma (EAC) remains a leading cause of cancer- University of Naples Federico II, Italy related deaths worldwide and demonstrates a predominant rising incidence in Western Valentina Silvestri, Sapienza University of Rome, Italy countries. Recently, immunotherapy has dramatically changed the landscape of treatment *Correspondence: for many advanced cancers, with the benefit in EAC thus far been limited to a small fraction Yanan Zhao of patients. [email protected] Xinwei Han Methods: Using somatic mutation data of The Cancer Genome Atlas (TCGA) and the [email protected] International Cancer Genome Consortium, we delineated the somatic mutation landscape † These authors have contributed of EAC patients from US and England.
    [Show full text]
  • Oncogenomic Portals for the Visualization and Analysis of Genome-Wide Cancer Data
    www.impactjournals.com/oncotarget/ Oncotarget, Vol. 7, No. 1 Oncogenomic portals for the visualization and analysis of genome-wide cancer data Katarzyna Klonowska1, Karol Czubak1, Marzena Wojciechowska1, Luiza Handschuh1,2, Agnieszka Zmienko1,3, Marek Figlerowicz1,3, Hanna Dams- Kozlowska4,5 and Piotr Kozlowski1 1 European Centre for Bioinformatics and Genomics, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland 2 Department of Hematology and Bone Marrow Transplantation, Poznan University of Medical Sciences, Poznan, Poland 3 Institute of Computing Sciences, Poznan University of Technology, Poznan, Poland 4 Department of Diagnostics and Cancer Immunology, Greater Poland Cancer Centre, Poznan, Poland 5 Chair of Medical Biotechnology, Poznan University of Medical Sciences, Poznan, Poland Correspondence to: Piotr Kozlowski, email: [email protected] Keywords: cBioPortal, COSMIC, IntOGen, PPISURV/MIRUMIR, Tumorscape Received: June 18, 2015 Accepted: September 28, 2015 Published: October 15, 2015 This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. ABSTRACT Somatically acquired genomic alterations that drive oncogenic cellular processes are of great scientific and clinical interest. Since the initiation of large-scale cancer genomic projects (e.g., the Cancer Genome Project, The Cancer Genome Atlas, and the International Cancer Genome Consortium cancer genome projects), a number of web-based portals have been created to facilitate access to multidimensional oncogenomic data and assist with the interpretation of the data. The portals provide the visualization of small-size mutations, copy number variations, methylation, and gene/protein expression data that can be correlated with the available clinical, epidemiological, and molecular features.
    [Show full text]
  • Metastatic Adrenocortical Carcinoma Displays Higher Mutation Rate and Tumor Heterogeneity Than Primary Tumors
    ARTICLE DOI: 10.1038/s41467-018-06366-z OPEN Metastatic adrenocortical carcinoma displays higher mutation rate and tumor heterogeneity than primary tumors Sudheer Kumar Gara1, Justin Lack2, Lisa Zhang1, Emerson Harris1, Margaret Cam2 & Electron Kebebew1,3 Adrenocortical cancer (ACC) is a rare cancer with poor prognosis and high mortality due to metastatic disease. All reported genetic alterations have been in primary ACC, and it is 1234567890():,; unknown if there is molecular heterogeneity in ACC. Here, we report the genetic changes associated with metastatic ACC compared to primary ACCs and tumor heterogeneity. We performed whole-exome sequencing of 33 metastatic tumors. The overall mutation rate (per megabase) in metastatic tumors was 2.8-fold higher than primary ACC tumor samples. We found tumor heterogeneity among different metastatic sites in ACC and discovered recurrent mutations in several novel genes. We observed 37–57% overlap in genes that are mutated among different metastatic sites within the same patient. We also identified new therapeutic targets in recurrent and metastatic ACC not previously described in primary ACCs. 1 Endocrine Oncology Branch, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA. 2 Center for Cancer Research, Collaborative Bioinformatics Resource, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA. 3 Department of Surgery and Stanford Cancer Institute, Stanford University, Stanford, CA 94305, USA. Correspondence and requests for materials should be addressed to E.K. (email: [email protected]) NATURE COMMUNICATIONS | (2018) 9:4172 | DOI: 10.1038/s41467-018-06366-z | www.nature.com/naturecommunications 1 ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/s41467-018-06366-z drenocortical carcinoma (ACC) is a rare malignancy with types including primary ACC from the TCGA to understand our A0.7–2 cases per million per year1,2.
    [Show full text]
  • The Challenge of Developing Cancer Biomarkers
    Downloaded from genome.cshlp.org on September 26, 2021 - Published by Cold Spring Harbor Laboratory Press Perspective Translational genomics: The challenge of developing cancer biomarkers James D. Brooks1 Department of Urology, Stanford University, Stanford, California 94305, USA Early detection and definitive treatment of cancer have been shown to decrease death and suffering in epidemiologic and intervention studies. Application of genomic approaches to many malignancies has produced thousands of candidate biomarkers for detection and prognostication, yet very few have become established in clinical practice. Fundamental issues related to tumor heterogeneity, cancer progression, natural history, and biomarker performance have provided challenges to biomarker development. Technical issues in biomarker assay detection limits, specificity, clinical de- ployment, and regulation have also slowed progress. The recent emergence of biomarkers and molecular imaging strategies for treatment selection and monitoring demonstrates the promise of cancer biomarkers. Organized efforts by interdisciplinary teams will spur progress in cancer diagnostics. Since the war on cancer was declared in 1972, cancer death rates, and many have been tested on clinical samples and show some after rising for several decades, have begun to slowly decline (Siegel utility. There the story usually stops (Ludwig and Weinstein 2005; et al. 2011). This drop can be attributed to preventive efforts (e.g., Ioannidis and Panagiotou 2011). In fact, the number of biomarkers smoking cessation), improved treatments for advanced disease, approved by the FDA each year for clinical use is in the single digits. and early detection and treatment of localized cancers. Future This drop-off rate shows that the development of biomarkers is progress in prevention and treatment of advanced disease requires every bit as difficult as the development and approval of a new drug.
    [Show full text]
  • 2019 AACR TET TCGA Poster FINAL
    Comprehensive analysis of thymic epithelial tumors using an integrated bioinformatics platform reveals novel biological characteristics Milan Radovich1, Jeffrey P. Solzak1, J Ireland2, Wade Webster2, Jesse Paquette2 1Indiana University Melvin and Bren Simon Cancer Center, Indianapolis, IN, USA; 2Tag.bio, San Francisco, CA, USA Background Methods Results Source data: Thymoma TCGA dataset cBioPortal (http://www.cbioportal.orG/study?id=thym_tcGa) 1. Differential expression identifies high tumoral overexpression of muscle autoantigens in patients with Myasthenia Gravis • Thymic epithelial tumors (TETs) are rare maliGnancies Statistical methods with ~500 cases diaGnosed in the US per year Statistical analyses for results 1A, 1B, 2 and 3A were performed usinG TaG.bio protocol apps. Upstream Predicted https://omics.taG.bio/fc-thym/custom/confiG A B Gene P-value Fold-change C Transcriptional Activation • AmonG TETs, thymoma is the most predominant, Analysis 1 - Differential expression for patients with Myasthenia Gravis The expression distribution for each Gene was compared between two cohorts: patients with Myasthenia PPARGC1A Regulator State Z-score P-value -4 characterized by a unique association with myasthenia Gravis (focus cohort) and patients without Myasthenia Gravis (comparison cohort). PPARGC1A <1x10 5.81 PPARGC1A Activated 3.796 0.00347 Statistical test: Mann-Whitney u-test. Raw p-values and false discovery rate estimates (q-values) were Gravis, followed by thymic carcinoma which is less -4 SOX1 Activated 2.887 0.0322 generated. NEFM NEFM <1x10 27.66 common but more clinically aGGressive. Analysis 2 - Differential expression for patients with metastatic Thymoma SOX2 Activated 2.581 0.0178 -3 The expression distribution for each Gene was compared between two cohorts: patients with Masaoka- NEFL <1x10 35.87 AIRE Activated 2.433 0.24 • The Cancer Genome Atlas (TCGA) recently published a KoGa staGe IVa or IVb (focus cohort), and patients with other staGes of the disease (comparison cohort).
    [Show full text]
  • A Prognostic Model for Predicting Recurrence-Free Survival in Hepatocellular Carcinoma Patients
    A prognostic model for predicting recurrence-free survival in hepatocellular carcinoma patients Wenhua Wang Nanchang University Lingchen Wang Nanchang University Xinsheng Xie Nanchang University Yehong Yan First Aliated Hospital of Nanchang University Yue Li Nanchang University Quqin Lu ( [email protected] ) Nanchang University https://orcid.org/0000-0003-2813-197X Research article Keywords: TCGA, hepatocellular carcinoma, recurrence-free survival, risk score, prognostic model Posted Date: May 28th, 2020 DOI: https://doi.org/10.21203/rs.3.rs-30658/v1 License: This work is licensed under a Creative Commons Attribution 4.0 International License. Read Full License Version of Record: A version of this preprint was published on January 5th, 2021. See the published version at https://doi.org/10.1186/s12885-020-07692-6. Page 1/23 Abstract Hepatocellular carcinoma (HCC) remains the most frequent liver cancer, accounting for approximately 90% of primary liver cancers worldwide. The recurrence-free survival (RFS) of HCC patients is a critical factor in devising a personal treatment plan. Thus, it is necessary to accurately forecast the prognosis of HCC patients in clinical practice. Using the The Cancer Genome Atlas TCGA dataset, we identied genes that are associated with RFS. A robust likelihood-based survival modeling approach was used to select the best genes for the prognostic model. Then, the GSE76427 dataset was used to evaluate the prognostic model’s effectiveness. We identied 1331 differentially expressed genes associated with RFS. Seven of these genes were selected to generate the prognostic model. Validation in both the TCGA cohort and the GEO cohort demonstrated that the 7-gene prognostic model has the capability of predicting the RFS of HCC patients.
    [Show full text]
  • Nci-Plan-2013.Pdf
    THE NATIONAL CANCER PROGRAM MANAGING THE NATION’S RESEARCH PORTFOLIO AN ANNUAL PLAN AND BUDGET PROPOSAL FOR FISCAL YEAR 2013 National Cancer Institute U.S. DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health Contents 1 Foreword 4 Provocative Questions 8 Center for Cancer Genomics 16 Introduction to Cancer Profiles 18 Pancreatic Ductal Adenocarcinoma 22 Colorectal Cancer 26 B-Cell Lymphoma 30 Renal Cell (Kidney) Cancer 34 GI Stromal Cancer 39 Research Focus: Comorbidities 42 Research Focus: Circulating Tumor Cells 44 Research Focus: Reducing Cancer Health Disparities 50 The Frederick National Laboratory for Cancer Research 61 Small Business Innovation Research, Small BusinessTechnology Transfer (SBIR, STTR) 66 NCI Center for Global Health 72 NCI-Designated Cancer Centers 83 Glossary of Terms 84 Budget FOREWORD The National Cancer Program MANAGING THE NATION’S RESEARCH PORTFOLIO his year marks the 75th anniversary of the founding of the National Cancer Institute, the first so-called “categorical institute” of what has become the National Institutes of Health, with its 27 Institutes and Centers. Over the years, the Congress and the public have generously supported the TNCI—and the NIH generally—with sustained budgetary increases. This was especially true for the NCI during the rapid expansion of its budget following the National Cancer Act of 1971 and for the NIH, including the NCI, during the 5-year doubling of its budget, launched in 1998. Both of these eras of rapid growth were remarkably fruitful. The first launched the pursuit of cancer genes and the molecular basis of oncogenesis, laying the foundation for the transformation of clinical oncology that is now occurring.
    [Show full text]
  • Characteristics of the Cancer Genome Atlas Cases Relative to U.S. General Population Cancer Cases
    www.nature.com/bjc ARTICLE Epidemiology Characteristics of The Cancer Genome Atlas cases relative to U.S. general population cancer cases Xiaoyan Wang1, Joseph T. Steensma1, Matthew H. Bailey2,3, Qianxi Feng1, Hannah Padda1 and Kimberly J. Johnson1,4 BACKGROUND: Despite anecdotal reports of differences in clinical and demographic characteristics of The Cancer Genome Atlas (TCGA) relative to general population cancer cases, differences have not been systematically evaluated. METHODS: Data from 11,160 cases with 33 cancer types were ascertained from TCGA data portal. Corresponding data from the Surveillance, Epidemiology, and End Results (SEER) 18 and North American Association of Central Cancer Registries databases were obtained. Differences in characteristics were compared using Student’s t, Chi-square, and Fisher’s exact tests. Differences in mean survival months were assessed using restricted mean survival time analysis and generalised linear model. RESULTS: TCGA cases were 3.9 years (95% CI 1.7–6.2) younger on average than SEER cases, with a significantly younger mean age for 20/33 cancer types. Although most cancer types had a similar sex distribution, race and stage at diagnosis distributions were disproportional for 13/18 and 25/26 assessed cancer types, respectively. Using 12 months as an end point, the observed mean survival months were longer for 27 of 33 TCGA cancer types. CONCLUSIONS: Differences exist in the characteristics of TCGA vs. general population cancer cases. Our study highlights population subgroups where increased sample collection is warranted to increase the applicability of cancer genomic research results to all individuals. British Journal of Cancer (2018) 119:885–892; https://doi.org/10.1038/s41416-018-0140-8 INTRODUCTION factors associated with race can impact the somatic mutation In recent years, progress in genome sequencing technologies and landscape.
    [Show full text]
  • Genomics and Functional Genomics of Malignant Pleural Mesothelioma
    International Journal of Molecular Sciences Review Genomics and Functional Genomics of Malignant Pleural Mesothelioma Ece Cakiroglu 1,2 and Serif Senturk 1,2,* 1 Izmir Biomedicine and Genome Center, Izmir 35340, Turkey; [email protected] 2 Department of Genome Sciences and Molecular Biotechnology, Izmir International Biomedicine and Genome Institute, Dokuz Eylul University, Izmir 35340, Turkey * Correspondence: [email protected] Received: 22 July 2020; Accepted: 20 August 2020; Published: 1 September 2020 Abstract: Malignant pleural mesothelioma (MPM) is a rare, aggressive cancer of the mesothelial cells lining the pleural surface of the chest wall and lung. The etiology of MPM is strongly associated with prior exposure to asbestos fibers, and the median survival rate of the diagnosed patients is approximately one year. Despite the latest advancements in surgical techniques and systemic therapies, currently available treatment modalities of MPM fail to provide long-term survival. The increasing incidence of MPM highlights the need for finding effective treatments. Targeted therapies offer personalized treatments in many cancers. However, targeted therapy in MPM is not recommended by clinical guidelines mainly because of poor target definition. A better understanding of the molecular and cellular mechanisms and the predictors of poor clinical outcomes of MPM is required to identify novel targets and develop precise and effective treatments. Recent advances in the genomics and functional genomics fields have provided groundbreaking insights into the genomic and molecular profiles of MPM and enabled the functional characterization of the genetic alterations. This review provides a comprehensive overview of the relevant literature and highlights the potential of state-of-the-art genomics and functional genomics research to facilitate the development of novel diagnostics and therapeutic modalities in MPM.
    [Show full text]
  • A Machine Learning Approach for the Classification of Kidney Cancer
    applied sciences Article A Machine Learning Approach for the Classification of Kidney Cancer Subtypes Using miRNA Genome Data Ali Muhamed Ali † , Hanqi Zhuang *,†, Ali Ibrahim †, Oneeb Rehman †, Michelle Huang † and Andrew Wu † CEECS Department, Florida Atlantic University, Boca Raton, FL 33431, USA; [email protected] (A.M.A.); [email protected] (A.I.); [email protected] (O.R.); [email protected] (M.H.); [email protected] (A.W.) * Correspondence: [email protected]; Tel.: +1-561-756-5372 † Current address: 777 Glades Rd, Boca Raton, FL 33431, USA. Received: 17 September 2018; Accepted: 20 November 2018; Published: 29 November 2018 Abstract: Kidney cancer is one of the deadliest diseases and its diagnosis and subtype classification are crucial for patients’ survival. Thus, developing automated tools that can accurately determine kidney cancer subtypes is an urgent challenge. It has been confirmed by researchers in the biomedical field that miRNA dysregulation can cause cancer. In this paper, we propose a machine learning approach for the classification of kidney cancer subtypes using miRNA genome data. Through empirical studies we found 35 miRNAs that possess distinct key features that aid in kidney cancer subtype diagnosis. In the proposed method, Neighbourhood Component Analysis (NCA) is employed to extract discriminative features from miRNAs and Long Short Term Memory (LSTM), a type of Recurrent Neural Network, is adopted to classify a given miRNA sample into kidney cancer subtypes. In the literature, only a couple of kidney subtypes have been considered for classification. In the experimental study, we used the miRNA quantitative read counts data, which was provided by The Cancer Genome Atlas data repository (TCGA).
    [Show full text]