From Hand-Crafted to Deep Learning-Based Cancer Radiomics

2 Radiomics consists of a wide range of (partially intercon- nected) research areas with each individual branch possibly worth a complete exploration, the purpose of this article is to provide an inclusive introduction to Radiomics for the signal processing community, as such, we will focus on the progression of signal processing algorithms within the context of Radiomics. In brief, we aim to present an overview of the current state, opportunities, and challenges of Radiomics from signal processing perspective to facilitate further advancement and innovation in this critical multidisciplinary research area. As such, the article will cover the following four main areas: (i) Hand-crafted Radiomics, where we introduce and in- Fig. 1: Increasing interest in Radiomics based on data from Google vestigate different feature extraction, feature reduction, Scholar (“Radiomics” is used as the keyword). It is observed that and classification approaches used within the context there is an increasing interest in both types of the Radiomics. of Radiomics. Most of the techniques utilized in any as cancer stage. This clinical study has illustrated/validated of the aforementioned steps lie within the broad area effectiveness of Radiomics for tumor related predictions and of “Machine Learning,” where the goal is to improve showed that Radiomics has the capability to identify lung the performance of different computational models using and head-and-neck cancers from a single-time point CT scan. past experiences (data) [17]. In other words, the underly- Consequently, there has been a recent surge of interest [8]– ing models are capable of learning from past data, lead- [12] on this multidisciplinary research area as Radiomics has ing to the automatic process of prediction and diagno- the potential to provide significant assistance for assessing sis. Furthermore, since hundreds of Radiomics features the risk of recurrence of cancer [13]; Evaluating the risk are extracted, an appropriate feature selection/extraction of radiation-induced side-effects on non-cancer tissues [14], strategy should be adopted to reduce the “curse of and; Predicting the risk for cancer development in healthy dimensionality” and overfitting of the prediction models. subjects [14]. In a very recent article by Vallieres et al. [16], it Most of these strategies, themselves, lie within the field is shown that the Radiomics features extracted in Reference [5] of “Machine Learning”, as they are aimed to learn the have a noticeable dependency on the tumor volume, which best set of features, based on the available data. is a strong prognostic factor, and revised calculations are (ii) Deep learning-based Radiomics, where we provide an proposed that are less correlated to the tumor volume. In other overview of different deep architectures used in Ra- words, more powerful Radiomics features and procedures are diomics along with interpretability requirements. being introduced, illustrating the ongoing research potentials (iii) Hybrid solutions developed to simultaneously benefit of Radiomics. from the advantages of each of the above two mentioned The key underlying hypothesis in the Radiomics is that categories. the constructed descriptive models (based on medical imaging (iv) Challenges, Open Problems, and Opportunities, where data, sometimes complemented by biological and/or medical we focus on the limitations of processing techniques data) are capable of providing relevant and beneficial predic- unique in nature to the Radiomics, and introduce open tive, prognostic, and/or diagnostic information. In this regard, problems and potential opportunities for signal process- one can identify two main categories of Radiomics. Con- ing researchers. ventional pipeline based on Hand-Crafted Radiomic features Fig. 1 shows the increasing interest in Radiomics within the (HCR) that consists of the following four main processing research community. Although there have been few recent arti- tasks: (i) Image acquisition/reconstruction; (ii) Image segmen- cles [3], [18] reviewing and introducing Radiomics, to the best tation; (iii) Feature extraction and quantification, and; (iv) of our knowledge, most of them are from outside the signal Statistical analysis and model building. On the other hand, the processing (SP) community. References within the SP society Deep Learning-based Radiomics (DLR) pipeline has recently such as the work by J. Edwards [19] have investigated recent emerged which differs from the former category since deep advancements in medical imaging devices and technologies networks do not necessarily need the segmented Region Of without reviewing the role of Radiomics in medical applica- Interest (ROI), and their feature extraction and analysis parts tions. Other existing papers outside SP community (e.g., [3]) are partially or fully coupled. We will elaborate on these have failed to clearly describe the underlying signal processing properties in section IV. technologies and have narrowed down their scope only to More clinical studies are being approved and conducted hand-crafted Radiomics and its diagnosis capability. While to further investigate and advance the unparalleled oppor- Reference [18] has briefly touched upon the deep learning tunities the Radiomics posed to offer for clinical applica- pipeline as an emerging technology that can extract Radiomics tions. Information Post I provides an overview of different features, it has not studied applicability of different deep screening technologies used within the Radiomics pipeline architectures [20] and left the interpretability topic untouched. along with supporting data sources and available datasets to Furthermore, Reference [21] has mostly focused on the hand- develop Radiomics-based predictive/prognostic models. While crafted Radiomics, while deep learning-based Radiomics is 3 explained briefly without addressing different architectures, especially since SP is one of the main building blocks of the interpretability, and hybrid models. Although both types of Radiomics. Radiomics are covered in Reference [22], combination of The reminder of this article is organized as follows: first in hand-crafted and deep learning-based features are not con- Section II, we will discuss several applications of Radiomics sidered. Besides, challenges associated with Radiomics and in cancer-related fields, followed by Hand-Crafted solutions in the relation between Radiomics and gene-expression (Radio- Section III. The Deep learning-based Radiomics is presented genomics) are also not discussed thoroughly. Finally, the scope in Section IV, where several aspects of DLR is investigated. In of Reference [23] is limited to deep learning-based Radiomics, Section V, we explain different hybrid solutions to Radiomics, without addressing hand-crafted features, their stability, hybrid which aim to take advantage of both DLR and HCR. Finally in Radiomics, and Radiogenomics. All these call for an urgent Section VI various challenges and opportunities of Radiomics, and timely quest to introduce Radiomics to our community especially for SP community, are discussed. We conclude our work is Section VII. 4 Information Post I: Radiomics Supporting Resources Several potential medical resources provide information to the • Magnetic Resonance Imaging (MRI): Unlike CT, proper- Radiomics pipeline, some of which are directly used to extract ties of MRI images are not directly associated with tissue Radiomics features and some serve the decision making process, density and specific methods are required to obtain the so- as complementary information sets. Below we review the most called signal intensity. Besides, several imager and vendor- important data resources for Radiomics. dependant factors such as gradient and coil systems [26], pulse sequence design, slice thickness, and other parameters Screening Technologies: The Radiomics features can be ex- such as artifacts and magnetic field strength affect the prop- tracted from several imaging modalities, among which the fol- erties of the MRI images [2], which should be consistent lowing are the most commonly used modalities: across different institutions. • Computed Tomography (CT) Scans: The CT is the modal- ity of choice for the diagnosis of many diseases in different parts of the body, and by providing high resolution images [1] paves the path for extracting comparable Ra- diomics features. Nonetheless, the CT imaging performance depends on different components of the utilized protocol including the following three main properties: (i) Slice thickness, which is the distance in millimeter (mm) between two consecutive slices; (ii) The capability for projecting the density variations into image intensities, and; (iii) Re- construction algorithm, which aims at converting tomo- Complimentary Data Sources: In addition to imaging resources, graphic measurements to cross-sectional images. Although the following clinical data sources are typically combined with CT protocols for specific clinical indications are usually Radiomics features: similar across different institutions, Radiomics features can even differ between different scanners with the same set- • Gene expression: The process of converting DNA to func- tings [24]. Therefore, there is still a considerable need to tional product to have a global insight of cellular function. ensure consistency of Radiomics feature extraction amongst • Clinical characteristics: Patient’s characteristics such age, different scanners and imaging protocols [2]. CT images gender, and past medical

From Hand-Crafted to Deep Learning-Based Cancer Radiomics

Intro to Radiomics

Radiomics-Based Prediction of Overall Survival in Lung Cancer Using Different Volumes-Of-Interest

Evaluation of Shape and Textural Features from CT As Prognostic

Identifying Robust Radiomics Features for Lung Cancer by Using In-Vivo and Phantom Lung Lesions

Preliminary Report on Computed Tomography Radiomics Features As Biomarkers to Immunotherapy Selection in Lung Adenocarcinoma Patients

Repeatability of Multiparametric Prostate MRI Radiomics Features

Radiomics in Oncology, Part 1: Technical Principles and Gastrointestinal Application in CT and MRI

The Impact of the Variation of Imaging Factors on the Robustness of Computed Tomography Radiomic Features: a Review

Radiomics in Oncology, Part 2: Thoracic, Genito-Urinary, Breast, Neurological, Hematologic and Musculoskeletal Applications

Identification of Non–Small Cell Lung Cancer Sensitive to Systemic

An Exploratory Study on the Stable Radiomics Features of Metastatic Small Pulmonary Nodules in Colorectal Cancer Patients

Radiomics: the Facts and the Challenges of Image Analysis