Convolutional Neural Network Model Layers Improvement for Segmentation and Classification on Kidney Stone Images Using Keras and Tensorflow

Journal of Multidisciplinary Engineering Science and Technology (JMEST) ISSN: 2458-9403 Vol. 8 Issue 6, June - 2021 Convolutional Neural Network Model Layers Improvement For Segmentation And Classification On Kidney Stone Images Using Keras And Tensorflow Orobosa Libert Joseph Waliu Olalekan Apena Department of Computer / Electrical and (1) Department of Computer / Electrical and Electronics Engineering, The Federal University of Electronics Engineering, The Federal University of Technology, Akure. Nigeria. Technology, Akure. Nigeria. [email protected] (2) Biomedical Computing and Engineering Technology, Applied Research Group, Coventry University, Coventry, United Kingdom [email protected], [email protected] Abstract—Convolutional neural network (CNN) and analysis, by automatic or semiautomatic means, of models are beneficial to image classification large quantities of data in order to discover meaningful algorithms training for highly abstract features patterns [1,2]. The domains of data mining include: and work with less parameter. Over-fitting, image mining, opinion mining, web mining, text mining, exploding gradient, and class imbalance are CNN and graph mining and so on. Some of its applications major challenges during training; with appropriate include anomaly detection, financial data analysis, management training, these issues can be medical data analysis, social network analysis, market diminished and enhance model performance. The analysis [3]. Recent progress in deep learning using models are 128 by 128 CNN-ML and 256 by 256 CNN machine learning (CNN-ML) has been helpful in CNN-ML training (or learning) and classification. decision support and contributed to positive outcome, The results were compared for each model significantly. The application of CNN-ML to diverse classifier. The study of 128 by 128 CNN-ML model areas of soft computing is adapted diagnosis has the following evaluation results consideration procedure to enhance time and accuracy [4]. The of error: (i) without validation, the accuracy was study investigated CNN-ML layer nodes improvement 85.1%; (ii) with validation, the accuracy was with adapted Keras and Tensor Flow for kidney stone 86.8%; (iii) Absolute Error (AE) of 0.0696 and (iv) image classification. Relative Error (RE) of ± 0.0897; while, the 256 by 256 CNN-ML model has the following evaluation II. LITERATURE REVIEW results: (i) without validation, the accuracy was A. Deep learning 86.3%; (ii) with validation, the accuracy was 85.6%; (iii) Absolute Error (AE) of 0.0257 and (iv) Deep learning could be adapted into computational Relative Error (RE) of ± 0.0320. These values show models that are composed of multiple processing that the higher the nodes in the layer, the better layers to learn representations of data with multiple the performance. Node scaling could be deplored levels of abstraction [5]. This could be deployed in in biomedical decision support system and speech recognition, visual object detection and patients’ management. recognition with respect to initial input and internal parameter (weight). Deep learning was firstly Keywords— CNN nodes; layers; Keras; introduced by [6] for a class of deep probabilistic TensorFlow and Performance generative models called Deep Belief Networks (DBNs) [7]. I. INTRODUCTION B. Neural Network The application of artificial intelligence (AI) to real In 1943, Warren McCulloch and Walter Pitts life situation is limitless, as it were in decision support developed the first mathematical model of a neuron. In system for positive healthcare outcomes. their research paper "A logical calculus of the ideas Convolutional neural network (CNN) is deployed for immanent in nervous activity”, they described the image analysis with limited initial parameters and its simple mathematical model for a neuron, which gaining popularity in the field of biomedical engineering represents a single cell of the neural system that takes research and healthcare patient’s management. The inputs, processes those inputs, and returns an output. process of extracting useful knowledge from huge data This model is known as the McCulloch-Pitts neural is known as Data Mining. Data Mining is an exploration model [8]. www.jmest.org JMESTN42353822 14151 Journal of Multidisciplinary Engineering Science and Technology (JMEST) ISSN: 2458-9403 Vol. 8 Issue 6, June - 2021 C. Convolutional Neural Network The paper [7], worked on deep learning-based classification of focal liver lesions with contrast- The name “convolutional neural network” indicates enhanced ultrasound, it focused on important that the network employs a mathematical operation classification of liver masses to early diagnosis of called convolution. Convolution is a specialized kind of patients. The study proposed a diagnostic system of linear operation. Convolutional networks are simply liver disease classification based on contrast neural networks that use convolution in place of enhanced ultrasound (CEUS) imaging. In the proposed general matrix multiplication in at least one of their system, the dynamic CEUS videos of hepatic perfusion layers [9]. Convolutional networks were inspired by are firstly retrieved. Secondly, time intensity curve biological processes [10]. (TICs) is extracted from the dynamic CEUS videos D. Convolutional layer using sparse non-negative matrix factorizations. The main building block of a CNN model are the Finally, deep learning was employed to classify benign convolutional layers; when programming a CNN, the and malignant focal liver lesions based on these TICs, input is a tensor [11] with shape (number of images) x which makes the work a near too robust to adopt. (image height) x (image width) x (image depth). Then H. CNN Model Classifier Evaluation after passing through a convolutional layer, the image becomes abstracted to a feature map, with shape To evaluate the performance of the Neural Network (number of images) x (feature map height) x (feature classifiers for kidney stone and non-kidney stone map width) x (feature map channels). Convolutional classes, the Accuracy (AC), Sensitivity (SE), Precision layers convolve the input and pass its result to the (P), Specificity (SP) and Effectiveness (E) parameters next layer [12, 13]. Figure (1) shows a typical CNN have been made use of [20, 21, 22]. architecture. III. METHOD The study implemented and trained segmentation and classification algorithm; performed on an AMD Quad-Core 1.7GHz processor on a 64-bit windows 7 operating system with 6.0 GB RAM using Python and Jupyter Notebook. The trained data acquired was of size 120 by 100 pixels. The image slices were resized to 50 by 50 pixel size before extracting features from Figure 1: A configuration of typical CNN layer [14] the images. All the images supplied were passed through OpenCV [23] preprocessing steps to enhance E. TensorFlow Backend Platform its contrast as pure gray scale. The two different Convolutional Neural Network Machine Learning TensorFlow is an open-source artificial intelligence (CNN-ML) models were: 128 by 128 CNN-ML and 256 library, using data flow graphs to build models. It by 256 CNN-ML models respectively. The study allows developers to create large-scale neural compared the training (or learning) and classification networks with many layers. TensorFlow is mainly used result for each classifier. for: Classification, Perception, Understanding, Discovering, Prediction and Creation [15, 16]. A. Neural Network Model of the Training Phase F. Keras Library The study deplored artificial neural network model for the image operation as expressed below. The In 2017, Google's TensorFlow team decided to image input are in vectors matrix as revealed in support Keras in TensorFlow's core library [17]. Keras equation (1) and (2), given set of input vectors푥⃗⃗⃗⃗푘 , is an open-source neural-network library written in (which represents the image matrix) with its Python language. It is capable of running on top of corresponding transpose as expressed in equations TensorFlow, Microsoft Cognitive Toolkit, R, Theano, or (1) and (2). PlaidML. Designed to enable fast experimentation with deep neural networks, it focuses on being user- 푥⃗⃗⃗⃗푘 = (푥푘1푥푘2푥푘3 … 푥푘푚) (1) friendly, modular, and extensible [17, 18]. 푇 푥⃗⃗⃗⃗푘 = [푥푘1푥푘2푥푘3 … 푥푘푚] (2) G. Related Works Where k = 1, 2, 3, …, m; and m = number of In their work [19], adopted neural network for interconnected neurons at the input, kidney stone detection diagnosis, the study was based The corresponding set of output vectors, 푦⃗⃗⃗⃗푘 , with its on empirical data to develop an algorithm using two transpose are expressed in (3) and (4). neural network algorithms viz Radial basis function and Learning vector quantization. The work used many 푦⃗⃗⃗⃗푘 = (푦푘1푦푘2푦푘3 … 푦푘푚) (3) complex dependant variables with few specimens of 5 푦⃗⃗⃗⃗ = [푦 푦 푦 … 푦 ]푇 (4) instances and each having 7 attributes such as age, 푘 푘1 푘2 푘3 푘푚 sex, Lymphocytes Monocytes, Neutrophil, And the associated weight vector, 푤⃗⃗⃗⃗푗푘⃗⃗ , to the input S.Creatinine, and Eosinophis; and an unusually low vector 푥⃗⃗⃗⃗푘 , is expressed in (5) amount of data. www.jmest.org JMESTN42353822 14152 Journal of Multidisciplinary Engineering Science and Technology (JMEST) ISSN: 2458-9403 Vol. 8 Issue 6, June - 2021 푇 푤⃗⃗⃗⃗푗푚⃗⃗⃗ = [푤푗1(푘) 푤푗2(푘) … 푤푗푚(푘)] (5) C. Machine Learning Model with Keras and TensorFlow The output 푦⃗⃗⃗⃗푘 , can thus be expressed as: Two different Convolutional Neural Network 푚 Machine Learning (CNN-ML) models were adopted in 푦⃗⃗⃗푘푗⃗⃗ = ∑푖=1 푤푗푖(푘)푥푘푖 (6) this work, namely

Convolutional Neural Network Model Layers Improvement for Segmentation and Classification on Kidney Stone Images Using Keras and Tensorflow

Multiscale Computation and Dynamic Attention in Biological and Artificial

Deep Neural Network Models for Sequence Labeling and Coreference Tasks

The History Began from Alexnet: a Comprehensive Survey on Deep Learning Approaches

The Understanding of Convolutional Neuron Network Family

Methods and Trends in Natural Language Processing Applications in Big Data

The Neocognitron As a System for Handavritten Character Recognition: Limitations and Improvements

Memory Based Machine Intelligence Techniques in VLSI Hardware

An Improved Hierarchical Temporal Memory

What Can Computer Vision Teach NLP About Efficient Neural Networks?

Deep Learning 1.1. from Neural Networks to Deep Learning

Neocognitron: a Self-Organizing Neural Network Model for a Mechanism of Pattern Recognition Unaffected by Shift in Position

Convolutional Neural Networks for Language