Computational Intelligence in Patient-Sensitive Medical Decision Systems Report from Walter Karplus Summer Research Grant awarded by IEEE Computational Intelligence Society

Summer 2007

Maciej A. Mazurowski1

Academic Advisors: Jacek M. Zurada1 and Georgia D. Tourassi2

1 Computational Intelligence Laboratory Electrical and Computer Engineering Department University of Louisville 2 Duke Advanced Imaging Laboratories Duke University Medical Center

Abstract This report summarizes the work sponsored by IEEE Walter Karplus Summer Research grant awarded in 2007 by IEEE Computational Intelligence Society. Within this project a cooperative re- search was performed at the Computational Intelligence Laboratory at the University of Louisville and Duke Advanced Imaging Laboratories at Duke University Medical Center. In the project, en- semble techniques were applied to improve performance of a previously presented computer-aided detection (CAD) classifier. The classifier was designed for detection of masses in mammograms. As a result of the project, a paper was written and presented at SPIE Medical Imaging 2008.

1 Introduction and motivations

Computer-aided decision (CAD) systems have gained interest and popularity in medical diagnosis. CAD systems provide a second opinion regarding patients diagnosis based on previously acquired clinical information. Among other tasks, CAD systems have been thus far successfully applied in breast, lung and colon detection [1, 2, 3, 4, 5]. Some of the presented systems utilize computational intelligence techniques [6, 7, 8, 9, 10] Recently, some efforts have been made to incorporate ensemble techniques to CAD systems [11, 12, 13, 14, 15, 16]. In this project, we have made further investigations on this topic. In our studies we used a modular approach where the cases were initially divided into separate subgroups. Then, classifiers based on each of these groups served as experts in the classification task. After several sub-classifiers were constructed, a combiner was trained to fuse the particular decisions in one final response.

1 2 Multidisciplinary nature of the project

The implemented project had a highly multidisciplinary character. It involved knowledge from the fields of computational and artificial intelligence, database management and medical diagnosis. Following this requirement, the project involved the collaboration of scientists from different domains. Maciej Mazurowski and Dr. Jacek Zurada (Computational Intelligence Laboratory, University of Louisville) were responsible for the development and application of computational intelligence techniques. Dr. Georgia Tourassi (Department of Radiology, Duke University Medical Center) provided her expertise on the medical imaging and medical decision aspects of the project. I have visited the associated University (Duke) twice for a period of 2 months total. The visit allowed for a better exposure to the medical imaging aspects of CAD systems development. The time spent at Duke University also strengthened the cooperation between the Computational Intelligence Laboratory at the University of Louisville and Duke Advanced Imaging Laboratory at Duke University Medical Center.

3 Background

3.1 Feature-based and featureless CAD systems The study was conducted in the context of the CAD systems for breast cancer detection. Such systems utilize the information extracted from mammograms (X-ray images of breast). Two main approaches were utilized: feature-based and featureless. In the feature-based approach, a set of features is extracted from an x-ray image using an image processing algorithm. Then, the features are utilized in the process of classification of the case. Within the classifiers that can be used in this paradigm are neural networks (NN), linear discriminant analysis (LDA), Bayesian classifier, case-based classifier and others. In the featureless approach, proposed by Dr. Tourassi [17], [18], a classical information-theoretic concept of mutual information is used in order to evaluate similarity between two images. Then, case-based principles are used to classify new, incoming cases based on the similarity between the query case and cases stored in the database of the system.

4 Methods

The majority of the methods and experiments are presented in the conference article which is the result of the research conducted within this project [19]. This paper is currently in press for Proceedings of SPIE Medical Imaging 2008. Therefore, only a brief summary of the methods and results are presented here. In this paper, we propose to use an ensemble technique to improve performance of previously pre- sented information-theoretic CAD system (IT-CAD). In order to do so, the database of examples is di- vided into several exclusive subsets. The division is done randomly and using k-means clustering based on 7 textural features extracted from the ROIs. Then, linear discriminant analysis (LDA) is applied to combine the outputs of classifiers built based on these subsets. It has been demonstrated in the paper that this approach provides statistically significant improvement in the performance measured using the receiver operator characteristic (ROC). In this document, two additional experiments are presented demonstrating an application of two

2 0.94

0.93

0.92

0.91

0.9 AUC

0.89

0.88

0.87 ORIGINAL PERFORMANCE

0.86 2 4 6 8 10 12 14 16 18 20 number of clusters

Figure 1: Performance of the system from Experiment 1.

computational intelligence techniques, artificial neural networks and fuzzy clustering, to the ensemble framework.

5 Experiments

5.1 Database For the experimental purposes mammograms extracted from Digital Database for Screening Mammog- raphy (DDSM) [20] (University of Florida) were used. The mammograms were digitized using a LU- MISYS scanner. Regions of interest (ROIs) of 512 × 512 pixels were extracted from the mammograms. The resulting dataset consisted of 901 ROIs depicting masses and 919 ROIs depicting normal cases. The ROIs depicting normal cases were extracted from the normal mammograms in a random way.

5.2 Experiment 1: Incorporating a neural network to the ensemble system In the first experiment, feedforward neural networks (instead of LDA) were applied to combine the ensemble elements. The set of training examples was constructed using leave-one-out approach, as shown in [19]. Three hidden neurons and one output neuron were used. The activation function for all the neurons was linear. Error backpropagation with momentum was used in the training. Here, I concentrate on k-means approach to the database decomposition as it turned out to be superior over random division. To assess the performance of k-means + NN approach, 10-fold crossvalidation

3 0.94

0.93

0.92

0.91

0.9

0.89 AUC

0.88

0.87 ORIGINAL PERFORMANCE

0.86

0.85

0.84 2 4 6 8 10 12 14 16 18 20 number of clusters

Figure 2: Performance of the system from Experiment 2.

data handling was used to train and test the ensemble. I varied the number of elements of the ensemble between 2 and 20 with a step of 2. Figure 5.1 presents the performance of the system measured by the area under the ROC curve (AUC) for different number of elements of the ensemble. A large improvement of the performance can be clearly seen for all of the examined scenarios.

5.3 Experiment 2: Incorporating fuzzy clustering to the ensemble system In this experiment I went a step further and introduced c-means fuzzy clustering instead of previously used k-means clustering. Similarly as in Experiment 1, NN was used to combine the responses of the ensemble elements into one final decision. Ten-fold crossvalidation was used to estimate the performance of the system. The results are shown in Fig. 2. It can be seen that the performance of the c-means-based system is comparable to the performance of the system using k-means. The only two observed difference are that the system based on fuzzy c-means had noticeably lower performance than the system based on k-means when only two clusters are used but on the other hand it provided more stable performance for higher number of clusters.

6 Conclusions

In this document, I presented the outcomes from the project sponsored by IEEE Computational Intel- ligence Society. The project involved cooperation between the Computational Intelligence Laboratory

4 at the University of Louisville and Duke Advanced Imaging Laboratories at Duke University Medical Center and it involved my visit at the later lab. The results of the research performed in this project were included in the conference article [19] and presented by me at the SPIE Medical Imaging 2008 conference in February 2008. Additional results were presented in this document.

7 Acknowledgments

I would like to acknowledge the two professors advising me in this project who co-authored the resulting article: Dr. Jacek M. Zurada and Dr. Georgia D. Tourassi, as well as Dr. Vasilis Megaloikonomou for his help in this project. I would also like to thank the members of the Computational Intelligence Laboratory at the University of Louisville and Duke Advanced Imaging Laboratories at Duke University Medical Center for their support.

References

[1] B. van Ginneken, B. M. ter Haar Romeny, and M. A. Viergever, “Computer-aided diagnosis in chest radiography: a survey,” IEEE Transactions on Medical Imaging, vol. 20, pp. 1228–1241, 2001.

[2] M. P. Sampat, M. K. Markey, and A. C. Bovik, Handbook of Image and Video Processing, 2005, ch. Computer-Aided Detection and Diagnosis in , pp. 1195–1217.

[3] J. J. Perumpillichira, H. Yoshida, and D. V. Sahani, “Computer-aided detection for virtual colonoscopy.” Cancer Imaging, vol. 5, no. 1, pp. 11–16, 2005. [Online]. Available: http://dx.doi.org/10.1102/1470-7330.2005.0016

[4] J. Y. Lo, A. O. Bilska-Wolak, M. K. Markey, G. D. Tourassi, J. A. Baker, and C. E. F. Jr., Recent Advances In Breast Imaging, Mammography, and Computer-Aided Diagnosis of Breast Cancer. SPIE PRESS, 2006, ch. Computer-aided diagnosis in breast imaging: where do we go after detec- tion?, pp. 871–900.

[5] K. Doi, “Diagnostic imaging over the last 50 years: research and development in medical imaging science and technology,” Physics in Medicine and Biology, vol. 51, no. 13, pp. R5–R27, 2006.

[6] M. A. Anastasio, H. Yoshida, R. Nagel, R. M. Nishikawa, and K. Doi, “A genetic algorithm-based method for optimizing the performance of a computer-aided diagnosis scheme for detection of clustered microcalcifications in mammograms,” , vol. 25, pp. 1613–1620, 1998.

[7] D. B. Fogel, E. C. Wasson, III, E. M. Boughton, and V. W. Porto, “Evolving artificial neural net- works for screening features from mammograms,” Artificial Intelligence in Medicine, vol. 14, pp. 317–326, 1998.

[8] C. A. Pena-Reyes and M. Sipper, “A fuzzy-genetic approach to breast cancer diagnosis,” Artificial Intelligence in Medicine, vol. 17, pp. 131–155, 1999.

[9] ——, “Evolutionary computation in medicine: an overview,” Artificial Intelligence in Medicine, vol. 19, pp. 1–23, 2000.

5 [10] M. A. Mazurowski, P. A. Habas, J. M. Zurada, and G. D. Tourassi, “Decision optimization of case-based computer aided decision systems using genetic algorithms with application to mam- mography,” Physics in Medicine and Biology, vol. 53, pp. 895–908, 2008.

[11] I. Bloch, “Some aspects of dempster-shafer evidence theory for classification of multi-modality medical images taking partial volume effect into account,” Pattern Recognition Letters, vol. 17, pp. 905–919, 1996.

[12] L. I. Kuncheva, “Switching between selection and fusion in combining classifiers: An experiment,” IEEE Transactions on Systems, Man, and Cybernetics – Part B: Cybernetics, vol. 32, pp. 146–156, 2002.

[13] Z.-H. Zhou and Y. Jiang, “Medical diagnosis with c4.5 rule preceded by artificial neural network ensemble,” IEEE Transaction on Information Technology in Biomedicine, vol. 7, pp. 37–42, 2003.

[14] D. Greene, A. Tsymbal, N. Bolshakova, and P. Cunningham, “Ensemble clustering in medical diagnostics,” in Proceedings of 17th IEEE Symposium on Computer-Based Medical Systems (CBMS 2004), 2004.

[15] D. West, P. Mangiameli, R. Rampal, and V. West, “Ensemble strategies for a medical diagnostic decision support system: A breast cancer diagnosis application,” European Journal of Operational Research, vol. 162, pp. 532–551, 2005.

[16] M. Raza, I. Gondal, and R. L. C. David Green, “Classifier fusion using dempster-shafer theory of evidence to predict breast cancer tumors,” in 2006 IEEE Region 10 Conference (TENCON 2006), 2006, pp. 1–4.

[17] G. D. Tourassi, R. Vargas-Voracek, J. David M. Catarious, and J. Carey E. Floyd, “Computer- assisted detection of mammographic masses: A template matching scheme based on mutual infor- mation,” Medical Physics, vol. 30, pp. 2123–2130, 2003.

[18] G. D. Tourassi, B. Haarawood, S. Singh, J. Y. Lo, and C. E. Floyd, “Evaluation of information- theoretic similarity measures for content-based retrieval and detection of masses in mammograms,” Medical Physics, vol. 34, pp. 140–150, 2007.

[19] M. A. Mazurowski, J. M. Zurada, and G. D. Tourassi, “Database decomposition of a knowledge base cad system in mammography; an ensemble approach to improve detection performance,” in Proc. SPIE Medical Imaging 2008 (in press), 2008.

[20] M. Heath and et al., Digital Mammography. Kluwer Academic, Dordrecht, 1998, ch. Current status of the digital database for screening mammography.

6