Lancaster-Dissertation-2018

Total Page:16

File Type:pdf, Size:1020Kb

Lancaster-Dissertation-2018 c Copyright by Keith Lancaster 2018 All Rights Reserved ASYMMETRY MEASURES FOR AUTOMATED MELANOMA DETECTION IN DERMOSCOPIC IMAGES A Dissertation Presented to the Faculty of the Department of Electrical and Computer Engineering University of Houston In Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy By Keith Lancaster May 2018 ASYMMETRY MEASURES FOR AUTOMATED MELANOMA DETECTION IN DERMOSCOPIC IMAGES Keith Lancaster Approved: Chair of the Committee Dr. George Zouridakis, Professor Electrical and Computer Engineering Committee Members: Dr. Ji Chen, Professor Electrical and Computer Engineering Dr. Ben H. Jansen, Professor Emeritus Electrical and Computer Engineering Dr. John Glover, Professor Emeritus Electrical and Computer Engineering Dr. Xiaojing Yuan, Associate Professor Computer Engineering Technology Dr. Suresh Khator, Associate Dean, Dr. Badri Roysam, Professor and Chair Cullen College of Engineering Electrical & Computer Engineering Acknowledgments First, I would like to thank Dr. George Zouridakis who has supported and en- couraged me throughout this long process. It was an honor to work with him, and I will be forever grateful. I would be remiss not to thank Dr. Ben Jansen, Dr. John Glover, Dr. Ji Chen, and Dr. Xiaojing Yuan for serving on my committee. I would also like to thank my lab mates Ning Situ, Tarun Wadhawan, and Rui Hu for their friendship and support. Finally, I must thank my wife Amy, son Garrett, and his wife, Cindy. Without their continued encouragement over the years, I would never have completed this journey. v ASYMMETRY MEASURES FOR AUTOMATED MELANOMA DETECTION IN DERMOSCOPIC IMAGES An Abstract of a Dissertation Presented to the Faculty of the Department of Electrical and Computer Engineering University of Houston In Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy By Keith Lancaster May 2018 vi Abstract Dermoscopic rules such as the ABCD and Menzies rules are employed by der- matologists to determine the likelihood that a suspicious lesion is cancerous. This dissertation focuses on the improvement of automated melanoma recognition systems that implement these rules, specifically by enhancing the ability of these systems to recognize lesion asymmetry, a significant indicator of melanoma. Two approaches are proposed for asymmetry classification. The first utilizes the irregularity of the outer contour of the lesion combined with measures that com- pare quadrants of the lesion with respect to area, color, and melanin content. The second method uses size theory as the basis for determining asymmetry. In this ap- proach, measuring functions are employed to expose relevant characteristics of the lesion. The one-dimensional measuring functions are mapped into size functions in R2 and compared using the bottleneck distance. The distances are used as features for classification. Annotated dermoscopic images were used to train classifiers for both methods. Classification rates were competitive with other approaches for both methods inde- pendently, with the combined method exhibiting 95% accuracy. Additionally, decision fusion strategies were investigated as a means of combining the results from individual melanoma classifiers using the asymmetry methods devel- oped in this study. The best approach showed 100% sensitivity and 64% specificity, exceeding the performance of the individual classifiers. Finally, a software framework for the development of medical applications is pre- sented. This framework attempts to provide biomedical researchers with a simplified approach to creating mobile applications for medical processing. vii Table of Contents Acknowledgments ................................ v Abstract ...................................... vii Table of Contents ................................ viii List of Figures .................................. xi List of Tables ................................... xiii 1 Introduction .................................. 1 1.1 Research Objectives . 2 2 Background .................................. 5 2.1 Identifying Suspected Melanoma . 5 2.1.1 Diagnostic Methods Used by PCPs . 5 2.1.2 Diagnostic Methods Used by Dermatologists . 6 2.2 Dermoscopy . 6 2.2.1 Dermoscopic Rules . 7 2.3 Automating Melanoma Detection . 10 2.4 Classifier Fusion . 12 2.4.1 Fusion Topologies . 12 2.5 Utilizing Smartphones for Melanoma Screening . 14 2.6 Image Processing Frameworks for Mobile Devices . 15 viii 2.7 Related Works . 15 2.7.1 Automation of Dermoscopic Rules . 15 2.7.2 DermoScreen . 19 3 Methods .................................... 22 3.1 Data . 22 3.2 Process Overview . 23 3.2.1 Image Segmentation . 23 3.2.2 Feature Extraction . 23 3.2.3 Feature Selection . 24 3.2.4 Classification . 27 3.2.5 Assessing Classifier Performance . 30 3.3 Assessment of Lesion Asymmetry . 33 3.3.1 Introduction . 33 3.3.2 DermoScreen Asymmetry Algorithm . 33 3.3.3 Asymmetry Based on Size Theory . 35 3.3.4 Asymmetry from Border and Area Characteristics . 39 3.4 Decision Fusion . 42 3.4.1 Majority Voting . 44 3.4.2 Sequential Conditional Application . 44 3.5 Mobile Software Framework . 44 4 Results ..................................... 49 4.1 Asymmetry . 49 4.1.1 Data Preprocessing . 49 4.1.2 DermoScreen Asymmetry Analysis . 50 4.1.3 Asymmetry from Border and Area . 51 ix 4.1.4 Asymmetry using Size Functions . 56 4.1.5 Summary of Asymmetry Classifier Performance . 67 4.1.6 Effect of Improved Asymmetry on Melanoma Detection . 68 4.2 Decision Fusion . 71 4.2.1 Decision Fusion Based on Voting Strategies . 72 4.2.2 Decision Fusion Based on Sequential Application . 72 4.3 Mobile Software Framework . 73 4.3.1 Domain Specific Language (DSL) Implementation . 73 4.3.2 Performance Testing . 75 5 Conclusions .................................. 78 5.1 Contributions . 78 5.1.1 Objective 1: Develop an adaptable software framework . 78 5.1.2 Objective 2: Explore methods for assessing asymmetry . 80 5.1.3 Objective 3: Explore decision fusion methods . 81 5.1.4 Additional contributions . 82 5.2 Future work . 83 5.2.1 Software framework . 83 5.2.2 Asymmetry detection . 83 Bibliography ................................... 84 Appendices .................................... 93 A Publications and Patents .......................... 94 x List of Figures 2.1 Dermline II dermatoscope . 7 2.2 The ABCD rule . 9 2.3 7-Point checklist . 10 2.4 Typical analysis architecture for medical image processing . 11 2.5 MCS topologies . 13 2.6 Dermoscopic analysis on iPhone . 20 3.1 Support vector machines . 28 3.2 A simple decision tree . 29 3.3 Receiver operating characteristic curve . 32 3.4 Lesion quadrants based on centroid . 34 3.5 Size functions for a simple curve . 36 3.6 Matching distance between two size functions . 37 3.7 Pixels used in calculating measuring functions . 38 3.8 Tangent values on lesion boundary . 40 3.9 Areas used to determine area irregularity measure . 42 3.10 Conditional application of dermoscopic rules . 45 3.11 Conceptual architecture for proposed imaging framework . 47 4.1 Correlation plot for area and contour features . 53 4.2 Boruta plot for all features using the PHm data set. 54 4.3 ROC curves for Boruta features . 55 xi 4.4 Border distance size functions . 58 4.5 Correlation plot for measuring functions . 59 4.6 Boruta diagram for size function features . 60 4.7 ROC curves for Boruta for size functions . 61 4.8 Correlation plot for combined features . 63 4.9 Boruta diagram for combined features . 64 4.10 ROC curves for LR classifiers and full feature set . 66 4.11 Asymmetry classifier accuracy . 67 4.12 Classification of melanoma using the ABCD rule. 69 4.13 Classification of melanoma using the Menzies rule. 70 4.14 Benign lesion with interal asymmetry . 71 4.15 IPC selection screen . 74 4.16 Task object model . 75 4.17 DSL Example: Creating a mask image . 75 xii List of Tables 2.1 Common melanoma scoring schemes . 8 3.1 Features and their corresponding area descriptors . 43 4.1 PH2 dataset after removal of poor images . 50 4.2 Classification results for Dermoscreen and PH2m dataset . 51 4.3 Features and their abbreviations . 52 4.4 CI and VIF measures for features with CI 30 . 52 ≥ 4.5 Classification results for PH2m dataset . 56 4.6 Classification results for PH2m dataset using size function approach . 62 4.7 Classification results for PH2m dataset using combined feature set . 65 4.8 Results for the LR classifier on the PH20;1 and PH20;2 datasets. 65 4.9 Summary of asymmetry accuracy for LASSO . 67 4.10 Classifier performance on melanoma images . 68 4.11 Summary of dermoscopic rule classifier results . 72 4.12 Results for simple voting strategies . 72 4.13 Fast median filtering performance . 76 xiii Chapter 1 Introduction An estimated two million people are diagnosed with some form of skin cancer annually [60]. These cancers include malignant melanoma, basal cell carcinoma, and squamous cell carcinoma. Melanoma represents only 5% of all new cases, however it accounts for the vast majority of skin cancer deaths globally [49]. In the United States alone, melanoma was estimated to claim 9,730 lives in 2016 [60]. Melanoma in situ, a very early stage of melanoma, can add tens of thousands of more cases, as it is growing at 15% per year [63] . Five-year survival rates for patients whose melanoma is detected early is about 98%. If detection is much later, after the cancer has spread to the internal organs, the rate drops to only 15% [60]. It is readily apparent that early detection is critical to patient survival. A substantial percentage of melanomas are initially detected through self-examination [6][51], using the checklists recommended by the American Academy of Dermatology. It is difficult, however, for even trained physicians to differentiate between melanoma and other pigmented skin lesions solely by visual inspection [48][10]. As a result, physicians routinely order biopsies if the lesion looks suspicious. The inaccuracy of visual inspection methods has motivated the development of automated screening systems that can recognize melanoma at an early stage. If suffi- ciently accurate, these systems have the potential to detect melanoma at a treatable stage, while eliminating at least some of the unnecessary invasive procedures such as scraping or excision.
Recommended publications
  • Homology Inference from Point Cloud Data
    Homology Inference from Point Cloud Data Yusu Wang Abstract. In this paper, we survey a common framework of estimating topo- logical information from point cloud data (PCD) that has been developed in the field of computational topology in the past twenty years. Specifically, we focus on the inference of homological information. We briefly explain the main ingredients involved, and present some basic results. This chapter is part of the AMS short course on Geometry and Topology in Statistical Inference. It aims to introduce the general mathematical audience to the problem of topol- ogy inference, but is not meant to be a comprehensive review of the field in general. 1. Introduction The past two decades have witnessed tremendous development in the field of computational and applied topology. Much progress has been made not only in the theoretical and computational fronts, but also in the application of topological methods / ideas for data analysis. For example, on the theoretical front, there has been elegant work on the so-called persistent homology, originally proposed in [52] 1, and further developed in [15, 16, 19, 22, 33, 94] etc. On the application front, topological methods have been successfully used in fields such as computer graphics e.g, [67, 82, 42], visualization e.g, [10, 77, 76], geometric reconstruction and meshing e.g, [2, 41, 90], sensor network e.g, [38, 39, 61, 88, 91], high dimensional data analysis e.g, [59, 60, 84, 89] and so on. Indeed, topological data analysis (TDA) has become a vibrant field attracting researchers from mathematics, computer science and statistics.
    [Show full text]
  • Evolutionary Homology on Coupled Dynamical Systems
    Evolutionary homology on coupled dynamical systems Zixuan Cang1, Elizabeth Munch1;2 and Guo-Wei Wei1;3;4 ∗ 1 Department of Mathematics 2Department of Computational Mathematics, Science and Engineering 3 Department of Biochemistry and Molecular Biology 4 Department of Electrical and Computer Engineering Michigan State University, MI 48824, USA February 14, 2018 Abstract Time dependence is a universal phenomenon in nature, and a variety of mathematical models in terms of dynamical systems have been developed to understand the time-dependent behavior of real-world problems. Originally constructed to analyze the topological persistence over spatial scales, persistent homology has rarely been devised for time evolution. We propose the use of a new filtration function for persistent homology which takes as input the adjacent oscillator trajecto- ries of coupled dynamical systems. We also regulate the dynamical system by a weighted graph Laplacian matrix derived from the network of interest, which embeds the topological connectivity of the network into the dynamical system. The resulting topological signatures, which we call evolution- ary homology (EH) barcodes, reveal the topology-function relationship of the network and thus give rise to the quantitative analysis of nodal properties. The proposed EH is applied to protein residue networks for protein thermal fluctuation analysis, rendering the most accurate B-factor prediction of a set of 364 proteins. This work extends the utility of dynamical systems to the quantitative modeling and analysis of realistic physical systems. Contents 1 Introduction 2 2 Methods 4 2.1 Coupled dynamical systems . .4 arXiv:1802.04677v1 [math.AT] 13 Feb 2018 2.1.1 Systems configuration .
    [Show full text]
  • Computational Topology in Image Context Massimo Ferri, Patrizio Frosini, Claudia Landi and Barbara Di Fabio
    1 Computational Topology in Image Context Massimo Ferri, Patrizio Frosini, Claudia Landi and Barbara Di Fabio Keywords—Shape description, shape comparison, persistent ho- a single point. The persistence diagram is obtained by pairing mology, Reeb graphs births and death of topological feature along a filtration of X by sublevel sets of f . I. INTRODUCTION f1 f2 It is well known that visual perception and topology are closely related to each other. In his book “The child’s concep- m m tion of space” Jean Piaget and Barbel¨ Inhelder wrote “the child e e starts by building up and using certain primitive relationships ... [which] correspond to those termed ‘topological’ by geo- d d metricians.” Even if further research in cognitive science has c c moderated this position, the importance of topology in visual perception is no longer questionable. b b The topological approach can be seen both as a method a a to reduce the usually big amount of information that is X Y associated with visual data, and as a particularly convenient and meaningful pass key to human perception [1], [2]. Fig. 1. Two curves which are not distinguishable by persistence modules, Recent progress in applied topology allows for its use but with nonvanishing natural pseudodistance. beyond low level image processing, extending the application of topological techniques to image interpretation and analysis. However, this often requires advanced and not trivial theoret- Metrics measuring the dissimilarity of Reeb graphs and ical frameworks that are still not well known in the computer persistence spaces are thus a proxy to direct shape comparison.
    [Show full text]
  • The Topology of Data
    techniques. In the approximation that the vacuum density is just the volume form on moduli space, the surface area of the boundary will just be thesurfaceareaoftheboundary in moduli space. Taking the region R to be a sphere in moduli space of radius r,wefind A(S1) √K Developing Topological Observables V (S1) ∼ r so the condition Eq. (5.2) becomes K L> . (5.3) in Cosmology r2 Thus, if we consider a large enough region, or the entire moduli space in order to find the total number of vacua, the condition for the asymptotic vacuum counting formulas we have discussed in this work to hold is L>cKwith some order one coe∼cient. But if we Alex Cole subdivide the region into subregions which do not satisfy Eq.(5.3),wewillfindthatthe number of vacua in each subregion will show oscillations around this centralThe scaling. In Topology of Data:PhD Thesis Defense fact, most regions will contain a smaller number of vacua (like S above), while a few should ∼ August 12, 2020 have anomalously large numbers (like S above), averaging out toFrom Eq. (5.1). String Theory to Cosmology to Phases of Matter 5.1 Flux vacua on rigid Calabi-Yau As an illustration of this, consider the following toy problem with K =4,studiedin[1].Theconfiguration space is simply the fundamental region of the upper 3 half plane, parameterized by ∼.Thefluxsuperpoten- tials are taken to be W = A∼ + B with A = a1 + ia2 and B = b1 + ib2 each taking values 2 in Z+iZ.Thiswouldbeobtainedifweconsideredflux vacua on a rigid Calabi-Yau, with no complex structure moduli, b3 =2,andtheperiods√1 =1and√2 = i.
    [Show full text]
  • Arxiv:0811.1868V2 [Cs.CG]
    NECESSARY CONDITIONS FOR DISCONTINUITIES OF MULTIDIMENSIONAL SIZE FUNCTIONS A. CERRI AND P. FROSINI Abstract. Some new results about multidimensional Topological Persistence are presented, proving that the discontinuity points of a k-dimensional size function are necessarily related to the pseudocritical or special values of the associated measuring function. Introduction Topological Persistence is devoted to the study of stable properties of sublevel sets of topological spaces and, in the course of its development, has revealed itself to be a suitable framework when dealing with applications in the field of Shape Analysis and Comparison. Since the beginning of the 1990s research on this sub- ject has been carried out under the name of Size Theory, studying the concept of size function, a mathematical tool able to describe the qualitative properties of a shape in a quantitative way. More precisely, the main idea is to model a shape by a topological space endowed with a continuous function ϕ, called measuring function. Such a functionM is chosen according to applications and can be seen as a descriptor of the features considered relevant for shape characterization. Under these assumptions, the size function ℓ( ,ϕ) associated with the pair ( , ϕ) is a de- scriptor of the topological attributes thatM persist in the sublevel setsM of induced by the variation of ϕ. According to this approach, the problem of comparingM two shapes can be reduced to the simpler comparison of the related size functions. Since their introduction, these shape descriptors have been widely studied and applied in quite a lot of concrete applications concerning Shape Comparison and Pattern Recognition (cf., e.g., [4, 8, 15, 34, 35, 36]).
    [Show full text]
  • Size Functions for Shape Recognition in the Presence of Occlusions
    Size functions for shape recognition in the presence of occlusions Barbara Di Fabio1 Claudia Landi2∗ 1Dipartimento di Matematica, Universita` di Bologna, Italia 2Dipartimento di Scienze e Metodi dell'Ingegneria, Universita` di Modena e Reggio Emilia, Italia September 2, 2009 Abstract In Computer Vision the ability to recognize objects in the presence of occlusions is a necessary requirement for any shape representation method. In this paper we investigate how the size function of an object shape changes when a portion of the object is occluded by another object. More precisely, considering a set X = A [ B and a measuring function ϕ on X, we establish a condition so that `(X;j) = `(A;jjA) + `(B;jjB) − `(A\B;jjA\B). The main tool we use is the Mayer-Vietoris sequence of Cechˇ homology groups. This result allows us to prove that size functions are able to detect partial matching between shapes by showing a common subset of cornerpoints. Keywords: Cechˇ homology, Mayer-Vietoris sequence, persistent homology, shape occlusion MSC (2000): 55N05, 68U05 1 Introduction Shape matching and retrieval are key aspects in the design of search engines based on visual, rather than keyword, information. Generally speaking, shape matching methods rely on the computation of a shape description, also called a signature, that effectively captures some essential features of the object. The ability to perform not only global matching, but also partial matching, is regarded as one of the most meaningful properties in order to evaluate the performance of a shape matching method (cf., e.g., [36]). Basically, the interest in robustness against partial occlusions is motivated by the problem of recognizing an object partially hidden by some other foreground object in the same image.
    [Show full text]
  • Algebraic Topological Methods in Computer Science (Atmcs) Iii
    ALGEBRAIC TOPOLOGICAL METHODS IN COMPUTER SCIENCE (ATMCS) III Paris, France July 7-11, 2008 1 2 ALGEBRAIC TOPOLOGICAL METHODS IN COMPUTER SCIENCE (ATMCS) III July 7 8:00 8:50 refreshments and orientation 8:50 9:00 opening remarks 9:00 10:00 invited talk Trace spaces: Organization, calculations, applications (Martin Raussen) 10:00 10:15 break 10:15 11:15 invited talk (Timothy Porter) 11:15 11:30 break 11:30 12:00 Geometric databases (David Spivak) 12:00 12:30 Simulation hemi-metrics for timed Systems, with relations to ditopology (Uli Fahren- berg) 12:30 14:30 lunch 14:30 15:30 invited talk Computation and the periodic table (John Baez) 15:30 15:45 break 15:45 16:45 invited talk Structure discovery, routing and information brokerage in sensor net- works (Leonidas Guibas) 16:45 17:00 break 17:00 17:30 Topological methods for predicting the character of response surface z = f(x; y) while elastomer systems studying (Ilya Gorelkin) 17:30 18:00 Extremal models of concurrent systems, the fundamental bipartite graph, and di- rected van Kampen theorems (Peter Bubenik) The end of the booklet contains all available abstracts, in alphabetical order of presenter. ALGEBRAIC TOPOLOGICAL METHODS IN COMPUTER SCIENCE (ATMCS) III 3 July 8 8:30 9:00 refreshments 9:00 10:00 invited talk (speaker Dmitry Kozlov) 10:00 10:15 break 10:15 11:15 invited talk Locally ordered spaces as sheaves (Krzysztof Worytkiewicz) 11:15 11:30 break 11:30 12:30 A survey on homological perturbation theory (Johannes Huebschmann) 12:30 14:30 lunch 14:30 15:30 invited talk Path categories
    [Show full text]
  • Proceedings in a Single
    Schedule of Talks 27th Annual Fall Workshop on Computational Geometry November 3–4, 2017 Stony Brook University Stony Brook, NY Sponsored by the National Science Foundation and the Department of Applied Mathematics and Statistics and the Department of Computer Science, College of Engineering and Applied Sciences, Stony Brook University Program Committee: GregAloupis(Tufts) PeterBrass(CCNY) WilliamRandolphFranklin(RPI) JieGao(co-chair,StonyBrook) MayankGoswami(QueensCollege) MatthewP.Johnson(Lehman College) Joseph S. B. Mitchell (co-chair, Stony Brook) Don Sheehy (University of Connecticut) Jinhui Xu (University at Buffalo) Friday, November 3, 2017: Computer Science (NCS), Room 120 9:20 Opening remarks Contributed Session 1 Chair: Joe Mitchell 9:30–9:45 “Sampling Conditions for Clipping-free Voronoi Meshing by the VoroCrust Algorithm”, Scott Mitchell, Ahmed Abdelkader, Ahmad Rushdi, Mohamed Ebeida, Ahmed Mahmoud, John Owens and Chandrajit Bajaj 9:45–10:00 “Quasi-centroidal Simplicial Meshes for Optimal Variational Methods”, Xiangmin Jiao 10:00–10:15 “A Parallel Approach for Computing Discrete Gradients on Mulfiltrations”, Federico Iuricich, Sara Scaramuccia, Claudia Landi and Leila De Floriani 10:15–10:30 “Parallel Intersection Detection in Massive Sets of Cubes”, W. Randolph Franklin and Salles V.G. Magalh˜aes 10:30–11:00 Break. Contributed Session 2 Chair: Jie Gao 11:00–11:15 “Dynamic Orthogonal Range Searching on the RAM, Revisited”, Timothy M. Chan and Konstantinos Tsakalidis 11:15–11:30 “Faster Algorithm for Truth Discovery via Range
    [Show full text]
  • Shape Comparison by Size Functions and Natural Size Distances
    RECENT ADVANCES IN MULTIDIMENSIONAL PERSISTENT TOPOLOGY Patrizio Frosini Vision Mathematics Group University of Bologna - Italy OUTLINE • Introduction and history • Multidimensional size functions • Reduction by the foliation method • Consequences: 1) STABILITY 2) LOCALIZATION OF DISCONTINUITIES • Conclusions Patrizio Frosini (U. Bologna) - MULTIDIMENSIONAL PERSISTENT TOPOLOGY 2/50 Let us start from three questions... Patrizio Frosini (U. Bologna) - MULTIDIMENSIONAL PERSISTENT TOPOLOGY 3/50 How similar are the colorings of these leaves? Patrizio Frosini (U. Bologna) - MULTIDIMENSIONAL PERSISTENT TOPOLOGY 4/50 How similar are the Riemannian structures of these manifolds? Patrizio Frosini (U. Bologna) - MULTIDIMENSIONAL PERSISTENT TOPOLOGY 5/50 How similar are the spatial positions of these wires? Patrizio Frosini (U. Bologna) - MULTIDIMENSIONAL PERSISTENT TOPOLOGY 6/50 All previous questions concern the comparison between pairs (M,ϕ), where M is a topological space and ϕ: M → IRk is a continuous function: ●LEAVES: ϕ(P)=color (r,g,b) at P ●3D MODELS: ϕ(P)=vector describing the first fundamental form at P •WIRES: ϕ(P)=(x(P),y(P),z(P)) Patrizio Frosini (U. Bologna) - MULTIDIMENSIONAL PERSISTENT TOPOLOGY 7/50 Multidimensional Persistent Topology studies the topological properties of the sublevel sets of continuous functions ϕ: M → IRk. It focuses on those properties that “persist” under change of the level. REMARK: the word multidimensional refers to the dimension of IRk, not the dimension of M! Patrizio Frosini (U. Bologna) - MULTIDIMENSIONAL PERSISTENT TOPOLOGY 8/50 From the beginning of the 90's, the study of Persistent Topology has originated different lines of research. Patrizio Frosini (U. Bologna) - MULTIDIMENSIONAL PERSISTENT TOPOLOGY 9/50 Persistence of connected components (size functions): P.
    [Show full text]
  • Dipartimento Di Informatica, Bioingegneria, Robotica Ed Ingegneria Dei Sistemi
    Dipartimento di Informatica, Bioingegneria, Robotica ed Ingegneria dei Sistemi Computational and Theoretical Issues of Multiparameter Persistent Homology for Data Analysis by Sara Scaramuccia Theses Series DIBRIS-TH-2018-16 DIBRIS, Università di Genova Via Opera Pia, 13 16145 Genova, Italy http://www.dibris.unige.it/ Università degli Studi di Genova Dipartimento di Informatica, Bioingegneria, Robotica ed Ingegneria dei Sistemi Ph.D. Thesis in Computer Science and Systems Engineering Computer Science Curriculum Computational and Theoretical Issues of Multiparameter Persistent Homology for Data Analysis by Sara Scaramuccia May, 2018 Dottorato di Ricerca in Informatica ed Ingegneria dei Sistemi Indirizzo Informatica Dipartimento di Informatica, Bioingegneria, Robotica ed Ingegneria dei Sistemi Università degli Studi di Genova DIBRIS, Univ. di Genova Via Opera Pia, 13 I-16145 Genova, Italy http://www.dibris.unige.it/ Ph.D. Thesis in Computer Science and Systems Engineering Computer Science Curriculum (S.S.D. INF/01) Submitted by Sara Scaramuccia DIBRIS, Univ. di Genova [email protected] Date of submission: January 2018 Title: Computational and Theoretical Issues of Multiparameter Persistent Homology for Data Analysis Advisors: Leila De Floriani Dipartimento di Informatica, Bioingegneria, Robotica ed Ingegneria dei Sistemi Università di Genova, Italy leila.defl[email protected] Claudia Landi Dipartimento di Scienze e Metodi dell’Ingegneria Università di Modena e Reggio Emilia, Italy [email protected] Ext. Reviewers: *Silvia Biasotti, `Heike Leitte *Istituto di Matematica Applicata e Tecnologie Informatiche "Enrico Magenes" Consiglio Nazionale delle Ricerche: Genova, Italy [email protected] ` Fachbereich Informatik Technische Universität Kaiserslautern, Germany [email protected] 4 Abstract The basic goal of topological data analysis is to apply topology-based descriptors to understand and describe the shape of data.
    [Show full text]
  • Persistence, Metric Invariants, and Simplification
    Persistence, Metric Invariants, and Simplification Dissertation Presented in Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy in the Graduate School of The Ohio State University By Osman Berat Okutan, M.S. Graduate Program in Mathematics The Ohio State University 2019 Dissertation Committee: Facundo M´emoli,Advisor Matthew Kahle Jean-Franc¸oisLafont c Copyright by Osman Berat Okutan 2019 Abstract Given a metric space, a natural question to ask is how to obtain a simpler and faithful approximation of it. In such a situation two main concerns arise: How to construct the approximating space and how to measure and control the faithfulness of the approximation. In this dissertation, we consider the following simplification problems: Finite approx- imations of compact metric spaces, lower cardinality approximations of filtered simplicial complexes, tree metric approximations of metric spaces and finite metric graph approxima- tions of compact geodesic spaces. In each case, we give a simplification construction, and measure the faithfulness of the process by using the metric invariants of the original space, including the Vietoris-Rips persistence barcodes. ii For Esra and Elif Beste iii Acknowledgments First and foremost I'd like to thank my advisor, Facundo M´emoli,for our discussions and for his continual support, care and understanding since I started working with him. I'd like to thank Mike Davis for his support especially on Summer 2016. I'd like to thank Dan Burghelea for all the courses and feedback I took from him. Finally, I would like to thank my wife Esra and my daughter Elif Beste for their patience and support.
    [Show full text]
  • Multidimensional Size Functions for Shape Comparison
    J Math Imaging Vis DOI 10.1007/s10851-008-0096-z Multidimensional Size Functions for Shape Comparison S. Biasotti · A. Cerri · P. Frosini · D. Giorgi · C. Landi © Springer Science+Business Media, LLC 2008 Abstract Size Theory has proven to be a useful framework proof of their stability with respect to that distance. Experi- for shape analysis in the context of pattern recognition. Its ments are carried out to show the feasibility of the method. main tool is a shape descriptor called size function. Size Theory has been mostly developed in the 1-dimensional set- Keywords Multidimensional size function · ting, meaning that shapes are studied with respect to func- Multidimensional measuring function · Natural tions, defined on the studied objects, with values in R.The pseudo-distance potentialities of the k-dimensional setting, that is using func- tions with values in Rk, were not explored until now for lack of an efficient computational approach. In this paper we pro- 1 Introduction vide the theoretical results leading to a concise and complete shape descriptor also in the multidimensional case. This is In the shape analysis literature many methods have been de- possible because we prove that in Size Theory the compar- veloped, which describe shapes making use of properties ison of multidimensional size functions can be reduced to of real-valued functions defined on the studied object. Usu- the 1-dimensional case by a suitable change of variables. In- ally, the role of these functions is to quantitatively measure deed, a foliation in half-planes can be given, such that the re- the geometric properties of the shape while taking into ac- striction of a multidimensional size function to each of these count its topology.
    [Show full text]