A Learning Classifier System Approach

Total Page:16

File Type:pdf, Size:1020Kb

A Learning Classifier System Approach The Detection and Characterization of Epistasis and Heterogeneity: A Learning Classifier System Approach A Thesis Submitted to the Faculty in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Genetics by Ryan John Urbanowicz DARTMOUTH COLLEGE Hanover, New Hampshire March 2012 Examining Committee: Professor Jason H. Moore (chair) Professor Michael L. Whitfield Professor Margaret J. Eppstein Professor Robert H. Gross Professor Tricia A. Thornton-Wells Brian W. Pogue, Ph.D. Dean of Graduate Studies c 2012 - Ryan John Urbanowicz All rights reserved. Abstract As the ubiquitous complexity of common disease has become apparent, so has the need for novel tools and strategies that can accommodate complex patterns of association. In particular, the analytic challenges posed by the phenomena known as epistasis and heterogeneity have been largely ignored due to the inherent difficulty of approaching multifactor non-linear relationships. The term epistasis refers to an interaction effect between factors contributing to diesease risk. Heterogeneity refers to the occurrence of similar or identical phenotypes by means of independent con- tributing factors. Here we focus on the concurrent occurrence of these phenomena as they are likely to appear in studies of common complex disease. In order to ad- dress the unique demands of heterogeneity we break from the traditional paradigm of epidemiological modeling wherein the objective is the identification of a single best model describing factors contributing to disease risk. Here we develop, evaluate, and apply a learning classifier system (LCS) algorithm to the identification, modeling, and characterization of susceptibility factors in the concurrent presence of heterogeneity and epistasis. This work includes an examination of existing LCS algorithms, the development of new strategies to address the inherent problem of knowledge discov- ery in LCSs, the introduction of novel heuristics that improve LCS performance and allow for the explicit characterization of heterogeneity, and an application of these ii Abstract collective advancements to an investigation of bladder cancer susceptibility. Success- ful analysis of simulated and real-world complex disease associations demonstrates the validity and unique potential of this LCS approach. iii Citations to Previously Published Work Chapter 1 has been published as: R. J. Urbanowicz, and J. H. Moore. Learning Classifier Systems: A Com- plete Introduction, Review, and Roadmap. Journal of Artificial Evolution and Applications. (2009):1, 2009. Chapter 2 has been published as: R. J. Urbanowicz, and J. H. Moore. The Application of Michigan-Style Learning Classifier Systems to Address Genetic Heterogeneity and Epis- tasis in Association Studies. Proceedings of the 12th Annual Conference on Genetic and Evolutionary Computation. 195–202, 2010. Chapter 3 has been published as: R. J. Urbanowicz, and J. H. Moore. The Application of Pittsburgh-Style Learning Classifier Systems to Address Genetic Heterogeneity and Epis- tasis in Association Studies. Parallel Problem Solving from Nature–PPSN XI. 404–413, 2011 Chapter 4 has been submitted as: R. J. Urbanowicz, A. Granizo-Mackenzie, and J. H. Moore. An Analysis Pipeline with Visualization-Guided Knowledge Discovery for Michigan- Style Learning Classifier Systems. Chapter 5 has been submitted as: R. J. Urbanowicz, A. Granizo-Mackenzie, and J. H. Moore. Instance- Linked Attribute Tracking and Feedback for Michigan-Style Supervised Learning Classifier Systems. iv Acknowledgments This work would not have been possible without the financial support of the Na- tional Institute of Health (LM010098, LM009012, and AI59694), and the William H. Neukom Institute for Computational Science at Dartmouth College. I am grateful for the expertise of my thesis committee including Margaret Eppstein and Michael Whit- field, each of whom provided excellent professional guidance throughout this research. I would especially like to thank my advisor Jason Moore for his support, trust, and guidance throughout the development of this work. I would also like to acknowledge the contribution of my co-authors who have been instrumental in the publication of works included in this thesis. Ambrose Granizo-Mackenzie has contributed to chap- ters 4 and 5. Jeff Kiralis, Jonathan Fisher, Nicholas Sinnott-Armstrong, and Tamra Heberling contributed to the development of the model simulation strategy utilized in chapters 2, 3, 4, and 5 (submitted for publication outside of this thesis). I am grate- ful to Angeline Andrew and Margaret Karagas for their collaboration on chapter 6, and for making the bladder cancer susceptibility data available for analysis. I am fortunate to have worked alongside such a diverse and inquisitive group of students and post-docs. In particular I would like to thank Chantel Sloan, Anna Tyler, Casey Greene, Nicholas Sinnott-Armstrong, Delaney Granizo-Mackenzie, Ambrose Granizo- Mackenzie, Arvis Sulovari, Ting Hu, Richard Cowper, Cristian Dabaros, Josh Payne, and Dov Pechenick for their insights and day to day collaboration. I’d also like to v Acknowledgments thank Peter Andrews, Jonathan Fisher, and Bill White for their programming ex- pertise and Anna Green for having developed the LaTeX thesis template used here. Lastly I thank my all my teachers, mentors, family, and friends for their encourage- ment, love, and support. A special thanks to the Andrews family. vi Contents TitlePage.................................... i Abstract..................................... ii Citations to Previously Published Work . iv Acknowledgments................................ v TableofContents................................ vii ListofTables .................................. xi ListofFigures.................................. xii Dedication.................................... xvi 1 An Introduction and Review of Learning Classifier Systems 1 1.1 Introduction................................ 2 1.2 AGeneralLCS .............................. 5 1.3 TheDrivingMechanisms......................... 6 1.3.1 Discovery - The Genetic Algorithm . 6 1.3.2 Learning.............................. 10 1.4 AMinimalClassifierSystem ....................... 12 1.5 HistoricalPerspective........................... 16 1.5.1 TheEarlyYears.......................... 17 1.5.2 TheRevolution .......................... 29 1.5.3 IntheWakeofXCS ....................... 32 1.5.4 Revisiting the Pitt . 34 1.5.5 Visualization . 35 1.6 ProblemDomains............................. 35 1.7 Biological Applications . 37 1.8 OptimizingLCS.............................. 38 1.9 ComponentRoadmap........................... 39 1.9.1 DetectorsandEffectors. 40 vii Contents 1.9.2 Population............................. 41 1.9.3 PerformanceComponentandSelection . 44 1.9.4 ReinforcementComponent . 47 1.9.4.1 BucketBrigade ..................... 47 1.9.4.2 Q-Learning-based. 49 1.9.4.3 More Credit Assignment . 51 1.9.5 DiscoveryComponents . 52 1.9.6 BeyondtheBasics ........................ 54 1.10Conclusion................................. 55 1.11Resources ................................. 57 1.12Bibliography................................ 58 2 The Application of Michigan-Style Learning Classifier Systems to Address Genetic Heterogeneity and Epistasis in Association Studies 97 2.1 Introduction................................ 98 2.2 Methods.................................. 102 2.2.1 LearningClassifierSystems . 102 2.2.1.1 Implementing LCS . 104 2.2.1.2 Three Michigan-Style LCSs . 107 2.2.2 Data Simulation . 108 2.2.3 SystemEvaluations. .. .. 110 2.3 Results................................... 113 2.3.1 ParameterSweep ......................... 113 2.3.2 Michigan LCS Evaluation . 114 2.4 DiscussionandConclusions . 117 2.5 Bibliography................................ 121 3 The Application of Pittsburgh-Style Learning Classifier Systems to Address Genetic Heterogeneity and Epistasis in Association Studies128 3.1 Introduction................................ 129 3.2 Methods.................................. 132 3.2.1 LearningClassifierSystems . 132 3.2.1.1 Implementing LCSs . 133 3.2.1.2 Two Pittsburgh-Style LCSs . 134 3.2.2 SystemEvaluations. .. .. 135 3.3 Results................................... 137 3.3.1 ParameterSweep ......................... 137 3.3.2 PittsburghLCSEvaluations . 139 viii Contents 3.4 DiscussionandConclusions . 144 3.5 Bibliography................................ 146 4 An Analysis Pipeline with Visualization-Guided Knowledge Discov- ery for Michigan-Style Learning Classifier Systems: Interpreting the Black Box 149 4.1 Introduction................................ 150 4.2 LearningClassifierSystems . 154 4.3 BioinformaticsProblem. .. .. 156 4.4 Analysis Pipeline . 160 4.4.1 RuntheM-LCS.......................... 160 4.4.2 Significance Testing of M-LCS Statistics . 161 4.4.3 Visualization - Heat-Map . 169 4.4.4 Visualization - Co-Occurrence Network . 173 4.4.5 Guided Knowledge Discovery . 174 4.4.6 NegativeControlAnalysis . 176 4.5 Conclusions ................................ 177 4.6 Bibliography................................ 178 5 Instance-Linked Attribute Tracking and Feedback for Michigan-Style Supervised Learning Classifier Systems 184 5.1 Introduction................................ 185 5.2 Methods.................................. 188 5.2.1 M-LCS............................... 189 5.2.2 AttributeTracking ........................ 190 5.2.3 AttributeFeedback . 193 5.2.3.1 Mutation ........................ 194 5.2.3.2 UniformCrossover . 195 5.2.4 Evaluation............................
Recommended publications
  • Unified Modeling Language 2.0 Part 1 - Introduction
    UML 2.0 – Tutorial (v4) 1 Unified Modeling Language 2.0 Part 1 - Introduction Prof. Dr. Harald Störrle Dr. Alexander Knapp University of Innsbruck University of Munich mgm technology partners (c) 2005-2006, Dr. H. Störrle, Dr. A. Knapp UML 2.0 – Tutorial (v4) 2 1 - Introduction History and Predecessors • The UML is the “lingua franca” of software engineering. • It subsumes, integrates and consolidates most predecessors. • Through the network effect, UML has a much broader spread and much better support (tools, books, trainings etc.) than other notations. • The transition from UML 1.x to UML 2.0 has – resolved a great number of issues; – introduced many new concepts and notations (often feebly defined); – overhauled and improved the internal structure completely. • While UML 2.0 still has many problems, current version (“the standard”) it is much better than what we ever had formal/05-07-04 of August ‘05 before. (c) 2005-2006, Dr. H. Störrle, Dr. A. Knapp UML 2.0 – Tutorial (v4) 3 1 - Introduction Usage Scenarios • UML has not been designed for specific, limited usages. • There is currently no consensus on the role of the UML: – Some see UML only as tool for sketching class diagrams representing Java programs. – Some believe that UML is “the prototype of the next generation of programming languages”. • UML is a really a system of languages (“notations”, “diagram types”) each of which may be used in a number of different situations. • UML is applicable for a multitude of purposes, during all phases of the software lifecycle, and for all sizes of systems - to varying degrees.
    [Show full text]
  • Meta-Class Features for Large-Scale Object Categorization on a Budget
    Meta-Class Features for Large-Scale Object Categorization on a Budget Alessandro Bergamo Lorenzo Torresani Dartmouth College Hanover, NH, U.S.A. faleb, [email protected] Abstract cation accuracy over a predefined set of classes, and without consideration of the computational costs of the recognition. In this paper we introduce a novel image descriptor en- We believe that these two assumptions do not meet the abling accurate object categorization even with linear mod- requirements of modern applications of large-scale object els. Akin to the popular attribute descriptors, our feature categorization. For example, test-recognition efficiency is a vector comprises the outputs of a set of classifiers evaluated fundamental requirement to be able to scale object classi- on the image. However, unlike traditional attributes which fication to Web photo repositories, such as Flickr, which represent hand-selected object classes and predefined vi- are growing at rates of several millions new photos per sual properties, our features are learned automatically and day. Furthermore, while a fixed set of object classifiers can correspond to “abstract” categories, which we name meta- be used to annotate pictures with a set of predefined tags, classes. Each meta-class is a super-category obtained by the interactive nature of searching and browsing large im- grouping a set of object classes such that, collectively, they age collections calls for the ability to allow users to define are easy to distinguish from other sets of categories. By us- their own personal query categories to be recognized and ing “learnability” of the meta-classes as criterion for fea- retrieved from the database, ideally in real-time.
    [Show full text]
  • Sysml Distilled: a Brief Guide to the Systems Modeling Language
    ptg11539604 Praise for SysML Distilled “In keeping with the outstanding tradition of Addison-Wesley’s techni- cal publications, Lenny Delligatti’s SysML Distilled does not disappoint. Lenny has done a masterful job of capturing the spirit of OMG SysML as a practical, standards-based modeling language to help systems engi- neers address growing system complexity. This book is loaded with matter-of-fact insights, starting with basic MBSE concepts to distin- guishing the subtle differences between use cases and scenarios to illu- mination on namespaces and SysML packages, and even speaks to some of the more esoteric SysML semantics such as token flows.” — Jeff Estefan, Principal Engineer, NASA’s Jet Propulsion Laboratory “The power of a modeling language, such as SysML, is that it facilitates communication not only within systems engineering but across disci- plines and across the development life cycle. Many languages have the ptg11539604 potential to increase communication, but without an effective guide, they can fall short of that objective. In SysML Distilled, Lenny Delligatti combines just the right amount of technology with a common-sense approach to utilizing SysML toward achieving that communication. Having worked in systems and software engineering across many do- mains for the last 30 years, and having taught computer languages, UML, and SysML to many organizations and within the college setting, I find Lenny’s book an invaluable resource. He presents the concepts clearly and provides useful and pragmatic examples to get you off the ground quickly and enables you to be an effective modeler.” — Thomas W. Fargnoli, Lead Member of the Engineering Staff, Lockheed Martin “This book provides an excellent introduction to SysML.
    [Show full text]
  • UML Basics: the Component Diagram
    English Sign in (or register) Technical topics Evaluation software Community Events UML basics: The component diagram Donald Bell ([email protected]), IT Architect, IBM Corporation Summary: from The Rational Edge: This article introduces the component diagram, a structure diagram within the new Unified Modeling Language 2.0 specification. Date: 15 Dec 2004 Level: Introductory Also available in: Chinese Vietnamese Activity: 259392 views Comments: 3 (View | Add comment - Sign in) Average rating (629 votes) Rate this article This is the next installment in a series of articles about the essential diagrams used within the Unified Modeling Language, or UML. In my previous article on the UML's class diagram, (The Rational Edge, September 2004), I described how the class diagram's notation set is the basis for all UML 2's structure diagrams. Continuing down the track of UML 2 structure diagrams, this article introduces the component diagram. The diagram's purpose The component diagram's main purpose is to show the structural relationships between the components of a system. In UML 1.1, a component represented implementation items, such as files and executables. Unfortunately, this conflicted with the more common use of the term component," which refers to things such as COM components. Over time and across successive releases of UML, the original UML meaning of components was mostly lost. UML 2 officially changes the essential meaning of the component concept; in UML 2, components are considered autonomous, encapsulated units within a system or subsystem that provide one or more interfaces. Although the UML 2 specification does not strictly state it, components are larger design units that represent things that will typically be implemented using replaceable" modules.
    [Show full text]
  • The Validation Possibility of Topological Functioning Model Using the Cameo Simulation Toolkit
    The Validation Possibility of Topological Functioning Model using the Cameo Simulation Toolkit Viktoria Ovchinnikova and Erika Nazaruka Department of Applied Computer Science, Riga Technical University, Setas Street 1, Riga, Latvia Keywords: Topological Functioning Model, Execution Model, Foundational UML, UML Activity Diagram. Abstract: According to requirements provided by customers, the description of to-be functionality of software systems needs to be provided at the beginning of the software development process. Documentation and functionality of this system can be displayed as the Topological Functioning Model (TFM) in the form of a graph. The TFM must be correctly and traceably validated, according to customer’s requirements and verified, according to TFM construction rules. It is necessary for avoidance of mistakes in the early stage of development. Mistakes are a risk that can bring losses of resources or financial problems. The hypothesis of this research is that the TFM can be validated during this simulation of execution of the UML activity diagram. Cameo Simulation Toolkit from NoMagic is used to supplement UML activity diagram with execution and allows to simulate this execution, providing validation and verification of the diagram. In this research an example of TFM is created from the software system description. The obtained TFM is manually transformed to the UML activity diagram. The execution of actions of UML activity diagrams was manually implemented which allows the automatic simulation of the model. It helps to follow the traceability of objects and check the correctness of relationships between actions. 1 INTRODUCTION It represents the full scenario of system functionality and its relationships. Development of the software system is a complex The simulation of models can help to see some and stepwise process.
    [Show full text]
  • UML Class Diagrams UML Is a Graphical Language for Recording Aspects of the Requirements and Design of Software Systems
    The Unified Modeling Language UML class diagrams UML is a graphical language for recording aspects of the requirements and design of software systems. Nigel Goddard It provides many diagram types; all the diagrams of a system together form a UML model. Three important types of diagram: School of Informatics 1. Use-case diagram. Already seen in requirements lecture. University of Edinburgh 2. Class diagram. Today. 3. Interaction diagram. In the future. Reminder: a simple use case diagram A class Reserve book Browse Browser BookBorrower Book Borrow copy of book A class as design entity is an example of a model element: the Return copy of book rectangle and text form an example of a corresponding presentation element. Extend loan UML explicitly separates concerns of actual symbols used vs Update catalogue meaning. Many other things can be model elements: use cases, actors, Borrow journal Librarian associations, generalisation, packages, methods,... Return journal JournalBorrower An object Classifiers and instances An aspect of the UML metamodel that it's helpful to understand up front. jo : Customer An instance is to a classifier as an object is to a class: instance and classifier are more general terms. This pattern generalises: always show an instance of a classifier In the metamodel, Class inherits from Classifier, Object inherits using the same symbol as for the classifier, labelled from Instance. instanceName : classifierName. UML defines many different classifiers. E.g., UseCase and Actor are classifiers. Showing attributes and operations Compartments We saw the standard: Book a compartment for attributes title : String I I a compartment for operations, below it copiesOnShelf() : Integer borrow(c:Copy) They can be suppressed in diagrams.
    [Show full text]
  • Integration of Model-Based Systems Engineering and Virtual Engineering Tools for Detailed Design
    Scholars' Mine Masters Theses Student Theses and Dissertations Spring 2011 Integration of model-based systems engineering and virtual engineering tools for detailed design Akshay Kande Follow this and additional works at: https://scholarsmine.mst.edu/masters_theses Part of the Systems Engineering Commons Department: Recommended Citation Kande, Akshay, "Integration of model-based systems engineering and virtual engineering tools for detailed design" (2011). Masters Theses. 5155. https://scholarsmine.mst.edu/masters_theses/5155 This thesis is brought to you by Scholars' Mine, a service of the Missouri S&T Library and Learning Resources. This work is protected by U. S. Copyright Law. Unauthorized use including reproduction for redistribution requires the permission of the copyright holder. For more information, please contact [email protected]. INTEGRATION OF MODEL-BASED SYSTEMS ENGINEERING AND VIRTUAL ENGINEERING TOOLS FOR DETAILED DESIGN by AKSHA Y KANDE A THESIS Presented to the Faculty of the Graduate School of the MISSOURI UNIVERSITY OF SCIENCE AND TECHNOLOGY In Partial Fulfillment of the Requirements for the Degree MASTER OF SCIENCE IN SYSTEMS ENGINEERING 2011 Approved by Steve Corns, Advisor Cihan Dagli Scott Grasman © 2011 Akshay Kande All Rights Reserved 111 ABSTRACT Design and development of a system can be viewed as a process of transferring and transforming data using a set of tools that form the system's development environment. Conversion of the systems engineering data into useful information is one of the prime objectives of the tools used in the process. With complex systems, the objective is further augmented with a need to represent the information in an accessible and comprehensible manner.
    [Show full text]
  • OMG Unified Modeling Languagetm (OMG UML), Infrastructure
    Date : August 2011 OMG Unified Modeling LanguageTM (OMG UML), Infrastructure Version 2.4.1 OMG Document Number: formal/2011-08-05 Standard document URL: http://www.omg.org/spec/UML/2.4.1/Infrastructure Associated Normative Machine-Readable Files: http://www.omg.org/spec/UML/20110701/Infrastucture.xmi http://www.omg.org/spec/UML/20110701/L0.xmi http://www.omg.org/spec/UML/20110701/LM.xmi http://www.omg.org/spec/UML/20110701/PrimitiveTypes.xmi Version 2.4.1 supersedes formal/2010-05-04. Copyright © 1997-2011 Object Management Group Copyright © 2009-2010 88Solutions Copyright © 2009-2010 Artisan Software Tools Copyright © 2001-2010 Adaptive Copyright © 2009-2010 Armstrong Process Group, Inc. Copyright © 2001-2010 Alcatel Copyright © 2001-2010 Borland Software Corporation Copyright © 2009-2010 Commissariat à l'Energie Atomique Copyright © 2001-2010 Computer Associates International, Inc. Copyright © 2009-2010 Computer Sciences Corporation Copyright © 2009-2010 European Aeronautic Defence and Space Company Copyright © 2001-2010 Fujitsu Copyright © 2001-2010 Hewlett-Packard Company Copyright © 2001-2010 I-Logix Inc. Copyright © 2001-2010 International Business Machines Corporation Copyright © 2001-2010 IONA Technologies Copyright © 2001-2010 Kabira Technologies, Inc. Copyright © 2009-2010 Lockheed Martin Copyright © 2001-2010 MEGA International Copyright © 2009-2010 Mentor Graphics Corporation Copyright © 2009-2010 Microsoft Corporation Copyright © 2009-2010 Model Driven Solutions Copyright © 2001-2010 Motorola, Inc. Copyright © 2009-2010 National
    [Show full text]
  • Predicting the Generalization Ability of a Few-Shot Classifier
    information Article Predicting the Generalization Ability of a Few-Shot Classifier Myriam Bontonou 1,* , Louis Béthune 2 , Vincent Gripon 1 1 Département Electronique, IMT Atlantique, 44300 Nantes, France; [email protected] 2 Université Paul Sabatier, 31062 Toulouse, France; [email protected] * Correspondence: [email protected] Abstract: In the context of few-shot learning, one cannot measure the generalization ability of a trained classifier using validation sets, due to the small number of labeled samples. In this paper, we are interested in finding alternatives to answer the question: is my classifier generalizing well to new data? We investigate the case of transfer-based few-shot learning solutions, and consider three set- tings: (i) supervised where we only have access to a few labeled samples, (ii) semi-supervised where we have access to both a few labeled samples and a set of unlabeled samples and (iii) unsupervised where we only have access to unlabeled samples. For each setting, we propose reasonable measures that we empirically demonstrate to be correlated with the generalization ability of the considered classifiers. We also show that these simple measures can predict the generalization ability up to a certain confidence. We conduct our experiments on standard few-shot vision datasets. Keywords: few-shot learning; generalization; supervised learning; semi-supervised learning; transfer learning; unsupervised learning 1. Introduction In recent years, Artificial Intelligence algorithms, especially Deep Neural Networks (DNNs), have achieved outstanding performance in various domains such as vision [1], audio [2], games [3] or natural language processing [4]. They are now applied in a wide Citation: Bontonou, M.; Béthune, L.; range of fields including help in diagnosis in medicine [5], object detection [6], user behavior Gripon.
    [Show full text]
  • UML Basics: the Component Diagram
    IBM Copyright http://www.ibm.com/developerworks/rational/library/dec04/bell/index.html Country/region [select] Terms of use All of dW Home Products Services & solutions Support & downloads My account developerWorks > Rational > UML basics: The component diagram Level: Introductory Contents: The diagram's purpose Donald Bell The notation IBM Global Services, IBM The basics 15 Dec 2004 Beyond the basics from The Rational Edge: This article introduces the component diagram, a structure diagram Conclusion within the new Unified Modeling Language 2.0 specification. Notes About the author This is the next installment in a series of articles about the essential diagrams used within the Unified Modeling Language, Rate this article or UML. In my previous article on the UML’s class diagram, Subscriptions: (The Rational Edge, September 2004), I described how the dW newsletters class diagram’s notation set is the basis for all UML 2’s structure diagrams. Continuing down the track of UML 2 structure dW Subscription diagrams, this article introduces the component diagram. (CDs and downloads) The Rational Edge The diagram's purpose The component diagram’s main purpose is to show the structural relationships between the components of a system. In UML 1.1, a component represented implementation items, such as files and executables. Unfortunately, this conflicted with the more common use of the term “component," which refers to things such as COM components. Over time and across successive releases of UML, the original UML meaning of components was mostly lost. UML 2 officially changes the essential meaning of the component concept; in UML 2, components are considered autonomous, encapsulated units within a system or subsystem that provide one or more interfaces.
    [Show full text]
  • UML Components
    UML Components John Daniels Syntropy Limited [email protected] © Syntropy Limited 2001 All rights reserved Agenda • Aspects of a component • A process for component specification • Implications for the UML email:[email protected] Syntropy Limited Aspects of a component email:[email protected] Syntropy Limited UML Unified Modeling Language • The UML is a standardised language for describing the structure and behaviour of things • UML emerged from the world of object- oriented programming • UML has a set of notations, mostly graphical • There are tools that support some parts of the UML email:[email protected] Syntropy Limited Aspects of an Object ! Specification unit 1..* Interface ! Implementation unit * realization Class 0..1 Class Implementation source 1 1 instance * Object ! Execution unit email:[email protected] Syntropy Limited Components in context ObjectObject Evolving PrinciplesPrinciples developed within adopted by adopted by 1989- 1967- RMI 1995- Smalltalk EJB DistributedDistributed Object-orientedObject-oriented ObjectObject ComponentsComponents Programming DCOM Programming TechnologyTechnology C++ Java CORBA COM+/.NET Only interoperable Typically language neutral. A way of packaging within the language. Multiple address spaces. object implementations Single address space Non-integrated services to ease their use. Integrated services email:[email protected] Syntropy Limited Component standard features • Component Model: – defined set of services that support the software – set of rules that must be obeyed in order
    [Show full text]
  • Metaclasses and Zoning Mechanism Applied to Handwriting Recognition
    Journal of Universal Computer Science, vol. 14, no. 2 (2008), 211-223 submitted: 15/6/07, accepted: 17/11/07, appeared: 28/1/08 © J.UCS Metaclasses and Zoning Mechanism Applied to Handwriting Recognition Cinthia O. A. Freitas (Pontifical Catholic University of Paraná – PUCPR, Brazil [email protected]) Luiz S. Oliveira (Pontifical Catholic University of Paraná – PUCPR, Brazil [email protected]) Simone B. K. Aires (Technological Federal University of Paraná - UTFPR-PG, Brazil [email protected]) Flávio Bortolozzi (OPET College, Brazil [email protected]) Abstract: The contribution of this paper is twofold. First we investigate the use of the confusion matrices in order to get some insight to better define perceptual zoning for character recognition. The features considered in this work are based on concavities/convexities deficiencies, which are obtained by labelling the background pixels of the input image. Four different perceptual zoning (symmetrical and non-symmetrical) are discussed. Experiments show that this mechanism of zoning could be considered as a reasonable alternative to exhaustive search algorithms. The second contribution is a methodology to define metaclasses for the problem of handwritten character recognition. The proposed approach is based on the disagreement among the characters and it uses Euclidean distance computed between the confusion matrices. Through comprehensive experiments we demonstrate that the use of metaclasses can improve the performance of the system. Keywords: Character Recognition, Zoning Mechanism, Feature Measurement, Metaclasses, Confusion Matrix Categories: I.4.7, I.5.2 1 Introduction The handwriting character recognition is a special subject and has become important as ICR systems (Intelligent Character Recognition) become more powerful and commercially available.
    [Show full text]