Formal Concept Analysis

Formal Concept Analysis

Formal Concept Analysis Part I Radim BELOHLˇ AVEK´ Dept. Computer Science Palacky University, Olomouc [email protected] Radim Belohlavek (UP Olomouc) Formal Concept Analysis 2011 1 / 111 Introduction to Formal Concept Analysis (FCA) Radim Belohlavek (UP Olomouc) Formal Concept Analysis 2011 2 / 111 Introduction to Formal Concept Analysis { Formal Concept Analysis (FCA) = method of analysis of tabular data (Rudolf Wille, TU Darmstadt), { alternatively called: concept data analysis, concept lattices, Galois lattices, . { used for data mining, knowledge discovery, preprocessing data { input: objects (rows) × attributes (columns) table y1 y2 y3 y1 y2 y3 x1 1 1 1 x1 XXX 1 1 1 x2 1 0 1 or x2 XX or 1 0 1 0 1 1 x3 0 1 1 x3 XX ... ... ... ... Radim Belohlavek (UP Olomouc) Formal Concept Analysis 2011 3 / 111 Introduction to Formal Concept Analysis { output: 1 hierarchically ordered collection of clusters: { called concept lattice, { clusters are called formal concepts, { hierarchy = subconcept-superconcept, 2 data dependencies: { called attribute implications, { not all (would be redundant), only representative set Radim Belohlavek (UP Olomouc) Formal Concept Analysis 2011 4 / 111 Output 1: Concept Lattices input data: output concept lattice: y1 y2 y3 x1 XXX x2 XX x3 XX concept lattice = hierarchically ordered set of clusters cluster (formal concept) = hA; Bi, A = collection of objects covered by cluster, B = collection of attributes covered by cluster, example of formal concept: hfx1; x2g; fy1; y3gi, clusters = nodes in the Hasse diagram, Hasse diagram = represents partial order given by subconcept-superconcept hierarchy concept lattice = all potentially interesting concepts in data Radim Belohlavek (UP Olomouc) Formal Concept Analysis 2011 5 / 111 Output 2: Attribute Implications input data: attribute implications: y1 y2 y3 A ) B like x1 XXX fy2g ) fy3g, fy1; y2g ) fy3g, x2 XX but not fy1g ) fy2g, x3 XX attribute implication = particular data dependency, large number of attribute implications may be valid in given data, some of them redundant and thus not interesting (fy2g ) fy2g), reasonably small non-redundant set of attribute dependencies (non-redundant basis), connections to other types of data dependencies (functional dependencies from relational databases, association rules). Radim Belohlavek (UP Olomouc) Formal Concept Analysis 2011 6 / 111 History of FCA Port-Royal logic (traditional logic): formal notion of concept Arnauld A., Nicole P.: La logique ou l'art de penser, 1662 (Logic Or The Art Of Thinking, CUP, 2003): concept = extent (objects) + intent (attributes) G. Birkhoff (1940s): work on lattices and related mathematical structures, emphasizes applicational aspects of lattices in data analysis. Barbut M., Monjardet B.: Ordre et classification, algbre et combinatoire. Hachette, Paris, 1970. Wille R.: Restructuring lattice theory: an approach based on hierarchies of concepts. In: I. Rival (Ed.): Ordered Sets. Reidel, Dordrecht, 1982, pp. 445{470. Radim Belohlavek (UP Olomouc) Formal Concept Analysis 2011 7 / 111 Literature on FCA books Ganter B., Wille R.: Formal Concept Analysis. Springer, 1999. Carpineto C., Romano G.: Concept Data Analysis. Wiley, 2004. conferences ICFCA (Int. Conference of Formal Concept Analysis), Springer LNCS, http://www.isima.fr/icfca07/ CLA (Concept Lattices and Their Applications), http://www.lirmm.fr/cla07/index.htm ICCS (Int. Conference on Conceptual Structures), Springer LNCS, http://www.iccs.info/ conferences with focus on data analysis, information sciences, etc. web keywords: formal concept analysis, concept lattice, attribute implication, concept data analysis, Galois lattice Radim Belohlavek (UP Olomouc) Formal Concept Analysis 2011 8 / 111 Selected Applications of FCA clustering and classification (conceptual clustering), information retrieval, knowledge extraction (structured view on data, structured browsing), machine learning, software engineering G. Snelting, F. Tip: Understanding class hierarchies using concept analysis. ACM Trans. Program. Lang. Syst. 22(3):540{582, May 2000. U. Dekel, Y. Gill: Visualizing class interfaces with formal concept analysis. In OOPSLA'03, pp. 288{289, Anaheim, CA, October 2003. preprocessing method: e.g., Zaki M.: Mining non-redundant association rules. Data Mining and Knowl. Disc. 9(2004), 223{248. closed frequent itemsets instead of frequent itemsets ) non-redundant association rules (<< number) mathematics (new results in math. structures related to FCA) Radim Belohlavek (UP Olomouc) Formal Concept Analysis 2011 9 / 111 State of the art of FCA Ganter, B., Stumme, G., Wille, R. (Eds.): Formal Concept Analysis Foundations and Applications. Springer, LNCS 3626, 2005, development of theoretical foundations, development of algorithms, applications: increasingly popular (information retrieval, software engineering, social networks, . ), FCA as method of data preprocessing, interaction with other methods of data analysis, several software packages available. Radim Belohlavek (UP Olomouc) Formal Concept Analysis 2011 10 / 111 Concept Lattices Radim Belohlavek (UP Olomouc) Formal Concept Analysis 2011 11 / 111 What is a concept? central notion in FCA = formal concept but what is a concept? many approaches, including: psychology (approaches: classical, prototype, exemplar, knowledge) Murphy G. L.: The Big Book of Concepts. MIT Press, 2004. Margolis E., Laurence S.: Concepts: Core Readings. MIT Press, 1999. logic (rare, but Transparent Intensional Logic) Tichy P.: The Foundations of Frege's Logic. W. De Gryuter, 1988. Materna P.: Conceptual Systems. Logos Verlag, Berlin, 2004. artificial intelligence (frames, learning of concepts) Michalski, R. S., Bratko, I. and Kubat, M. (Eds.), Machine Learning and Data Mining: Methods and Applications, London, Wiley, 1998. conceptual graphs (Sowa) Sowa J. F.: Knowledge Representation: Logical, Philosophical, and Computational Foundations. Course Technology, 1999. \conceptual modeling", object-oriented paradigm, . traditional/Port-Royal logic Radim Belohlavek (UP Olomouc) Formal Concept Analysis 2011 12 / 111 Traditional (Port-Royal) view on concepts The notion of a concept as used in FCA | inspired by Port-Royal logic (traditional logic): Arnauld A., Nicole P.: La logique ou l'art de penser, 1662 (Logic Or The Art Of Thinking, CUP, 2003): concept (according to Port-Royal) := extent + intent extent = objects covered by concept intent = attributes covered by concept example: DOG (extent = collection of all dogs (foxhound, poodle, . ), intent = fbarks, has four limbs, has tail,. g) concept hierarchy subconcept/superconcept relation DOG ≤ MAMMAL ≤ ANIMAL concept1=(extent1,intent1) ≤ concept2=(extent2,intent2) , extent1 ⊆ extent2 (, intent1 ⊇ intent2) Radim Belohlavek (UP Olomouc) Formal Concept Analysis 2011 13 / 111 Formal Contexts (Tables With Binary Attributes) Definition (formal context (table with binary attributes)) A formal context is a triplet hX ; Y ; I i where X and Y are non-empty sets and I is a binary relation between X and Y , i.e., I ⊆ X × Y . interpretation: X . set of objects, Y . set of attributes, hx; yi 2 I . object x has attribute y formal context can be represented by table (table with binary attributes) hx; yi 2 I ... ×in table, hx; yi 62 I . blank in table, I y1 y2 y3 y4 x1 × × × × x2 × × × x3 × × × x4 × × × x5 × Radim Belohlavek (UP Olomouc) Formal Concept Analysis 2011 14 / 111 Concept-forming Operators " and # Definition (concept-forming operators) For a formal context hX ; Y ; I i, operators " : 2X ! 2Y and # : 2Y ! 2X are defined for every A ⊆ X and B ⊆ Y by A" = fy 2 Y j for each x 2 A : hx; yi 2 I g; B# = fx 2 X j for each y 2 B : hx; yi 2 I g: operator ": assigns subsets of Y to subsets of X , A" . set of all attributes shared by all objects from A, operator #: assigns subsets of X to subsets of Y , B" . set of all objects sharing all attributes from B. To emphasize that " and # are induced by hX ; Y ; I i, we use "I and #I . Radim Belohlavek (UP Olomouc) Formal Concept Analysis 2011 15 / 111 Concept-forming Operators " and # Example (concept-forming operators) For table I y1 y2 y3 y4 x1 × × × × x2 × × × x3 × × × x4 × × × x5 × we have: " " fx2g = fy1; y3; y4g, fx2; x3g = fy3; y4g, " fx1; x4; x5g = ;, X " = ;, ;" = Y , # # fy1g = fx1; x2; x5g, fy1; y2g = fx1g, # # fy2; y3g = fx1; x3; x4g, fy2; y3; y4g = fx1; x3; x4g, # # ; = X , Y = fx1g: Radim Belohlavek (UP Olomouc) Formal Concept Analysis 2011 16 / 111 Formal Concepts Definition (formal concept) A formal concept in hX ; Y ; I i is a pair hA; Bi of A ⊆ X and B ⊆ Y such that A" = B and B# = A. A . extent of hA; Bi, B . extent of hA; Bi, verbal description: hA; Bi is a formal concept iff A contains just objects sharing all attributes from B and B contains just attributes shared by all objects from A, mathematical description: hA; Bi is a formal concept iff hA; Bi is a fixpoint of h"; #i. Radim Belohlavek (UP Olomouc) Formal Concept Analysis 2011 17 / 111 Formal Concepts Example (formal concept) For table I y1 y2 y3 y4 x1 × × × × x2 × × × x3 × × × x4 × × × x5 × the highlighted rectangle represents formal concept hA1; B1i = hfx1; x2; x3; x4g; fy3; y4gi because " fx1; x2; x3; x4g = fy3; y4g, # fy3; y4g = fx1; x2; x3; x4g. Radim Belohlavek (UP Olomouc) Formal Concept Analysis 2011 18 / 111 Example (formal concept (cntd.)) But there are further formal concepts: I y1 y2 y3 y4 I y1 y2 y3 y4 x1 × × × × x1 ×× × × x2 × × × x2 × × × x3 × × × x3 × × × x4 × × × x4 × × × x5 × x5 × I y1 y2 y3 y4 x1 ×××× x2 × ×× x3 × × × x4 × × × x5 × i.e., hA2; B2i = hfx1;

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    111 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us