Statistical Relational AI: Papers from the AAAI-14 Workshop
Reasoning in the Description Logic BEL Using Bayesian Networks
Ismail˙ Ilkan˙ Ceylan⇤ Rafael Penaloza˜ † Theoretical Computer Science Theoretical Computer Science, TU Dresden, Germany TU Dresden, Germany Center for Advancing Electronics Dresden [email protected] [email protected]
Abstract 2 Proof Structures in EL is a light-weight DL that allows for polynomial-time rea- We study the problem of reasoning in the probabilistic De- EL scription Logic . Using a novel structure, we show that soning. It is based on concepts and roles, corresponding to probabilistic reasoningBEL in this logic can be reduced in poly- unary and binary predicates from first-order logic, respec- nomial time to standard inferences over a Bayesian network. tively. Formally, let NC and NR be disjoint sets of concept This reduction provides tight complexity bounds for proba- names and role names, respectively. concepts are defined bilistic reasoning in . through the syntactic rule C ::= A EL C C r.C, where BEL | > | u | 9 A NC and r NR. The2 semantics2 of is given in terms of an interpre- 1 Introduction EL tation =( I , I ) where I is a non-empty domain Description Logics (DLs) (Baader et al. 2007) are a fam- I · and I is an interpretation function that maps every con- ily of knowledge representation formalisms tailored towards · cept name A to a set AI I and every role name the representation of terminological knowledge in a formal ✓ r to a set of binary relations rI I I . The manner. In their classical form, DLs are unable to handle ✓ ⇥ interpretation function I is extended to concepts by the inherent uncertainty of many application domains. To · EL defining I := I , (C D)I := CI DI , and overcome this issue, several probabilistic extensions of DLs > u \ ( r.C)I := d I e I :(d, e) rI e CI . have been proposed. The choice of a specific probabilistic 9The knowledge{ 2 of a| 9 domain2 is represented2 ^ through2 a} set DL over others depends on the intended application; these of axioms restricting the interpretation of the concepts. logics differ in their logical expressivity, their semantics, and their independence assumptions. Definition 1 (TBox). A general concept inclusion (GCI) is an expression of the form C D, where C, D are con- Recently, the DL (Ceylan and Penaloza˜ 2014) was v introduced as a meansBEL of describing certain knowledge that cepts. A TBox is a finite set of GCIs. The signature of (sig( )) is the setT of concept and role names appearing in T. depends on an uncertain context, expressed by a Bayesian T T network (BN). An interesting property of this logic is that An interpretation satisfies the GCI C D iff CI DI ; is a model of theI TBox iff it satisfiesv all the GCIs✓ in . reasoning can be decoupled between the logical part and I T T the BN inferences. However, despite the logical component The main reasoning service in is subsumption check- of this logic being decidable in polynomial time, the best ing, i.e., deciding the sub-conceptEL relations between given known algorithm for probabilistic reasoning in runs in BEL concepts based on their semantic definitions. A concept exponential time. C is subsumed by D w.r.t. the TBox ( = C D) T T| v In this paper we use a novel structure, called the proof iff CI DI for all models of . It has been structure, to reduce probabilistic reasoning for a ✓ I T BEL shown that subsumption can be decided in in poly- knowledge base to probabilistic inferences in a BN. In a nut- nomial time by a completion algorithm (Baader,EL Brandt, shell, a proof structure describes the class of contexts that and Lutz 2005). This algorithm requires the TBox to entail the wanted consequence. A BN can be constructed to be in normal form; i.e., where all GCIs are one of the compute the probability of these contexts, which yields the forms A B A B C A r.B r.B A. It probability of the entailment. Since this reduction can be is well knownv that| everyu v TBox| canv9 be transformed| 9 v into an done in polynomial time, it provides tight upper bounds for equivalent one in normal form of linear size (Brandt 2004; the complexity of reasoning in . BEL Baader, Brandt, and Lutz 2005); for the rest of this paper, we assume that is a TBox in normal form and GCI denotes a ⇤Supported by DFG in the Research Training Group “RoSI” T (GRK 1907). normalized subsumption relation. In this paper, we are interested in deriving the GCIs in †Partially supported by DFG within the Cluster of Excellence normal form that follow from ; i.e. the normalised logical ‘cfAED’. T Copyright c 2014, Association for the Advancement of Artificial closure of . We introduce the deduction rules shown in Ta- Intelligence (www.aaai.org). All rights reserved. ble 1 to produceT the normalised logical closure of a TBox.
15 Table 1: Deduction rules for . EL Algorithm 1 Construction of the pruned proof structure Input: TBox Premises (S) Result (↵) Output: H =(T W, F ) 7! A B B C A C 1 , 1: V , E , i 0 2 hA v r.Bi h , vB ihC A v r.Ci 0 0 2: do T ; hA v9r.Bi hC v AihC v9r.Bi 3 , 3: i i +1 4 h r.Av9 Bi, hB v Cihr.Av9 C i h9 v i h v ih9v i 4: Vi := Vi 1 ↵ S ↵,S Vi 1 5 r.A B , C A r.C B [ { | 7! ✓ } h9 v i h v ih9v i 5: Ei = (S, ↵) S ↵,S Vi 1 6 r.A B , B r.C A C { | 7! ✓ } h9 v i h v9 ihv i 6: while Vi = Vi 1 or Ei = Ei 1 A r.B r.B C A C 7 , 7: W := (↵6 ,k) ↵ V ,6 0 k i hA v9B Ci h9C vX ihA vB i X k 8 , 8: E := {(S, (↵,k| )) 2(S, ↵) E, 0} k i 9 hA u B v Ci, hX v AihX u B v Ci { | 2 k } [ h u v i h v ihu v i 9: ( (↵,k 1) , (↵,k)) ↵ Vk 1, 0 k i 10 A B C , X B A X C 10: return{({W, E) } | 2 } 11 hX u Xv Ci h v ihX u Cv i h u v ihv i
Each rule maps a set of premises to a GCI that is implicitly der to avoid cycles. In this case, nodes are pairs of GCIs and labels, where the latter indicates to which level the nodes be- encoded in the premises. It is easy to see that the sets of i premises cover all pairwise combinations of GCIs in normal long in the hypergraph. We write S = (↵,i) ↵ S to denote the i-labeled set of GCIs in S. Let{ n :=| sig2( }) 3, form and that the deduction rules produce the normalised | T | logical closure of a TBox. Moreover, the given deduction we start with the set W0 := (↵, 0) ↵ and define the levels 0 i