<<

Describing

Claire Gardent Bonnie Webber Computational Computer and Science University of the Saarland University of Pennsylvania Saarbrücken, Gerrnany Philadelphia PA 19104-6389 USA [email protected] [email protected]

Descriptions. In recent years, both formal and (2) cdn(A,B) computational linguistics have been exploiting de­ ~ A B scriptions of structures where previously the struc­ 1 1 tures themselves were used. 1 a b Tue practice started with (Marcus et al., 1983), who demonstrated the of (syntactic) tree de­ scriptions for near-deterministic incremental pars­ The dashed lines indicate domination, the plain ing. Vijay-Shankar (Vijay-Shankar and Joshi, 1988; lines immediate domination. Labels on the nodes Vijay-Shankar, 1992) used descriptions to main­ are first-order terms abbreviating their associated tain the monotonicity of syntactic derivations in the semantic infonnation. Capital letters indicate vari­ framework ofFeature-Based Tree Adjoining Gram­ ables, lower letters indicate constants, and shared mar. In semantics, both (Muskens, 1997) and (Egg variables indicate re-entrancy. Whenever two node et al., 1997) have shown the value of descriptions as descriptions are identified and taken to refer to the an underspecified representation of ambigui­ same node, their labels must unify. ties. Tue description licenses a local tree whose root Tue current paper further extends the use of semantics is CDN(A,B), where A and B are the descriptions, from individual sentences to dis­ semantics of nodes dominating the nodes whose course, showing their benefit for incremental, semantics is a and b, respectively. Intuitively, A near-detenninistic discourse processing. In partic­ and B represent the final of the CON­ ular, we show that using descriptions to desc~be DITIONA L relation, whereas a and b stand for its the semantic representation of discourse penn1ts: current arguments. 1 Formally, A/a and B/b nodes (1) a monotone treatment of local ; (2) are quasi-nodes in the sense of Vijay-Shankar: they a detenninistic treatment of global ambiguity; and are related by dominance and therefore can (but (3) a distinction to be made between "simple" local need not) be identical. ambiguity and "garden-path" local ambiguity. Local· ambiguity. As (Marcus et aI.; 1983) has Discourse descriptions. Suppose we have the dis­ noted, descriptions facilitate a near-deterministic course: treatment of local attachment in incre­ mental parsing. This is also true at the discou!"Se (1) a. Jon. and Mary only go to the cinema level. For instance ( 1) can be continued in two b. when an Islandic film is playing ways: additional discourse material can "close off" the scope of the relation On hearing the second , the hearer infers a CONDITIONAL relation (CDN) to hold between the (3) a. so they rarely go. event partially specified in (la) and the event par­ b. Semantics: cause(cdn(a,b),c) tially specified in (1 b). We associate this with the following description of structure and semantics: Ioescriptions can be formulated more precisely. using tree (Vijay-Shankar, 1992). For this paper howevcr, we will use a graphic presentation, as in (2) above, which is easier to rcad than conjunclions of logical formulae.

50 or it can extend it: In summary, dominance permits underspecifying the syntactic link between nodes, while seman­ (4) a. or the film got a good review in The tically, quasi-nodes permits underspecifying the Nation. arguments of discourse relations. In both cases, b. Semantics: cdn(a,or(b,c)) monotonicity is preserved by manipulating descrip­ By using descriptions of trees rather than the trees tions of trees rather than the trees themselves. themselves, we have a representation which is com­ patible with both : In the first case GlobaJ ambiguity Discourse exhibits global scope ( 3), addition of the third clause will ambiguities in much the same way sentences do: Iead the hearer to infer a CAUSE relation to hold be­ (6) a. I try to read a tween (1) and (3). This extends the description in b. if I feel bored (2) to: c. or 1 am unhappy. (5) cause(C,Dn This discourse means either that the speaker tries to read a novel under one of two conditions (boredom ~ Ds ------1 or unhappiness), or that the speaker is unhappy if 1 cdn(A,8)3 CtJ s/he can't read a novel when bored. Discourse-level scope ambiguities can be captured as in (Muskens, 1997) by leaving the structural relations holding be­ ------tween scope bearing elements underspecified. For example, the (ambiguous) structure and semantics of (6) can be captured in the description:

By , if (1) is continued with (4), the initial A ifß3 description is expanded to: ~ A1 fü 1 ... 1 DJ bs

In the absence of additional information (i.e. when the respective scope of the discourse relations remains unspecified), no additional constraints where OR stands for disjunction. Both descriptions come into play, so that not one but two trees satisfy are compatible with the initial description (2) and the description: one with root semantics a if (b or both descriptions can be further constrained to yield c) and the with root semantics ( a if b) or c. the appropriate discourse semantics. Suppose that no further material is added: now the scope of the becomes known. Defaults, underspecification aml preferences. We This in turn licenses node identifications which con­ assume that a cognitive model of incremental dis­ flate final and current arguments. So if the discourse course processing should distinguish between those ends in (4), then node 6 is identified with node 5, cases of "simple" local ambiguity which do not trig­ fixing to b the left-hand of the OR rela­ ger repair when they are resolved by information tion. Similarly, nodes 8 and 9 can be identified, later in the discourse and those cases of "garden thereby fixing the right-hand argument to c'. Given path" ambiguity that do. these additionai constraints, the minimai tree struc­ No\'.' there is a ccntinuum of ways to dea! "eco­ ture which satisfies the resulting description is: nomically" with local ambiguity, without generat­ ing all the possible readings. At one end is a pure cdn(n.or(b,c))3 default approach, commiting to one reading and dis­ carding the others. At the other end is a pure under­ a1.1 or(b,c)4,7 ------specifzcation approach, with a single compact repre­ sentation of all possible readings but no indication ------of the reading of the text so far.

51 Neither of these "pure" approaches suffices to (8) a. The horse raced past the bam. distinguish simple local ambiguity from garden path b. The horse raced past the bam fell ambiguity in either sentence-level processing or dis­ course. While defaults can be subsequently overrid­ - is a matter of choosing whether "raced" takes den, there is no between overriding a sim­ "the horse" as its argument or whether it acts as ple local ambiguity and overriding a garden path. a modifier of "the horse" (in distinguisl.ing this On the other band, underspecification, which does horse from other ones, cf. (Crain and Steedman, not "commit" to any specific choice, provides no 1985)). This ambiguity cannot be captured by indication of the reading of the text so far and thus domination underspecification. As such, it can again, no way of distinguishing simple local ambi­ only be represented as a (disjuctive) alternative, a guity from cases where the reader garden-paths. matter of non-deterministic or preferential choice. However, in between these poJls are approaches If the choice is incorrect, revision is required, thus that combine features of both. One that seems providing a way of making the desired distinction able to meet the cognitive criteria given above between simple and garden-path Iocal ambiguity. is an approach that combines underspecification with a preference system that highlights a specific Biases in choosing a preferred reading. In any reading corresponding to the hearer's currently abductive process, there are many ways of explain­ preferred interpretation. Such a proposal was ing the given data, and biases are used to identify suggested in (Marcus et al„ 1983), and is the one one that is preferred. For example, in plan recog­ we are currently exploring for discourse. The two nition (identifying the structure of goals and sub­ aspects of the approach we want to discuss here goals that give rise to what is usually taken to be a are: (1) partial underspecification, and (2) biases in sequence of observed actions), Kautz (Kautz, 1990) choosing a preferred reading. suggested a "goal minimization" bias that preferred a tree with the fewest goals (non-terminal nodes) Partial Underspecification. Tue degree of under­ able to "explain" the sequence of actions. Where specification in a description is usually only par­ goal minimization is known to produce the wrong tial: there is always something that it commits to. , some other bias is needed to yield the For example, while underspecifying domination, one that is preferred (Gertner and Webber, I 996). the structural descriptions used above still rigidly Similarly, in associating a preferred reading with distinguish each branch of a tree from its sisters. a compact underspecified representation, (Marcus et Similarly, while allowing underspecification in each al., 1983) proposed a bias towards a tree that min­ individual argument to a , the descriptions imised the dominance relation. That is, if two node used here still rigidly distinguish one argument from stand in a dominance relation, they are taken another. We take this to be a "feature" with respect to refer to one and the same node, provided nothing to making a cognitive distinction between simple lo­ rules it out. Ofcourse, such a "min.dom" bias might cal ambiguity and local ambiguity that leads to gar­ yield several trees, each of which are equally mini­ den paths. mal. Typically, this is true of global ambiguities as In particular, we associate simple local ambiguity in (6) above, where dominance can be minimised by with domination underspecification, whether it be at identifying node 5 either with node 4 or with node the sentence-level or in discourse: the local ambi­ 6, each .move resulting in an equally minimal tree. guity associated with "my aunt" after processing "1 An alternative bias combines "min.dom" with saw my aunt ... " - whether it continues "right-association" (Frazier, 1995; Chen and Vijay­ Shankar, 1997), yielding a preference for a structure (7) a. I saw my aunt. in which the incoming unit attaches "low in the tree" b. I sa\v my aunt's cat. and c•m be obtained by minimising the most recent c. 1 saw my aunt's cat's litter box. dominance link. In example 6, this means identify­ ing node 6 with node 5 first so that the default read­ - is purely a matter of how the domination relation ing in this case is the reading where if scopes over eventually resolves itself. or. On the other hand, the ambiguity associated with Other biases are possible: Crain and Steedman "raced" after processing "Tue horse raced .„" - (Crain and Steedman, 1985) argue for a pref­ whether it continues erence for referring forms that distinguish one

52 already-evoked (discourse) entity from possible thank Mark Steedman and Aravind Joshi for com­ alternatives. We believe it is worth exploring what ments and suggestions. An early draft of this pa­ bias best models the preferences people have in per was presented at the Workshop on Underspeci­ discourse interpretation, and how it resembles their fication, Bad Teinach, Germany, May 1998. Claire preference at the sentence level. Gardent is grateful to the Deutsche Forschungsge­ sellschaft for financial support within the SFB 378, Comparison with related work. A related C2. approach to discourse structure and semantics is presented in (Webber and Joshi, 1998), where Lexicalised Tree Adjoining Grammar (LTAG) is N. Asher. 1993. to abstract objects in dis­ used to construct the compositional semantics of course. Kluwer, Dordrecht. discourse. Although the basic structures used here J. Chen and K. Vijay-Shankar. 1997. Towards a Re­ are different, we foresee no difficulty in modifiying duced Commitment, D-Theory Style TAG Parser them in order to integrate the additional information Proc. Int'l Workshop on Parsing Technologies. included in the LTAG discourse trees. Essentially, S. Crain and M. Steedman. 1985. On not being led up the garden path: Tue use of by the psycho­ the atomic labels representing the relations should logical parser. In D. Dowty, L. Kartunnen and A. be mapped into the feature structures used in Zwicky (eds.), Natural Parsing: Psycho­ (Webber and Joshi, 1998) and this information used logical, computational and Theoretical Perspectives, to labe! not the root node of a local tree but its Cambridge University Press. anchor. Second, the LTAG approach has focussed M. Egg, J. Niehren, P. Ruhrberg and F. Xu 1998. Con­ on describing the compositional semantics of straints over Lambda Structures in Semantic Under­ discourse - that is, the semantics explicitely given specification Proc. COLING/ACL, Montreal CA. by the text (as opposed to what can be inferred). In L. Frazier. 1995. Issues ofRepresentation in Psycholin­ contrast, the present approach does not differentiale guistics. In J.L. Miller and P.D. Eimas, eds., Speech, between compositional and inferred semantics, Languageand . Academic Press, New York. though again, the difference does not seem essential A. Gertner and B. Webber. 1996. A Bias towards Rel­ as the description based approach could be either evance: Recognizing plans where goal minimization extended to explicitely distinguish (e.g. by means fails. Proc. 1996 National Conference on Anificial In­ of features) between compositional and inferential te/ligence (AAA/-96), Portland OR, pp. 1133-1138. information, or restricted to describe those aspects J. Hobbs. and Cognition. CSLI Lecture of discourse semantics that are compositional. Notes, Number 21. 1990. Third, the LTAG proposal does not address the H. Kautz. A circumscriptive theory of plan recognition. of the current approach - incrementality and In Jerry Morgan Philip R. Cohen and Martha E. Pol­ underspecification. On the whole then, the two sys­ lack, editors, /lltentions in Communication. Bradford tems are complementary rather than antagonistic. Books, 1990. W. C. Mann and Sandra A. Thompson. Rhetorical struc­ ture theory. Technical Report RS-87-190, USC/ISI, Conclusion. We have argued that a technique 1987. developed to handle well-known problems in M. P. Marcus, D. Hindle, and M.M. Fleck. 1983. Talk­ sentence processing can also benefit the processing ing about talking about trees. In Proceedings ofACL, of monologic discourse. First, it provides a well­ Cambridge, MA. defined framework for monotonically describing R. Muskens. 1997. Order-independenceand uriderspec­ the incremental construction of discourse seman­ ification. University of Ti Iburg. tics. This departs from approaches in the discourse L. Vijay-Shankar and A. Joshi. 1988. Feature based literature which give up either monotonicity (Asher, tags. Proceedings of ACL, Budapest. 1993) or incrementality (Hob90; MT87). Second, K Vijay-Shankar. 1992. Using descriptions of trees in a it has a well-understood formal basis in tree logic. tree-adjoining grammar. Computational Linguisrics. Third, it permits a clear-cut distinction between B. Webber and A. Joshi 1998. Anchoring a Jexicalized local ambiguities that lead the hearer down the Tree-Adjoining Grammar for discourse. Proceedings garden path and those that don 't. of COLING!ACL workshop 011 Discourse Relations and Discourse Markers, Montreal CA. Acknowledgements. Tue authors would Iike to

53