<<

Making Sense of to the Unfamiliar



Helen Seville and Allan Ramsay

Centre for Computational Linguistics

UMIST, PO Box 88, Manchester M60 1QD, England

heleng/[email protected] c.uk

Abstract they must nevertheless denote individuals famil-

iar to conversants if they are successfully to re-

Computational approaches to reference resolu-

fer. However, there is another class of referring

tion, like Centering Theory, are b est at resolv-

expressions in relation to which we b elieve the

ing referring expressions which denote familiar

concept of uniqueness of meaning do es havean

referents. We demonstrate how, by taking a

essential role to play. These include such def-

pro of-theoretic approach to reference resolution

inite descriptions as \the rst man" and \the

within a Centering-typ e framework, we are able

rst snowdrop of Spring", along with suchvari-

to make sense of referring expressions for un-

ations on these as \the rst three men" and \the

familiar referents. These include, in addition

rst snowdrops of Spring".

to bridging descriptions, de nite descriptions

In implementing a system of reference resolu-

like \the rst man" and \the rst snowdrops of

tion, wehave attempted to reconcile the notions

Spring". We claim that the rst of these denotes

of familiarity and uniqueness. This enables us

a unique subset of a plural antecedent.

to dereference expressions like \the rst snow-

While the second has no discourse antecedent,

drop of Spring" in a uni ed framework alongside

we similarly treat it as denoting a unique subset

1

anaphors , pronouns, referential tenses, names,

of a familiar referent.

and other de nite descriptions like \the man".

1 Intro duction

1 Two men arrived.

How do referring expressions denote? Accord-

2 The rst man sp oke.

ing to Russell, a de nite description such as

\the King of France", denotes a unique individ-

In the case of a referring expression like \the

ual by virtue of its meaning. But, according to

rst man", there maybeanantecedent of sorts

Familiarity Theory Heim, 1983, referring ex-

in the discourse, but it is not the individual re-

pressions need not denote uniquely by virtue of

ferred to or indeed an individual at all. We

their meaning as they refer to individuals made

will say that the antecedent \two men" intro-

familiar by the discourse or other . This

duces a set, and that the referring expression

observation plays a key role in Centering The-

\the rst man" denotes, by virtue of the mean-

ory Grosz and Sidner, 1986; Grosz et al., 1995

ing of rst, a unique subset of this familiar set.

and other computational approaches in which

referring expressions are resolved by lo cating

1 Mary saw the rst snowdrop of

their antecedents in the discourse. The refer-

Spring.

ence of pronouns like \he", de nite descriptions

like \the woman", and referential tenses like

In the case of \the rst snowdrop of Spring",

\had" clearly has more to do with salience in

there need b e no explicit antecedent in the dis-

context than with uniqueness of meaning. Sim-

course. We will say that, in the same way

ilarly, while names like \Mary" need not denote

that \Mary" denotes a familiar individual, \the

individuals prominent in the discourse context,

snowdrops of Spring" denotes a familiar set, or

 1

Wewould like to thank the anonymous reviewers for We use this term to distinguish re exives like \her-

their detailed and helpful comments. self " from pronouns like \he" and \him".

Given that many referring expressions do not in property. Again, by virtue of the meaning of

themselves denote uniquely, however, we need rst, \the rst snowdrop of Spring" can b e said

a theory of reference resolution to enable us to denote a unique subset of the familiar set. We

to obtain the appropriate i.e., intended  ref- will not claim that it denotes a unique individ-

erent for any referring expression. We incorp o- ual, but that rather it denotes a unique subset

rate our theory of reference resolution into the of the sp eci ed cardinality, i.e., 1. This treat-

actual representation of referring expressions; ment has the advantage that it extends to plural

for example, we lab el anaphors with the prop- referring expressions.

erty \salient" and pronouns and also referential Below we outline the approach we have de-

3

tenses with the prop erty \centred" : velop ed to the representation and resolution of

referring expressions, b efore discussing in more

\himself "

detail its to deal with unfamiliar ref-

ref X sal ientX; r ef D cdsD &mX 

erents.

\she"

2 A Framework for Reference

ref X centr edX; r ef D cdsD &f X 

Resolution

Reference resolution relies on maintaining, as

Our framework for reference resolution has b een

in Centering Theory, a list of forward-lo oking

implemented in the system of language under-

centres for each discourse state corresp onding

standing describ ed in Ramsay, 1999. The

to an utterance in the discourse. Furthermore,

starting p oint for reference resolution is the log-

for the purp oses of reference resolution, the dis-

ical form we obtain from parsing. For example,

course states themselves are organized into a

the following is the logical form we get for the

discourse tree, which is constructed automati-

utterance \Mary slept."

4

cally based on referential cues , as describ ed in

Seville, 1999.

9A : fA is inter v al &

ends bef or eref B speech timeB; 1;Ag

0 1 a man died in a park .

i j

k

9C : faspectsimpl e; A; C g

j 2 he had b een sleeping there .

i j

k

C ; ag ent; r ef D namedD; M ary

1 3awoman loved him .

m i

l

& car dD; 1

/ n 4 she had hated him .

m i

l

& sl eepC 

2 3 5 he had hated himself .

i m i

& C is ev ent

/ jn 6 he had loved her .

i m

l

45 6

We use the inference engine describ ed in

Ramsay and Seville, 2000 to up date the dis-

The no des in such a tree corresp ond to dis-

course mo del with a new discourse state con-

course states. Those on the right-hand frontier

taining the information explicitly represented in

are open, which essentially means that the enti-

the logical form together with any further infer-

ties mentioned in them are available to pronom-

ences which are licensed given the existing dis-

inal reference.

course mo del. Reference resolution, which in-

The pro cess of reference resolution for the

volves carrying out a pro of that a referring ex-

various referring expressions can be brie y de-

pression denotes, is implemented as part of the

scrib ed as follows. Anaphors, characterised as

up date step. We anchor a referring expression

salient, are resolved to a less oblique argument

like ref D namedD; M ary&car dD; 1 in

of the same verb Pollard and Sag, 1994 within

the discourse mo del by proving the existence of

the current discourse state, which is constructed

an entity in the mo del which satis es the prop-

3

erties sp eci ed by the referring expression, in

Here ref D cdsD  is a reference to the current

2

discourse state and the prop erties m and f refer to male

this case D namedD; M ary&car dD; 1 .

and female gender resp ectively.

2 4

Strictly sp eaking, it is a set which is denoted. For The tree illustrated was constructed using pronomi-

readability, our referring expressions con ate the prop- nal cues. Each discourse state was attached as a daugh-

erties of sets and their memb ers. In this case, the car- ter of the highest no de in the discourse tree to which all

dinality is a prop erty of the set denoted, but the name pronouns and referential tenses like had  mentioned in

Mary is a prop erty of its memb er. it could b e anchored.

ref D namedD; M ary&car dD; 1 as incrementally. We also start our search for the

the following were amongst the facts contained referents of pronouns and other centred enti-

in Discourse State 0: ties in the current discourse state, which is nec-

essary if we are to resolve such referring ex-

Discourse State 0

pressions as \her" in \Mary to ok John with

|||||{

her." However, referring expressions contain-

f emal e94

ing the prop erty centred are prevented from

named94; M ar y 

b eing dereferenced to salient entities, thus en-

w oman94

suring that the constraint of disjoint reference

f 94

is met. If we fail to nd the centred en-

car d94; 1

tity in the current discourse state, we search

adul t94

the previous op en no de and, if necessary, fur-

ther op en no des in the discourse tree, in order

These were generated from the lexical meaning

to deal with long-distance pronominalisation.

p ostulates we stipulated for \Mary", \woman",

The dereferencing of other referring expressions

and \female":

like ref D namedD; M ary&car dD; 1 is

similar but less constrained in that we con-

9X namedX; M ary &w omanX &car dX; 1

sider entities mentioned in al l no des mentioned

8X w omanX  !

in the discourse tree, whether op en or closed,

f emal eX &X is human&adul tX 

in order of recency. This means that, essen-

8X f emal eX  ! f X 

tially, names and de nite descriptions are deref-

3 Unfamiliar Referents

erenced to the most recently mentioned refer-

ent whichis appropriate. Unlike in the case of

In this section we showhow, within the frame-

pronouns, we also consider Discourse State 0,

work ab ove, we are able to make sense of a vari-

which do esn't corresp ond to an utterance but,

ety of referring expressions denoting unfamiliar

rather, contains the background knowledge as-

referents. The most straightforward of these are

sumed in the mo del. This is how we are able

bridging descriptions, so we start with these.

to deal with the rst mention of a familiar

3.1 Bridging Descriptions

referent like Mary assuming that the prop er-

1 Mary loves her mother.

ties D namedD; M ary&car dD; 1 suce

to distinguish a particular entity in Discourse

In this rst case, \her mother", contains a refer-

State 0 from all the others.

ring expression nested within it. Having deref-

Our approach extends naturally to cases like

erenced this, the knowledge that mother of is a

\the rst snowdrop of Spring" b ecause it is

function enables us to obtain a unique referent.

pro of-theoretic and so able to exploit back-

Our representation of the referring expression

ground knowledge in reference resolution. This

to b e dereferenced is as follows:

can b e illustrated, in the rst instance, by exam-

ining the background knowledge which is used

\her mother"

in up dating the utterance \Mary slept." The

ref B of B;

up date step for this utterance yields Discourse

F mother F ;

State 1, containing amongst others the follow-

ref Gsal ient or centr edG; 1

ing facts:

& f G

& car dB; 1

Discourse State 1

|||||{

The rst step involves anchoring the referring

sl eep134

expression by dereferencing its nested referring

134; ag ent; 94

5

expression for \her" .

bef or e41; 133 ends

aspectsimpl e; 133; 134

5

The referent for this is characterised as

sal ient or centr ed as we allow p ossessive pronouns

We were able to prove named94; M ar y 

to be dereferenced as anaphors or, failing that, as

and car d94; 1 and so dereference pronouns.

8X houseX  _ car X  ! Current Mo del

9Y of Y; Z door Z ;X&car dY; 1 |||||{

at or af ter 41; 135 ends

This means that, having used utterance 1

aspectsimpl e; 135; 136

ab ove to up date the discourse mo del, we have

136; ag ent; 94

the following amongst the facts in Discourse

love136

State 1:

The partially constructed current discourse

Discourse State 1

state wehave when we do our dereferencing is as

|||||{

shown. \Mary" has already b een dereferenced

see1138

to 94 and this has b een entered into the list

138; ag ent; 94

of forward-lo oking centres for the current utter-

138; obj ect; 139

ance. We are able to prove b oth sal ient94

car d139; 1

and f 94, and so our nested referring expres-

house139

sion is dereferenced to this entity.

bef or e41; 137 ends

door 46139

ref B of B;

entr ance46139

F mother F ;

of 46139;Adoor A; 139

94

car d46139; 1

& car dB; 1

aspectsimpl e; 137; 138

It is then a straightforward matter to derefer-

In up dating utterance 2, the bridging descrip-

ence the anchored referring expression, given

tion which needs to b e dereferenced has the fol-

the following facts in Discourse State 0:

lowing representation:

Discourse State 0

ref E door E  & car dE; 1

|||||{

Since we cannot guarantee that there will only

mother 6094

be a single entity in our mo del satisfying the

of 6094;Amother A; 94

prop erties E door E  & car dE; 1, wewant

f 6094

to ensure that the referent we obtain is either

car d6094; 1

the most recently mentioned or that with the

most recently mentioned antecedent, i.e., in this

These derive from our meaning p ostulates for

6

case, the house 139. Our reference resolu-

\mother" and \of ":

tion pro cedure exploits the fact that the house,

8X X is animal &car dX; 1 !

139, is explicitly represented in the forward

9Y of Y; Z mother Z ;X

lo oking centres of Discourse State 1 and that

&car dY; 1&f Y 

the intended referent, 46139, is clearly a

8X 8Y 8Z of X; Y ; Z  ! Y:X 

function of this its dep endency having b een

preserved bySkolemization. In considering the

Dealing with other bridging descriptions is

p otential referents for our referring expression in

more complicated:

order of recency,we attempt to prove, not sim-

1 Mary saw a house.

ply, for each referent, X , whether door X  and

2 She found the door.

car dX; 1, but door Y  and car dY; 1 where

Y is a function of X . Since 46139 is

In order to give an analogous treatment to the

a function of the antecedent 139, we obtain

referring expression \the do or", wehave to treat

the appropriate referent in this case by proving

it as elliptical for an expression containing a

door 46139 and car d46139; 1.

nested referring expression, i.e., \the do or of the

house ". In the same way that wehave a mean-

3.2 Sup erlatives

ing p ostulate for the relation mother of,wehave

We are now in a p osition to describ e our treat-

one for the relation door of :

ment of the sup erlatives discussed in the intro-

6

duction. First, we consider a case in which there

Skolemization preserves the dep endency of Y on X,

is a discourse antecedent of sorts: i.e., 94 is present in 6094.

In this way, we prove that the referring ex- 1Two men arrived.

pression makes sense, i.e., denotes. However, 2The rst man sp oke.

unlike in the previous cases, we do not deref-

Discourse State 1 contains the following facts:

erence to a familiar referent. There are no

existing facts in the database ab out the ref-

Discourse State 1

erent 81Aear l y A;:::; 1; 2; 108; 108.

|||||{

Instead, in this case, we have to add to Dis-

arrive107

course State 2 the facts wehave proved.

107; ag ent; 108

car d108; 2

Discourse state 2

man108

|||||{

mal e108

speak 112

m108

theta112; ag ent;

adul t108

81Aear l y A;:::; 1; 2; 108; 108

bef or e41; 106 ends

bef or e42; 111 ends

aspectsimpl e; 106; 107

speech time42; 2

aspectsimpl e; 111; 112

Our representation of the referring expression

most81Aear l y A;:::; 1; 2; 108; 108;

\the rst man" is as follows:

C ear l y C; D manD ;

ref B mostB;

108

C ear l y C; D manD ;

car d81Aear l y A;:::; 1; 2; 108; 108;

ref E manE 

1

& car dB; 1

ear l y 81Aear l y A;:::; 1; 2; 108; 108;

The nested referring expression

C manC 

ref E manE  can be straightforwardly

man81Aear l y A;:::; 1; 2; 108; 108

dereferenced in this case to give the anchored

mal e81Aear l y A;:::; 1; 2; 108; 108

referring expression:

m81Aear l y A;:::; 1; 2; 108; 108

adul t81Aear l y A;:::; 1; 2; 108; 108

ref B mostB;

C ear l y C; D manD ;

The further facts we prove, ab out our referent

108

b eing early, male, etc., are required if we are to

& car dB; 1

b e able to subsequently refer to it using referring

Dereferencing this then involves our meaning

expressions such as \he". These are generated

p ostulate for sup erlatives:

from a set of asso ciated meaning p ostulates:

8X 8Z 8C car dZ; C &Z X &:C = 1&

8X 8Y 8P or der edP &mostY; P; X  ! P:Y 

8N 8P :mostX; P ;  !

8Aor der edB ear l y B; A

9Y mostY; P; X &car dY; N 

8X 8P ear l y X; P  ! P:X 

8X manX  !

This simply says that for any severalton set X ,

X is human&mal eX &adul tX 

any prop erty P and any N , there is some set Y

8X mal eX  ! mX 

containing the N \most P " memb ers of X. This

meaning p ostulate do es not translate into any

In addition to these, wehavetwo further mean-

facts in Discourse State 0, but remains as a rule.

ing p ostulates for sup erlatives:

When wehave a particular referring expression

to dereference, this rule enables us to prove that:

8X 8Y 8P 8C 8Z mostY; P; X &car dY; C 

&mostZ; P ; X &car dZ; C  most81Aear l y A;:::; 1; 2; 108; 108;

! Z = Y  C ear l y C; D manD ;

8X 8P 8Y 8N 8C mostX; P ; Y & 108

car dX; N &car dY; C  car d81Aear l y A;:::; 1; 2; 108; 108;

!:mor eN; C  1

anchor our referring expression by dereferenc- The rst of these, the uniqueness meaning p os-

ing the referring expression nested within it, we tulate, states that if there are two subsets of of

need to intro duce a meaning p ostulate for the a set which share the same cardinality and the

nested referent and one for its nested referent, same sup erlative prop erty, such as rst, then

7

Spring : they must be regarded as identical . The sec-

ond simply ensures that any unfamiliar refer-

9X seasonX &namedX ; S pr ing &car dX; 1

ent which we obtain via our meaning p ostu-

9X of X;

lates can sensibly regarded as a prop er subset

Y snow dr opY ;

of its antecedent; that is, it prevents us regard-

ref Z namedZ ; S pr ing 

ing \two men" as a p otential antecedent of \the

&car dX; pl

rst men":

These meaning p ostulates simply intro duce into

1 Two men arrived.

i

Discourse State 0 the fact that there are snow-

2 The rst men  sp oke.

f i

drops of Spring, in the same way that the mean-

ing p ostulate for \Mary" intro duced the fact

Our treatment of sup erlatives without dis-

that there is a singleton set containing an in-

course antecedents is similar to that ab ove.

dividual so named.

1 Mary saw the rst snowdrops of

Discourse state 0

Spring.

|||||{

season98

There is just one ma jor di erence.

named98; S pr ing 

ref E mostE;

car d98; 1

F ear l y F;

extended98

Gof G;

H snow dr opH ;

snow dr op101

ref I namedI;

of 101;A; snow dr opA; 98

S pr ing 

n101

&car dI; 1;

car d101;pl

ref J of J;

Given the ab ove facts in Discourse State 0, an-

K snow dr opK ;

choring our referring expression is straightfor-

ref LnamedL; S pr ing 

ward.

&car dL; 1

&car dE; pl

ref E mostE;

F ear l y F;

The representation we obtain for the referring

Gof G;

expression \the rst snowdrops of Spring" is

H snow dr opH ;

shown ab ove. Like that for \the rst man", this

98;

contains a nested referring expression:

101

&car dE; pl

ref J of J;

K snow dr opK ;

From this p oint onwards, the pro of that this

ref LnamedL; S pr ing 

referring expression denotes pro ceeeds in the

&car dL; 1

same way as in the previous example. Given

the meaning p ostulates for sup erlatives, we are

The di erence is that, in this case, there is

able to prove:

no discourse antecedent for the nested refer-

most81Aear l y A;:::; pl ; pl ; 101; 101;

ring expression. This means that, in order to

D ear l y D;

7

Practically, this meaning p ostulate seems to b e re-

E of E; F snow dr opF ; 98;

dundant. Our meaning p ostulates generate for us only

101

one such subset and it is imp ossible for another to be

car d81Aear l y A;:::; pl ; pl ; 101; 101;

intro duced through the discourse as \a rst man" is un-

pl  grammatical.

ries of reference resolution, likeFamiliarity The- Again, as in the example ab ove, the facts we

ory, and semantic theories, like Russell's, may have proved concern an unfamiliar referent, and

be reconciled. However, it is fair to say that so have to be added to the current discourse

the success of the approach is not yet proven. state.

This is b ecause we have yet to show that we

Discourse state 1

can deal with a set of related referring expres-

|||||{

sions within a single framework. The following

see1107

example illustrates the kinds of cases we have

theta107; ag ent; 94

in mind:

theta107;

1 Three men ate.

obj ect;

i

2 Two men slept.

81A;:::; 98; pl ; pl ; 101; 101

j

3 The rst men died.

bef or e41; 106 ends

i

aspectsimpl e; 106; 107

Here, \ rst" in \the rst men" is clearly p er-

most81A:::; 98; pl ; pl ; 101; 101 ;

forming a di erent, discourse-related function

D ear l y D;

from that it plays in the cases we have b een

E of E; F snow dr opF ; 98;

considering. Wehaveyet to tackle such dicult

101

cases but, since they seem to require reasoning

car d81A:::; 98; pl ; pl ; 101; 101 ;

ab out sets, we b elieve that our inference-based

pl 

approach to reference resolution is a go o d place

ear l y 81A:::; 98; pl ; pl ; 101; 101;

to start.

D of D; E snow dr opE ;

98

References

of 81A:::; 98; pl ; pl ; 101; 101 ;

B. J. Grosz and C. L. Sidner. 1986. Attention,

D snow dr opD ;

intentions, and the structure of discourse.

98

Computational Linguistics, 123:175{204.

snow dr op81A:::; 98;:::

B. J. Grosz, A. K. Joshi, and S. Weinstein.

n81A:::; 98; pl ; pl ; 101; 101

1995. Centering: A framework for mo deling

the lo cal coherence of discourse. Computa-

4 Conclusion

tional Linguistics, 212:203{225.

Wehave shown how, by taking a pro of-theoretic

I. Heim. 1983. File change and

approach to reference resolution, we can extend

the familiarity theory of de niteness. In

a Centering-typ e framework to make sense of

R. Bauerle, C. Schwarze, and A. von Stechow,

referring expressions for a variety of unfamiliar

editors, Meaning, Use, and Interpretation of

referents. Having made sense of such referring

Language, pages 164{189. de Gruyter, Berlin.

expressions, we add their referents to our dis-

C. Pollard and I. A. Sag. 1994. Head-Driven

course mo del. This is how we would normally

Phrase Structure Grammar. University of

deal with inde nites rather than de nites. How-

Chicago Press, London.

ever, this approach makes p erfect sense, given

A. Ramsay and Helen Seville. 2000. Mo dels and

our treatment of such referring expressions as

discourse mo dels. Journal of Language and

denoting unfamiliar subsets of familiar referents

Computation, 12:159{174. forthcoming.

regarded as sets. We claim that we are able

A. Ramsay. 1999. Do es it makeany sense? up-

to use de nite descriptions to refer to the ref-

dating = consistency checking. In K. Turner,

erents in question, despite their unfamiliarity,

editor, The Semantics/ Interface

so long as we can prove that, by virtue of their

from Di erent Points of View. Elsevier Sci-

meaning, they denote uniquely.

ence B.V.

Having implemented our approach in a sys-

H. Seville. 1999. Exp eriments with discourse

tem of language understanding which already

structure. In Third International Workshop

deals with a wide variety of referring expres-

on , pages 233{246,

sions, we have demonstrated its practicality.

Tilburg.

It also has interesting theoretical implications,

since it suggests a way in which pragmatic theo-