<<

Context-Free Grammar Rewriting and the Transfer of Packed Linguistic Representations

Marc Dymetman Fred´ eric´ Tendeau Xerox Research Centre Europe Lernout & Hauspie 6, chemin de Maupertuis Koning Albert-I laan 64 38240 Meylan, France B-1780 Wemmel, Belgium [email protected] [email protected]

Abstract The representations of (Emele and Dorna, 1998) and We propose an for the transfer of packed linguistic (Kay, 1999) are based on a notion of propositional con- structures, that is, finite collections of labelled graphs which texts (see (Maxwell and Kaplan, 1991)), where each share certain subparts. A labelled graph is seen as a word over possible non-ambiguous reading included in the packed a vocabulary of description elements (nodes, arcs, labels), and source representation is extracted by selecting the value a collection of graphs as a set of such words, that is, as a lan- (true or false) of a certain number of propositional vari- guage over description elements. A packed representation for ables that index elements of the labelled source graph. the collection of graphs is then viewed as a context-free gram- Transfer is then seen as a process of rewriting source mar which generates such a language. We present an algorithm graph elements (e.g, nodes labelled with French lexemes) that uses a conventional set of transfer rules but is capable of into target graph elements (e.g. nodes labelled with En- rewriting the CFG representing the source packed structure into glish lexemes), while preserving the propositional con- a CFG representing the target packed structure that preserves texts in which these graph elements were selected. the compaction properties of the source CFG. In contrast, our approach, following (Dymetman, 1997), views a packed representation as being a gram- 1 Introduction mar (more specifically, a context-free grammar) over the vocabulary of graph elements (labelled nodes and edges), There is currently much interest in translation models where each word (in the sense of formal language theory) that support some amount of ambiguity preservation be- tween source and target texts, so as to minimize disam- generated by the grammar represents one of the possible biguation decisions that the system, or an interactive user, non-ambiguous readings of the packed representation. In has to make during the translation process (Kay et al., other terms, the collection of non-ambiguous graphs be- longing to the packed representation is seen as a lan- 1994). guage over a vocabulary of graph elements, and a packed An important aspect of such models is the ability to handle, during all the stages of the translation process, representation is seen as a grammar which generates such packed linguistic structures, that is, structures which fac- a language. Packing comes from the fact that a context- torize in a compact fashion all the different readings of free grammar is an efficient representation for the lan- guage it generates. Another essential feature of such a a sentence and obviate the need to list and treat all these representation is that it is interaction-free, that is, each readings in isolation of each other (as is standard in more nondeterministic top-down traversal of the grammar suc- traditional models for machine translation). In the case of parsing, and more specifically, parsing ceeds without ever backtracking and it results in a certain with unification-based formalisms such as LFG, tech- reading, without the need for checking the consistency of a set of associated propositional constraints: the repre- niques for producing packed structures have been in sentation for the collection of readings is as direct as can existence for some time (Maxwell and Kaplan, 1991; be while permitting a factorization of common parts. Maxwell and Kaplan, 1993; Maxwell and Kaplan, 1996; D¨orre, 1997; Dymetman, 1997). More recently, tech- Based on this notion, we present an algorithm for niques have been appearing for the generation from transfer which, starting from a finite set of rewriting packed structures (Shemtov, 1997), the transfer between patterns (the transfer lexicon), associates with a given context-free grammar representing the source packed packed structures (Emele and Dorna, 1998; Rayner and structure a context-free grammar representing the tar- Bouillon, 1995), and the integration of such mechanisms into the whole translation process (Kay, 1999; Frank, get packed structure. Therefore, the target representa- 1999). tion remains interaction-free and transparently encodes This paper focuses on the problem of transfer. The the target structures; furthermore, under certain natural “locality” conditions on the rewriting rules (the graph el- method proposed is related to those of (Emele and Dorna, ements in their left-hand sides tend be be “close” from 1998) and (Kay, 1999). As in these approaches, we view each other in the source grammar derivations), the target packed representations as being descriptions of a finite collection of directed labelled graphs (similar to the func- grammar preserves much of the factorization and com- tional structures of LFG), each representing a different paction properties of the source grammar. non-ambiguous reading, which share certain subparts. The paper is structured in the following way. Sec- tion 2 explains how ambiguous graphs can be seen as way to describe such a graph is by listing a collection of commutative languages over graph description elements, “description elements” for it, where each such element

and how context-free grammars provide concise specifi- is either a labelled node such as see 0 or a labelled edge

cations for these languages. Section 3 extends the stan- such as mod27 . Using this format, the pragmatically pre-

f 01

dard notion of non-ambiguous transfer to that of am- ferred analysis for our sentence is the set see 0 , arg1 ,

02 2 27 7 23 3 34

biguous transfer. Section 4 presents the basic language- i1 , arg2 , light , mod , green1 , mod , on , arg2 ,

g

05 5 56 6 theoretic formalism needed and introduces some opera- hill4 , mod , with , arg2 , telescope . tors on languages. Section 5 presents the detailed rewrit- If we consider the collection of all possible analyses, ing algorithm, which applies these operators not directly we then obtain a collection of sets of description ele- to languages, but to the context-free grammars specify- ments. It is convenient to view such a collection as a ing them. Section 6 gives an example of the algorithm in commutative language over the vocabulary of all possi- operation. ble description elements; each word in such a language corresponds to one analysis and is a list of description 2 Ambiguous structures as languages elements the order of which is considered irrelevant. The main advantage of taking this view of ambiguous

0: see / saw structures is that formal language theory provides stan- arg1 arg2 mod mod dard tools for representing languages compactly. Thus it is well-known in computational lexicography that a 1: i 2: light mod mod large list of word strings can be represented efficiently by mod 3: on means of a finite-state automaton which factorizes com- 7: green1 / green2 arg2 mon substrings. Such a representation is both compact 4: hill mod and “explicit”: accessing and using it is as direct as the 5: with flat list of words would be. arg2 Although one might think of using finite-state mod- 6: telescope els for representing compactly the language associated with a collection of graphs, they do not seem as relevant Figure 1: An informal graphical representation of the 20 as context-free models for our purposes. The reason is possible analyses for “I saw the green light on the hill that the source packed representations are typically ob- with a telescope”. tained as the results of chart-parsing processes. A chart Let’s consider the sentence “I saw the green light on used in the parsing of a context-free grammar can itself the hill with a telescope”. In Fig. 1, we have repre- be viewed as a context-free grammar, which is a spe- sented informally the set of possible analyses for this cialization of the original grammar for the string being sentence. Labels on the nodes correspond to predicate parsed, and which directly generates the derivation trees names (‘on’, ‘hill’, etc). A slash is used to indicate dif- for this string relative to the original grammar (Billot and ferent possible readings for a node; for instance, we as- Lang, 1989).1 The generalization of this approach to uni- sume that the surface form “saw” can correspond to the fication grammars (of the LFG or DCG type) proposed verbs “to see” or “to saw”, and that “green” is ambigu- in (Dymetman, 1997) shows that, in turn, chart-parsing ous between the color adjective “green1” and the noun with these unification grammars conducts naturally to “green2” (grassy lawn). Relations between nodes are in- packed representations for the parse results very close to

dicated by labels on the edges joining two nodes: ‘arg1’ the ones we are about to introduce. G and ‘arg2’ for first and second argument, ‘mod’ for mod- Let’s consider the CFG 0 : ifier. The solid edges correspond to relations which are S ! SAW ON WITH D3

satisfied in all the readings for the sentence, dotted edges

!

1 02 SAW D0 arg101 i arg2 LIGHT

to relations that are satisfied only for certain readings. ! 2

LIGHT GREEN mod27 light

! j 7

Thus, the preprositional phrase “on the hill” can modify GREEN green17 green2

!

34 4

either “light” or “see/saw”, the phrase “with a telescope” ON on3 arg2 hill

!

56 6

either “hill”, “light”, or “see/saw”. The informal picture WITH with5 arg2 telescope

j ! 0

of Fig. 1 does not make explicit exactly which structures D0 see0 saw

! j 23

are actually possible analyses of the sentence. For in- D3 mod03 D30 mod D32

! j 45

D30 mod05 mod 25

stance the two crossing edges mod 03 and mod (where

! j j

25 45 indices are used to denote the origin and destination of D32 mod05 mod mod the edge) cannot appear together in a reading of the given Nonterminals of that grammar are written in upper-

sentence. As a consequence only five of the apparent case, terminals (which are graph description elements) in

 3 2 prepositional attachments combinations are possi- lowercase. It can be verified that the language generated ble, which multiplied by the four possible lexical variants by this grammar is the collection of commutative words for “saw” and “green” gives 20 possible readings for the sentence. 1This context-free grammar has polynomial size relative to the length of the string. While it is also possible in principle to use a finite- Each of these readings is a graph where nodes 0 and state model for representing the same set of derivation trees, it can be 7 now carry one label, and where one ‘mod’ edge has shown that such a model may be exponential relative to string length been selected for the attachment of nodes 3 and 5. One (remark due to John Maxwell). corresponding precisely to all the possible analyses for 4 Formal aspects the sentence. The commutative over an alphabet A is denoted

The fact that there are 20 such words can be es- 

A  by C , and its words are represented by vectors of

tablished by a simple bottom-up computation involving A

A N w 2 N , indexed by and with entries in . For each

multiplications and sums. If we call ambiguity degree A

a 2 A N , the component indexed by is denoted by

ad(N) of a nonterminal N the number of words it gen-

w a w

a] [ and tells how many ’s occur in . The product

erates, then it is obvious that, for instance, ad(D30) = 2, 

w w C A  w 2 2

(concatenation) of 1 and in is the vector

  

ad(D3) = 2+3, ad(S) = 4 1 1 5 = 20. In fact, it is the mul-

A

+ w 8a 2A w = w N

2 1

[a] [a] a]

tiplications which appear in such computations which are s.t. : [ . A language of



A  responsible for the compactness of the grammar as com- the commutative monoid is a subset of C .

pared to the direct listing of the words: each time a mul- The subword relation is denoted by . For a language

v L w 2 L v w

tiplication appears, a factorization is being cashed in.2 L, we write: iff there exists s.t. . L

The rewriting is performed from a source language S

 L T

3 Transfer as language rewriting over an alphabet S to a target language over an al-

  S

phabet T (disjoint from ) w.r.t. a set of rewriting

+ 

   !

When working with non-ambiguous structures, transfer R T

rules S (rules have the form ). We

2  is a rewriting process which takes as input a source- a

assume in the sequel that any S appears at most

language graph and constructs a target-language graph once in any left-hand side of each rule of R and also at

lhs ! rhs

by applying transfer rules of the form ,where L

most once in any word of S . This property is preserved rhs

lhs and are finite sets of description elements for by all the rewritings that we are going to introduce.

!= R R

source graph and target graph respectively. In outline, the Let’s define LH S .For ,wedefine

R =fa 2  j9r 2 R s:t: aLH S r g “non-ambiguous”transfer process works in the following eft et 

L S S . L

way: for each non-overlapping covering of the source  S The rewriting is a function R taking and yielding

graph with left-hand sides of transfer rules, the corre- L

T ,definedas:

sponding right-hand sides are produced and taken to-

 L  = f  j9w 2 L ;w =   ^

S 1 p S 1 p

gether represent a target graph (this is a non-deterministic R

 ! 2R^::: ^  ! 2Rg:

1 p p function as there can be several such coverings). 1 In the case of ambiguous structures, the aim of transfer

is to take as input a language of source graphs and to pro- 5 Algorithm 

duce a language of target graphs. The language of target In order to implement the function R ,itisusefulto

 

! 

graphs should be equal to the union of all the graphs that introduce rewriting functions and a . They apply



L C    =  [  T would have obtained if one had enumerated one-by-one to any language over ,where S .

the source graphs, applied non-ambiguous transfer, and They are defined as:

 L=fw j w 2 Lg

! 

taken the collection of all target graphs obtained. The 

L=fw 2 L j w =0g 

a

a]

goal of ambiguous transfer is to perform the same task [ .



! 

on the basis of a compact representation for the collec- The  functions are applied so that source sym- L

tion of source graphs, yielding a compact representation bols are guaranteed to be removed one by one from S :

 <

for the collection of target graphs. we consider S is totally ordered by and we write

 =[a ;a ; :::; a ] a

1 2 N i i+1

For illustration purposes, we will consider the follow- S , with ; then consider the

R R R R R R

2 N 1

ing collection of transfer rules: partition of : 1 , , ..., s.t. contains all

a R R a

2 2

rules with 1 in LHS, contains all rules with but

a R R a

! !

1 N N

0 0 0

see 0 voir , saw scier , not in LHS, etc, contains all rules with only



! !

R

7 7 7

7 in LHS. Then we define a third rewriting function :

green1 vert , green2 gazon , i

S

0

 L  L= L [

!

r R

a

27 27 7 2 7

2 . i

light ,mod , green1 feu ,mod , vert , i

r 2R

i

! 2

light2 lumi`ere ,etc.

L L S

Lemma. T can be obtained from by applying the R We have only listed a few rules, and have assumed that i iteratively in the following manner:

the remaining ones are straighforward one-to-one corre-

   L  = L :

R R R S T

N N 1 1

! !

1 03 03

spondences (1 1 je ,mod mod’ [we prime la-

 j  N bels such as mod, arg1,... in order to have disjointness of PROOF SKETCH.For1 ,wedefine

3



= f  x j9w 2 L ;x 2  ;p  0;w =

source and target vocabulary], etc.). L

j 1 p S S

   x; 8k  p ! 2[ R ; 8i  ja6 xg

p k k ij i i 1 .

2As the example shows, context-free representations of ambigu-

L = L L =

T 1 It is clear that N . Furthermore, we have

ous structures have the important property (related to their interaction-

2  n  N L   S R , and it is easy to show that, for ,

freeness as described in the introduction) of being easily “countable”. 1

L  L = 

n1 R

n . From this we have immediately

This is to be contrasted with other possible representations for ambigu- n

=    L    = L

ous structures, such as ones based on propositional axioms determining L

R R R S T

N .

N 1 1

which description elements can be jointly present in a given analysis. N

L L S In these representations, the problem of determining whether there ex- In order to obtain T , we will start from and ac-

ists one structure satisfying the specification can be of high complexity, 

tually apply the R ’s not on languages directly but on let alone the problem of counting such structures. i 3In practice, real transfer rules are not specialized for specific nodes, tain ground rules as the ones we are considering, a simple preprocessing but are patterns containing variables instead of numbers; in order to ob- step is necessary.

 x  = A A

1

k

!

the grammars that define them. This computation is per- function  is//

 xA  A 1

formed by the algorithm that we now present. // is searched within k

L G  N P S 9j 2f1; :::; k g 8a; aLA  

0 0 0 0 j

Let S be defined by the CFG =( , , , ). if s.t. // if falls

LA  A

A 2N A

j j

For 0 , the set of all rules having as LHS is no- // entirely within then the rewriting applies only to

P

A! A 

!

tated . This additive notation is a for- then add j to Agenda;

A! 2P

0

xA  A A  A A

A! j j A!0

j 1 j ! j +1 k

return 1 ;(4) 2

mal represention of 1 ... Hence means 

that no rule defines A. else // is searched within several symbols

 = yw w  w

 G G =

2 k

Consider 1 s.t.

0 1

First R is applied on , which builds

1

x  y

; N ; P ;S  G

 – the longest common subword of and is ,

1 1 1 1

,then R is applied on to produce

2

8aw ;aLA  w A 

j j

G j – j // is contribution to

2 and so forth. Each time, new non-terminals are in-



A A A

 if such decomposition of exists // that is, it is

R ! 

troduced: of the form , or a ,where

i

+ 

x A

j

2N  2   2  a 2 

A // entirely covered by and some ’s

1 S T S i , , ,and . Each one

is defined by a formal sum as we saw above. then /* it is unique: see below */ add to Agenda

A  w 6= 

w ! j all j s.t. ; // all those that contribute

The order of symbols in the RHSs of grammar rules j

Q Q

x=y  A   A  

w ! j

return j ;(5)

j

6= w =

is irrelevant since we consider commutative languages. w

j j

x x

Hence the RHSs of grammar rules can be denoted by // The rewriting is actually applied: y is deleted from ;

 

x 2 C   2 C N  N

 w

s.t. and ,where is the set of // each contributing (i.e. non ) j is to be deleted

 A A j all non-terminals considered. // (i.e. rewritten to in j ); non-contributing ’s

The algorithm consists of the procedure and functions // remain untouched; and  is inserted. x

described below and uses an agenda which contains new else //  cannot be produced by G

non-terminals to be defined in i . The agenda is handled return 0; // No rewriting is applicable

with a table: each non-terminal is treated once. end function;

j  

! procedure main is 

Unicity of in a , and unicity of the sequence in :

i 2f1; :::; N g

!xX Y 2P

for do A

1

consider i ; as each source symbol oc-

P P

L S

i1

Initialize i with ;

1

curs at most once in every word of ( i ), the same

R 6= ;

LA

if i then holds for hence the sets of source symbols occur-

S 

LX  L Y 

1 R Initialize Agenda with i ;

i ring in and are disjoint. repeat remove NonTerm from Agenda; 6 Example

case NonTerm is

 R

1 7 7 0

Consider S =[i , green1 , green2 , see , ...] so that

A P i

when R :addto

i

P

R f ! g R f !

1 1 2 7 7

is partitioned in 1 = i je , = green1 vert ,

! A    R

R ;

i i

A! 2P

i1

! g

27 2 2 27 7

green1 7 mod light feu mod vert , etc. Each

A P

!  i

when  :addto

R P

other i contains a single rule.

A !   

!  ! 

 ;

A! 2P

1

i The first iteration of the algorithm computes the gram-

A P

G =  G 

i

a

R 0

when :addto mar 1 . The result is:

1

P

A !    a

a ;

A! 2P

i1

S !

0 R

( )R ( AW) N ITH 3, 1

end case; 1 S O W D

!

01 02 1

( AW)R 0 arg1 arg2 IGHT je , S 1 D L

until Agenda is empty

! 2

LIGHT GREEN mod27 light ,

G S =S 

i i1 R

Reduce i whose axiom is ;

i

! j 7 GREEN green17 green2 ,

/*remove non-terminals that are non-productive

!

34 4

ON on3 arg2 hill ,

LA= ; S

( ) or inaccessible from i .*/

!

56 6 WITH with5 arg2 telescope ,etc. end for;

end procedure; We see that the only nonterminals which have been rede-

= S S

S AW

0 R

fined are 0 and S . The computation of ( )

1

 = A  A x 

1 k function R is//

i has been done through step (1) in the algorithm. This is

9j 2f1; :::; k g 8a 2 R ;aLA 

eft et R j if s.t. L S i

because the terminals in left-hand sides of 1 , namely

R A j // if all rewritings in i can only affect

the single terminal i1 , are all “concentrated” on the sin-

A 

S R

j Agenda AW then add to ; 0

i gle nonterminal S on the right-hand side of .This

xA  A A  A  A

j 1 j R j +1 k

1 AW return ;(1) R

i leads in turn to a requirement for a definition of (S ) ,

1

P

 x +  x  r else return a ;(2)

i which is fulfilled by step (5) in the algorithm, at which

r 2R

i 1

end function; time the rewriting of i1 into je is performed. R

For any group of rules i , as long as all terminals in

x   = A A

1

k R function a is//

the left-hand sides of rules of i are thus concentrated on

9j 2f1; :::; k g aLA 

if s.t. j at most one nonterminal in a right-hand side, no expan-

j A  j

then /* is unique, see below*/ add a to Agenda; sion of rules is necessary. It is only when the terminals

xA  A A  A A

1 j 1 j j +1 k

return a ;(3)start to be distributed on several RHS terminals or non-

x else if a then return 0; terminals that an expansion is required.

else return x ; This situation is illustrated by the second iteration

G G =  G 

1 2 R

end function; which maps into 1 . The result is: 2

S !

0 R R R

(( )R ) (( AW) ) N ITH 3,

2 1 2 1 S O W D representation which maintains these beneficial proper-

!

R 01 02 R 1

(( AW)R ) 0 arg1 arg2 ( IGHT) je ,

2 2

S 1 D L ties. Although proofs have not been provided here, the

 !

27 2 ( IGHT)R REEN mod light , L 2 G algorithm can be shown to satisfy our initial formal def-

green17

j   2

REEN 27

G ! mod light , inition of transfer as nondeterministic, exhaustive, non- 7

green17 vert

0

j   7

REEN 2

! G feu mod27 vert ,

green17 overlapping replacement of description elements in the

  ! !

3 34 4 GREEN green27 ON on arg2 hill , source structure by their counterparts as specified in the

green17

  ! !

5 56 6 REEN vert 7 ITH with arg2 telescope ,

G ! W rewriting rules. The method described in this paper bears 7

green17 vert

 ! 

 REEN  G ! ,etc.

green17 some obvious analogy to the classical problem of map-

ping a context-free language into another context-free R

This time, the terminals in left-hand sides of 2 are language by way of a finite-state transducer (Harrison,

27 2

green1 7 , mod and light . We first need to compute 1978). It would be an interesting research question to

S

R R

(( 0 ) ) . Again, our three terminals are all con- 2 1 make this analogy formal, the main difference here be- AW centrated on (S ) R . We thus only have to define 1 ing the need to work with a commutative concatenation,

AW R

((S ) R ) . Once again, the three terminals are con- 2 1 as opposed to the standard non-commutative concatena- IGHT IGHT centrated on L , and we have to define (L ) R . 2 tion which is more directly connected with the automaton At this point, something interesting happens. It is not view of transductions. the case any more that one nonterminal on the right- Acknowledgments hand side of the rule defining LIGHT concentrates all

REEN Thanks to our colleagues Eric de la Clergerie, Max Copper- our terminals. In fact, G only “touches” green17 , but not the other two terminals. The algorithm then man, Andreas Eisele, Martin Emele, Anette Frank, Pierre Is- has recourse to step (2), which leads it to define three abelle, Ron Kaplan, Martin Kay, Bernard Lang, John Maxwell and Hadar Shemtov for extended discussions and comments at

IGHT 

rules for (L ) R , involving recursive calls to , 2

green17 various stages in the preparation of this paper.

0

 

! , .Thefirst

! 7

green17 vert

27 2 2 7

green17 mod light feu mod vert 27 of these calls involves step (3), the second, step (4), and References the third, step (5), leading to the three expansions shown S. Billot and B. Lang. 1989. The structure of shared forests in

IGHT th

for (L ) R , and eventually to the definitions for the 2 ambiguous parsing. In 27 Meeting of the Association for three variants of the nonterminal GREEN. Computational . The remaining iterations of the rewriting procedures J. D¨orre. 1997. Efficient construction of underspecified seman- are of the same type as the first iteration. They lead fi- tics under massive ambiguity. In Proc. ACL,Madrid. nally to a target grammar of the form: M. Dymetman. 1997. Interaction-free grammars, chart-

parsing, and the compact representation of ambiguity. In

0 0 0 0 0 0 0 0 0 0

! S !

AW N ITH AW IGHT 1 02

S O W D3 S D0 arg101 arg2 L je Proc. IJCAI, Nagoya.

0 0 0 !

IGHT REEN 2

L G mod27 lumi`ere Martin Emele and Michael Dorna. 1998. Ambiguity preserv-

00 0 j

REEN 2

G mod27 lumi`ere ing machine translation using packed representations. In

0

j

2 7

feu mod27 vert Proceedings of Coling-ACL ’98, pages 365–371, Montreal,

0 0 0

! !

5 6

REEN 7 ITH

G gazon W avec arg256 lunette August.

00 0 0

! !

3 4

REEN 7 N

G vert O sur arg234 colline Anette Frank. 1999. From parallel grammar development to-

0 0 0 0 0 0 0 0

! j ! j

45 03 23

D30 mod05 mod D3 mod D30 mod D32 wards machine translation. In Proceedings of MT Summit

0 0 0 0 0

! ! j j j

0 0

25 45 D0 voir scier D32 mod05 mod mod VII. MT in the Great Translation Era, pages 134–142, Kent Ridge Digital Labs, Singapore, September. which is only slightly less compact than the source gram- Michael A. Harrison. 1978. Introduction to Formal Language mar. It can be checked that this grammar enumerates 30 Theory. Addison-Wesley, Reading, MA. target graphs, the difference of 10 with the source gram- M. Kay, J.M. Gawron, and P. Norvig. 1994. Verbmobil: a mar being due to the addition of the French variant “feu translation system for face to face dialog.CSLI. vert” along with “lumi`ere verte” for translating “green Martin Kay. 1999. Chart translation. In Proceedings of MT light”. Summit VII. MT in the Great Translation Era, pages 9–14, Kent Ridge Digital Labs, Singapore, September. 7 Conclusion John Maxwell and Ronald Kaplan. 1991. A method for dis- junctive constraint satisfaction. In Masaru Tomita, editor, We have presented a model and an algorithm for the Current Issues in Parsing Technology. Kluwer, Dordrecht. transfer of packed linguistic representations based on John T. Maxwell and Ronald M. Kaplan. 1993. The interface the view that: (1) packed representations are best seen between phrasal and functional constraints. Computational as context-free grammars over graph description ele- Linguistics, 19(4):571–590, December. ments, an approach which permits factorization of com- John T. Maxwell and Ronald M. Kaplan. mon parts while maintaining a transparent, easily com- 1996. An efficient parser for LFG. In First putable, relationship to the set of structures represented LFG Conference, Grenoble, France, August. (interaction-freeness, countability)4, and (2) transfer is http://www-csli.stanford.edu/user/mutt/. a rewriting process that takes as input such a context- M. Rayner and P. Bouillon. 1995. Hybrid Transfer in an free representation and that outputs a target context-free English-French Spoken Language Translator. In Proceed- ings of IA ’95, Montpellier, France, June. 4Properties that we believe are essential to all such representations, H. Shemtov. 1997. Ambiguity Management in Natural Lan- whether they are made explicit or not. guage Generation. Ph.D. thesis, Stanford.