Proceedings of the 16th International Conference on Head-Driven Phrase Structure Grammar
Georg-August-Universität Göttingen, Germany
Stefan Müller (Editor)
2009
CSLI Publications
http://csli-publications.stanford.edu/HPSG/2009
The papers are published under a CC-BY license: http://creativecommons.org/licenses/by/4.0/ Contents
1 Editor’s Note 4
Gabriela Bîlbîie, Frédéric Laurens: A Construction-based Analysis of Verb- less Relative Adjuncts in French and Romanian 5
Olivier Bonami, Pollet Samvelian: Inflectional periphrasis in Persian 26
Rui P. Chaves: Construction-based Cumulation and Adjunct Extraction 47
Berthold Crysmann: Deriving Superficial Ergativity in Nias 68
Marianne Desmets, Florence Villoing: French VN lexemes: morphological compounding in HPSG 89
Antske Fokkens, Laurie Poulson, Emily M. Bender: Inflectional Morphol- ogy in Turkish VP Coordination 110
Anke Holler: Towards an analysis of the adverbial use of German inter- rogative was (‘what’) 131
Gianina Iordachioaia,˘ Frank Richter: Negative Concord in Romanian as Polyadic Quantification 150
Paul Kay, Ivan A. Sag: How Hard a Problem Would This Be to Solve? 171
David Lahm: An Alternative to the HPSG Raising Principle on the Description- Level 192
Stefan Müller: On Predication 213
Stefan Müller, Janna Lipenkova: Serial Verb Constructions in Chinese: A HPSG Account 234
Bjarne Ørsnes: Preposed Negation in Danish 255
Shakthi Poornima, Jean-Pierre Koenig: Hindi Aspectual Complex Predi- cates 276
Frank Richter, Manfred Sailer: Phraseological Clauses in Constructional HPSG 297
Filip Skwarski: Accounting for Underlying Forms in HPSG Phonology 318
Jesse Tseng: Phonological change and grammaticalization in HPSG: The case of French final consonants 338
2 Frank Van Eynde: On the copula: from a Fregean to a Montagovian treat- ment 359
3 1 Editor’s Note
The 16th International Conference on Head-Driven Phrase Structure Grammar (2009) was held in Göttingen. The conference featured 2 invited talks and 19 papers selected by the program committee (Anne Abeille, Doug Arnold [chair], Olivier Bonami, Bob Borsley, Gosse Bouma, Rui Chaves, Berthold Crysmann, Markus Egg, Elisabet Engdahl, Dan Flickinger, Jonathan Ginzburg, Chikara Hashimoto, Jong-Bok Kim, Tibor Kiss, Anna Kupsc, Shalom Lappin, Bob Levine, Rob Malouf, Detmar Meurers, Stefan Müller, Tsuneko Nakazawa, Gerald Penn, Adam Przepiorkowski, Frank Richter, Louisa Sadler, Ivan Sag, Jesse Tseng, Stephen Wechsler). In total there were 34 submissions to the conference. We want to thank the program committee for putting this nice program together. Thanks go to Anke Holler, Manfred Sailer, Heike Walker, Gert Webelhuth [chair], who were in charge of local arrangements. As in the past years the contributions to the conference proceedings are based on the five page abstract that was reviewed by the respective program committees, but there is no additional reviewing of the longer contribution to the proceedings. To ensure easy access and fast publication we have chosen an electronic format. The proceedings include all the papers except those by Danièle Godard and Anne Abeillé and Polly Jacobson.
4 A Construction-based Analysis of Verbless Relative Adjuncts in French and Romanian
Gabriela Bîlbîie Frédéric Laurens LLF, Université Paris Diderot Paris 7 LLF, Université Paris Diderot Paris 7
Proceedings of the 16th International Conference on Head-Driven Phrase Structure Grammar
Georg-August-Universität Göttingen, Germany Stefan Müller (Editor) 2009 CSLI Publications pages 5–25 http://csli-publications.stanford.edu/HPSG/2009
Bîlbîie, Gabriela, & Laurens, Frédéric. 2009. A Construction-based Analysis of Verbless Relative Adjuncts in French and Romanian. In Müller, Stefan (Ed.), Pro- ceedings of the 16th International Conference on Head-Driven Phrase Structure Grammar, Georg-August-Universität Göttingen, Germany, 5–25. Stanford, CA: CSLI Publications. Abstract French and Romanian verbless relative adjuncts are incidental adjuncts which have been described as elliptical relative clauses. We show that this analysis is not empirically adequate and propose an alternative non-elliptical analysis. We analyze verbless relative adjuncts as sentential fragments whose head can be a cluster of phrases. They are marked by a functor phrase which displays selection properties with respect to the head phrase and makes an essential contribution to the semantics of the adjunct. The analysis relies on the interaction of grammatical constraints introduced by various linguistic objects, as well as on a constructional analysis of verbless relative adjuncts distinguishing several subtypes.
1 Introduction
Both French and Romanian have verbless adjuncts whose form is reminiscent of relative clauses. These verbless adjuncts are exemplified in (1) for French and in (2) for Romanian. They are characterized by the presence of a fronted constituent (shown in italics) which can either be a prepositional phrase containing a WH form (1a, 2a, 2b) or the form dont in French (1b). We label those constructions Verbless Relative Adjuncts (henceforth VRAs).
(1) a. Trois personnes, [parmi lesquelles Jean], sont venues. three people(FEM), [among which.FEM John], AUX come ‘Three people, among which John, have come.’
b. Trois personnes sont venues, [dont une hier]. three people(FEM) AUX come, [DONT one.FEM yesterday] ‘Three people have come, one of them yesterday.’
(2) a. Au venit trei persoane, [{ printre | ˆıntre } care (s¸i) Ion]. AUX come three people, [{among | among} which.ACC (also) John] ‘Three people have come, among which (also) John.’
b. Au venit trei persoane, [dintre care una ieri]. AUX come three people(FEM), [among which.ACC one.FEM yesterday] ‘Three people have come, one of them yesterday.’
French and Romanian VRAs have been described as elliptical relative clauses (Grevisse 1993, Gheorghe 2004 and Gheorghe 2005) on the basis of semantic and syntactic similarities with non-restrictive relative clauses (3).
†Part of this work has been presented in Bucharest at the 11th Conference of the English Depart- ment. Many thanks to A. Abeille,´ D. Arnold, O. Bonami, D. Godard, J-M. Marandin, the audience of the HPSG09 Conference and three anonymous reviewers for helpful discussions or useful sugges- tions.
6 (3) a. Il a ecrit´ de nombreux romans, [dont deux ont et´ e´ publies´ le mois dernier]. ‘He has written many novels, two of which have been published last month.’
b. El a scris multe romane, [dintre care doua˘ au fost publicate luna trecuta].˘ ‘He has written many novels, two of which have been published last month.’
Part of the data used in this work is based on a corpus study. The French data comes from the French Treebank of Paris 7 (Abeille´ et al. 2003). Lacking a similar corpus for Romanian, we collected examples from press texts. The paper is structured as follows. We first focus on the constituency of VRAs. We show that the elliptical analysis of VRAs is not empirically adequate and pro- pose an alternative non-elliptical analysis. Then, we discuss the distributional and functional properties of VRAs before turning to their semantic properties. The analysis is then couched in a constructional version of HPSG (i.e. SBCG, Sag 2007). The analysis relies on the interaction of grammatical constraints introduced by various linguistic objects, as well as on a constructional analysis of VRAs dis- tinguishing several subtypes. For reasons of space, properties of VRAs which are the same in both languages are only illustrated for French. Romanian examples are only used when the prop- erty is absent or less clear in French.
2 Constituency
French and Romanian VRAs are constituted of two parts. The first part (labeled the fronted constituent) is reminiscent of the extracted phrase or of the complementizer of a relative clause. The second part (labeled the body) is composed of either a single phrase or of a sequence of phrases which are not related by grammatical functions. We show that an analysis of VRAs as elliptical relative clauses is not empirically supported and propose an alternative analysis.
2.1 The fronted constituent
The fronted constituent of VRAs contains lexical items which are found in relative clauses. Those items include forms of the WH lexeme lequel in French and care in Romanian, and the form dont in French. While French lequel and Romanian care are found in interrogative clauses too, French dont is only found in relative clauses apart from its use in VRAs. The WH forms always appear in prepositional phrases in VRAs. The preposi- tions heading the fronted constituent all have an abstract partitive meaning similar to that of the preposition among in English. This includes parmi in French and dintre, ˆıntre, and printre in Romanian.
7 In both languages, more complex expressions are also found, such as au {pre- mier | second} rang desquels ‘in the {first | second} row of which’, au sommet desquels ‘at the top of which’, au sein desquels ‘in the middle of which’, au centre desquels ‘in the center of which’ or au nombre desquels ‘in the count of which’ in French, and ˆın randulˆ carora˘ ‘in the row of which’, ˆın mijlocul carora˘ ‘in the middle of which’ or ˆın fruntea carora˘ ‘at the top of which’ in Romanian. Although these expressions almost always compositionally denote spatial relations, they are only found with an abstract partitive meaning in VRAs. Note that some of these expressions additionally suggest a ranking among the subparts. The WH form appearing in the fronted phrase is coreferential with a noun phrase preceding the VRA in the host clause, which we label the licenser (see sec- tion 3.1). This is signalled by morphological agreement on the WH form in French: lesquels (MASC) vs. lesquelles (FEM). Agreement cannot be observed in Roma- nian because both the accusative form care and the genitive plural form carora˘ do not vary in gender. In French, the fronted phrase can also contain the form dont. Apart from its use in VRAs, the form dont appears only in relative clauses where it has been analyzed as a complementizer (Godard 1988). Like the complementizers que and qui which are also found in relative clauses, the complementizer dont only combines with finite clauses (4a). WH relative clauses on the other hand are not always finite. Unlike prepositional WH forms like duquel ‘of which’, the complementizer dont cannot be embedded within a filler phrase (4b), and does not show morphological agreement with an antecedent. Fi- nally, complementizer dont marks relative clauses containing a missing constituent which would be marked by the form de (4c) or any proform coreferential with the antecedent of the relative clause, as long as the proform is embedded under a propositional attitude predicate, such as penser ‘to think’ or etreˆ vrai ‘to be true’ (4d).
(4) a. * Un livre [dont parler] a book [DONT talk.INF] ‘A book which we should talk about’
b. Un homme, [le frere` { * dont | duquel } est venu hier] a man, [the brother { DONT | of.which } AUX come yesterday] ‘A man, the brother of which has come yesterday’
c. Un livre [dont on parle beaucoup] ‘A book which one talks about’
d. Un livre [dont il est vrai qu’il couteˆ cher] ‘A book of which it is true that it is expensive’
VRAs’ dont shares some of its properties with the complementizer dont. For instance, it cannot be embedded within the fronted constituent as well. However, it
8 is unclear whether other selection properties of the complementizer are shared by VRAs’ dont. Since an elliptical analysis of VRAs is not empirically adequate (see section 2.3), we argue that it is not the case. Moreover, while complementizer dont is assumed to have no semantic contribution, such a statement is hard to make for VRAs’ dont since it forces a partitive semantics (5), although the partitive semantics may also be assumed to be constructionally introduced in VRAs. (5) a. Au total, dix livres ont et´ e´ commandes,´ [(*dont) tous pour toi]. ‘In total, ten books have been ordered, all of them for you.’ b. Au total, dix livres ont et´ e´ commandes,´ [(dont) deux pour toi]. ‘In total, ten books have been ordered, two of them for you.’
The left edge of the fronted phrase must coincide with the left edge of the VRA. It cannot be linearized in or after the body (6a) or be preceded by adverbials (6b). (6) a. *Plusieurs personnes, [Jean parmi lesquelles], sont venues. ‘Several people, among which John, have come.’ b. *Plusieurs personnes, [notamment parmi lesquelles Jean], sont venues. ‘Several people, among which notably John, have come.’
2.2 The body
The body of VRAs is constituted of either a single phrase or a sequence of phrases. When the body is a single phrase, it can be either a noun phrase in the citation form (i.e. no prepositional marking in French and Romanian and nominative case in Romanian) or a phrase of any category whose form parallels the corresponding phrase in the host clause. Not every VRA type allows for the two options. In French, WHVRAs do not allow for marked single phrases and dont VRAs disprefer it. Dont VRAs with a single marked constituent are not as bad as their WHVRA counterparts and can be improved with adverbs, such as notamment ‘notably’ (7). (7) Un jeune homme annonce a` diverses personnes sa mort prochaine, [{*parmi lesquelles | % dont} notamment a` un psychiatre qui decide´ de l’aider]. ‘A young man announces his imminent death to several people, {among which | DONT} notably to a psychiatrist who decides to help him.’
In Romanian, dintre is incompatible with preposition or case marking (8a)1, while no such restriction is found with printre and ˆıntre as long as the adverb s¸i ‘also’ precedes them (8b). (8) a. Ion lucreaza˘ cu s¸apte doctori, [dintre care (*cu) doi rus¸i]. John works with seven doctors, [among which with two Russian] ‘John works with seven doctors, two of them are Russian.’
1Adverbs such as mai ales ‘especially’ can improve the acceptability of (8a).
9 b. Ion a oferit flori mai multor fete, [printre care s¸i John AUX offered flowers ADV many.DAT girls, [among which also {Maria | Mariei}]. {Mary.NOM | Mary.DAT}] ‘John has offered flowers to many girls, Mary among them.’
The body of a VRA can also be constituted of a sequence of phrases (i.e. a cluster). Three types need to be distinguished. Clusters of type I mimick the syntax of the host clause. In those clusters, each of the phrases has to be marked like the correponding phrase in the host clause (9a). Clusters of type II contain a noun phrase in the citation form and a predicative phrase expressing a property of that noun phrase (9b). Clusters of type III are only found in VRAs whose licenser is a past participle used to express a functional property assumed by some individuals within a event. For example, un blesse´ ‘an injured person’ is the patient of an event in which someone gets hurt. In clusters of that type, the event relation of the cluster is contributed by the past participle. As a result, adverbs, rather than adjectives, are used to modify that relation (9c).
(9) a. Certains ont parle´ a` mes amis, [dont Marie *(a)` Marc]. ‘Some have spoken with my friends, Mary with Mark.’
b. Je vends dix jeux, [dont la plupart encore dans leur boˆıte]. ‘I sell ten games, the majority of them still in their original box.’
c. L’accident a fait douze blesses,´ [dont cinq grievement].` ‘The accident left twelve injured, five of them critically.’
Fronted phrases show selection properties regarding the type of the cluster they combine with. For instance, in French VRAs with parmi, at least one phrase in the cluster must be a noun phrase. These properties of fronted phrases combined with properties of clusters of type I can result in ungrammaticality (10).
(10) J’ai parle´ a` plusieurs personnes hier, [{dont | *parmi lesquelles} a` Marie de linguistique]. ‘I spoke with several people yesterday, of which with Mary about linguistics.’
2.3 VRAs are not elliptical relative clauses
VRAs are usually referred to as elliptical relative clauses. However, an elliptical analysis of VRAs faces two kinds of problems. Under an elliptical approach, VRAs are assumed to be relative clauses which have the additional property of having some of their syntactic or phonological material removed. An elliptical analysis is of interest if, and only if, (I) one can reconstruct a relative clause from any VRA in a regular fashion and (II) the semantic properties of VRAs are the same as that of relative clauses. We argue that none of these conditions are verified.
10 2.3.1 Arguments against syntactic reconstruction
There are at least three options for the reconstruction of a verbal form in VRAs. For an example like (11), the first option is to reconstruct a verbal form of the same lexeme as the verb of the host clause of the VRA (11b). This is the obvious option when the VRA contains a cluster of type I. The second option is the reconstruction of an existential verb (11c). The third option is the reconstruction of a quotation verb (11d). In some cases, none of these options will work.
(11) a. Plusieurs ont eu un cadeau, [dont Marie un livre]. ‘Several have had a present, of which Mary a book.’
b. *Plusieurs ont eu un cadeau, [dont Marie a eu un livre]. ‘Several have had a present, of which Mary has had a book.’
c. *Plusieurs ont eu un cadeau, [dont est Marie un livre]. ‘Several have had a present, of which is Mary a book.’
d. *Plusieurs ont eu un cadeau, [dont on cite Marie un livre]. ‘Several have had a present, of which one mentions Mary a book.’
Within a syntactic reconstruction approach, the choice of a verbal form is de- pendent on lexical constraints, such as subcategorization properties, which are not correlated with semantic properties. For instance, it is possible to reconstruct the verb figurer ‘to appear’ within a parmi VRA but not in a dont VRA, because figurer can subcategorize a PP marked by parmi but not a PP marked by de. The reverse is true for an expression such as faire partie de ‘to belong to’.
2.3.2 VRAs do not have the same semantic properties as relative clauses Non-restrictive relative clauses behave semantically like independent clauses that contain a proform (Arnold 2004). As a result, their semantic contribution is largely independent from that of their host clause. This is not the case for VRAs, as shown by the contrast in (12). While the sequence of utterances in (12a), which contains a relative clause, is coherent, the sequence in (12b) is contradictory because whales are said both to have and not to have apparent ears. This is so because VRAs are sentential fragments (see section 4.3). Thus, only a syntactic reconstruction which reproduces the content of the host clause is compatible with the semantics of VRAs. However, this kind of reconstruction is most often impossible on syntactic grounds.
(12) a. Non, tu te trompes! Bien que beaucoup de mammiferes,` [dont les balei- nes sont un exemple], aient des oreilles apparentes, les baleines n’en ont pas. ‘No, you’re wrong! While many mammals, of which whales are a exam- ple, do have apparent ears, whales do not have any.’
11 b. Non, tu te trompes! Bien que beaucoup de mammiferes,` [dont les balei- nes], aient des oreilles apparentes, les baleines n’en ont pas. ‘No, you’re wrong! While many mammals, whales among others, do have apparent ears, whales do not have any.’
Another problem faced by the elliptical account is that it predicts that some VRAs should be well-formed, while they are ill-formed for semantic reasons. This is so because the elliptical account assumes that the partitive semantics of VRAs comes from the elided verbal predicate rather than from the fronted phrase. In Ro- manian, the preposition dintre cannot cooccur with a body containing a referential noun phrase such as a definite one. However, a verb form can be reconstructed without difficulty yielding a well-formed relative clause (13).
(13) Au venit mai multe persoane, [dintre care {*Maria | o amintim pe Maria}]. ‘Many people have come, among which {Mary | one mentions Mary}.’
2.4 Non-elliptical alternative Non-elliptical analyses differ from elliptical ones in that they do not link form constraints on clusters (such as those exhibited by clusters of type I) to the presence of a syntactic head in the structure. As a result, they make no prediction on the distribution of clusters. We assume that the body of VRAs has exactly the structure it seems to have at first sight: it has a flat structure and has no syntactic head. As for VRAs as a whole, we argue in favor of an analysis in which the body is the head and the fronted phrase is a functor phrase. The selection properties of VRAs are best attributed to the body. This is so because most of the phrases functioning as the body in VRAs can also function as incidental adjuncts alone with a similar semantics. This is especially the case of those which contain adverbials such as notamment ‘notably’ (14a) or are coordi- nated structures (14b).
(14) a. De nombreuses especes,` [(dont) notamment les oursins], ont souffert de la pollution. ‘Many species, (among which) notably urchins, have suffered from pol- lution.’
b. Plusieurs personnes, [une hier et deux ce matin], se sont plaintes de l’organisation. ‘Several people, one yesterday and two this morning, complained about the organization.’
The selection properties of VRAs are distinct from those of the fronted phrase. Apart from the French form dont whose category is unclear, the fronted constituent is always a prepositional phrase. A preposition like parmi ‘among’ in French in- troduces a semantic relation between two arguments, one of which is typically
12 realized as a complement (e.g. lesquelles in (15)). The other argument (the exter- nal argument) is usually not realized within the preposition phrase itself. Rather it is selected for by the prepositional phrase. In VRAs, the external argument of the fronted phrase is realized within the body (e.g. Jean in (15)). Thus, it is reasonable to assume that the selection properties of the fronted phrase and those of the VRA are distinct and therefore not to analyze the fronted phrase as the head.
(15) plusieurs personnes, [[parmi lesquelles] Jean] several people.FEM.PL [among which.FEM.PL John(MASC.SG)] ‘several people, among which John’
Following Van Eynde 2003, we use the function functor, which replaces the functions specifier, marker and (prehead) adjunct. Functors select a head and are able to mark it, hence modifying its distribution. The fronted phrase is best ana- lyzed as a functor because (I) it must appear before the body (unlike other adjuncts, such as notamment ‘notably’), (II) it displays selection properties, and (III) (at least in certain cases) it modifies the distribution of the phrase it combines with.
3 Function in the host phrase
VRAs are incidental adjuncts. They are linearized within a host phrase which must contain a noun phrase introducing a sum-denoting entity. That noun phrase is labeled the licenser. The syntactic relation between the VRA and its licenser is submitted to both linear order and locality constraints.
3.1 Properties of the licenser The licenser must denote a plural entity whose subparts are accessible (i.e. an entity which can be described as a sum of entities) (Lasersohn 1995). This does not entail that the licenser of a VRA will always have plural morphology (16). In most corpus examples however, the licenser is a plural indefinite.
(16) Un important volume de gaz s’est echapp´ e´ du cratere,` [dont environ 25% de dioxyde de soufre]. ‘A great volume of gaz has been released from the crater, including about 25% of sulfur dioxide.’
The licenser can be a dependent of a head of any category (17a) and assume any function including adjunct (17b).
(17) a. Des representants´ de plusieurs pays,[dont le Bresil],´ y assistent. ‘Representatives of several countries, among which Brasil, attend it.’
b. Je l’ai attendu plusieurs heures, [dont une sous la pluie]. ‘I have been waiting for him for several hours, one of which in the rain.’
13 In some cases, it might be tempting to describe a VRA as having two licensers or more (18). This analysis is only possible when the fronted phrase does not make it explicit which of the noun phrase is truly the licenser of the VRA. We will come back to the analysis of these examples later in section (4.3).
(18) J’ai offert des cadeaux a` plusieurs personnes,[dont un livre a` Marie]. ‘I have offered presents to many people, among which a book to Mary.’
3.2 Ordering and locality constraints
The constraints on the linearization of VRAs need to make reference to the relative linearization of their licenser, as well as to the syntactic structure of their host. VRAs must follow their licenser (19a). If the licenser is a dependent of a clause’s head, the VRA can be linearized anywhere in the clause after the licenser (19b). However, if the licenser is not a dependent of a clause’s head, the VRA needs to follow its licenser directly (19c).
(19) a. ([*Dont Marie]), plusieurs personnes sont venues, ([dont Marie]). ‘Several people have come, among which Mary.’
b. J’ai demande´ a` plusieurs personnes hier, [dont Marie], de m’aider un peu. ‘I asked several people yesterday, among which Mary, to help me a bit.’
c. Des representants´ de plusieurs pays, ([dont le Bresil])´ se sont reunis,´ ([*dont le Bresil]).´ ‘Representatives from several countries, among which Brasil, have met.’
The relation between a VRA and its licenser obeys the right roof constraint which generally applies on rightwards non-local relations such as extraposition or right dislocation.
(20) *Que deux personnes viennent ne m’a pas etonn´ e,´ [dont Marie]. ‘The fact that two people come, among which Mary, has not surprised me.’
Note that if a VRA contains a cluster of type I, the VRA must follow every phrase of the host clause which is paralleled in the cluster.
(21) Plusieurs, ([*dont Marie un livre]), m’ont offert des cadeaux, ([dont Marie un livre]). ‘Several have offered me presents, of which Mary a book.’
VRAs are also at least compatible with ‘comma intonation’. Thus, they are a kind of incidental adjuncts, incidentality being defined as a syntactic property which correlates with both phonological and linearization properties (Bonami and Godard 2003).
14 4 Semantic Properties
VRAs can be associated with two distinct semantics. Although VRAs always intro- duce a subpart of the sum individual denoted by their licenser, the referential prop- erties of the introduced subpart are not always the same. VRAs are non-restrictive modifiers which, unlike most other non-restrictive modifiers, must scope under propositional attitude verbs. This is explained by the fact that VRAs are sentential fragments.
4.1 VRAs have a partitive semantics
VRAs always introduce an entity which must be interpreted as a subpart of the sum individual denoted by their licenser. Sum individuals are not always composed of atomic parts. As a result, a noun phrase containing a mass noun can function as the licenser of a VRA (16). Quantified noun phrases which do not denote sum individuals are not suitable licensers for VRAs (22).
(22) *Tout etudiant´ doit venir, [dont Marie]. ‘Every student must come, among which Mary.’
VRAs cannot introduce any other semantic relation between their licenser and the phrase introducing the subpart. Meronymy and possession, for instance, are ruled out (23).
(23) *Plusieurs personnes sont venues, [dont leur chien]. ‘Several people have come, among them their dog.’
VRAs can have two distinct semantics depending on the referential properties of the introduced subpart. Exemplifying VRAs introduce a subpart which is referential (i.e. It can be identified independently of the fact that it is a subpart of a sum individual) (24a). This is the case of the noun phrases Marie and Jean in (24a). Partitioning VRAs, on the other hand, introduce a subpart which is not referential but can be defined within the sum individual as having some properties which are not shared by other subparts. The property in question can be a property of the entity which is the subpart or a property of the subevent in which the subpart takes part (24b). One of the most striking differences between exemplifying and partitioning VRAs is that only the latter can introduce a list of subparts which are coextensive with the sum individual denoted by the licenser (24b).
(24) a. {*Deux | trois} personnes sont venues, [dont Marie et Jean]. ‘{Two | three} people have come, among which Mary and John.’
b. Trois personnes sont venues, [dont une lundi et deux mardi]. ‘Three people have come, one on Monday and two on Tuesday.’
15 Within a noun phrase coordination, it is sufficient that one of the noun phrases be non referential in order for the partitioning semantics to be available (25).
(25) Prends deux objets, [dont cette bouteille et {un | *ce} couteau]. ‘Take two objects, that bottle and {a | that} knife.’
The semantics of the head of the fronted phrase also plays a role in the se- mantics of the VRA. In French, parmi is always exemplifying, while dont can be exemplifying or partitioning. In Romanian, dintre is always partitioning, while printre and ˆıntre can be both exemplifying or partitioning. These lexical properties can be observed in other uses of the prepositions as well (26).
(26) a. Avem {majoritatea | spionii} printre noi. ‘We have {the majority | spies} among us.’ (partitioning or exemplifying)
b. {majoritatea | *spionii} dintre copii ‘the {majority | spies} of the children’ (partitioning only)
4.2 VRAs have a non-restrictive semantics Restrictive modifiers have an intersective interpretation and therefore introduce an implicit ’contrast set’, which can be accessed by anaphors like the others (Arnold 2004). Non-restrictive modifiers, on the other hand, are not intersective modifiers and introduce no such ’contrast set’. VRAs are non-restrictive modifiers of their licenser as well as of their host. The VRA in (27) does not restrict the set of Mary’s friends to the one including John. Therefore, there is no possible antecedent for the other friends of Mary. Neither does the VRA in (27) restrict the event denoted by the host clause to the event such that Some friends of Mary came and John came as opposed to another event such that Some friends of Mary came but John didn’t.
(27) Certains amis de Marie, [parmi lesquels Jean], sont venus. #Les autres amis de Marie viendront demain. ‘Some friends of Mary’s have come, among which John. The others friends of Mary will come tomorrow.’
Unlike most non-restrictive modifiers, the content of VRAs is part of the as- serted content of the utterance containing them. As a result, VRAs which are lin- earized within a clause which is itself in the scope of a propositional attitude verb must be interpreted in the scope of that verb too. Thus, (28) entails that Peter be- lieves that vervain can heal ulcers but not that it is actually true or even that the speaker believes it.
(28) Pierre croit que certaines plantes, [dont la verveine], soignent les ulceres.` ‘Peter believes that some plants, vervain among them, can heal ulcers.’
16 4.3 VRAs are sentential fragments
VRAs are sentential fragments and describe a subevent of the event denoted by their host clause. This is especially clear in partitioning examples where the subpart is not defined by a property of the subpart itself but by a property of the subevent (24b). However, this is also true of exemplifying examples as can be seen by the ungrammaticality of (29) in which a VRA is embedded within a non-event for which no subevent can be defined.
(29) *Aucun representant´ de ces quatre pays, [dont le Bresil],´ n’a participe´ a` la conference.´ ‘No representative of those four countries, among which Brasil, has attended the conference’
VRAs can appear in declarative, imperative (25) or interrogative clauses. In interrogative clauses, they share their abstracted parameter with the host clause and thus can be used as a hint. In (30), if one can answer the question about France, then one can answer the question about countries of the OECD too. The hint is that knowledge about France is sufficient to answer the question.
(30) En quelle annee,´ plusieurs pays de l’OCDE, [dont la France], ont-ils signe´ ce traite?´ ‘In which year, several countries of the OECD, among which France, did sign this treaty?’
The fact that VRAs are sentential fragments allows one to account for cases where VRAs seem to have several licensers as cases where VRAs have only one licenser but also introduce a sum-subpart relation because the event they introduce is a subevent of the one introduced by the host. In that kind of implied sub-subpart relation, the corresponding element in the host does not need to denote a sum- individual at all (31).
(31) Paul a offert un cadeau a` plusieurs personnes, [dont un livre *(a` Marie)]. ‘Paul has offered a present to several people, among which a book (to Mary).’
5 HPSG Grammar Fragment
The properties of VRAs are best analyzed as resulting from the interaction of var- ious syntactic and semantic constraints applying on distinct linguistic objects. An HPSG grammar fragment accounting for the properties of VRAs minimally requires (I) a theory of fragments, (II) a theory of clusters, and (III) a theory of locality of selection. Several aspects of the grammar introduced here are constructional in na- ture. The analysis is couched in a constructional version of HPSG, namely SBCG (Sag 2007).
17 5.1 A theory of fragments Fragments are expressions which convey a semantic content which is not given by their form alone. Rather, the semantic content conveyed by a fragment is a function of (I) the type of the fragment, (II) the semantic content of the constituent(s) in the fragment, and (III) contextual information which can be linguistic in nature or not (Fernandez´ et al. 2007). A sentential fragment such as the short question when in (32a) is interpreted as having the same semantic content as the clause when she will come in (32b). This semantic content comes in part from the type of the fragment (short questions have the same content type as interrogative clauses, i.e. a propositional abstract), the semantic content of the constituent(s) in the fragment (when provides the para- mater for the propositional abstract), and contextual information (Mary will come functions as the antecedent of the fragment providing the proposition used to build the propositional abstract).
(32) a. Mary will come but nobody knows [when].
b. Mary will come but nobody knows [when she will come].
More generally, fragments are reminiscent of description anaphora, as opposed to instance anaphora. In instance anaphora, what is shared between an anaphoric expression and its antecedent(s) are indices. In description anaphora, what is shared is some aspects of the description of the antecedent(s) which apply to a new entity with a different index. This is the case with one anaphora in English (33) (Arnold and Borsley 2008).
(33) Here is a small red mugs with flowers and here is a bigger one.
In all of these cases, computing the semantics of the fragment can be achieved by expressing constraints between four semantic representations: two complete ones (the meaning of the antecedent and the meaning of the fragment) and two partial ones (the content which is anaphorically shared between the antecedent and the fragment, and the content which is given by the constituents in the fragment). We use MRS (Copestake et al. (2005)) to represent incomplete semantic rep- resentations as the underspecification of a complete semantic representation. In MRS, meaning is represented as bags of elementary predications. Connection be- tween these elementary predications is achieved through index-sharing and label- sharing. MRS representations are suitable to express partial meaning. For an example such as (34), we want to obtain the four bags of elementary predications in (35). The bags are related by two meta-constraints (bag unification and bag intersection) shown at the bottom of figure (35).
(34) Plusieurs personnes sont venues, [dont Marie hier]. ‘Several people have come, of which Mary yesterday.’
18 h i (35) HOOK IND 1 *" # come-rel " # " #+ ANTECEDENT A message-rel several-rel person-rel 0 RELS A ,IND 1 , , IND 1 IND 2 IND 2 ARG1 2 h i HOOK IND 5 *" # come-rel + VARIABLE B message-rel RELS B0 ,IND 5 5 IND ARG1 ind h i HOOK IND 3 *" # " # " #+ FRAGMENT C message-rel name-rel yesterday-rel C0 RELS , , IND 3 IND 4 IND 3 h i HOOK IND 3 *" # come-rel " # " #+ CONTENT D message-rel name-rel yesterday-rel 0 RELS D ,IND 3 , , IND 3 IND 4 IND 3 ARG1 4
• A0 ∩ D0 = B0 • B0 ∪ C0 = D0 Bag unification: Let A, B and C be bags of elementary predications. A is the unification of B and C iff each element R in A, Q in B and S in C appears in either: (I) a pair
h i (36) SEM RELS 1 h i C-SEM RELS 2 MTR h i ANTECEDENT RELS 3 FRAGMENT h i 4 VARIABLE RELS * + h i h i DTRS SEM RELS 5 , SEM RELS 6 , ...
19 • 3 ∩ 1 = 4 • 4 ∪ union 5 , 6 , 2 = 1
Additional constraints on the semantics of fragments come from the syntax- semantics interface. Constraints can be imposed on by clusters or by the VRA- construction. Incomplete meaning for the fragment (i.e. bags of elementary predi- cations that cannot be mapped into a connex graph) should be excluded as well.
5.2 A theory of clusters Clusters are sequences of phrases which are not related by functions but neverthe- less display constituent properties. Clusters are fragments but the reverse is not always true. The fragment when in (32a), for instance, is composed of a single phrase. Clusters do not always have the same distribution as a headed phrase with a similar semantic content. For French, it has been noticed that some items or constructions are compatible with clusters which are sentential fragments but not with clauses. This is the case of the conjunction ainsi que ‘as well as’, which may combine with a cluster but not with a finite clause (Abeille´ and Godard 1996, Mouret 2006).
(37) Paul offrira un livre a` Marie, ainsi qu’(*il offrira) un CD a` Paul. ‘Paul will offer a book to Mary, as well as (he will offer) a CD to Paul.’
Some cluster types are submitted to form constraints which instantiate lexi- cal knowledge about subcategorization properties of lexical items which are not realized within the cluster. That property leads Ginzburg and Sag 2000 to analyze sentential fragments as single daughters of a phrase with full clausal properties including the syntactic category VERB. This is problematic for VRAs because sentential fragments and clauses do not have the same distribution. We use the definition given by Mouret 2006 (38). The feature CLUSTER allows one to select or subcategorize for a sign which is a cluster.
(38) cluster-cxt ⇒ phrasal-cxt & " # cluster MTR SYN CAT CLUSTER 1 DTRS 1 list sign
Several constraints can be expressed on clusters, such as the fact that they must contain a noun phrase in the citation form or that their form instantiates knowledge about the subcategorization of a word which is not present in the structure (39).
20 (39) NP-cluster-cxt ⇒ cluster-cxt & " # noun CAT CASE nominative MTR SYN CAT CLUSTER containsSYN MRKG det
We will assume that VRAs with only one phrase in the body have a unary cluster body because it allows one to express generalizations in a simplified way.
5.3 A theory of locality of selection
The selection properties of VRAs and the selection properties of their fronted phrase are submitted to the same kind of locality constraints. When a VRA modifies a clause, the licenser of the VRA must be a direct dependent of the clause’s head. When a fronted phrase modifies a cluster, the phrase introducing a subpart of the plural entity denoted by the antecedent must be a direct daughter of the cluster. In order to state these locality constraints on selection, we introduce a set- valued feature ANCHORS which contains indices of semantic entities which are ac- cessible to adjunct selection. This feature was originally proposed by Kiss 2005 to analyze extraposed relative clauses. The two following constraints on the propaga- tion of anchors are introduced. Constraint (40) ensures that only direct dependents of a clause’s head are accessible via the anchor set on the level of the clause. In (40) and (41), c stands for the predicate contains which is a relation expressing that a list contains some element. In both constraints, the element contained in the antecedent of the constraint in universally quantified.
(40) word ⇒ anchor anchor MTR SEM ANCHORS cLABEL 1 DEPS cSEM HOOK LABEL 1 IND 2 IND 2
Constraint (41) ensures that only direct daughters of a cluster are accessible via the anchor set on the level of the clause. Another virtue of the anchor constraints is that they restrict semantic selection to material which is literally introduced in the cluster. Thus, reconstructed semantic relations are not available for semantic selection.
(41) cluster-cxt ⇒ anchor anchor MTR SEM ANCHORS cLABEL 1 DTRS cSEM HOOK LABEL 1 IND 2 IND 2
21 5.4 A theory of VRAs
We analyze VRA constructs as subtypes of the head-functor construct as defined in Van Eynde 2003 and Sag 2007. Functors have two main properties. They select the head sign and contribute a mark to the construct (42).
hd-func-cxt ⇒ hd-cxt & " # (42) VAL 1 MTR SYN 2 MRKG * h i + CAT SELECT 3 DTRS SYN , 3 MRKG 2 h i HD-DTR 3 SYN VAL 1
The VRA construct itself is defined in (43). The head of the construct is a sentential fragment. It is selected for by the fronted phrase which contributes a sum-subpart relation which is characteristic for the construction. The sum-subpart relation is assumed to have a partitioning subtype and an exemplifying subtype. The construction itself contributes a second subpart relation which links the event denoted by the host clause to that introduced by the sentential fragment. The con- struction also selects a nominal licenser. Notice the use of the set ANCHORS to express the locality of selection of the fronted phrase and of the VRA construct.
(43) VRA-cxt ⇒ phrasal-cxt & hd-fun-cxt & " # h i CAT 1SELECT SEM ANCHORS contains( IND 2 ) SYN VAL 3 h i MTR SEM HOOK IND 7 * sum-subpart-rel + C-SEM RELS SUBPART 7 event SUM 6 event CAT 1 SYN VAL 3hi MRKG none h i 7 HOOK IND SEM B HD-DTR 4 h i ANCHORS contains( IND 8 ) h i ANTECEDENT A HOOK IND 6 FRAGMENT VARIABLE C GAP {} h i HOOK LTOP 5 * sum-subpart-rel + DTRS SEM * +, 4 LABEL 5 RELS SUBPART 8 SUM 2
22 WH-VRA-construct is a subtype of VRA-construct. The fronted phrase of a VRA-construct contains a WH form which is coreferential with the nominal an- tecedent of the VRA-construct (44).
(44) WH-VRA-cxt ⇒ VRA-cxt & " # h i MTR SYN CAT SELECT SEM ANCHORS contains IND 1 * ( ) + h i DTRS WH HOOK IND 1 , sign
The prepositions functioning as the head of the fronted phrase of a WH-VRA- construct have the following lexical properties (45). They have an argument struc- ture containing two elements, one of which is an internal argument realized as the complement of the preposition. The other argument is the external argument of the preposition and is not realized as a dependent of the preposition. Rather the prepo- sition selects for a phrase which contains an anchor coindexed with its external argument. Finally the preposition has to introduce a sum-subpart relation between its two arguments: the internal argument denoting a sum and the external a subpart of that sum. Prepositions may differ regarding the exact type of sum-subpart rela- tion. Some introduce an exemplifying relation (Fr. parmi), some a partitioning one (Ro. dintre), some an underspecified one (Ro. printre and ˆıntre).
(45) PARMI-word ⇒ word & cat noun CAT noun * syn VAL hi SYN VAL hi + ARG-ST 1 MRKG det , 2 MRKG det h i h i 3 4 SEM HOOK IND SEM HOOK IND preposition 1 XARG CAT " # SYN h i SELECT SEM ANCHORS contains( IND 3 ) D E VAL 2 h i HOOK LTOP 5 exemplifying-sum-subpart-rel SEM LABEL 5 RELS SUBPART 3 SUM 4
We assume that dont is a marker which has no argument structure but intro- duces an underspecified sum-subpart relation, although it could also be introduced constructionally. Dont selects for a phrase which (I) contains an anchor for the sub- part argument of its sum-subpart relation and (II) selects for a phrase containing an anchor for the sum argument of its sum-subpart relation.
23 (46) DONT-word ⇒ word & h i SYN | CAT | SELECT | SEM | ANCHORS contains IND 4 CAT SELECT h i SEM | ANCHORS contains IND 3 SYN VAL hi MRKG dont h i 5 HOOK LTOP sum-subpart-rel SEM LABEL 5 RELS SUBPART 3 SUM 4
With the entry in (46) for dont, the only thing that must be stated in the subtype DONT-VRA-cxt is that the mother of the construct has a feature MRKG whose value is dont (47).
" # (47) DONT-VRA-cxt ⇒ VRA-cxt & h i MTR SYN MRKG dont
Subtypes of VRAs must include constraints on the syntax of the head. For example, French WHVRAs must constrain one of the phrases in the cluster to be a noun phrase and link the index of that noun phrase to the subpart feature of the sum-subpart relation expressed by the fronted phrase.
6 Conclusion
VRAs are incidental adjuncts. Like other incidental adjuncts, they are licensed within a phrase, as long as their selection properties are satisfied. Adjacency be- tween VRAs and their sum-denoting licenser is not always required, but locality constraints can be formulated nonetheless. VRAs are not elliptical relative clauses. They are sentential fragments which function as adjuncts with two different kinds of partitive semantics (exemplifying vs. partitioning). The partitive semantics is enforced by the presence of a fronted phrase which displays selection properties regarding the phrase it combines with to form the VRA. The body of a VRA can contain a cluster of phrases. Clusters are submitted to internal form constraints which cannot be explained by reconstruction because their distributional proper- ties are distinct from those of their alleged source. Instead, the instanciation of subcategorization knowledge without the realization of a head must be allowed under description anaphora.
References
Abeille,´ A., Clement,´ L. and Toussenel, F. 2003. Building a treebank for French. In A. Abeille´ (ed.), Treebanks, Dordrecht: Kluwer.
24 Abeille,´ A. and Godard, D. 1996. La complementation´ des auxiliaires en franc¸ais. Langages 122, 32–61.
Arnold, D. 2004. Non-Restrictive Relative Clauses in Construction Based HPSG. In St. Muller¨ (ed.), Proceedings of the HPSG04 Conference, pages 27–47, Stan- ford: CSLI Publications.
Arnold, D. and Borsley, R. D. 2008. Non restrictive relative clauses, ellipsis and anaphora. In St. Muller¨ (ed.), Proceedings of the HPSG08 Conference, pages 5–25, Stanford: CSLI Publications.
Bonami, O. and Godard, D. 2003. Incidental adjuncts: an overlooked type of ad- junction. Presentation given at the 10th International Conference on HPSG. East Lansing: Michigan State University.
Copestake, A., Flickinger, D., Pollard, C. and Sag, I. A. 2005. Minimal Recur- sion Semantics: An Introduction. Research on Language and Computation 3(4), 281–332.
Fernandez,´ R., Ginzburg, J. and Lappin, Sh. 2007. Classifying Non-Sentential Ut- terances in Dialogue: A Machine Learning Approach. Computational Linguis- tics 33(3), 397–427.
Gheorghe, M. 2004. Propozit¸ia relativa˘. Bucharest: Editura Paralela 45.
Gheorghe, M. 2005. Construct¸ii cu propozit¸ii relative. In Gramatica limbii romaneˆ , volume II, Bucharest: Editura Academiei Romane.ˆ
Ginzburg, J. and Sag, I. A. 2000. Interrogative Investigations: the form, meaning and use of English interrogatives. Stanford: CSLI Publications.
Godard, D. 1988. Syntaxe des relatives en franc¸ais. Paris: CNRS Editions.
Grevisse, M. 1993. Le Bon Usage. Boeck Duculot, 13. edition.
Kiss, T. 2005. Semantic constraints on relative clause extraposition. Natural Lan- guage and Linguistic Theory 23(2), 281–334.
Lasersohn, P. 1995. Plurality, Conjunction and Events. Dordrecht: Kluwer.
Mouret, F. 2006. A Phrase Structure Approach to Argument Cluster Coordination. In St. Muller¨ (ed.), Proceedings of the HPSG06 Conference, pages 247–267, Stanford: CSLI Publications.
Sag, I. A. 2007. Sign-Based Construction Grammar: An informal synopsis. Manuscript. Stanford University.
Van Eynde, F. 2003. On the Notion ‘Determiner’. In St. Muller¨ (ed.), Proceedings of the HPSG03 Conference, pages 391–396, Stanford: CSLI Publications.
25 Inflectional periphrasis in Persian
Olivier Bonami Université Paris-Sorbonne and Laboratoire de Linguistique Formelle
Pollet Samvelian Université Paris 3 Sorbonne nouvelle and Mondes iranien et indien
Proceedings of the 16th International Conference on Head-Driven Phrase Structure Grammar
Georg-August-Universität Göttingen, Germany Stefan Müller (Editor) 2009 CSLI Publications pages 26–46 http://csli-publications.stanford.edu/HPSG/2009
Bonami, Olivier, & Samvelian, Pollet. 2009. Inflectional periphrasis in Persian. In Müller, Stefan (Ed.), Proceedings of the 16th International Conference on Head- Driven Phrase Structure Grammar, Georg-August-Universität Göttingen, Germany, 26–46. Stanford, CA: CSLI Publications. Abstract Modern Persian conjugation makes use of five periphrastic constructions. We contrast the properties of these five constructions and argue that they call for different analyses. We propose contrasting analyses relying on the combination of an HPSG approach to feature geometry and syntactic combi- nation, and an approach to paradigm organization and morphological expo- nence based on Paradigm Function Morphology. This combination of ana- lytic tools allows us to treat the whole array of periphrastic constructions as lexical in origin—no phrasal construction or multi-word lexical entry of any kind is required.
Grammars of Persian (e.g. Lazard et al., 2006) distinguish five conjugational periphrastic construction types. The passive construction is based on an inflected form of ˇsodan ‘become’ preceded by a perfect participle (1). So-called ‘perfect’ forms are based on an inflected form of budan ‘be’ preceded by a perfect participle (2). The auxiliary is a full word (2a) or a clitic, (2b) depending on tense and mood, and giving rise to different syntactic and semantic properties. The future is formed with a special present tense form of xastanˆ ‘want’ followed by a bare stem (3). Finally, the progressive is based on an inflected form of daˇstanˆ ‘have’ followed by a finite form (4).1
(1) In tˆablo foruxte mi-ˇsav-ad. this painting sold UNBD-become.S1-3SG ‘This painting is sold.’ (2) a. Maryam in tˆablo=rˆa foruxte bud. Maryam this painting=DDO sold be.S2.3SG ‘Maryam had sold this painting.’ b. Maryam in tˆablo=rˆa foruxe=ast. Maryam this painting=DDO sold=be.PRS.3SG ‘Maryam has sold this painting.’ (3) Maryam in tˆablo=rˆa xˆah-ad foruxt. Maryam this painting=DDO want.S1-3SG sell.S2 ‘Maryam will sell the painting’
†Aspects of this work have been presented at the D´ecembrettes 6 International Morphology Con- ference (December, 2008), at the HPSG Seminar at U. Paris Diderot (March, 2009), at a Morphology Meeting in Surrey (April, 2009), and at the HPSG 2009 Conference (G¨ottingen, July 2009). We thank for their comments and suggestions the audiences at these events, and in particular Anne Abeill´e, Gilles Boy´e, Dunstan Brown, Patricia Cabredo Hofherr, Greville Corbett, Berthold Crysmann, Ger- ald Gazdar, Stefan M¨uller, Ivan A. Sag, Gregory Stump, Jesse Tseng, and Gert Webelhuth. This work was supported by a grant from Agence Nationale de la Recherche and Deutsche Forschungsgemein- schaft to the Franco-German project ‘PER-GRAM: Theory and Implementation of a Head-driven Phrase Structure Grammar for Persian’. 1The glosses use the following abbreviations. BD: bounded aspect; DDO: definite direct object; EZ: Ezafe; NEG: negation; PAF: pronominal affix; PRF: perfect; PRS: present; PST: past; S1: first stem (a.k.a. the present stem); S2: second stem (a.k.a. the past stem); SBJV: subjunctive; UNBD: unbounded aspect.
27 (4) Maryam dˆar-ad in tˆablo=rˆa mi-foruˇs-ad. Maryma have.PRS-3SG this painting=DDO UNBD-sell.S1-3SG ‘Maryam is selling the painting.’
The differing properties of these five types of periphrasis stem from different origins as finite, infinitival or participial complements, and different degrees of grammaticalization, going from the quasi-analytic passive to the recently morphol- ogized present perfect, through truly periphrastic forms that need to be integrated into inflectional paradigms despite being multi-word expressions. We assume that the different properties call for different analyses. We propose five contrasting analyses relying on the combination of an HPSG approach to feature geometry and syntactic combination, and an approach to paradigm organization and morpholog- ical exponence based on Paradigm Function Morphology (PFM; Stump, 2001). Interestingly, this combination of analytic tools allows us to treat the whole ar- ray of periphrastic constructions as lexical in origin—no phrasal construction or multi-word lexical entry of any kind is required.
1 Synthetic conjugation in HPSG/PFM
Before we address the analysis of periphrastic forms, we start with an account of synthetic conjugation. (5) lists the synthetic subparadigms of the lexeme xaridan ‘buy’, using the positive 2SG form as an illustration.
(5) a. Finite forms: i. Simple present: mi-xar-i ii. Simple bounded past: xarid-i iii. Simple unbounded past: mi-xarid-i iv. Simple subjunctive: be-xar-i v. Imperative: be-xar b. Nonfinite forms: i. Infinitive: xarid-an ii. Present participle: xar-ande iii. Perfect participle: xarid-e iv. Gerund: xar-anˆ
Persian verbs exhibit a morphomic stem alternation (here xar vs. xarid). Nei- ther stem is predictable from the other in general, and both stems are used in a combination of contexts which do not form a natural class. Affixal exponents real- ize unbounded aspect in the indicative (mi-), irrealis mood (be-), negation (na- or ne-, not illustrated here), type of nonfinite form (-e vs. -ande vs. -an vs. -anˆ ), and subject agreement for finite forms. Within Paradigm Function Morphology, this rather simple position class system can be accounted for using the series of rule
28 III II I IV V na- mi- stem-selection -e -am ne- -ande -i/∅ be- an -ad/∅ -im -id -and
Table 1: Rule blocks for Persian synthetic conjugation blocks outlined in table 1. Remember that in PFM, realization rules are organized in successive blocks. When attempting to realize a given set of morphosyntactic feature, the most specific applicable rule within the block is chosen. (6) are sam- ple rules from block V, written in an attribute-value matrix format.2: while (6a) asks that finite verbs with a 2SG subject take the suffix -i, the more specific (6b) indicates that the suffix is dropped in the imperative.
PHON X PER 2 PHON X⊕i (6) a. , σ : −→ (block V) "LID Y # "NB sg# "LID Y #
PER 2 PHON X PHON X b. , σ :NB sg −→ (block V) "LID Y # "LID Y # MOOD imper Since the integration of HPSG and PFM will be essential to our account of periphrastic conjugation, it is important that we specify how we intend to do it. The task is not trivial, because of PFM’s reliance on comparisons of feature structure descriptions, which can not easily be formulated in existing description languages for HPSG grammars. Rather than attempting a direct integration, we propose to use a PFM grammar to further constrain the class of signs satisfying an HPSG theory. Specifically, we rely on a slight reorganization of the feature geometry for head values as in (7), where MORSYN groups features that get realized in inflection and LID assigns a specific index to each lexeme (Spencer, 2005; Sag, 2007). We then define a version of PFM that is exactly like that of (Stump, 2001) except for the fact that typed feature structures are used to model morphosyntactic feature bundles instead of category structures a` la (Gazdar et al., 1985). The meta-constraint in (8) then links the two grammars.
2Two different conventions are currently used to write PFM rules, defined respectively in (Stump, 2001) and (Ackerman and Stump, 2004). The AVM format we use here is meant to ease the integra- tion with HPSG, although the change is little more than syntactic sugar.
29 head (7) head→LID lexemic-index MORSYN morsyn (8) Morphology-syntax interface (preliminary version) PHON 1
A sign of type word meeting the description LID 3 is HEAD MORSYN 4 " # well-formed only if the PFM grammar licenses phonology 1 as a realiza- tion of the features 4 for the lexeme 3 .
2 The passive
The passive in Persian is a typical complex predicate construction, whose prop- erties are parallel to those of copula-predicative complement constructions. The auxiliary ˇsodan is clearly the head: all inflectional information, e.g. negation (9), is realized on the auxiliary. The participle-auxiliary sequence is syntactically flexi- ble: adverbs may intervene (10), the auxiliary may be scrambled over the participle (11), and long-distance fronting of the participle is possible (12).
(9) In tˆablo foruxte ne-mi-ˇsav-ad. this painting sold NEG-UNBD-become.S1-3.SG ‘This painting is not sold.’ (10) In tˆablo foruxte hatman ˇsod. this painting sold certainly become.S2 ‘This painting was certainly sold.’ (11) In tˆablo ˇsod robude va foruxte. this painting become.S2 stolen and sold ‘It is this painting which was stolen and sold.’ (12) Foruxte fekr mi-kon-am [ agar in tˆablo sold thought UNBD-do.S1-1SG if this painting be-ˇsav-ad, mi-tavˆan-im bˆa pul-aˇs yek SBJV-become.S1-3SG UNBD-can.S1-1PL with money-PAF.3SG a mˆaˇsin be-xar-im]. car SBJV-buy.S1-1PL ‘I think that if this painting is sold, we can buy a car with the money.’
To account for this we rely on an argument composition analysis in the spirit of (Hinrichs and Nakazawa, 1994) and subsequent work. Specifically we propose the lexical entry in (13) for the auxiliary lexeme ˇsodan, giving rise to analyses such as that in Figure 1. Under our analysis there is no passive participle, and subject demotion is effected directly in the auxiliary’s entry. This is appropriate
30 S H 1 NP 2 PP V[perf-part] V[prs,3sg] LID foruxtan LID ˇsodan-aux 3 LEX + SUBJ h 1 i in tˆablo be Maryam SUBJ hNPi V[perf-part] COMPS h 1 NP, 2 PPi LEX + COMPS 3 , 2 SUBJ hNPi * + 1 2 COMPS h , i foruxte miˇsavad
Figure 1: Analysis of a passive sentence because (i) perfect participles are always active except in the periphrastic pas- sive constructions—participial clauses with transitive head verbs take direct ob- jects (14), and (ii) for semantic reasons there is no hope of using the same lexical entry for the auxiliary ˇsodan and the full verb ˇsodan (contrary to what happens in languages where the passive auxiliary coincides with the copula). Moreover, we assume a flat structure, wich allows for an easy account of the free reordering of the participle, auxiliary and valents. The specification [VC −] on the participle inhibits the formation of a verbal complex—see below for a contrasting analysis of perfect periphrases.
HEAD LID ˇsodan-aux CONT h2 i FORM part PERFECT + (13) + POL ARG-ST 1 ,CONT 2 ⊕ L * + ARG-ST hNP, 1 i⊕ L LEX + VC − (14) Maryam tˆablo=rˆa xarid-e va be Omid dˆad. Maryam painting=DDO buy.S2-PRP and to Omid give.S2 ‘Having bought the painting, Maryam gave it to Omid.’
Notice that under our analysis voice is not an inflectional category in Persian: the active-passive opposition is dealt with entirely within syntax.
31 3 Two sets of forms based on budan
There are five different subparadigms based on budan, illustrated here in (15). These contrast in two independent ways.
(15) a. Complex present: xaride=i b. Complex bounded past: xaride bud-i c. Complex unbounded past: mixaride=i d. Complex subjunctive: xaride baˇs-iˆ e. Compex perfect: xaride bude=i
3.1 Morphologized vs. truly periphrastic forms In the complex present and the complex unbounded past, the perfect participle combines with the present clitic form of the auxiliary, which is homophonous with the exponent of subject agreement except for 3SG (there is also a nonclitic form of present budan, but it may not be used in this construction). In the complex bounded past and complex subjunctive, the perfect participle combines respectively with the bounded past and subjunctive forms of the auxiliary. Finally the complex perfect cumulates two forms of the auxiliary: the participle bude and the present form clitic (here =i). There is strong evidence that the forms historically based on the clitic auxiliary have undergone morphologization in contemporary Persian. First, the sequence cannot be interrupted in any way; in particular, adverbs are excluded (16), as is participle fronting (17). Second, the distribution of the unbounded aspect marker mi- is otherwise unexplainable: it is the full construction, not the participle, that is unbounded. Finally, colloquial Persian allows a form of vowel reduction in the 3SG that is peculiar to these forms (18a): comparable contructions where the clitic auxiliary combines with an adjective do not give rise to the same pattern (18b).3
(16) *Rafte hatman=ast. left certainly=be.S1.3SG ‘(S)he has certainly left.’ (17) *Ne-mi-rafte sˆalhˆa Maryam be madrase=ast. NEG-UNBD-gone years Maryam to school=be.S1.3SG ‘For years, Maryam didn’t go to school’ (18) a. mord"e=ast → mord"e: died=be.S1.3SG ‘(S)he has died.’
3The only piece of evidence pointing in the other direction is the possibility for the auxiliary to have wide scope over a coordination of participles. However the existence of sublexical coordination in numerous languages calls into question whether this is a strong argument against a morphological analysis. We leave this issue for future research.
32 b. mord"e=ast → mord"ast corpse=be.S1.3SG ‘It is a corpse.’
Compare now the situation of forms that are based on a nonclitic auxiliary. The participle-auxiliary combination is more constrained than it is in the passive; in particular, neither adverbs (19) nor pronominal affixes (20) can occur between the two verb forms, and negation must be realized on the participle (21). In addition, scrambling is excluded (22). However, the combination is not lexical, since the participle can be extracted (23).
(19) * Maryam dide hatman bud-aˇs Maryam seen certainly be.S2-PAF.3SG (20) a. Maryam dide budaˇs. Maryam seen be.S2-PAF.3SG ‘Maryam had seen him.’ b. * Maryam dide-aˇs bud. Maryam seen-PAF.3SG be.S2 (21) Maryam Omid-rˆa na-dide bud. Maryam Omid-DDO NEG-seen be.S2 ‘Maryam hadn’t seen Omid.’ (22) * Maryam Omid-rˆa bud dide. Maryam Omid-DDO be.S2 seen (23) Foruxte fekr ne-mi-kon-am [ bˆaˇs-ad in sold thought NEG-UNBD-do.S1-1SG be.SBJV-3SG this tˆablo=rˆa ]. painting=DDO ‘I don’t think that s/he has sold this painting.’
3.2 Morphosyntactic import The use of a form based on budan may realize two distinct morphosyntactic fea- tures. The complex bounded past (24) and complex subjunctive (25) express re- spectively the past perfect and the subjunctive perfect. The complex unbounded past however does not express perfectivity at all. Rather, it has an evidential value (Windfuhr, 1982; Lazard, 1985; Jahani, 2000). Whereas the simple bounded past is used when the speaker has direct evidence for what she is asserting, the complex bounded past is used in contexts where the evidence is only indirect, as in (26).
(24) Qabl az inke Omid be-res-ad, Maryam birun rafte bud. before from that Omid SBJV-arrive.S1-3SG Maryam out gone be.S2 ‘Maryam had left (before Omid arrived).’
33 (25) Fekr mi-kon-am Maryam mariz bude baˇs-ad. thought UNBD-do.S1-1SG Maryam sick been be.SBJV-3SG ‘I think Maryam has been sick.’ (26) (Banˆabar gofte-ye Omid) Maryam dar sˆal-e 1950 in xˆane-rˆa According to-EZ Omid Maryam in year-EZ 1950 this house-DDO mi-sˆaxte=ast. UNBD-built=be.S1.3SG ‘According to Omid, Maryam was building this house in 1950.’
The complex present is ambiguous between a perfect and an evidential value: it can be interpreted either as a present perfect (27a) or as a bounded past with in- direct evidentiality (27b). Finally, the complex perfect expresses both perfectivity and indirect evidentiality: it is the indirect evidential equivalent of the complex bounded past (28). Note that this corresponds transparently to the fact the the com- plex perfect includes two realizations of the copula.
(27) a. Maryam tˆaze reside=ast. Maryam new arrived=be.S1.3SG ‘Maryam has just arrived.’ b. (Banˆabar gofte-ye Omid) Maryam in xˆane-rˆa dar sˆal-e 1950 According to-EZ Omid) Maryam this house-DDO in year-EZ 1950 xaride=ast. bought=be.S1.3SG ‘According to Omid, Maryam bought this house in 1950.’ (28) (Az qarˆar), qabl az inke Omid be-res-ad, Maryam birun apparently before from that Omid SBJV-arrive.S1-3SG, Maryam out rafte bude=ast. gone been=be.S1.3SG ‘Apparently, Maryam had left before Omid arrived.’
As can be seen in Table 3.2, if the present perfect is ignored, morphosyntactic properties align nicely with morphologized vs. syntactic combinations: the mor- phologized forms are used for indirect evidentiality, as stated by rules (29); while the truly periphrastic forms are used to express the perfect. The fact that the present perfect is unexpectedly synthetic calls for an paradigmatic analysis: this seems to be a standard case of syncretism, where the exponents used to realize a certain feature set (here indirect bounded past) are reused in some unrelated part of the paradigm. Specifically one should assume a rule of referral along the lines of (30). The rule states that any present perfect form of a lexeme Y will be identical to the indirect bounded past form of Y with the same specifications for all features except tense, aspect and evidentiality (here, the relevant remaining features are person, number and polarity).4
4This is a portmanteau rule of referral covering blocks I to V, thus bypassing completely synthetic exponence. σ \ τ is the description that is identical to σ except where the features mentioned in
34 PAST PRESENT DIR. EV. IND. EV. SBJV bounded complex BD *** past present simple simple unbd cpl. unbd. sbjv UNBD present past past complex complex complex complex PRF present bnd. past perfect sbjv
Table 2: Morphosyntactic features expressed by Persian subparadigms
PHON X PHON X⊕e (29) a. , σ : EVID indir −→ (block IV) "LID Y # "LID Y # h i EVID indir PHON X PHON X⊕ast b. , σ :PER 3 −→ (block V) "LID Y # "LID Y # NB sg PHON X TNS prst σ : (30) Y , −→ "LID # "PRF + # TNS pst PHON X ASP bnd σ PHON refer Y , \ , I-V "LID # PRF − (blocks I-V) EVID ind LID Y 4 Analyzing the perfect periphrases
We construct the analysis of perfect periphrases in two steps. First we present a syntactic analysis of perfect forms based on argument composition, and show what is unsatisfactory with such an approach. Next we present a way of arriving at the same syntactic analysis by inflectional means. Finally we discuss alternatives and potential problems.
τ differ from those in σ. The function refer takes as arguments an indexed phonological form, a morphosyntactic specification and a rule block sequence, and outputs the result of applying to this indexed phonological form and this morphosyntactic specification the restriction of the PFM grammar to these rule blocks. The motivation for deriving the present perfect from the indirect bounded past rather than the other way around is the economy of paradigms: this allows us to state the rules of exponence realizing suffixes -e and -ast in a natural way, as applying to all and only evidential forms. Notice that the orientation of the rule of referral might not correspond to the directionality of the diachronic morphologization process.
35 4.1 A failed analysis based on argument composition As a first step, we present an analysis that is a variation of the analysis presented above for the passive. (31) is a candidate entry for the present form of the auxiliary bud. This states that the auxiliary is a past perfect form which takes a perfect participle complement and inherits the participle’s arguments. Because the past participle is marked as [VC +], the auxiliary and participle form a verbal complex, as indicated in figure 2 and thus can not be seperated by elements that are not allowed to occur inside a verbal complex. Rigid word order is a consequence of the LP rule in (32). In addition, since the participle is an argument of the auxiliary, this analysis will allow for the extraction of the participle within any HPSG approach to extraction.
PHON bud
LID budan-aux tns pst PRF + HEAD MORSYN PER 3 AGR "NB sg# POL + (31) verb FORM part HEAD + PRF ARG-ST 1 , POL + ⊕ L * + LEX + + VC ARG-ST h 1 i⊕ L verb FORM part (32) HEAD ≺ [ ] PRF + + VC While this analysis is appropriate as far as syntax is concerned, its integration with the analysis of synthetic conjugation is problematic. First, the perfect auxil- iary must be stipulated to be defective for all nonperfect forms, and to have perfect forms that are homonymous to the nonperfect forms of the ordinary copula; thus the purported perfect auxiliary is inflectionally deeply abnormal. Second, we need to derive the fact that there is no present form of the perfect auxiliary (remem- ber that the present perfect is a morphologized form). There are two ways this could be done. We could further stipulate that the perfect auxiliary is defective for
36 S SUBJ hi COMPS hi H 1 NP 3 NP V′ SUBJ h 1 i Maryam in tˆablo COMPS h 3 i H V[perf-part] V[prf,bnd,pst,3sg] 2 LEX + SUBJ h 1 i VC + COMPS h 2 , 3 i foruxte bud
Figure 2: The syntactic structure of perfect periphrases the present, despite the fact that the ordinary copula it derives from has perfectly good present forms (in fact, two sets of such forms: clitic and nonclitic ones). Or we could assume that some form of competition between morphology and syntax is taking place (Poser, 1992)—but the postulation of such competitions is notori- ously difficult to state precisely, and quite alien to the design properties of HPSG. Finally, we need to find a way of stating that the passive auxiliary can not take the perfect auxiliary as its complement: while (33a) is well-formed, (33b) is not.
(33) a. In tˆablo foruxte ˇsode bud. this painting sold become be.PST.3SG ‘This painting had been sold.’ b. * In tˆablo foruxte bude ˇsod. this painting sold be become.PST.3SG
While these problems can definitely be circumvented by specifying an appro- priately complex inflectional paradigm for the perfect auxiliary, it is striking that many conterintuitive stipulations are needed just because it is not possible to state that the periphrastic perfect is part of the inflectional paradigm of the main verb. The next subsection attempts to modify the framework in a way that allows for the formulation of such an analysis.
4.2 An alternative solution: exponence as valence As the last subsection stressed, what we need is a way to treat perfect forms as part of the inflectional paradigm (Ackerman and Stump, 2004), while allowing for the fact that they correspond to a combination of two words, one of which may be extracted. The solution we explore here can be stated informally as follows: a perfect form of a lexeme Y is a word whose phonology is borrowed from that of a form of the lexeme budan, but which subcategorizes for a perfect participle of this
37 same lexeme Y . For instance, the 3SG positive complex bounded past of xaridan meets the description in (34), which is exactly like (31) except for the fact that it is an instance of the lexeme xaridan.
PHON bud
LID xaridan tns pst HEAD PRF + MORSYN 2 AGR POL + (34) verb FORM part HEAD + PRF ARG-ST 1 , POL + ⊕ L * + LEX + VC + 1 L ARG-ST h i⊕ The challenge now is to derive (34) in a principled way, while integrating it within an inflectional system where perfect forms may be realized either synthet- ically or periphrastically. The approach we propose is based on an extension of the power of realization rules in the spirit of (Spencer, 2005). In classical PFM, realization rules relate phonology-lexemic index pairs to phonology-lexemic index pairs. We propose that valence lists be added to the picture: realization rules now relate triplets of a phonological representation, a lexemic index, and an argument structure specification. The meta-constraint in (8) is updated as in (35), so that argument structure is examined at the morphology-syntax interface.5
(35) Morphology-syntax interface (preliminary version) PHON 1 ARG-ST 2 A sign of type word meeting the description LID 3 HEAD 4 "MORSYN # is well-formed only if the PFM grammar licenses phonology 1 and argu- ments 2 as a realization of the features 4 for the lexeme 3 .
The rule licensing (34) is given in (36). To realize a feature structure σ verify- ing [PRF +], one should refer the phonology to that of the corresponding bounded
5The formulation of this constraint presupposes that the HPSG grammar says nothing about in- dividual lexical entries, and that most of the usual HPSG theory of the lexicon is recast as part of the morphological component.
38 positive nonperfect form of budan, and add to the argument list a requirement for a form of Y realizing the same feature set except for the fact that it is a participle.
PHON X Y σ : + (36) LID , PRF −→ VAL Z h i PHON X PRF − σ PHON referLID budan, \ASP bnd, I-V Z + VAL POL Y LID + LEX VC + VAL Z⊕ Y LID *HEAD + σ MORSYN \ FORM part h i The proposed analysis makes the following correct predictions. First, negation is handled correctly: the phonology of the head word is constrained to be that of a positive form of budan, whereas the participle shares its polarity value with that of the head word. Thus the head will never carry a negation prefix, but its nega- tive polarity value will be realized as a prefix on the participle it selects. Second, the complex perfect is predicted to exist without stipulation: because evidentiality is morphologized and available for all past forms, rule (36) will generate an indi- rect past perfect with the phonology of an indirect bounded past form of budan. Figure 3 illustrates the relevant analysis. Third, the analysis correctly predicts that (33a), and not (33b), is grammatical. This is because the passive auxiliary, as a lex- eme, can be put in the perfect; whereas there is no sense in which one can put the perfect auxiliary in the passive, because there is no such thing as a perfect auxiliary lexeme. The analysis of (33a) is shown in Figure 4. Finally we account straightforwardly for the nonexistence of a periphrastic present perfect. Since (36) is an inflectional realization rule, it interacts with the rule of referral in (30) under the logic of rule specificity: thus the existence of (30) overrides the application of (36). In this sense the current analysis of the pe- riphrastic perfect is syntactically reductionist: periphrasis is reduced to valence; no phrasal constructions or lexical entries are needed. We assume a notion of rule competition, but this competition is segregated to the inflectional component, where it is arguably needed for independent reasons. Thus no competition be- tween morphology and syntax (e.g. Poser, 1992; Bresnan, 2001) needs to be or- cherstrated.
4.3 Discussion The analysis of the Persian perfect outlined above attempts to capture the tradi- tional intuition of periphrastic inflection. While there are many ways one might
39 S SUBJ hi COMPS hi H 1 NP 3 NP V′ SUBJ h 1 i Maryam in tˆablo COMPS h 3 i H V[perf-part] V [bnd-past,indir,prf,3sg] LID foruxtan LID foruxtan 2 LEX + SUBJ h 1 i VC + COMPS h 2 , 3 i foruxte bude-ast Figure 3: Analysis of a sentence in the complex pefect: ‘Reportedly, Maryam had sold this painting.’
S SUBJ hi COMPS hi H 2 NP V[perf-part] V′ LID foruxtan SUBJ h 2 i in tˆablo LEX + COMPS h 3 i 3 SUBJ h 1 i H COMPS h 2 i V[perf-part] V [bnd-past,3sg] VC − LID ˇsodan-aux LID ˇsodan-aux foruxte SUBJ h 2 i SUBJ h 2 i 4 COMPS h 3 i COMPS h 4 , 3 i LEX + VC + ˇsode bud Figure 4: Perfect-passive interaction: analysis for (33a)
40 attempt to reach this goal in the context of HPSG (see in particular Ackerman and Webelhuth, 1998), the specific design goal here has been to devise an analysis that meets as much as possible both the analytical habits of HPSG syntax and of real- izational morphology. Thus as far as clausal syntax is concerned, our analysis is undistinguishable from an argument composition analysis, and we have attempted to account for all relevant syntactic features of the construction. On the other hand, the lexical analysis is as close as possible to standard realizational morphology; in particular it relies heavily on the architecture of rule blocks and rule competition to generate the correct lexical representations. While we fully assume this research strategy, alternatives are readily conceiv- able that meet different design goals but produce very similar analyses. For in- stance, turning the analysis into a standard HPSG analysis, with item-and-process morphology encoded via lexical rules, is easy: one just needs to recast rules such as (30) and (36) as lexical rules, and modify the morphosyntactic descriptions (using quite a bit of negation and disjunction and/or fine-tuning the type system) so as to make them mutually exclusive. The resulting system is more conservative from an HPSG perspective, although one may doubt that it is more perspicuous. A different issue raised by the current analysis is its interaction with the analy- sis of coordination. Coordination of participles is possible in the perfect in Persian, just as it is in the passive (37). This can not be treated as a simple instance of con- stituent coordination under our analysis: because we assume that the auxiliary is really an inflected form of the main verb, there is no single lexeme of which bud is the realization in (37). While this is definitely a problem, it is a familiar one, reminiscent of issues pertaining to coordinations of unlikes. We see two potential solutions. First, we could assume an ellipsis-based analysis of (37) along the lines of analyses proposed by (Yatabe, 2001; Crysmann, 2003; Beavers and Sag, 2004). Second, we could assume a richer ontology of LID values where a neutralized value common to both participles is assigned to the coordinate phrase in (37), extending work in the tradition of (Daniels, 2002; Levy and Pollard, 2002; Sag, 2003). This neutralized value could then serve as an appropriate input for rule (36).6 Whether these strategies prove fruitful will have to wait for future research, and in particular for a detailed empirical study of coordination in Persian. (37) Maryam tˆablo-rˆa pasandide va xaride bud. Maryam painting-DDO liked and bought be.PST ’Maryam had liked and bought the painting.’
5 The future
For the periphrastic future, a number of different analytic options are available. As in the case of the periphrastic perfect, the verb sequence can not be interrupted, and
6Notice that the postulation of neutralized LID values is needed anyway to allow for constituent coordination under the assumptions of (Sag, 2007). Thus the issue raised by our analysis is an issue that needs to be addressed anyway.
41 occurs in a rigid order.
(38) a. Maryam Omid=rˆa xˆah-ad did. Maryam Omid=DDO want.S1-3.SG see.S2 ‘Maryam will see Omid.’ b. *Maryam xˆah-ad Omid=rˆa did. Maryam want.S1-3.SG Omid=DDO see.S2 c. *Maryam Omid-rˆa did xˆah-ad. Maryam Omid-DDO see.S2 want.S1-3.SG
The periphrastic future does not enter into paradigmatic relations with syncretic inflection. Thus it could be accounted for entirely within syntax. On the other hand, syntactic rules do not manipulate portions of the periphrastic construction— notably, the nonauxiliary part of the future can not be fronted. Thus nothing pre- cludes either a purely morphological analysis. There is however one argument favouring a purely morphological analysis, al- though it is not a very strong one. The future auxiliary looks like a present tense form of xastanˆ ‘want’, except that it does not carry the unbounded auxiliary nor- mally found in the present. If we were to treat the future construction as phrasal, we would thus need to set up the grammar so that the morphology output supplemen- tary forms, the distribution of which we would then need to constrain drastically within syntax.7 We thus opt for a purely morphological analysis.8 We propose to use the rule in (39), which is a double portmanteau rule of referral. To find the
7The nonfinite form appears to be a bare past stem. Words homophonous to a bare past stem are used in two other contexts: in the bounded past with a 3SG subject, where the exponent of agreement is null; and in the impersonal complement of some modal verbs such as bˆayastan ‘must, be necessary’ (1). (1) a. Maryam (hatman) bˆay-ad be madrasa be-rav-ad. Maryam certainly must.S1-3SG to school IRR-go.S1-3SG ‘Maryam definitely has to go to school.’ b. (Hatman) bˆay-ad be madrase raft. certainly must.S1-3SG to school go.S2 ‘It is definitely necessary to go to school.’
8One could argue from the fact that object clitics can be realized either on the auxiliary (i) or on the nonfinite form (ii) that they should be treated as two distinct syntactic atoms; but since we treat object clitics as affixes anyway, the question is moot. In any case, the analysis in (39) can readily be extended to account for (i), but an account of (ii) will need to rely on more extensive revisions.
(i) Maryam xˆah-ad did-aˇs Maryam want.S1-3.SG see.S2-PAF.3.SG ‘Maryam will see her/him.’ (ii) Maryam xˆah-ad-aˇs did Maryam want.S1-3.SG-PAF.3.SG see.S2 ‘Maryam will see her/him.’
42 phonology of a future form, one needs to concatenate the output of block IV on the form xahˆ with a bare past stem of the lexeme being realized.
PHON X (39) LID Y , σ : TNS fut −→ Z ARG-ST h i PHON xˆah PHON referLID Y , σ,V Z ARG-ST L X PHON referLID Y , σ\ TNS pst , I Z ARG-ST h i LID Y Z ARG-ST 6 The progressive
All unbounded forms may give rise to a progressive interpretation, but that inter- pretation can also be forced by using the periphrastic construction illustrated in (4). Unlike the ones we discussed so far, this construction results from the grammat- icalization of a finite complement clause construction, and all relevant evidence points to the fact that an embedded clausal structure is still present.9 The nonaux- iliary verb is unmistakably a finite form; it occurs on the right of the auxiliary, as finite complement clauses occur on the right of their head. No complementizer can be used, but complementizers are optional for finite complements (40). Comple- ments normally occur between the two verbs; they can scramble to the left of the auxiliary, but this is also possible with clausal complements (41). Finally, object clitic pronouns must be realized on the nonauxiliary verb, and cannot climb to the auxiliary (42).
(40) a. Maryam dˆar-ad (*ke) ketˆab mi-xˆan-ad Maryam have.S1-3SG COMP book UNBD-read.S1-3SG ‘Maryam is reading a book.’ b. Maryam mi-xˆah-ad (ke) bˆa Omid har ruz be sinemˆa Maryam UNBD-want.S1-3SG COMP with Omid every day to theatre be-rav-ad SBJV-go.S1-3SG ‘Maryam wants to go to theatre with Omid everyday.’
9Persian raising and control constructions normally rely on a finite unsaturated complement clause. Infinitival complements are available only in a very formal register.
43 (41) a. Maryam in ketˆab=rˆa dˆar-ad mi-xˆan-ad Maryam this book=DDO have.S1-3SG UNBD-read.S1-3SG ‘Maryam is reading this book.’ b. Maryam bˆa Omid mi-xˆah-ad (ke) har ruz be sinemˆa Maryam with Omid UNBD-want.S1-3SG COMP every day to theatre be-rav-ad SBJV-go.S1-3SG ‘Maryam wants to go to theatre with Omid everyday.’ (42) a. Maryam dˆar-ad mi-xˆan-ad=aˇs Maryam have.S1-3.SG UNBD-read.S1-3SG-=3SG ‘Maryam is reading it.’ b. * Maryam dˆar-ad=aˇs mi-xˆan-ad Maryam have.S1-3SG=3SG UNBD-read.S1-3SG
This data can be accounted for by assuming a slightly idiosyncratic lexemic entry for the auxiliary daˇstanˆ . This entry assumes that prog is a subtype of the ASPECT value unbd (unbounded). As a result of its lexeme-level specification, this auxiliary is defective for all subparadigms except the present, the unbounded past and the complex unbounded past, in accordance with the facts. The subject of the complement is constrained to be an nc-pro, the type of pro-dropped subjects, and coindexed with the auxiliary’s subject. The analysis is illustrated in Figure 5.
LID daˇstan-auxˆ HEAD MORSYN 1 ASP prog CONT 2 h i 1 MORSYN (43) MARKING none CONT 2 ARG-ST IND 3 , nc-pro * + h iSUBJ *"IND 3 #+ COMPS hi
References
Ackerman, Farrell and Stump, Gregory T. 2004. Paradigms and Periphrastic Ex- pression. In Louisa Sadler and Andrew Spencer (eds.), Projecting Morphology, pages 111–157, Stanford: CSLI Publications.
Ackerman, Farrell and Webelhuth, Gert. 1998. A Theory of Predicates. Stanford: CSLI Publications.
44 S
H
1 NP V S LID daˇstan-aux MORSYN 0 MARKING none Maryam TNS prs MORSYN 0 1 ASP prog nc-pro " # SUBJ 3 IND 4 * " #+ SUBJ 1 IND 4 COMPS hi h i COMPS h 2 i H dˆarad 5 NP V LID foruxtan 0 in tˆablo rˆa MORSYN MARKING none SUBJ h 3 i COMPS h 5 i
miforuˇsad
Figure 5: Analysis for (4) ‘Maryam is selling the painting.’
Beavers, John and Sag, Ivan A. 2004. Coordinate Ellipsis and Apparent Non- Constituent Coordination. In Proceedings of the HPSG04 Conference, Stanford: CSLI Publications.
Bresnan, Joan. 2001. Lexical-functional syntax. Oxford: Basil Blackwell.
Crysmann, Berthold. 2003. An Asymmetric Theory of Peripheral Sharing in HPSG: Conjunction Reduction and Coordination of Unlikes. In Proceedings of Formal Grammar 2003.
Daniels, Michael W. 2002. On a Type-Based Analysis of Feature Neutrality and the Coordination of Unlikes. In Frank Van Eynde, Lars Hellan and Dorothee Beer- mann (eds.), Proceedings of the 8th International Conference on Head-Driven Phrase Structure Grammar, pages 137–147, Stanford: CSLI Publications.
Gazdar, Gerald, Klein, Ewan, Pullum, Geoffrey K. and Sag, Ivan A. 1985. Gener- alized Phrase Structure Grammar. Harvard: Harvard University Press.
Hinrichs, Erhard and Nakazawa, Tsuneko. 1994. Linearizing AUXs in German ver- bal complexes. In John Nerbonne, Klaus Netter and Carl Pollard (eds.), German in HPSG, pages 11–37, Stanford: CSLI Publications.
45 Jahani, Carina. 2000. Expressions of indirectivity in spoken Modern Persian. In Lars Johanson and Bo Utas (eds.), Evidentials: Turkic, Iranian and Neighbour- ing Languages, pages 185–207, Berlin: Mouton de Gruyter.
Lazard, Gilbert. 1985. L’inf´erentiel ou pass´edistanci´e en persan. Studia Iranica 14, 27–42.
Lazard, Gilbert, Richard, Yann, Hechmati, Rokhsareh and Samvelian, Pollet. 2006. Grammaire du persan contemporain. Tehran: IFRI and Farhag-e Moaser.
Levy, Roger and Pollard, Carl. 2002. Coordination and Neutralization in HPSG. In Frank Van Eynde, Lars Hellan and Dorothee Beermann (eds.), Proceedings of the 8th International Conference on Head-Driven Phrase Structure Grammar, pages 221–230, Stanford: CSLI Publications.
Poser, William J. 1992. Blocking of phrasal constructions by lexical items. In Ivan A. Sag and Anna Szabolcsi (eds.), Lexical matters, pages 111–130, Stan- ford: CSLI Publications.
Sag, Ivan A. 2003. Coordination and underspecification. In Proceedings of the 9th HPSG Conference, pages 267–291, Stanford: CSLI Publications.
Sag, Ivan A. 2007. SIgn-Based Construction Grammar: An Informal Synopsis, ms., Stanford University.
Spencer, Andrew. 2005. Inflecting clitics in Generalized Paradigm Function Mor- phology. Lingue e Linguaggio 4, 179–193.
Stump, Gregory T. 2001. Inflectional Morphology. A Theory of Paradigm Structure. Cambridge: Cambridge University Press.
Windfuhr, Gernot. 1982. The verbal category of inference in Persian. In Momentum Georg Morgenstierne II, pages 263–287, Leiden: E. G. Brill.
Yatabe, Shˆuichi. 2001. The Syntax and Semantics of Left-Node Raising in Japanese. In Dan Flickinger and Andreas Kathol (eds.), Proceedings of the 2001 HPSG Conference, Stanford: CSLI Publications.
46 Construction-based Cumulation and Adjunct Extraction
Rui P. Chaves University at Buffalo, SUNY
Proceedings of the 16th International Conference on Head-Driven Phrase Structure Grammar
Georg-August-Universität Göttingen, Germany Stefan Müller (Editor) 2009 CSLI Publications pages 47–67 http://csli-publications.stanford.edu/HPSG/2009
Chaves, Rui P. 2009. Construction-based Cumulation and Adjunct Extraction. In Müller, Stefan (Ed.), Proceedings of the 16th International Conference on Head- Driven Phrase Structure Grammar, Georg-August-Universität Göttingen, Germany, 47–67. Stanford, CA: CSLI Publications. Abstract Previous HPSG accounts of extraction blur the distinction between valents and adjuncts by allowing verbs to lexically control the modifiers that com- bine with their phrasal projections. However, assuming that adjuncts are valents runs into various difficulties. This paper argues that the distinction between complements and adjuncts can be maintained, and that certain se- mantic phenomena that challenge traceless theories of extraction can be seen as an instance of a more general process. Finally, this paper also discusses a uniform mechanism for case assignment to valents and adverbial nominals.
1 Introduction
Pollard and Sag (1994) and others have noted that certain verbal adjuncts can be extracted, as in (1). Although extractable, these phrases behave like adjuncts in many other aspects (they are not semantic arguments of the verb that they modify, are optional, can be iterated, are canonically VP-final, pass the do-so test, and have a freer distribution than true arguments).
(1) a. [Yesterday], it seems that [Kim arrived home very early ]. b. [(On) that day], I think that [Kim went home very late ]. c. [How often] do you think that [Fred was late this week ]? d. It was [with a stick] that [we killed the snake ].
The distinction between adjuncts and complements also appears to be blurred cross-linguistically, in extraction pathway marking phenomena (see Clements et al. (1983); McCloskey (1979); Hukari and Levine (1995) inter alia), in case assign- ment to adverbial nominals (see Maling (1989, 1993) inter alia), and in adverbial scope (van Noord and Bouma, 1994). Thus, it can be argued that at least some verbal adjuncts are selected or controlled by the verbal head. This poses various puzzles, and runs counter the standard distinction between arguments and adjuncts. §2 discusses previous accounts of these phenomena, and §3 proposes a new analy- sis that allows for a simpler view of the adverbial argument-adjunct puzzle.
2 Previous accounts
There are two main approaches to adjunct extraction that have been proposed within HPSG. One is lexical (the lexical entry of the verb hosts adverbial gaps in SLASH), and another is phrasal (adverbial gaps are introduced syntactically). Both run into problems in the presence of conjunction, as discussed below.
†I thank the audience of the HPSG09 conference and reviewers for their comments and criticism. A very special dept is owed to Doug Arnold. I am also grateful to Olivier Bonami, Philip Hofmeister, EunHee Lee, Stefan Muller,¨ and Ivan Sag. I am the sole responsible for any errors or omissions.
48 2.1 Lexical gap addition Pollard and Sag (1994, 387) propose the lexical rule in (2), which adds one adjunct to the SLASH feature of verbs that subcategorize for clauses.
(2) COMPS h... 2 S... i
COMPS h... S... i MOD 2 V → V SLASH XP "SLASH {} # "INDEX 3 # CONT | SOA-ARG 3 But as Hukari and Levine (1996) note, this account is problematic on various fronts. Not only adverbs can be extracted when verbs take VP complements, but there are a number of languages for which valent and modifier extraction triggers exactly the same morphophonological processes. This is the case of Kikuyu downstep suppression (Clements et al., 1983) and Irish complementizers (McCloskey, 1979), among many others. Since extraction pathway marking for valents and (some) modifiers triggers the same phenomena in these languages, van Noord and Bouma (1994) and Bouma et al. (2001) propose that such modifiers are in fact lexically selected by the verbal head, as dependents. Thus, adjunct extraction can be handled in the same way as valent extraction. The account in Bouma et al. is given in (3).
(3) ARGUMENT STRUCTURE EXTENSION
HEAD 1 ARG-ST 2 1 verb ⇒SYNSEM HEAD DEPS 2 ⊕ list MOD CONT | KEY 3 " # 3 CONT | KEY However, giving up adjunction is problematic, as Levine (2003) notes. First, further assumptions are needed in order to account for cases that would otherwise be taken care as standard VP adjunction, as in (4).
(4) Nobody can [[drink four beers and eat two hotdogs] [under fifteen seconds]].
If the PP is a complement, then something else must be assumed in order to capture this sentence. For example, one would have to assume that the PP is extraposed ATB or Right-Node Raised. But, as Levine and Hukari (2006) argue, these hy- potheses are at odds with the semantic interpretation that the PP obtains, which ranges over the total time interval denoted by the two conjuncts. In contrast, this reading is trivially obtained if the PP simply adjoins to the conjoined VP. It should be clear that plurality-forming conjunction operates beyond NPs, and can form event pluralities (Bach, 1986; Lasersohn, 1995; Link, 1998). For exam- ple, the sentence in (5a), adapted from Oehrle (1987), can describe the frequency of two joint event-types rather than independent frequencies of occurrence.
49 (5) a. Often, [[I go to the beach]e1 and [you go to the city]e2 ]e1+e2 .
b. Sue [[[got dressed]e1 and [dried her hair]e2 ]e1+e2 , [in exactly twenty sec- onds]].
c. You can’t simultaneously [[drink]e1 and [drive]e2 ]e1+e2 .
Levine (2003) also points out that the cumulative reading can occur even when the adjunct is extracted, as in (6). Here, In how many seconds flat predicates the total of three events denoted by the embedded coordinate VP, not each conjunct.
(6) In how many seconds flat do you think that [Robin found a chair, sat down and took off her logging boots]?
This utterance is a query about the total time occupied by the occurrence of three (possibly overlapping) events. Such a reading suggests that the extracted con- stituent is not a complement of anything in the sentence. If it were, then the adjunct should be predicating over each of the conjuncts separately, not the higher VP co- ordination node. To address this problem, Sag (2005) proposes that the extracted phrase in how many seconds flat is a complement that can semantically outscope the verb structure that it modifies. In a coordinate structure, the PP extracted ATB is naturally required to simultaneously outscope each of the verbs heading the con- juncts, thus obtaining wide scope over the entire coordination. However, there is no evidence that the modifier in X seconds is semantically scope-bearing. Compare the unambiguous examples in (7) with the examples in (8). Only the latter contain scope-bearing modifiers and trigger an ambiguity with respect to the wide or narrow scope interpretation of the indefinite NP.
(7) a. Someone died in the arena yesterday / under twelve seconds flat. b. Kim sang a song yesterday / in twelve seconds flat.
(8) a. Someone probably / usually died in the arena. b. Kim probably / usually / often sang a song.
Scope cannot in general solve the cumulation problem, but in §3 I will argue that the challenging phenomenon in (6) is the consequence of other semantic aspects.
2.2 Syntactic gap addition Assuming that adjuncts are modifiers, Levine (2003) proposes that extraction paths are terminated by traces. Thus, a modifier can instantiate the trace in (9) and adjoin to VP as usual. When it does, it creates an unbounded dependency that is percolated and linked to a filler, like any other unbounded dependency. Cumulative readings arise as a consequence of adjuncts being able to adjoin to VPs, coordinate or not.
50 (9) PHON hi LOC 1 SYNSEM SLASH 1 n o In order to deal with adverbial case assignment and extraction pathway marking, Levine and Hukari (2006) introduce a new feature ADJS, which allows the lexical entry of a verb to list adjuncts realized in arbitrarily high positions. This list is lexically underspecified, and becomes instantiated at the phrasal level, when mod- ifiers adjoin to a VP projected by that verb. The verb can thus lexically access any gaps that may reside in an adjunct located in a higher position in the syntactic tree (see also Sato and Tam (2008)). With regard to scope ambiguities in Dutch that have been argued to follow from an adjuncts-as-complements analysis, Levine (2003) proposes that these can be captured with direct access to the relevant parts of semantic representation.1 There are however some concerns with the adjunct extraction account in Levine (2003). First, nothing prevents the adverb trace in (9) from being adjoined to each of the VP conjuncts, instead of the coordinate mother VP. Given the standard as- sumption that SLASH values are structure-shared between daughters and mother in coordination, then one would obtain an impossible interpretation where each con- junct event is the same. This is shown schematically in Figure 1; notice that both VPs become structure-shared in 1 ). The problem is related to the one discussed in Levine and Hukari (2006, 159), where structure-sharing slashed event-modifying adjuncts originating in each conjunct yield a description that no sign can satisfy.
VP SLASH { 2 }
VP VP SLASH { 2 } SLASH { 2 }
1 VP: e 2 Adv Conj VP 1 [MOD :e] SLASH { 2 } t 1 VP: e 2 Adv [MOD 1 :e]
t
Figure 1: ATB adverbial extraction and impossible descriptions
1This can be achieved in a number of different ways, for example, if KEY corresponds to the predication φ(v1, ..., vn) semantically heading a phrase, then other heads can access the predicate’s argument slots. Thus, an adverb like often can either predicate the verb heading the phrase that it adjoins to, or the scopal argument of that verb. See also Crysmann (2004) and Sato and Tam (2008).
51 This is not the same as a distributive interpretation of the adverb. For example, yesterday is distributive in the sense that when applying to a sum of events such as e1+e2, it does not yield a collective predication yesterday(e1+e2), but rather, a dis- tributive one: ∀e(e ≤ e1+e2 → yesterday(e)). In the latter, the adverb predicates over each mereological part e of the event sum e1+e2. A modifier like for how long on the other hand, can apply collectively to the entire sum. Which adverbs are distributive, which are collective, and which are ambiguous is a matter of lexi- cal specification, similar to how verbs like smile, meet, and hire can interpret their pluralic NP arguments in different ways. This is what seems to be happening in the ambiguous example from Levine and Hukari (2006,186), shown below. Although the PP attaches to the higher VP coordination, it can either apply distributively to each event in the sum e1+e2, or apply collectively to the entire sum.
(10) Robin [stands on his head and falls off his chair]e1+e2 in order to attract people’s attention.
A second concern pertains to the assumption that extraction is terminated by traces. I will side with Sag and Fodor (1994) and Sag (2000) in assuming that extraction can be modelled without resorting to traces. The question is, of course, how can this be achieved parcimoniously. One possible alternative is explored in Muller¨ (1999, 108–109,447) and Chaves (2007,Ch.7), who show that it is possible to for- mulate unary-branching traceless extraction rules. Consider (11), based on Chaves (2007), which allows an adjunct to become a member of the head’s SLASH.
(11) ADJUNCT EXTRACTION RULE:
LOC 1 SYNSEM SLASH 2 ∪ XP MOD 3 h i adj-extr-phr ⇒HD-DTR 4 LOC 1 DTRS 4 SYNSEM 3 2 * "SLASH #+ Assuming a Ginzburg and Sag (2000) framework, the GENERALIZED HEAD FEA- TURE PRINCIPLE would ensure that valence and head features are percolated in the tree structure. The value of SLASH is percolated as dictated by the rule in (11), overriding the default percolation of the GENERALIZED HEAD FEATURE PRINCI- PLE. Since (11) is independent from coordination, and adverbs are lexically select VPs, gap insertion can also arise in non-coordinate VPs. In other languages ad- verbs may select S nodes or even to V nodes instead. This analysis can also be augmented with the ADJS feature, so that whenever the rule in (11) applies, the gap adjunct is identified with a member of the head’s ADJS list. This way, a verb can access adjuncts higher up in the tree, for gap threading purposes. The account is illustrated in Figure 2. The VP daughter is
52 modified by an adverbial phrase located in SLASH. Thus, yesterday modifies the event plurality that the VP conjunction yields.
S SLASH {} h i 1 AP S SLASH { 1 } Yesterday h i NP VP SLASH { 1 } I h i V S SLASH { 1 } think h i
NP VP
Kim SLASH 1 AP MOD 2 " # h i 2 VP SLASH {} h i
came too early and left too late
Figure 2: A modifier adjoining into SLASH
The ATB adjunct extraction in (12) can also be easily addressed. In Copestake et al. (2006), existentially quantification of events is often omitted for simplifica- tion purposes, but if one makes such quantification explicit – as in ∃e left(e, kim) – then the adjunct extraction rule cannot apply to each VP conjunct because the resulting semantic structure is ill-formed. Consider the parse in (12).
(12) [Under how many seconds flat] did Kim [pack and escape ]?
Each adjunct is located in the SLASH value of each VP conjunct, and predi- cates over the respective event. The modifiers must be one and the same at the coordination level, because the coordination rule imposes identity of SYN values (as for example, in Beavers and Sag (2004)). Thus, the adverbial phrase filler has to predicate the very same event across conjuncts, and has to be simultaneously located under the scope of each existential quantifier. This yields an ill-formed MRS structure because the underspecified representation cannot describe a tree structure, as depicted in Figure 3. Arrows denote underspecified semantic subor- dination constraints. Here, k is the shared subject index of Kim, and e′ = e1 = e2 . This solution follows from the coordination rule, and is valid for extraction accounts with or without traces.
53 ∃ e ( e = e1 + e2 ∧ ∃ e1 (...) ∧ ∃ e2 (...)
pack( e1 , k ) escape( e2 , k )
seconds-of-duration(s, e′ )
Figure 3: ATB adjunct extraction yielding an illegal MRS representation
2.3 Taking stock All of the extraction accounts discussed so far – the adjuncts as complements analy- sis and the phrasal adjunction analyses – allow verbs to select for in situ adjuncts, blurring the distinction between adjuncts and valents. In one case this is done lex- ically, and in the other, via a special valence feature ADJS, so that gap threading phenomena can be dealt with. However, there is in my view no semantic evidence that such adjuncts are com- plements. In fact, the semantic evidence observed in coordination indicates that adjuncts semantically combine with phrase structures. On the other hand, the fea- ture ADJS seems to lack independent motivation because it is only relevant for a particular class of post-verbal adjuncts, and not, for example, adnominal modifiers. In what follows I propose a simpler and more general analysis that maintains a strong distinction between adjuncts and valents, and dispenses the need for traces, new constructions, and ADJS. As in Pollard and Sag (1994), adjunct gaps start out lexically so that heads can only detect extracted adjuncts, not in situ ones, and cumulative phenomena are handled semantically in ways reminiscent of Sag (2005). By viewing the cumulative readings of extracted elements as an instance of a more general phenomenon, we will also be able to deal with other cases that arise beyond adjunct extraction.
3 A coordination-based proposal
Let us assume that there are no traces, and that adjuncts are not valents but rather modifiers in the usual sense. It could be that the cumulation of extracted adver- bials is an instance of a more general phenomenon where the plurality-formation operation triggered by conjunction ‘bleeds over’ to certain unrealized dependents. Consider the data in (13), from Postal (1998, 136,160) and Kehler (2002, 125). Here, an extracted NP can denote a plurality composed of two individuals, each being linked to each verb in each conjunct. These data are relevant because each conjunct contains a different gap, and the two gaps are not fused together as a sin- gle entity. Rather, they might be cumulated into a complex entity (a conjoined NP) and permitted to percolate independently.
54 (13) a. [How many frogs]i and [how many toads]j did respectively Greg capture i and Lucille train j?
b. [[Which pilot]i and [which sailor]j] will Joan invite i and Greta enter- tain j (respectively)?
c. [[What book]i and [what magazine]j] did John buy i and Bill read j respectively?
This is somewhat unexpected, because coordination is known to not allow different conjuncts to host different gaps, as shown in (14).
(14) *[A violin this well crafted]i, even [the most difficult sonata]j will be easy to write j and to play it on i.
One can argue that in (13) there is only one gap at the coordination level, and that this gap is linked to a pluralic filler. Each member of the plurality is predicated by a different VP conjunct. In (14) however, there are two fillers and thus each VP conjunct would have to contain a different gap. The latter is correctly ruled out if one assumes that SLASH values of conjuncts and mother node are structure- shared. Thus, it seems that as conjunction forms a plurality from the indices of the conjoined heads, the extracted dependents can be pluralized in a similar fashion. The cumulation of gaps is illustrated in Figure 4.2
S[SLASH {[INDEX x + y]}]
S[SLASH {[INDEX x]}] S[SLASH {[INDEX y]}]
Figure 4: Conjunct nominal gap sharing with cumulation
If this analysis is on the right track, then one would expect to find the same phe- nomenon in constructions without respectively. However, the detection of such data is not easy because the gaps in such examples are preferentially interpreted non-cumulatively. In (15) I provide such data. These sentences can be interpreted as conveying that the plural NP filler corresponds to the union X ∪ Y, where X and Y are the entities extracted from different conjuncts.
(15) a. Setting aside illegal poaching for a moment, how many sharksX+Y do you estimate [[ X died naturally] and [ Y were killed recreationally]]?
b. The [shipsx+y that [[a U-boat destroyed x] and [a kamikaze blew up not insured ]]]] were . y the Laconia and the Callaghan 2Chaves (2009) proposes a direct account of respectively readings that does not resort to any form of ‘conjunction reduction’, syntactically or semantically. Further research is forthcoming.
55 c. The housesX+Y [[the fire reduced to ash X ] and [the flood leveled down Y ]] were near each other.
Now, the adjunct cumulation cases in Levine (2003) might be due to the same kind of phenomenon. This is illustrated in Figure 5. Each conjunct has a different adverbial gap, the index of each adverb matches the event that it modifies, and conjunction allows the indices of the adverbial gaps to be cumulated.
S[SLASH { }]
1 Adv S[SLASH { 1 }]
NP VP[SLASH { 1 [INDEX e1 + e2]}]
VP[SLASH {[INDEX e1]}] VP[SLASH {[INDEX e2]}]
sat down and took off her logging boots
Figure 5: Conjunct adverbial gap sharing with cumulation
Put more in more general terms, in a coordinate structure with n displaced struc- tures with indices α1,...,αn, these can either be combined into one and the same entity α1 = ... = αn or combined cumulatively into a complex entity, for exam- ple, a Linkean sum: α1 + ... + αn. In other words, the plurality-formation bleeds over to certain dependents. Because this mechanism is observed beyond adjunct extraction, it begs for a general account. In this view where shared gaps in coordination can be cumulated, adjunction can operate as usual, and the only adjuncts that verbs need to have control over are the ones in SLASH, because of extraction pathway marking phenomena.
3.1 Other possible cases of cumulation in conjunction Perhaps the cumulation phenomena observed above also arises in other kinds of dependents. For example, Vergnaud (1974), Abbott (1976), Jackendoff (1977) and others have noted a phenomenon where the same structure is cumulatively con- nected to different clauses. For example, in (16a) the plural NP very different opponents denotes a set of individuals some of which were defeated by John and some of which beat Mary. Similarly, in (16b) we do not know how much Fred spent nor how much Mia lost, although we know the total amount that Fred spent and Mia lost. In other words, these sentences are not equivalent to their counterparts in
56 which the ‘right node raised’ NP is in situ after the first verb.3
(16) a. John DEFEATED and Mary LOST TO very different opponents. b. Fred SPENT and Mia LOST a total of $10.000.
This process is sometimes not possible or highly marked. For example, reflexive expressions cannot be pluralized in this fashion:
(17) *John LOVES and Mary HATES themselves / each other.
This process is unique to plurality-forming conjunction. As Beavers and Sag (2004,66) note, disjunction does not allow cumulative readings:
(18) a. *John DEFEATED i or Mary LOST TO j [very different opponents]i+j.
b. *Either Fred SPENT i or Mia LOST j [a total of $10.000]i+j.
Yatabe (2003) argues that there are two kinds of RNR. One is prosodic and has no semantic effect, and another which is linearization-based and has a semantic effect. This distinction may be responsible for the contrast shown in (19) and (20). Prosodic RNR can be long-distance and can apply in non-coordinate contexts (as in (19a) and (20a); see Chaves and Sag (2007) for a recent discussion), whereas linearization RNR cannot be long-distance as shown in (19b) and (20b), and is restricted to conjunction.
(19) a. One man said that he LIKED i and another even boasted that he ADORED i [the woman in the commercial]i.
b. ?*One man said that he LIKED i, and another even boasted that he ADORED j [different women]i+j.
(20) a. One man said that he HATED i just because some other had boasted that he ADORED i [the woman in the commercial]i.
b. ?*One man said that he HATED i just because some other had boasted that he ADORED i [different women]i+j.
Finally, it could be that the same cumulation phenomenon also occurs in extra- position phenomena, in the form of split antecedent relative clauses. Consider the data in (21), based in Ross and Perlmutter (1970). The relative clause is seman- tically linked to both subject NPs. It is unlikely that these are instances of RNR of the extraposed clause (e.g. [S RelC] & [S RelC]) because no conjunct-final prosodic contrast is needed for examples like (21a), and because reflexives can be linked to the pluralized noun, as shown in (21b) (confront with (16)).
3Postal (1998) argues that cases like ?the pilot claimed that the first nurse and the sailor proved that the second nurse were spies also exhibit some form of summation/cumulation of the ‘right-node raised’ verbal structure. However, judgments are gradient and vary significantly, which lead Beavers and Sag (2004) to argue that these are quasi-sentences that result from performance effects. My account could in principle be extended to these data, along the lines pioneered by Yatabe (2002).
57 (21) a. A man entered the room and a woman left who were similar. b. A man came into the store and a woman left right after him who used be in love with each other. c.*A man entered the room or a woman left who were similar.
These cases can be accounted as follows. The nominal head that each extraposed relative clause is modifying is cumulated at the coordination level. One way to achieve this result is illustrated in Figure 6. Alternatively, one could also state this analysis in a more elegant way in terms of anchors (Kiss, 2005; Crysmann, 2004).
S[EXTRA hRelC[MOD x + y]i]
S[EXTRA hRelC[MOD x]i] S[EXTRA hRelC[MOD y]i]
Figure 6: Conjunct relativized head sharing with cumulation
The remainder of the paper will flesh out an account of extraction that is compatible with gap cumulation, as well as with the cumulation of other delayed/displaced dependents such as RNR phrases and extraposed relative clauses.
3.2 HPSG formalization The semantics of conjunction is a complex topic which I cannot address here fully, and so I will make the minimal assumptions needed for the purpose of this paper.4 I start by allowing individual and event indices to be either Linkean sums ‘+’ or atomic elements. The type hierarchy and signature are provided in (22).
(22) index
evt ref
event ind evt-sum ind-sum ARG ref ARG1 evt 1 ARG2 evt ARG2 ref Next, I define a non-deterministic relation ‘≈’ called Integration that maps a pair of lists onto one list. It allows for two different cases: in (23a) the arguments of the relation are simply structure-shared, and in (23b) we obtain the cumulation of two signs, by forming a new sign that denotes a plurality.
4For a more comprehensive semantic account of conjunction see Chaves (2007, Ch.5).
58 (23) a. Direct Integration (structure-sharing equality)
( 1 ≈ 1 ) := 1 list b. Cumulative Integration (summation):
SYN | LOC 0 SYN | LOC 0 SYN | LOC 0 ≈ := ARG1 1 *"SEM | INDEX 1 #+ *"SEM | INDEX 2 #+ * SEM INDEX + ARG2 2 " # If we adopt the extraction account Ginzburg and Sag (2000), then we need only extend the conjunction with gap Integration. Since adjunct gaps reside in SLASH we can dispense ADJS and obtain the cumulation phenomena straight away. The rule is given below, in (24). For reasons of uniformity to be discussed below, the set-valued SLASH is replaced by a list-valued GAP feature, and I will use the feature geometry of Sag (2001). I also omit the full specification of KEY due to lack of space, but I am assuming basically the same as Bouma et al. (2001).
(24) CONJUNCTION CONSTRUCTION
SYN | GAP 3 ≈ 4
MOTHER ARG1 1 SEM INDEX ARG2 2 " # conj-cx ⇒ SYN | GAP 4 SYN | GAP 3 DTRS , SEM | INDEX 2 1 *"SEM | INDEX # + CRD-MRK conj I assume that the coordination construction is more general than the conjunc- tion construction: the former requires LOCAL identity and has nothing to say about semantics, whereas the latter requires a right-marked conjunct with and, and yields a pluralic index from the indices of the conjuncts. Based on Beavers and Sag (2004), I assume coordination is binary branching and resorts to a feature CRD- MRK that identifies the coordination type (i.e. conj, disj, etc.). We can, however, revise Ginzburg and Sag (2000) so that adjuncts are not complements. First, the SLASH-AMALGAMATION CONSTRAINT and the ARGU- MENT STRUCTURE EXTENSION (see §2.1 above and Ginzburg and Sag 2000,169) are blended into a unique condition, in (25). Further constraints should pinpoint the exact class of extractable adjuncts. Then it will follow that h has to be verb if the list of adjunct gaps is non-empty, ruling out adnominal adjunct extraction.
59 (25) GAP AMALGAMATION AND EXTENSION CONDITION
LOC | HEAD h SYN | L | HEAD h SYN GAP ⊕( 1 ⊕...⊕ n )⊖ 0 ⊕list MOD *"SEM | KEY k #+ word ⇒ S 0 BIND k SEM | KEY ARG-ST SYN | GAP 1 ,..., SYN | GAP n h i h i The relation ⊕ allows the amalgamation of gaps: ⊕( 1 2 1 )= 1 ⊕ 2 . This operation takes as input a list, it splits the list nondeterministically into three sub- S S lists (two of which are structure-shared as 1 ) and yields the append of the two remaining distinct lists. This allows some, all, or none of the gaps to be unified. Non-scope bearing verb-modifying adjuncts possess an event index which is structure-shared with the event index of the verb they intersectively combine with. This is illustrated in (26a). Conjunction then is able to combine two such adjunct gaps into one gap with a summed event index. The cumulation of an ATB extracted PP adjunct is shown in Figure 7. I assume that the MRS representation of verbs includes an existentially quantified index, as discussed in §2.2. This cumulation process occurs cross-categorially. With nominal gaps we ob- tain a sum of individual indices and with non-scope bearing verbal modifier gaps we obtain an event sum. As for extractable scope-bearing adverbs like often, I as- sume along with Sag (2005) that the adverb lexically outscopes the verb it modifies, as illustrated in (26b). Thus, in ATB extraction each verb heading each conjunct must be outscoped by the adjunct gap. See Bonami and Godard (2007) for other is- sues pertaining to scope surface order and scope ambiguities, as well as Crysmann
(2004) for a more elaborate MRS account of intersective modification in German.