Distributed Morphology As a Regular Relation Marina Ermolaeva University of Chicago, [email protected]

Proceedings of the Society for Computation in Linguistics Volume 1 Article 20 2018 Distributed Morphology as a regular relation Marina Ermolaeva University of Chicago, [email protected] Daniel Edmiston University of Chicago, [email protected] Follow this and additional works at: https://scholarworks.umass.edu/scil Part of the Computational Linguistics Commons Recommended Citation Ermolaeva, Marina and Edmiston, Daniel (2018) "Distributed Morphology as a regular relation," Proceedings of the Society for Computation in Linguistics: Vol. 1 , Article 20. DOI: https://doi.org/10.7275/R51834PC Available at: https://scholarworks.umass.edu/scil/vol1/iss1/20 This Extended Abstract is brought to you for free and open access by ScholarWorks@UMass Amherst. It has been accepted for inclusion in Proceedings of the Society for Computation in Linguistics by an authorized editor of ScholarWorks@UMass Amherst. For more information, please contact [email protected]. Distributed Morphology as a regular relation Marina Ermolaeva and Daniel Edmiston University of Chicago {mermolaeva,danedmiston}@uchicago.edu 1 Introduction structure (FS). An FS F is defined as a pair M, E , h i where the feature bundle M = feat(F ) is a sub- This research reorganizes the Distributed Morphol- set of some finite set of features (including syntac- ogy (DM, (Halle and Marantz, 1993)) framework tic category labels) and the phonological exponent to work over strings. That the morphological mod- E = exp(F ) (Σ None, ), where Σ is a fi- ule should operate over strings is desirable, since it ∈ ∪ { } nite set of phonemes. The default exponent of each is assumed that most (arguably all) morphological syntactic unit is None, a place-holder to be replaced processes can be modelled with regular languages by vocabulary insertion (VI) in the morphological (Karttunen et al. 1992). As is, DM typically op- module; None is distinct from the null exponent . erates on binary trees, with the syntax- morphol- An example derivation and its yield serving as in- ogy interface implicitly treated as a tree-transducer. put to the morphology are shown in (1); we assume We contend that using (binary) trees is overpowered, that word boundaries (denoted by #) are part of the and predicts patterns which are unattested in natu- linearized representation. ral language (e.g. iterable nested dependencies). If however, we restrict the morphological component (1) > to working on strings, we correctly predict that morphology can be modelled with regular string lan- D, JOHN, 3, SG { None } < guages, and so we treat the morphological compo- D E T, PAST, 3, SG nent as a finite-state string-transducer, i.e. as a regu- { } > lar relation. None D E Assuming the Y-model, standard DM operating V, SLEEP { } on trees presumes that the flattening of the derivation None D E for PF takes place post-morphology. We push this flattening of the structure to above the morphologi- ⇓ D, JOHN, 3, SG # V, SLEEP # T, PAST, 3, SG cal module, between the syntax and morphology. { None } { None } { None } D E D E D E 2 Syntax-morphology interface 3 Morphology as regular relations We use a modified version of Minimalist Grammars As in standard DM, our operations are context- (Stabler, 1997) as our syntactic framework. We en- dependent. Context-dependent rewriting rules have hance the formalism with the notion of node strength the form A B/C D, where A, B, C, D are → imported from Mirror Theory (Brody, 1997; Kobele, regular expressions over some alphabet. With a few 2002) to implement both Head Movement and Low- caveats, such rules (and ordered finite sequences ering as mechanisms for constructing morphological thereof) have been shown to define regular relations words. Following (Trommer, 1999), we assume that on strings (Kaplan and Kay, 1994). Assuming that the basic unit of syntactic computation is a feature morphology can be modelled by regular relations, it 178 Proceedings of the Society for Computation in Linguistics (SCiL) 2018, pages 178-181. Salt Lake City, Utah, January 4-7, 2018 should be possible to state morphological rules in a 4 Cyclicity and Rewriting way compatible with this format. We take A, B to be sequences of FSs and C, D to be regular expres- As we have proposed a move away from tree- sions over FSs. As a proof of concept, we define VI structures and toward strings, it is worth revisiting and Readjustment as follows. VI rewrites a single one of the primary motivations behind tree struc- node as a sequence of FSs which can be thought of tures in the literature, namely Cyclicity/Rewriting as morphophonemes and can be further manipulated (Bobaljik, 2000). by Readjustment rules: (6) Cyclicity: VI starts at the root and proceeds outwards (2) A rule r of the form A B/C D is a VI rule → Rewriting: VI deletes morphosyntactic fea- iff: tures it expresses A = 1, B 1; | | | | ≥ exp(A ) = None for 1 i A ; i ≤ ≤ | | The two constraints in (6) make strong predictions exp(B ) = None for 1 j B ; j 6 ≤ ≤ | | concerning possible contexts for VI rules. In par- 1 i A feat(Ai) = 1 j B feat(Bj) ≤ ≤| | (r is≤ feature-preserving≤| | ); ticular, Cyclicity predicts that phonologically condi- S S tioned allomorphy can only be inward-sensitive, and feat(A1) = ... = feat(A A ) = feat(B1) = | | ... = feat(B B ) (r is set-preserving). Rewriting only allows outward-sensitivity for mor- | | phosyntactic features. One motivating example be- (3) A rule r of the form A B/C D is a Read- → hind Cyclicity comes from Itelmen, where it appears justment rule iff: that dependencies are sensitive to embedding-depth exp(A ) = None for 1 i A ; i 6 ≤ ≤ | | exp(B ) = None for 1 j B ; rather than linear ordering. Consider the follow- j 6 ≤ ≤ | | r is feature-preserving and set-preserving. ing schematic of an Itelmen example adapted from Bobaljik 2000. For instance, the root alternation in sleep/slept can (7) be captured by the VI and Readjustment rules in (4), represented as rewriting rules in (5), with ? as short- A, SUBJ-AGR { } hand for “any exponent”: None D E B, OBJ-AGR (4)[ SLEEP] sli:p { None } → i: E /X Y[PAST], where X Y {CREEP, D E → ∈ SLEEP, WEEP, ...} VERB C, CLASS(II) {None} { None } SLEEP SLEEP SLEEP SLEEP SLEEP D E D E (5) { } { } { } { } None → { s } l i: p In (7), the surface form B is dependent upon the D SLEEP E D SLEEPED EDSLEEP ED∗ PAST E { } { } / { } { } morphosyntantic features of A, and the the form of C i: → E ? ? D E D E hD Ei D E is dependent upon the features of B and A. Given the Importantly, morphological rules – both in tradi- tree structure above, there is a clear pattern of more- tional DM and in our formalism – are stated over embedded items being dependent on less-embedded underspecified feature structures: they apply to any items morphosyntactically; however, such depen- item whose features match the structural descrip- dencies cannot be reduced to linear order. tion. Put in practice, each underspecified rule like Such effects in morphology are handled by ap- the ones in (5) can be converted, via a well-defined plying VI rules from the root outwards – which is procedure, into a set of instances – regular relations impossible after linearization. Fortunately, using over the alphabet of fully specified FSs. This alpha- transducer composition to build our morphology out bet is guaranteed to be finite, as the set of features of individual rules imposes a strict rule ordering. and Σ are finite by definition. Relations encoding For this ordering, we can follow the standard or- rules are then composed into a single regular rela- der of projections, working from bottom to top (e.g. tion (transducer), following the procedure laid out V T C in the clausal domain). In this way, we re- → → in (Kaplan and Kay, 1994). produce Cyclicity effects over strings. For example, 179 with cases of phonologically conditioned allomor- (12) > phy like English DPs of the form [DPD[NPN]], we can T, PRS > { } capture the a/an allomorphy by having rules that ap- None SPACE,D E ply to FSs containing N precede rules that target FSs { > TRANSLOC containing D. Likewise for examples like (7) above, None } ASP, P,D E { arranging VI in the C B A order yields the right < S-CLASS → → None } pattern with no need for tree structure. D E V, 3, SG Thus, trees are not necessary to produce Cyclic- { None } > ity effects. However, it is not even clear that D E FIND, µ, { { C-CLASS S-CLASS Cyclicity/Rewriting are desirable constraints for our None } None } D E D E grammar, as there are numerous apparent counter- The solution proposed by Deal and Wolf (2017) examples to Rewriting (Gribanova and Harizanov, involves weakening of both Rewriting and Cyclicity. 2017) and Cyclicity (Svenonius, 2012). The former is replaced with Monotonicity: instead An interesting case is Nez Perce (Deal and Wolf, of deleting morphosyntactic features, VI strictly 2017). The verb suffix denoted as µ exhibits al- adds information. The latter is restated in terms of lomorphy conditioned by the class feature of the cycles, roughly corresponding to syntactic phases. root, as exemplified by the so-called ‘S-class’ verb Within a cycle, lexical items can be inserted in any eat in (8) and the ‘C-class’ verb find in (9). This order, constrained only by their phonological con- is an instance of inward sensitivity and, since class text requirements. Cycles themselves are spelled out membership is a morphosyntactic feature, a counter- in an inside-out fashion, one at a time. As there is example to Rewriting: a phase boundary above Aspect, the realization of µ can depend on the phonological exponent of Aspect, (8) ’aw- ’ap- an’y- o’- qa 3OBJ- eat- µ- PROSP.ASP- PST but not of a morpheme belonging to the higher cycle.

Load more