Empty Category Detection Using Path Features and Distributed Case Frames

Total Page:16

File Type:pdf, Size:1020Kb

Empty Category Detection Using Path Features and Distributed Case Frames Empty Category Detection using Path Features and Distributed Case Frames Shunsuke Takeno†, Masaaki Nagata‡, Kazuhide Yamamoto† †Nagaoka University of Technology, 1603-1 Kamitomioka, Nagaoka, Niigata, 940-2188 Japan takeno, yamamoto @jnlp.org { } ‡NTT Communication Science Laboratories, NTT Corporation, 2-4 Hikaridai, Seika-cho, Soraku-gun, Kyoto, 619-0237 Japan [email protected] Abstract unlexicalized tree fragments from empty node to its antecedent. Dienes and Dubey (2003) proposed We describe an approach for machine a machine learning-based “trace tagger” as a pre- learning-based empty category detection process of parsing. Campbell (2004) proposed a that is based on the phrase structure analy- rule-based post-processing method based on lin- sis of Japanese. The problem is formal- guistically motivated rules. Gabbard et al. (2006) ized as tree node classification, and we replaced the rules with machine learning-based find that the path feature, the sequence of classifiers. Schmid (2006) and Cai et al. (2011) node labels from the current node to the integrated empty category detection with the syn- root, is highly effective. We also find that tactic parsing. the set of dot products between the word Empty category detection for pro (dropped pro- embeddings for a verb and those for case nouns or zero pronoun) has begun to receive at- particles can be used as a substitution for tention as the Chinese Penn Treebank (Xue et al., case frames. Experiments show that the 2005) has annotations for pro as well as PRO proposed method outperforms the previ- and trace. Xue and Yang (2013) formalized the ous state-of the art method by 68.6% to problem as classifying each pair of the location 73.2% in terms of F-measure. of empty category and its head word in the de- pendency structure. Wang et al. (2015) pro- 1 Introduction posed a joint embedding of empty categories and Empty categories are phonetically null elements their contexts on dependency structure. Xiang et that are used for representing dropped pro- al. (2013) formalized the problem as classifying nouns (“pro” or “small pro”), controlled elements each IP node (roughly corresponds to S and SBAR (“PRO” or “big pro”) and traces of movement in Penn Treebank) in the phrase structure. (“T” or “trace”), such as WH-questions and rela- In this paper, we propose a novel method for tive clauses. They are important for pro-drop lan- empty category detection for Japanese that uses guages such as Japanese, in particular, for the ma- conjunction features on phrase structure and word chine translation from pro-drop languages to non- embeddings. We use the Keyaki Treebank (But- pro-drop languages such as English. Chung and ler et al., 2012), which is a recent development. Gildea (2010) reported their recover of empty cat- As it has annotations for pro and trace, we show egories improved the accuracy of machine trans- our method has substantial improvements over the lation both in Korean and in Chinese. Kudo et state-of-the-art machine learning-based method al. (2014) showed that generating zero subjects in (Xiang et al., 2013) for Chinese empty category Japanese improved the accuracy of preordering- detection as well as linguistically-motivated man- based translation. ually written rule-based method similar to (Camp- State-of-the-art statistical syntactic parsers had bell, 2004). typically ignored empty categories. Although 2 Baseline systems Penn Treebank (Marcus et al., 1993) has annota- tions on PRO and trace, they provide only labeled The Keyaki Treebank annotates the phrase struc- bracketing. Johnson (2002) proposed a statistical ture with functional information for Japanese sen- pattern-matching algorithm for post-processing tences following a scheme adapted from the Anno- the results of syntactic parsing based on minimal tation manual for the Penn Historical Corpora and 1335 Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pages 1335–1340, Lisbon, Portugal, 17-21 September 2015. c 2015 Association for Computational Linguistics. IP-MAT IP-MAT:*pro*-SBJ@0 NP-SBJ PP NP-OB1 VB VB2 AXD PU PP VB VB2 AXD PU *pro* NP P *を * 連れ 戻し た 。 NP P 連れ 戻し た 。 tsure modoshi ta tsure modoshi ta brought-back . brought-back . IP-REL N を IP-REL:*T*-SBJ@0 N を wo wo NP-SBJ VB VB0 AXD 娘 VB VB0 AXD 娘 musume → musume a daughter a daughter *T* 家出 し た 家出 し た iede shi ta iede shi ta who ran-away-from-home ran-away-from-home Figure 1: An annotation example of 家出し た娘を 連れ戻し た。 (*pro* brought back a daughter who ran away from home.) in Keyaki Treebank. (The left tree is the original tree and the right tree is a converted tree based on Xiang et al.’s (2013) formalism) the PCEEC (Santorini, 2010). There are some ma- node, and ei be the empty category tag associated jor changes: the VP level of structure is typically with ti. The probability model of (Xiang et al., absent, function is marked on all clausal nodes 2013) is formulated as MaxEnt model: n (such as IP-REL and CP-THT) and all NPs that n i 1 P (e T )= P (e e − ,T ) are clause level constituents (such as NP-SBJ). 1 | i| 1 i=1 Disambiguation tags are also used for clarifying Yn exp(θ φ(e ,ei 1,T )) the functions of its immediately preceding node, i 1− = · i 1 (1) such as NP-OBJ *を *(wo) for PP, however, we re- Z(e1− ,T ) Yi=1 moved them in our experiment. where φ is a feature vector, θ is a weight vector to Keyaki Treebank has annotation for trace mark- φ and Z is normalization factor: ers of relative clauses (*T*) and dropped pronouns i 1 i 1 (*pro*), however, it deliberately has no annota- Z(e − ,T )= exp(θ φ(e,e − ,T )) 1 · 1 tion for control dependencies (PRO) (Butler et al., e X∈E 2015). It has also fine grained empty categories where represents the set of all empty category E of *pro* such as *speaker* and *hearer*, but we types to be detected. unified them into *pro* in our experiment. Xiang et al. (2013) grouped their features into HARUNIWA(Fang et al., 2014) is a Japanese four types: tree label features, lexical features, phrase structure parser trained on the treebank. It empty category features and conjunction features has a rule-based post-processor for adding empty as shown in Table 1. As the features for (Xiang et categories, which is similar to (Campbell, 2004). al., 2013) were developed for Chinese Penn Tree- We call it RULE in later sections and use it as one bank, we modify their features for Keyaki Tree- of two baselines. bank: First, the traversal order is changed from We also use Xiang et al’s (2013) model as an- post-order (bottom-up) to pre-order (top-down). other baseline. It formulates empty category de- As PROs are implicit in Keyaki Treebank, the de- tection as the classification of IP nodes. For ex- cisions on IPs in lower levels depend on those on ample, in Figure 1, empty nodes in the left tree higher levels in the tree. Second, empty category are removed and encoded as additional labels with features are extracted from ancestor IP nodes, not its position information to IP nodes in the right from descendant IP nodes, in accordance with the tree. As we can uniquely decode them from the first change. extended IP labels, the problem is to predict the Table 2 shows the accuracies of Japanese empty labels for the input tree that has no empty nodes. category detection, using the original and our Let T = t t t be the sequence of nodes modification of the (Xiang et al., 2013) with ab- 1 2 ··· n produced by the post-order traversal from root lation test. We find that the conjunction features 1336 Tree label features Head word feature (HEAD) is the surface form 1 current node label 2 parent node label of the lexical head of the current node. Child fea- 3 grand-parent node label ture (CHILD) is the set of labels for the children of 4 left-most child label or POS tag the current node. The label is augmented with the 5 right-most child label or POS tag 6 label or POS tag of the head child surface form of the rightmost terminal node if it is 7 the number of child nodes a function word. In the example of Figure 1, if the 8 one level CFG rule current node is IP-MAT, HEAD is 連れ (tsure) and 9 left-sibling label or POS tag (up to two siblings) 10 right-sibling label or POS tag (up to two siblings) CHILD includes: PP-を (wo), VB, VB2, AXD-た Lexical features (ta) and PU-。 . Empty category feature (EC) is a 11 left-most word under the current node set of empty categories detected in the ancestor IP 12 right-most word under the current node 13 word immediately left to the span of the current node nodes. For example in Figure 1, if the current node 14 word immediately right to the span of the current node is IP-REL, EC is *pro*. 15 head word of the current node 16 head word of the parent node We then combine the PATH with others. If the 17 is the current node head child of its parent? (binary) current node is the IP-MAT node in right-half of Empty category features Figure 1, the combination of PATH and HEAD 18 predicted empty categories of the left sibling is:IP-MAT 連れ (tsure) and the combinations 19* the set of detected empty categories of ancestor nodes × of PATH and CHILD are: IP-MAT PP-を (wo), Conjunction features × 20 current node label with parent node label IP-MAT VB, IP-MAT VB2, IP-MAT AXD-た × × × 21* current node label with features computed from ances- (ta) and IP-MAT PU-。 .
Recommended publications
  • 2008. Pruning Some Branches from 'Branching Spacetimes'
    CHAPTER 10 Pruning Some Branches from “Branching Spacetimes” John Earman* Abstract Discussions of branching time and branching spacetime have become com- mon in the philosophical literature. If properly understood, these concep- tions can be harmless. But they are sometimes used in the service of debat- able and even downright pernicious doctrines. The purpose of this chapter is to identify the pernicious branching and prune it back. 1. INTRODUCTION Talk of “branching time” and “branching spacetime” is wide spread in the philo- sophical literature. Such expressions, if properly understood, can be innocuous. But they are sometimes used in the service of debatable and even downright per- nicious doctrines. The purpose of this paper is to identify the pernicious branching and prune it back. Section 2 distinguishes three types of spacetime branching: individual branch- ing, ensemble branching, and Belnap branching. Individual branching, as the name indicates, involves a branching structure in individual spacetime models. It is argued that such branching is neither necessary nor sufficient for indeterminism, which is explicated in terms of the branching in the ensemble of spacetime mod- els satisfying the laws of physics. Belnap branching refers to the sort of branching used by the Belnap school of branching spacetimes. An attempt is made to sit- uate this sort of branching with respect to ensemble branching and individual branching. Section 3 is a sustained critique of various ways of trying to imple- ment individual branching for relativistic spacetimes. Conclusions are given in Section 4. * Department of History and Philosophy of Science, University of Pittsburgh, Pittsburgh, USA The Ontology of Spacetime II © Elsevier BV ISSN 1871-1774, DOI: 10.1016/S1871-1774(08)00010-7 All rights reserved 187 188 Pruning Some Branches from “Branching Spacetimes” 2.
    [Show full text]
  • Layers and Operators in Lakota1 Avelino Corral Esteban Universidad Autónoma De Madrid
    Kansas Working Papers in Linguistics, Vol. 36 (2015), 1-33 Layers and operators in Lakota1 Avelino Corral Esteban Universidad Autónoma de Madrid Abstract Categories covering the expression of grammatical information such as aspect, negation, tense, mood, modality, etc., are crucial to the study of language universals. In this study, I will present an analysis of the syntax and semantics of these grammatical categories in Lakota within the Role and Reference Grammar framework (hereafter RRG) (Van Valin 1993, 2005; Van Valin and LaPolla 1997), a functional approach in which elements with a purely grammatical function are treated as ´operators`. Many languages mark Aspect-Tense- Mood/Modality information (henceforth ATM) either morphologically or syntactically. Unlike most Native American languages, which exhibit an extremely complex verbal morphological system indicating this grammatical information, Lakota, a Siouan language with a mildly synthetic / partially agglutinative morphology, expresses information relating to ATM through enclitics, auxiliary verbs and adverbs, rather than by coding it through verbal affixes. 1. Introduction The organisation of this paper is as follows: after a brief account of the most relevant morpho- syntactic features exhibited by Lakota, Section 2 attempts to shed light on the distinction between lexical words, enclitics and affixes through evidence obtained in the study of this language. Section 3 introduces the notion of ´operator` and explores the ATM system in Lakota using RRG´s theory of operator system. After a description of each grammatical category, an analysis of the linear order exhibited by the Lakota operators with respect to the nucleus of the clause are analysed in Section 4, showing that this ordering reflects the scope relations between the grammatical categories conveyed by these operators.
    [Show full text]
  • LINGUISTICS 221 Lecture #3 DISTINCTIVE FEATURES Part 1. an Utterance Is Composed of a Sequence of Discrete Segments. Is the Segm
    LINGUISTICS 221 Lecture #3 DISTINCTIVE FEATURES Part 1. An utterance is composed of a sequence of discrete segments. Is the segment indivisible? Is the segment the smallest unit of phonological analysis? If it is, segments ought to differ randomly from one another. Yet this is not true: pt k prs What is the relationship between members of the two groups? p t k - the members of this set have an internal relationship: they are all voiceles stops. p r s - no such relationship exists p b d s bilabial bilabial alveolar alveolar stop stop stop fricative voiceless voiced voiced voiceless SIMILARITIES AND DIFFERENCES! Segments may be viewed as composed of sets of properties rather than indivisible entities. We can show the relationship by listing the properties of each segment. DISTINCTIVE FEATURES • enable us to describe the segments in the world’s languages: all segments in any language can be characterized in some unique combination of features • identifies groups of segments → natural segment classes: they play a role in phonological processes and constraints • distinctive features must be referred to in terms of phonetic -- articulatory or acoustic -- characteristics. 1 Requirements on distinctive feature systems (p. 66): • they must be capable of characterizing natural segment classes • they must be capable of describing all segmental contrasts in all languages • they should be definable in phonetic terms The features fulfill three functions: a. They are capable of describing the segment: a phonetic function b. They serve to differentiate lexical items: a phonological function c. They define natural segment classes: i.e. those segments which as a group undergo similar phonological processes.
    [Show full text]
  • Definiteness and Determinacy
    Linguistics and Philosophy manuscript No. (will be inserted by the editor) Definiteness and Determinacy Elizabeth Coppock · David Beaver the date of receipt and acceptance should be inserted later Abstract This paper distinguishes between definiteness and determinacy. Defi- niteness is seen as a morphological category which, in English, marks a (weak) uniqueness presupposition, while determinacy consists in denoting an individual. Definite descriptions are argued to be fundamentally predicative, presupposing uniqueness but not existence, and to acquire existential import through general type-shifting operations that apply not only to definites, but also indefinites and possessives. Through these shifts, argumental definite descriptions may become either determinate (and thus denote an individual) or indeterminate (functioning as an existential quantifier). The latter option is observed in examples like `Anna didn't give the only invited talk at the conference', which, on its indeterminate reading, implies that there is nothing in the extension of `only invited talk at the conference'. The paper also offers a resolution of the issue of whether posses- sives are inherently indefinite or definite, suggesting that, like indefinites, they do not mark definiteness lexically, but like definites, they typically yield determinate readings due to a general preference for the shifting operation that produces them. Keywords definiteness · descriptions · possessives · predicates · type-shifting We thank Dag Haug, Reinhard Muskens, Luca Crniˇc,Cleo Condoravdi, Lucas
    [Show full text]
  • Pronominal Deixis in East Caucasian Stability and Renewals
    PRONOMINAL DEIXIS IN EAST CAUCASIAN STABILITY AND RENEWALS Gilles Authier EPHE, PSL Online course on East Caucasian languages HSE University Linguistic Convergence Laboratory Deixis 1) person (speaker vs adressee), 2) space (distance, mono‐ or polycentric), 3) time ‘Did you write this (yesterday)?’ (Temporal deixis in East Caucasian is, like in European languages, marked by bound morphems on verbs, and adverbs) 2 locus of deixis: matter for a series of lectures Free words Personal pronouns I, you, we Demonstratives this / that (up/level/down) Adverbs there (up/level/down) Copula / predicatives is / enter / comes / goes (up/level/down) Bound elements Person markers go‐es verbal ‘satellites’ and affixes go in / en‐ter (up/level/down) Flagging (cases) on/at/under, etc 3 Outlines What I will speak about: Diachrony 1) demonstratives: grouping, branching based on innovations 2) personal pronouns: branching based on innovations, avoidance‐based replacements, (analogy), with a focus on clusivity What I will not have time to speak about: synchronic systems ‐ morphology ‐ grammaticalization ‐ morphosyntax ‐ syntax 4 1) E‐C demonstrative (pronouns) Adnominal vs pronominal demonstratives usually the same in EC (exception: Kryz pronoun u‐ / am / a‐ vs adnominal lu‐ / lam / la‐) The systems are not uniform across the family, with some languages or branches showing simple, horizontal systems (but at least three roots are used ‐ no ‘minimal’ systems). while others have elevational distinctions 5 a horizontal system: Nakh Chechen Ingush Tsova‐Tush / Batsbi PROX MED / 3 DIST PROX MED / 3 DIST PROX MED DIST / 3 hara i / iza dʕora jer ɨz dʕāra‐D‐ar e is o only distance / person oriented (grammars not always clear on this) higher: laqara lower: laxara 6 Horizontal systems are geographically (and phylogenetically) peripheral: Peripheral features = retentions..
    [Show full text]
  • Arxiv:2106.08037V1 [Cs.CL] 15 Jun 2021 Alternative Ways the World Could Be
    The Possible, the Plausible, and the Desirable: Event-Based Modality Detection for Language Processing Valentina Pyatkin∗ Shoval Sadde∗ Aynat Rubinstein Bar Ilan University Bar Ilan University Hebrew University of Jerusalem [email protected] [email protected] [email protected] Paul Portner Reut Tsarfaty Georgetown University Bar Ilan University [email protected] [email protected] Abstract (1) a. We presented a paper at ACL’19. Modality is the linguistic ability to describe b. We did not present a paper at ACL’20. events with added information such as how de- sirable, plausible, or feasible they are. Modal- The propositional content p =“present a paper at ity is important for many NLP downstream ACL’X” can be easily verified for sentences (1a)- tasks such as the detection of hedging, uncer- (1b) by looking up the proceedings of the confer- tainty, speculation, and more. Previous studies ence to (dis)prove the existence of the relevant pub- that address modality detection in NLP often p restrict modal expressions to a closed syntac- lication. The same proposition is still referred to tic class, and the modal sense labels are vastly in sentences (2a)–(2d), but now in each one, p is different across different studies, lacking an ac- described from a different perspective: cepted standard. Furthermore, these senses are often analyzed independently of the events that (2) a. We aim to present a paper at ACL’21. they modify. This work builds on the theoreti- b. We want to present a paper at ACL’21. cal foundations of the Georgetown Gradable Modal Expressions (GME) work by Rubin- c.
    [Show full text]
  • Serial Verb Constructions Revisited: a Case Study from Koro
    Serial Verb Constructions Revisited: A Case Study from Koro By Jessica Cleary-Kemp A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in Linguistics in the Graduate Division of the University of California, Berkeley Committee in charge: Associate Professor Lev D. Michael, Chair Assistant Professor Peter S. Jenks Professor William F. Hanks Summer 2015 © Copyright by Jessica Cleary-Kemp All Rights Reserved Abstract Serial Verb Constructions Revisited: A Case Study from Koro by Jessica Cleary-Kemp Doctor of Philosophy in Linguistics University of California, Berkeley Associate Professor Lev D. Michael, Chair In this dissertation a methodology for identifying and analyzing serial verb constructions (SVCs) is developed, and its application is exemplified through an analysis of SVCs in Koro, an Oceanic language of Papua New Guinea. SVCs involve two main verbs that form a single predicate and share at least one of their arguments. In addition, they have shared values for tense, aspect, and mood, and they denote a single event. The unique syntactic and semantic properties of SVCs present a number of theoretical challenges, and thus they have invited great interest from syntacticians and typologists alike. But characterizing the nature of SVCs and making generalizations about the typology of serializing languages has proven difficult. There is still debate about both the surface properties of SVCs and their underlying syntactic structure. The current work addresses some of these issues by approaching serialization from two angles: the typological and the language-specific. On the typological front, it refines the definition of ‘SVC’ and develops a principled set of cross-linguistically applicable diagnostics.
    [Show full text]
  • Branching Time and the Semantics of Future Contingents
    Branching Time and the Semantics of Future Contingents Jacek Wawer PhD dissertation prepared under the supervision of Prof. dr hab. Tomasz Placek Institute of Philosophy Jagiellonian University Kraków, Poland Acknowledgments First and foremost, I would like to express my gratitude to Professor Tomasz Placek, the supervisor of my dissertation. Over a decade ago, he introduced me to the world of branching and has helped me to travel it ever since. The assistance he provided during my work cannot be overestimated. The philosophical community was impoverished due to his dedication, but I benefited enormously. Also, thanks to his continuous sup- port, I had the best possible environment to develop my thought. I have never been an obedient student and often challenged, rather than developed his philosophical views. Nonetheless, he only doubled his support and helped me to express my views better than I could have done it myself. I count myself to be immensely lucky to be his student. I would like to thank Professors Fabrice Correia, John MacFarlane, and Thomas Müller who mentored me during my research visits in Geneva, Berkeley, and Konstanz respectively. Many of their helpful suggestions influenced my views. I am indebted to Alex Malpass, Leszek Wronski,´ Juliusz Doboszewski, Antje Rumberg, and Michał Marczyk for their feedback on various fragments of this work and many inspiring dis- cussions which helped me clarify my ideas and recognize new paths. I would have not accomplished this work, if not for the continuous support and encouragement of my wife, Karolina. My research was possible due to the financial, administrative, and scientific sup- port of many institutions.
    [Show full text]
  • 1 English Subjectless Tagged Sentences Paul Kay Department Of
    1 English subjectless tagged sentences Paul Kay Department of Linguistics University of California Berkeley, CA 94720 [email protected] 2 Abstract A colloquial English sentence like Fooled us, didn't they? contains a finite main verb but no expressed subject. The identity of the missing subject of fooled is recovered from the tag subject they: compare Fooled us, didn't she?, Fooled us, didn't you? This paper argues (1) that such subjectless tagged sentences (STSs) pose a problem for grammatical approaches based on movement and empty categories and (2) that STSs receive a revealing analysis as part of a finely articulated family of tagged sentence constructions when viewed within a non-derivational, constructional, multiple-inheritance-based approach.* *I would like to thank Peter Culicover, Liliane Haegeman, Charles Fillmore Andreas Kathol and Richard Oehrle for comments on previous versions of this paper, as well as an anonymous reviewer for Language. They have doubtless offered more good advice than I have accepted. 3 0. Introduction. It has been argued from several points of view that whatever can be done with empty categories (ecs) can be done without them (Ades and Steedman 1982, Gazdar et al. 1984, Kaplan and Zaenen 1989, Pollard and Sag 1994 chapter 9, Sag and Fodor 1994, Kay and Fillmore 1999, Sag 1999). It has also been argued that, because there is no hard evidence for their existence, linguistic theory would be better off dispensing with these unobservable entities (Pickering and Barry 1991, Sag and Fodor 1994, Sag 1999).1 The present paper purports to take the argument one step further by showing that there are things that can be done without empty categories that cannot be done with them, at least not with any of the ecs currently available.
    [Show full text]
  • The Syntax of Answers to Negative Yes/No-Questions in English Anders Holmberg Newcastle University
    The syntax of answers to negative yes/no-questions in English Anders Holmberg Newcastle University 1. Introduction This paper will argue that answers to polar questions or yes/no-questions (YNQs) in English are elliptical expressions with basically the structure (1), where IP is identical to the LF of the IP of the question, containing a polarity variable with two possible values, affirmative or negative, which is assigned a value by the focused polarity expression. (1) yes/no Foc [IP ...x... ] The crucial data come from answers to negative questions. English turns out to have a fairly complicated system, with variation depending on which negation is used. The meaning of the answer yes in (2) is straightforward, affirming that John is coming. (2) Q(uestion): Isn’t John coming, too? A(nswer): Yes. (‘John is coming.’) In (3) (for speakers who accept this question as well formed), 1 the meaning of yes alone is indeterminate, and it is therefore not a felicitous answer in this context. The longer version is fine, affirming that John is coming. (3) Q: Isn’t John coming, either? A: a. #Yes. b. Yes, he is. In (4), there is variation regarding the interpretation of yes. Depending on the context it can be a confirmation of the negation in the question, meaning ‘John is not coming’. In other contexts it will be an infelicitous answer, as in (3). (4) Q: Is John not coming? A: a. Yes. (‘John is not coming.’) b. #Yes. In all three cases the (bare) answer no is unambiguous, meaning that John is not coming.
    [Show full text]
  • Sequence-Of-Tense and the Features of Finite Tenses Karen Zagona University of Washington*
    Sequence-of-tense and the Features of Finite Tenses Karen Zagona University of Washington* Abstract Sequence-of-tense (SOT) is often described as a (past) tense verb form that does not correspond to a semantically interpretable tense. Since SOT clauses behave in other respects like finite clauses, the question arises as to whether the syntactic category Tense has to be distinguished from the functional category tense. I claim that SOT clauses do in fact contain interpretable PRESENT tense. The “past” form is analyzed as a manifestation of agreement with the (matrix past) controller of the SOT clause evaluation time. One implication of this analysis is that finite verb forms should be analyzed as representing features that correspond to functional categories higher in clause structure, including those of the clausal left periphery. SOT morphology then sheds light on the existence of a series of finer- grained functional heads that contribute to tense construal, and to verbal paradigms. These include Tense, Modality and Force. 1. Introduction The phenomenon of sequence-of-tense (SOT) poses several challenges for the standard assumption that a “past tense” verb form signals the presence of a functional category in clause structure with an interpretable ‘past’ value. SOT is illustrated by the ‘simultaneous’ reading of sentence (1): (1) Terry believed that Sue was pregnant. a. The time of Sue’s pregnancy precedes time of Terry’s belief (precedence) b. The time of Sue’ pregnancy overlaps time of Terry’s belief (simultaneity) For the ‘precedence’ reading in (1a), the embedded clause tense is semantically transparent in the sense that the past form was corresponds to a past ordering relation (relative to the time of Terry’s belief).
    [Show full text]
  • Topic Structures in Chinese Author(S): Xu Liejiong and D
    Linguistic Society of America Topic Structures in Chinese Author(s): Xu Liejiong and D. Terence Langendoen Source: Language, Vol. 61, No. 1 (Mar., 1985), pp. 1-27 Published by: Linguistic Society of America Stable URL: http://www.jstor.org/stable/413419 Accessed: 11/05/2009 22:37 Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at http://www.jstor.org/page/info/about/policies/terms.jsp. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you may use content in the JSTOR archive only for your personal, non-commercial use. Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at http://www.jstor.org/action/showPublisher?publisherCode=lsa. Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission. JSTOR is a not-for-profit organization founded in 1995 to build trusted digital archives for scholarship. We work with the scholarly community to preserve their work and the materials they rely upon, and to build a common research platform that promotes the discovery and use of these resources. For more information about JSTOR, please contact [email protected]. Linguistic Society of America is collaborating with JSTOR to digitize, preserve and extend access to Language. http://www.jstor.org TOPIC STRUCTURES IN CHINESE Xu LIEJIONG D.
    [Show full text]