Pronoun Prediction with Linguistic Features and Example Weighing

Total Page:16

File Type:pdf, Size:1020Kb

Pronoun Prediction with Linguistic Features and Example Weighing Pronoun Prediction with Linguistic Features and Example Weighing Michal Novak´ Charles University in Prague, Faculty of Mathematics and Physics Institute of Formal and Applied Linguistics Malostranske´ nam´ estˇ ´ı 25, CZ-11800 Prague 1 [email protected] Abstract We take a machine learning approach to the problem and apply a linear classifier. Our ap- We present a system submitted to the proach combines features coming from the target WMT16 shared task in cross-lingual language model with features extracted from the pronoun prediction, in particular, to linguistically analyzed source and target texts. We the English-to-German and German-to- also introduce training example weighing, which English sub-tasks. The system is based aims at improving the prediction accuracy of less on a linear classifier making use of fea- populated target pronouns. All the source codes tures both from the target language model used to build the system are publicly available.3 and from linguistically analyzed source According to the WMT16 pronoun translation and target texts. Furthermore, we apply shared task results (Guillou et al., 2016), our best example weighing in classifier learning, German-to-English system ranks in the middle of which proved to be beneficial for recall the pack while our English-to-German systems in less frequent pronoun classes. Com- seem to be the poorest. However, after the shared pared to other shared task participants, our task submission deadline, we discovered an error best English-to-German system is able to in post-processing of the classifier predictions on rank just below the top performing sub- the evaluation set for the English-to-German di- missions. rection. After correcting this error, our system reaches the 2nd best result for this language di- 1 Introduction rection. Previous works concerning translation of pro- The paper is structured as follows. After intro- nouns1 have shown that unlike other words, pro- ducing the related work in Section 7, we describe nouns require a special treatment. Context and tar- three preprocessing components of our system that get language grammar influence pronoun transla- enrich the input data with additional information tion much more profoundly than the translation of in Section 2. Section 3 then presents features ex- parts-of-speech carrying lexical information. tracted from the data whereas Section 4 gives more This paper presents a system for the WMT16 details about the method used to train the model. shared task of cross-lingual pronoun prediction In Section 5, all our system configurations submit- (Guillou et al., 2016),2 the task that looks at the ted to the shared tasks are evaluated. Finally, we problem of pronoun translation in a more sim- examine the effect of individual features and ex- plified way. Here, the objective is to predict ample weighing in Section 6 before we conclude a target language pronoun from a set of pos- in Section 8. sible candidates, given source text, lemmatized and part-of-speech-tagged target text, and auto- 2 Preprocessing components matic word alignment. We address specifically the The preprocessing stage combines three compo- sub-tasks of English-to-German and German-to- nents, each of them enriching the input data with English pronoun prediction. additional information: a target language model, 1Summarized by Hardmeier (2014). an automatic linguistic analysis of the source sen- 2http://www.statmt.org/wmt16/pronoun-task. html 3https://github.com/ufal/wmt16-pronouns 602 Proceedings of the First Conference on Machine Translation, Volume 2: Shared Task Papers, pages 602–608, Berlin, Germany, August 11-12, 2016. c 2016 Association for Computational Linguistics tences, and a basic automatic analysis of the target of being anaphoric by the NADA tool (Bergsma sentences. and Yarowsky, 2011). 7 2.1 Target language model German. We utilized the MATE tools (Bjorkelund¨ et al., 2010) to perform part- For language modeling, we employed the KenLM of-speech tagging, morphological analysis Language Model Toolkit (Heafield et al., 2013), an (necessary to obtain grammatical categories efficient implementation of large language models such as gender or number), and transition-based with modified Kneser-Ney smoothing (Kneser and dependency parsing (Bohnet and Nivre, 2012; Ney, 1995). Seeker and Kuhn, 2012). Lemmatized 5-gram models for English and German have been supplied as a baseline system 2.3 Target language analysis by the organizers of the shared task. An integral In the data supplied by the task organizers, the for- part of the baseline system is a wrapper script4 mat of the target language sentences differs from performing necessary preprocessing before the ac- the source language format. Not only are the target tual probability estimation. For instance, it selects words to be predicted replaced by a placeholder, words which may possibly belong to the OTHER but all other tokens are also substituted with cor- class5 and it enables setting a penalty for pre- responding lemmas and coarse-grained part-of- ferring an empty word.6 We only adjusted the speech tags. wrapper script so that it fits into our processing For this reason, we needed to simplify the anal- pipeline, making no modifications to the estima- ysis of target texts. The parsers used for source tion machinery. texts do not accept the tagset used by the organiz- 2.2 Source language analysis ers. There are two possible solutions to fix this disagreement: either running a part-of-speech tag- In the input data supplied by the task organiz- ger producing tags that agree with the tagset re- ers, source text is represented as plain tokenized quired by the parser, or obtaining suitable part- sentences. We have processed the source texts of-speech tags by a transformation of the origi- with tools obtaining additional linguistic analy- nal tagset. However, both options are prone to er- sis. However, due to different availability of rors. In the former option, the tags produced in these tools for English and German, the depth of this way would definitely be of low quality as only the analysis differs. We describe both analysis a lemmatized text is available. This would cause pipelines separately in the following: problems especially for German. The latter option brings another problem. The original tagsets (12 English. English source texts have been ana- lyzed up to the level of deep syntax using the Treex tags in both English and German) are more coarse- framework (Popel and Zabokrtskˇ y,´ 2010) incor- grained than the tagsets required by the parsers (44 porating several external tools. The processing and 53 tags in English and German, respectively), pipeline consists of part-of-speech tagging with which makes the transformation in this direction the Morceˇ tool (Spoustova´ et al., 2007) depen- difficult. dency parsing conducted by the MST parser (Mc- Due to these obstacles, we decided to abandon Donald et al., 2005), semantic role labeling (Bo- any additional linguistic processing except for the jar et al., 2016), and coreference resolution ob- identification of noun genders. We consider gen- tained as a combination of Treex coreference mod- der and number information one of the most valu- ules and the Bart 2 toolkit (Versley et al., 2008; able inputs for correct pronoun translation. While Uryupina et al., 2012). Prior to the last step, all in- the number information is hard to reconstruct from stances of the pronoun it are assigned a probability a lemmatized text with part-of-speech tags having no indication of grammatical number, gender can 4https://bitbucket.org/yannick/discomt_ be reconstructed from a noun lemma itself quite baseline/src satisfactorily. In each of the languages, we ap- 5The OTHER class comprise words, not necessarily pro- nouns, that appear often enough in the context typical for pro- proached the task of obtaining gender for a given nouns to be resolved but not enough to form their own class. noun in a different way. Furthermore, it can be an empty word if the source pronoun has no target language counterpart. 7https://code.google.com/archive/p/ 6In all experiments, we used zero penalty. mate-tools/ 603 English. The gender information was obtained dependency tree, and combinations of the previous using the data collected by Bergsma and Lin features. As the analysis of English goes deeper (2006).8 They used paths in dependency trees to than the surface syntax, we include the semantic learn the likelihood of coreference between a pro- function of the source counterpart. If the counter- noun and a noun candidate and then applied them part is an instance of the pronoun it, we add the in a bootstrapping fashion on larger data to obtain anaphoricity probability estimated by the NADA a noun gender and number distribution in different detector, quantized in the same way as the proba- contexts. bilities coming from the KenLM model. For the sake of simplicity, we filtered their list only to single-word items. If we encounter a token Target language features. The lemma of a par- with a noun tag assigned in the target sentence, its ent verb of the target pronoun placeholder might lemma is looked up in the list and assigned the also be a valuable feature. Even though we have most probable gender, if any is found. Otherwise, not performed a syntactic analysis on the target the neuter gender is assumed. text (see Section 2.3), we are still able to approx- imate it in several ways. The easiest option is German. We run the MATE morphological to list all verb lemmas that appear in a relatively analysis separately for every lemma labeled as a small context surrounding the placeholder (1, 3, or noun. If no gender information is obtained, the 5 words). Another approach is to project the par- noun is assigned the neuter gender.
Recommended publications
  • Nouns, Adjectives, Verbs, and Adverbs
    Unit 1: The Parts of Speech Noun—a person, place, thing, or idea Name: Person: boy Kate mom Place: house Minnesota ocean Adverbs—describe verbs, adjectives, and other Thing: car desk phone adverbs Idea: freedom prejudice sadness --------------------------------------------------------------- Answers the questions how, when, where, and to Pronoun—a word that takes the place of a noun. what extent Instead of… Kate – she car – it Many words ending in “ly” are adverbs: quickly, smoothly, truly A few other pronouns: he, they, I, you, we, them, who, everyone, anybody, that, many, both, few A few other adverbs: yesterday, ever, rather, quite, earlier --------------------------------------------------------------- --------------------------------------------------------------- Adjective—describes a noun or pronoun Prepositions—show the relationship between a noun or pronoun and another word in the sentence. Answers the questions what kind, which one, how They begin a prepositional phrase, which has a many, and how much noun or pronoun after it, called the object. Articles are a sub category of adjectives and include Think of the box (things you have do to a box). the following three words: a, an, the Some prepositions: over, under, on, from, of, at, old car (what kind) that car (which one) two cars (how many) through, in, next to, against, like --------------------------------------------------------------- Conjunctions—connecting words. --------------------------------------------------------------- Connect ideas and/or sentence parts. Verb—action, condition, or state of being FANBOYS (for, and, nor, but, or, yet, so) Action (things you can do)—think, run, jump, climb, eat, grow A few other conjunctions are found at the beginning of a sentence: however, while, since, because Linking (or helping)—am, is, are, was, were --------------------------------------------------------------- Interjections—show emotion.
    [Show full text]
  • Pronouns: a Resource Supporting Transgender and Gender Nonconforming (Gnc) Educators and Students
    PRONOUNS: A RESOURCE SUPPORTING TRANSGENDER AND GENDER NONCONFORMING (GNC) EDUCATORS AND STUDENTS Why focus on pronouns? You may have noticed that people are sharing their pronouns in introductions, on nametags, and when GSA meetings begin. This is happening to make spaces more inclusive of transgender, gender nonconforming, and gender non-binary people. Including pronouns is a first step toward respecting people’s gender identity, working against cisnormativity, and creating a more welcoming space for people of all genders. How is this more inclusive? People’s pronouns relate to their gender identity. For example, someone who identifies as a woman may use the pronouns “she/her.” We do not want to assume people’s gender identity based on gender expression (typically shown through clothing, hairstyle, mannerisms, etc.) By providing an opportunity for people to share their pronouns, you're showing that you're not assuming what their gender identity is based on their appearance. If this is the first time you're thinking about your pronoun, you may want to reflect on the privilege of having a gender identity that is the same as the sex assigned to you at birth. Where do I start? Include pronouns on nametags and during introductions. Be cognizant of your audience, and be prepared to use this resource and other resources (listed below) to answer questions about why you are making pronouns visible. If your group of students or educators has never thought about gender-neutral language or pronouns, you can use this resource as an entry point. What if I don’t want to share my pronouns? That’s ok! Providing space and opportunity for people to share their pronouns does not mean that everyone feels comfortable or needs to share their pronouns.
    [Show full text]
  • Pronoun Verb Agreement Exercises
    Pronoun Verb Agreement Exercises hammercloth.Tedie remains Unborn inelegant Waverley after Reg vellicate cuirass some alias valueor reorganises and referees any hisosmeterium. crucian so Prussian stingingly! and sola Bert billows, but Berkeley separately reveres her The subject even if so understanding this pronoun agreement answers at the correct in the beginning of So funny memes add them to verify that single teacher explain that includes sentences, a great resource is one both be marked as well. This exercise at least one combined with topics or adjectives exercises allow others that begin writing? Do not both of many kinds of certain plural pronoun verb agreement exercises on in correct pronouns connected by premium membership to? One pronoun agreement exercise together your quiz! This is the currently selected item. Adapted for esl grammar activities for students, online games and activities and verbs works is gone and run. Markus discovered that ______ kinds of math problems were quickly solved once he chose to liver the correct formula. Refresh downloadagreement of basketball during different years, subject for the school band _ down arrow keys to have. To determine whether to use a singular or plural verb with an indefinite pronoun, consider the noun that the pronoun would refer to. Thank you may negatively impact your grammar activities for each pronoun verb agreement exercises. Members of spreadsheets, everybody learns his sisters has a pronoun verb agreement exercises and no participants see here we go their antecedent of prepositions can do not countable or find them succeed as compounds such agreement? Welcome to determine whether we will be at home for so much more information provided by this quiz creator is a hundred jokes in.
    [Show full text]
  • Read Each Sentence and Rewrite the Underlined Word Or Words Using a Possessive Pronoun Or a Possessive Pronoun and a Noun
    Grammar: Possessive Pronouns Name • A possessive pronoun takes the place of a possessive noun. It does not have an apostrophe. • A possessive pronoun must match the noun it replaces in number and gender. Read each sentence and rewrite the underlined word or words using a possessive pronoun or a possessive pronoun and a noun. 1. My uncle’s ranch is in Texas. 2. I left the book that belongs to me at home. 3. Lisa’s telephone is broken. 4. The mechanic said that the car’s wheels are brand new. 5. The bedroom that belongs to you and me needs to be cleaned. 6. We watched the tiger’s cubs as they played. 7. May I borrow the necklace that belongs to you? 8. Maya and Lucy’s parrot says “Hello.” 9. The foot that belongs to me is hurting. Copyright © The McGraw-Hill Companies, Inc. Companies, McGraw-Hill The © Copyright 10. The woman’s dream was to become a lawyer. Grammar • Grade 4 • Unit 4 • Week 4 91 Grammar: Stand-Alone Possessive Pronouns Name • A possessive pronoun takes the place of a possessive noun and does not include an apostrophe. It must match the noun it replaces in number and gender. • A stand-alone possessive pronoun is not used directly before a noun. • Add -s to most pronouns that are used before a noun to form a stand- alone possessive pronoun, such as yours, hers, ours, and theirs. • Mine is a stand-alone possessive pronoun. His and its are the same when used before a noun or as a stand-alone possessive pronoun.
    [Show full text]
  • Minimal Pronouns1
    1 Minimal Pronouns1 Fake Indexicals as Windows into the Properties of Bound Variable Pronouns Angelika Kratzer University of Massachusetts at Amherst June 2006 Abstract The paper challenges the widely accepted belief that the relation between a bound variable pronoun and its antecedent is not necessarily submitted to locality constraints. It argues that the locality constraints for bound variable pronouns that are not explicitly marked as such are often hard to detect because of (a) alternative strategies that produce the illusion of true bound variable interpretations and (b) language specific spell-out noise that obscures the presence of agreement chains. To identify and control for those interfering factors, the paper focuses on ‘fake indexicals’, 1st or 2nd person pronouns with bound variable interpretations. Following up on Kratzer (1998), I argue that (non-logophoric) fake indexicals are born with an incomplete set of features and acquire the remaining features via chains of local agreement relations established in the syntax. If fake indexicals are born with an incomplete set of features, we need a principled account of what those features are. The paper derives such an account from a semantic theory of pronominal features that is in line with contemporary typological work on possible pronominal paradigms. Keywords: agreement, bound variable pronouns, fake indexicals, meaning of pronominal features, pronominal ambiguity, typologogy of pronouns. 1 . I received much appreciated feedback from audiences in Paris (CSSP, September 2005), at UC Santa Cruz (November 2005), the University of Saarbrücken (workshop on DPs and QPs, December 2005), the University of Tokyo (SALT XIII, March 2006), and the University of Tromsø (workshop on decomposition, May 2006).
    [Show full text]
  • Pronouns: Types and Definitions from Warriners English Grammar and Composition, Complete Course ©1982
    Pronouns: Types and Definitions From Warriners English Grammar and Composition, Complete Course ©1982 I. A pronoun is a word used in place of a noun or of more than one noun. Example: Anglers complained about the weather forecast. They said it had not warned them of the storm. [The pronouns they and them take the place of a noun anglers. The pronoun it takes the place of the noun forecast.] Sometimes a pronoun takes the place of another pronoun. Example: One of the film projectors is broken. It has been sent out for repair. [The pronoun it takes the place of the pronoun it.] The word to which a pronoun refers (whose place it takes) is the antecedent of the pronoun. II. Types a. Personal Pronouns take the place of nouns – either common or proper – that refer to people. I We You You He/she/it They b. Possessive pronouns always act as adjectives because they give information that limits the noun that follows. My, mine Our, ours Your, yours Your, yours His, her/hers, its Their/theirs c. Reflexive pronouns are possessive pronouns combined with –self or –selves. i. Can be used to refer back to an acting subject (Example: Carmen hurt herself.) ii. Can be used to intensify or emphasize a preceding noun or pronoun (Example: Jeff himself was not hurt.) Myself Ourselves Yourself Yourselves Himself/herself/itself Themselves d. Relative pronouns are used to introduce subordinate clauses. (Remember that subordinate clauses are groups of words with both a subject and a verb that are not complete sentences because they do not express a complete thought.) i.
    [Show full text]
  • PARTS of SPEECH ADJECTIVE: Describes a Noun Or Pronoun; Tells
    PARTS OF SPEECH ADJECTIVE: Describes a noun or pronoun; tells which one, what kind or how many. ADVERB: Describes verbs, adjectives, or other adverbs; tells how, why, when, where, to what extent. CONJUNCTION: A word that joins two or more structures; may be coordinating, subordinating, or correlative. INTERJECTION: A word, usually at the beginning of a sentence, which is used to show emotion: one expressing strong emotion is followed by an exclamation point (!); mild emotion followed by a comma (,). NOUN: Name of a person, place, or thing (tells who or what); may be concrete or abstract; common or proper, singular or plural. PREPOSITION: A word that connects a noun or noun phrase (the object) to another word, phrase, or clause and conveys a relation between the elements. PRONOUN: Takes the place of a person, place, or thing: can function any way a noun can function; may be nominative, objective, or possessive; may be singular or plural; may be personal (therefore, first, second or third person), demonstrative, intensive, interrogative, reflexive, relative, or indefinite. VERB: Word that represents an action or a state of being; may be action, linking, or helping; may be past, present, or future tense; may be singular or plural; may have active or passive voice; may be indicative, imperative, or subjunctive mood. FUNCTIONS OF WORDS WITHIN A SENTENCE: CLAUSE: A group of words that contains a subject and complete predicate: may be independent (able to stand alone as a simple sentence) or dependent (unable to stand alone, not expressing a complete thought, acting as either a noun, adjective, or adverb).
    [Show full text]
  • Chapter 3 Noun Phrases Pronouns
    Chapter 3 Noun Phrases Now that we have established something about the structure of verb phrases, let's move on to noun phrases (NPs). A noun phrase is a noun or pronoun head and all of its modifiers (or the coordination of more than one NP--to be discussed in Chapter 6). Some nouns require the presence of a determiner as a modifier. Most pronouns are typically not modified at all and no pronoun requires the presence of a determiner. We'll start with pronouns because they are a relatively simple closed class. Pronouns English has several categories of pronouns. Pronouns differ in the contexts they appear in and in the grammatical information they contain. Pronouns in English can contrast in person, number, gender, and case. We've already discussed person and number, but to review: 1. English has three persons o first person, which is the speaker or the group that includes the speaker; o second person, which is the addressee or the group of addressees; o third person, which is anybody or anything else 2. English has two numbers o singular, which refers to a singular individual or undifferentiated group or mass; o plural, which refers to more than one individual. The difference between we and they is a difference in person: we is first person and they is third person. The difference between I and we is a difference in number: I is singular and we is plural. The other two categories which pronouns mark are gender and case. Gender is the system of marking nominal categories.
    [Show full text]
  • Plurality in a Classifier Language Author(S): Yen-Hui Audrey Li Source: Journal of East Asian Linguistics, Vol
    Plurality in a Classifier Language Author(s): Yen-Hui Audrey Li Source: Journal of East Asian Linguistics, Vol. 8, No. 1 (Jan., 1999), pp. 75-99 Published by: Springer Stable URL: http://www.jstor.org/stable/20100755 Accessed: 24/08/2010 19:01 Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at http://www.jstor.org/page/info/about/policies/terms.jsp. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you may use content in the JSTOR archive only for your personal, non-commercial use. Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at http://www.jstor.org/action/showPublisher?publisherCode=springer. Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission. JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact [email protected]. Springer is collaborating with JSTOR to digitize, preserve and extend access to Journal of East Asian Linguistics. http://www.jstor.org YEN-HUI AUDREY LI PLURALITY IN A CLASSIFIER LANGUAGE* This work argues that -men inMandarin Chinese is best analyzed as a plural morpheme realized on an element in Determiner, in contrast to a regular plural on an element in N, such as the English -s.
    [Show full text]
  • Parts of Speech
    PARTS OF SPEECH NOUN: Name of a person, place, thing or quality. (examples: Billy, Chicago, pencil, courage). PRONOUN: Word used in place of a noun. (examples: he, she, it, they, we). ADJECTIVE: Word that describes or limits a noun or pronoun. (examples: big, blue, mean, pretty). VERB: Word expressing action or state of being. (examples: write, kiss, is, feels). ADVERB: Word used to modify the meaning of a verb, adjective, or another adverb. (examples: always, once, quickly, too, there). PREPOSITION: Word used to show the relation between two or more things. (examples: to, at, under, into, between). CONJUNCTION: Word used to join a word or group of words with another word or group of words. INTERJECTION: An exclamation. (examples: oh!, wow!). GRAMMAR Subject: The something or someone talked about or doing the action in a sentence. (ex.: President Johnson disliked his portrait.) Predicate: The verb plus all its modifiers and complements. (ex.: President Johnson disliked his portrait.) Fused Sentence: Two sentences run together with no punctuation between them. (ex.: President Johnson disliked his portrait he felt it made him look too old.) Comma Splice: Two sentences separated only by a comma. (ex.: President Johnson disliked his portrait, he felt it made him look too old.) Fragment: A group of words presented as a sentence but which lacks the elements of a sentence. (ex.: Once upon a time.) Sentence: A group of words containing a subject, a verb, and expressing a complete thought. (ex.: Once upon a time, Jack and Jill tumbled down the hill.) OVER HELPING VERBS be will being would been must am do is did are does was have were has may had might having can shall could should PREPOSITIONS about beyond over above by since across down through after during to against for toward along from under among in underneath around inside until at into up before like upon behind near with below of within beneath off without besides on between onto 1.
    [Show full text]
  • Nine Parts of Speech in Teaching English to Electrical Engineering Students
    NINE PARTS OF SPEECH IN TEACHING ENGLISH TO ELECTRICAL ENGINEERING STUDENTS Naely Muchtar Electrical Engineering Department Politeknik Negeri Ujung Pandang [email protected] Abstract Prior to the ASEAN Economic Community, one of the specific soft skills to be mastered for Indonesian engineers to be stand out from the other engineers of other ASEAN countries is mastering English language skills. Engineering students’ backgrounds are mostly from Vocational High School with a variety of English skills. It is quite complicated for the lecturer in teaching English for Engineering as English for Specific Purpose (ESP) subject with students’ various level of English skills. English is taught based on the Electrical Engineering content which made the lecturer’s need to analyze the students’ skill in basic words of English. This paper aims at presenting and offering the importance of teaching English through nine parts of speech which consists of noun, pronoun, adjective, verb, adverb, preposition, conjunction, and interjection, as the fundamental words of English. By mastering nine parts of speech, engineering students will easily conduct sentences into paragraph, reading English text, listening while watching tutorial video, and speaking for presenting their idea. Nine parts of speech are very practical for teaching ESP. Keywords: Parts of speech, English for engineering Introduction ASEAN Economic Community era provides the real challenges and opportunities for lecturer in teaching ESP which is aimed at evolving students’ professional skills in communication by using English which is depending on the area of professional field. It means that teaching and learning process must be interconnected to students’particular specialization. Hence, English for specific purpose includes specialized programmes which are designed to develop the communicative use of English in a specialized field of science, work or technology.
    [Show full text]
  • Pragmatic Person Features in Pronominal and Clausal Speech Act Phrases Hailey Hyekyeong Ceong* Abstract. This Paper Proposes
    2021. Proc Ling Soc Amer 6(1). 484–498. https://doi.org/10.3765/plsa.v6i1.4984. Pragmatic person features in pronominal and clausal speech act phrases Hailey Hyekyeong Ceong* Abstract. This paper proposes the necessity of pragmatic person features (Ritter and Wiltschko 2018) in pronominal and clausal speech act phrases in Korean, giving three main arguments for such necessity: (i) pragmatic person [ADDRESSEE] is needed for hearsay mye which expresses the meaning of you told me without the lexical verb of saying, (ii) pragmatic person [SPEAKER] is needed for the unequal distribution of first-person plural pronouns with exhortative ca ‘let us’, and (iii) pragmatic persons [SPEAKER], and [ADDRESSEE] are needed for the asymmetric distribution of a dative goal argument in secondhand exhortatives. Based on the compatibility and incompatibility of exhortative ca- and secondhand exhortative ca- mye clauses with a first-person pronoun (e.g., na ‘I’, ce ‘I’, wuli ‘we’, and cehuy ‘we’), I argue that pragmatic person features are needed in syntax to account for their distribution. Keywords. person; formality; clusivity; speech act phrases; hearsay; Korean 1. Introduction. This paper investigates pragmatic person features in pronominal and clausal speech act phrases focusing on the properties of first-and secondhand exhortatives in Korean. Based largely on evidence from a survey of variable pronominal paradigms across languages, Ritter and Wiltschko (2018) state that some languages lexicalize the distinction between prag- matic and grammatical person features. In this paper, I argue, on the basis of the distribution and interpretation of first-and secondhand exhortative markers, first-person pronouns, and the (dis-) agreement with the head of exhortatives and a first-person pronoun, that the distinction is lexical- ized in Korean as well.
    [Show full text]