An Integrated Model for Anaphora Resolution

Total Page:16

File Type:pdf, Size:1020Kb

An Integrated Model for Anaphora Resolution AN INTEGRATED MODEL FOR ANAPHORA RESOLUTION Ruslan Mitkov Institute of Mathematics Acad. G. Bonchev str. bl.8, 1113 Sofia, Bulgaria ABSTRACT "distributed architecture", but their ideas a) do not seem to capture enough The paper discusses a new knowledge- discourse and heuristical knowledge and based and sublanguage-oriented model b) do not concentrate on and investigate a for anaphora resolution, which integrates concrete domain, and thus risk being too syntactic, semantic, discourse, domain general. We have tried nevertheless k) and heuristical knowledge for the incorporate some of their ideas into our sublanguage of computer science. Special proposal. attention is paid to a new approach for tracking the center throughout a discourse segment, which plays an imtx~rtant role in THE ANAPttORA RESOLUTION proposing the most likely antecedent to MODEL the anaphor in case of ambiguity. Our anaphora resolution model integrates modules containing different types of INTRODUCTION knowledge - syntactic, semantic, domain, discourse and heuristical knowledge. All Anaphora resolution is a complicated the modules share a common problem in computational linguistics. representation of the cunent discourse. Considerable research has been done by computational linguists ([Carbonell & The syntactic module, for example, Brown 88], IDahl & Ball 90], knows that the anaphor and antecedent [Frederking & Gchrke 87], [Hayes 81], must agree in number, gender and [Hobbs 78], [lngria & Stallard 89], person. It checks if the c-command [Preug et al. 9411, [Rich & LuperFoy 88[, constraints hold and establishes disjoint [Robert 89]), but no complete theory has reference. In cases of syntactic emerged which offers a resolution parallelism, it prefers the noun phrase procedure with success guaranteed. All with the same syntactic role as the approaches developed - even if we restrict anaphor, as the most probable antecedent. our attention to pronominal anaphora, It knows when cataphora is possible and which we will do throughout this paper - can indicate syntactically topicalized noun from purely syntactic ones to highly phrases, which are more likely to be semantic and pragmatic ones, only antecedents than non-topicalized ones. provide a partial treatment of the problem. The semantic module checks for semantic Given the complexity of the problem, we consistency between the anaphor and the think that to secure a comparatively possible antecedent. It filters out successful handling of anaphora semantically incompatible candidates resolution one should adhere to the following the cun-ent verb semantics or following principles: l) restriction to a the animacy of the candidate. In cases of domain (sublanguage) rather than focus semantic parallelism, it prefers the noun on a particular natural language as a phrase, having the same semantic role as whole; 2) maximal use of linguistic the anaphor, as a most likely antecedent. information integrating it into a uniform Finally, it generates a set of possible architecture by means of partial theories. antecedents whenever necessary. Some more recent treatments of anaphora ([Carbonell & Brown 88], [Preug et al. The domain knowlcdge module is 941, [Rich & LuperFoy 8811) do express practically a knowlcdge basc of the the idea of "multi-level approach", or concepts of the domain considered and 1170 thc discourse knowledge module knows implementcd; its development, however, how to track the center throughout the is envisaged for later stages of the project. current discourse segment. The syntactic and semantic modules The heuristical knowledge module can usually filter the possible candidates and solnetimes bc helpful in assigning the do not propose an antecedent (with the antecedent. It has a set of useful rules exception of syntactic and semantic (e.g. the antecedent is to be located parallelism). Usually the proposal for an preferably in thc current sentence or in the antecedent comes from the domain, previous one) and can forestall certain heuristical, and discourse modules. The impractical search procedures. latter plays an important role in tracking the center and proposes it in many cases The use of colnmon sense and world as the most probable candidate for an knowledge is in general commendable, antecedent. but it requires a huge knowledge base and set of inference rules. The present version Figurc 1 illustrates the general structure of of the model does not have this mcxtule our anaphom resolution model. IIIiURISTICAI, 1)OMAIN I)|SCOURSI ~, KNOWI ,I ';lX;t i KNOWI ,El X]! i KNOW I ,l ilX]l'~ l)omain lleuristics l)omain Concept Rating Rules '1'racking Center Ktlowledgc 1~ase Recency [ Rl:ilqlilil iNTIA], l ihJ ANAPIIORA ANTI iCI il)ENT ANAPIIOR-~--~.I ixptaisstoN P,I {SOl ,VI {R SYNTACTIC KNOW] ,t il )(}l i Number Agrccmenl St,',MANTI( Gender Agfeelncnl KNOW1,1 '~I)GE PCI'SOll A~l'Celllellt Semm~tic Consistency l)isjoim Reference Case Roles (~-(~ommaud Constraints Semantic Parallelism Cataphora Animacy Syntactic Paralldislll Set Generalion Syntactic Topicalization Figure 1: Anaphora resolution model THE NEED FOR DISCOURSE criteria are therefore needed. CRITERIA As an illustration, considerthe following Although the syntactic and semantic text. criteria for the selection of an antecedent are already very strong, they are not Chapter 3 discusses these additional or always sufficient to discriminate alnong a auxiliary storage devices, wlfieh mc set of possible candidates. Moreover, similar to our own domestic tape they serve more as filters to eliminate cassettes and record discs. Figure 2 unsuitable candidates than as protx)sers of illustrates lheir connection to the main the most likely candidate. Additional cenlral memory. 1171 anaphora resolution. Usually a center is In this discourse segment neither the the most likely candidate for syntactic, nor the semantic constraints can pronominalization. eliminate the ambiguity between "storage devices", "tape cassettes" or "record There are different views in literature discs" as antecedents for "their", and thus regarding the preferred candidate for a cannot turn up a plausible antecedent from center (focus). Sidner's algorithm among these candidates. A human reader ([Sidner 811), which is based on thematic would be in a better position since he roles, prefers the theme o[ the previous would be able to identify the central sentence as the focus of the current concept, which is a primary candidate for sentence. This view, in general, is pronominalization. Correct identification advocated also in ([Allen87]). PUNDIT, of the antecedent is possible on the basis in its current implementation, considers of the pronominal reference hypothesis: in the entire previous utterance to be the every sentence which contains one or potential focus ([Dahl&Ball 901). Finally, more pronouns must have one of its in the centering literature ([Brennan et al. pronouns refer to the center 1 of the 87]), the subject is generally considered previous sentence. Therefore, whenever to be preferred. We have found, we have to find a referent of a pronoun however, that there are many additional which is alone in the sentence, we have to interrelated factors which influence upon look for the centered clement in the the location of the center. previous sentence. Wc studied the "behaviour" of center in Following this hypothesis, and various computer science texts (30 recognizing "storage devices" as the different sources totally exceeding 1000 center, an anaphora resolution model pages) and the empirical observations would not have problems in picking up enabled us to develop efficient the center of the previous sentence sublanguage-dependent heuristics for ("storage devices") as antecedent for tracking the center in the sublanguage of "their". computer science. We summarize the most important conclusions as follows: We see now that the main problem which arises is the tracking of the center 1) Consider the primary candidates throughout the discourse segment. for center from the priority list: Certain ideas and algorithms for tracking subject, object, verb phrase. focus or center (e.g. [Brennan et al.87]) have been proposed, provided that one 2) Prefer the NP, representing a knows the focus or center of the first domain concept to the NPs, which sentence in the segment. However, they are not domain concepts. do not try to identify this center. Our approach determines the most probable 3) If the verb is a member of the center of the first sentence, and then Verb set = {discuss, present, tracks it all the way through the segment, illustrate, summarize, examine, correcting the proposed algorithm at each describe, define, show, check, step. develop, review, report, outline, consider, investigate, explore, assess, analyze, synthesize, study, TRACKING THE CENTER IN "['HE survey, deal, cover}, then consider SUBLANGUAGE OF COMPUTER the object as a most probable SCIENCE center. Identifying center can be very helpful in 4) If a verbal adjective is a member of" the Adj set = {defined, called, so-called}, then consider the NP 1Though "center" is ml uncrancc specific notion, they refer to as the probable center we refer to "sentence center", because in many cases the centers of the uttermmes a senlence may of the subsequent clause/current consist of, coincide. In a complex sentence, sentence. however, we distinguish also "clause centers" 1172 5) If the subject is "chapter", proposer module which identifies the "section", "table", or a personal most likely center. We must point out that pronoun - 'T', "we", "you",
Recommended publications
  • Minimal Pronouns, Logophoricity and Long-Distance Reflexivisation in Avar
    Minimal pronouns, logophoricity and long-distance reflexivisation in Avar* Pavel Rudnev Revised version; 28th January 2015 Abstract This paper discusses two morphologically related anaphoric pronouns inAvar (Avar-Andic, Nakh-Daghestanian) and proposes that one of them should be treated as a minimal pronoun that receives its interpretation from a λ-operator situated on a phasal head whereas the other is a logophoric pro- noun denoting the author of the reported event. Keywords: reflexivity, logophoricity, binding, syntax, semantics, Avar 1 Introduction This paper has two aims. One is to make a descriptive contribution to the crosslin- guistic study of long-distance anaphoric dependencies by presenting an overview of the properties of two kinds of reflexive pronoun in Avar, a Nakh-Daghestanian language spoken natively by about 700,000 people mostly living in the North East Caucasian republic of Daghestan in the Russian Federation. The other goal is to highlight the relevance of the newly introduced data from an understudied lan- guage to the theoretical debate on the nature of reflexivity, long-distance anaphora and logophoricity. The issue at the heart of this paper is the unusual character of theanaphoric system in Avar, which is tripartite. (1) is intended as just a preview with more *The present material was presented at the Utrecht workshop The World of Reflexives in August 2011. I am grateful to the workshop’s audience and participants for their questions and comments. I am indebted to Eric Reuland and an anonymous reviewer for providing valuable feedback on the first draft, as well as to Yakov Testelets for numerous discussions of anaphora-related issues inAvar spanning several years.
    [Show full text]
  • Relative Clause Attachment and Anaphora: a Case for Short Binding
    Relative Clause Attachment and Anaphora: A Case for Short Binding Rodolfo Delmonte Ca' Garzoni-Moro, San Marco 3417, Università "Ca Foscari", 30124 - VENEZIA E-mail: [email protected] Abstract Relative clause attachment may be triggered by binding requirements imposed by a short anaphor contained within the relative clause itself: in case more than one possible attachment site is available in the previous structure, and the relative clause itself is extraposed, a conflict may arise as to the appropriate s/c-structure which is licenced by grammatical constraints but fails when the binding module tries to satisfy the short anaphora local search for a bindee. 1 Introduction It is usually the case that anaphoric and pronominal binding take place after the structure building phase has been successfully completed. In this sense, c-structure and f-structure in the LFG framework - or s-structure in the chomskian one - are a prerequisite for the carrying out of binding processes. In addition, they only interact in a feeding relation since binding would not be possibly activated without a complete structure to search, and there is no possible reversal of interaction, from Binding back into s/c-structure level seen that they belong to two separate Modules of the Grammar. As such they contribute to each separate level of representation with separate rules, principles and constraints which need to be satisfied within each Module in order for the structure to be licensed for the following one. However we show that anaphoric binding requirements may cause the parser to fail because the structure is inadequate. We propose a solution to this conflict by anticipating, for anaphors only the though, the agreement matching operations between binder and bindee and leaving the coindexation to the following module.
    [Show full text]
  • Antar Solhy Abdellah Publication Date: 2007 Source: CDELT (Centre for Developing English Language Teaching) Occasional Papers, January (2007) [Egypt]
    Title: “English Majors’ errors in translating Arabic Endophora; Analysis and Remedy” Author: Antar Solhy Abdellah Publication date: 2007 Source: CDELT (Centre for Developing English Language Teaching) Occasional Papers, January (2007) [Egypt]. ENGLISH MAJORS' ERRORS IN TRANSLATING ARABIC ENDOPHORA: ANALYSIS AND REMEDY Antar Solhy Abdellah Lecturer in TEFL Qena Faculty of Education, South Valley University- Egypt Abstract Egyptian English majors in the faculty of Education, South Valley university tend to mistranslate the plural inanimate Arabic pronoun with the singular inanimate English pronoun. A diagnostic test was designed to analyze this error. Results showed that a large number of students (first year and fourth year students) make this error, that the error becomes more common if the pronoun is cataphori rather than anaphori, and that the further the pronoun is from its antecedent the more students are apt to make the error. On the basis of these results, sources of the error are identified and remedial procedures are suggested. Abstract in Arabic تقوم الدراسة الحالية بتحليل أخطاء طﻻب شعبة اللغة اﻹنجليزية )الفرقة اﻷولى والرابعة( في ترجمة ضمير جمع غير العاقل من العربية إلى اﻹنجليزية؛حيث يميل الطﻻب إلى استخدام ضمير غير العاقل المفرد في اﻹنجليزية بدﻻ من ضمير الجمع. تستخدم الدراسة اختبارا تشخيصيا يسعى للكشف عن نسبة شيوع الخطأ ومن ثم تحليله. أظهرت النتائج أن عددا كبيرا من طﻻب الفرقتين يرتكبون هذا الخطأ، وأن الخطأ يزداد إذا كان الضمير في موضع المتقدم أكثر مما إذا كان في موضع المتأخر، وأن الخطأ يزداد كلما بعد الضمير عن عائده. ثم تناولت الدراسة تحليﻻ لمصدر الخطأ وقدمت مقترحات لعﻻجه. INTRODUCTION 62 Students whose major is English in faculties of Education are faced with translation problems from the very start of their study.
    [Show full text]
  • Inalienable Possession in Swedish and Danish – a Diachronic Perspective 27
    FOLIA SCANDINAVICA VOL. 23 POZNAŃ 20 17 DOI: 10.1515/fsp - 2017 - 000 5 INALIENABLE POSSESSI ON IN SWEDISH AND DANISH – A DIACHRONIC PERSP ECTIVE 1 A LICJA P IOTROWSKA D OMINIKA S KRZYPEK Adam Mickiewicz University in Poznań A BSTRACT . In this paper we discuss the alienability splits in two Mainland Scandinavian language s, Swedish and Danish, in a diachronic context. Although it is not universally acknowledged that such splits exist in modern Scandinavian languages, many nouns typically included in inalienable structures such as kinship terms, body part nouns and nouns de scribing culturally important items show different behaviour from those considered alienable. The differences involve the use of (reflexive) possessive pronouns vs. the definite article, which differentiates the Scandinavian languages from e.g. English. As the definite article is a relatively new arrival in the Scandinavian languages, we look at when the modern pattern could have evolved by a close examination of possessive structures with potential inalienables in Old Swedish and Old Danish. Our results re veal that to begin with, inalienables are usually bare nouns and come to be marked with the definite article in the course of its grammaticalization. 1. INTRODUCTION One of the striking differences between the North Germanic languages Swedish and Danish on the one hand and English on the other is the possibility to use definite forms of nouns without a realized possessive in inalienable possession constructions. Consider the following examples: 1 The work on this paper was funded by the grant Diachrony of article systems in Scandi - navian languages , UMO - 2015/19/B/HS2/00143, from the National Science Centre, Poland.
    [Show full text]
  • A Multi-Modal Analysis of Anaphora and Ellipsis
    University of Pennsylvania Working Papers in Linguistics Volume 5 Issue 2 Current Work in Linguistics Article 2 1998 A Multi-Modal Analysis of Anaphora and Ellipsis Gerhard Jaeger Follow this and additional works at: https://repository.upenn.edu/pwpl Recommended Citation Jaeger, Gerhard (1998) "A Multi-Modal Analysis of Anaphora and Ellipsis," University of Pennsylvania Working Papers in Linguistics: Vol. 5 : Iss. 2 , Article 2. Available at: https://repository.upenn.edu/pwpl/vol5/iss2/2 This paper is posted at ScholarlyCommons. https://repository.upenn.edu/pwpl/vol5/iss2/2 For more information, please contact [email protected]. A Multi-Modal Analysis of Anaphora and Ellipsis This working paper is available in University of Pennsylvania Working Papers in Linguistics: https://repository.upenn.edu/pwpl/vol5/iss2/2 A Multi-Modal Analysis of Anaphora and Ellipsis Gerhard J¨ager 1. Introduction The aim of the present paper is to outline a unified account of anaphora and ellipsis phenomena within the framework of Type Logical Categorial Gram- mar.1 There is at least one conceptual and one empirical reason to pursue such a goal. Firstly, both phenomena are characterized by the fact that they re-use semantic resources that are also used elsewhere. This issue is discussed in detail in section 2. Secondly, they show a striking similarity in displaying the characteristic ambiguity between strict and sloppy readings. This supports the assumption that in fact the same mechanisms are at work in both cases. (1) a. John washed his car, and Bill did, too. b. John washed his car, and Bill waxed it.
    [Show full text]
  • Donkey Anaphora Is In-Scope Binding∗
    Semantics & Pragmatics Volume 1, Article 1: 1–46, 2008 doi: 10.3765/sp.1.1 Donkey anaphora is in-scope binding∗ Chris Barker Chung-chieh Shan New York University Rutgers University Received 2008-01-06 = First Decision 2008-02-29 = Revised 2008-03-23 = Second Decision 2008-03-25 = Revised 2008-03-27 = Accepted 2008-03-27 = Published 2008- 06-09 Abstract We propose that the antecedent of a donkey pronoun takes scope over and binds the donkey pronoun, just like any other quantificational antecedent would bind a pronoun. We flesh out this idea in a grammar that compositionally derives the truth conditions of donkey sentences containing conditionals and relative clauses, including those involving modals and proportional quantifiers. For example, an indefinite in the antecedent of a conditional can bind a donkey pronoun in the consequent by taking scope over the entire conditional. Our grammar manages continuations using three independently motivated type-shifters, Lift, Lower, and Bind. Empirical support comes from donkey weak crossover (*He beats it if a farmer owns a donkey): in our system, a quantificational binder need not c-command a pronoun that it binds, but must be evaluated before it, so that donkey weak crossover is just a special case of weak crossover. We compare our approach to situation-based E-type pronoun analyses, as well as to dynamic accounts such as Dynamic Predicate Logic. A new ‘tower’ notation makes derivations considerably easier to follow and manipulate than some previous grammars based on continuations. Keywords: donkey anaphora, continuations, E-type pronoun, type-shifting, scope, quantification, binding, dynamic semantics, weak crossover, donkey pronoun, variable-free, direct compositionality, D-type pronoun, conditionals, situation se- mantics, c-command, dynamic predicate logic, donkey weak crossover ∗ Thanks to substantial input from Anna Chernilovskaya, Brady Clark, Paul Elbourne, Makoto Kanazawa, Chris Kennedy, Thomas Leu, Floris Roelofsen, Daniel Rothschild, Anna Szabolcsi, Eytan Zweig, and three anonymous referees.
    [Show full text]
  • Arxiv:1805.11824V1 [Cs.CL] 30 May 2018
    Artificial Intelligence Review manuscript No. (will be inserted by the editor) Anaphora and Coreference Resolution: A Review Rhea Sukthanker · Soujanya Poria · Erik Cambria · Ramkumar Thirunavukarasu Received: date / Accepted: date Abstract Entity resolution aims at resolving repeated references to an entity in a document and forms a core component of natural language processing (NLP) research. This field possesses immense potential to improve the performance of other NLP fields like machine translation, sentiment analysis, paraphrase detection, summarization, etc. The area of entity resolution in NLP has seen proliferation of research in two separate sub-areas namely: anaphora resolution and coreference resolution. Through this review article, we aim at clarifying the scope of these two tasks in entity resolution. We also carry out a detailed analysis of the datasets, evaluation metrics and research methods that have been adopted to tackle this NLP problem. This survey is motivated with the aim of providing the reader with a clear understanding of what constitutes this NLP problem and the issues that require attention. Keywords Entity Resolution · Coreference Resolution · Anaphora Resolution · Natural Language Processing · Sentiment Analysis · Deep Learning 1 Introduction A discourse is a collocated group of sentences which convey a clear understanding only when read together. The etymology of anaphora is ana (Greek for back) and pheri (Greek for to bear), which in simple terms means repetition. In computational linguistics, anaphora is typically defined as references to items mentioned earlier in the discourse or \pointing back" reference as described by (Mitkov, 1999). The most prevalent type of anaphora in natural language is the pronominal anaphora (Lappin and Leass, 1994).
    [Show full text]
  • Donkey Sentences 763 Creating Its Institutions of Laws, Religion, and Learning
    Donkey Sentences 763 creating its institutions of laws, religion, and learning. many uneducated speakers to restructure their plural, It was the establishment of viceroyalties, convents so that instead of the expected cotas ‘coasts’, with -s and a cathedral, two universities – the most notable denoting plurality, they have created a new plural being Santo Toma´s de Aquino – and the flourishing of with -se,asinco´ tase. arts and literature during the 16th and early 17th Dominican syntax tends to prepose pronouns in century that earned Hispaniola the title of ‘Athena interrogative questions. As an alternative to the stan- of the New World.’ The Spanish language permeated dard que´ quieres tu´ ? ‘what do you want?’, carrying an those institutions from which it spread, making obligatory, postverbal tu´ ‘you’, speakers say que´ tu´ Hispaniola the cradle of the Spanish spoken in the quieres?. The latter sentence further shows retention Americas. of pronouns, which most dialects may omit. Fre- Unlike the Spanish of Peru and Mexico, which quently found in Dominican is the repetition of dou- co-existed with native Amerindian languages, ble negatives for emphatic purposes, arguably of Dominican Spanish received little influence from the Haitian creole descent. In responding to ‘who did decimated Tainos, whose Arawak-based language that?’, many speakers will reply with a yo no se´ no disappeared, leaving a few recognizable words, such ‘I don’t know, no’. as maı´z ‘maize’ and barbacoa ‘barbecue’. The 17th Notwithstanding the numerous changes to its century saw the French challenge Spain’s hegemony grammatical system, and the continuous contact by occupying the western side of the island, which with the English of a large immigrant population they called Saint Domingue and later became the residing in the United States, Dominican Spanish has Republic of Haiti.
    [Show full text]
  • The Bound Variable Hierarchy and Donkey Anaphora in Mandarin Chinese
    The Bound Variable Hierarchy and Donkey Anaphora in Mandarin Chinese Haihua Pan and Yan Jiang City University of Hong Kong / London University Cheng and Huang (1996) argue that both unselective binding and E-type pro- noun strategies are necessary for the interpretation of natural language sentences and claim that there exists a correspondence between two sentence types in Chinese and the two strategies, namely that the interpretation of the “wh … wh” construction (which they call “bare conditional”) employs the unselective binding strategy, while the ruguo ‘if’ and dou ‘all’ conditionals use the E-type pronoun strategy. They also suggest that there is a complementary distribution between bare conditionals and ruguo/dou conditionals in the sense that the lat- ter allows all the NP forms, e.g. (empty) pronouns and definite NPs, except for wh-phrases in their consequent clauses, and can even have a consequent clause with no anaphoric NP in it, while the former permits only the same wh-phrase appearing in both the antecedent clause and the consequent clause. Although we agree with Cheng and Huang on the necessity of the two strategies in natural language interpretation, we see apparent exceptions to the correspondence between sentence types and interpretation strategies and the complementary distribution between wh-phrases and other NPs in bare conditionals and ruguo/dou conditionals. We think that the claimed correspondence and comple- mentary distribution are the default or preferred patterns, or a special case of a more general picture, namely that (i) bare conditionals prefer the unselective binding strategy and the ruguo ‘if’ and dou ‘all’ conditionals, the E-type pronoun strategy; and (ii) wh-phrases are more suitable for being a bound variable, and pronouns are more suitable for being the E-type pronoun.
    [Show full text]
  • Long-Distance Reflexivization and Logophoricity in the Dargin Language Muminat Kerimova Florida International University
    Florida International University FIU Digital Commons MA in Linguistics Final Projects College of Arts, Sciences & Education 2017 Long-Distance Reflexivization and Logophoricity in the Dargin Language Muminat Kerimova Florida International University Follow this and additional works at: https://digitalcommons.fiu.edu/linguistics_ma Part of the Linguistics Commons Recommended Citation Kerimova, Muminat, "Long-Distance Reflexivization and Logophoricity in the Dargin Language" (2017). MA in Linguistics Final Projects. 3. https://digitalcommons.fiu.edu/linguistics_ma/3 This work is brought to you for free and open access by the College of Arts, Sciences & Education at FIU Digital Commons. It has been accepted for inclusion in MA in Linguistics Final Projects by an authorized administrator of FIU Digital Commons. For more information, please contact [email protected]. FLORIDA INTERNATIONAL UNIVERSITY Miami, Florida LONG-DISTANCE REFLEXIVIZATION AND LOGOPHORICITY IN THE DARGIN LANGUAGE A thesis submitted in partial fulfillment of the requirements for the degree of MASTER OF ARTS in LINGUISTICS by Muminat Kerimova 2017 ABSTRACT OF THE THESIS LONG-DISTANCE REFLEXIVIZATION AND LOGOPHORICITY IN THE DARGIN LANGUAGE by Muminat Kerimova Florida International University, 2017 Miami, Florida Professor Ellen Thompson, Major Professor The study of anaphora challenges us to determine the conditions under which the pronouns of a language are associated with possible antecedents. One of the theoretical questions is whether the distribution of pronominal forms is best explained by a syntactic, semantic or discourse level analysis. A more practical question is how we distinguish between anaphoric elements, e.g. what are the borders between the notions of pronouns, locally bound reflexives and long-distance reflexives? The study analyzes the anaphora device saj in Dargin that is traditionally considered to be a long-distance reflexivization language.
    [Show full text]
  • Anaphora: Text-Based Or Discourse-Dependent? Functionalist Vs
    Anaphora: Text-based or discourse-dependent? Functionalist vs. formalist accounts Francis Cornish To cite this version: Francis Cornish. Anaphora: Text-based or discourse-dependent? Functionalist vs. formalist accounts. Functions of Language, John Benjamins Publishing, 2010, 17 (2), pp.207-241. 10.1075/fol.17.2.03cor. hal-00966398 HAL Id: hal-00966398 https://hal-univ-tlse2.archives-ouvertes.fr/hal-00966398 Submitted on 26 Mar 2014 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. 1 ANAPHORA: TEXT-BASED OR DISCOURSE-DEPENDENT? FUNCTIONALIST VS. FORMALIST ACCOUNTS* (Published in Functions of Language 17(2), 2010, pp. 207-241. DOI: 10.1075/fol.17.2.03cor) Francis Cornish, CLLE-ERSS, CNRS UMR 5263 and Université de Toulouse-Le-Mirail, Département Etudes du Monde Anglophone, 5, Allées Antonio Machado, 31058 Toulouse Cedex 09, France Email address: [email protected] 2 Abstract The traditional definition of anaphora in purely co-textual terms as a relation between two co-occurring expressions is in wide currency in theoretical and descriptive studies of the phenomenon. Indeed, it is currently adopted in on-line psycholinguistic experiments on the interpretation of anaphors, and is the basis for all computational approaches to automatic anaphor resolution (see Mitkov, 2002).
    [Show full text]
  • 2 Linguistic Fundamentals of Anaphors and Anaphora
    2 Linguistic fundamentals of anaphors and anaphora 2.1 Basic definitions The word anaphora originates from Greek ana- (“back”) and pherein (“to bear”) and entered English via Latin transmission (cf. “Anaphora” 2010). In English, it is documented for the first time in 1589 (cf. Simpson & Weiner 1989: 436-437): Anaphora, or the Figure of Report. Repetition in the firſt degree we call the figure of Report according to the Greeke originall, and is when we make one word begin, and as they are wont to ſay, lead the daunce to many verſes in ſute, as thus. To thinke on death it is a miſerie, To think on life it is a vanitie: To thinke on the world verily it is, To thinke that heare man hath no perfit bliſſe. (Puttenham 1589: 165) “Anaphora” here denotes the rhetoric figure of repetition. The first written evi- dence of a use in grammar is not found until 1933, when the term appeared in Bloomfield’s work Language: [W]hen we say Ask that policeman, and he will tell you, the substitute he means, among other things, that the singular male substantive expression which is replaced by he, has been recently uttered. A substitute which implies this, is an anaphoric or dependent sub- stitute, and the recently-uttered replaced form is the antecedent. (Bloomfield 1984: 249) Later he gives another example: The word one […] replaces a with anaphora of the noun […] when no other modifier is pre- sent (Here are some apples; take one); […] it is the anaphoric substitute for nouns after an adjective, and in this use forms a plural, ones (the big box and the small one, these boxes and the ones in the kitchen […]).
    [Show full text]