Text Data Processing Language Reference Guide Company

Total Page:16

File Type:pdf, Size:1020Kb

Text Data Processing Language Reference Guide Company PUBLIC SAP Data Services Document Version: 4.2 Support Package 14 (14.2.14.0) – 2021-01-08 Text Data Processing Language Reference Guide company. All rights reserved. All rights company. affiliate THE BEST RUN 2020 SAP SE or an SAP SE or an SAP SAP 2020 © Content 1 Text Data Processing Language Reference Guide....................................4 2 Overview of the Language Reference Guide........................................5 2.1 Who Should Read This Guide.....................................................5 3 Overview of Text Data Processing Functionality.....................................6 3.1 Character Encodings.......................................................... 9 3.2 About Linguistic Analysis....................................................... 9 3.3 About Entity Extraction.........................................................9 About Customizing Extraction.................................................10 3.4 About Fact Extraction.........................................................10 About Sentiment Analysis ....................................................11 About Public Sector Fact Extraction.............................................11 About Enterprise Fact Extraction...............................................11 4 Linguistic Analysis ..........................................................13 4.1 Word Segmentation...........................................................14 Whitespace Languages......................................................14 Nonwhitespace Languages...................................................30 4.2 Case Normalization.......................................................... 39 4.3 Stemming.................................................................40 Language-Specific Stemming Examples..........................................41 Stemming Unknown Words...................................................81 4.4 Compound Analysis.......................................................... 81 Language-Specific Compound Analysis Examples...................................81 4.5 Part-of-Speech Tagging........................................................87 Part-of-Speech Naming Conventions............................................88 Part-of-Speech and Stem Disambiguation ........................................88 Guessing Part-of-Speech for Unknown Words......................................89 Language-Specific Unknown Words Guessing Examples..............................89 Language-Specific Part-of-Speech Tagging Examples................................93 5 Entity Extraction.......................................................... 208 5.1 Predefined Entity Types by Language.............................................208 5.2 Language-specific Entity Type Examples...........................................214 5.3 Definitions of Entity Types.....................................................295 5.4 Entity Normalization.........................................................308 6 Fact Extraction............................................................314 Text Data Processing Language Reference Guide 2 PUBLIC Content 6.1 Sentiment Analysis Fact Extraction...............................................314 Sentiment Extraction......................................................315 Emoticon Extraction.......................................................329 Request Extraction....................................................... 340 Profanity Extraction.......................................................346 6.2 Public Sector Fact Extraction...................................................346 Common Entity Extraction..................................................348 Action Event Extraction.................................................... 353 Military Unit Extraction.....................................................355 Organizational Information Extraction..........................................356 Person: Alias Extraction.................................................... 357 Person: Appearance Extraction...............................................359 Person: Attribute Extraction................................................. 361 Person: Relationship Extraction...............................................362 Spatial Reference Extraction.................................................364 Travel Event Extraction.....................................................365 6.3 Enterprise Fact Extraction.....................................................367 Management Change Event Extraction..........................................368 Membership Information Extraction............................................371 Merger Information Extraction................................................373 Organizational Information Extraction.......................................... 374 Product Release Event Extraction............................................. 376 6.4 Entity Type Names and Subentity Type Names For Fact Extraction.........................378 Text Data Processing Language Reference Guide Content PUBLIC 3 1 Text Data Processing Language Reference Guide Text Data Processing provides a suite of analyzers that enable developers to construct applications that analyze unstructured data, enabling your application to perform extraction and natural-language processing of text. This guide provides information about the linguistic extraction features, and describes the behavior of the supported language modules. Text Data Processing Language Reference Guide 4 PUBLIC Text Data Processing Language Reference Guide 2 Overview of the Language Reference Guide Welcome to the Language Reference Guide. This guide addresses the variety of languages that can be processed by the text analytics engine: a suite of natural-language processing capabilities based on linguistic, statistical and machine-learning algorithms that model and structure the information content of textual sources in multiple languages. This technology forms the foundation for advanced text processing for a range of applications including search, business intelligence or exploratory data analysis. The major areas of text analytics functionality are: Linguistic Analysis is the most fundamental form of text analytics - tokenization, stemming and part-of- speech tagging. These operations allow for optimized full-text index building because they guarantee both high precision and recall. Entity Extraction is the identification of named entities (persons, organizations, and so on), which eliminates the 'noise' in textual data by highlighting salient information. This process transforms unstructured text into structured information. Fact Extraction is a higher-level semantic processing that links entities as "facts" in domain-specific applications. For example, "Voice of the Customer" classifies sentiments with their corresponding topics. 2.1 Who Should Read This Guide Users of this guide may need to enhance extraction in their text analytics application and should understand text data processing extraction concepts. However, users of this guide are not expected to understand or be familiar with the natural languages of the text being processed by the software. Similarly, users are not required to be familiar with linguistic principles. This document assumes the following: ● You are an application developer or consultant working on enhancing text data processing extraction. ● You understand your organization's text data processing extraction needs. Text Data Processing Language Reference Guide Overview of the Language Reference Guide PUBLIC 5 3 Overview of Text Data Processing Functionality The software includes language modules for the languages supported. Each language module consists of a set of files that include system dictionaries containing words to support the language processing operations for the given natural language. It is the language modules that enable linguistic analysis and extraction of unstructured text in a given language. Language modules use the following language processing technologies: ● Linguistic analysis to handle natural language processing ● Entity extraction to handle named entity extraction and normalization ● Fact extraction to handle sentiment analysis, public sector events, and enterprise facts Linguistic analysis provides ● Word segmentation ● Stemming ● Part of speech tagging ● Tagged stemming (part of speech in context) Extraction provides the following levels: ● Entity Extraction ○ Core Extraction – for named entities (PERSON, DATE, LOCALITY, and so on.) ● Fact Extraction ○ Sentiment Analysis – for positive and negative sentiment and topic extraction ○ Public Sector - for public-sector-specific information. ○ Enterprise - for business-specific information There is an additional language module for the "neutral" language. The neutral language module offers basic tokenization, stemming and part-of-speech tagging in any language other than the 33 currently covered by SAP Text Analysis. The neutral language works for whitespace languages, such as Afrikaans or Bengali. Customization is also possible through dictionaries and rules. Refer to the Extraction Customization Guide for more details about customization. The following table shows the languages which are explicitly supported for Text Data Processing. Entity Extraction Fact Extraction Linguistic Language Analysis Custom Dic­ Sentiment Public Sec­ Custom Predefined tionary Analysis tor Enterprise Rules Arabic (AR) X X X X X Catalan (CA) X X X Chinese, sim­ X X X* X X plified (ZH) Text Data
Recommended publications
  • Transparency in Language a Typological Study
    Transparency in language A typological study Published by LOT phone: +31 30 253 6111 Trans 10 3512 JK Utrecht e-mail: [email protected] The Netherlands http://www.lotschool.nl Cover illustration © 2011: Sanne Leufkens – image from the performance ‘Celebration’ ISBN: 978-94-6093-162-8 NUR 616 Copyright © 2015: Sterre Leufkens. All rights reserved. Transparency in language A typological study ACADEMISCH PROEFSCHRIFT ter verkrijging van de graad van doctor aan de Universiteit van Amsterdam op gezag van de Rector Magnificus prof. dr. D.C. van den Boom ten overstaan van een door het college voor promoties ingestelde commissie, in het openbaar te verdedigen in de Agnietenkapel op vrijdag 23 januari 2015, te 10.00 uur door Sterre Cécile Leufkens geboren te Delft Promotiecommissie Promotor: Prof. dr. P.C. Hengeveld Copromotor: Dr. N.S.H. Smith Overige leden: Prof. dr. E.O. Aboh Dr. J. Audring Prof. dr. Ö. Dahl Prof. dr. M.E. Keizer Prof. dr. F.P. Weerman Faculteit der Geesteswetenschappen i Acknowledgments When I speak about my PhD project, it appears to cover a time-span of four years, in which I performed a number of actions that resulted in this book. In fact, the limits of the project are not so clear. It started when I first heard about linguistics, and it will end when we all stop thinking about transparency, which hopefully will not be the case any time soon. Moreover, even though I might have spent most time and effort to ‘complete’ this project, it is definitely not just my work. Many people have contributed directly or indirectly, by thinking about transparency, or thinking about me.
    [Show full text]
  • Equative Constructions in World-Wide Perspective
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by ZENODO 1 Equative constructions in world-wide perspective Martin Haspelmath & the Leipzig Equative Constructions Team1 Abstract: In this paper, we report on a world-wide study of equative constructions (‘A is as big as B’) in a convenience sample of 119 languages. From earlier work, it has been known that European languages often have equative constructions based on adverbial relative pronouns that otherwise express degree or manner (‘how’, ‘as’), but we find that this type is rare outside Europe. We divide the constructions that we found into six primary types, four of which have closely corresponding types of comparative constructions (‘A is bigger than B’). An equative construction often consists of five components: a comparee (‘A’), a degree-marker (‘as’), a parameter (‘is big’), a standard-marker (‘as’), and a standard (‘B’). Most frequently, the parameter is the main predicate and the equative sense is expressed by a special standard-marker. But many languages also have a degree-marker, so that we get a construction of the English and French type. Another possibility is for the equality sense to be expressed by a transitive ‘equal’ (or ‘reach’) verb, which may be the main predicate or a secondary predicate. And finally, since the equative construction is semantically symmetrical, it is also possible to “unify” the parameter and the standard in the subject position (‘A and B are equally tall’, or ‘A and B are equal in height’). But no language has only a degree-marker, leaving the standard unmarked.
    [Show full text]
  • Introduction to Sumerian Grammar
    INTRODUCTION TO SUMERIAN GRAMMAR DANIEL A FOXVOG LECTURER IN ASSYRIOLOGY (RETIRED) UNIVERSITY OF CALIFORNIA AT BERKELEY Revised September 2008 CONTENTS INTRODUCTION 3 THE SUMERIAN WRITING SYSTEM 4 TABLE OF SYLLABIC SIGN VALUES 16 PHONOLOGY 18 NOUNS AND ADJECTIVES 23 THE NOMINAL CHAIN 28 PRONOUNS AND DEMONSTRATIVES 31 SUMMARY OF PERSONAL PRONOUN FORMS 38 THE ADNOMINAL CASES: GENITIVE AND EQUATIVE 39 THE COPULA 46 ADVERBS AND NUMERALS 51 THE ADVERBAL CASES 54 INTRODUCTION TO THE VERB 61 DIMENSIONAL PREFIXES 1: INTRODUCTION 69 DIMENSIONAL PREFIXES 2: DATIVE 73 DIMENSIONAL PREFIXES 3: COMITATIVE, ABLATIVE-INSTRUMENTAL, TERMINATIVE 78 CORE PREFIXES: ERGATIVE, LOCATIVE-TERMINATIVE, LOCATIVE 83 THE VENTIVE ELEMENT 90 RELATIVE CLAUSES: THE NOMINALIZING SUFFIX -a 95 PREFORMATIVES (MODAL PREFIXES) 102 THE IMPERATIVE 109 IMPERFECTIVE FINITE VERBS 117 PARTICIPLES AND THE INFINITIVE 127 APPENDIX: CHART OF VERBAL PREFIX CHAIN ELEMENTS 150 INDEX 151 EXERCISES 152 2 INTRODUCTION Entia non sunt multiplicanda praeter necessitatem William of Ockham This grammar is intended primarily for use in the first year of university study under the guidance of a teacher who can describe the classic problems in greater detail, add current alternative explanations for phenomena, help the student parse and understand the many textual illustrations found throughout, and provide supplementary information about the history of the language and the culture of early Mesopotamia. A few exercises have been provided to accompany study of the lessons, some artificial, others drawn from actual texts. Both require vocabulary lookup from the companion Elementary Sumerian Glossary or its equivalent. Upon completing this introduction, the student will be well prepared to progress to sign learning and reading of texts.
    [Show full text]
  • Sumerian Grammar-Society of Biblical Literature
    HDO1-71-edzard.qxd 09/04/2003 10:03 Page i SUMERIAN GRAMMAR HDO1-71-edzard.qxd 09/04/2003 10:03 Page ii HANDBOOK OF ORIENTAL STUDIES HANDBUCH DER ORIENTALISTIK SECTION ONE THE NEAR AND MIDDLE EAST EDITED BY H. ALTENMÜLLER · B. HROUDA · B.A. LEVINE · R.S. O’FAHEY K.R. VEENHOF · C.H.M. VERSTEEGH VOLUME SEVENTY-ONE HDO1-71-edzard.qxd 09/04/2003 10:03 Page iii SUMERIAN GRAMMAR BY DIETZ OTTO EDZARD BRILL LEIDEN • BOSTON 2003 HDO1-71-edzard.qxd 09/04/2003 10:03 Page iv This book is printed on acid-free paper. Library of Congress Cataloging-in-Publication Data Edzard, Dietz Otto. Sumerian grammar / by Dietz Otto Edzard. p. cm. – (Handbook of Oriental studies. Section 1, Near and Middle East ; v. 71 = Handbuch der Orientalistik) Includes bibliographical references and index. ISBN 90-04-12608-2 1. Sumerian language–Grammar. I. Title. II. Handbuch der Orientalistik. Erste Abteilung, Nahe und der Mittlere Osten ; 71. Bd. PJ4013.E38 2003 499'.955-dc21 2003049567 ISSN 0169-9423 ISBN 90 04 12608 2 © Copyright 2003 by Koninklijke Brill NV, Leiden, The Netherlands All rights reserved. No part of this publication may be reproduced, translated, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, without prior written permission from the publisher. Authorization to photocopy items for internal or personal use is granted by Brill provided that the appropriate fees are paid directly to The Copyright Clearance Center, 222 Rosewood Drive, Suite 910, Danvers MA 01923, USA.
    [Show full text]
  • Introduction to Sumerian Grammar”
    Cuneiform Digital Library Preprints <http://cdli.ucla.edu/?q=cuneiform-digital-library-preprints> Hosted by the Cuneiform Digital Library Initiative (<http://cdli.ucla.edu>) Editor: Bertrand Lafont (CNRS, Nanterre) Number 2 Title: “Introduction to Sumerian Grammar” Author: Daniel A. Foxvog Posted to web: 4 January 2016 INTRODUCTION TO SUMERIAN GRAMMAR DANIEL A FOXVOG LECTURER IN ASSYRIOLOGY (RETIRED) UNIVERSITY OF CALIFORNIA AT BERKELEY Revised January 2016 CONTENTS PREFACE 4 THE SUMERIAN WRITING SYSTEM 5 PHONOLOGY 17 NOUNS AND ADJECTIVES 22 THE NOMINAL CHAIN 27 PRONOUNS AND DEMONSTRATIVES 30 SUMMARY OF PERSONAL PRONOUN FORMS 37 THE ADNOMINAL CASES: GENITIVE AND EQUATIVE 38 THE COPULA 44 ADVERBS AND NUMERALS 49 THE ADVERBAL CASES 53 INTRODUCTION TO THE VERB 60 DIMENSIONAL PREFIXES 1: INTRODUCTION 68 DIMENSIONAL PREFIXES 2: DATIVE 73 DIMENSIONAL PREFIXES 3: COMITATIVE, ABLATIVE-INSTRUMENTAL, TERMINATIVE 78 CORE PREFIXES: ERGATIVE, LOCATIVE-TERMINATIVE, LOCATIVE 84 THE VENTIVE ELEMENT 91 RELATIVE CLAUSES: THE NOMINALIZING SUFFIX -a 97 PREFORMATIVES (MODAL PREFIXES) 104 THE IMPERATIVE 111 IMPERFECTIVE FINITE VERBS 119 PARTICIPLES AND THE INFINITIVE 130 APPENDIX 1: CHART OF VERBAL PREFIX CHAIN ELEMENTS 155 TABLE OF SYLLABIC SIGN VALUES 156 APPENDIX 2: THE EMESAL DIALECT 158 INDEX 159 EXERCISES 161 3 PREFACE Entia non sunt multiplicanda praeter necessitatem William of Ockham This grammar is intended primarily for use in the first year of university study under the guidance of a teacher who can describe the classic problems in greater detail, add current alternative explanations for phenomena, help the student parse and understand the many textual illustrations found throughout, and provide supplementary information about the history of the language and the culture of early Mesopotamia.
    [Show full text]
  • Clarifying 'Blur': Paradigms, Defaults and Inflectional Classes
    Inflectional Identity OXFORD STUDIES IN THEORETICAL LINGUISTICS GENERAL EDITORS:DavidAdger,Queen Mary, University of London;Hagit Borer, University of Southern California. ADVISORY EDITORS:StephenAnderson, Yale University; Daniel Büring, University of California, Los Angeles;NomiErteschik-Shir,Ben-Gurion University; Donka Farkas, University of California, Santa Cruz;AngelikaKratzer,Univer- sity of Massachusetts, Amherst;AndrewNevins,Harvard University;Christopher Potts, University of Massachusetts, Amherst;BarrySchein,University of Southern California;PeterSvenonius,University of Tromsø;MoiraYip,University College London. RECENT TITLES 10 The Syntax of Aspect: Deriving Thematic and Aspectual Interpretation edited by Nomi Erteschik-Shir and Tova Rapoport 11 Aspects of the Theory of Clitics by Stephen Anderson 12 Canonical Forms in Prosodic Morphology by Laura J. Downing 13 Aspect and Reference Time by Olga Borik 14 Direct Compositionality edited by Chris Barker and Pauline Jacobson 15 A Natural History of Infixation by Alan C. L. Yu 16 Phi Theory: Phi-Features Across Modules and Interfaces edited by Daniel Harbour, David Adger, and Susana Béjar 17 French Dislocation: Interpretation, Syntax, Acquisition by Cécile De Cat 18 Infectional Identity edited by Asaf Bachrach and Andrew Nevins PUBLISHED IN ASSOCIATION WITH THE SERIES The Oxford Handbook of Linguistic Interfaces edited by Gillian Ramchand and Charles Reiss For a complete list of titles published and in preparation for the series, see pp. 367. Inflectional Identity Edited by ASAF BACHRACH
    [Show full text]
  • Evolution of Case in Ossetic*
    Iran and the Caucasus 14 (2010) 287-322 Evolution of Case in Ossetic * Oleg Belyaev Moscow State University Abstract Ossetic sets itself apart from the other New Iranian languages by having a relatively elaborate system of nine cases. Since most of them are relatively late innovations, and only four cases (Nom., Gen, Abl., and Iness.) can be traced back to Proto-Iranian, many scholars tend to ascribe the development of the case system to Caucasian in- fluence. The exact nature of this influence, however, has never been demonstrated. The aim of this paper is, first, to not only reconstruct the etymologies of Ossetic cases, but also to provide a chronology of how the case system developed. The sec- ond aim pursued here is to give a systematic comparison of the case system of Os- setic with those of the neighbouring languages and to determine if there is any ex- ternal influence on the case system and, if so, what languages this influence came from. I conclude that Ossetic developed from a case system identical to those of Khotanese and Sogdian towards the present state under the influence of contact with Georgian and, later, with Turkic and Vaynakh languages. In the process of the discussion, I also argue that two new cases, the Directive and Regressive, are under- going grammaticalisation in contemporary Ossetic. Keywords Ossetic, North-East Iranian languages, Georgian, Vaynakh Languages, Case, Internal Reconstruction, Language Contact, Areal Linguistics §1. INTRODUCTION Compared to other modern Eastern Iranian languages, which usually have case inventories ranging from two to four cases, Ossetic has a strikingly elaborate system of nine cases (eight in Digor).
    [Show full text]
  • Unimorph Schema)
    The Composition and Use of the Universal Morphological Feature Schema (UniMorph Schema) John Sylak-Glassman Center for Language and Speech Processing Johns Hopkins University [email protected] June 2, 2016 c John Sylak-Glassman ** Working Draft, v. 2 ** Contents 1 Introduction 3 2 Overview 4 3 Construction Methodology 5 3.1 Guiding Principles . .5 3.2 Constructing the Schema . .6 4 Annotation Formatting Guidelines 7 5 Dimensions of Meaning and Features 8 5.1 Aktionsart . .8 5.2 Animacy . 10 5.3 Argument Marking . 12 5.4 Aspect . 13 5.5 Case . 15 5.5.1 Core Case . 15 5.5.2 Non-Core, Non-Local Case . 16 5.5.3 Local Case . 18 5.6 Comparison . 20 5.7 Definiteness . 21 5.8 Deixis . 22 5.8.1 Distance . 22 5.8.2 Reference Point . 22 5.8.3 Visibility . 23 5.8.4 Verticality . 23 5.8.5 Summary . 24 1 5.9 Evidentiality . 24 5.10 Finiteness . 26 5.11 Gender and Noun Class . 27 5.12 Information Structure . 28 5.13 Interrogativity . 29 5.14 Language-Specific Features . 29 5.15Mood ............................................ 30 5.16 Number . 34 5.17 Part of Speech . 36 5.18 Person . 40 5.19 Polarity . 42 5.20 Politeness . 42 5.20.1 Speaker-Referent Axis . 43 5.20.2 Speaker-Addressee Axis . 44 5.20.3 Speaker-Bystander Axis . 44 5.20.4 Speaker-Setting Axis . 45 5.20.5 Politeness Features . 45 5.21 Possession . 46 5.22 Switch-Reference . 49 5.23 Tense . 53 5.24 Valency . 55 5.25 Voice .
    [Show full text]
  • Creole Genesis and Universality: Case, Word Order, and Agreement Gerald Taylor Snow Brigham Young University
    Brigham Young University BYU ScholarsArchive All Theses and Dissertations 2017-03-01 Creole Genesis and Universality: Case, Word Order, and Agreement Gerald Taylor Snow Brigham Young University Follow this and additional works at: https://scholarsarchive.byu.edu/etd Part of the Linguistics Commons BYU ScholarsArchive Citation Snow, Gerald Taylor, "Creole Genesis and Universality: Case, Word Order, and Agreement" (2017). All Theses and Dissertations. 6338. https://scholarsarchive.byu.edu/etd/6338 This Thesis is brought to you for free and open access by BYU ScholarsArchive. It has been accepted for inclusion in All Theses and Dissertations by an authorized administrator of BYU ScholarsArchive. For more information, please contact [email protected], [email protected]. Creole Genesis and Universality: Case, Word Order, and Agreement Gerald Taylor Snow A thesis submitted to the faculty of Brigham Young University in partial fulfillment of the requirements for the degree of Master of Arts Deryle W. Lonsdale, Chair Dallin D. Oaks Heather Willson-Sturman Department of Linguistics and English Language Brigham Young University Copyright © 2017 Gerald Taylor Snow All Rights Reserved ABSTRACT Creole Genesis and Universality: Case, Word Order, and Agreement Gerald Taylor Snow Department of Linguistics and English Language, BYU Master of Arts The genesis of creole languages is important to the field of linguistics for at least two reasons. As newly emerging languages, creoles provide a unique window on the human language faculty and on the development of language generally (Veenstra 2008). They also offer insight into what are arguably universal linguistic structures. Two opposing theories have been in contention in the literature with respect to creole genesis: (1) that creoles owe their origin to the lexifier and substrate languages of their speech community and to other environmental influences (McWhorter 1997); and alternatively, (2) that universal innate linguistic structures or principles are the generative source of creole grammar (Bickerton 1981).
    [Show full text]
  • Common Declensions and Cases
    COMMON DECLENSIONS AND CASES: Modern English is an analytic language. It primarily makes meaning by word order. To show that a word is in the nominative case, (i.e., the word functions as the subject of a clause), modern English speakers put that word in front of the verb. To show that a word is in the accusative case (functioning as a direct objective), modern English speakers put that word after the verb. For instance, The teacher (nominative) graded the tests (accusative/direct object). Word order thus becomes very important in analytic languages. It makes no sense in modern English to write, "The tests graded the teacher." However, Anglo-Saxon (Old English), Latin, Greek, and many other languages are or were synthetic. These languages require that the case or function of each word be visibly marked through inflections or declensions. An inflection, generally speaking, is any change in a word--whether that word is a noun, adjective, verb, adverb, or whatnot. A noun might inflect to show a singular or plural meaning, for instance. Inflections in verbs fall into patterns called conjugations. When you conjugate a verb, you show all its inflections for a particular tense and mood, for instance. Declensions are inflections to words that show what their case is. (The term case simply means what part of speech a word and its modifiers are in--for instance, subject, object, etc.). NB: You always conjugate verbs. You always decline nouns and their modifiers. Grammarians will make fun of you if you talk about "conjugating nouns" or "declining verbs." In synthetic languages, word order does not matter.
    [Show full text]
  • Functive Phrases in Typological and Diachronic Perspective
    John Benjamins Publishing Company This is a contribution from Studies in Language 38:3 © 2014. John Benjamins Publishing Company This electronic file may not be altered in any way. The author(s) of this article is/are permitted to use this PDF file to generate printed copies to be used by way of offprints, for their personal use only. Permission is granted by the publishers to post this file on a closed server which is accessible only to members (students and faculty) of the author’s/s’ institute. It is not permitted to post this PDF on the internet, or to share it on sites such as Mendeley, ResearchGate, Academia.edu. Please see our rights policy on https://benjamins.com/#authors/rightspolicy For any other use of this material prior written permission should be obtained from the publishers or through the Copyright Clearance Center (for USA: www.copyright.com). Please contact [email protected] or consult our website: www.benjamins.com Functive phrases in typological and diachronic perspective Denis Creissels University of Lyon This paper is concerned with a particular semantic type of noun phrases in secondary predicate function, illustrated by as a negotiator in He was sent as a negotiator. It explores the cross-linguistic regularities in the polysemy patterns characteristic of the markers and constructions having the expression of this meaning as one of their possible uses, and discusses the grammaticalization paths in which they are involved. Keywords: functive, essive, grammaticalization, secondary predicate, polysemy 1. Introduction The combination of a noun phrase or adposition phrase with a verb is not always designed to express a participant in an event encoded by the verb.
    [Show full text]
  • Comparative Constructions: an Introduction
    Comparative Constructions: An Introduction Yvonne Treis CNRS-LLACAN 1. Preliminaries Comparison is a mental act by which two or more items are examined in order to assess similarities and differences between them. The comparison can be made with regard to a certain gradable, one- dimensional property, and the items are then assigned a position on a predicative scale. This mental act of comparison finds its linguistic encoding in comparison constructions, especially comparative constructions for the expression of comparison of inequality or equative constructions for the expression of comparison of equality. The assessment that is made with regard to a multi- faceted notion (e.g. manner) finds its linguistic correlate in similative constructions. The linguistic literature has especially been concerned with comparison of inequality and comparative constructions, such as the English sentences in (1). (1) Mary is tall -er than Peter Comparee Parameter Parameter/Degree Marker Standard Marker Standard Mary is more intelligent than Peter Comparee Parameter/Degree Marker Parameter Standard Marker Standard Comparative constructions are also the focus of this special issue, which evolved out of a two-day workshop titled “Comparative and Superlative Constructions: Typology and Diachrony” convened on 16-17 June 2015 at the Vrije Universiteit Amsterdam and organised by Lourens de Vries (Vrije Universiteit Amsterdam) and Katarzyna I. Wojtylak (James Cook University). The workshop aimed at taking a look at comparative constructions from a different angle – namely from the perspective of field linguistics. When attempting to analyse comparative constructions in a little- known minority language, a fieldworker who has built up a multi-purpose corpus for a grammatical description of a language may, firstly, be confronted with difficulties in finding correlates to comparative constructions in other languages – often because the languages does not have dedicated, grammaticaliZed constructions for the expression of comparison.
    [Show full text]