Dictionary of Paraphrases of Czech Complex Predicates with Light Verbs

Total Page:16

File Type:pdf, Size:1020Kb

Dictionary of Paraphrases of Czech Complex Predicates with Light Verbs ParaDi: Dictionary of Paraphrases of Czech Complex Predicates with Light Verbs Petra Barancˇ´ıkova´ and Vaclava´ Kettnerova´ Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University, Malostranske´ nam´ estˇ ´ı 25, 118 00, Praha, Czech Republic, [email protected], [email protected] Abstract kiss’ vs. va´snivˇ e/nˇ eˇznˇ e/letmo/*manˇ zelsky/*mˇ a-´ jove/*smrtelnˇ eˇ pol´ıbit ‘kiss passionately/tender- We present a new freely available dic- ly/fleetingly/*marriagely/*Mayly/?fatally’. Easier tionary of paraphrases of Czech complex modification of CPs is usually considered as the predicates with light verbs, ParaDi. Can- main motivation for their widespread use (Brinton didates for single predicative paraphrases and Akimoto, 1999). of selected complex predicates have been In this paper, we present ParaDi, a dictionary extracted automatically from large mono- of single predicative verb paraphrases of Czech lingual data using word2vec. They have CPs. We restricted the dictionary only to CPs been manually verified and further refined. that consist of light verbs and predicative nouns, We demonstrate one of many possible ap- which represent the most frequent and central type plications of ParaDi in an experiment with of CPs in the Czech language. improving machine translation quality. ParaDi was built on a semi-automatic basis. First, candidates for single verb paraphrases of 1 Introduction selected CPs have been automatically identified Multiword expressions (MWEs) pose a serious in large monolingual data using word2vec, a shal- challenge for both foreign speakers and many NLP low neural network. The list of these candidates tasks (Sag et al., 2002). From various multiword has been then manually checked and further re- expressions, those that involve verbs are of great fined. In many cases, if CPs are to be correctly significance as verbs represent the syntactic center paraphrased by the identified single predicative of a sentence. verbs, these verbs require certain semantic and/or In this paper, we focus on one particular type syntactic modifications. of Czech multiword expressions – on complex It has been widely acknowledged that many predicates with light verbs (CPs). CPs consist NLP applications – let us mention, e.g. informa- of a light verb and another predicative element tion retrieval (Wallis, 1993), question answering, – a predicative noun, an adjective, an adverb or machine translation (Madnani and Dorr (2013); a verb; the pairs function as single predicative Callison-Burch et al. (2006); Marton et al. (2009)) units. As such, most CPs have their single pred- or machine translation evaluation (Kauchak and icative counterparts by which they can be para- Barzilay (2006); Zhou et al. (2006); Barancˇ´ıkova´ phrased, e.g. the CPs dat´ polibek and dat´ pusu et al. (2014)) – can benefit from paraphrases. ‘give a kiss’ can be both paraphrased by pol´ıbit Here we show how the dictionary providing ‘to kiss’. high quality data can be integrated into an ex- In contrast to their single predicative para- periment with improving statistical machine trans- phrases, CPs manifest much greater flexibility lation quality. If translated separately, CPs of- in their modification, c.f. adjectival modifiers ten cause errors in machine translation. In our of the CP dat´ polibek ‘give a kiss’ and the cor- experiment, we use the dictionary to simplify responding adverbial modifiers of its single verb Czech source sentences before translation by re- paraphrase pol´ıbit ‘to kiss’ in dat´ va´snivˇ y/n´ eˇz-ˇ placing CPs with their respective single predica- ny/letm´ y/man´ zelskˇ y/m´ ajov´ y/smrt´ ´ıc´ı polibek ‘give tive verb paraphrases. Human annotators have a passionate/tender/fleeting/marriage/May/fatal evaluated quality of the translated simplified sen- 1 Proceedings of the 13th Workshop on Multiword Expressions (MWE 2017), pages 1–10, Valencia, Spain, April 4. c 2017 Association for Computational Linguistics tences higher than of the original sentences con- combined with information on syntactic and/or tain CPs. semantic properties of CPs are employed (e.g. This paper is structured as follows. First, related Bannard (2007), Fazly et al. (2005)). The auto- work on CPs generally and on their paraphrases is matic detection benefits especially from parallel introduced (Section 2). Second, the paraphrasing corpora representing valuable sources of data in model for CPs is thoroughly described, especially which CPs can be automatically recognized via the selection of CPs, an automatic extraction of word alignment, see e.g. (Chen et al., 2015), candidates for their paraphrases and their manual (de Medeiros Caseli et al., 2010), (Sinha, 2009), evaluation (Section 3). Third, the resulting data (Zarrießand Kuhn, 2009). and the structure of the lexical space of the dictio- Work on paraphrasing CPs is still not exten- nary are discussed (Section 4). Finally, in order to sive. A paraphrasing model has been proposed present one of many practical applications of this within the Meaning Text Theory(Zolkovskijˇ and ↔ dictionary, a random sample of paraphrases from Mel’cuk,ˇ 1965). Its representation of CPs by the ParaDi dictionary is used in a machine trans- means of lexical functions and rules applied in lation experiment (Section 5). the paraphrasing model are thoroughly described in (Alonso Ramos, 2007). Further, Fujita et al. 2 Related Work (2004) present a paraphrasing model which takes A theoretical research on CPs with light verbs has advantage of semantic representation of CPs by a long history, which can be traced back to Jes- lexical conceptual structures. Similarly as our pro- persen (1965). An ample literature devoted to this posed dictionary of paraphrases, this model also language phenomenon so far is characterized by takes into account changes in the grammatical cat- an enormous diversity in used terms and analyses, egory of voice and changes in morphological cases see esp. (Amberber et al., 2010) and (Alsina et al., of arguments, which have appeared to be highly 1997). Here we use the term CP with the light verb relevant for the paraphrasing task. for a collocation within which the verb – not re- taining its full semantic content – provides rather 3 Paraphrase Model grammatical functions (incl. syntactic structure) In this section, the process of paraphrase extrac- and to which individual semantic properties are tion is described in detail. First, we present the se- primarily contributed by the noun (Algeo, 1995). lection of CPs (Section 3.1). For their paraphras- The information on CPs is a part of several ing, we had initially intended to use some of exist- lexical resources containing manually annotated ing sources of paraphrases, however, they turned data. For instance, CPs are represented in syn- out to be completely unsatisfactory for our task.4 tactically rich annotated corpora from the family Word2vec is a group of shallow neural networks of the Prague Dependency Treebanks: the Prague generating representations of words in a continu- 1 Dependency Treebank 3.0 (PDT) and the Prague ous vector space depending on contexts they ap- 2 Czech-English Dependency Treebank 2.0 , see pear in (Mikolov et al., 2013). In line with dis- (Bejcekˇ et al., 2013) and (Hajicˇ et al., 2012). Fur- tributional hypothesis (Harris, 1954), semantically ther, the PropBank3 project has been recently en- hanced with the information on CPs; the anno- 4We used the ParaPhrase DataBase (PPDB), (Ganitke- tation scheme of CPs in PropBank is thoroughly vitch and Callison-Burch, 2014; Ganitkevitch et al., 2013) the largest paraphrase database available for the Czech lan- described in (Hwang et al., 2010). Finally, the guage. PPDB has been created automatically from large par- Hungarian corpus of CPs based on the data from allel data and it comes in several sizes ranging from S to the Szeged Treebank has been built (Vincze and XXL. However, the bigger its size, the bigger the amount of noise. We chose the size L as a reasonable trade-off between Csirik, 2010). quality and quantity. We combined the phrasal paraphrases, At present, one of trending topics in NLP many-to-one and one-to-many. We lemmatized and tagged the collection of PPDB using the state-of-the-art POS tagger community is an automatic identification of CPs. Morphodita (Strakova´ et al., 2014). Even though this collec- In this task, various statistical measures often tion contains almost 400k lemmatized paraphrases in total, it contained only 54 candidates for single predicative verb para- 1 http://ufal.mff.cuni.cz/pdt3.0 phrases of CP. Only 2 of these 45 candidates these candidates 2http://ufal.mff.cuni.cz/pcedt2.0/en/ have been detected correctly, the rest was noise in PPDB. As index.html a result, we chose not to use parallel data in our task but we 3https://verbs.colorado.edu/˜mpalmer/ have adopted another approach applying word2vec, a neural projects/ace.html network based model to large monolingual data. 2 similar words are mapped close to each other CPs Verbs Nouns (measured by the cosine similarity) so we can ex- First dataset 726 49 612 pect CPs and their single verb paraphrases to have Second dataset 1640 126 699 similar vector space distribution. Union 2257 154 1061 Word2vec computes vectors for single tokens. As CPs represent MWEs, their preprocessing was Table 1: The number of unique CPs, light verbs necessary: CPs have to be first identified and con- and predicative nouns from two datasets. Their nected into a single token (Section 3.2). union has been used in the paraphrase extraction Particular settings of our model for an auto- task. matic extraction of candidates for single predica- Corpus Sentences Tokens tive verb paraphrases are presented in Section 3.3. CNK2000 2.78 121.81 Finally, a manual evaluation of the extracted can- CNK2005 7.95 122.99 didates, including their further annotation with CNK2010 8.18 122.48 semantic and syntactic information, is described Czeng 1.0 14.83 206.05 (Section 3.4).
Recommended publications
  • Cross-Disciplinary Collaboration for Teaching Colors in Menominee
    The Complexity of Simple Things: Cross-disciplinary Collaboration for Teaching Colors in Menominee Monica Macaulay Linguistics Department Rita MacDonald School of Education 3/9/17 UNIVERSITY OF WISCONSIN 1 Who we are • Language documentation, • Applied linguistics, SLA description, analysis and TESOL • Language teacher & teacher educator Rita Monica Menominee Language and Culture Commission 3/9/17 UNIVERSITY OF WISCONSIN 2 3/9/17 UNIVERSITY OF WISCONSIN 3 Menominee Language • Algonquian language of Wisconsin • Documented 1921-1949 by Leonard Bloomfield • MM: working with community since 1998 3/9/17 UNIVERSITY OF WISCONSIN 4 Menominee Language Revitalization Current status: • Fewer than 5 L1 speakers, all elderly • Small number of proficient L2 speakers • No external communiMes of speakers • 2016-present: Tribal program to train teachers 1. to speak Menominee (14 months) 2. to become teachers for pre-school immersion 3/9/17 UNIVERSITY OF WISCONSIN 5 Menominee Language Revitalization Current status: • Fewer than 5 L1 speakers, all elderly • Small number of proficient L2 speakers • No external communiMes of speakers • 2016-present: Tribal efforts to train teachers 1. to speak Menominee (14 months) 2. to become teachers for pre-school immersion COLORS!! 3/9/17 UNIVERSITY OF WISCONSIN 6 Past Attempt: CL’s lesson § doctoral student in Included sentences like these Curriculum and Instruction for me to translate: § interested in intergenerational • What color do you see? transmission of language • I see orange. § no training in linguistics • What’s your favorite color? § some SLA training • My favorite color is blue. § had idea of lesson on colors as • Touch someone wearing red. sample lesson for teachers • Touch someone wearing a red shirt.
    [Show full text]
  • The Japanese Government Project for Machine
    THE JAPANESE GOVERNMENTPROJECT FOR MACHINE TRANSLATION Makoto Nagao, Jun-ichi Tsujii, and Jun-ichi Nakamura Department of Electrical Engineering Kyoto University Sakyou-ku, Kyoto, Japan 606 1 OUTLINE OF THE PROJECT post-editing, access to grammar rules, and dictionary maintenance. The project is funded by a grant from the Agency of The project is not primarily concerned with the devel- Science and Technology through the Special Coordi- opment of a final practical system; that will be developed nation Funds for the Promotion of Science and Technol- by private industry using the results of this project. ogy, and was started in fiscal 1982. The formal title of Technical know-how is already being transferred gradu- the project is "Research on Fast Information Services ally to private enterprise through the participation in the between Japanese and English for Scientific and Engi- project of people from industry. Software and linguistic neering Literature". The purpose is to demonstrate the data are also being transferred in part. Finally, complete feasibility of machine translation of abstracts of scientific technical transfer will be done under the proper condi- and engineering papers between the two languages, and tions. as a result, to establish a fast information exchange The Japanese source texts being used are abstracts of system for these papers. The project term was initially scientific and technical papers published in the monthly scheduled as three years from the fiscal year of 1982 JICST journal d Current Bibliography of Science and with a budget of about seven hundred million yen, but, Technology. At present, the project is only processing due to the present financial pressures on the government, texts in the electronics, electrical engineering, and the term has been extended to four years, up to 1986.
    [Show full text]
  • Development of Copula and Auxiliary BE in Children with Specific Language Impairment and Younger Unaffected Controls*
    F;rsr Langnoge, 22 (2002). 137-172. Printed in England Development of copula and auxiliary BE in children with Specific Language Impairment and younger unaffected controls* KATE L. JOSEPH LUDOVICA SERRATRICE University of Mancltester GINA CONTI-RAMSDEN I ABSTRACT Children with Specific Language Impairment (SLI) have well- known difficulties with the obligatory use of verbal tense-marking morphology which they use optionally for a longer period of time than typically developing children. Copula BE and auxiliary BE are two tense markers that have been shown to be problematic for children with SLI, but their status as independent allomorphs is somewhat controversial in the literature. In the present study we argue for copula BE and auxiliary BE as two separate tense markers showing different developmental curves both for children with SLI and for younger unaffected controls. It is suggested that copular versus auxiliary status, morphological form, and the frequency of the lexical construction containing BE all affect children’s provision of copular and auxiliary forms. Implications for identifying variables constraining optionality are discussed. INTRODUCTION Children with Specific Language Impairment (SLI) have long been * This research was supported by the Economic and Social Research Council, ESRC grant to Gina Conti-Ramsden (grant number R000237767). We would like to thank Rachel F. Hick for help with data collection, Brian Faragher for help with statistical analyses, and Kevin Durkin, Stan Kuczaj and an anonymous reviewer for their helpful comments on a previous draft. Finally, our special thanks go to the three children with SLI and their families who participated in the study. Address for correspondence: Human Communication and Deafness, School of Education, University of Manchester, Oxford Road, Manchester M13 9PL, UK.
    [Show full text]
  • English Predication and Semantic Roles
    DISCOURSE AS A FORM OF MULTICULTURALISM IN LITERATURE AND COMMUNICATION SECTION: LANGUAGE AND DISCOURSE ARHIPELAG XXI PRESS, TÎRGU MUREȘ, 2015, ISBN: 978-606-8624-21-1 ENGLISH PREDICATION AND SEMANTIC ROLES Claudia Leah Assist. Prof., PhD, University of Oradea Abstract: This paper aims at highlighting some aspects regarding the English predication and the semantic roles, seen from complexity perspective. In doing so, it starts from verbs and types of predicates and continues with the semantic roles involved in a verbal relation. A clear distinction between different roles is required as each type is worth not only mentioning, but also debating and describing. Keywords: relations, semantics, predication, role As in any other language, the English predicate is considered to be the main part of the sentence that says something about the subject, showing what, who or how the subject is. Birds fly./ John is a doctor. John is happy. According to its abilities and characteristics the predicate may be: verbal or nominal. In its turn, verbal predicate is of two kinds: simple verbal predicate, expressed by a verb or a phrasal verb in a personal mood: The child eats an apple every day. They were taken care of by John. compound modal verbal predicate, made of: a semiauxiliary modal verb (to express a modal action feature: necessity, possibility, desire, etc.) and a basic verb, usually an infinitive: You must see a doctor. They can give the answer. We could have come if they had been invited. The most common semiauxiliary modal verbs are: can, could, may, might, must, to have to etc.
    [Show full text]
  • Subject-Predicate Sentence Structure
    International Journal of Applied Linguistics & English Literature ISSN 2200-3592 (Print), ISSN 2200-3452 (Online) Vol. 5 No. 7; December 2016 [Special Issue on Language and Literature] Flourishing Creativity & Literacy Australian International Academic Centre, Australia The Chinese and Kazakh Languages Comparative Study: Subject-Predicate Sentence Structure Nurhalyk Abdurakyn Al-Farabi Kazakh National University, Kazakhstan Alina Nurzhayeva Al-Farabi Kazakh National University, Kazakhstan Anar Mustafayeva (Corresponding author) Al-Farabi Kazakh National University, Kazakhstan E-mail: [email protected] Dariga Kokeyeva Al-Farabi Kazakh National University, Kazakhstan Kaldybay Kydyrbayev Al-Farabi Kazakh National University, Kazakhstan Received: 14-08-2016 Accepted: 17-10-2016 Advance Access Published: November 2016 Published: 10-12-2016 doi:10.7575/aiac.ijalel.v.5n.7p.197 URL: http://dx.doi.org/10.7575/aiac.ijalel.v.5n.7p.197 Abstract This article compares subject-predicate sentence and subject-predicate-object sentence forms with extended sentence of the Kazakh language. It compares Chinese and Kazakh sentences with subject-predicate sentence structure and studies differences of verb-predicate sentences word order. Detailed comparative study of Chinese and Kazakh language differences and characteristics. Morphology structure of Chinese language belongs to radical language, and Kazakh language belongs to the type of adhesive language, syntax of Chinese and Kazakh Languages in the same syntactic structure often used in different ways, the morphological role in Chinese and Kazakh language is also not the same, and even the same kind of grammatical means in Chinese and Kazakh languages are various. According to the analysis of languages, the different positions of word order, the function of words and grammatical word order are different too.
    [Show full text]
  • More Thoughts on the Communicative Function of the English Verb
    JAN FIRBAS MORE THOUGHTS ON THE COMMUNICATIVE FUNCTION OF THE ENGLISH VERB The present treatise forms the second instalment of a study dealing with some aspects of the shift from verbal to nominal expression within the structure of the English language. The first instalment entitled Thoughts on the Communicative Function of the Verb in English, German and Czech (Brno Studies in English I, Prague 1959, pp. 39—63) has provided a wider setting for the problems to be dealt with in the present paper. It has shown that in all the three examined languages the verb ranks below the noun in that it displays a definitely lower frequency as conveyer of the rheme proper.1 In all the three languages this detracts from the communicative value of the verb and promotes the shift towards nominal expres­ sion. The present study sets out to inquire into the shift from verbal to nominal expression in greater detail. It focusses its attention on English, constantly com­ paring it with Czech. This comparison will make it possible to ascertain the qualitative differences the two languages display in regard to the shift towards nominal expression. As the starting, or rather zero, point from which the shift may be traced, it is proposed to regard the highest degree of communicative value a verb can possibly attain. The degree of communicative value of the verb depends on the total amount of functions the verb can perform in a sentence at the given moment of communi­ cation (spoken or written).2 These functions can be specified as (i) grammatical, (ii) semantic, and as (iii) those performed within functional sentence per­ spective.3 Any disengagement of the verb from a function it could as a verb perform weakens its communicative value and contributes towards the shift away from verbal expression.
    [Show full text]
  • I. Introduction
    I. INTRODUCTION The aim of this work is to analyze the differences and similarities between English and Czech predications with regards to their level of semantic dynamism. First of all, I will concentrate on the whole concept of dynamism in semantics. This includes the general characteristics as well as the concrete realization of the category in English and Czech. I will then describe the characteristic features of predicates and their categorical division according to the grammatical and semantic criteria. This division was made presumably on the ground of the book A Functional Analysis of Present Day English on a General Linguistic Basis by Vilém Mathesius. I have tried to describe the characteristics of predications in English and compare them to predications in Czech. In this linguistic analysis of English and Czech predications I have concentrated especially on the major differences between them as English and Czech are not closely related language. This analysis will serve as the basis for my further study. I will then explain other differences between the two languages as I will concentrate in more detail on the specific features of English that are not typical for Czech, especially sentence condensation and the tendency of English to nominal expression of reality. Another major difference between English and Czech is their system of tenses that I will describe later. English has a very complex system that combine the present, past and future tenses existing in Czech with progressive and perfect aspects. The choice of a concrete type of tense always depends on the dynamism of a concrete situation.
    [Show full text]
  • Two Types of Verb Particle Constructions Mikael Vinka
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by ScholarlyCommons@Penn University of Pennsylvania Working Papers in Linguistics Volume 6 Issue 1 Proceedings of the 23rd Annual Penn Article 24 Linguistics Colloquium 1-1-1999 Two Types of Verb Particle Constructions Mikael Vinka This paper is posted at ScholarlyCommons. http://repository.upenn.edu/pwpl/vol6/iss1/24 For more information, please contact [email protected]. Two Types of Verb Particle Constructions This working paper is available in University of Pennsylvania Working Papers in Linguistics: http://repository.upenn.edu/pwpl/vol6/ iss1/24 Two Types of Verb Particle Constructions* Mikael Vinka 1 Introduction Swedish Verb Particle Constructions (VPCs) are often claimed to be constrained in such a way that the particle must precede the object (Taraldsen 1991, Holmberg & Platzack 1995, Svenonius 1996 etc.). Therefore, (la) and (2a) are well formed, whereas (lb) and (2b) are ill formed. (1) a Kallesatte pa TVn. V Prt Obj Kalle switched on TV.the 'Kalle switched on the TV.' b *Kallesatte TVn pa. *V Obj Prt Kalle switched TV.the on 'Kalle switched the TV on/ (2) a Kalle smutsade ner trojan V Prt Obj Kalle dirtied down shirt.the 'Kalle made the shirt dirty.* b *Kalle smutsade trojan ner *V Obj Prt Kalle dirtied shirt.the down 'Kalle made the shirt dirty.' However, closer examination reveals that (1) and (2) differ in various fine points of syntactic distribution. In (3) and (4) the full DP objects of (1) and (2) are replaced with pronouns. In contrast to (1 b), (3b) is well formed, even though the object precedes the particle.
    [Show full text]
  • Verb Valency Patterns in Academic Register: Syntactic Approach Jolita Horbačauskienė, Saulė Petronienė Abstract
    ISSN 1648-2824 KALBŲ STUDIJOS. 2013. 22 NR. * STUDIES ABOUT LANGUAGES. 2013. NO. 22 Verb Valency Patterns in Academic Register: Syntactic Approach Jolita Horbačauskienė, Saulė Petronienė http://dx.doi.org/10.5755/j01.sal.0.22.4297 Abstract. The article analyses the main characteristics of verb valency patterns and behaviour of verbs in syntactic structures in clauses or sentences. Herbst claims that valency, as well as collocations, can be considered as an area for errors for foreign language learners as they involve idiosyncratic knowledge that has to be learned (2010, p. 225). That is why the studies of the verb valency patterns from the syntactic point of view have an obvious applicability in foreign language studies as it allows an observation of the most common patterns of syntactic verb relations. The paper focuses on verb valency patterns in syntactic structures and aims to analyse these patterns in the academic register. Five types of verb valency patterns (monotransitive, copular, intransitive, complex transitive and ditransitive) employed in the research have been identified according to the number of complements that a predicative verb can take as well as to the type of the compliments. The paper emphasises the awareness of possible types of verb complementation which is significant and useful in the studies of languages since it enables a learner to produce a well articulated speech with grammatical clauses fully conveying the intended meaning. Key words : verb valency patterns, academic register, syntactic approach, passive voice. Introduction sentences constituting predicates which in turn, are “the semantic and structural centre of the clause” (Valeika and The concept of verb valency corresponds to the Butkienė, 2006, p.
    [Show full text]
  • Serbo-Croatian Word Order: a Logical Approach
    Serbo-Croatian Word Order: A Logical Approach Dissertation Presented in Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy in the Graduate School of The Ohio State University By Vedrana Mihaliˇcek,B.A., M.A. Graduate Program in Department of Linguistics The Ohio State University 2012 Dissertation Committee: Carl Pollard, Advisor Brian Joseph Michael White Table of Contents Page List of Tables . iv 1. Introduction . 1 2. Framework . 2 2.1 Background . 2 2.2 Phenogrammar . 5 2.2.1 Terms and Types . 5 2.2.2 Functions . 7 2.3 Tectogrammar . 14 2.3.1 Preliminaries . 14 2.3.2 Representing Inflectional Features . 16 2.3.3 N and NP type families . 17 2.3.4 S type family . 17 2.3.5 ( types . 18 2.3.6 ∏ types . 19 2.4 Semantics . 21 2.4.1 Preliminaries . 21 2.4.2 Entailment . 21 2.4.3 Types . 22 2.5 Putting it all together . 24 2.5.1 Signs . 24 2.5.2 Rules . 26 i 3. Basic Word Order . 28 3.1 Introduction . 28 3.2 Data . 29 3.2.1 Lexical Noun Phrases . 29 3.2.2 Phrasal Noun Phrases . 36 3.2.3 Adverbial Modifiers . 42 3.3 Analysis . 45 3.3.1 Lexical Noun Phrases . 45 3.3.2 Phrasal Noun Phrases . 53 3.3.3 Adverbial Modifiers . 69 3.4 Conclusion . 75 4. Embedding, Predicative and Control . 79 4.1 Introduction . 79 4.2 Embedded Declarative Clauses . 80 4.2.1 Data . 80 4.2.2 Analysis . 83 4.3 Predicatives .
    [Show full text]
  • A Grammar Research Guide for Ngwi Languages
    Language and Culture DigitalResources Documentation and Description 33 A Grammar Research Guide for Ngwi Languages Eric B. Drewry A Grammar Research Guide for Ngwi Languages Eric B. Drewry Azusa Pacific University in cooperation with SIL International—East Asia Group SIL International 2016 SIL Language and Culture Documentation and Description 33 ©2016 SIL International® ISSN 1939-0785 Fair Use Policy Documents published in the Language and Culture Documentation and Description series are intended for scholarly research and educational use. You may make copies of these publications for research or instructional purposes (under fair use guidelines) free of charge and without further permission. Republication or commercial use of a Language and Culture Documentation and Description or the documents contained therein is expressly prohibited without the written consent of the copyright holder. Managing Editor Eric Kindberg Series Editor Lana Martens Content Editor Lynn Frank Copy Editor Sue McQuay Compositor Bonnie Waswick Abstract This grammar research guide describes the range of syntactic variety found in a representative group of well-described Ngwi languages. This overview of syntactic variety should make the guide useful for field linguists preparing to describe any of the forty-eight Ngwi languages that were recognized for the first time in the sixteenth edition of the Ethnologue (Lewis 2009). This is done by giving examples of where and how widely the languages in this group vary even within the typical categories of the Ngwi languages, including sentence introducers, conjunctions, noun types, compounding, derivation, noun particles, postnominal clausal particles, classifiers and numerals, negation, adjectives, pronouns, adverbs, verb types, verb concatenations, preverbal and postverbal slots, verb particles, clause-final and sentence- final particles, simple sentences, compound sentences, and complex sentences.
    [Show full text]
  • Researching of the Relative Syntagmas in the Languages of Different System at the Contemporary Stage (Azerbaijani and English)
    International Journal of Humanities and Social Science Vol. 5, No. 11; November 2015 Researching of the Relative Syntagmas in the Languages of Different System at the Contemporary Stage (Azerbaijani and English) Hashim L. Akbarov PhD candidate Department of Foreign Languages Lankaran State University Jill, Lankaran, Azerbaijan Abstract The article deals with the investigation of syntagma and relative syntagmas in the Azerbaijani and English languages. They are researched as syntactic units of linguistics, classified according to their types and spoken widely of their meanings in the article. This unit consisting of two members of the combined words is used in the functions of determining and determined. It is explained as a semantic-syntactic event, studied and developed its semantic features. It should be noted that signs perform the system of approaches in the form of syntagmatics and paradigmatics. Syntagmatic approaches are based on distributive potentials of signs, their valence, but paradigmatic approaches are based on the selection of definite element of paradigm signs, namely for that reason Saussure considers the morphology as the “sphere of paradigmatics”, but the syntax – the “sphere of syntagmatics”. But a notion of “verticality” and “horizontality” exists even in the row of language signs of classic linguistics. So syntagmatics is explained as a “horizontality”, but paradigmatics as“verticality”. This work is one of the first attempts in this area. Thus, the theoretical and methodological basis of the origin and development of combinatorial linguistics as a fundamental discipline in which we investigate the relative syntagmas (RS), it should be regarded as theoretical works of Azerbaijani and foreign (Western European and American) linguists.
    [Show full text]