Aorist Hindiurdu Sept 001 Épr
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Noun Group and Verb Group Identification for Hindi
Noun Group and Verb Group Identification for Hindi Smriti Singh1, Om P. Damani2, Vaijayanthi M. Sarma2 (1) Insideview Technologies (India) Pvt. Ltd., Hyderabad (2) Indian Institute of Technology Bombay, Mumbai, India [email protected], [email protected], [email protected] ABSTRACT We present algorithms for identifying Hindi Noun Groups and Verb Groups in a given text by using morphotactical constraints and sequencing that apply to the constituents of these groups. We provide a detailed repertoire of the grammatical categories and their markers and an account of their arrangement. The main motivation behind this work on word group identification is to improve the Hindi POS Tagger’s performance by including strictly contextual rules. Our experiments show that the introduction of group identification rules results in improved accuracy of the tagger and in the resolution of several POS ambiguities. The analysis and implementation methods discussed here can be applied straightforwardly to other Indian languages. The linguistic features exploited here are drawn from a range of well-understood grammatical features and are not peculiar to Hindi alone. KEYWORDS : POS tagging, chunking, noun group, verb group. Proceedings of COLING 2012: Technical Papers, pages 2491–2506, COLING 2012, Mumbai, December 2012. 2491 1 Introduction Chunking (local word grouping) is often employed to reduce the computational effort at the level of parsing by assigning partial structure to a sentence. A typical chunk, as defined by Abney (1994:257) consists of a single content word surrounded by a constellation of function words, matching a fixed template. Chunks, in computational terms are considered the truncated versions of typical phrase-structure grammar phrases that do not include arguments or adjuncts (Grover and Tobin 2006). -
1: Scope and Important of Urdu: Urdu Is a Standardized Register of the Hindustani Language It Is the National Language and Ling
1: Scope And Important Of Urdu: Urdu Is A Standardized Register Of The Hindustani Language It Is The National Language And Lingua Franca Of Pakistan, And An Official Language Of Six States Of India. It Is Also One Of The 22 Official Languages Recognized In The Constitution Of India. Urdu Is Historically Associated With The Muslims Of The Northern Indian Subcontinent. Apart From Specialized Vocabulary, Urdu Is Mutually Intelligible With Standard Hindi, Another Recognized Register Of Hindustani. The Urdu Variant Received Recognition And Patronage Under British Rule When The British Replaced The Persian And Local Official Languages With The Urdu And English Languages In The North Indian Regions Of Jammu and Kashmir In 1846 And Punjab In 1849. Urdu, Like Hindi, Is A Form Of Hindustani. It Evolved From The Medieval (6th To 13th Century) Apabhraṃśa Register Of The Preceding Shauraseni Language, A Middle Indo-Aryan Language That Is Also The Ancestor Of Other Modern Languages, Including The Punjabi Dialects. Urdu Developed Under The Influence Of The Persian And Arabic Languages, Both Of Which Have Contributed A Significant Amount Of Vocabulary To Formal Speech around 99% Of Urdu Verbs Have Their Roots In Sanskrit And Prakrit. Although The Word Urdu Itself Is Derived From The Turkic Word Ordu Or Orda, From Which English Horde Is Also Derived, Turkic Borrowings In Urdu Are Minimal And Urdu Is Not Genetically Related To The Turkic Languages. Urdu Words Originating From Chagatai And Arabic Were Borrowed Through Persian And Hence Are Personalized Versions Of The Original Words. For Instance, The Arabic Ta' Marbuta Changes To He Or Te . -
Linguistics Development Team
Development Team Principal Investigator: Prof. Pramod Pandey Centre for Linguistics / SLL&CS Jawaharlal Nehru University, New Delhi Email: [email protected] Paper Coordinator: Prof. K. S. Nagaraja Department of Linguistics, Deccan College Post-Graduate Research Institute, Pune- 411006, [email protected] Content Writer: Prof. K. S. Nagaraja Prof H. S. Ananthanarayana Content Reviewer: Retd Prof, Department of Linguistics Osmania University, Hyderabad 500007 Paper : Historical and Comparative Linguistics Linguistics Module : Indo-Aryan Language Family Description of Module Subject Name Linguistics Paper Name Historical and Comparative Linguistics Module Title Indo-Aryan Language Family Module ID Lings_P7_M1 Quadrant 1 E-Text Paper : Historical and Comparative Linguistics Linguistics Module : Indo-Aryan Language Family INDO-ARYAN LANGUAGE FAMILY The Indo-Aryan migration theory proposes that the Indo-Aryans migrated from the Central Asian steppes into South Asia during the early part of the 2nd millennium BCE, bringing with them the Indo-Aryan languages. Migration by an Indo-European people was first hypothesized in the late 18th century, following the discovery of the Indo-European language family, when similarities between Western and Indian languages had been noted. Given these similarities, a single source or origin was proposed, which was diffused by migrations from some original homeland. This linguistic argument is supported by archaeological and anthropological research. Genetic research reveals that those migrations form part of a complex genetical puzzle on the origin and spread of the various components of the Indian population. Literary research reveals similarities between various, geographically distinct, Indo-Aryan historical cultures. The Indo-Aryan migrations started in approximately 1800 BCE, after the invention of the war chariot, and also brought Indo-Aryan languages into the Levant and possibly Inner Asia. -
Diachrony of Ergative Future
• THE EVOLUTION OF THE TENSE-ASPECT SYSTEM IN HINDI/URDU: THE STATUS OF THE ERGATIVE ALGNMENT Annie Montaut INALCO, Paris Proceedings of the LFG06 Conference Universität Konstanz Miriam Butt and Tracy Holloway King (Editors) 2006 CSLI Publications http://csli-publications.stanford.edu/ Abstract The paper deals with the diachrony of the past and perfect system in Indo-Aryan with special reference to Hindi/Urdu. Starting from the acknowledgement of ergativity as a typologically atypical feature among the family of Indo-European languages and as specific to the Western group of Indo-Aryan dialects, I first show that such an evolution has been central to the Romance languages too and that non ergative Indo-Aryan languages have not ignored the structure but at a certain point went further along the same historical logic as have Roman languages. I will then propose an analysis of the structure as a predication of localization similar to other stative predications (mainly with “dative” subjects) in Indo-Aryan, supporting this claim by an attempt of etymologic inquiry into the markers for “ergative” case in Indo- Aryan. Introduction When George Grierson, in the full rise of language classification at the turn of the last century, 1 classified the languages of India, he defined for Indo-Aryan an inner circle supposedly closer to the original Aryan stock, characterized by the lack of conjugation in the past. This inner circle included Hindi/Urdu and Eastern Panjabi, which indeed exhibit no personal endings in the definite past, but only gender-number agreement, therefore pertaining more to the adjectival/nominal class for their morphology (calâ, go-MSG “went”, kiyâ, do- MSG “did”, bola, speak-MSG “spoke”). -
Lith. Mir̃ti / Latv. Mirt ‘To Die’ and Lith
YOKO YAMAZAKI Stockholm University, University of Zurich Fields of research: Baltic linguistics, historical linguistics, Indo-European comparative linguistics. DOI: doi.org/10.35321/all83-01 LITH. MIR̃TI / LATV. MIRT ‘TO DIE’ AND LITH. MIR̃ŠTI / LATV. MÌRST ‘TO FORGET’ IN EAST BALTIC1 Liet. miti / latv. mirt ‘mirti’ ir liet. mišti / latv. mrst ‘pamiršti’ rytų baltų kalbose ANNOTATION The verbs Lith. miti / Latv. mirt ‘to die’ and Lith. (-)mišti / Latv. (-)mrst ‘to forget’ share several features in historical morphology: both take sta-present stem, in spite of their Indo-European cognates in the *-ye/o- present stem; the root-aorist in the middle voice inflection can be reconstructed in PIE; and both are also semantically middle. However, they are contrastive in the past tense in Baltic, taking different preterit stems, i.e., Lith. mrė / Latv. miru(ē) and Lith. mišo / Latv. mrsu(ā). This article will investigate what led them to choose the different preterit stems by comparing their semantic and phonological properties, and will contribute to the reconstruction of the entire prehistory of the Baltic preterit system. In this article, it will be proposed that Lith. mrė / Latv. miru(ē) is probably descended from the older imperfect, while its aoristic nature led Lith. mišo / Latv. mrsu(ā) to inherit the older aorist stem, and this historical difference may be reflected in their different preterit stems. 1 This article presents some of the results of a project entitled The prehistory of the Baltic preterit system – diachronic changes and morphosemantics (reg. Nr. 2018-00473), financed by the Swedish Research Council (Vetenskapsrådet). I am thankful to generous financial support from the council, and to Dr. -
Indo-European Linguistics: an Introduction Indo-European Linguistics an Introduction
This page intentionally left blank Indo-European Linguistics The Indo-European language family comprises several hun- dred languages and dialects, including most of those spoken in Europe, and south, south-west and central Asia. Spoken by an estimated 3 billion people, it has the largest number of native speakers in the world today. This textbook provides an accessible introduction to the study of the Indo-European proto-language. It clearly sets out the methods for relating the languages to one another, presents an engaging discussion of the current debates and controversies concerning their clas- sification, and offers sample problems and suggestions for how to solve them. Complete with a comprehensive glossary, almost 100 tables in which language data and examples are clearly laid out, suggestions for further reading, discussion points and a range of exercises, this text will be an essential toolkit for all those studying historical linguistics, language typology and the Indo-European proto-language for the first time. james clackson is Senior Lecturer in the Faculty of Classics, University of Cambridge, and is Fellow and Direc- tor of Studies, Jesus College, University of Cambridge. His previous books include The Linguistic Relationship between Armenian and Greek (1994) and Indo-European Word For- mation (co-edited with Birgit Anette Olson, 2004). CAMBRIDGE TEXTBOOKS IN LINGUISTICS General editors: p. austin, j. bresnan, b. comrie, s. crain, w. dressler, c. ewen, r. lass, d. lightfoot, k. rice, i. roberts, s. romaine, n. v. smith Indo-European Linguistics An Introduction In this series: j. allwood, l.-g. anderson and o.¨ dahl Logic in Linguistics d. -
Grammar of Urdu Or Hindustani.Pdf
OF THE URDU OR HINDUSTANI LANGUAGE. BY THE SAME AUTHOR. Crown 8vo, cloth, price 2s. 6d. HINDUSTANI EXERCISES. A Series of Passages and Extracts adapted for Translation into Hindustani. Crown 8vo, cloth, price 7s. IKHWANU-9 SAFA, OR BROTHERS OF PURITY. Translated from the Hindustani. "It has been the translator's object to adhere as closely as possible to the original text while rendering the English smooth and intelligible to the reader, and in this design he has been throughout successful." Saturday Review. GRAMMAR URDU OR HINDUSTANI LANGUAGE. JOHN DOWSON, M.R.A.S., LAT5 PROCESSOR OF HINDCSTANI, STAFF COLLEGE. Cfjtrfl (SBitton. LONDON : KEGAN PAUL, TRENCH, TRUBNER & CO. L DRYDEN HOUSE, GERRARD STREET, W. 1908. [All riijhts reserved.] Printed by BALLANTYNK, HANSOM &* Co. At the Ballantyne Press, Edinburgh TABLE OF CONTENTS. PACK PREFACE . ix THE ALPHABET 1 Pronunciation . .5,217 Alphabetical Notation or Abjad . 1 7 Exercise in Reading . 18 THE AKTICLE 20 THE Nora- 20 Gender 21 Declension. ...... 24 Izafat 31 THE ADJECTIVE 32 Declension ...... 32 Comparison ... 33 PRONOUNS Personal. ...... 37 Demonstrative ...... 39 Respectful 40 Reflexive 41 Possessive 41 Relative and Correlative . .42 Interrogative . 42 Indefinite ....... 42 Partitive . 43 Compound. .43 VERB 45 Substantive and Auxiliary 46 Formation of . 46 Conjugation of Neuter Verbs . .49 Active Verbs . 54 Irregulars . .57 Hona 58 Additional Tenses . 60 2004670 CONTENTS. VERB (continued) Passive Verb ....... 62 Formation of Actives and Causals ... 65 Nominals 69 Intensives ..... 70 Potentials Completives . .72 Continuatives .... Desideratives ..... 73 Frequentatives .... 74 Inceptives . 75 Permissives Acquisitives . .76 Reiteratives ..... 76 ADVKRBS ......... 77 PREPOSITIONS 83 CONJUNCTIONS 90 INTERJECTIONS ....... 91 NUMERALS ......... 91 Cardinal 92 Ordinal 96 Aggregate 97 Fractional 97 Ralcam 98 Arabic 99 Persian 100 DERIVATION 101 Nouns of Agency . -
Development and Analysis of Verb Frame Lexicon for Hindi
Linguistics and Literature Studies 5(1): 1-22, 2017 http://www.hrpub.org DOI: 10.13189/lls.2017.050101 Development and Analysis of Verb Frame Lexicon for Hindi Rafiya Begum*, Dipti Misra Sharma Language Technology and Research Center, India Copyright©2017 by authors, all rights reserved. Authors agree that this article remains permanently open access under the terms of the Creative Commons Attribution License 4.0 International License Abstract A verb frame (VF) captures various syntactic relatively flexible word order [2,3]. There is a debate in the distributions where a verb can be expected to occur in a literature whether the notions subject and object can at all be language. The argument structure of Hindi verbs (for various defined for ILs [4]. Behavioral properties are the only criteria senses) is captured in the verb frames (VFs). The Hindi verbs based on which one can confidently identify grammatical were also classified based on their argument structure. The functions in Hindi [5]; Marking semantic properties such as main objective of this work is to create a linguistic resource thematic roles as dependency relations is problematic too. of Hindi verb frames which would: (i)Help the annotators in Thematic roles are abstract notions and require higher the annotation of the dependency relations for various verbs; semantic features which are difficult to formulate and to (ii)Prove to be useful in parsing and for other Natural extract. Therefore, a grammatical model which can account Language Processing (NLP) applications; (iii)Be helpful for for most of the linguistic phenomena in ILs and would also scholars interested in the linguistic study of the Hindi verbs. -
The Formation of Modern Hindi As Demonstrated in Early 'Hindi' Dictionaries
The formation of modern Hindi as demonstrated in early 'Hindi' dictionaries BY STUART MCGREGOR zooo GONDA LECTURE Stuart McGregor is Emeritus Reader in Hindi at the University of Cambridge. He has written many studies on Hindi and Hindi Literature, e.g. OutlineojHindi Grammar (Oxford 1972., 3rd revised and enlarged edition, Oxford 1995), Hindi Litcmt11roof the NilleleetJtband Ear& Twentieth Centuries (Wiesbaden, 1974) and The OxjordHindi-E11glisbDictioNary (ed.; Oxford 1993; electronic version, z.ooa,-.Com mittee on Institutional Co-operation, Champaign, Illinois) 2000 G01 OA I.ECTURU Eighth Gonda lecture, held on 23 November 2ooo on the premises of the Royal Netherlands Academy of Arts and Sciences The formation of modern Hindi as demonstrated in early 'Hindi' dictiOnaries BY STUART MCGREGOR ROYAL NETHERLANDS ACADEMY OF ARTS AND SCIENCES Amsterdam, zooi THE FORMATION OF MODERN HIND! AS DEMONSTRATED IN EARLY 'HIND!' DICTIONARIES Modern Hindi has been widely seen as a new language that came into existence at the beginning of the nineteenth century, the product of new political condi tions in north India: a language to be distinguished historically from forms of north Indian language current before it. Research on these languages provides various illustrations, however, of the extent to which modern Hindi shares fea tures of lexical repertoire as well as grammatical structure with them, and of the depth and strength of its roots in pre-existing language usage. It is my intention today to look at lexical and grammatical connections be tween older language and modernHindi, making use of evidence from early dic tionaries. In the course of working in Hindi lexicography over a considerable number of years I came to realise the great interest of the early dictionaries, not only as documenting the history of Hindi lexicography, but also for the in formation relevant to my present subject that they proved to contain. -
Swahili-English Dictionary, the First New Lexical Work for English Speakers
S W AHILI-E N GLISH DICTIONARY Charles W. Rechenbach Assisted by Angelica Wanjinu Gesuga Leslie R. Leinone Harold M. Onyango Josiah Florence G. Kuipers Bureau of Special Research in Modern Languages The Catholic University of Americ a Prei Washington. B. C. 20017 1967 INTRODUCTION The compilers of this Swahili-English dictionary, the first new lexical work for English speakers in many years, hope that they are offering to students and translators a more reliable and certainly a more up-to-date working tool than any previously available. They trust that it will prove to be of value to libraries, researchers, scholars, and governmental and commercial agencies alike, whose in- terests and concerns will benefit from a better understanding and closer communication with peoples of Africa. The Swahili language (Kisuiahili) is a Bantu language spoken by perhaps as many as forty mil- lion people throughout a large part of East and Central Africa. It is, however, a native or 'first' lan- guage only in a nnitp restricted area consisting of the islands of Zanzibar and Pemba and the oppo- site coast, roughly from Dar es Salaam to Mombasa, Outside this relatively small territory, elsewhere in Kenya, in Tanzania (formerly Tanganyika), Copyright © 1968 and to a lesser degree in Uganda, in the Republic of the Congo, and in other fringe regions hard to delimit, Swahili is a lingua franca of long standing, a 'second' (or 'third' or 'fourth') language enjoy- ing a reasonably well accepted status as a supra-tribal or supra-regional medium of communication. THE CATHOLIC UNIVERSITY OF AMERICA PRESS, INC. -
1- UNIVERSITY of IOWA Asian & Slavic Languages and Literatures
1- UNIVERSITY OF IOWA Asian & Slavic Languages and Literatures eHindi SOAS:2101 First-Year Hindi-Urdu: First Semester, 5 s.h. Reading, writing, speaking. Offered fall semesters of odd years. GE: Foreign Language (NEW GE: World Languages); First Level Proficiency. SOAS:2102 First-Year Hindi-Urdu: Second Semester, 5 s.h. Continuation of SOAS:2101. Offered spring semesters of even years. Prerequisites: SOAS:2101. Requirements: undergraduate standing. GE: Foreign Language (NEW GE: World Languages); Second Level Proficiency. SOAS:3101 Second-Year Hindi-Urdu: First Semester, 4 s.h. Conversation, reading of folktales and modern short stories. Offered fall semesters of even years. Prerequisites: SOAS:2102. Requirements: undergraduate standing. GE: Foreign Language (NEW GE: World Languages); Second Level Proficiency. SOAS:3102 Second-Year Hindi-Urdu: Second Semester, 4 s.h. Continuation of SOAS:3101. Offered spring semesters of odd years. Prerequisites: SOAS:3101. Requirements: undergraduate standing. GE: Foreign Language (NEW GE: World Languages); Fourth Level Proficiency. SOAS:4101 Third-Year Hindi-Urdu: First Semester, 3 s.h. Advanced level Hindi texts; speaking, writing. Offered fall semesters. Prerequisites: SOAS:3102. SOAS:4102 Third-Year Hindi-Urdu: Second Semester, 3 s.h. Continuation of SOAS:4101. Offered spring semesters. Prerequisite: SOAS:4101. 1 Asian & Slavic Languages and Literatures College of Liberal Arts and Sciences Chair Russell Ganim Locati 111A PH (Phillips Hall) on Phone 319-335-2151 Email [email protected] Websi http://clas.uiowa.edu/dwllc/asll/ te Gener al http://www.registrar.uiowa.edu/registrar/catalog/liberalartsandsciences/asianandslaviclangu Catalo agesandliteratures/ g 6 courses found, displaying all courses. Course # Title SOAS:2102:0001 Course Title is also known as(039:124:001) First-Year Hindi-Urdu: Second Semester Prerequisites: SOAS:2101 (039:123). -
Repor I Resumes
REPOR IRESUMES ED 015 469 48 AL 000 894 A BRIEF :1INDI REFERENCE GRAMMAR. PRELIMINARY VERSION. BY- GUMPERZ, JOHN J. MISRA, VIDYA NI WAS CALIFORNIA UNIV., BERKELEY REPORT NUMBER NDEA-VI-.215 PUB DATE. 63 CONTRACT OEC-SAE-8825 EDRS PRICEMF-$9.25 HC-$2.36 57P. DESCRIPTORS- *GRAMMAR, *HINDI, *REFERENCE MATERIALS, SECOND LANGUAGE LEARNING, DESCRIPTIVE LINGUISTICS, *STRUCTURAL ANALYSIS, DISTINCTIVE FEATURES, NOMINALS, ADJECTIVES, VERBS, FORM CLASSES (LANGUAGES), SENTENCE STRUCTURE, PHRASE STRUCTURE, PHONOLOGY, THIS BRIEF OUTLINE OF HINDI PHONOLOGY AND GRAMMAR IS INTENDED FOR FIRST AND SECOND YEAR STUDENTS OF HINDI WHO HAVE SOME PREVIOUS KNOWLEDGE OF THE ORAL AND WRITTEN LANGUAGE BUT WHO MAY HAVE HAD NO PREVIOUS TRAINING IN LINGUISTIC TERMINOLOGY. THE AUTHORS HAVE THEREFORE EMPHASIZED SIMPLICITY AND READABILITY RATHER THAN EXHAUSTIVENESS OR ORIGINALITY OF ANALYSIS. ALTHOUGH NOT A LANGUAGE TEXTBOOK, THIS GRAMMAR MAY BE USED TO SUPPLEMENT A CLASSROOM TEXT AS A REFERENCE GUIDE FOR INDIVIDUAL READING OR FOR GRAMMAR REVIEW. THE BRIEF INTRODUCTION TRACES THE HISTORY AND CURRENT USE OF HINDI-URDU IN MODERN INDIA. FOLLOWING CHAPTERS INCLUDE A DESCRIPTION OF THE PHONOLOGY, GENERAL SENTENCE STRUCTURE, PHRASES, FORM CLASSES, AND VERBS AND VERB CONSTRUCTIONS. A ROMAN TRANSLITERATION OF THE DEVANAGARI SCRIPT IS USED THROUGHOUT. (JD) f/4f/7 if-ultexuA.-,o, Contract SAE-8825 os LrN 1.4 A BRIEF HINDI REFERENCE GRAMMAR C) Wry veFsion7)-- by John J. Gumperz and Vidya Niwas Misra The University of California Berkeley 1963 The research reported herein was performed pursuent to a contract with the U. So Office of Education, Department of Health, Education, and Welfare under provisions of Section 602, Title VI, of the National Defense Education Act HEALTH, EDUCATION &WELFARE U.S.