Morphological Analyzer in the Development of Bilingual Dictionary (Kokborok-English) - an Analysis for Appropriate Method and Approach Partha Sarkar, Dr
Total Page:16
File Type:pdf, Size:1020Kb

Load more
Recommended publications
-
Dept of English Tripura University Faculty Profile Name
Dept of English Tripura University Faculty Profile Name: Kshetrimayum Premchandra (PhD) Designation: Assistant Professor Academic Qualification: BA (Hindu College, Delhi University), MA, PhD (Manipur University) Subjects Taught: Northeast (Indian) Literatures, Drama (Performance Theory), Folklore Mail: [email protected], [email protected] Landline/Intercom: +91 381237-9445 Fax: 03812374802 Mobile: +91 9436780234 1. List of publications as chapters, proceedings and articles in books and journals S. ISBN/ISSN No Title with page nos. Book/Journal No . Law of the Mother: The Mothers of Maya Spectrum, June, 2014, Vol. 2, ISSN 2319-6076 1 Diip, , pp. 12-18 Issue 1 Blood, Bullet, Bomb, and Voices in the Spectrum, June, 2015, Vol. 3, ISSN 2319-6076 2 The Sound of Khongji, pp. 11-18 Issue 1, The Hanndmaid’s Tale: The Curse of Spectrum, July- Vol. 3, Issue 2, ISSN 2319-6076 3 Being ‘Fertile’, pp. 19-27 Dec., 2015, Transcript, Bodoland 4 Place of Ougri in Meitei Lai Haraoba ISSN 2347-1743 University. Vol III, pp. 81-89 Of Pilgrims and Savages in the Heart of ISSN 2319-6076 5 Spectrum Darkness Journal of Literature and Incest and Gender Bias in the Tripuri Cultural Studies, Mizoram 6 ISSN 2348-1188 Folktale “Chethuang” University. Vol V, Issue II, Dec. 2018. Pp. 87-98. Nationalist Thought and the ISBN 978-93- “Death Journey: The Nigger of the Colonial World, 7 81142-92-9 ‘Narcissus’”, pp. 126-136. Ram Sharma et. al., New Delhi: Mangalam Publications Manipuri Dance and Culture: An Anthology. The Notions of Manipuri Identity, ISBN 978-81- 8 Ed. Adhikarimayum pp. 160-181. -
LCSH Section K
K., Rupert (Fictitious character) Motion of K stars in line of sight Ka-đai language USE Rupert (Fictitious character : Laporte) Radial velocity of K stars USE Kadai languages K-4 PRR 1361 (Steam locomotive) — Orbits Ka’do Herdé language USE 1361 K4 (Steam locomotive) UF Galactic orbits of K stars USE Herdé language K-9 (Fictitious character) (Not Subd Geog) K stars—Galactic orbits Ka’do Pévé language UF K-Nine (Fictitious character) BT Orbits USE Pévé language K9 (Fictitious character) — Radial velocity Ka Dwo (Asian people) K 37 (Military aircraft) USE K stars—Motion in line of sight USE Kadu (Asian people) USE Junkers K 37 (Military aircraft) — Spectra Ka-Ga-Nga script (May Subd Geog) K 98 k (Rifle) K Street (Sacramento, Calif.) UF Script, Ka-Ga-Nga USE Mauser K98k rifle This heading is not valid for use as a geographic BT Inscriptions, Malayan K.A.L. Flight 007 Incident, 1983 subdivision. Ka-houk (Wash.) USE Korean Air Lines Incident, 1983 BT Streets—California USE Ozette Lake (Wash.) K.A. Lind Honorary Award K-T boundary Ka Iwi National Scenic Shoreline (Hawaii) USE Moderna museets vänners skulpturpris USE Cretaceous-Paleogene boundary UF Ka Iwi Scenic Shoreline Park (Hawaii) K.A. Linds hederspris K-T Extinction Ka Iwi Shoreline (Hawaii) USE Moderna museets vänners skulpturpris USE Cretaceous-Paleogene Extinction BT National parks and reserves—Hawaii K-ABC (Intelligence test) K-T Mass Extinction Ka Iwi Scenic Shoreline Park (Hawaii) USE Kaufman Assessment Battery for Children USE Cretaceous-Paleogene Extinction USE Ka Iwi National Scenic Shoreline (Hawaii) K-B Bridge (Palau) K-TEA (Achievement test) Ka Iwi Shoreline (Hawaii) USE Koro-Babeldaod Bridge (Palau) USE Kaufman Test of Educational Achievement USE Ka Iwi National Scenic Shoreline (Hawaii) K-BIT (Intelligence test) K-theory Ka-ju-ken-bo USE Kaufman Brief Intelligence Test [QA612.33] USE Kajukenbo K. -
Morphological Analyzer for Kokborok
Morphological Analyzer for Kokborok Khumbar Debbarma1 Braja Gopal Patra2 Dipankar Das3 Sivaji Bandyopadhyay2 (1) TRIPURA INSTITUTE OF TECHNOLOGY, Agartala, India (2) JADAVPUR UNIVERSITY, Kolkata, India (3) NATIONAL INSTITUTE_OF TECHNOLOGY, Meghalaya, India [email protected], [email protected], [email protected],[email protected] ABSTRACT Morphological analysis is concerned with retrieving the syntactic and morphological properties or the meaning of a morphologically complex word. Morphological analysis retrieves the grammatical features and properties of an inflected word. However, this paper introduces the design and implementation of a Morphological Analyzer for Kokborok, a resource constrained and less computerized Indian language. A database driven affix stripping algorithm has been used to design the Morphological Analyzer. It analyzes the Kokborok word forms and produces several grammatical information associated with the words. The Morphological Analyzer for Kokborok has been tested on 56732 Kokborok words; thereby an accuracy of 80% has been obtained on a manual check. KEYWORDS : Morphology Analyzer, Kokborok, Dictionary, Stemmer, Prefix, Suffix. Proceedings of the 3rd Workshop on South and Southeast Asian Natural Language Processing (SANLP), pages 41–52, COLING 2012, Mumbai, December 2012. 41 1 Introduction Kokborok is the native language of Tripura and is also spoken in the neighboring states like Assam, Manipur, Mizoram as well as the countries like Bangladesh, Myanmar etc., comprising of more than 2.5 millions1 of people. Kokborok belongs to the Tibeto-Burman (TB) language falling under the Sino language family of East Asia and South East Asia2. Kokborok shares the genetic features of TB languages that include phonemic tone, widespread stem homophony, subject- object-verb (SOV) word order, agglutinative verb morphology, verb derivational suffixes originating from the semantic bleaching of verbs, duplication or elaboration. -
Waromung an Ao Naga Village, Monograph Series, Part VI, Vol-I
@ MONOGRAPH CENSUS OF INDIA 1961 No. I VOLUME-I MONOGRAPH SERIES Part VI In vestigation Alemchiba Ao and Draft Research design, B. K. Roy Burman Supervision and Editing Foreword Asok Mitra Registrar General, InOla OFFICE OF THE REGISTRAR GENERAL, INDIA WAROMUNG MINISTRY OF HOME AFFAIRS (an Ao Naga Village) NEW DELHI-ll Photographs -N. Alemchiba Ao K. C. Sharma Technical advice in describing the illustrations -Ruth Reeves Technical advice in mapping -Po Lal Maps and drawings including cover page -T. Keshava Rao S. Krishna pillai . Typing -B. N. Kapoor Tabulation -C. G. Jadhav Ganesh Dass S. C. Saxena S. P. Thukral Sudesh Chander K. K. Chawla J. K. Mongia Index & Final Checking -Ram Gopal Assistance to editor in arranging materials -T. Kapoor (Helped by Ram Gopal) Proof Reading - R. L. Gupta (Final Scrutiny) P. K. Sharma Didar Singh Dharam Pal D. C. Verma CONTENTS Pages Acknow ledgement IX Foreword XI Preface XIII-XIV Prelude XV-XVII I Introduction ... 1-11 II The People .. 12-43 III Economic Life ... .. e • 44-82 IV Social and Cultural Life •• 83-101 V Conclusion •• 102-103 Appendices .. 105-201 Index .... ... 203-210 Bibliography 211 LIST OF MAPS After Page Notional map of Mokokchung district showing location of the village under survey and other places that occur in the Report XVI 2 Notional map of Waromung showing Land-use-1963 2 3 Notional map of Waromung showing nature of slope 2 4 (a) Notional map of Waromung showing area under vegetation 2 4 (b) Notional map of Waromung showing distribution of vegetation type 2 5 (a) Outline of the residential area SO years ago 4 5 (b) Important public places and the residential pattern of Waromung 6 6 A field (Jhurn) Showing the distribution of crops 58 liST OF PLATES After Page I The war drum 4 2 The main road inside the village 6 3 The village Church 8 4 The Lower Primary School building . -
AO and PROTO-TIBETO-BURMAN – the RIMES* Daniel Bruhn University of California, Berkeley
Format adapted from the LTBA stylesheet UCB Linguistics Qualifying Paper #2 Revised: 27 October 2010 UNEARTHING THE ROOTS: AO AND PROTO-TIBETO-BURMAN – THE RIMES* Daniel Bruhn University of California, Berkeley Keywords: Tibeto-Burman, historical, linguistics, Naga, Mongsen, Chungli, Ao, reflexes, roots, cognate sets, sound correspondences, sound change Table of Contents 1. INTRODUCTION ......................................................................................................... 2 1.1. Purpose ................................................................................................................. 2 1.2. Background ........................................................................................................... 3 1.3. Phonology ............................................................................................................. 4 1.3.1. Chungli ........................................................................................................... 4 1.3.2. Mongsen ......................................................................................................... 7 1.4. Methodology ......................................................................................................... 9 2. RIMES ....................................................................................................................... 11 2.1. *-a- ..................................................................................................................... 11 2.1.1. *-a > Chungli -a/-u/-i, Mongsen -a ........................................................... -
Dimasa Kachari of Assam
ETHNOGRAPHIC STUDY NO·7II , I \ I , CENSUS OF INDIA 1961 VOLUME I MONOGRAPH SERIES PART V-B DIMASA KACHARI OF ASSAM , I' Investigation and Draft : Dr. p. D. Sharma Guidance : A. M. Kurup Editing : Dr. B. K. Roy Burman Deputy Registrar General, India OFFICE OF THE REGISTRAR GENERAL, INDIA MINISTRY OF HOME AFFAIRS NEW DELHI CONTENTS FOREWORD v PREFACE vii-viii I. Origin and History 1-3 II. Distribution and Population Trend 4 III. Physical Characteristics 5-6 IV. Family, Clan, Kinship and Other Analogous Divisions 7-8 V. Dwelling, Dress, Food, Ornaments and Other Material Objects distinctive qfthe Community 9-II VI. Environmental Sanitation, Hygienic Habits, Disease and Treatment 1~ VII. Language and Literacy 13 VIII. Economic Life 14-16 IX. Life Cycle 17-20 X. Religion . • 21-22 XI. Leisure, Recreation and Child Play 23 XII. Relation among different segments of the community 24 XIII. Inter-Community Relationship . 2S XIV Structure of Soci141 Control. Prestige and Leadership " 26 XV. Social Reform and Welfare 27 Bibliography 28 Appendix 29-30 Annexure 31-34 FOREWORD : fhe Constitution lays down that "the State shall promote with special care the- educational and economic hterest of the weaker sections of the people and in particular of the Scheduled Castes and Scheduled Tribes and shall protect them from social injustice and all forms of exploitation". To assist States in fulfilling their responsibility in this regard, the 1961 Census provided a series of special tabulations of the social and economic data on Scheduled Castes and Scheduled Tribes. The lists of Scheduled Castes and Scheduled Tribes are notified by the President under the Constitution and the Parliament is empowered to include in or exclude from the lists, any caste or tribe. -
The Naga Language Groups Within the Tibeto-Burman Language Family
TheNaga Language Groups within the Tibeto-Burman Language Family George van Driem The Nagas speak languages of the Tibeto-Burman fami Ethnically, many Tibeto-Burman tribes of the northeast ly. Yet, according to our present state of knowledge, the have been called Naga in the past or have been labelled as >Naga languages< do not constitute a single genetic sub >Naga< in scholarly literature who are no longer usually group within Tibeto-Burman. What defines the Nagas best covered by the modern more restricted sense of the term is perhaps just the label Naga, which was once applied in today. Linguistically, even today's >Naga languages< do discriminately by Indo-Aryan colonists to all scantily clad not represent a single coherent branch of the family, but tribes speaking Tibeto-Burman languages in the northeast constitute several distinct branches of Tibeto-Burman. of the Subcontinent. At any rate, the name Naga, ultimately This essay aims (1) to give an idea of the linguistic position derived from Sanskrit nagna >naked<, originated as a titu of these languages within the family to which they belong, lar label, because the term denoted a sect of Shaivite sadhus (2) to provide a relatively comprehensive list of names and whose most salient trait to the eyes of the lay observer was localities as a directory for consultation by scholars and in that they went through life unclad. The Tibeto-Burman terested laymen who wish to make their way through the tribes labelled N aga in the northeast, though scantily clad, jungle of names and alternative appellations that confront were of course not Hindu at all. -
THE LANGUAGES of MANIPUR: a CASE STUDY of the KUKI-CHIN LANGUAGES* Pauthang Haokip Department of Linguistics, Assam University, Silchar
Linguistics of the Tibeto-Burman Area Volume 34.1 — April 2011 THE LANGUAGES OF MANIPUR: A CASE STUDY OF THE KUKI-CHIN LANGUAGES* Pauthang Haokip Department of Linguistics, Assam University, Silchar Abstract: Manipur is primarily the home of various speakers of Tibeto-Burman languages. Aside from the Tibeto-Burman speakers, there are substantial numbers of Indo-Aryan and Dravidian speakers in different parts of the state who have come here either as traders or as workers. Keeping in view the lack of proper information on the languages of Manipur, this paper presents a brief outline of the languages spoken in the state of Manipur in general and Kuki-Chin languages in particular. The social relationships which different linguistic groups enter into with one another are often political in nature and are seldom based on genetic relationship. Thus, Manipur presents an intriguing area of research in that a researcher can end up making wrong conclusions about the relationships among the various linguistic groups, unless one thoroughly understands which groups of languages are genetically related and distinct from other social or political groupings. To dispel such misconstrued notions which can at times mislead researchers in the study of the languages, this paper provides an insight into the factors linguists must take into consideration before working in Manipur. The data on Kuki-Chin languages are primarily based on my own information as a resident of Churachandpur district, which is further supported by field work conducted in Churachandpur district during the period of 2003-2005 while I was working for the Central Institute of Indian Languages, Mysore, as a research investigator. -
Kh. Dhiren Singha Mob.: 09435173346 Associate Professor [email protected] Department of Linguistics, Dhiren [email protected] Assam University, Silchar
Kh. Dhiren Singha Mob.: 09435173346 Associate Professor [email protected] Department of Linguistics, [email protected] Assam University, Silchar AREAS OF SPECIALIZATION: Phonology, Morpho-syntax, Language Typology and Tibeto-Burman Linguistics. MAJOR PROJECTS Presently working as a Principal Investigator (PI) for the project on ‘MEYOR LANGUAGE’of Arunachal Pradesh under the SCHEME FOR PROTECTION AND PRESERVATION OF ENDANGERED LANGUAGESof Central Institute of Indian Languages (CIIL), Mysore, from October, 2013 onwards. Submitted project report in the topic:A PEDAGOGICAL GRAMMAR OF DIMASA under the project DEVELOPMENT OF DIMASA LANGUAGE IN ASSAM of Central Institute of Indian Languages, Mysore in 2007. PROFESSIONAL ACTIVITIES Editorial Service: Journal Editor (co-editor) INDIAN LANGUAGE REVIEW (International Journal of Linguistics), 2014, July onwards. Member of Editorial Board of the JOURNAL OF ASSAM UNIVERSITY (Humanities and Social Science) 2008. Conferences/workshops organized: 1. Organised TRAINING PROGRAMME CUM WORKSHOP ON LANGUAGE TEACHING, TESTING AND EVALUATIONfor the college teachers from 23rd to 26th February 2011 in the Premises of the Department of Linguistics, Assam University Silchar in collaboration with Central Institute of Indian Languages, Ministry of HRD, Mysore, as a co- ordinator (with Ganesh Baskaran). 2. Organised NATIONAL WORKSHOP ON TESTING AND EVALUATION OF MOTHER TONGUE EDUCATION (HIGH SCHOOL LEVEL) IN NORTHEAST INDIA WITH SPECIAL REFERENCE TO ASSAM from 1stto 5th December, 2011 in the Premises of the Department of Linguistics, Assam University Silchar in collaboration with Central Institute of Indian Languages, Ministry of HRD, Mysore, as a co-ordinator (with Ganesh Baskaran). PROFESSIONAL ORGANIZATIONS I. Life member of Dravidian Linguistics Association, St. Xavier’s College, P. O. -
Tone Systems of Dimasa and Rabha: a Phonetic and Phonological Study
TONE SYSTEMS OF DIMASA AND RABHA: A PHONETIC AND PHONOLOGICAL STUDY By PRIYANKOO SARMAH A DISSERTATION PRESENTED TO THE GRADUATE SCHOOL OF THE UNIVERSITY OF FLORIDA IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY UNIVERSITY OF FLORIDA 2009 1 © 2009 Priyankoo Sarmah 2 To my parents and friends 3 ACKNOWLEDGMENTS The hardships and challenges encountered while writing this dissertation and while being in the PhD program are no way unlike anything experienced by other Ph.D. earners. However, what matters at the end of the day is the set of people who made things easier for me in the four years of my life as a Ph.D. student. My sincere gratitude goes to my advisor, Dr. Caroline Wiltshire, without whom I would not have even dreamt of going to another grad school to do a Ph.D. She has been a great mentor to me. Working with her for the dissertation and for several projects broadened my intellectual horizon and all the drawbacks in me and my research are purely due my own markedness constraint, *INTELLECTUAL. I am grateful to my co-chair, Dr. Ratree Wayland. Her knowledge and sharpness made me see phonetics with a new perspective. Not much unlike the immortal Sherlock Holmes I could often hear her echo: One's ideas must be as broad as Nature if they are to interpret Nature. I am indebted to my committee member Dr. Andrea Pham for the time she spent closely reading my dissertation draft and then meticulously commenting on it. Another committee member, Dr. -
Non-Structural Case Marking in Tibeto-Burman and Artificial Languages*
Non-structural Case Marking in Tibeto-Burman and Artificial Languages* Alexander R. Coupe Sander Lestrade Nanyang Technological University Radboud University This paper discusses the diachronic development of non-structural case marking in Tibeto- Burman and in computer simulations of language evolution. It is shown how case marking is initially motivated by the pragmatic need to disambiguate between two core arguments, but eventually may develop flagging functions that even extend into the intransitive domain. Also, it is shown how different types of case-marking may emerge from various underlying disambiguation strategies. 1. Introduction This paper was motivated by the authors’ serendipitous discovery that a computer simulation of the emergence of case marking produced results that are uncannily similar to attested grammaticalization pathways and patterns of core case marking observed in the grammars of many Tibeto-Burman (henceforth TB) languages with pragmatically-motivated case marking. When it manifests, core case marking in both the natural and artificial languages appears to be initially motivated by the same fundamental need to disambiguate semantic roles. In other words, whenever some meaning does not follow inherently from the semantics of the dependency relationship holding between a head and its arguments, it must be encoded explicitly in the grammars of both the simulated language and the natural languages. Disambiguating case marking is most likely to appear in clauses with two core arguments when either one may potentially fulfill the role of the agent. Under these circumstances, some grammatical means of distinguishing an A argument from an O argument may be required to clarify meaning. 1 Conversely, if semantic roles can be unambiguously assigned to core arguments because of the semantic nature of their referents, then there may be no overt marking, even in bivalent clauses, so the use of core case marking is observed to have a pragmatic basis. -
Sino-Tibetan Numeral Systems: Prefixes, Protoforms and Problems
Sino-Tibetan numeral systems: prefixes, protoforms and problems Matisoff, J.A. Sino-Tibetan Numeral Systems: Prefixes, Protoforms and Problems. B-114, xii + 147 pages. Pacific Linguistics, The Australian National University, 1997. DOI:10.15144/PL-B114.cover ©1997 Pacific Linguistics and/or the author(s). Online edition licensed 2015 CC BY-SA 4.0, with permission of PL. A sealang.net/CRCL initiative. PACIFIC LINGUISTICS FOUNDING EDITOR: Stephen A. Wunn EDITORIAL BOARD: Malcolm D. Ross and Darrell T. Tryon (Managing Editors), Thomas E. Dutton, Nikolaus P. Himmelmann, Andrew K. Pawley Pacific Linguistics is a publisher specialising in linguistic descriptions, dictionaries, atlases and other material on languages of the Pacific, the Philippines, Indonesia and southeast Asia. The authors and editors of Pacific Linguistics publications are drawn from a wide range of institutions around the world. Pacific Linguistics is associated with the Research School of Pacific and Asian Studies at the Australian National University. Pacific Linguistics was established in 1963 through an initial grant from the Hunter Douglas Fund. It is a non-profit-making body financed largely from the sales of its books to libraries and individuals throughout the world, with some assistance from the School. The Editorial Board of Pacific Linguistics is made up of the academic staff of the School's Department of Linguistics. The Board also appoints a body of editorial advisors drawn from the international community of linguists. Publications in Series A, B and C and textbooks in Series D are refereed by scholars with re levant expertise who are normally not members of the editorial board.