HISTORICAL SYNTAX the Syntax of Sanskrit Compounds John J
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Studies on Language Change. Working Papers in Linguistics No. 34
DOCUMENT RESUME ED 286 382 FL 016 932 AUTHOR Joseph, Brian D., Ed. TITLE Studies on Language Change. Working Papers in Linguistics No. 34. INSTITUTION Ohio State Univ., Columbus. Dept. of Linguistics. PUB DATE Dec 86 NOTE 171p. PUB TYPE Reports - Evaluative/Feasibility (142) -- Collected Works - General (020) EDRS PRICE MF01/PC07 Plus Postage. DESCRIPTORS Arabic; Diachronic Linguistics; Dialects; *Diglossia; English; Estonian; *Etymology; Finnish; Foreign Countries; Language Variation; Linguistic Borrowing; *Linguistic Theory; *Morphemes; *Morphology (Languages); Old English; Sanskrit; Sociolinguistics; Syntax; *Uncommonly Taught Languages; Word Frequency IDENTIFIERS Saame ABSTRACT A collection of papers relevant to historical linguistics and description and explanation of language change includes: "Decliticization and Deaffixation in Saame: Abessive 'taga'" (Joel A. Nevis); "Decliticization in Old Estonian" (Joel A. Nevis); "On Automatic and Simultaneous Syntactic Changes" (Brian D. Joseph); "Loss of Nominal Case Endings in the Modern Arabic Sedentary Dialects" (Ann M. Miller); "One Rule or Many? Sanskrit Reduplication as Fragmented Affixation" (Richard D. Janda, Brian D. Joseph); "Fragmentation of Strong Verb Ablaut in Old English" (Keith Johnson); "The Etymology of 'bum': Mere Child's Play" (Mary E. Clark, Brian D. Joseph); "Small Group Lexical Innovation: Some Examples" (Christopher Kupec); "Word Frequency and Dialect Borrowing" (Debra A. Stollenwerk); "Introspection into a Stable Case of Variation in Finnish" (Riitta Valimaa-Blum); -
A Way with Words: Recent Advances in Lexical Theory and Analysis: a Festschrift for Patrick Hanks
A Way with Words: Recent Advances in Lexical Theory and Analysis: A Festschrift for Patrick Hanks Gilles-Maurice de Schryver (editor) (Ghent University and University of the Western Cape) Kampala: Menha Publishers, 2010, vii+375 pp; ISBN 978-9970-10-101-6, e59.95 Reviewed by Paul Cook University of Toronto In his introduction to this collection of articles dedicated to Patrick Hanks, de Schryver presents a quote from Atkins referring to Hanks as “the ideal lexicographer’s lexicogra- pher.” Indeed, Hanks has had a formidable career in lexicography, including playing an editorial role in the production of four major English dictionaries. But Hanks’s achievements reach far beyond lexicography; in particular, Hanks has made numerous contributions to computational linguistics. Hanks is co-author of the tremendously influential paper “Word association norms, mutual information, and lexicography” (Church and Hanks 1989) and maintains close ties to our field. Furthermore, Hanks has advanced the understanding of the relationship between words and their meanings in text through his theory of norms and exploitations (Hanks forthcoming). The range of Hanks’s interests is reflected in the authors and topics of the articles in this Festschrift; this review concentrates more on those articles that are likely to be of most interest to computational linguists, and does not discuss some articles that appeal primarily to a lexicographical audience. Following the introduction to Hanks and his career by de Schryver, the collection is split into three parts: Theoretical Aspects and Background, Computing Lexical Relations, and Lexical Analysis and Dictionary Writing. 1. Part I: Theoretical Aspects and Background Part I begins with an unfinished article by the late John Sinclair, in which he begins to put forward an argument that multi-word units of meaning should be given the same status in dictionaries as ordinary headwords. -
Finetuning Pre-Trained Language Models for Sentiment Classification of COVID19 Tweets
Technological University Dublin ARROW@TU Dublin Dissertations School of Computer Sciences 2020 Finetuning Pre-Trained Language Models for Sentiment Classification of COVID19 Tweets Arjun Dussa Technological University Dublin Follow this and additional works at: https://arrow.tudublin.ie/scschcomdis Part of the Computer Engineering Commons Recommended Citation Dussa, A. (2020) Finetuning Pre-trained language models for sentiment classification of COVID19 tweets,Dissertation, Technological University Dublin. doi:10.21427/fhx8-vk25 This Dissertation is brought to you for free and open access by the School of Computer Sciences at ARROW@TU Dublin. It has been accepted for inclusion in Dissertations by an authorized administrator of ARROW@TU Dublin. For more information, please contact [email protected], [email protected]. This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 4.0 License Finetuning Pre-trained language models for sentiment classification of COVID19 tweets Arjun Dussa A dissertation submitted in partial fulfilment of the requirements of Technological University Dublin for the degree of M.Sc. in Computer Science (Data Analytics) September 2020 Declaration I certify that this dissertation which I now submit for examination for the award of MSc in Computing (Data Analytics), is entirely my own work and has not been taken from the work of others save and to the extent that such work has been cited and acknowledged within the test of my work. This dissertation was prepared according to the regulations for postgraduate study of the Technological University Dublin and has not been submitted in whole or part for an award in any other Institute or University. -
AN EXPERIENCE of TRANSLATING NEPALI GRAMMAR INTO ENGLISH Govinda Raj Bhattarai and Bal Ram Adhikari [email protected]; [email protected]
AN EXPERIENCE OF TRANSLATING NEPALI GRAMMAR INTO ENGLISH Govinda Raj Bhattarai and Bal Ram Adhikari [email protected]; [email protected] 1. Introduction Translating a grammar from one language to another sounds impossible because two languages differ basically in terms of grammar which stands for different components from sound to discourse units. Despite this fact, the world has translated grammar between languages, as it is the most authentic door to enter into another language, its structure, form and function. Greek grammars were translated into Latin and it is from Latin that many western grammars have been reshaped, some in translation, some in adaptation and others in considerable degree of borrowing as well. The grammars of all young languages undergo such a process in course of evolution. This applies to Nepali as well. Pande's Chandrika Gorkha Bhasa Vyakaran (1912) and Sharma's Madhya Chandrika (1919) (quoted in Schmidt and Dahal 1993: x) are some seminal works in Nepali grammar. These works in their attempt to codify and describe the Nepali language adopted the model of Sanskrit grammar. The modern Nepali grammar has evolved through a long and exacting process. Whitney's Sanskrit Grammar (1879) is a best example of translation of classical grammar into contemporary languages. There have been some efforts made in Nepali too. Some works like Ayton (1820), Turnbull (1888) Kilgour and Duncan (1922), Turner (1931), Clark (1963) and Matthews (1984) border between translation and restructuring. Many other foreign scholars have contributed towards writing Nepali grammar till date. They have laid the foundation stone of Nepali grammar in English. -
Why Are Sanskrit Play Titles Strange?
STEPHAN HILLYER LEVITT WHY ARE SANSKRIT PLAY TITLES STRANGE? 1. Introduction Many Sanskrit play titles generally have presented problems to translators and lexicographers. On the one hand, we have such titles as Bhavabhæti’s Målatœmådhava, which is taken to refer jointly to the play’s hero Mådhava and the play’s heroine Målatœ, and which is translated, “Målatœ and Mådhava”. The translation appears to be supported by Viƒvanåtha Kaviråja’s treatise on dramaturgy, the Såhityadarpa∫a, in Såhityadarpa∫a 6.142-143 1. Or we have a title such as Kålidåsa’s Målavikågnimitra, which is also standardly taken to refer to the play’s hero Agnimitra and its heroine Målavikå. This is translated, “Målavikå and Agnimitra”. Alternately, we have such titles as Bha™™a Nåråya∫a’s Ve∫œsaμhåra, which is understood to be a Sanskrit compound mean- ing, “The Binding (saμhåra) of the Braid of Hair (ve∫œ)”. It refers to an incident in the Mahåbhårata in which Draupadœ is humiliated and vows never to braid her hair again until her humiliation has been avenged. Or we have Bhavabhæti’s Uttararåmacarita, which is under- 1. The Mirror of Composition, A Treatise on Poetical Composition, Being an English Translation of the Såhitya-Darpa∫a of Viƒwanåtha Kaviråja, transl. by J. R. Ballantyne and P. D. Mitra, Calcutta, 1875, p. 225, nos. 427-429; Såhityadarpa∫a of Viƒvanåtha Kaviråja, ed. by D. Dviveda, 1922, rpt. New Delhi, 1982, p. 330. 196 Stephan Hillyer Levitt stood to be a Sanskrit compound meaning, “The Later (uttara) Deeds (carita) of Råma (råma)”. This play is based on the last book of the Råmåya∫a and deals with events that occur after Råma returns to Ayodhyå as king. -
Lexical Analysis
From words to numbers Parsing, tokenization, extract information Natural Language Processing Piotr Fulmański Lecture goals • Tokenizing your text into words and n-grams (tokens) • Compressing your token vocabulary with stemming and lemmatization • Building a vector representation of a statement Natural Language Processing in Action by Hobson Lane Cole Howard Hannes Max Hapke Manning Publications, 2019 Terminology Terminology • A phoneme is a unit of sound that distinguishes one word from another in a particular language. • In linguistics, a word of a spoken language can be defined as the smallest sequence of phonemes that can be uttered in isolation with objective or practical meaning. • The concept of "word" is usually distinguished from that of a morpheme. • Every word is composed of one or more morphemes. • A morphem is the smallest meaningful unit in a language even if it will not stand on its own. A morpheme is not necessarily the same as a word. The main difference between a morpheme and a word is that a morpheme sometimes does not stand alone, but a word, by definition, always stands alone. Terminology SEGMENTATION • Text segmentation is the process of dividing written text into meaningful units, such as words, sentences, or topics. • Word segmentation is the problem of dividing a string of written language into its component words. Text segmentation, retrieved 2020-10-20, https://en.wikipedia.org/wiki/Text_segmentation Terminology SEGMENTATION IS NOT SO EASY • Trying to resolve the question of what a word is and how to divide up text into words we can face many "exceptions": • Is “ice cream” one word or two to you? Don’t both words have entries in your mental dictionary that are separate from the compound word “ice cream”? • What about the contraction “don’t”? Should that string of characters be split into one or two “packets of meaning?” • The single statement “Don’t!” means “Don’t you do that!” or “You, do not do that!” That’s three hidden packets of meaning for a total of five tokens you’d like your machine to know about. -
Olga Tribulato Ancient Greek Verb-Initial Compounds
Olga Tribulato Ancient Greek Verb-Initial Compounds Olga Tribulato - 9783110415827 Downloaded from PubFactory at 08/03/2016 10:10:53AM via De Gruyter / TCS Olga Tribulato - 9783110415827 Downloaded from PubFactory at 08/03/2016 10:10:53AM via De Gruyter / TCS Olga Tribulato Ancient Greek Verb-Initial Compounds Their Diachronic Development Within the Greek Compound System Olga Tribulato - 9783110415827 Downloaded from PubFactory at 08/03/2016 10:10:53AM via De Gruyter / TCS ISBN 978-3-11-041576-6 e-ISBN (PDF) 978-3-11-041582-7 e-ISBN (EPUB) 978-3-11-041586-5 Library of Congress Cataloging-in-Publication Data A CIP catalog record for this book has been applied for at the Library of Congress. Bibliografische Information der Deutschen Nationalbibliothek The Deutsche Nationalbibliothek lists this publication in the Deutsche Nationalbibliographie; detailed bibliographic data are available in the Internet at http://dnb.dnb.de. © 2015 Walter de Gruyter GmbH, Berlin/Boston Umschlagabbildung: Paul Klee: Einst dem Grau der Nacht enttaucht …, 1918, 17. Aquarell, Feder und Bleistit auf Papier auf Karton. 22,6 x 15,8 cm. Zentrum Paul Klee, Bern. Typesetting: Dr. Rainer Ostermann, München Printing: CPI books GmbH, Leck ♾ Printed on acid free paper Printed in Germany www.degruyter.com Olga Tribulato - 9783110415827 Downloaded from PubFactory at 08/03/2016 10:10:53AM via De Gruyter / TCS This book is for Arturo, who has waited so long. Olga Tribulato - 9783110415827 Downloaded from PubFactory at 08/03/2016 10:10:53AM via De Gruyter / TCS Olga Tribulato - 9783110415827 Downloaded from PubFactory at 08/03/2016 10:10:53AM via De Gruyter / TCS Preface and Acknowledgements Preface and Acknowledgements I have always been ὀψιανθής, a ‘late-bloomer’, and this book is a testament to it. -
Loukota 2019
UNIVERSITY OF CALIFORNIA Los Angeles The Goods that Cannot Be Stolen: Mercantile Faith in Kumāralāta’s Garland of Examples Adorned by Poetic Fancy A dissertation submitted in partial satisfaction of the requirements for the degree Doctor of Philosophy in Asian Languages and Cultures by Diego Loukota Sanclemente 2019 © Copyright by Diego Loukota Sanclemente 2019 ABSTRACT OF THE DISSERTATION The Goods that Cannot Be Stolen: Mercantile Faith in Kumāralāta’s Garland of Examples Adorned by Poetic Fancy by Diego Loukota Sanclemente Doctor of Philosophy in Asian Languages and Cultures University of California, Los Angeles, 2019 Professor Gregory Schopen, Co-chair Professor Stephanie J. Watkins, Co-chair This dissertation examines the affinity between the urban mercantile classes of ancient India and contemporary Buddhist faith through an examination of the narrative collection Kalpanāmaṇḍitikā Dṛṣṭāntapaṅkti (“Garland of Examples,” henceforth Kumāralāta’s Garland) by the 3rd Century CE Gandhāran monk Kumāralāta. The collection features realistic narratives that portray the religious sensibility of those social classes. I contend that as Kumāralāta’s 3rd Century was one of crisis for cities and for trade in the Indian world, his work reflects an urgent statement of the core values of ii Buddhist urban businesspeople. Kumāralāta’s stories emphasize both religious piety and the pursuit of wealth, a concern for social respectability, a strong work ethic, and an emphasis on rational decision-making. These values inform Kumāralāta’s religious vision of poverty and wealth. His vision of religious giving conjugates economic behavior and religious doctrine, and the outcome is a model that confers religious legitimation to the pursuit of wealth but also an economic outlet for religious fervor and a solid financial basis for the monastic establishment, depicted by Kumāralāta in close interdependence with the laity and, most importantly, within the same social class. -
To Be Read by Dr Malhar Kulkarni, IIT Bombay)
Friday, 10 t h Dec, 2010 Venue: Auditorium, School of Arts & Aesthetics, JNU Saturda, January 9, 2010 0830 - 0930: Breakfast & Registration 0930 - 1130: Session 1: Inaugural session • Dr Girish Nath Jha , Special Center for Sanskrit Studies, JNU , Welcome address, Seminar theme • Prof Bal Ram Singh , University of Massachusetts Dartmouth, USA, Scientifying the Sanskrit vs. Sanskritizing the Science(inaugural address) • Prof Ramanath Sharma , University of Hawaii, USA, Rule interaction, blocking and derivation in Pāini (Keynote speech) • Prof V. K. Jain , Registrar, JNU, Presidential address and release of seminar proceedings • Chairperson , Special Center for Sanskrit Studies, JNU, Vote of Thanks Observer: Dr Maureen P. Hall , University of Massachusetts. Dartmouth, USA 1130-1200: Tea 1200 - 1300 : Session 2: Plenary Talk 1: Chair: Prof Bal Ram Singh , University of Massachusetts, Dartmouth, USA Speaker: Prof Saroja Bhate , Bhandarkar Oriental Research Institute (BORI) Topic: Anubandha s of P āini and Exegetics of Sanskrit grammar (to be read by Dr Malhar Kulkarni, IIT Bombay) Coordinator: Dr. Girish Nath Jha , JNU Observer: Manji Bhadra , research student, JNU 1300-1400 : Lunch 1400-1600 : Session 3: Lexical Resources Chair: Manoj Jain , Scientist “E”, TDIL, MCIT • Abhinandan S P and Shrisha Rao , I.I.I.T. Bangalore o Citation Matching in Sanskrit Corpora Using Local Alignment • Diwakar Mani , Jawaharlal Nehru University o RDBMS based Lexical Resource for Indian Heritage: the case of Mah ābh ārata • Sivaja S. Nair and Amba Kulkarni , University of Hyderabad o The Knowledge Structure in Amarako śa • Malhar Kulkarni, Irawati Kulkarni, Chaitali Dangarikar and Pushpak Bhattacharyya , I.I.T. Bombay o Gloss in Sanskrit Wordnet Coordinator: Dr. -
Lexical Sense Labeling and Sentiment Potential Analysis Using Corpus-Based Dependency Graph
mathematics Article Lexical Sense Labeling and Sentiment Potential Analysis Using Corpus-Based Dependency Graph Tajana Ban Kirigin 1,* , Sanda Bujaˇci´cBabi´c 1 and Benedikt Perak 2 1 Department of Mathematics, University of Rijeka, R. Matejˇci´c2, 51000 Rijeka, Croatia; [email protected] 2 Faculty of Humanities and Social Sciences, University of Rijeka, SveuˇcilišnaAvenija 4, 51000 Rijeka, Croatia; [email protected] * Correspondence: [email protected] Abstract: This paper describes a graph method for labeling word senses and identifying lexical sentiment potential by integrating the corpus-based syntactic-semantic dependency graph layer, lexical semantic and sentiment dictionaries. The method, implemented as ConGraCNet application on different languages and corpora, projects a semantic function onto a particular syntactical de- pendency layer and constructs a seed lexeme graph with collocates of high conceptual similarity. The seed lexeme graph is clustered into subgraphs that reveal the polysemous semantic nature of a lexeme in a corpus. The construction of the WordNet hypernym graph provides a set of synset labels that generalize the senses for each lexical cluster. By integrating sentiment dictionaries, we introduce graph propagation methods for sentiment analysis. Original dictionary sentiment values are integrated into ConGraCNet lexical graph to compute sentiment values of node lexemes and lexical clusters, and identify the sentiment potential of lexemes with respect to a corpus. The method can be used to resolve sparseness of sentiment dictionaries and enrich the sentiment evaluation of Citation: Ban Kirigin, T.; lexical structures in sentiment dictionaries by revealing the relative sentiment potential of polysemous Bujaˇci´cBabi´c,S.; Perak, B. Lexical Sense Labeling and Sentiment lexemes with respect to a specific corpus. -
Indo-European Linguistics: an Introduction Indo-European Linguistics an Introduction
This page intentionally left blank Indo-European Linguistics The Indo-European language family comprises several hun- dred languages and dialects, including most of those spoken in Europe, and south, south-west and central Asia. Spoken by an estimated 3 billion people, it has the largest number of native speakers in the world today. This textbook provides an accessible introduction to the study of the Indo-European proto-language. It clearly sets out the methods for relating the languages to one another, presents an engaging discussion of the current debates and controversies concerning their clas- sification, and offers sample problems and suggestions for how to solve them. Complete with a comprehensive glossary, almost 100 tables in which language data and examples are clearly laid out, suggestions for further reading, discussion points and a range of exercises, this text will be an essential toolkit for all those studying historical linguistics, language typology and the Indo-European proto-language for the first time. james clackson is Senior Lecturer in the Faculty of Classics, University of Cambridge, and is Fellow and Direc- tor of Studies, Jesus College, University of Cambridge. His previous books include The Linguistic Relationship between Armenian and Greek (1994) and Indo-European Word For- mation (co-edited with Birgit Anette Olson, 2004). CAMBRIDGE TEXTBOOKS IN LINGUISTICS General editors: p. austin, j. bresnan, b. comrie, s. crain, w. dressler, c. ewen, r. lass, d. lightfoot, k. rice, i. roberts, s. romaine, n. v. smith Indo-European Linguistics An Introduction In this series: j. allwood, l.-g. anderson and o.¨ dahl Logic in Linguistics d. -
The Birth of Grammar in Greece 3
CHAPTE! THI!TY"TH!EE !e Birth of Grammar in Greece Andreas U. Schmidhauser Grammar as one understands it today gives an account of the system of rules govern- ing the construction of syllables, words, and sentences in a certain language. !e sci- ence thus called was independently—and very di"erently—developed at about the same time in ancient India and Greece: Sanskrit grammar is the work of Pā!ini ( !. c. $%% BC); Greek grammar is the creation of Chrysippus of Soli ( !. &$% BC). Both Pā!ini and Chrysippus not only inaugurate a new 'eld but also represent the culmi- nation of centuries of linguistic thought: what distinguishes them from their prede- cessors is that they, for the 'rst time, integrated the results obtained into one theory. !e term “grammar” itself is of Greek origin: literally, ἡ γραμματική (or in full: ἡ γραμματικὴ τέχνη) is the skill, expertise, or knowledge belonging to a person consid- ered γραμματικός; and the adjective γραμματικός is derived from the noun γράμμα [letter], which in turn derives from the verb γράφειν [write, draw]. Over time, the meaning of γραμματικός and thus also of γραμματική changed. One can distinguish four stages: i. In the fourth century BC, when γραμματικός 'rst appears, it is used to describe someone who knows the “letters”: a person versed in grammar, that is, knows how to read and write, can set apart vowels, consonants, and semiconsonants, and suchlike (e.g. Pl. Cra. $;<e; Phlb. <=d; Soph. &>;a). ii. From the third century BC, γραμματική comes to be used for what one would now call philology and criticism (e.g.