An Analytical Study of Synonymy in Assamese Language Using Worldnet: Classification and Structure
Total Page:16
File Type:pdf, Size:1020Kb
An Analytical Study of Synonymy in Assamese Language Using WorldNet: Classification and Structure Shikhar Kr. Sarma Himadri Bharali Mayashree Mahanta Gauhati University Gauhati University Gauhati University Guwahati,Assam,India. Guwahati,Assam,India. Guwahati,Assam,India. [email protected] [email protected] [email protected] Utpal Saikia Dibyajyoti Sarmah Gauhati University Gauhati University Guwahati,Assam,India. Guwahati,Assam,India. [email protected] [email protected] with completely identical meaning. It is general- Abstract ly accepted that complete synonym is rare in nat- ural language. The discussion of synonyms comes under the study of lexical relation. Lexical relation analyses the meaning of the words in the The present paper aims to categorize different language which have related meanings. The idea types of synonymous words and also to high- of synonym is not only applied to lexical items, light their synonymic pattern as well as gram- but also idioms, larger expressions, of course. A matical categories found in Wordnet of As- samese language. Synonymy is an important lexicographer builds a synonym dictionary de- component of vocabulary of the language. It pending on the words which share the same se- establishes lexical relation between words. In mantic features in a given language. fact, the term ‘synonymy’ is applied to the two The present paper deals with lexical synonyms of or more words which share the same semantic the same word class, not with the phrasal syno- features. WorldNet is a lexical database con- nyms. We will categorize the synonymous words sisting of synsets. A synset is constructed by considering the semantic features of the words assembling a set of synonyms that together de- they share based on Assamese Wordnet. Besides, fine a unique sense and synset is the basic it is also an attempt to point out the synonymic foundation of Wordnet. Assamese language is pattern and the grammatical categories of synsets rich in synonyms. In Assamese WorldNet, more than 20,000 synsets are entered under the in Wordnet. categories of Noun, Verb, Adverb and Adjec- tive. These synsets can of different types ac- 2 A rapid sketch of Assamese Language cording to their semantic similarity, connota- Assamese is the easternmost New Indo-Aryan tion, denotation, stylistic variations etc. language spoken in the Brahmaputra valley com- prising at present six districts with Lakhimpur in 1 Introduction the extreme east and Goalpara in the west. Synonym is an important feature of the vocabu- Tibeto-Burman and the Khasi are the important lary of any language. But it is very difficult to ones. According to the 1991 census report, the give a clear, precise and correct definition of number of speakers of the language is almost synonymy. There are various approaches with 100000 billions. However, it is spoken as a se- numerous definitions of synonym and types of cond language by a considerable number of synonyms. Linguistically, two or more words in speakers of Tibeto-Burman languages like Bodo, the same language with very closely related Mising and Karbi. Traditionally, it has served as meaning are called synonyms. It is to be men- the lingua-franca or pidgin in the neighbouring tioned here that synonyms does not mean the states of Nagaland and Arunachal Pradesh. ‘sameness of meaning’ as there is no two terms The word ‘Assamese’ is an English one based and ‘desaja’. But his ‘Desaja’ words are shown on the anglicized form ‘Assam’ from the native as loan words in which maximum words are word ‘Asam’. The word Assam was connected Perso-Arabic words (Pathak, 2004). Therefore, with the Shan invaders of the Brahmaputra val- his vocabulary classification cannot be taken as ley during the 13th century. In modern Assamese, valid. On the other hand, though Banikanta Shan invaders of the 13th century are termed as Kakati’s classification of Assamese vocabulary ‘Ahoms’. covers almost all the aspects, yet his classifica- Presence of Assamese language dated back to tion also cannot be regarded as valid one. the literatures of Charyapadas, written by It is interesting to note here that there are a Budhist scholars. The Assamese language pre- large amount of loan words in Assamese lan- sent in charyapadas reflects its evolutionary stag- guage. In day-to-day life these loan words have es in initial state. Literatures with distinct As- been used extremely to express feelings, ideas samese language are found from the Kavyas of etc. Moreover, it is seen that Perso-Arabic words the pre Sankari era. This was in 13th century have been used in Assamese language. These AD. From that time onwards pure Assamese lan- words occupy a significant status in Assamese guage with its structured forms evolved language. (Goswami, 1983). Assamese script is derived Assamese vocabulary can be divided into the from Brahmi script. It played a vital role in the following heads (Sarma et al., 2012): evolution of the Indian script. The rock inscrip- tion and copper plate from 5th to 9th century 1. Aryan or words of Sanskrit origin showed the evolution of Assamese script. There a. Tatsama are eight vowel phonemes in Assamese. There b. Semi-tatsama are twenty-one consonant and two semi-vowel c. Tadbhava phonemes in the Standard Colloquial Assamese 2. Non-Aryan words (Kakati, 2008). a. Austro-Asiatic b. Tibeto-Burmese 2 A Brief Discussion of Assamese Vo- c. Tai-Ahom cabulary d. Dravidian 3. Loan words The scope of Assamese vocabulary is very vast. a. Words coming from N.I.A. lan- It consists of words of Sanskrit origin, Non- guages Aryan words, dialect oriented words. Besides b. Foreign Words Assamese socio-cultural influences are also per- i.Persian ceived in the vocabulary of the language. It is to ii.Arabic be noted here that Assamese still lacks a com- iii.Portugese mon vocabulary dictionary in the language. iv.English Moreover, no dictionary was found in the early c. Loan translations and the middle ages. The selected modern dic- i.Translated words tionaries are – ‘A Dictionary in Assamese and ii.Terminology English by Miles Bronson’ (1867); ‘Hemkosh’ 4. Unclassified words (1900), by Hemchandra Barua and later it is th a. Hybrid compiled by Debananda Barua (the 14 edition) b. Onomatopoetic which included 1, 54,428 words; ‘Chandrakanta c. Compound Abhidhan’ (2004, 3rd edition) ‘Adhunik Asomiya Sabdakosh’ (2007, 9th edition), ‘Asamiya Jatiya 3 Wordnet and Synonym Sets Building Abhidhan’ (2010) and many other vocabulary in Assamese Language dictionaries are available in Assamese language. No common standard vocabulary dictionary has Wordnet is a repository of words of a language. been made till today. Many critics have prepared It is basically a synonymous lexical database. vocabulary lists in their own way. Earlier philol- The words are classed together according to their ogists like Kaliram Medhi and Banikanta kakati similarity of meanings. Vocabulary plays a main had classified vocabulary list in their own style. role in building Wordnet. The task of Assamese Kaliram Medhi in ‘Asomiya Byakaran aru Wordnet building is almost ready to provide us Bhasatattva’ has provided a classification As- with all the lexical words. Though Assamese samese vocabulary such as ‘tatsama’, ‘tatbhava’ wordnet tries to cover all the Assamese word forms, yet there are still many words in the lan- pending on its resemblance of meaning, distribu- guage those need to be entered (Sarma et al., tion, style, form etc. 2010) 3.2 Near Synonymy 3.1 Classification of Synonymy in As- Near synonyms are those words whose meaning samese is relatively close or more or less similar, but not Assamese language is rich in synonyms. We can fully intersubstitutable. They vary in terms of classify synonyms under the following three their shades of denotation, connotation, heads: implicature, emphasis or register. Near syno- 1. Absolute synonymy: nyms are extensively found in Assamese. For Words can be called absolutely synonymous if example: they share the complete semantic features in all ভাল ‘bhaal’ (good): সজ্জন,সত্ ‘sajjan, sat’ contexts of occurrences. However, it is generally All these words denote the quality of goodness. recognized that absolute synonyms are almost But they differ from one another in respect to non-existent. Though it is very rare, it certainly their denotational meaning. The word exists in Assamese languages. It is limited most- ভাল ly to dialectical variations and technical or insti- ‘bhaal’ is a generic term, whereas সজ্জন ‘sajjan’ tutional terms. For example: is more particular applicable only to human be- বিদ্যালয় ‘bidyaaloi’ (school): পঢাশালী, পাঠশালা ing. Besides, সত্ ‘sat’ conforms to both animate ‘parhaashaalii, haathshaalaa’ and inanimate things. The usages of these synsets খিৰ ‘khabar’ (news): িাতবৰ, সংিাদ, সংিাদ পত্ৰ, are shown below- িাতবৰ কাকত ‘baatari, sangbaad, sangbaad hatra, ভাল ‘bhaal’ - ভাল / কাম/ বকতাপ ‘bhaal baatari kaakati’ byakti/kaam/kitaap’ (good person/work/book) 2. Stylistic synonymy: সজ্জন ‘sajjan’- সজ্জন / *কাম/ *বকতাপ Stylistic synonyms are words conveying the ‘sat’- / / * same concept but differing in stylistic connota- সত্ সত্ কাম বকতাপ tions. Stylistic synonymy is very common in As- Near-synonyms can vary as follows- samese language. For example- Type of varia- মৃত্যু ‘mrityu’ (death): Examples tion মৰণ, প্রয়াণ, প্রাণত্যাগ, মহাপ্রয়াণ, বিকু প্রয়াণ, বতৰৰাধান, কমম, চাকবৰ ‘karma, chaakari’ , , , , ‘maran, Collocational বতৰৰাভাৱ কাল ললাকান্তৰ কালগ্রাম লদহাৱসান (work) prayaan, praantyaag, mahaaprayaan,boikuntha prayaan, tirodhaan, tirobhaabh, kaal, lokaantar, Stylistic, for- সানীয়, মাননীয়, মািৰ kaalagraam, dehaawasaan’ mality ‘sanmaniiya, maananiiya, maanyabar’ (honourable) সুন্দৰ ‘sundar’ (beautiful): ধুনীয়া,লদখবনয়াৰ,쇂পহ, Stylistic, dhansha, patan’ লমাহনীয়,নয়নাবভৰাম,চকুত লগা, চকু জুৰৰাৱা, নয়ন ধ্বংস, পতন forced (destruction) জুৰৰাৱা, বিৰতাপন,চকুত চমক লৰগাৱা ‘dhuniaa, dekhaniyaar, rupah, mohaniiya, nayanaabhiraam, Expressed atti- ক্ষীণ, লাহী, শুকান khin, laahii, sakut lagaa, saku juruwaa, nayan juruwaa, sakut tude sukaan thin samak lagowaa’ 3.