Optimal Word Segmentation for Neural Machine Translation Into Dravidian Languages

Total Page:16

File Type:pdf, Size:1020Kb

Optimal Word Segmentation for Neural Machine Translation Into Dravidian Languages Optimal Word Segmentation for Neural Machine Translation into Dravidian Languages Prajit Dhar Arianna Bisazza Gertjan van Noord University of Groningen {p.dhar, a.bisazza, g.j.m.van.noord}@rug.nl Abstract instance that an average English sentence contains almost ten times as many words as its Kannada Dravidian languages, such as Kannada and equivalent. For the other three languages, the ra- Tamil, are notoriously difficult to translate by state-of-the-art neural models. This stems tio is a bit smaller but the difference with English from the fact that these languages are mor- remains considerable. This indicates why it is im- phologically very rich as well as being low- portant to consider word segmentation algorithms resourced. In this paper, we focus on subword as part of the translation system. segmentation and evaluate Linguistically Moti- In this paper we describe our work on Neural vated Vocabulary Reduction (LMVR) against Machine Translation (NMT) from English into the the more commonly used SentencePiece (SP) Dravidian languages Kannada, Malayalam, Tamil for the task of translating from English into four different Dravidian languages. Addition- and Telugu. We investigated the optimal transla- ally we investigate the optimal subword vocab- tion settings for the pairs and in particular looked at ulary size for each language. We find that SP the effect of word segmentation. The aim of the pa- is the overall best choice for segmentation, and per is to answer the following research questions: that larger subword vocabulary sizes lead to higher translation quality. • Does LMVR, a linguistically motivated word segmentation algorithm, outperform the 1 Introduction purely data-driven SentencePiece? Dravidian languages are an important family of • What is the optimal subword dictionary size languages spoken by about 250 million of people for translating from English into these Dravid- primarily located in Southern India and Sri Lanka ian languages? (Steever, 2019). Kannada (KN), Malayalam (MA), Tamil (TA) and Telugu (TE) are the four most In what follows, we review the relevant previ- spoken Dravidian languages with approximately ous work (Sect. 2), introduce the two segmenters 47, 34, 71 and 79 million native speakers, respec- (Sect. 3), describe the experimental setup (Sect. 4), tively. Together, they account for 93% of all Dra- and present our answers to the above research ques- vidian language speakers. While Kannada, Malay- tions (Sect. 5). alam and Tamil are classified as South Dravidian languages, Telugu is a part of South-Central Dra- 2 Previous Work vidian languages. All four languages are SOV (Subject-Object-Verb) languages with free word 2.1 Translation Systems order. They are highly agglutinative and inflection- Statistical Machine Translation One of the ear- ally rich languages. Additionally, each language liest automatic translation systems for English into has a different writing system. Table 1 presents a Dravidian language was the English!Tamil sys- an English sentence example and its Dravidian- tem by Germann (2001). They trained a hy- language translations. brid rule-based/statistical machine translation sys- The highly complex morphology of the Dravid- tem that was trained on only 5k English-Tamil ian languages under study is illustrated if we com- parallel sentences. Ramasamy et al. (2012) cre- pare translated sentence pairs. The analysis of our ated SMT systems (phrase-based and hierarchical) parallel datasets (section 4.1, Table 3) shows for which were trained on a dataset of 190k parallel 181 Proceedings of the 8th Workshop on Asian Translation, pages 181–190 Bangkok, Thailand (online), August 5-6, 2021. ©2021 Association for Computational Linguistics EN He was born in Thirukkuvalai village in Nagapattinam District on 3rd June, 1924. KN ಅವರು ಗಪಟ ಣಂ ಯ ರುಕು ವಲ ಮದ 1924ರ ಜೂ 3ರಂದು ಜದ ರು. avaru nāgapatṭaṇ aṃ jilleya tirukkuvalay grāmadalli 1924ra jūn 3randu janisiddaru. ML 1924ല ് നാഗപണം ജിയിെല തിരുുവൈള ഗാമിലാണ് അേഹം ജനിത് 1924l nāgapatṭaṇ aṃ jillayile tirukkuvalaị grāmattilān ̣addēham janiccat. நாகபன மாவட வைள ராம அவ 1924-ஆ ஆ TA ஜூ மாத 3-ஆ ேத றதா. nāgappatṭinaṃ māvatṭaṃ tirukkuvalaiḳ kirāmattil avar 1924-ām ānṭ ụ jūn mātam 3-ām tēti pirantār. TE ఆయన గపటణం ౖ మం 1924 3న జం. āyana nāgapatṭaṇ aṃ jillā tirukkuvālai grāmanlō 1924 jūn 3na janmincāru. Table 1: Example sentence in English along with its translation and transliteration in the four Dravidian languages. sentences (henceforth referred to as UFAL). They findings was also reported by Ramesh et al. (2020) also reported that applying pre-processing steps in- for Tamil and Dandapat and Federmann (2018) for volving morphological rules based on Tamil suf- Telugu . fixes improved the BLEU score of the baseline To the best of our knowledge and as of 2021, model to a small extent (from 9.42 to 9.77). For there has not been any scientific publication involv- the Indic languages multilingual tasks of WAT- ing translation to and from Kannada, except for 2018, the Phrasal-based SMT system of Ojha et al. Chakravarthi et al. (2019). One possible reason for (2018) with a BLEU score of 30.53. this could be the fact that sizeable corpora involv- Subsequent papers also focused on SMT sys- ing Kannada (i.e. in the order of magnitude of at tems for Malayalam and Telugu with some notable least thousand sentences) have been readily avail- work including: (Anto and Nisha, 2016; Sreelekha able only since 2019, with the release of the JW300 and Bhattacharyya, 2017, 2018) for Malayalam Corpus (Agić and Vulić, 2019). and (Lingam et al., 2014; Yadav and Lingam, 2017) for Telugu. Multilingual NMT Since 2018 several studies have presented multilingual NMT systems that can Neural Machine Translation On the neural handle English ! Malayalam, Tamil and Telugu machine translation (NMT) side, there have translation (Dabre et al., 2018; Choudhary et al., been a handful of NMT systems trained on 2020; Ojha et al., 2018; Sen et al., 2018; Yu et al., ! English Tamil. On the aforementioned Indic 2020; Dabre and Chakrabarty, 2020). In particular, languages multilingual tasks of WAT-2018, Sen Sen et al. (2018) presented results where the BLEU et al. (2018), Dabre et al. (2018) reported only score improved when comparing monolingual and 11.88 and 18.60 BLEU scores, respectively, for multilingual models. Conversely, Yu et al. (2020) ! English Tamil. The poor performance of these found that NMT systems that were multi-way (In- systems compared to the 30.53 BLEU score of the dic $ Indic) performed worse than English $ In- SMT system (Ojha et al., 2018) showed that those dic systems. NMT systems were not yet suitable for translating To our knowledge, no work so far has explored into the morphologically rich Tamil. the effect of the segmentation algorithm and dictio- However, the following year, Philip et al. (2019) nary size on the four languages: Kannada, Malay- outperformed Ramasamy et al. (2012) on the alam, Tamil and Telugu. UFAL dataset with a BLEU score of 13.05 (the pre- vious best score on this test set was 9.77). They 3 Subword Segmentation Techniques report that techniques such as domain adaptation and back-translation can make training NMT sys- Prior to the emergence of subword segmenters, tems on low-resource languages possible. Similar translation systems were plagued with the issue of 182 Available in: Name Domain Kannada Malayalam Tamil Telugu Bible Religion 18 1 14 ELRC COVID-19 <1 <1 <1 GNOME Technical <1 <1 <1 <1 JW300 Religion 70 45 52 45 KDE Technical 1 <1 <1 <1 NLPC General <1 OpenSubtitles Cinema 26 3 3 CVIT-PIB Press 5 10 10 PMIndia Politics 10 4 3 8 Tanzil Religion 18 9 Tatoeba General <1 <1 <1 <1 Ted2020 General <1 <1 <1 1 TICO-19 COVID-19 <1 Ubuntu Technical <1 <1 <1 <1 UFAL Mixed 11 Wikimatrix General <1 10 18 Wikititles General 1 Table 2: Composition of training corpora. The numbers indicate the relative size (in percentages) of the correspond- ing part for that language. out-of-vocabulary (OOV) tokens. This was partic- To address this, Ataman et al. (2017) proposed a ularly an issue for translations involving agglutina- modification of Morfessor FlatCat (Grönroos et al., tive languages such as Turkish (Ataman and Fed- 2014), called Linguistically Motivated Vocabu- erico, 2018) or Malayalam (Manohar et al., 2020). lary Reduction (LMVR). Specifically, LMVR Various segmentation algorithms were brought for- imposes an extra condition on the cost function of ward to circumvent this issue and in turn, improve Morfessor Flatcat so as to favour vocabularies of translation quality. the desired size. In a comparison of LMVR to BPE, Perhaps the most widely used algorithm in NMT Ataman et al. (2017) reported a +2.3 BLEU im- to date is the language-agnostic Byte Pair Encod- provement on the English-Turkish translation task ing (BPE) by Sennrich et al. (2016). Initially pro- of WMT18. posed by Gage (1994), BPE was repurposed by Given the encouraging results reported on the Sennrich et al. (2016) for the task of subword agglutinative Turkish language, we hypothesise segmentation, and is based on a simple principle that translation into Dravidian languages may also whereby pairs of character sequences that are fre- benefit from a linguistically motivated segmenter, quently observed in a corpus get merged itera- and evaluate LMVR against SP across varying vo- tively until a predetermined dictionary size is at- cabulary sizes. tained. In this paper we use a popular implemen- tation of BPE, called SentencePiece (SP) (Kudo 4 Experimental Setup and Richardson, 2018). 4.1 Training Corpora The parallel training data is mostly taken from the While purely statistical algorithms are able to datasets available for the MultiIndicMT task from segment any token into smaller segments, there is WAT 2021.
Recommended publications
  • TNEB LIMITED TANGEDCO TANTRANSCO BULLETIN December
    1 TNEB LIMITED TANGEDCO TANTRANSCO BULLETIN December – 2018 CONTENTS Page No 1. PART – I NEWS & NOTES … … … 2 2. PART – II GENERAL ADMINISTRATIVE & SERVICES … … … 8 3. PART – III FINANCE … … … 21 4. PART – IV TECHNICAL … … … 33 5. INDEX … … … 55 6. CONSOLIDATED INDEX … … … 59 A request With the present issue of the TANGEDCO Bulletin for December 2018 Volume XXXVII (37) which completed. The recipients of the Bulletin are request to have the 12 issues of Volume XXXVII bound in one part from January 2018 to December 2018. A consolidated Index for volume XXXVII has been included in this issue for reference. 2 NEWS & NOTES PART – I I. GENERATION/RELIEF PARTICULARS:- The Generation/Relief particulars for the month of December 2018 were as follows: Sl.No Particulars In Million Units I. TNEB GENERATION (Gross) Hydro 488.582 Thermal 2318.235 Gas 145.094 Wind 0.100 TNEB TOTAL 2952.011 II. NETT PURCHASES FROM CGS 2730.033 III. PURCHASES IPP 221.921 Windmill Private 243.604 CPP, Co- generation & Bio-Mass (Provisional) 16.500 Solar (Private) 274.640 Through Traders (nett purchase) 1758.316 TOTAL PURCHASES 2514.981 IV. Total Wheeling Quantum by HT consumers 702.424 Total Wheeling Quantum to Other States by Pvt. Generators 11.053 Total TNEB Power generation for sale 0.000 TOTAL WHEELING 713.477 Power Sale by TANGEDCO (Exchange) 0.000 Power Sale by TANGEDCO (STOA under Bilateral) 0.000 Power Sale by Private Generators (Exchange) (-)8.403 Power Sale by Private Generators (Bilateral) (-)2.650 Power balance under SWAP 2.688 V. TOTAL (TNEB Own Gen + Purchase + wheeling quantum + SWAP) 8902.138 VI.
    [Show full text]
  • Tiruchirappalli
    TIRUCHIRAPPALLI S.No. ROLL No. NAME OF ADVOCATE ADDRESS 3/48, KOTTA KOLLAI STREET, BEEMA NAGAR, 1 1911/2013 ABDUL HAKEEM A. TIRUCHI 620001 53-ALLWARTHOPE STREET, PALAKKARAI, 2 12/1971 ABDUL MALIK Y.K. TRICHI - 8. 45/1, R.K. GARDEN AKILANDESWARI NAGAR, 3 124/1983 ABRAHAM PREMKUMAR P. LALGUDI - 621601, TRICHY. NO. 38, CAVERY NAGAR, SRIRANGAM, 4 1004/2007 ADHINARAYANAMOORTHY R. TRICHY - 620 006 NO. 57 MAIN ROAD, THIRUVALAR SOLAI 5 2142/2007 ADITHAN S POST, THIRUVANAI KOVIL VIA, SRIRANGAM TALUK, TRICHY DISTRICT - 620 005 84A, PUKKATHURAI POST, MANACHANALLUR 6 2543/2007 AGILAN S. TK. TRICHY DT. 621213. B/3, HOUSING UNIT, VARAGANERY COLONY, 7 3002/2012 AGILESVARAA T.K. TANJAVUR ROAD, TRICHY - 8. 3-B, BALAJI BLOCK, S.B.O. COLONY, 8 1159/1996 AGNEL RAJAN A. CANTONMENT, TIRUCHIRAPPALLI -620001 3D, ROYAL FANTASY FLATS, STATE BANK 9 83/1990 AHAMATH BATHUSHA A. OFFICERS COLONY, LAWYERS ROAD, TRICHY - 1 NO.74, VARUSAI ROWTHER LANE, TANJORE 10 839/1995 AHMED BASHA S. ROAD, TRICHY-620008. 3/108A, OLD POST OFFICE STREET, 11 471/1999 AJMAL KAN A. INAMKULATHUR P.O. TRICHY DT. S.No. ROLL No. NAME OF ADVOCATE ADDRESS 7, RAMA RAO ST., TENNUR HIGH ROAD, 12 638/1986 AJUHAR ALI A. TRICHIRAPPALLI - 620 017 94, SRIRAMAPURAM, RAYAR THOPE, 13 961/1998 AKILA S. SRIRANGAM, TRICHY 620006. 1/97, MAIN ROAD, MANAKKAL POST, LALGUDI 14 1355/2014 AKILANDESWARI A. TALUK, TRICHY - 621 601 NO.41, MALLIGAIPURAM MAIN ROAD, 15 42/2015 ALAGAPPAN A. MALLIGAIPURAM, PALAKARAI, TRICHY- 620001. 43/44-B, MUTHURAJA STREET, 16 2108/2006 ALAGAR M. INAMSAMYAPURAM, MANNACHANALLUR, TRICHY 621112.
    [Show full text]
  • Day Fdtp on Ce8502-Structural Analysis
    REGISTRATION FORM SPONSORSHIP SIX - DAY FDTP Faculty Development Training Programme Mr./Ms./Dr.______________________________ ON on is an employee of Institute/Organization and is CE8502-STRUCTURAL ANALYSIS-I hereby allowed to attend the Anna University CE8502-STRUCTURAL ANALYSIS-I 16.05.2019 to 22.05.2019 Chennai FDTP on CE8502 Structural Analysis-I. 16.05.2019 to 22.05.2019 1. Name : He/ She will be permitted to attend the Co-ordinators 2. Age & DOB : programme, if selected. Dr. G. Elangovan 3. Designation : Date: Head of the Department 4. Qualification : Place: Department of Civil Engineering 5. Experience : and 6. Institution : Signature & Seal of the Sponsoring Mrs. V.M. Rajanandhini Authority Assistant Professor 7. E-mail ID : Department of Civil Engineering 8. Mobile Number : MAILING ADDRESS DECELARATION The Coordinators, The information provided by me is true to the Department of Civil Engineering, Sponsored by best of my knowledge. I agree to abide by the University College of Engg, Thirukkuvalai, Nagapattinam (District), ANNA UNIVERSITY rules and regulations governing the FDTP. If Pin Code– 610 204. CHENNAI selected, I shall attend the program for the entire Phone: 04366-245234, 245235 Mobile: 9443587314, 9994054086 duration. Organized by: Scanned copy of Registration form to be sent: Department of Civil Engineering, E-mail:[email protected] University College of Engg. Thirukkuvalai, Place: Hard copy should be sent to the coordinator Nagapattinam (District), Date: Signature Pin Code– 610 204. ABOUT THE INSTITUTION PROGRAMME OBJECTIVE ORGANIZING COMMITTEE Training the participants to basic theory and University College of Engineering concept of classic methods of structural CHIEF PATRON : Dr.
    [Show full text]
  • Tamil Nadu Public Service Commission Bulletin
    © [Regd. No. TN/CCN-466/2012-14. GOVERNMENT OF TAMIL NADU [R. Dis. No. 196/2009 2018 [Price: Rs. 145.60 Paise. TAMIL NADU PUBLIC SERVICE COMMISSION BULLETIN No. 7] CHENNAI, FRIDAY, MARCH 16, 2018 Panguni 2, Hevilambi, Thiruvalluvar Aandu-2049 CONTENTS DEPARTMENTAL TESTS—RESULTS, DECEMBER 2017 NAME OF THE TESTS AND CODE NUMBERS Pages Pages The Tamil Nadu Government office Manual Departmental Test for Junior Assistants In Test (Without Books & With Books) the office of the Administrator - General (Test Code No. 172) 552-624 and official Trustee- Second Paper (Without Books) (Test Code No. 062) 705-706 the Account Test for Executive officers (Without Books& With Books) (Test Code No. 152) 625-693 Local Fund Audit Department Test - Commercial Book - Keeping (Without Books) Survey Departmental Test - Field Surveyor’s (Test Code No. 064) 706-712 Test - Paper -Ii (Without Books) (Test Code No. 032) 694-698 Fisheries Departmental Test - Ii Part - C - Fisheries Technology (Without Books) Fisheries Departmental Test - Ii Part - B - (Test Code No. 067) 712 Inland Fisheries (Without Books) (Test Code No. 060) 698 Forest Department Test - forest Law and forest Revenue (Without Books) Fisheries Departmental Test - Ii Part - (Test Code No. 073) 713-716 A - Marine Fisheries (Without Books) (Test Code No. 054) 699 Departmental Test for Audit Superintendents of Highways Department - Third Paper Departmental Test for Audit Superintendents (Constitution of India) (Without Books) of Highways Department - First Paper (Test Code No. 030) 717 (Precis and Draft) (Without Books) (Test Code No. 020) 699 The Account Test for Public Works Department officers and Subordinates - Part - I (Without Departmental Test for the officers of Books & With Books) (Test Code No.
    [Show full text]
  • DR. NAME Father's /Husband Name
    TAMILNADU STATE VETERINARY COUNCIL, CHENNAI-600035. DRAFT ELECTORAL ROLL-2013 SVPR Roll. Father's /Husband TNSVC SVPR SVPR PAGE.N No: DR. NAME Name ADDRESS Reg.No: YEAR Sl.NO: O: 44 /183-3, PUSHPAGAM EAST YMR 1 SAIRABANU S. P. SAMSUDEEN PATTI, DINDIGUL -624001. 2 2002 2 1 25 / 32A, KUNJAN VILAI, MANIKATTIPOTTAL (P.O.), 2 RAMESH S. R.SUYAMBU NAGERCOIL 629 501 3 2002 3 1 27, CHELLA PERUMAL ST., K.G.SUBRAMANIA SHOLINGHUR 631 102, VELLORE 3 VIJAYAKUMAR K. S. N DISTRICT 4 2002 4 1 # 220, METTU STREET, SAMPATH K.R.KARUNAKAR MANSION, NATHAM P.O., 4 SAMPATH K. AN CHENGALPATTU 603 001 5 2002 5 2 156D/163B, Subasri Nagar, Extn.I, 5 KAMALRAJ V. D. VENKATESAN Porur, Chennai - 600 0116 6 2002 6 2 ANAIPALAYAM (P.O.) ANDAGALUR GATE (VIA), RASIPURAM (TK), 6 LAVANYA K. A.KAILASAM NAMAKKAL DT., 637 401 7 2002 7 2 KEELA RADHA VEEDI, MUDUKULATHUR 623 704 , 7 KANNAN ALPADI A. T.T.ALPADI RAMANATHAPURAM DT., 8 2002 8 2 102, ARANI KOOT ROAD, PADMAVATHY A. W/o. A. KAMALA CHEYYAR - 604407 8 KANNAN THIRUVANNAMALAI DIST. 9 2002 9 3 122, MAIN ROAD. OLAGADAM 638 9 GANAPATHI RAJ M. R.MURUGESAN 314, ERODE DISTRICT 10 2002 10 3 OLD NO. 8,9 NEW NO. 5, RATHINAM R.GOVINDARAJA STREET, FIRST LANE, NEAR FIVE 10 DHANARAJ G. N CORNER, COIMBATORE-641001. 11 2002 11 3 15 / 1, MURUGA BHAVANAM, FIRST STREET, KAKKAN NAGAR, SURESH I. PALAYAMKOTTAI 11 S. IYYAPILLAI 627 353 12 2002 12 3 NO.17 & 19, FOURTH STREET, GOVINDA SWAMY NAGAR, KANDANCHAVADI, MADRAS 600 12 SARASWATHI M.
    [Show full text]
  • District Profile 1
    District Profile 1. Introduction: Nagapattinam District was carved out of erstwhile Thanjavur District on October 18, 1991. Subsequently it was bifurcated in 1997 as Nagapattinam and Tiruvarur Districts. It is a very small district with a total geographical area of 2715.83 Sq. Kms. This constitutes just 2.09 % of the area of the State. Revenue Divisions 2 Taluks 8 Municipalities 4 Panchayat Unions 11 Town Panchayats 8 Panchayats 434 Habitations 2508 2. Coastal: Coastline 187.9 Kilometers Coastal Villages 53 Coastal Panchayat 29 Town Panchayat 2 Municipality 2 3. Population Actual Population 16,16,450 Male 7,98,127 Female 8,18,323 Population Growth 8.57% Area Sq. Km 2,715.83 sq.km Density/km2 629 Proportion to Tamil Nadu Population 2.24% Sex Ratio (Per 1000) 1025 Child sex ratio (0‐6 Age) 959 Average Literacy 83.59 Male Literacy 89.79 Female Literacy 77.58 Total Child Population (0‐6 Age) 1,65,245 Male Population (0‐6 Age) 84,335 Female Population (0‐6 Age) 80,910 Literates 12,13,008 Male Literates 6,40,916 Female Literates 5,72,092 Child Proportion (0‐6 Age) 10.22% Boys Proportion (0‐6 Age) 10.57% Girls Proportion (0‐6 Age) 9.89% Total Total Sex Population Area Male Female Area Households Population Ratio Density Rural 2,71,827 11,58,557 5,76,010 5,82,547 1011 225.03 520.69 Urban 71,786 3,30,282 1,63,064 1,67,218 1025 191.97 1720.49 Total 3,43,613 14,88,839 7,39,064 7,49,765 1014 2417.00 615.99 4.
    [Show full text]
  • Chapter 4.1.9 Ground Water Resources Nagapattinam District
    CHAPTER 4.1.9 GROUND WATER RESOURCES NAGAPATTINAM DISTRICT 1 INDEX CHAPTER PAGE NO. INTRODUCTION 3 NAGAPATTINAM DISTRICT – ADMINISTRATIVE SETUP 3 1. HYDROGEOLOGY 3-7 2. GROUND WATER REGIME MONITORING 8-15 3. DYNAMIC GROUND WATER RESOURCES 15-24 4. GROUND WATER QUALITY ISSUES 24-25 5. GROUND WATER ISSUES AND CHALLENGES 25-26 6. GROUND WATER MANAGEMENT AND REGULATION 26-32 7. TOOLS AND METHODS 32-33 8. PERFORMANCE INDICATORS 33-36 9. REFORMS UNDERTAKEN/ BEING UNDERTAKEN / PROPOSED IF ANY 10. ROAD MAPS OF ACTIVITIES/TASKS PROPOSED FOR BETTER GOVERNANCE WITH TIMELINES AND AGENCIES RESPONSIBLE FOR EACH ACTIVITY 2 GROUND WATER REPORT OF NAGAPATTINAM DISTRICT INRODUCTION : In Tamil Nadu, the surface water resources are fully utilized by various stake holders. The demand of water is increasing day by day. So, groundwater resources play a vital role for additional demand by farmers and Industries and domestic usage leads to rapid development of groundwater. About 63% of available groundwater resources are now being used. However, the development is not uniform all over the State, and in certain districts of Tamil Nadu, intensive groundwater development had led to declining water levels, increasing trend of Over Exploited and Critical Firkas, saline water intrusion, etc. ADMINISTRATIVE SET UP Nagapattinam district is having administratie division of 5 Taluks,11 Blocks, 434 village panchayats, 8 town panchayats, 4 municipality ,31 Firkas and 523revenue villages. Nagapattinam District is Totally bifurcated into 31 Firkas. Basin and sub-basin The district is part of the composite east flowing river basin having Cauvery and vennar sub basin. Drainage The district is drained by Kollidam and Cauvery in the north, Virasolanar, Uppanar in the central part and Arasalar, TirumalairajanAr, Vettar, Kedurai AR, Pandavai Ar, Vedaranyam canal and Harichandra Nadi in the southern part of the district.
    [Show full text]
  • Individual Satyagraha in Nagapattinam District – a Study
    Volume 3, Issue 11, November – 2018 International Journal of Innovative Science and Research Technology ISSN No:-2456-2165 Individual Satyagraha in Nagapattinam District – A Study R. Alamelu Dr. D. Julius Vijaya kumar M.A., M.Phil., Associate Professor/Head M.A., M.Phil., Ph.D., Associate Professor Department of History PG & Research Department of History A.D.M. College for Women TBML College, Poraiyar Nagapattinam, Tamil Nadu, India Nagapattinam, Tamil Nadu, India Abstract:- The Individual Satyagraha is a salient feature cooperation to the British War effort turned against them. of freedom struggle. The Individual Satyagraha Gandhi was invited by the Congress to need the movement Movement in Nagapattinam District was very active. A against the British Government. Gandhi wanted the persistent antiwar propaganda was started in the various movement to be symbolic in character in order to register parts of the Districts. It took several forms Anti-war India’s moral protest against the British attitude and to draw pamphlets were distributed; Anti-war speeches were the attention of the world to the right of Indians to freedom. made; Anti-war posters struck to walls; Antiwar This was Gandhi’s direct action plan to resist the British war propaganda was carried on from house to house and efforts. On 13th October 1940, Gandhiji got the approval of antiwar slogans were shouted in public places. Ettukudi his scheme of Individual Satyagraha in the meeting of is a small village in Thirukkuvalai Taluk conducted Congress working committee at Wardha. Padayatra. All forms of Individual Satyagraha were th carried out by the freedom fighters.
    [Show full text]
  • Resettlement Plan
    Resettlement Plan Document Stage: Draft January 2021 IND: Tamil Nadu Industrial Connectivity Project Kumbakonam to Sirkazhi (SH64) Prepared by Project Implementation Unit (PIU), Chennai Kanyakumari Industrial Corridor, Highways Department, Government of Tamil Nadu for the Asian Development Bank. CURRENCY EQUIVALENTS (as of 7 January 2021) Currency unit – Indian rupee/s (₹) ₹1.00 = $0. 01367 $1.00 = ₹73.1347 ABBREVIATIONS ADB – Asian Development Bank AH – Affected Household AP – Affected Person BPL – Below Poverty Line CKICP – Chennai Kanyakumari Industrial Corridor Project DC – District Collector DE – Divisional Engineer (Highways) DH – Displaced Household DP – Displaced Person SDRO – Special District Revenue Officer (Competent Authority for Land Acquisition) GOI – Government of India GRC – Grievance Redressal Committee IAY – Indira Awaas Yojana LA – Land Acquisition LARRU – Land Acquisition, Rehabilitation and Resettlement Unit LARRIC – Land Acquisition Rehabilitation & Resettlement Implementation Consultant LARRMC – Land Acquisition Rehabilitation & Resettlement Monitoring Consultant PIU – Project implementation Unit PRoW – Proposed Right-of-Way RFCTLARR – The Right to Fair Compensation and Transparency in Land Acquisition, Rehabilitation and Resettlement Act, 2013 R&R – Rehabilitation and Resettlement RF – Resettlement Framework RSO – Resettlement Officer RoW – Right-of-Way RP – Resettlement Plan SC – Scheduled Caste SH – State Highway SPS – Safeguard Policy Statement SoR – Schedule of Rate ST – Scheduled Tribe NOTE (i) The fiscal year (FY) of the Government of India ends on 31 March. FY before a calendar year denotes the year in which the fiscal year ends, e.g., FY2021 ends on 31 March 2021. (ii) In this report, "$" refers to US dollars. This draft resettlement plan is a document of the borrower. The views expressed herein do not necessarily represent those of ADB's Board of Directors, Management, or staff, and may be preliminary in nature.
    [Show full text]
  • Nagapattinam - an Introduction 2
    Table of Contents S.No Contents Page No 1. Nagapattinam - An Introduction 2. District Diagnostic Study 3. Socio Demographic Profile of the District 3.1 Population 3.2 Sex ratio 3.3 Literacy 3.4 SC,ST population 3.5 Occupation Profile 3.6 Community Based Organisations 3.7 Farmer Producer Organisations 4.0 Geographical Features 4.1 Topography 4.2 Land use pattern 4.3 Climate and rainfall 4.4 Soil 4.5 Water resources 5.0 Status of Groundwater 6.0 District Infrastructure 6.1 Electricity 6.2 Factory accommodation 6.3 Railways 6.4 Roads 6.5 Sea Port 6.6 Post & Telegraph 6.7 Banking and Financial Institutions 6.8 Training facilities 6.9 Regulated Markets 6.10 Tamilnadu Civil supplies Corporation storage points 7.0 Farm Sector 7.1 Land holding pattern 7.2 Irrigation 7.3 Cropping pattern 7.3.1 Area and Production major crops 7.3.2 Other crops 7.3.3 Horticulture and Plantation crops 7.4 Blockwise Major Crop Cultivation 8 Resource Institutions 9.0 Allied sectors 9.1 Livestock and Poultry 9.2 Fisheries 10.0 Non farm sector 10.1 Industrial scenario in the district 10.2 Small Scale Industries 10.3 Medium and Large Scale Industries 10.4 MSME clusters 10.5 Salt pan – Vedaranyam 11.0 Heritage sites - Rural Tourism 12.0 Credit and Insurance 13.0 Potential Activities in Nagapattinam District 13.1 Commodity Prioritization 14.0 Qualitative Skill gap 15.0 Conclusion List of tables Table 1 : List of Village Panchayats in TNRTP Blocks of Nagapattinam District Table 2: Population details of Nagapattinam District Table 3.
    [Show full text]
  • Entrepreneurship Awareness Camp October 03-05, 2018 Sponsored By
    ABOUT THE INSTITUTION instruments. Our Department constantly encourages THREE DAYS the students to take part in In-plant training and University College of Engineering Entrepreneurship Industrial Visits relevant to Civil Engineering. Four Thirukkuvalai, a Constituent College of Anna Awareness Camp students in first batch, three in second batch and two University was started in the year 2008.The College in third batch scored University Ranks and brought offers UG Courses in Civil Engineering, (English and October 03 -05, 2018 glory to our Department as well as to College. Tamil Medium), Mechanical Engineering (English Sponsored By and Tamil Medium), Computer science & PROGRAMME OBJECTIVE Engineering, Electronics & Communication Engineering and Electrical & Electronics Engineering. Entrepreneurship Awareness Camp aims at The College is situated at Thirukkuvalai, creating awareness among students about various Nagapattinam District with an extent of 42.21 acres facets of Entrepreneurship while highlighting the which includes excellent academic building, hostels, merits of pursuing such a career option. In this camp, Entrepreneurship Development Institute of India (EDII) & DST-NSTEDB mess and canteen. The College is well equipped with students are exposed to different aspects of (DST-NIMAT Project 2018-2019) smart lecture halls, spacious conference halls, Entrepreneurship including opportunity guidance, Govt. of India, New Delhi necessary laboratories and advanced equipment for services offered by agencies of support system etc. research and academic activities. The students from different parts of Tamil Nadu with different The main objective of the camp is to bring background are selected on the basis of merit through together the experts from industries, government agencies working in the field of Entrepreneurship and TNEA, conducted by Anna University.
    [Show full text]
  • Nagapattinam District Industrial Profile 2020-21
    Ministry of Micro, Small and Medium Enterprises Government of India Nagapattinam District Industrial Profile 2020-21 Prepared by MSME Development Institute - Chennai Office of the Development Commissioner Ministry of Micro Small and Medium Enterprises Government of India 1 INDEX CHAPTER CONTENT PAGE NO. 1 DISTRICT AT A GLANCE 3 2 INTRODUCTION 9 3 AVAILABLITY OF RESOURCES 15 4 INFRASTRUCTURE FACILITY EXISTING IN 18 NAGAPATTINAM 5 INDUSTRIAL SCENARIO AND MSMEs AT 20 NAGAPATTINAM 5.1 DETAILS OF MSMEs IN THE DISTRICT 20 5.2 LARGE SCALE INDUSTRIES AND PUBLIC 29 SECTOR UNDERTAKINGS 5.3 MAJOR EXPORTABLE ITEMS IN NAGAPATTINAM 30 5.4 ENTERPRISE HAVING POTENTIAL IN 30 NAGAPATTINAM DISTRICT 6 MICRO SMALL ENTERPRISES- CLUSTER 32 DEVELOPMENT PROGRAMME 7 SWOT ANALYISIS FOR ENTERPRISES 35 DEVELOPMENT 8 INSTITUTIONAL SUPPORT FOR MSMEs 36 9 STEPS TO SET UP ENTERPRISES 43 10 IMPORTANT SCHEMES AND ITS PERFORMANCE 58 11 ADDITIONAL INFORMATION 67 ANNEXURE-I ADDRESSESS OF CENTRAL AND STATE GOVT 67 AUTHORITIES ANNEXURE- IMPORTANT CONTACTS AT NAGAPATTINAM 71 II 2 CHAPTER-I NAGAPATTINAM DISTRICT AT A GLANCE 1. GEOGRAPHICAL POSITION North Latitude Between 10°10' and 11°20' East Latitude Between 79°15' and 79°50' Area [In Square Kilometers] 2715.83 Population 16,16,450 Density 629 Mean Sea Level 9 Mtrs. UP 2. DEMOGRAPHIC INFORMATION Ref. Year Unit Figure POPULATION Census 2011 Nos. 16,16,450 Rural ’’ ’’ 11,58,557 Urban ’’ ’’ 3,30,282 Density ’’ No/Sq.K 629 m Sex Ratio ’’ for 1000 1025 males By Sex ’’ Nos. Male ’’ ’’ 7,98,127 Female ’’ ’’ 8,18,323 LITERATES ’’ ’’ 1213008 Male ’’ ’’ 6,40,916 Female ’’ ’’ 5,72,092 LITERACY RATE ’’ % 83.59 Male Literacy ’’ ’’ 89.79 Female Literacy ’’ ’’ 77.58 TOTAL HOUSEHOLDS ’’ Nos.
    [Show full text]