Punjabi Language During British Rule
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Named Entity Recognition System for Kashmiri Language Iamir Bashir Malik, Iikhushboo Bansal Istudent, M.Tech, Iiassistant Professor I,Iidept
ISSN : 2347 - 8446 (Online) International Journal of Advanced Research in ISSN : 2347 - 9817 (Print) Vol. 3, Issue 2 (Apr. - Jun. 2015) Computer Science & Technology (IJARCST 2015) Named Entity Recognition System for Kashmiri Language IAmir Bashir Malik, IIKhushboo Bansal IStudent, M.Tech, IIAssistant Professor I,IIDept. of CSE, Desh Bhagat University, Mandi Gobindgarh, Punjab, India Abstract Named Entity Recognition (NER) is a task which helps in finding out Persons name, Location names, Organization names, Place, Date, Time etc. and classifies them into predefined different categories. Named Entity Recognition plays a major role in various Natural Language Processing (NLP) fields like Information Extraction, Machine Translations and Question Answering. Unfortunately Kashmiri language which is a scarce resourced language has not been taken into account. This paper describes the problems of NER in the context of Kashmiri Language and provides relevant solutions. Keywords Named Entity, Named Entity Recognition, Natural language process, Kashmiri language text. I. Introduction is as follows. The term Named Entity (NE) was evolved during the sixth (1) “Micromax”represent anorganization and “ Dec19, 2014” Message Understanding Conference (MUC -6, 1995).Named represent dateand “smartphone” represent entity and “had Entity Recognition (NER) is also knows as entity identification is a launched its on” represent others. subtask of information extraction (IE). NER extracts and classifies The named entities may be of any type such as given below -
Phd, MS/Mphil BS/Bsc (Hons) 2021-22 GCU
PhD, MS/MPhil BS/BSc (Hons) GCU GCU To Welcome 2021-22 A forward-looking institution committed to generating and disseminating cutting- GCUedge knowledge! Our vision is to provide students with the best educational opportunities and resources to thrive on and excel in their careers as well as in shaping the future. We believe that courage and integrity in the pursuit of knowledge have the power to influence and transform the world. Khayaali Production Government College University Press All Rights Reserved Disclaimer Any part of this prospectus shall not be reproduced in any form or by any means without permission from Government CONTENTS College University Press Lahore. University Rules, Regulations, Policies, Courses of Study, Subject Combinations and University Dues etc., mentioned in this Prospectus may be withdrawn or amended by the University authorities at any time without any notice. The students shall have to follow the amended or revised Rules, Regulations, Policies, Syllabi, Subject Combinations and pay University Dues. Welcome To GCU 2 Department of History 198 Vice Chancellor’s Message 6 Department of Management Studies 206 Our Historic Old Campus 8 Department of Philosophy and Interdisciplinary Studies 214 GCU’s New Campus 10 Department of Political Science 222 Department of Sociology 232 (Located at Kala Shah Kaku) 10 Journey from Government College to Government College Faculty of Languages, Islamic and Oriental Learning University, Lahore 12 Department of Arabic and Islamic Studies 242 Legendary Alumni 13 Department of -
(And Potential) Language and Linguistic Resources on South Asian Languages
CoRSAL Symposium, University of North Texas, November 17, 2017 Existing (and Potential) Language and Linguistic Resources on South Asian Languages Elena Bashir, The University of Chicago Resources or published lists outside of South Asia Digital Dictionaries of South Asia in Digital South Asia Library (dsal), at the University of Chicago. http://dsal.uchicago.edu/dictionaries/ . Some, mostly older, not under copyright dictionaries. No corpora. Digital Media Archive at University of Chicago https://dma.uchicago.edu/about/about-digital-media-archive Hock & Bashir (eds.) 2016 appendix. Lists 9 electronic corpora, 6 of which are on Sanskrit. The 3 non-Sanskrit entries are: (1) the EMILLE corpus, (2) the Nepali national corpus, and (3) the LDC-IL — Linguistic Data Consortium for Indian Languages Focus on Pakistan Urdu Most work has been done on Urdu, prioritized at government institutions like the Center for Language Engineering at the University of Engineering and Technology in Lahore (CLE). Text corpora: http://cle.org.pk/clestore/index.htm (largest is a 1 million word Urdu corpus from the Urdu Digest. Work on Essential Urdu Linguistic Resources: http://www.cle.org.pk/eulr/ Tagset for Urdu corpus: http://cle.org.pk/Publication/papers/2014/The%20CLE%20Urdu%20POS%20Tagset.pdf Urdu OCR: http://cle.org.pk/clestore/urduocr.htm Sindhi Sindhi is the medium of education in some schools in Sindh Has more institutional backing and consequent research than other languages, especially Panjabi. Sindhi-English dictionary developed jointly by Jennifer Cole at the University of Illinois Urbana- Champaign and Sarmad Hussain at CLE (http://182.180.102.251:8081/sed1/homepage.aspx). -
Resource, Valuable Archive on Social and Economic History in Western India
H-Asia Resource, Valuable archive on social and economic history in Western India Discussion published by Sumit Guha on Friday, September 2, 2016 Note on a valuable new resource: Haribhakti Collection Department of History, Faculty of Arts The Maharaja Sayajirao University of Baroda, Vadodara, Gujarat-INDIA Foundation: 1949 Eighteenth Century Baroda in Gujarat has not only evidenced the emergence of political potentates in Gaekwads but also the pecuniary mainstays amongst citizens. The foremost were the Haribhaktis’[i]; who are remembered for business success in areas such as money-lending/indigenous banking, coin- changing, traders in private capacity and banking; formation of Gaekwad’s State financial policy- which stimulated rural resources and commercial economy that benefitted in the making of urban Gujarat during the 18th and 19th centuries; and as philanthropists in individual capability. The business acumen and continuous support to Gaekwad fetched honours and titles like Nagar‘ Seth’ and ‘Raj Ratan' ‘Raj Mitra’ ‘Chiranjiva’&c to them by rulers and citizens. Their firm building in Vadodara dates back to last quarter of 19th century; and its location is near Mandvi darwaza in Ghadiali pol popularly known as Haribhakti ni Haveli “…made up of red and yellow wood and …stands as grandeur of 200 years past”. This family as state bankers were Kamvisadars, traders and Nagarseths of Gaekwad`s of Baroda. Their multifunctional role is apparent as we have more than 1000bahis/ account books and around 10,000 loose sheets of correspondence and statements;kundlis, astrological charts, receipts of transactions related to religious donations, grants for educational and health infrastructure, greetings, invitations, admiration and condolence letters etc. -
Tai Lü / ᦺᦑᦟᦹᧉ Tai Lùe Romanization: KNAB 2012
Institute of the Estonian Language KNAB: Place Names Database 2012-10-11 Tai Lü / ᦺᦑᦟᦹᧉ Tai Lùe romanization: KNAB 2012 I. Consonant characters 1 ᦀ ’a 13 ᦌ sa 25 ᦘ pha 37 ᦤ da A 2 ᦁ a 14 ᦍ ya 26 ᦙ ma 38 ᦥ ba A 3 ᦂ k’a 15 ᦎ t’a 27 ᦚ f’a 39 ᦦ kw’a 4 ᦃ kh’a 16 ᦏ th’a 28 ᦛ v’a 40 ᦧ khw’a 5 ᦄ ng’a 17 ᦐ n’a 29 ᦜ l’a 41 ᦨ kwa 6 ᦅ ka 18 ᦑ ta 30 ᦝ fa 42 ᦩ khwa A 7 ᦆ kha 19 ᦒ tha 31 ᦞ va 43 ᦪ sw’a A A 8 ᦇ nga 20 ᦓ na 32 ᦟ la 44 ᦫ swa 9 ᦈ ts’a 21 ᦔ p’a 33 ᦠ h’a 45 ᧞ lae A 10 ᦉ s’a 22 ᦕ ph’a 34 ᦡ d’a 46 ᧟ laew A 11 ᦊ y’a 23 ᦖ m’a 35 ᦢ b’a 12 ᦋ tsa 24 ᦗ pa 36 ᦣ ha A Syllable-final forms of these characters: ᧅ -k, ᧂ -ng, ᧃ -n, ᧄ -m, ᧁ -u, ᧆ -d, ᧇ -b. See also Note D to Table II. II. Vowel characters (ᦀ stands for any consonant character) C 1 ᦀ a 6 ᦀᦴ u 11 ᦀᦹ ue 16 ᦀᦽ oi A 2 ᦰ ( ) 7 ᦵᦀ e 12 ᦵᦀᦲ oe 17 ᦀᦾ awy 3 ᦀᦱ aa 8 ᦶᦀ ae 13 ᦺᦀ ai 18 ᦀᦿ uei 4 ᦀᦲ i 9 ᦷᦀ o 14 ᦀᦻ aai 19 ᦀᧀ oei B D 5 ᦀᦳ ŭ,u 10 ᦀᦸ aw 15 ᦀᦼ ui A Indicates vowel shortness in the following cases: ᦀᦲᦰ ĭ [i], ᦵᦀᦰ ĕ [e], ᦶᦀᦰ ăe [ ∎ ], ᦷᦀᦰ ŏ [o], ᦀᦸᦰ ăw [ ], ᦀᦹᦰ ŭe [ ɯ ], ᦵᦀᦲᦰ ŏe [ ]. -
Shahmukhi to Gurmukhi Transliteration System: a Corpus Based Approach
Shahmukhi to Gurmukhi Transliteration System: A Corpus based Approach Tejinder Singh Saini1 and Gurpreet Singh Lehal2 1 Advanced Centre for Technical Development of Punjabi Language, Literature & Culture, Punjabi University, Patiala 147 002, Punjab, India [email protected] http://www.advancedcentrepunjabi.org 2 Department of Computer Science, Punjabi University, Patiala 147 002, Punjab, India [email protected] Abstract. This research paper describes a corpus based transliteration system for Punjabi language. The existence of two scripts for Punjabi language has created a script barrier between the Punjabi literature written in India and in Pakistan. This research project has developed a new system for the first time of its kind for Shahmukhi script of Punjabi language. The proposed system for Shahmukhi to Gurmukhi transliteration has been implemented with various research techniques based on language corpus. The corpus analysis program has been run on both Shahmukhi and Gurmukhi corpora for generating statistical data for different types like character, word and n-gram frequencies. This statistical analysis is used in different phases of transliteration. Potentially, all members of the substantial Punjabi community will benefit vastly from this transliteration system. 1 Introduction One of the great challenges before Information Technology is to overcome language barriers dividing the mankind so that everyone can communicate with everyone else on the planet in real time. South Asia is one of those unique parts of the world where a single language is written in different scripts. This is the case, for example, with Punjabi language spoken by tens of millions of people but written in Indian East Punjab (20 million) in Gurmukhi script (a left to right script based on Devanagari) and in Pakistani West Punjab (80 million), written in Shahmukhi script (a right to left script based on Arabic), and by a growing number of Punjabis (2 million) in the EU and the US in the Roman script. -
This Document Serves As a Summary of the UC Berkeley Script Encoding Initiative's Recent Activities. Proposals Recently Submit
L2/11‐049 TO: Unicode Technical Committee FROM: Deborah Anderson, Project Leader, Script Encoding Initiative, UC Berkeley DATE: 3 February 2011 RE: Liaison Report from UC Berkeley (Script Encoding Initiative) This document serves as a summary of the UC Berkeley Script Encoding Initiative’s recent activities. Proposals recently submitted to the UTC that have involved SEI assistance include: Afaka (Everson) [preliminary] Elbasan (Everson and Elsie) Khojki (Pandey) Khudawadi (Pandey) Linear A (Everson and Younger) [revised] Nabataean (Everson) Woleai (Everson) [preliminary] Webdings/Wingdings Ongoing work continues on the following: Anatolian Hieroglyphs (Everson) Balti (Pandey) Dhives Akuru (Pandey) Gangga Malayu (Pandey) Gondi (Pandey) Hungarian Kpelle (Everson and Riley) Landa (Pandey) Loma (Everson) Mahajani (Pandey) Maithili (Pandey) Manichaean (Everson and Durkin‐Meisterernst) Mende (Everson) Modi (Pandey) Nepali script (Pandey) Old Albanian alphabets Pahawh Hmong (Everson) Pau Cin Hau Alphabet and Pau Cin Hau Logographs (Pandey) Rañjana (Everson) Siyaq (and related symbols) (Pandey) Soyombo (Pandey) Tani Lipi (Pandey) Tolong Siki (Pandey) Warang Citi (Everson) Xawtaa Dorboljin (Mongolian Horizontal Square script) (Pandey) Zou (Pandey) Proposals for unencoded Greek papyrological signs, as well as for various Byzantine Greek and Sumero‐Akkadian characters are being discussed. A proposal for the Palaeohispanic script is also underway. Deborah Anderson is encouraging additional participation from Egyptologists for future work on Ptolemaic signs. She has received funding from the National Endowment for the Humanities and support from Google to cover work through 2011. . -
Saraiki Suba Movement in the Punjab: Viability in Focus
Pakistan Perspectives Vol. 20, No.2, July-December 2015 Saraiki Suba Movement in the Punjab: Viability in Focus Akhtar Hussain Sandhu* Abstract The pre-partition politics which revolved around religion was shifted to the language and culture in the post-partition era. After independence many parties emerged and realigned themselves on the basis of language and culture. This research is an effort to analyze the viability of the demand for Saraiki suba from different perspectives. It argues that the Saraiki suba movement has neither sound reasons nor justifiable political strength. The Saraiki suba‟s leadership which never won elections throughout the political history of the region, claims areas of Punjab, KPK and Sindh. The demand to create Saraiki suba is fraught with „dangers‟ including enslavement of the people of south Punjab by feudal lords. The paper recommends some practical steps for the political resolution of this issue. ______ Historical background The majority of the Muslims of the Indian subcontinent once emerged as a united political entity on the basis of religion. After 1947, languages perceived as a symbol of unity motivated the separatist tendencies in Pakistan. Absence of Hindu threat loosened the strength of Muslim nationhood and regional nationalism or sub-nationalism appeared as a gigantic problem. The main cause behind this problem was the impotent and incompetent leadership who could not perform well in redressing the grievances of the people. Bengali, Pakhtoon, Baloch, Barohi, Saraiki, Sindhi, Hindko and other voices based on language and culture became an important element behind politics at least at the regional level. In Pakistan persistent economic problems, hardships and violation of basic rights are some of the main factors behind general discontent. -
Central European Journal of International & Security Studies
Vysoká škola veřejné správy a mezinárodních vztahů v Praze Volume 1 / Issue 2 / November 2007 Central European Journal of International & Security Studies Volume 1 Issue 2 November 2007 Contents Editor’s Note . 5 Special Report Daniel Kimmage and Kathleen Ridolfo / Iraqi Insurgent Media: The War of Images and Idea . 7 Research Articles Marketa Geislerova / The Role of Diasporas in Foreign Policy: The Case of Canada . .90 Atsushi Yasutomi and Jan Carmans / Security Sector Reform (SSR) in Post-Confl ict States: Challenges of Local Ownership . 109 Nikola Hynek / Humanitarian Arms Control, Symbiotic Functionalism and the Concept of Middlepowerhood . 132 Denis Madore / The Gratuitous Suicide by the Sons of Pride: On Honour and Wrath in Terrorist Attacks . 156 Shoghig Mikaelian / Israeli Security Doctrine between the Thirst for Exceptionalism and Demands for Normalcy . 177 Comment & Analysis Svenja Stropahl and Niklas Keller / Adoption of Socially Responsible Investment Practices in the Chinese Investment Sector – A Cost-Benefi t Approach . 193 Richard Lappin / Is Peace-Building Common Sense? . 199 Balka Kwasniewski / American Political Power: Hegemony on its Heels? . 201 Notes on Contributors . 207 CEJISS Contact Information. 208 5 Editor’s Note: CE JISS In readying the content of Volume 1 Issue 2 of CEJISS, I was struck by the growing support this journal has received within many scholarly and profes- sional quarters. Building on the success of the fi rst issue, CEJISS has man- aged to extend its readership to the universities and institutions of a number of countries both in the EU and internationally. It is truly a pleasure to watch this project take on a life of its own and provide its readers with cutting-edge analy- sis of current political affairs. -
California's A.Jnjabi- Mexican- Americans
CULTURE HERITAGE Amia&i-Mexicon-Americans California's A.Jnjabi Mexican Americans Ethnic choices made by the descendants of Punjabi pioneers and their Mexican wives by Karen Leonard he end of British colonial rule in India and the birth of two new nations-India and Pakistan-was celebrated in California in T 1947 by immigrant men from India's Punjab province. Their wives and children celebrated with them. With few exceptions, these wives were of Mexican ancestry and their children were variously called "Mexican-Hindus," "half and halves," or sim ply, like their fathers, "Hindus," an American misno mer for people from India. In a photo taken during the 1947 celebrations in the northern California farm town of Yuba City, all the wives of the "Hindus" are of Mexican descent, save two Anglo women and one woman from India. There were celebrations in Yuba City in 1988, too; the Sikh Parade (November 6) and the Old-Timers' Reunion Christmas Dance (November 12). Descend ants of the Punjabi-Mexicans might attend either or The congregation of the Sikh temple in Stockton, California, circa 1950. - -_ -=- _---=..~...;..:..- .. both of these events-the Sikh Parade, because most of the Punjabi pioneers were Sikhs, and the annual ChristmaB dance, because it began as a reunion for descendants of the Punjabi pioneers. Men from In dia's Punjab province came to California chiefly between 1900 and 1917; after that, immigration practices and laws discriminated against Asians and legal entry was all but impossible. Some 85 percent of the men who came during those years were Sikhs, 13 percent were Muslims, and only 2 percent were really Hindus. -
Devan¯Agar¯I for TEX Version 2.17.1
Devanagar¯ ¯ı for TEX Version 2.17.1 Anshuman Pandey 6 March 2019 Contents 1 Introduction 2 2 Project Information 3 3 Producing Devan¯agar¯ıText with TEX 3 3.1 Macros and Font Definition Files . 3 3.2 Text Delimiters . 4 3.3 Example Input Files . 4 4 Input Encoding 4 4.1 Supplemental Notes . 4 5 The Preprocessor 5 5.1 Preprocessor Directives . 7 5.2 Protecting Text from Conversion . 9 5.3 Embedding Roman Text within Devan¯agar¯ıText . 9 5.4 Breaking Pre-Defined Conjuncts . 9 5.5 Supported LATEX Commands . 9 5.6 Using Custom LATEX Commands . 10 6 Devan¯agar¯ıFonts 10 6.1 Bombay-Style Fonts . 11 6.2 Calcutta-Style Fonts . 11 6.3 Nepali-Style Fonts . 11 6.4 Devan¯agar¯ıPen Fonts . 11 6.5 Default Devan¯agar¯ıFont (LATEX Only) . 12 6.6 PostScript Type 1 . 12 7 Special Topics 12 7.1 Delimiter Scope . 12 7.2 Line Spacing . 13 7.3 Hyphenation . 13 7.4 Captions and Date Formats (LATEX only) . 13 7.5 Customizing the date and captions (LATEX only) . 14 7.6 Using dvnAgrF in Sections and References (LATEX only) . 15 7.7 Devan¯agar¯ıand Arabic Numerals . 15 7.8 Devan¯agar¯ıPage Numbers and Other Counters (LATEX only) . 15 1 7.9 Category Codes . 16 8 Using Devan¯agar¯ıin X E LATEXand luaLATEX 16 8.1 Using Hindi with Polyglossia . 17 9 Using Hindi with babel 18 9.1 Installation . 18 9.2 Usage . 18 9.3 Language attributes . 19 9.3.1 Attribute modernhindi . -
Pakistan Connection Diasporas @ BBC World Service Audience
Pakistan Connection Diasporas @ BBC World Service Audience Research Report Authors: Marie Gillespie, Sadaf Rivzi, Matilda Andersson, Pippa Virdee, Lucy Michael, Sophie West A BBC World Service / Open University Research Partnership 1 Foreword What does a shop owner in Bradford have in common with a graduate from Princeton USA, a construction worker in Bahrain and a banker in the City of London? All are users of the bbcurdu.com website, and all are part of the global Pakistani diaspora. The term diaspora is used to describe the global dispersion of migrant groups of various kinds. Diasporas are of growing economic, political and cultural significance. In a world where migration, geopolitical dynamics, communication technologies and transport links are continually changing, it is clear that culture and geography no longer map neatly onto one another. Understanding diaspora groups inside and outside its base at Bush House, London, is ever more important for the BBC World Service, too. Diaspora audiences are increasingly influencing the way the BBC conceives and delivers output. For example, over 60% of the weekly users of bbcurdu.com are accessing the site from outside the subcontinent, and this proportion is rising. The same trend has been observed for other BBC language websites. (See BBC World Agenda, September 2008.) The research presented here is based on a unique partnership between BBC World Service Marketing Communication and Audiences (MC&A) and The Open University. It was funded primarily by MC&A but it was also generously supported by the Arts and Humanities Research Council (AHRC) though the Diasporas, Migration and Identities Research Programme, which funded a project entitled ‘Tuning In: Diasporic Contact Zones at BBC World Service’ (Grant Award reference AH/ES58693/1).