Ideology, Language Attitudes, and Status of Punjabi In
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Named Entity Recognition System for Kashmiri Language Iamir Bashir Malik, Iikhushboo Bansal Istudent, M.Tech, Iiassistant Professor I,Iidept
ISSN : 2347 - 8446 (Online) International Journal of Advanced Research in ISSN : 2347 - 9817 (Print) Vol. 3, Issue 2 (Apr. - Jun. 2015) Computer Science & Technology (IJARCST 2015) Named Entity Recognition System for Kashmiri Language IAmir Bashir Malik, IIKhushboo Bansal IStudent, M.Tech, IIAssistant Professor I,IIDept. of CSE, Desh Bhagat University, Mandi Gobindgarh, Punjab, India Abstract Named Entity Recognition (NER) is a task which helps in finding out Persons name, Location names, Organization names, Place, Date, Time etc. and classifies them into predefined different categories. Named Entity Recognition plays a major role in various Natural Language Processing (NLP) fields like Information Extraction, Machine Translations and Question Answering. Unfortunately Kashmiri language which is a scarce resourced language has not been taken into account. This paper describes the problems of NER in the context of Kashmiri Language and provides relevant solutions. Keywords Named Entity, Named Entity Recognition, Natural language process, Kashmiri language text. I. Introduction is as follows. The term Named Entity (NE) was evolved during the sixth (1) “Micromax”represent anorganization and “ Dec19, 2014” Message Understanding Conference (MUC -6, 1995).Named represent dateand “smartphone” represent entity and “had Entity Recognition (NER) is also knows as entity identification is a launched its on” represent others. subtask of information extraction (IE). NER extracts and classifies The named entities may be of any type such as given below -
(And Potential) Language and Linguistic Resources on South Asian Languages
CoRSAL Symposium, University of North Texas, November 17, 2017 Existing (and Potential) Language and Linguistic Resources on South Asian Languages Elena Bashir, The University of Chicago Resources or published lists outside of South Asia Digital Dictionaries of South Asia in Digital South Asia Library (dsal), at the University of Chicago. http://dsal.uchicago.edu/dictionaries/ . Some, mostly older, not under copyright dictionaries. No corpora. Digital Media Archive at University of Chicago https://dma.uchicago.edu/about/about-digital-media-archive Hock & Bashir (eds.) 2016 appendix. Lists 9 electronic corpora, 6 of which are on Sanskrit. The 3 non-Sanskrit entries are: (1) the EMILLE corpus, (2) the Nepali national corpus, and (3) the LDC-IL — Linguistic Data Consortium for Indian Languages Focus on Pakistan Urdu Most work has been done on Urdu, prioritized at government institutions like the Center for Language Engineering at the University of Engineering and Technology in Lahore (CLE). Text corpora: http://cle.org.pk/clestore/index.htm (largest is a 1 million word Urdu corpus from the Urdu Digest. Work on Essential Urdu Linguistic Resources: http://www.cle.org.pk/eulr/ Tagset for Urdu corpus: http://cle.org.pk/Publication/papers/2014/The%20CLE%20Urdu%20POS%20Tagset.pdf Urdu OCR: http://cle.org.pk/clestore/urduocr.htm Sindhi Sindhi is the medium of education in some schools in Sindh Has more institutional backing and consequent research than other languages, especially Panjabi. Sindhi-English dictionary developed jointly by Jennifer Cole at the University of Illinois Urbana- Champaign and Sarmad Hussain at CLE (http://182.180.102.251:8081/sed1/homepage.aspx). -
Pakistani Migrants in the United States: the Interplay of Ethnic Identity and Ethnic Retention
American International Journal of Social Science Vol. 5, No. 4; August 2016 Pakistani Migrants in the United States: The Interplay of Ethnic Identity and Ethnic Retention Dr. Navid Ghani Five Towns College Professor of Sociology and History 305 N Service Rd, Dix Hills, NY 11746 United States Abstract This study is designed to explore the process of integration of first-generation Pakistani immigrants in the United States. There are two analytical themes that are the focus of this study. The first is the question of their integration into American society. What are the factors that have led to their maintenance of strong ethnic attachment, and their role in the shifting interplay of integration versus ethnic retention? The second issue is the factors that hinder their integration into American society, and how they perceive their cultural heritage versus mainstream norms and values. I rely on five benchmarks to assess first-generation immigrant integration: socioeconomic status, cultural heritage such as religious and social activities, perceptions, and experiences of discrimination, and gender relations. Based on ethnographic methods such as interviews and participant observations, one level of integration is explained. This level of integration is related to high ethnic identity and low integration, and is explained in terms of identity formation with strong ethnic characteristics but only a functional level of integration. Keywords: Immigrant, migration, ethnicity, assimilation, acculturation, socioeconomic status, gender, discrimination. 1. Introduction and Background My contribution to this discourse stems from my own background as a first-generation Pakistani immigrant, and now as a permanent resident of the United States. As such, I write from the perspective of an immigrant who has experienced the process of integration and adjustment of the Pakistani community in the United States. -
Population, Labour Force and Employment
Chapter 12 Population, Labour Force and Employment Persistent efforts to control population through However, this human resource is not being utilized family planning programs and improved education properly due to lack of human resource development facilities helped in controlling population growth programs. and resultantly, the world population growth slowed down. The comparison of population data published Population and Demographic Indicators by Population Reference Bureau shows that the The Crude Birth Rate (CBR) and Crude Death Rate world population growth rate reduced from 1.4% in (CDR) are main statistical values that can be utilized 2011 to 1% in 2012. Nevertheless the decreased to measure the trends in structure and growth of a growth rate added 71 million people in global population. Crude Birth Rate (CBR) or simply population, and the total world population crossed the figure of 7 billion at the end of June 2012. Each birth rate is the annual number of live births per year the number of human beings is on the rise, but one thousand persons. The Crude Birth Rate is the availability of natural resources, required to called "Crude" because it does not take into account sustain this population, to improve the quality of age or sex differences among the population. Crude human lives and to eliminate mass poverty remains Birth Rate of more than 30 per thousand are finite. considered high and rate of less than 18 per thousand are considered low. According to the World Resultantly, these resources are becoming scarce and Population Data Sheet, the Global Crude Birth Rate incapable of fulfilling ever increasing demand of in 2012 was 20 per thousand. -
L2/20-246 Teeth and Bellies: a Proposed Model for Encoding Book Pahlavi
L2/20-246 Teeth and bellies: a proposed model for encoding Book Pahlavi Roozbeh Pournader (WhatsApp) September 7, 2020 Background In Everson 2002, a proposal was made to encode a unified Avestan and Pahlavi script in the Unicode Standard. The proposal went through several iterations, eventually leading to a separate encoding of Avestan as proposed by Everson and Pournader 2007a, in which Pahlavi was considered non-unifiable with Avestan due to its cursive joining property. The non-cursive Inscriptional Pahlavi (Everson and Pournader 2007b) and the cursive Psalter Pahlavi (Everson and Pournader 2011) were later encoded too. But Book Pahlavi, despite several attempts (see the Book Pahlavi Topical Document list at https://unicode.org/L2/ topical/bookpahlavi/), remains unencoded. Everson 2002 is peculiar among earlier proposals by proposing six Pahlavi archigraphemes, including an ear, an elbow, and a belly. I remember from conversations with Michael Everson that he intended these to be used for cases when a scribe was just copying some text without understanding the underlying letters, considering the complexity of the script and the loss of some of its nuances to later scribes. They could also be used when modern scholars wanted to represent a manuscript as written, without needing to over-analyze potentially controversial readings. Meyers 2014 takes such a graphical model to an extreme, trying to encode pieces of the writing system, most of which have some correspondence to letters, but with occasional partial letters (e.g. PARTIAL SHIN and FINAL SADHE-PARTIAL PE). Unfortunately, their proposal rejects joining properties for Book Pahlavi and insists that “[t]he joining behaviour of the final stems of the characters in Book Pahlavi is more similar to cursive variants of Latin than to Arabic”. -
Pashto, Waneci, Ormuri. Sociolinguistic Survey of Northern
SOCIOLINGUISTIC SURVEY OF NORTHERN PAKISTAN VOLUME 4 PASHTO, WANECI, ORMURI Sociolinguistic Survey of Northern Pakistan Volume 1 Languages of Kohistan Volume 2 Languages of Northern Areas Volume 3 Hindko and Gujari Volume 4 Pashto, Waneci, Ormuri Volume 5 Languages of Chitral Series Editor Clare F. O’Leary, Ph.D. Sociolinguistic Survey of Northern Pakistan Volume 4 Pashto Waneci Ormuri Daniel G. Hallberg National Institute of Summer Institute Pakistani Studies of Quaid-i-Azam University Linguistics Copyright © 1992 NIPS and SIL Published by National Institute of Pakistan Studies, Quaid-i-Azam University, Islamabad, Pakistan and Summer Institute of Linguistics, West Eurasia Office Horsleys Green, High Wycombe, BUCKS HP14 3XL United Kingdom First published 1992 Reprinted 2004 ISBN 969-8023-14-3 Price, this volume: Rs.300/- Price, 5-volume set: Rs.1500/- To obtain copies of these volumes within Pakistan, contact: National Institute of Pakistan Studies Quaid-i-Azam University, Islamabad, Pakistan Phone: 92-51-2230791 Fax: 92-51-2230960 To obtain copies of these volumes outside of Pakistan, contact: International Academic Bookstore 7500 West Camp Wisdom Road Dallas, TX 75236, USA Phone: 1-972-708-7404 Fax: 1-972-708-7433 Internet: http://www.sil.org Email: [email protected] REFORMATTING FOR REPRINT BY R. CANDLIN. CONTENTS Preface.............................................................................................................vii Maps................................................................................................................ -
Role of the Muslim Anjumans for the Promotion of Education in the Colonial Punjab: a Historical Analysis
Bulletin of Education and Research December 2019, Vol. 41, No. 3 pp. 1-18 Role of the Muslim Anjumans for the Promotion of Education in the Colonial Punjab: A Historical Analysis Maqbool Ahmad Awan* __________________________________________________________________ Abstract This article highlightsthe vibrant role of the Muslim Anjumans in activating the educational revival in the colonial Punjab. The latter half of the 19th century, particularly the decade 1880- 1890, witnessed the birth of several Muslim Anjumans (societies) in the Punjab province. These were, in fact, a product of growing political consciousness and desire for collective efforts for the community-betterment. The Muslims, in other provinces, were lagging behind in education and other avenues of material prosperity. Their social conditions were also far from being satisfactory. Religion too had become a collection of rites and superstitions and an obstacle for their educational progress. During the same period, they also faced a grievous threat from the increasing proselytizing activities of the Christian Missionary societies and the growing economic prosperity of the Hindus who by virtue of their advancement in education, commerce and public services, were emerging as a dominant community in the province. The Anjumans rescued the Muslim youth from the verge of what then seemed imminent doom of ignorance by establishing schools and madrassas in almost all cities of the Punjab. The focus of these Anjumans was on both secular and religious education, which was advocated equally for both genders. Their trained scholars confronted the anti-Islamic activities of the Christian missionaries. The educational development of the Muslims in the Colonial Punjab owes much to these Anjumans. -
Languages of Kohistan. Sociolinguistic Survey of Northern
SOCIOLINGUISTIC SURVEY OF NORTHERN PAKISTAN VOLUME 1 LANGUAGES OF KOHISTAN Sociolinguistic Survey of Northern Pakistan Volume 1 Languages of Kohistan Volume 2 Languages of Northern Areas Volume 3 Hindko and Gujari Volume 4 Pashto, Waneci, Ormuri Volume 5 Languages of Chitral Series Editor Clare F. O’Leary, Ph.D. Sociolinguistic Survey of Northern Pakistan Volume 1 Languages of Kohistan Calvin R. Rensch Sandra J. Decker Daniel G. Hallberg National Institute of Summer Institute Pakistani Studies of Quaid-i-Azam University Linguistics Copyright © 1992 NIPS and SIL Published by National Institute of Pakistan Studies, Quaid-i-Azam University, Islamabad, Pakistan and Summer Institute of Linguistics, West Eurasia Office Horsleys Green, High Wycombe, BUCKS HP14 3XL United Kingdom First published 1992 Reprinted 2002 ISBN 969-8023-11-9 Price, this volume: Rs.300/- Price, 5-volume set: Rs.1500/- To obtain copies of these volumes within Pakistan, contact: National Institute of Pakistan Studies Quaid-i-Azam University, Islamabad, Pakistan Phone: 92-51-2230791 Fax: 92-51-2230960 To obtain copies of these volumes outside of Pakistan, contact: International Academic Bookstore 7500 West Camp Wisdom Road Dallas, TX 75236, USA Phone: 1-972-708-7404 Fax: 1-972-708-7433 Internet: http://www.sil.org Email: [email protected] REFORMATTING FOR REPRINT BY R. CANDLIN. CONTENTS Preface............................................................................................................viii Maps................................................................................................................. -
Folkloristic Understandings of Nation-Building in Pakistan
Folkloristic Understandings of Nation-Building in Pakistan Ideas, Issues and Questions of Nation-Building in Pakistan Research Cooperation between the Hanns Seidel Foundation Pakistan and the Quaid-i-Azam University Islamabad Islamabad, 2020 Folkloristic Understandings of Nation-Building in Pakistan Edited by Sarah Holz Ideas, Issues and Questions of Nation-Building in Pakistan Research Cooperation between Hanns Seidel Foundation, Islamabad Office and Quaid-i-Azam University Islamabad, Pakistan Acknowledgements Thank you to Hanns Seidel Foundation, Islamabad Office for the generous and continued support for empirical research in Pakistan, in particular: Kristóf Duwaerts, Omer Ali, Sumaira Ihsan, Aisha Farzana and Ahsen Masood. This volume would not have been possible without the hard work and dedication of a large number of people. Sara Gurchani, who worked as the research assistant of the collaboration in 2018 and 2019, provided invaluable administrative, organisational and editorial support for this endeavour. A big thank you the HSF grant holders of 2018 who were not only doing their own work but who were also actively engaged in the organisation of the international workshop and the lecture series: Ibrahim Ahmed, Fateh Ali, Babar Rahman and in particular Adil Pasha and Mohsinullah. Thank you to all the support staff who were working behind the scenes to ensure a smooth functioning of all events. A special thanks goes to Shafaq Shafique and Muhammad Latif sahib who handled most of the coordination. Thank you, Usman Shah for the copy editing. The research collaboration would not be possible without the work of the QAU faculty members in the year 2018, Dr. Saadia Abid, Dr. -
Persian Language
v course reference persian language r e f e r e n c e زبان فارسی The Persian Language 1 PERSIAN OR FARSI? In the U.S., the official language of Iran is language courses in “Farsi,” universities and sometimes called “Farsi,” but sometimes it is scholars prefer the historically correct term called “Persian.” Whereas U.S. government “Persian.” The term “Farsi” is better reserved organizations have traditionally developed for the dialect of Persian used in Iran. 2 course reference AN INDO-EUROPEAN LANGUAGE Persian is a member of the Indo-European Persian has three major dialects: Farsi, language family, which is the largest in the the official language of Iran, spoken by 50 world. percent of the population; Dari, spoken mostly in Afghanistan, and Tajiki, spoken Persian falls under the Indo-Iranian branch, in Tajikistan. Other languages in Iran are comprising languages spoken primarily Arabic, New Aramaic, Armenian, Georgian in Afghanistan, Iran, Pakistan, India, and Turkic dialects such as Azerbaidjani, Bangladesh, areas of Turkey and Iraq, and Khalaj, Turkemenian and Qashqa”i. some of the former Soviet Union. INDO-EUROPEAN LANGUAGES GERMANIC INDO-IRANIAN HELLENIC CELTIC ITALIC BALTO-SLAVIC Polish Russin Indic Greek Serbo-Crotin North Germnic Ltin Irnin Mnx Irish Welsh Old Norse Swedish Scottish Avestn Old Persin Icelndic Norwegin French Spnish Portuguese Itlin Middle Persin West Germnic Snskrit Rumnin Ctln Frsi Kurdish Bengli Urdu Gujrti Hindi Old High Germn Old Dutch Anglo-Frisin Middle High Germn Middle Dutch Old Frisin Old English Germn Flemish Dutch Afrikns Frisin Middle English Yiddish Modern English vi v Persian Language 3 ALPHABET: FROM PAHLAVI TO ARABIC History tells us that Iranians used the Pahlavi Unlike English, Persian is written from right writing system prior to the 7th Century. -
Considerations About Semitic Etyma in De Vaan's Latin Etymological Dictionary
applyparastyle “fig//caption/p[1]” parastyle “FigCapt” Philology, vol. 4/2018/2019, pp. 35–156 © 2019 Ephraim Nissan - DOI https://doi.org/10.3726/PHIL042019.2 2019 Considerations about Semitic Etyma in de Vaan’s Latin Etymological Dictionary: Terms for Plants, 4 Domestic Animals, Tools or Vessels Ephraim Nissan 00 35 Abstract In this long study, our point of departure is particular entries in Michiel de Vaan’s Latin Etymological Dictionary (2008). We are interested in possibly Semitic etyma. Among 156 the other things, we consider controversies not just concerning individual etymologies, but also concerning approaches. We provide a detailed discussion of names for plants, but we also consider names for domestic animals. 2018/2019 Keywords Latin etymologies, Historical linguistics, Semitic loanwords in antiquity, Botany, Zoonyms, Controversies. Contents Considerations about Semitic Etyma in de Vaan’s 1. Introduction Latin Etymological Dictionary: Terms for Plants, Domestic Animals, Tools or Vessels 35 In his article “Il problema dei semitismi antichi nel latino”, Paolo Martino Ephraim Nissan 35 (1993) at the very beginning lamented the neglect of Semitic etymolo- gies for Archaic and Classical Latin; as opposed to survivals from a sub- strate and to terms of Etruscan, Italic, Greek, Celtic origin, when it comes to loanwords of certain direct Semitic origin in Latin, Martino remarked, such loanwords have been only admitted in a surprisingly exiguous num- ber of cases, when they were not met with outright rejection, as though they merely were fanciful constructs:1 In seguito alle recenti acquisizioni archeologiche ed epigrafiche che hanno documen- tato una densità finora insospettata di contatti tra Semiti (soprattutto Fenici, Aramei e 1 If one thinks what one could come across in the 1890s (see below), fanciful constructs were not a rarity. -
Shahmukhi to Gurmukhi Transliteration System: a Corpus Based Approach
Shahmukhi to Gurmukhi Transliteration System: A Corpus based Approach Tejinder Singh Saini1 and Gurpreet Singh Lehal2 1 Advanced Centre for Technical Development of Punjabi Language, Literature & Culture, Punjabi University, Patiala 147 002, Punjab, India [email protected] http://www.advancedcentrepunjabi.org 2 Department of Computer Science, Punjabi University, Patiala 147 002, Punjab, India [email protected] Abstract. This research paper describes a corpus based transliteration system for Punjabi language. The existence of two scripts for Punjabi language has created a script barrier between the Punjabi literature written in India and in Pakistan. This research project has developed a new system for the first time of its kind for Shahmukhi script of Punjabi language. The proposed system for Shahmukhi to Gurmukhi transliteration has been implemented with various research techniques based on language corpus. The corpus analysis program has been run on both Shahmukhi and Gurmukhi corpora for generating statistical data for different types like character, word and n-gram frequencies. This statistical analysis is used in different phases of transliteration. Potentially, all members of the substantial Punjabi community will benefit vastly from this transliteration system. 1 Introduction One of the great challenges before Information Technology is to overcome language barriers dividing the mankind so that everyone can communicate with everyone else on the planet in real time. South Asia is one of those unique parts of the world where a single language is written in different scripts. This is the case, for example, with Punjabi language spoken by tens of millions of people but written in Indian East Punjab (20 million) in Gurmukhi script (a left to right script based on Devanagari) and in Pakistani West Punjab (80 million), written in Shahmukhi script (a right to left script based on Arabic), and by a growing number of Punjabis (2 million) in the EU and the US in the Roman script.