Punjabi (India and Pakistan) – Language Snapshot
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Named Entity Recognition System for Kashmiri Language Iamir Bashir Malik, Iikhushboo Bansal Istudent, M.Tech, Iiassistant Professor I,Iidept
ISSN : 2347 - 8446 (Online) International Journal of Advanced Research in ISSN : 2347 - 9817 (Print) Vol. 3, Issue 2 (Apr. - Jun. 2015) Computer Science & Technology (IJARCST 2015) Named Entity Recognition System for Kashmiri Language IAmir Bashir Malik, IIKhushboo Bansal IStudent, M.Tech, IIAssistant Professor I,IIDept. of CSE, Desh Bhagat University, Mandi Gobindgarh, Punjab, India Abstract Named Entity Recognition (NER) is a task which helps in finding out Persons name, Location names, Organization names, Place, Date, Time etc. and classifies them into predefined different categories. Named Entity Recognition plays a major role in various Natural Language Processing (NLP) fields like Information Extraction, Machine Translations and Question Answering. Unfortunately Kashmiri language which is a scarce resourced language has not been taken into account. This paper describes the problems of NER in the context of Kashmiri Language and provides relevant solutions. Keywords Named Entity, Named Entity Recognition, Natural language process, Kashmiri language text. I. Introduction is as follows. The term Named Entity (NE) was evolved during the sixth (1) “Micromax”represent anorganization and “ Dec19, 2014” Message Understanding Conference (MUC -6, 1995).Named represent dateand “smartphone” represent entity and “had Entity Recognition (NER) is also knows as entity identification is a launched its on” represent others. subtask of information extraction (IE). NER extracts and classifies The named entities may be of any type such as given below -
(And Potential) Language and Linguistic Resources on South Asian Languages
CoRSAL Symposium, University of North Texas, November 17, 2017 Existing (and Potential) Language and Linguistic Resources on South Asian Languages Elena Bashir, The University of Chicago Resources or published lists outside of South Asia Digital Dictionaries of South Asia in Digital South Asia Library (dsal), at the University of Chicago. http://dsal.uchicago.edu/dictionaries/ . Some, mostly older, not under copyright dictionaries. No corpora. Digital Media Archive at University of Chicago https://dma.uchicago.edu/about/about-digital-media-archive Hock & Bashir (eds.) 2016 appendix. Lists 9 electronic corpora, 6 of which are on Sanskrit. The 3 non-Sanskrit entries are: (1) the EMILLE corpus, (2) the Nepali national corpus, and (3) the LDC-IL — Linguistic Data Consortium for Indian Languages Focus on Pakistan Urdu Most work has been done on Urdu, prioritized at government institutions like the Center for Language Engineering at the University of Engineering and Technology in Lahore (CLE). Text corpora: http://cle.org.pk/clestore/index.htm (largest is a 1 million word Urdu corpus from the Urdu Digest. Work on Essential Urdu Linguistic Resources: http://www.cle.org.pk/eulr/ Tagset for Urdu corpus: http://cle.org.pk/Publication/papers/2014/The%20CLE%20Urdu%20POS%20Tagset.pdf Urdu OCR: http://cle.org.pk/clestore/urduocr.htm Sindhi Sindhi is the medium of education in some schools in Sindh Has more institutional backing and consequent research than other languages, especially Panjabi. Sindhi-English dictionary developed jointly by Jennifer Cole at the University of Illinois Urbana- Champaign and Sarmad Hussain at CLE (http://182.180.102.251:8081/sed1/homepage.aspx). -
Punjabi Language Characteristics and Role of Thesaurus in Natural
Dharam Veer Sharma et al, / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 2 (4) , 2011, 1434-1437 Punjabi Language Characteristics and Role of Thesaurus in Natural Language processing Dharam Veer Sharma1 Aarti2 Department of Computer Science, Punjabi University, Patiala, INDIA Abstract---This paper describes an attempt to explain various 2.2 Characteristics of the Punjabi Language characteristics of Punjabi language. The origin and symbols of Modern Punjabi is a very tonal language, making use of Punjabi language are presents in this paper. Various relations various tones to differentiate words that would otherwise be exist in thesaurus and role of thesaurus in natural language identical. Three primary tones can be identified: high-rising- processing also has been elaborated in this paper. falling, mid-rising-falling, and low rising. Following are characteristics of Punjabi language [3] [4]. Keywords---Thesaurus, Punjabi, characteristics, relations 2.2.1 Morphological characteristics Morphologically, Punjabi is an agglutinative language. That 1. INTRODUCTION is to say, grammatical information is encoded by way of A thesaurus links semantically related words and helps in the affixation (largely suffixation), rather than via independent selection of most appropriate words for given contexts [1]. A freestanding morphemes. Punjabi nouns inflect for number thesaurus contains synonyms (words which have basically the (singular, plural), gender (masculine, feminine), and same meaning) and as such is an important tool for many declension class (absolute, oblique). The absolute form of a applications in NLP too. The purpose is twofold: For writers, noun is its default or uninflected form. This form is used as it is a tool - one with words grouped and classified to help the object of the verb, typically when inanimate, as well as in select the best word to convey a specific nuance of meaning, measure or temporal (point of time) constructions. -
Online Guides to Indian Languages with Particular Reference to Hindi, Punjabi, and Sanskrit
University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln Library Philosophy and Practice (e-journal) Libraries at University of Nebraska-Lincoln 5-2012 Online Guides to Indian Languages with Particular Reference to Hindi, Punjabi, and Sanskrit Preeti Mahajan Panjab University, [email protected] Neeraj Kumar Singh Panjab University, [email protected] Follow this and additional works at: https://digitalcommons.unl.edu/libphilprac Part of the Library and Information Science Commons Mahajan, Preeti and Singh, Neeraj Kumar, "Online Guides to Indian Languages with Particular Reference to Hindi, Punjabi, and Sanskrit" (2012). Library Philosophy and Practice (e-journal). 749. https://digitalcommons.unl.edu/libphilprac/749 http://unllib.unl.edu/LPP/ Library Philosophy and Practice 2012 ISSN 1522-0222 Online Guides to Indian Languages with Particular Reference to Hindi, Punjabi, and Sanskrit Prof. Preeti Mahajan Department of Library and Information Science Panjab University Chandigarh, India Neeraj Kumar Singh Assistant Librarian A C Joshi Library Panjab University Chandigarh, India Introduction India is a multilingual country and the second most populated country on earth There are a quite a number of languages spoken in India. Some of these languages are accepted nationally while others are accepted as dialects of that particular region. The Indian languages belong to four language families namely Indo-European, Dravidian, Austroasiatic (Austric) and Sino-Tibetan. Majority of India's population are using Indo-European and Dravidian languages. The former are spoken mainly in northern and central regions and the latter in southern India. India has 22 officially recognised languages. But around 33 different languages and 2000 dialects have been identified in India. -
Volume 33 (1) November 2018
A Peer-reviewed Journal of Linguistic Society of Nepal Nepalese Linguistics Volume 33 (1) November 2018 Editor-in-Chief Kamal Poudel Editors Ram Raj Lohani Dr. Tikaram Poudel Office bearers for 2018-2020 President Bhim Narayan Regmi Vice President Krishna Prasad Chalise General Secretary Dr. Karnakhar Khatiwada Secretary (Office) Dr. Ambika Regmi Secretary (General) Dr. Tara Mani Rai Treasurer Ekku Maya Pun Member Dr. Narayan Prasad Sharma Member Dr. Ramesh Kumar Limbu Member Dr. Laxmi Raj Pandit Member Pratigya Regmi Member Shankar Subedi Editorial Board Editor-in-Chief Kamal Poudel Editors Ram Raj Lohani Dr. Tikaram Poudel Nepalese Linguistics is a peer-reviewed journal published by Linguistic Society of Nepal (LSN). LSN publishes articles related to the scientific study of languages, especially from Nepal. The authors are solely responsible for the views expressed in their articles. Published by: Linguistic Society of Nepal Kirtipur, Kathmandu Nepal Copies: 300 © Linguistic Society of Nepal ISSN 0259-1006 Price: NC 400/- (Nepal) IC 350/- (India) US$ 10/- The publication of this volume was supported by Nepal Academy. Editorial Linguistic Society of Nepal, since its inception in 1979, has been involved in preserving and promoting the languages of the Himalayan region through different activities such as organizing conferences, workshops and publications. As all our esteemed readers know that the journal Nepalese Linguistics is one of the major initiatives of the Society. The Board of Editors feels immense pleasure to bring out Volume 33.1 of Nepalese Linguistics in the eve of the 39th International Annual Conference of Linguistic Society of Nepal. The Society decided to peer-review the articles since this issue in order to ensure the quality of the journal. -
New Sufi Sounds of Pakistan: Arif Lohar with Arooj Aftab
Asia Society and CaravanSerai Present New Sufi Sounds of Pakistan: Arif Lohar with Arooj Aftab Saturday, April 28, 2012, 8:00 P.M. Asia Society 725 Park Avenue at 70th Street New York City This program is 2 hours with no intermission New Sufi Sounds of Pakistan Performers Arooj Afab lead vocals Bhrigu Sahni acoustic guitar Jorn Bielfeldt percussion Arif Lohar lead vocals/chimta Qamar Abbas dholak Waqas Ali guitar Allah Ditta alghoza Shehzad Azim Ul Hassan dhol Shahid Kamal keyboard Nadeem Ul Hassan percussion/vocals Fozia vocals AROOJ AFTAB Arooj Aftab is a rising Pakistani-American vocalist who interprets mystcal Sufi poems and contemporizes the semi-classical musical traditions of Pakistan and India. Her music is reflective of thumri, a secular South Asian musical style colored by intricate ornamentation and romantic lyrics of love, loss, and longing. Arooj Aftab restyles the traditional music of her heritage for a sound that is minimalistic, contemplative, and delicate—a sound that she calls ―indigenous soul.‖ Accompanying her on guitar is Boston-based Bhrigu Sahni, a frequent collaborator, originally from India, and Jorn Bielfeldt on percussion. Arooj Aftab: vocals Bhrigu Sahni: guitar Jorn Bielfeldt: percussion Semi Classical Music This genre, classified in Pakistan and North India as light classical vocal music. Thumri and ghazal forms are at the core of the genre. Its primary theme is romantic — persuasive wooing, painful jealousy aroused by a philandering lover, pangs of separation, the ache of remembered pleasures, sweet anticipation of reunion, joyful union. Rooted in a sophisticated civilization that drew no line between eroticism and spirituality, this genre asserts a strong feminine identity in folk poetry laden with unabashed sensuality. -
Folkloristic Understandings of Nation-Building in Pakistan
Folkloristic Understandings of Nation-Building in Pakistan Ideas, Issues and Questions of Nation-Building in Pakistan Research Cooperation between the Hanns Seidel Foundation Pakistan and the Quaid-i-Azam University Islamabad Islamabad, 2020 Folkloristic Understandings of Nation-Building in Pakistan Edited by Sarah Holz Ideas, Issues and Questions of Nation-Building in Pakistan Research Cooperation between Hanns Seidel Foundation, Islamabad Office and Quaid-i-Azam University Islamabad, Pakistan Acknowledgements Thank you to Hanns Seidel Foundation, Islamabad Office for the generous and continued support for empirical research in Pakistan, in particular: Kristóf Duwaerts, Omer Ali, Sumaira Ihsan, Aisha Farzana and Ahsen Masood. This volume would not have been possible without the hard work and dedication of a large number of people. Sara Gurchani, who worked as the research assistant of the collaboration in 2018 and 2019, provided invaluable administrative, organisational and editorial support for this endeavour. A big thank you the HSF grant holders of 2018 who were not only doing their own work but who were also actively engaged in the organisation of the international workshop and the lecture series: Ibrahim Ahmed, Fateh Ali, Babar Rahman and in particular Adil Pasha and Mohsinullah. Thank you to all the support staff who were working behind the scenes to ensure a smooth functioning of all events. A special thanks goes to Shafaq Shafique and Muhammad Latif sahib who handled most of the coordination. Thank you, Usman Shah for the copy editing. The research collaboration would not be possible without the work of the QAU faculty members in the year 2018, Dr. Saadia Abid, Dr. -
Internal Classification of Indo-European Languages: Survey
Václav Blažek (Masaryk University of Brno, Czech Republic) On the internal classification of Indo-European languages: Survey The purpose of the present study is to confront most representative models of the internal classification of Indo-European languages and their daughter branches. 0. Indo-European 0.1. In the 19th century the tree-diagram of A. Schleicher (1860) was very popular: Germanic Lithuanian Slavo-Lithuaian Slavic Celtic Indo-European Italo-Celtic Italic Graeco-Italo- -Celtic Albanian Aryo-Graeco- Greek Italo-Celtic Iranian Aryan Indo-Aryan After the discovery of the Indo-European affiliation of the Tocharian A & B languages and the languages of ancient Asia Minor, it is necessary to take them in account. The models of the recent time accept the Anatolian vs. non-Anatolian (‘Indo-European’ in the narrower sense) dichotomy, which was first formulated by E. Sturtevant (1942). Naturally, it is difficult to include the relic languages into the model of any classification, if they are known only from several inscriptions, glosses or even only from proper names. That is why there are so big differences in classification between these scantily recorded languages. For this reason some scholars omit them at all. 0.2. Gamkrelidze & Ivanov (1984, 415) developed the traditional ideas: Greek Armenian Indo- Iranian Balto- -Slavic Germanic Italic Celtic Tocharian Anatolian 0.3. Vladimir Georgiev (1981, 363) included in his Indo-European classification some of the relic languages, plus the languages with a doubtful IE affiliation at all: Tocharian Northern Balto-Slavic Germanic Celtic Ligurian Italic & Venetic Western Illyrian Messapic Siculian Greek & Macedonian Indo-European Central Phrygian Armenian Daco-Mysian & Albanian Eastern Indo-Iranian Thracian Southern = Aegean Pelasgian Palaic Southeast = Hittite; Lydian; Etruscan-Rhaetic; Elymian = Anatolian Luwian; Lycian; Carian; Eteocretan 0.4. -
RJSSER ISSN 2707-9015 (ISSN-L) Research Journal of Social DOI: Sciences & Economics Review ______
Research Journal of Social Sciences & Economics Review Vol. 1, Issue 4, 2020 (October – December) ISSN 2707-9023 (online), ISSN 2707-9015 (Print) RJSSER ISSN 2707-9015 (ISSN-L) Research Journal of Social DOI: https://doi.org/10.36902/rjsser-vol1-iss4-2020(411-417) Sciences & Economics Review ____________________________________________________________________________________ Interplay between Socio-Economic Factors and Language Shift: A Study of Saraiki Language in D.G. Khan * Ghulam Mujtaba Yasir, PhD Scholar (Corresponding Author) ** Prof. Dr. Mamuna Ghani, Ex-Dean __________________________________________________________________________________ Abstract Pakistan is among those very few multicultural and multilingual countries which are celebrated for their ethnic as well as linguistic diversity. From the coastal areas of Karachi to the mountainous terrain of Gilgit Baltistan six major and more than 70 minor languages are spoken in various parts of Pakistan. Urdu relishes the position of National Language whereas the official language of the country is English and is mostly used by the power-wielding strata of the country namely the government functionaries, corporate sector, and education sector. The purpose of the study was to find out the interplay between socioeconomic factors and the phenomenon of language shift. The present research is descriptive in which 300 Urdu speaking children of Saraiki families of D.G. Khan District were selected for data collection. A multiple-choice questionnaire was devised and administered to collect the required data. The results insinuated a strong interplay between socio- economic factors and the language shift. Keywords: Linguistic Diversity, Multilingualism, Socio-Economic Factors, Language Shift, Saraiki Language Introduction Language shift, also termed as language transfer, language replacement, and language assimilation, is such a situation where the members of one speech community functionally abandon one language and shift to another socially prestigious language, not necessarily by conscious choice (Garret 2006). -
THE PHONOLOGY of the VERBAL PHRASE in HINDKO Submitted By
THE PHONOLOGY OF THE VERBAL PHRASE IN HINDKO submitted by ELAHI BAKHSH AKHTAR AWAN FOR THE AWARD OF A DEGREE OF Ph.D. at the UNIVERSITY OF LONDON 1974 ProQuest Number: 10672820 All rights reserved INFORMATION TO ALL USERS The quality of this reproduction is dependent upon the quality of the copy submitted. In the unlikely event that the author did not send a com plete manuscript and there are missing pages, these will be noted. Also, if material had to be removed, a note will indicate the deletion. uest ProQuest 10672820 Published by ProQuest LLC(2017). Copyright of the Dissertation is held by the Author. All rights reserved. This work is protected against unauthorized copying under Title 17, United States C ode Microform Edition © ProQuest LLC. ProQuest LLC. 789 East Eisenhower Parkway P.O. Box 1346 Ann Arbor, Ml 48106- 1346 acknowledgment I wish to express my profound sense of gratitude and indebtedness to my supervisor Dr. R.K. Sprigg. But for his advice, suggestions, constructive criticism and above all unfailing kind ness at all times this thesis could not have been completed. In addition, I must thank my wife Riaz Begum for her help and encouragement during the many months of preparation. CONTENTS Introduction i Chapter I Value of symbols used in this thesis 1 1 .00 Introductory 1 1.10 I.P.A. Symbols 1 1.11 Other symbols 2 Chapter II The Verbal Phrase and the Verbal Word 6 2.00 Introductory 6 2.01 The Sentence 6 2.02 The Phrase 6 2.10 The Verbal Phra.se 7 2.11 Delimiting the Verbal Phrase 7 2.12 Place of the Verbal Phrase 7 2.121 -
Himalayan Languages and Linguistics: Studies in Phonology, Semantics, Morphology and Syntax by Mark Turin and Bettina Zeisler, Eds.; Reviewed by Elena Bashir
HIMALAYA, the Journal of the Association for Nepal and Himalayan Studies Volume 31 Number 1 Article 21 8-1-2012 Himalayan Languages and Linguistics: Studies in Phonology, Semantics, Morphology and Syntax by Mark Turin and Bettina Zeisler, Eds.; Reviewed by Elena Bashir Elena Bashir University of Chicago Follow this and additional works at: https://digitalcommons.macalester.edu/himalaya Recommended Citation Bashir, Elena. 2012. Himalayan Languages and Linguistics: Studies in Phonology, Semantics, Morphology and Syntax by Mark Turin and Bettina Zeisler, Eds.; Reviewed by Elena Bashir. HIMALAYA 31(1). Available at: https://digitalcommons.macalester.edu/himalaya/vol31/iss1/21 This work is licensed under a Creative Commons Attribution 3.0 License. This Review is brought to you for free and open access by the DigitalCommons@Macalester College at DigitalCommons@Macalester College. It has been accepted for inclusion in HIMALAYA, the Journal of the Association for Nepal and Himalayan Studies by an authorized administrator of DigitalCommons@Macalester College. For more information, please contact [email protected]. In Part II, Helen Plaisier’s article, “A key to four HIMALAYAN LANGUAGES transcription systems of Lepcha” (13 pp.), compares transcription systems proposed by four scholars, including AND LINGUISTICS: STUDIES herself, and argues that her own system “offers the user the most accurate way of transcribing Lepcha. The transliteration IN PHONOLOGY, SEMANTICS, is consistent with and faithful to the way text is written in the traditional Lepcha orthography, so it remains possible at all times to derive the original spelling from the transliteration” MORPHOLOGY AND SYNTAX (p. 51). Y ARK URIN AND ETTINA EISLER Hiroyuki Suzuki’s paper is on “Dialectal particularities B M T B Z , of Sogpho Tibetan - an introduction to the “Twenty-four EDS. -
English Language in Pakistan: Expansion and Resulting Implications
Journal of Education & Social Sciences Vol. 5(1): 52-67, 2017 DOI: 10.20547/jess0421705104 English Language in Pakistan: Expansion and Resulting Implications Syeda Bushra Zaidi ∗ Sajida Zaki y Abstract: From sociolinguistic or anthropological perspective, Pakistan is classified as a multilingual context with most people speaking one native or regional language alongside Urdu which is the national language. With respect to English, Pakistan is a second language context which implies that the language is institutionalized and enjoying the privileged status of being the official language. This thematic paper is an attempt to review the arrival and augmentation of English language in Pakistan both before and after its creation. The chronological description is carried out in order to identify the consequences of this spread and to link them with possible future developments and implications. Some key themes covered in the paper include a brief historical overview of English language from how it was introduced to the manner in which it has developed over the years especially in relation to various language and educational policies. Based on the critical review of the literature presented in the paper there seems to be no threat to English in Pakistan a prediction true for the language globally as well. The status of English, as it globally elevates, will continue spreading and prospering in Pakistan as well enriching the Pakistani English variety that has already born and is being studied and codified. However, this process may take long due to the instability of governing policies and law-makers. A serious concern for researchers and people in general, however, will continue being the endangered and indigenous varieties that may continue being under considerable pressure from the prestigious varieties.