A Survey of Languages Spoken in a Sample of PRP Target Schools
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
(And Potential) Language and Linguistic Resources on South Asian Languages
CoRSAL Symposium, University of North Texas, November 17, 2017 Existing (and Potential) Language and Linguistic Resources on South Asian Languages Elena Bashir, The University of Chicago Resources or published lists outside of South Asia Digital Dictionaries of South Asia in Digital South Asia Library (dsal), at the University of Chicago. http://dsal.uchicago.edu/dictionaries/ . Some, mostly older, not under copyright dictionaries. No corpora. Digital Media Archive at University of Chicago https://dma.uchicago.edu/about/about-digital-media-archive Hock & Bashir (eds.) 2016 appendix. Lists 9 electronic corpora, 6 of which are on Sanskrit. The 3 non-Sanskrit entries are: (1) the EMILLE corpus, (2) the Nepali national corpus, and (3) the LDC-IL — Linguistic Data Consortium for Indian Languages Focus on Pakistan Urdu Most work has been done on Urdu, prioritized at government institutions like the Center for Language Engineering at the University of Engineering and Technology in Lahore (CLE). Text corpora: http://cle.org.pk/clestore/index.htm (largest is a 1 million word Urdu corpus from the Urdu Digest. Work on Essential Urdu Linguistic Resources: http://www.cle.org.pk/eulr/ Tagset for Urdu corpus: http://cle.org.pk/Publication/papers/2014/The%20CLE%20Urdu%20POS%20Tagset.pdf Urdu OCR: http://cle.org.pk/clestore/urduocr.htm Sindhi Sindhi is the medium of education in some schools in Sindh Has more institutional backing and consequent research than other languages, especially Panjabi. Sindhi-English dictionary developed jointly by Jennifer Cole at the University of Illinois Urbana- Champaign and Sarmad Hussain at CLE (http://182.180.102.251:8081/sed1/homepage.aspx). -
Pashto, Waneci, Ormuri. Sociolinguistic Survey of Northern
SOCIOLINGUISTIC SURVEY OF NORTHERN PAKISTAN VOLUME 4 PASHTO, WANECI, ORMURI Sociolinguistic Survey of Northern Pakistan Volume 1 Languages of Kohistan Volume 2 Languages of Northern Areas Volume 3 Hindko and Gujari Volume 4 Pashto, Waneci, Ormuri Volume 5 Languages of Chitral Series Editor Clare F. O’Leary, Ph.D. Sociolinguistic Survey of Northern Pakistan Volume 4 Pashto Waneci Ormuri Daniel G. Hallberg National Institute of Summer Institute Pakistani Studies of Quaid-i-Azam University Linguistics Copyright © 1992 NIPS and SIL Published by National Institute of Pakistan Studies, Quaid-i-Azam University, Islamabad, Pakistan and Summer Institute of Linguistics, West Eurasia Office Horsleys Green, High Wycombe, BUCKS HP14 3XL United Kingdom First published 1992 Reprinted 2004 ISBN 969-8023-14-3 Price, this volume: Rs.300/- Price, 5-volume set: Rs.1500/- To obtain copies of these volumes within Pakistan, contact: National Institute of Pakistan Studies Quaid-i-Azam University, Islamabad, Pakistan Phone: 92-51-2230791 Fax: 92-51-2230960 To obtain copies of these volumes outside of Pakistan, contact: International Academic Bookstore 7500 West Camp Wisdom Road Dallas, TX 75236, USA Phone: 1-972-708-7404 Fax: 1-972-708-7433 Internet: http://www.sil.org Email: [email protected] REFORMATTING FOR REPRINT BY R. CANDLIN. CONTENTS Preface.............................................................................................................vii Maps................................................................................................................ -
ARTICLE Development of a Gold-Standard Pashto Dataset and a Segmentation App Yan Han and Marek Rychlik
ARTICLE Development of a Gold-standard Pashto Dataset and a Segmentation App Yan Han and Marek Rychlik ABSTRACT The article aims to introduce a gold-standard Pashto dataset and a segmentation app. The Pashto dataset consists of 300 line images and corresponding Pashto text from three selected books. A line image is simply an image consisting of one text line from a scanned page. To our knowledge, this is one of the first open access datasets which directly maps line images to their corresponding text in the Pashto language. We also introduce the development of a segmentation app using textbox expanding algorithms, a different approach to OCR segmentation. The authors discuss the steps to build a Pashto dataset and develop our unique approach to segmentation. The article starts with the nature of the Pashto alphabet and its unique diacritics which require special considerations for segmentation. Needs for datasets and a few available Pashto datasets are reviewed. Criteria of selection of data sources are discussed and three books were selected by our language specialist from the Afghan Digital Repository. The authors review previous segmentation methods and introduce a new approach to segmentation for Pashto content. The segmentation app and results are discussed to show readers how to adjust variables for different books. Our unique segmentation approach uses an expanding textbox method which performs very well given the nature of the Pashto scripts. The app can also be used for Persian and other languages using the Arabic writing system. The dataset can be used for OCR training, OCR testing, and machine learning applications related to content in Pashto. -
Finite State Morphology and Sindhi Noun Inflections
PACLIC 24 Proceedings 669 Finite State Morphology and Sindhi Noun Inflections Mutee U Rahman, Mohammad Iqbal Bhatti Department of Computer Science, Isra University, Hala Road, Hyderabad Sindh 71000, Pakistan [email protected], [email protected] Abstract. Sindhi is a morphologically rich language. Morphological construction include inflections and derivations. Sindhi morphology becomes more complex due to primary and secondary word types which are further divided into simple, complex and compound words. Sindhi nouns are marked by number gender and case. Finite state transducers (FSTs) quite reasonably represent the inflectional morphology of Sindhi nouns. The paper investigates Sindhi noun inflection rules and defines equivalent computational rules to be used by FSTs; corresponding FSTs are also given. Keywords. Sindhi, morphology, noun inflections, two-level morphology, finite state morphology. 1 Introduction Morphology deals with word formation rules in a language. Word structures of a language are defined by its morphological constructions. Morphology defines that how smaller meaning bearing units called morphemes are combined to make larger meaning bearing units of a language called words. Morphology also deals with word formation by variations in already existing words. The morphological changes are mostly done by suffix addition, subtraction and replacement phenomenon. In few words morphology can be defined as syntax of word formation. Models for computational analysis of morphology always remained challenge for computational linguists until early 1980’s when 4Ks* discovered the two level morphology (Kaplan, R. M. and M. Kay. 1981) the first general model for morphologically complex languages. This two level morphology represents a word at lexical level and surface level. Morphotactics or morpheme ordering model is used in between these two levels to incorporate morphological changes. -
THE PHONOLOGY of the VERBAL PHRASE in HINDKO Submitted By
THE PHONOLOGY OF THE VERBAL PHRASE IN HINDKO submitted by ELAHI BAKHSH AKHTAR AWAN FOR THE AWARD OF A DEGREE OF Ph.D. at the UNIVERSITY OF LONDON 1974 ProQuest Number: 10672820 All rights reserved INFORMATION TO ALL USERS The quality of this reproduction is dependent upon the quality of the copy submitted. In the unlikely event that the author did not send a com plete manuscript and there are missing pages, these will be noted. Also, if material had to be removed, a note will indicate the deletion. uest ProQuest 10672820 Published by ProQuest LLC(2017). Copyright of the Dissertation is held by the Author. All rights reserved. This work is protected against unauthorized copying under Title 17, United States C ode Microform Edition © ProQuest LLC. ProQuest LLC. 789 East Eisenhower Parkway P.O. Box 1346 Ann Arbor, Ml 48106- 1346 acknowledgment I wish to express my profound sense of gratitude and indebtedness to my supervisor Dr. R.K. Sprigg. But for his advice, suggestions, constructive criticism and above all unfailing kind ness at all times this thesis could not have been completed. In addition, I must thank my wife Riaz Begum for her help and encouragement during the many months of preparation. CONTENTS Introduction i Chapter I Value of symbols used in this thesis 1 1 .00 Introductory 1 1.10 I.P.A. Symbols 1 1.11 Other symbols 2 Chapter II The Verbal Phrase and the Verbal Word 6 2.00 Introductory 6 2.01 The Sentence 6 2.02 The Phrase 6 2.10 The Verbal Phra.se 7 2.11 Delimiting the Verbal Phrase 7 2.12 Place of the Verbal Phrase 7 2.121 -
Saraiki Suba Movement in the Punjab: Viability in Focus
Pakistan Perspectives Vol. 20, No.2, July-December 2015 Saraiki Suba Movement in the Punjab: Viability in Focus Akhtar Hussain Sandhu* Abstract The pre-partition politics which revolved around religion was shifted to the language and culture in the post-partition era. After independence many parties emerged and realigned themselves on the basis of language and culture. This research is an effort to analyze the viability of the demand for Saraiki suba from different perspectives. It argues that the Saraiki suba movement has neither sound reasons nor justifiable political strength. The Saraiki suba‟s leadership which never won elections throughout the political history of the region, claims areas of Punjab, KPK and Sindh. The demand to create Saraiki suba is fraught with „dangers‟ including enslavement of the people of south Punjab by feudal lords. The paper recommends some practical steps for the political resolution of this issue. ______ Historical background The majority of the Muslims of the Indian subcontinent once emerged as a united political entity on the basis of religion. After 1947, languages perceived as a symbol of unity motivated the separatist tendencies in Pakistan. Absence of Hindu threat loosened the strength of Muslim nationhood and regional nationalism or sub-nationalism appeared as a gigantic problem. The main cause behind this problem was the impotent and incompetent leadership who could not perform well in redressing the grievances of the people. Bengali, Pakhtoon, Baloch, Barohi, Saraiki, Sindhi, Hindko and other voices based on language and culture became an important element behind politics at least at the regional level. In Pakistan persistent economic problems, hardships and violation of basic rights are some of the main factors behind general discontent. -
The Rise of Dalit Peasants Kolhi Activism in Lower Sindh
The Rise of Dalit Peasants Kolhi Activism in Lower Sindh (Original Thesis Title) Kolhi-peasant Activism in Naon Dumbālo, Lower Sindh Creating Space for Marginalised through Multiple Channels Ghulam Hussain Mahesar Quaid-i-Azam University Department of Anthropology ii Islamabad - Pakistan Year 2014 Kolhi-Peasant Activism in Naon Dumbālo, Lower Sindh Creating Space for Marginalised through Multiple Channels Ghulam Hussain Thesis submitted to the Department of Anthropology, Quaid-i-Azam University Islamabad, in partial fulfillment of the degree of ‗Master of Philosophy in Anthropology‘ iii Quaid-i-Azam University Department of Anthropology Islamabad - Pakistan Year 2014 Formal declaration I hereby, declare that I have produced the present work by myself and without any aid other than those mentioned herein. Any ideas taken directly or indirectly from third party sources are indicated as such. This work has not been published or submitted to any other examination board in the same or a similar form. Islamabad, 25 March 2014 Mr. Ghulam Hussain Mahesar iv Final Approval of Thesis Quaid-i-Azam University Department of Anthropology Islamabad - Pakistan This is to certify that we have read the thesis submitted by Mr. Ghulam Hussain. It is our judgment that this thesis is of sufficient standard to warrant its acceptance by Quaid-i-Azam University, Islamabad for the award of the degree of ―MPhil in Anthropology‖. Committee Supervisor: Dr. Waheed Iqbal Chaudhry External Examiner: Full name of external examiner incl. title Incharge: Dr. Waheed Iqbal Chaudhry v ACKNOWLEDGEMENT This thesis is the product of cumulative effort of many teachers, scholars, and some institutions, that duly deserve to be acknowledged here. -
According to the Syllabus of University of Azad Jammu & Kashmir
LLB FIVE YEARS DEGREE PROGRAMME NOTES According to the Syllabus of University of Azad Jammu & Kashmir Muzaffarabad and Other Public, Private Sector Universities of Pakistan Prepared By Advocate Muhammad Adnan Masood Joja Sardar Javed Zahoor Khan (Advocate) CITI Law College Rawalakot 05824-442207, 444222, 0332-4573251, 051-4852737 Near CMH Rawalakot AJK Web:-www.clc.edu.pk Citi Law College, Near CMH, Rawalakot Ph: 05824-442207, 051-4852737 www.clc.edu.pk Page | 1 Citi Law College, Near CMH, Rawalakot Ph: 05824-442207, 051-4852737 www.clc.edu.pk Page | 2 Citi Law College, Near CMH, Rawalakot Ph: 05824-442207, 051-4852737 www.clc.edu.pk Page | 3 Citi Law College, Near CMH, Rawalakot Ph: 05824-442207, 051-4852737 www.clc.edu.pk Page | 4 Citi Law College, Near CMH, Rawalakot Ph: 05824-442207, 051-4852737 www.clc.edu.pk Page | 5 FUNCTIONAL ENGLISH Citi Law College, Near CMH, Rawalakot Ph: 05824-442207, 051-4852737 www.clc.edu.pk Page | 6 Parts of Speech NOUNS A noun is the word that refers to a person, thing or abstract idea. A noun can tell you who or what. There are several different types of noun: - There are common nouns such as dog, car, chair etc. Nouns that refer to things which can be counted (can be singular or plural) are countable nouns. Nouns that refer to some groups of countable nouns, substances, feelings and types of activity (can only be singular) are uncountable nouns. Nouns that refer to a group of people or things are collective nouns. Nouns that refer to people, organizations or places are proper nouns, only proper nouns are capitalized. -
Persian, Farsi, Dari, Tajiki: Language Names and Language Policies
University of Pennsylvania ScholarlyCommons Department of Anthropology Papers Department of Anthropology 2012 Persian, Farsi, Dari, Tajiki: Language Names and Language Policies Brian Spooner University of Pennsylvania, [email protected] Follow this and additional works at: https://repository.upenn.edu/anthro_papers Part of the Anthropological Linguistics and Sociolinguistics Commons, and the Anthropology Commons Recommended Citation (OVERRIDE) Spooner, B. (2012). Persian, Farsi, Dari, Tajiki: Language Names and Language Policies. In H. Schiffman (Ed.), Language Policy and Language Conflict in Afghanistan and Its Neighbors: The Changing Politics of Language Choice (pp. 89-117). Leiden, Boston: Brill. This paper is posted at ScholarlyCommons. https://repository.upenn.edu/anthro_papers/91 For more information, please contact [email protected]. Persian, Farsi, Dari, Tajiki: Language Names and Language Policies Abstract Persian is an important language today in a number of countries of west, south and central Asia. But its status in each is different. In Iran its unique status as the only official or national language continueso t be jealously guarded, even though half—probably more—of the population use a different language (mainly Azari/Azeri Turkish) at home, and on the streets, though not in formal public situations, and not in writing. Attempts to broach this exclusive status of Persian in Iran have increased in recent decades, but are still relatively minor. Persian (called tajiki) is also the official language ofajikistan, T but here it shares that status informally with Russian, while in the west of the country Uzbek is also widely used and in the more isolated eastern part of the country other local Iranian languages are now dominant. -
English Language in Pakistan: Expansion and Resulting Implications
Journal of Education & Social Sciences Vol. 5(1): 52-67, 2017 DOI: 10.20547/jess0421705104 English Language in Pakistan: Expansion and Resulting Implications Syeda Bushra Zaidi ∗ Sajida Zaki y Abstract: From sociolinguistic or anthropological perspective, Pakistan is classified as a multilingual context with most people speaking one native or regional language alongside Urdu which is the national language. With respect to English, Pakistan is a second language context which implies that the language is institutionalized and enjoying the privileged status of being the official language. This thematic paper is an attempt to review the arrival and augmentation of English language in Pakistan both before and after its creation. The chronological description is carried out in order to identify the consequences of this spread and to link them with possible future developments and implications. Some key themes covered in the paper include a brief historical overview of English language from how it was introduced to the manner in which it has developed over the years especially in relation to various language and educational policies. Based on the critical review of the literature presented in the paper there seems to be no threat to English in Pakistan a prediction true for the language globally as well. The status of English, as it globally elevates, will continue spreading and prospering in Pakistan as well enriching the Pakistani English variety that has already born and is being studied and codified. However, this process may take long due to the instability of governing policies and law-makers. A serious concern for researchers and people in general, however, will continue being the endangered and indigenous varieties that may continue being under considerable pressure from the prestigious varieties. -
Ethnicity and Ethnic Conflict in Pakistan Gulshan Majeed Abstract There Is Hardly Any State in the World, Which Is Not Ethnicall
Journal of Political Studies, Vol.1, Issue 2, Vol. 1, Issue 2, 51-63 Ethnicity and Ethnic Conflict in Pakistan Gulshan Majeed♣ Abstract There is hardly any state in the world, which is not ethnically plural. Pakistan is also no exception in this regard. Pakistan is a country with unique ethnic diversity. This present study focuses on the concept of ethnicity and different variables such as religion, language, territory and caste, which have potential to give birth violent conflicts among different ethnic identities of Pakistan. Process of national integration can be secured only when ethnic identities would be given adequate representation according to the constitution to decide their future themselves and opportunities to flourish their specific cultural identity. Ethnic identities come into conflict when they face imbalance in the society. Economic resources should be judiciously distributed among different ethnic identities. Political system should have capability to articulate social capital on the proper place according to their intellectual level. System should have capability to generate resources and to utilize these resources in the best interest of various segment of society. Political system should initiate various economic, social and political measures to curb ethnic conflict. Key words: Ethnic diversity, prescribed limits, imbalance society, judiciously distributed, articulate social capital Introduction Stratification and diversification with variegated dimensions and context, is an inbound mechanism of social fabric of a society and state. Different scholars studied this phenomenon with different tools and methodologies. Some name ethnicity as minority, group1, race, caste, class, inner and outer group. There are others who studied it in terms of insiders and outsiders, the others, nationalities, aborigines, and immigrants2. -
Classification of Eastern Iranian Languages
62 / 2014 Ľubomír Novák 1 QUESTION OF (RE)CLASSIFICATION OF EASTERN IRANIAN LANGUAGES STATI – ARTICLES – AUFSÄTZE – СТАТЬИ AUFSÄTZE – ARTICLES – STATI Abstract The Eastern Iranian languages are traditionally divided into two subgroups: the South and the North Eastern Iranian languages. An important factor for the determination of the North Eastern and the South Eastern Iranian groups is the presence of isoglosses that appeared already in the Old Iranian period. According to an analysis of isoglosses that were used to distinguish the two branches, it appears that most likely there are only two certain isoglosses that can be used for the division of the Eastern Iranian languages into the two branches. Instead of the North-South division of the Eastern Iranian languages, it seems instead that there were approximately four dialect nuclei forming mi- nor groups within the Eastern Iranian branch. Furthermore, there are some languages that geneti- cally do not belong to these nuclei. In the New Iranian period, several features may be observed that link some of the languages together, but such links often have nothing in common with a so-called genetic relationship. The most interesting issue is the position of the so-called Pamir languages with- in the Eastern Iranian group. It appears that not all the Pamir languages are genetically related; their mutual proximity, therefore, may be more sufficiently explained by later contact phenomena. Keywords Eastern Iranian languages; Pamir languages; language classification; linguistic genealogy. The Iranian languages are commonly divided into two main groups: the Eastern and Western Iranian languages. Each of the groups is subsequently divided in two other subgroups – the Northern and Southern1.