THE PHONOLOGY of the VERBAL PHRASE in HINDKO Submitted By
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
(And Potential) Language and Linguistic Resources on South Asian Languages
CoRSAL Symposium, University of North Texas, November 17, 2017 Existing (and Potential) Language and Linguistic Resources on South Asian Languages Elena Bashir, The University of Chicago Resources or published lists outside of South Asia Digital Dictionaries of South Asia in Digital South Asia Library (dsal), at the University of Chicago. http://dsal.uchicago.edu/dictionaries/ . Some, mostly older, not under copyright dictionaries. No corpora. Digital Media Archive at University of Chicago https://dma.uchicago.edu/about/about-digital-media-archive Hock & Bashir (eds.) 2016 appendix. Lists 9 electronic corpora, 6 of which are on Sanskrit. The 3 non-Sanskrit entries are: (1) the EMILLE corpus, (2) the Nepali national corpus, and (3) the LDC-IL — Linguistic Data Consortium for Indian Languages Focus on Pakistan Urdu Most work has been done on Urdu, prioritized at government institutions like the Center for Language Engineering at the University of Engineering and Technology in Lahore (CLE). Text corpora: http://cle.org.pk/clestore/index.htm (largest is a 1 million word Urdu corpus from the Urdu Digest. Work on Essential Urdu Linguistic Resources: http://www.cle.org.pk/eulr/ Tagset for Urdu corpus: http://cle.org.pk/Publication/papers/2014/The%20CLE%20Urdu%20POS%20Tagset.pdf Urdu OCR: http://cle.org.pk/clestore/urduocr.htm Sindhi Sindhi is the medium of education in some schools in Sindh Has more institutional backing and consequent research than other languages, especially Panjabi. Sindhi-English dictionary developed jointly by Jennifer Cole at the University of Illinois Urbana- Champaign and Sarmad Hussain at CLE (http://182.180.102.251:8081/sed1/homepage.aspx). -
Pashto, Waneci, Ormuri. Sociolinguistic Survey of Northern
SOCIOLINGUISTIC SURVEY OF NORTHERN PAKISTAN VOLUME 4 PASHTO, WANECI, ORMURI Sociolinguistic Survey of Northern Pakistan Volume 1 Languages of Kohistan Volume 2 Languages of Northern Areas Volume 3 Hindko and Gujari Volume 4 Pashto, Waneci, Ormuri Volume 5 Languages of Chitral Series Editor Clare F. O’Leary, Ph.D. Sociolinguistic Survey of Northern Pakistan Volume 4 Pashto Waneci Ormuri Daniel G. Hallberg National Institute of Summer Institute Pakistani Studies of Quaid-i-Azam University Linguistics Copyright © 1992 NIPS and SIL Published by National Institute of Pakistan Studies, Quaid-i-Azam University, Islamabad, Pakistan and Summer Institute of Linguistics, West Eurasia Office Horsleys Green, High Wycombe, BUCKS HP14 3XL United Kingdom First published 1992 Reprinted 2004 ISBN 969-8023-14-3 Price, this volume: Rs.300/- Price, 5-volume set: Rs.1500/- To obtain copies of these volumes within Pakistan, contact: National Institute of Pakistan Studies Quaid-i-Azam University, Islamabad, Pakistan Phone: 92-51-2230791 Fax: 92-51-2230960 To obtain copies of these volumes outside of Pakistan, contact: International Academic Bookstore 7500 West Camp Wisdom Road Dallas, TX 75236, USA Phone: 1-972-708-7404 Fax: 1-972-708-7433 Internet: http://www.sil.org Email: [email protected] REFORMATTING FOR REPRINT BY R. CANDLIN. CONTENTS Preface.............................................................................................................vii Maps................................................................................................................ -
ARTICLE Development of a Gold-Standard Pashto Dataset and a Segmentation App Yan Han and Marek Rychlik
ARTICLE Development of a Gold-standard Pashto Dataset and a Segmentation App Yan Han and Marek Rychlik ABSTRACT The article aims to introduce a gold-standard Pashto dataset and a segmentation app. The Pashto dataset consists of 300 line images and corresponding Pashto text from three selected books. A line image is simply an image consisting of one text line from a scanned page. To our knowledge, this is one of the first open access datasets which directly maps line images to their corresponding text in the Pashto language. We also introduce the development of a segmentation app using textbox expanding algorithms, a different approach to OCR segmentation. The authors discuss the steps to build a Pashto dataset and develop our unique approach to segmentation. The article starts with the nature of the Pashto alphabet and its unique diacritics which require special considerations for segmentation. Needs for datasets and a few available Pashto datasets are reviewed. Criteria of selection of data sources are discussed and three books were selected by our language specialist from the Afghan Digital Repository. The authors review previous segmentation methods and introduce a new approach to segmentation for Pashto content. The segmentation app and results are discussed to show readers how to adjust variables for different books. Our unique segmentation approach uses an expanding textbox method which performs very well given the nature of the Pashto scripts. The app can also be used for Persian and other languages using the Arabic writing system. The dataset can be used for OCR training, OCR testing, and machine learning applications related to content in Pashto. -
Internal Classification of Indo-European Languages: Survey
Václav Blažek (Masaryk University of Brno, Czech Republic) On the internal classification of Indo-European languages: Survey The purpose of the present study is to confront most representative models of the internal classification of Indo-European languages and their daughter branches. 0. Indo-European 0.1. In the 19th century the tree-diagram of A. Schleicher (1860) was very popular: Germanic Lithuanian Slavo-Lithuaian Slavic Celtic Indo-European Italo-Celtic Italic Graeco-Italo- -Celtic Albanian Aryo-Graeco- Greek Italo-Celtic Iranian Aryan Indo-Aryan After the discovery of the Indo-European affiliation of the Tocharian A & B languages and the languages of ancient Asia Minor, it is necessary to take them in account. The models of the recent time accept the Anatolian vs. non-Anatolian (‘Indo-European’ in the narrower sense) dichotomy, which was first formulated by E. Sturtevant (1942). Naturally, it is difficult to include the relic languages into the model of any classification, if they are known only from several inscriptions, glosses or even only from proper names. That is why there are so big differences in classification between these scantily recorded languages. For this reason some scholars omit them at all. 0.2. Gamkrelidze & Ivanov (1984, 415) developed the traditional ideas: Greek Armenian Indo- Iranian Balto- -Slavic Germanic Italic Celtic Tocharian Anatolian 0.3. Vladimir Georgiev (1981, 363) included in his Indo-European classification some of the relic languages, plus the languages with a doubtful IE affiliation at all: Tocharian Northern Balto-Slavic Germanic Celtic Ligurian Italic & Venetic Western Illyrian Messapic Siculian Greek & Macedonian Indo-European Central Phrygian Armenian Daco-Mysian & Albanian Eastern Indo-Iranian Thracian Southern = Aegean Pelasgian Palaic Southeast = Hittite; Lydian; Etruscan-Rhaetic; Elymian = Anatolian Luwian; Lycian; Carian; Eteocretan 0.4. -
RJSSER ISSN 2707-9015 (ISSN-L) Research Journal of Social DOI: Sciences & Economics Review ______
Research Journal of Social Sciences & Economics Review Vol. 1, Issue 4, 2020 (October – December) ISSN 2707-9023 (online), ISSN 2707-9015 (Print) RJSSER ISSN 2707-9015 (ISSN-L) Research Journal of Social DOI: https://doi.org/10.36902/rjsser-vol1-iss4-2020(411-417) Sciences & Economics Review ____________________________________________________________________________________ Interplay between Socio-Economic Factors and Language Shift: A Study of Saraiki Language in D.G. Khan * Ghulam Mujtaba Yasir, PhD Scholar (Corresponding Author) ** Prof. Dr. Mamuna Ghani, Ex-Dean __________________________________________________________________________________ Abstract Pakistan is among those very few multicultural and multilingual countries which are celebrated for their ethnic as well as linguistic diversity. From the coastal areas of Karachi to the mountainous terrain of Gilgit Baltistan six major and more than 70 minor languages are spoken in various parts of Pakistan. Urdu relishes the position of National Language whereas the official language of the country is English and is mostly used by the power-wielding strata of the country namely the government functionaries, corporate sector, and education sector. The purpose of the study was to find out the interplay between socioeconomic factors and the phenomenon of language shift. The present research is descriptive in which 300 Urdu speaking children of Saraiki families of D.G. Khan District were selected for data collection. A multiple-choice questionnaire was devised and administered to collect the required data. The results insinuated a strong interplay between socio- economic factors and the language shift. Keywords: Linguistic Diversity, Multilingualism, Socio-Economic Factors, Language Shift, Saraiki Language Introduction Language shift, also termed as language transfer, language replacement, and language assimilation, is such a situation where the members of one speech community functionally abandon one language and shift to another socially prestigious language, not necessarily by conscious choice (Garret 2006). -
Himalayan Languages and Linguistics: Studies in Phonology, Semantics, Morphology and Syntax by Mark Turin and Bettina Zeisler, Eds.; Reviewed by Elena Bashir
HIMALAYA, the Journal of the Association for Nepal and Himalayan Studies Volume 31 Number 1 Article 21 8-1-2012 Himalayan Languages and Linguistics: Studies in Phonology, Semantics, Morphology and Syntax by Mark Turin and Bettina Zeisler, Eds.; Reviewed by Elena Bashir Elena Bashir University of Chicago Follow this and additional works at: https://digitalcommons.macalester.edu/himalaya Recommended Citation Bashir, Elena. 2012. Himalayan Languages and Linguistics: Studies in Phonology, Semantics, Morphology and Syntax by Mark Turin and Bettina Zeisler, Eds.; Reviewed by Elena Bashir. HIMALAYA 31(1). Available at: https://digitalcommons.macalester.edu/himalaya/vol31/iss1/21 This work is licensed under a Creative Commons Attribution 3.0 License. This Review is brought to you for free and open access by the DigitalCommons@Macalester College at DigitalCommons@Macalester College. It has been accepted for inclusion in HIMALAYA, the Journal of the Association for Nepal and Himalayan Studies by an authorized administrator of DigitalCommons@Macalester College. For more information, please contact [email protected]. In Part II, Helen Plaisier’s article, “A key to four HIMALAYAN LANGUAGES transcription systems of Lepcha” (13 pp.), compares transcription systems proposed by four scholars, including AND LINGUISTICS: STUDIES herself, and argues that her own system “offers the user the most accurate way of transcribing Lepcha. The transliteration IN PHONOLOGY, SEMANTICS, is consistent with and faithful to the way text is written in the traditional Lepcha orthography, so it remains possible at all times to derive the original spelling from the transliteration” MORPHOLOGY AND SYNTAX (p. 51). Y ARK URIN AND ETTINA EISLER Hiroyuki Suzuki’s paper is on “Dialectal particularities B M T B Z , of Sogpho Tibetan - an introduction to the “Twenty-four EDS. -
Persian, Farsi, Dari, Tajiki: Language Names and Language Policies
University of Pennsylvania ScholarlyCommons Department of Anthropology Papers Department of Anthropology 2012 Persian, Farsi, Dari, Tajiki: Language Names and Language Policies Brian Spooner University of Pennsylvania, [email protected] Follow this and additional works at: https://repository.upenn.edu/anthro_papers Part of the Anthropological Linguistics and Sociolinguistics Commons, and the Anthropology Commons Recommended Citation (OVERRIDE) Spooner, B. (2012). Persian, Farsi, Dari, Tajiki: Language Names and Language Policies. In H. Schiffman (Ed.), Language Policy and Language Conflict in Afghanistan and Its Neighbors: The Changing Politics of Language Choice (pp. 89-117). Leiden, Boston: Brill. This paper is posted at ScholarlyCommons. https://repository.upenn.edu/anthro_papers/91 For more information, please contact [email protected]. Persian, Farsi, Dari, Tajiki: Language Names and Language Policies Abstract Persian is an important language today in a number of countries of west, south and central Asia. But its status in each is different. In Iran its unique status as the only official or national language continueso t be jealously guarded, even though half—probably more—of the population use a different language (mainly Azari/Azeri Turkish) at home, and on the streets, though not in formal public situations, and not in writing. Attempts to broach this exclusive status of Persian in Iran have increased in recent decades, but are still relatively minor. Persian (called tajiki) is also the official language ofajikistan, T but here it shares that status informally with Russian, while in the west of the country Uzbek is also widely used and in the more isolated eastern part of the country other local Iranian languages are now dominant. -
English Language in Pakistan: Expansion and Resulting Implications
Journal of Education & Social Sciences Vol. 5(1): 52-67, 2017 DOI: 10.20547/jess0421705104 English Language in Pakistan: Expansion and Resulting Implications Syeda Bushra Zaidi ∗ Sajida Zaki y Abstract: From sociolinguistic or anthropological perspective, Pakistan is classified as a multilingual context with most people speaking one native or regional language alongside Urdu which is the national language. With respect to English, Pakistan is a second language context which implies that the language is institutionalized and enjoying the privileged status of being the official language. This thematic paper is an attempt to review the arrival and augmentation of English language in Pakistan both before and after its creation. The chronological description is carried out in order to identify the consequences of this spread and to link them with possible future developments and implications. Some key themes covered in the paper include a brief historical overview of English language from how it was introduced to the manner in which it has developed over the years especially in relation to various language and educational policies. Based on the critical review of the literature presented in the paper there seems to be no threat to English in Pakistan a prediction true for the language globally as well. The status of English, as it globally elevates, will continue spreading and prospering in Pakistan as well enriching the Pakistani English variety that has already born and is being studied and codified. However, this process may take long due to the instability of governing policies and law-makers. A serious concern for researchers and people in general, however, will continue being the endangered and indigenous varieties that may continue being under considerable pressure from the prestigious varieties. -
Ethnicity and Ethnic Conflict in Pakistan Gulshan Majeed Abstract There Is Hardly Any State in the World, Which Is Not Ethnicall
Journal of Political Studies, Vol.1, Issue 2, Vol. 1, Issue 2, 51-63 Ethnicity and Ethnic Conflict in Pakistan Gulshan Majeed♣ Abstract There is hardly any state in the world, which is not ethnically plural. Pakistan is also no exception in this regard. Pakistan is a country with unique ethnic diversity. This present study focuses on the concept of ethnicity and different variables such as religion, language, territory and caste, which have potential to give birth violent conflicts among different ethnic identities of Pakistan. Process of national integration can be secured only when ethnic identities would be given adequate representation according to the constitution to decide their future themselves and opportunities to flourish their specific cultural identity. Ethnic identities come into conflict when they face imbalance in the society. Economic resources should be judiciously distributed among different ethnic identities. Political system should have capability to articulate social capital on the proper place according to their intellectual level. System should have capability to generate resources and to utilize these resources in the best interest of various segment of society. Political system should initiate various economic, social and political measures to curb ethnic conflict. Key words: Ethnic diversity, prescribed limits, imbalance society, judiciously distributed, articulate social capital Introduction Stratification and diversification with variegated dimensions and context, is an inbound mechanism of social fabric of a society and state. Different scholars studied this phenomenon with different tools and methodologies. Some name ethnicity as minority, group1, race, caste, class, inner and outer group. There are others who studied it in terms of insiders and outsiders, the others, nationalities, aborigines, and immigrants2. -
Map by Steve Huffman; Data from World Language Mapping System
Svalbard Greenland Jan Mayen Norwegian Norwegian Icelandic Iceland Finland Norway Swedish Sweden Swedish Faroese FaroeseFaroese Faroese Faroese Norwegian Russia Swedish Swedish Swedish Estonia Scottish Gaelic Russian Scottish Gaelic Scottish Gaelic Latvia Latvian Scots Denmark Scottish Gaelic Danish Scottish Gaelic Scottish Gaelic Danish Danish Lithuania Lithuanian Standard German Swedish Irish Gaelic Northern Frisian English Danish Isle of Man Northern FrisianNorthern Frisian Irish Gaelic English United Kingdom Kashubian Irish Gaelic English Belarusan Irish Gaelic Belarus Welsh English Western FrisianGronings Ireland DrentsEastern Frisian Dutch Sallands Irish Gaelic VeluwsTwents Poland Polish Irish Gaelic Welsh Achterhoeks Irish Gaelic Zeeuws Dutch Upper Sorbian Russian Zeeuws Netherlands Vlaams Upper Sorbian Vlaams Dutch Germany Standard German Vlaams Limburgish Limburgish PicardBelgium Standard German Standard German WalloonFrench Standard German Picard Picard Polish FrenchLuxembourgeois Russian French Czech Republic Czech Ukrainian Polish French Luxembourgeois Polish Polish Luxembourgeois Polish Ukrainian French Rusyn Ukraine Swiss German Czech Slovakia Slovak Ukrainian Slovak Rusyn Breton Croatian Romanian Carpathian Romani Kazakhstan Balkan Romani Ukrainian Croatian Moldova Standard German Hungary Switzerland Standard German Romanian Austria Greek Swiss GermanWalser CroatianStandard German Mongolia RomanschWalser Standard German Bulgarian Russian France French Slovene Bulgarian Russian French LombardRomansch Ladin Slovene Standard -
Map by Steve Huffman Data from World Language Mapping System 16
Tajiki Tajiki Tajiki Shughni Southern Pashto Shughni Tajiki Wakhi Wakhi Wakhi Mandarin Chinese Sanglechi-Ishkashimi Sanglechi-Ishkashimi Wakhi Domaaki Sanglechi-Ishkashimi Khowar Khowar Khowar Kati Yidgha Eastern Farsi Munji Kalasha Kati KatiKati Phalura Kalami Indus Kohistani Shina Kati Prasuni Kamviri Dameli Kalami Languages of the Gawar-Bati To rw al i Chilisso Waigali Gawar-Bati Ushojo Kohistani Shina Balti Parachi Ashkun Tregami Gowro Northwest Pashayi Southwest Pashayi Grangali Bateri Ladakhi Northeast Pashayi Southeast Pashayi Shina Purik Shina Brokskat Aimaq Parya Northern Hindko Kashmiri Northern Pashto Purik Hazaragi Ladakhi Indian Subcontinent Changthang Ormuri Gujari Kashmiri Pahari-Potwari Gujari Bhadrawahi Zangskari Southern Hindko Kashmiri Ladakhi Pangwali Churahi Dogri Pattani Gahri Ormuri Chambeali Tinani Bhattiyali Gaddi Kanashi Tinani Southern Pashto Ladakhi Central Pashto Khams Tibetan Kullu Pahari KinnauriBhoti Kinnauri Sunam Majhi Western Panjabi Mandeali Jangshung Tukpa Bilaspuri Chitkuli Kinnauri Mahasu Pahari Eastern Panjabi Panang Jaunsari Western Balochi Southern Pashto Garhwali Khetrani Hazaragi Humla Rawat Central Tibetan Waneci Rawat Brahui Seraiki DarmiyaByangsi ChaudangsiDarmiya Western Balochi Kumaoni Chaudangsi Mugom Dehwari Bagri Nepali Dolpo Haryanvi Jumli Urdu Buksa Lowa Raute Eastern Balochi Tichurong Seke Sholaga Kaike Raji Rana Tharu Sonha Nar Phu ChantyalThakali Seraiki Raji Western Parbate Kham Manangba Tibetan Kathoriya Tharu Tibetan Eastern Parbate Kham Nubri Marwari Ts um Gamale Kham Eastern -
A Case Study on the Language Situation in Northern Pakistan
multiethnica 61 Linguistic diversity, vitality and maintenance: a case study on the language situation in northern Pakistan HENRIK LILJEGREN AND FAKHRUDDIN AKHUNZADA The multilingual and multicultural region of northern ce and advocacy that have been carried out in recent Pakistan, which has approximately 30 distinct languages, years, particularly through the work of the Forum for Language lnitiatives (FLI) and its partner organizations is described and evaluated from the perspective of throughout the region. language vitality, revealing the diverse and complex interplay of language policies, community attitudes and generational transmission. Based on the experience The region: its people and languages of conscious language maintenance efforts carried out It is essential to point out from the start that the re in the area, some conclusions are offered concerning gion dealt with here is not a single geopolitical unit the particular effectiveness of regional networking and with generally agreed on boundaries. lnstead, it is roade up of several political units with varying status within non-governmental institution support to promote local today's Pakistan. In order to operationalize the descrip languages and sustain their vitality in times of great tion and decide what areas and languages to include change. or leave out, a somewhat artificial decision was roade to define northern Pakistan as that part of the country that is situated above the 34th parallel, or all Pakistan I ntrod uction held territory north of the city of Peshawar. The three Northem Pakistan's mountain region is characterized main units that makeup this region of 125,000 km2 by great linguistic and cultural diversity.