Pashto and Its Dialects, Descriptive Grammar Of
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
The English Language
The English Language Version 5.0 Eala ðu lareow, tæce me sum ðing. [Aelfric, Grammar] Prof. Dr. Russell Block University of Applied Sciences - München Department 13 – General Studies Winter Semester 2008 © 2008 by Russell Block Um eine gute Note in der Klausur zu erzielen genügt es nicht, dieses Skript zu lesen. Sie müssen auch die “Show” sehen! Dieses Skript ist der Entwurf eines Buches: The English Language – A Guide for Inquisitive Students. Nur der Stoff, der in der Vorlesung behandelt wird, ist prüfungsrelevant. Unit 1: Language as a system ................................................8 1 Introduction ...................................... ...................8 2 A simple example of structure ..................... ......................8 Unit 2: The English sound system ...........................................10 3 Introduction..................................... ...................10 4 Standard dialects ................................ ....................10 5 The major differences between German and English . ......................10 5.1 The consonants ................................. ..............10 5.2 Overview of the English consonants . ..................10 5.3 Tense vs. lax .................................. ...............11 5.4 The final devoicing rule ....................... .................12 5.5 The “th”-sounds ................................ ..............12 5.6 The “sh”-sound .................................. ............. 12 5.7 The voiced sounds / Z/ and / dZ / ...................................12 5.8 The -
Pashto, Waneci, Ormuri. Sociolinguistic Survey of Northern
SOCIOLINGUISTIC SURVEY OF NORTHERN PAKISTAN VOLUME 4 PASHTO, WANECI, ORMURI Sociolinguistic Survey of Northern Pakistan Volume 1 Languages of Kohistan Volume 2 Languages of Northern Areas Volume 3 Hindko and Gujari Volume 4 Pashto, Waneci, Ormuri Volume 5 Languages of Chitral Series Editor Clare F. O’Leary, Ph.D. Sociolinguistic Survey of Northern Pakistan Volume 4 Pashto Waneci Ormuri Daniel G. Hallberg National Institute of Summer Institute Pakistani Studies of Quaid-i-Azam University Linguistics Copyright © 1992 NIPS and SIL Published by National Institute of Pakistan Studies, Quaid-i-Azam University, Islamabad, Pakistan and Summer Institute of Linguistics, West Eurasia Office Horsleys Green, High Wycombe, BUCKS HP14 3XL United Kingdom First published 1992 Reprinted 2004 ISBN 969-8023-14-3 Price, this volume: Rs.300/- Price, 5-volume set: Rs.1500/- To obtain copies of these volumes within Pakistan, contact: National Institute of Pakistan Studies Quaid-i-Azam University, Islamabad, Pakistan Phone: 92-51-2230791 Fax: 92-51-2230960 To obtain copies of these volumes outside of Pakistan, contact: International Academic Bookstore 7500 West Camp Wisdom Road Dallas, TX 75236, USA Phone: 1-972-708-7404 Fax: 1-972-708-7433 Internet: http://www.sil.org Email: [email protected] REFORMATTING FOR REPRINT BY R. CANDLIN. CONTENTS Preface.............................................................................................................vii Maps................................................................................................................ -
ARTICLE Development of a Gold-Standard Pashto Dataset and a Segmentation App Yan Han and Marek Rychlik
ARTICLE Development of a Gold-standard Pashto Dataset and a Segmentation App Yan Han and Marek Rychlik ABSTRACT The article aims to introduce a gold-standard Pashto dataset and a segmentation app. The Pashto dataset consists of 300 line images and corresponding Pashto text from three selected books. A line image is simply an image consisting of one text line from a scanned page. To our knowledge, this is one of the first open access datasets which directly maps line images to their corresponding text in the Pashto language. We also introduce the development of a segmentation app using textbox expanding algorithms, a different approach to OCR segmentation. The authors discuss the steps to build a Pashto dataset and develop our unique approach to segmentation. The article starts with the nature of the Pashto alphabet and its unique diacritics which require special considerations for segmentation. Needs for datasets and a few available Pashto datasets are reviewed. Criteria of selection of data sources are discussed and three books were selected by our language specialist from the Afghan Digital Repository. The authors review previous segmentation methods and introduce a new approach to segmentation for Pashto content. The segmentation app and results are discussed to show readers how to adjust variables for different books. Our unique segmentation approach uses an expanding textbox method which performs very well given the nature of the Pashto scripts. The app can also be used for Persian and other languages using the Arabic writing system. The dataset can be used for OCR training, OCR testing, and machine learning applications related to content in Pashto. -
Member Directory
D DIRECTORY Member Directory ABOUT THE MOBILE MARKETING ASSOCIATION (MMA) Mobile Marketing Association Member Directory, Spring 2008 The Mobile Marketing Association (MMA) is the premier global non- profit association established to lead the development of mobile Mobile Marketing Association marketing and its associated technologies. The MMA is an action- 1670 Broadway, Suite 850 Denver, CO 80202 oriented organization designed to clear obstacles to market USA development, establish guidelines and best practices for sustainable growth, and evangelize the mobile channel for use by brands and Telephone: +1.303.415.2550 content providers. With more than 600 member companies, Fax: +1.303.499.0952 representing over forty-two countries, our members include agencies, [email protected] advertisers, handheld device manufacturers, carriers and operators, retailers, software providers and service providers, as well as any company focused on the potential of marketing via mobile devices. *Updated as of 31 May, 2008 The MMA is a global organization with regional branches in Asia Pacific (APAC); Europe, Middle East & Africa (EMEA); Latin America (LATAM); and North America (NA). About the MMA Member Directory The MMA Member Directory is the mobile marketing industry’s foremost resource for information on leading companies in the mobile space. It includes MMA members at the global, regional, and national levels. An online version of the Directory is available at http://www.mmaglobal.com/memberdirectory.pdf. The Directory is published twice each year. The materials found in this document are owned, held, or licensed by the Mobile Marketing Association and are available for personal, non-commercial, and educational use, provided that ownership of the materials is properly cited. -
New Language Resources for the Pashto Language
New language resources for the Pashto language Djamel Mostefa 1 , Khalid Choukri 1 , Sylvie Brunessaux 2 , Karim Boudahmane 3 1 Evaluation and Language resources Distribution Agency, France 2 CASSIDIAN, France 3 Direction Générale de l'Armement, France E-mail: [email protected], [email protected], [email protected], [email protected] Abstract This paper reports on the development of new language resources for the Pashto language, a very low-resource language spoken in Afghanistan and Pakistan. In the scope of a multilingual data collection project, three large corpora are collected for Pashto. Firstly a monolingual text corpus of 100 million words is produced. Secondly a 100 hours speech database is recorded and manually transcribed. Finally a bilingual Pashto-French parallel corpus of around 2 million is produced by translating Pashto texts into French. These resources will be used to develop Human Language Technology systems for Pashto with a special focus on Machine Translation. Keywords: Pashto, low-resource language, speech corpus, monolingual and multilingual text corpora, web crawling. other one being Dari) and one regional language in 1. Introduction Pakistan. There are very few corpora and Human Language The code assigned to the language by the ISO 639-3 Technology (HLT) services available for Pashto. No standard is [pus]. language resources for Pashto can be found in the According to the Ethnologue.com website, it is spoken by catalogues of LDC1 and ELRA2. around 20 million people and three main dialects are to be Pashto is a very low-resource language. Google doesn't considered: support Pashto in its search engine or translation services. -
Reconstructing Our Understanding of the Link Between Services and State Legitimacy Working Paper 87
Researching livelihoods and services affected by conflict Reconstructing our understanding of the link between services and state legitimacy Working Paper 87 Aoife McCullough, with Antoine Lacroix and Gemma Hennessey June 2020 Written by Aoife McCullough, with Antoine Lacroix and Gemma Hennessey SLRC publications present information, analysis and key policy recommendations on issues relating to livelihoods, basic services and social protection in conflict affected situations. This and other SLRC publications are available from www.securelivelihoods.org. Funded by UK aid from the UK Government, Irish Aid and the EC. Disclaimer: The views presented in this publication are those of the author(s) and do not necessarily reflect the UK Government’s official policies or represent the views of Irish Aid, the EC, SLRC or our partners. ©SLRC 2020. Readers are encouraged to quote or reproduce material from SLRC for their own publications. As copyright holder SLRC requests due acknowledgement. Secure Livelihoods Research Consortium Overseas Development Institute (ODI) 203 Blackfriars Road London SE1 8NJ United Kingdom T +44 (0)20 3817 0031 F +44 (0)20 7922 0399 E [email protected] www.securelivelihoods.org @SLRCtweet Cover photo: DFID/Russell Watkins. A doctor with the International Medical Corps examines a patient at a mobile health clinic in Pakistan. B About us The Secure Livelihoods Research Consortium (SLRC) is a global research programme exploring basic services, and social protection in fragile and conflict-affected situations. Funded by UK Aid from the UK Government (DFID), with complementary funding from Irish Aid and the European Commission (EC), SLRC was established in 2011 with the aim of strengthening the evidence base and informing policy and practice around livelihoods and services in conflict. -
February,February, 20052005
THETHE OFFICIALOFFICIAL PUBLICATIONPUBLICATION OFOF THETHE WOMENWOMEN SOARINGSOARING PILOTSPILOTS ASSOC.ASSOC. February,February, 20052005 Www.womensoaring.org IIN THIS ISSUE 2005 Seminar PAGE 2 July 11-15, 2005 at Air Sailing Gliderport Reno, NV From the President For contact: Terry Duncan by Lucy Anne McKoski [email protected] Badges by Helen D’Couto Phone 408 446-0847 From the Editor by Frauke Elber PAGE 3 Convention Report by Colleen Koenig PAGE 4 TuesdayTuesday FlightFlight byby CindyCindy BricknerBrickner PAGE 5 On Line Contest PAGE 6 Motorless Flight Sympo- sium,Facts and Trends, but Safety First by Roberta Fischer Malara PAGE 7 FOD Happens by Joel Cornell PAGE 8 Lindbergh Trophy Help needed InvitationInvitation toto WSPAWSPA Welcome new members PAGE 9 HOW TO DRESS FOR WINTER FLYING Hear Say The above picture shows Ginny Schweizer ready for a flight from Harris Hill in the winter of 1940 in the open cockpit Kirby Kite which required special clothing to keep warm. PAGE 10--11 Rather than trying to explain what she wore she had submitted this picture which was taken by Scholarships and Scholarship Hans Groenhoff (Another great soaring pioneer. Ed.) The photo and story was published in Applications Life Magazine Ginny said that being launched by auto tows in that open sailplane on a very cold day was thrilling and a lot of fun. She had no problem keeping warm with all those flying clothes which were also used for a New Year Day’s flight. Thanks to Kay Gertsen, editor of Ridgewind, newsletter of the Harris Hill Soaring. Corp page 2 February 2005 THE WOMEN SOARING PILOTS Badges through Silver Altitude ASSOCIATION (WSPA) WAS Cheryl Beckage FOUNDED IN 1986 AND IS Dec.15, 2005 Mary Cowie AFFILIATED WITH THE SOARING by Helen D’Couto Katherine (Kat) Haessler SOCIETY OF AMERICA Silver Distance ANNUAL DUES (JULY-JUNE) ARE $10. -
From Latin to Romance: Case Loss and Preservation in Pronominal Systems
FLORE Repository istituzionale dell'Università degli Studi di Firenze From Latin to Romance: case loss and preservation in pronominal systems Questa è la Versione finale referata (Post print/Accepted manuscript) della seguente pubblicazione: Original Citation: From Latin to Romance: case loss and preservation in pronominal systems / Manzini, MARIA RITA; Savoia, LEONARDO MARIA. - In: PROBUS. - ISSN 1613-4079. - STAMPA. - 26, 2(2014), pp. 217-248. Availability: This version is available at: 2158/891750 since: 2016-01-20T16:23:29Z Terms of use: Open Access La pubblicazione è resa disponibile sotto le norme e i termini della licenza di deposito, secondo quanto stabilito dalla Policy per l'accesso aperto dell'Università degli Studi di Firenze (https://www.sba.unifi.it/upload/policy-oa-2016-1.pdf) Publisher copyright claim: (Article begins on next page) 27 September 2021 Probus 2014; 26(2): 217 – 248 M. Rita Manzini* and Leonardo M. Savoia From Latin to Romance: case loss and preservation in pronominal systems Abstract: The evolution from Latin into Romance is marked by the loss of case in nominal declensions. In most Romance varieties, however, pronouns, specifi- cally in the 1st/2nd person singular, keep case differentiations. In some varieties 1st/2nd singular pronouns present a three-way case split, essentially the same re- constructed for proto-Romance (De Dardel and Gaeng 1992, Zamboni 1998). We document and analyze the current situation of Romance in the first part of the article (section 1). In the second part of the article we argue that the Dative Shifted distribution of loro in modern Italian, accounted for by means of the category of weak pronoun in Cardinaletti and Starke (1999), is best construed as a survival of oblique case in the 3rd person system (section 2). -
Possessive Pronouns As Oblique Dps: Linkers and Affix Stacking
19.3 (2018): 393-425 UDC 81’367.626.2=111 Original scientific article Received on 16.02. 2018 Accepted for publication on 05.11. 2018 M. Rita Manzini Università degli Studi di Firenze Possessive pronouns as oblique DPs: Linkers and affix stacking In many familiar European languages, e.g. German or Italian, possessive pro- nouns agree in φ-features with their head noun. We argue that they are geni- tive pronouns, endowed with an extra φ-features set. As such, they are part of a range of phenomena including case stacking and linkers unified under the historical-typological label of Suffixaufnahme. We express the formal basis for this unification as the Stacking Generalization (Section 1). We then apply our analysis to the narrower domain of facts involving possessive pronouns, specifically in Balkan and Romance languages. We further find that 1/2P pro- nouns present a richer stacking structure than their 3P counterparts (Section 2). We examine this latter fact in the context of a more general phenomenon, whereby the 1/2P vs 3P Person split not only tends to correlate with different case and agreement alignments – but seems to govern the morphological ex- pression of case and agreement itself, in terms of richer vs poorer content (Section 3). Key words: oblique case; genitive; possessives; pronouns; linkers; agree- ment; person. 1. Linkers and affix stacking This section aims at establishing the framework for the discussion of agreeing pos- sessive pronouns to be pursued in later sections. According to a well established historical-typological view (Plank 1995), modifier structures involving both free standing heads (linkers) and stacked affixation (case stacking), are to be unified on the basis of functional considerations. -
Pashto Alphabets
LEARNING PASHTO Intensive Elementary & Secondary Pashto for Military and other Professionals by Dawood Azami Visiting Scholar Email: [email protected] The Middle East Studies Center (MESC) The Ohio State University, Columbus August 2009 1 Aims of the Course: *To provide a thorough introductory course in basic Pashto with the accent on practical spoken Pashto, coverage of grammar, familiarity with Pashto pronunciation, and essential vocabulary. *Ability to communicate within a range of situations and to handle simple survival situations (e.g. finding lodging, food, transportation etc.) *Ability to read the simple Pashto texts dealing with a variety of social and basic needs. In addition to author’s own command and expertise, a number of sources (books, both published and unpublished, journals, websites, etc.) have been consulted while preparing this material. Word of thanks: The author would like to thank Dr. Alam Payind, Director, Middle East Studies Center (MESC), and Melinda McClimans, Assistant Director, MESC. Their cooperation and assistance certainly made my stay in Columbus easier and enjoyable. Copy Right: This course material is for teaching of Pashto language at The Ohio State University. The author holds the copy right for any other use. The author intends to publish a modified version of the material as a book in the future. Please contact the author for more information. (Email: [email protected] ) 2 Contents I. Pashto Alphabet .................................................................................................................. 5 A. Pashto Sounds Similar to English ............................................................................... 7 B. Pashto Sounds Different from English ........................................................................ 7 C. Two letters pronounced differently in major Pashto dialects: ...................................... 8 D. Arabic Letters/ Sounds in Pashto .................................................................................. 8 II. -
Special Inspector General for Afghanistan Reconstruction (SIGAR)
Special Inspector General for OCT 30 SIGAR Afghanistan Reconstruction 2018 QUARTERLY REPORT TO THE UNITED STATES CONGRESS The National Defense Authorization Act for FY 2008 (Pub. L. No. 110- 181) established the Special Inspector General for Afghanistan Reconstruction (SIGAR). SIGAR’s oversight mission, as dened by the legislation, is to provide for the independent and objective • conduct and supervision of audits and investigations relating to the programs and operations funded with amounts appropriated or otherwise made available for the reconstruction of Afghanistan. • leadership and coordination of, and recommendations on, policies designed to promote economy, efciency, and effectiveness in the administration of the programs and operations, and to prevent and detect waste, fraud, and abuse in such programs and operations. • means of keeping the Secretary of State and the Secretary of Defense fully and currently informed about problems and deciencies relating to the administration of such programs and operation and the necessity for and progress on corrective action. Afghanistan reconstruction includes any major contract, grant, agreement, or other funding mechanism entered into by any department or agency of the U.S. government that involves the use of amounts appropriated or otherwise made available for the reconstruction of Afghanistan. As required by the National Defense Authorization Act for FY 2018 (Pub. L. No. 115-91), this quarterly report has been prepared in accordance with the Quality Standards for Inspection and Evaluation issued by the Council of the Inspectors General on Integrity and Efciency. Source: Pub.L. No. 110-181, “National Defense Authorization Act for FY 2008,” 1/28/2008, Pub. L. No. -
Reproductions Supplied by EDRS Are the Best That Can Be Made from the Original Document
DOCUMENT RESUME ED 447 692 FL 026 310 AUTHOR Breathnech, Diarmaid, Ed. TITLE Contact Bulletin, 1990-1999. INSTITUTION European Bureau for Lesser Used Languages, Dublin (Ireland). SPONS AGENCY Commission of the European Communities, Brussels (Belgium). PUB DATE 1999-00-00 NOTE 398p.; Published triannually. Volume 13, Number 2 and Volume 14, Number 2 are available from ERIC only in French. PUB TYPE Collected Works Serials (022) LANGUAGE English, French JOURNAL CIT Contact Bulletin; v7-15 Spr 1990-May 1999 EDRS PRICE MF01/PC16 Plus Postage. DESCRIPTORS Ethnic Groups; Irish; *Language Attitudes; *Language Maintenance; *Language Minorities; Second Language Instruction; Second Language Learning; Serbocroatian; *Uncommonly Taught Languages; Welsh IDENTIFIERS Austria; Belgium; Catalan; Czech Republic;-Denmark; *European Union; France; Germany; Greece; Hungary; Iceland; Ireland; Italy; *Language Policy; Luxembourg; Malta; Netherlands; Norway; Portugal; Romania; Slovakia; Spain; Sweden; Ukraine; United Kingdom ABSTRACT This document contains 26 issues (the entire output for the 1990s) of this publication deaicated to the study and preservation of Europe's less spoken languages. Some issues are only in French, and a number are in both French and English. Each issue has articles dealing with minority languages and groups in Europe, with a focus on those in Western, Central, and Southern Europe. (KFT) Reproductions supplied by EDRS are the best that can be made from the original document. N The European Bureau for Lesser Used Languages CONTACT BULLETIN This publication is funded by the Commission of the European Communities Volumes 7-15 1990-1999 REPRODUCE AND PERMISSION TO U.S. DEPARTMENT OF EDUCATION MATERIAL HAS Office of Educational Research DISSEMINATE THIS and Improvement BEEN GRANTEDBY EDUCATIONAL RESOURCESINFORMATION CENTER (ERIC) This document has beenreproduced as received from the personor organization Xoriginating it.