Urdu Alphabet
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Owner's Manual
OWNER’S MANUAL M197WD M227WD M237WD Make sure to read the Safety Precautions before using the product. Keep the User's Guide(CD) in an accessible place for furture reference. See the label attached on the product and give the information to your dealer when you ask for service. Trade Mark of the DVB Digital Video Broadcasting Project (1991 to 1996) ID Number(s): 5741 : M227WD 5742 : M197WD 5890 : M237WD PREPARATION FRONT PANEL CONTROLS I This is a simplified representation of the front panel. The image shown may be somewhat different from your set. INPUT INPUT Button MENU MENU Button OK OK Button VOLUME VOL Buttons PROGRAMME PR Buttons Power Button Headphone Button 1 PREPARATION <M197WD/M227WD> BACK PANEL INFORMATION I This is a simplified representation of the back panel. The image shown may be somewhat different from your set. 1 2 3 4 5 6 7 COMPONENT AV-IN 3 AUDIO IN IN (RGB/DVI) AV 1 AV 2 OPTICAL Y DIGITAL AV 1 AV 2 AUDIO OUT VIDEO AUDIO 1 B P VIDEO HDMI RGB IN (PC) (MONO) AC IN 2 PR L DVI-D ANTENNA L IN AC IN SERVICE R AUDIO ONLY RS-232C IN (CONTROL & SERVICE) R S-VIDEO 8 9 10 11 12 13 14 1 PCMCIA (Personal Computer Memory Card 7 Audio/Video Input International Association) Card Slot Connect audio/video output from an external device (This feature is not available in all countries.) to these jacks. 2 Power Cord Socket 8 SERVICE ONLY PORT This set operates on AC power. The voltage is indicat- ed on the Specifications page. -
Roman to Urdu Transliteration Using Word List
Roman to Urdu Transliteration using word list Tafseer Ahmed Universitaet Konstanz [email protected] Abstract script is widely used when there comes a need of a language for the informal communication over internet. The paper discusses transliteration of Urdu words Unlike Urdu script, the roman script for Urdu does from roman script to Urdu script. We propose a word not have any standard for spelling the words. A word list based approach that gives a better transliteration can be written in various forms not only by distinct to the Persio-Arabic letters that have same or similar writers but also by the same writer at different sound but are written as different letter in Urdu script. occasions. Specially, there is no one to one mapping We give a rule based system for roman to Urdu between Urdu letters for vowel sounds and the transliteration. As the roman script for Urdu does not corresponding roman letters. The support of Urdu follow any standard and a single word can be written script on the computers is getting better by the usage of in multiple ways, the proposed rule based system Unicode character set and Open Type fonts. The covers the different ways of writing and give the best availability of Urdu script support demands for roman possible Urdu transliteration based on word list and to Urdu transliterator that can convert the text written roman to Urdu script mapping rules. in roman Urdu into proper Urdu script. This paper focuses on the issues related to the 1. Introduction transliteration of Urdu text from roman script to Urdu script. -
Technical Reference Manual for the Standardization of Geographical Names United Nations Group of Experts on Geographical Names
ST/ESA/STAT/SER.M/87 Department of Economic and Social Affairs Statistics Division Technical reference manual for the standardization of geographical names United Nations Group of Experts on Geographical Names United Nations New York, 2007 The Department of Economic and Social Affairs of the United Nations Secretariat is a vital interface between global policies in the economic, social and environmental spheres and national action. The Department works in three main interlinked areas: (i) it compiles, generates and analyses a wide range of economic, social and environmental data and information on which Member States of the United Nations draw to review common problems and to take stock of policy options; (ii) it facilitates the negotiations of Member States in many intergovernmental bodies on joint courses of action to address ongoing or emerging global challenges; and (iii) it advises interested Governments on the ways and means of translating policy frameworks developed in United Nations conferences and summits into programmes at the country level and, through technical assistance, helps build national capacities. NOTE The designations employed and the presentation of material in the present publication do not imply the expression of any opinion whatsoever on the part of the Secretariat of the United Nations concerning the legal status of any country, territory, city or area or of its authorities, or concerning the delimitation of its frontiers or boundaries. The term “country” as used in the text of this publication also refers, as appropriate, to territories or areas. Symbols of United Nations documents are composed of capital letters combined with figures. ST/ESA/STAT/SER.M/87 UNITED NATIONS PUBLICATION Sales No. -
Arabic Alphabet - Wikipedia, the Free Encyclopedia Arabic Alphabet from Wikipedia, the Free Encyclopedia
2/14/13 Arabic alphabet - Wikipedia, the free encyclopedia Arabic alphabet From Wikipedia, the free encyclopedia َأﺑْ َﺠ ِﺪﯾﱠﺔ َﻋ َﺮﺑِﯿﱠﺔ :The Arabic alphabet (Arabic ’abjadiyyah ‘arabiyyah) or Arabic abjad is Arabic abjad the Arabic script as it is codified for writing the Arabic language. It is written from right to left, in a cursive style, and includes 28 letters. Because letters usually[1] stand for consonants, it is classified as an abjad. Type Abjad Languages Arabic Time 400 to the present period Parent Proto-Sinaitic systems Phoenician Aramaic Syriac Nabataean Arabic abjad Child N'Ko alphabet systems ISO 15924 Arab, 160 Direction Right-to-left Unicode Arabic alias Unicode U+0600 to U+06FF range (http://www.unicode.org/charts/PDF/U0600.pdf) U+0750 to U+077F (http://www.unicode.org/charts/PDF/U0750.pdf) U+08A0 to U+08FF (http://www.unicode.org/charts/PDF/U08A0.pdf) U+FB50 to U+FDFF (http://www.unicode.org/charts/PDF/UFB50.pdf) U+FE70 to U+FEFF (http://www.unicode.org/charts/PDF/UFE70.pdf) U+1EE00 to U+1EEFF (http://www.unicode.org/charts/PDF/U1EE00.pdf) Note: This page may contain IPA phonetic symbols. Arabic alphabet ا ب ت ث ج ح خ د ذ ر ز س ش ص ض ط ظ ع en.wikipedia.org/wiki/Arabic_alphabet 1/20 2/14/13 Arabic alphabet - Wikipedia, the free encyclopedia غ ف ق ك ل م ن ه و ي History · Transliteration ء Diacritics · Hamza Numerals · Numeration V · T · E (//en.wikipedia.org/w/index.php?title=Template:Arabic_alphabet&action=edit) Contents 1 Consonants 1.1 Alphabetical order 1.2 Letter forms 1.2.1 Table of basic letters 1.2.2 Further notes -
Urdu Alphabet 1 Urdu Alphabet
Urdu alphabet 1 Urdu alphabet Urdu alphabet ﺍﺭﺩﻭ ﺗﮩﺠﯽ Example of writing in the Urdu alphabet: Urdu Type Abjad Languages Urdu, Balti, Burushaski, others Parent systems Proto-Sinaitic • Phoenician • Aramaic • Nabataean • Arabic • Perso-Arabic • Urdu alphabet ﺍﺭﺩﻭ ﺗﮩﺠﯽ [1] Unicode range U+0600 to U+06FF [2] U+0750 to U+077F [3] U+FB50 to U+FDFF [4] U+FE70 to U+FEFF Urdu alphabet ﮮ ﯼ ء ﮪ ﻩ ﻭ ﻥ ﻡ ﻝ ﮒ ﮎ ﻕ ﻑ ﻍ ﻉ ﻅ ﻁ ﺽ ﺹ ﺵ ﺱ ﮊ ﺯ ﮌ ﺭ ﺫ ﮈ ﺩ ﺥ ﺡ ﭺ ﺝ ﺙ ﭦ ﺕ ﭖ ﺏ ﺍ Extended Perso-Arabic script • History • Diacritics • Hamza • Numerals • Numeration The Urdu alphabet is the right-to-left alphabet used for the Urdu language. It is a modification of the Persian alphabet, which is itself a derivative of the Arabic alphabet. With 38 letters and no distinct letter cases, the Urdu alphabet is typically written in the calligraphic Nasta'liq script, whereas Arabic is more commonly in the Naskh style. Usually, bare transliterations of Urdu into Roman letters (called Roman Urdu) omit many phonemic elements that have no equivalent in English or other languages commonly written in the Latin script. The National Language Authority of Pakistan has developed a number of systems with specific notations to signify non-English sounds, but ﺥ ﻍ ﻁ these can only be properly read by someone already familiar with Urdu, Persian, or Arabic for letters such as Urdu alphabet 2 [citation needed].ﮌ and Hindi for letters such as ﻕ or ﺹ ﺡ ﻉ ﻅ ﺽ History The Urdu language emerged as a distinct register of Hindustani well before the Partition of India, and it is distinguished most by its extensive Persian influences (Persian having been the official language of the Mughal government and the most prominent lingua franca of the Indian subcontinent for several centuries prior to the solidification of British colonial rule during the 19th century). -
Shahmukhi to Gurmukhi Transliteration System: a Corpus Based Approach
Shahmukhi to Gurmukhi Transliteration System: A Corpus based Approach Tejinder Singh Saini1 and Gurpreet Singh Lehal2 1 Advanced Centre for Technical Development of Punjabi Language, Literature & Culture, Punjabi University, Patiala 147 002, Punjab, India [email protected] http://www.advancedcentrepunjabi.org 2 Department of Computer Science, Punjabi University, Patiala 147 002, Punjab, India [email protected] Abstract. This research paper describes a corpus based transliteration system for Punjabi language. The existence of two scripts for Punjabi language has created a script barrier between the Punjabi literature written in India and in Pakistan. This research project has developed a new system for the first time of its kind for Shahmukhi script of Punjabi language. The proposed system for Shahmukhi to Gurmukhi transliteration has been implemented with various research techniques based on language corpus. The corpus analysis program has been run on both Shahmukhi and Gurmukhi corpora for generating statistical data for different types like character, word and n-gram frequencies. This statistical analysis is used in different phases of transliteration. Potentially, all members of the substantial Punjabi community will benefit vastly from this transliteration system. 1 Introduction One of the great challenges before Information Technology is to overcome language barriers dividing the mankind so that everyone can communicate with everyone else on the planet in real time. South Asia is one of those unique parts of the world where a single language is written in different scripts. This is the case, for example, with Punjabi language spoken by tens of millions of people but written in Indian East Punjab (20 million) in Gurmukhi script (a left to right script based on Devanagari) and in Pakistani West Punjab (80 million), written in Shahmukhi script (a right to left script based on Arabic), and by a growing number of Punjabis (2 million) in the EU and the US in the Roman script. -
DRAFT Arabtex a System for Typesetting Arabic User Manual Version 4.00
DRAFT ArabTEX a System for Typesetting Arabic User Manual Version 4.00 12 Klaus Lagally May 25, 1999 1Report Nr. 1998/09, Universit¨at Stuttgart, Fakult¨at Informatik, Breitwiesenstraße 20–22, 70565 Stuttgart, Germany 2This Report supersedes Reports Nr. 1992/06 and 1993/11 Overview ArabTEX is a package extending the capabilities of TEX/LATEX to generate the Perso-Arabic writing from an ASCII transliteration for texts in several languages using the Arabic script. It consists of a TEX macro package and an Arabic font in several sizes, presently only available in the Naskhi style. ArabTEX will run with Plain TEXandalsowithLATEX2e. It is compatible with Babel, CJK, the EDMAC package, and PicTEX (with some restrictions); other additions to TEX have not been tried. ArabTEX is primarily intended for generating the Arabic writing, but the stan- dard scientific transliteration can also be easily produced. For languages other than Arabic that are customarily written in extensions of the Perso-Arabic script some limited support is available. ArabTEX defines its own input notation which is both machine, and human, readable, and suited for electronic transmission and E-Mail communication. However, texts in many of the Arabic standard encodings can also be processed. Starting with Version 3.02, ArabTEX also provides support for fully vowelized Hebrew, both in its private ASCII input notation and in several other popular encodings. ArabTEX is copyrighted, but free use for scientific, experimental and other strictly private, noncommercial purposes is granted. Offprints of scientific publi- cations using ArabTEX are welcome. Using ArabTEX otherwise requires a license agreement. There is no warranty of any kind, either expressed or implied. -
Grammatical Analysis of Nastalique Writing Style of Urdu
Grammatical Analysis of Nastalique Writing Style of Urdu 1 Historical Note Nastalique is one of the most intricate styles used for Arabic script, which makes it both beautiful and complex to model. This analysis has been conducted as part of the development process of Nafees Nastalique Font. The work has been conducted in 2002 and is being released for the general development of Nastalique writing style. The work has been supported by APDIP UNDP and IDRC. Authors Sarmad Hussain Shafiq ur Rahman Aamir Wali Atif Gulzar Syed Jamil ur Rahman December, 2002 2 Table of Contents HISTORICAL NOTE ............................................................................................................................................. 2 1. THE NASTALIQUE STYLE: AN INTRODUCTION .................................................................................... 5 2. URDU SCRIPT .................................................................................................................................................... 5 2.1. THE URDU ALPHABET .................................................................................................................................... 5 2.2. MAPPING BETWEEN NASTALIQUE AND URDU CHARACTERS ........................................................................... 6 2.3. BUILDING BLOCKS FOR AN URDU FONT ......................................................................................................... 7 2.3.1 Urdu Characters .................................................................................................................................... -
Learning Trilingual Dictionaries for Urdu – Roman Urdu – English
Learning Trilingual Dictionaries for Urdu – Roman Urdu – English Moiz Rauf and Sebastian Padó Institut für Maschinelle Sprachverarbeitung University of Stuttgart, Germany {moiz.rauf,pado}@ims.uni-stuttgart.de Abstract In this paper, we present an effort to generate a joint Urdu, Roman Urdu and English trilingual lexicon using automated methods. We make a case for using statistical machine translation approaches and parallel corpora for dictionary creation. To this purpose, we use word alignment tools on the corpus and evaluate translations using human evaluators. Despite different writing script and considerable noise in the corpus our results show promise with over 85% accuracy of Roman Urdu–Urdu and 45% English–Urdu pairs. 1 Introduction Bilingual lexicons serve an integral role in cross lingual information retrieval and bringing NLP to low resourced languages. The process of dictionary generation has greatly benefited from improvements in statistical translation methods. However, for low resourced languages the large parallel and monolingual corpora necessary to learn these dictionaries are hard to come by and remain a critical hurdle (Lam et al., 2015). In this paper, we have developed such a resource for Urdu, English and Roman Urdu (Urdu written in Latin script) language pairs. Urdu is an Indo-Aryan language with an extended Persio-Arabic script. It is the national language of Pakistan (Rasul, 2013), while English has been established as the medium used in educational and official settings in the country (Rafi, 2013; Muhammad Asghar and Mahmood, 2013). Roman Urdu despite not being an official script, plays an important role in communication and is widely popular on social media platforms (Bilal et al., 2017). -
Sequence to Sequence Networks for Roman-Urdu to Urdu Transliteration
20th International Multitopic Conference (INMIC’ 17) Sequence to Sequence Networks for Roman-Urdu to Urdu Transliteration Mehreen Alam, Sibt ul Hussain [email protected], [email protected] Reveal Lab, Computer Science Department, NUCES Islamabad, Pakistan Abstract— Neural Machine Translation models have natural language processing methods are being employed for replaced the conventional phrase based statistical translation Urdu language which limit the extent to which the methods since the former takes a generic, scalable, data-driven performance can be achieved. Their reliance on hand-crafted approach rather than relying on manual, hand-crafted features. features and manual annotations restricts not only their The neural machine translation system is based on one neural scalability for larger datasets but also their capacity to handle network that is composed of two parts, one that is responsible varying complexities of the language. In this paper, neural for input language sentence and other part that handles the machine translation based on deep learning has been applied desired output language sentence. This model based on to get transliteration from Roman-Urdu to Urdu with distributed representations of both the languages. encoder-decoder architecture also takes as input the distributed representations of the source language which enriches the Neural Machine Translation is based on the concept of learnt dependencies and gives a warm start to the network. In encoder-decoder where encoder and decoder are both based on this work, we transform Roman-Urdu to Urdu transliteration the sequence of multiple Recurrent Neural Network (RNN) into sequence to sequence learning problem. To this end, we cells [3]. -
Languages of New York State Is Designed As a Resource for All Education Professionals, but with Particular Consideration to Those Who Work with Bilingual1 Students
TTHE LLANGUAGES OF NNEW YYORK SSTATE:: A CUNY-NYSIEB GUIDE FOR EDUCATORS LUISANGELYN MOLINA, GRADE 9 ALEXANDER FFUNK This guide was developed by CUNY-NYSIEB, a collaborative project of the Research Institute for the Study of Language in Urban Society (RISLUS) and the Ph.D. Program in Urban Education at the Graduate Center, The City University of New York, and funded by the New York State Education Department. The guide was written under the direction of CUNY-NYSIEB's Project Director, Nelson Flores, and the Principal Investigators of the project: Ricardo Otheguy, Ofelia García and Kate Menken. For more information about CUNY-NYSIEB, visit www.cuny-nysieb.org. Published in 2012 by CUNY-NYSIEB, The Graduate Center, The City University of New York, 365 Fifth Avenue, NY, NY 10016. [email protected]. ABOUT THE AUTHOR Alexander Funk has a Bachelor of Arts in music and English from Yale University, and is a doctoral student in linguistics at the CUNY Graduate Center, where his theoretical research focuses on the semantics and syntax of a phenomenon known as ‘non-intersective modification.’ He has taught for several years in the Department of English at Hunter College and the Department of Linguistics and Communications Disorders at Queens College, and has served on the research staff for the Long-Term English Language Learner Project headed by Kate Menken, as well as on the development team for CUNY’s nascent Institute for Language Education in Transcultural Context. Prior to his graduate studies, Mr. Funk worked for nearly a decade in education: as an ESL instructor and teacher trainer in New York City, and as a gym, math and English teacher in Barcelona. -
Orthographic Transparency and the Ottoman Abjad Maithili Jais
Orthographic Transparency and the Ottoman Abjad Maithili Jais University of Florida Spring 2018 I. Introduction In 2014, the debate over whether Ottoman Turkish was to be taught in schools or not was once again brought to the forefront of Turkish society and the Turkish conscience, as Erdogan began to push for Ottoman Turkish to be taught in all high schools across the country (Yeginsu, 2014). This became an obsession of a news topic for media in the West as well as in Turkey. Turkey’s tumultuous history with politics inevitably led this proposal of teaching Ottoman Turkish in all high schools to become a hotbed of controversy and debate. For all those who are perfectly contented to let bygones be bygones, there are many who assert that the Ottoman Turkish alphabet is still relevant and important. In fact, though this may be a personal anecdote, there are still certainly people who believe that the Ottoman script is, or was, superior to the Latin alphabet with which modern Turkish is written. This thesis does not aim to undertake a task so grand as sussing out which of the two was more appropriate for Turkish. No, such a task would be a behemoth for this paper. Instead, it aims to answer the question, “How?” Rather, “How was the Arabic script moulded to fit Turkish and to what consequence?” Often the claim that one script it superior to another suggests inherent judgement of value, but of the few claims seen circulating Facebook on the efficacy of the Ottoman script, it seems some believe that it represented Turkish more accurately and efficiently.