Improving Search Via Named Entity Recognition in Morphologically Rich Languages – a Case Study in Urdu

Total Page:16

File Type:pdf, Size:1020Kb

Improving Search Via Named Entity Recognition in Morphologically Rich Languages – a Case Study in Urdu Improving Search via Named Entity Recognition in Morphologically Rich Languages – A Case Study in Urdu A DISSERTATION SUBMITTED TO THE FACULTY OF THE UNIVERSITY OF MINNESOTA BY Kashif H. Riaz IN PARTIAL FULLFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY Adviser: Dr. Vipin Kumar Co-Adviser: Dr. Blake Howald Co-Adviser: Dr. Jeanette Gundel February 2018 This dissertation is copyrighted to Kashif H. Riaz Copyright © 2018 Dedication To my teachers, parents, and Komayl i Acknowledgments To my late grandfather Intizar Hussain, I will always cherish the time I spent scribing for him. The access he provided to his library was no less than the Book of Kells for me. I am thankful to my committee members and my advisers for their support on this journey. This work would not have been possible without your encouragement to pursue multidisciplinary research. I thank Dr. Vipin Kumar for his immense support, guidance, encouragement, and most of all that he was gracious despite my procrastination and supported me through the ups and downs of life on this journey. I immensely thank Dr. Blake Howald for his guidance and showing me the path through the fog of this multifaceted research. Most importantly, I am grateful for the generosity of his time, his encouraging comments, his feedback, and taking me through the finish line. My gratitude to Dr. Michael Steinbach for his gentle guidance, his feedback on this dissertation, and components of my research on this journey. He has always listened patiently and talked me through the challenges of my research. Thanks to Claudia Neuhauser, for teaching me how to look into complex societal problems in a multidisciplinary way and giving me hope that there is light at the end of the tunnel for the problems close to my heart. I thank Dr. Jeanette Gundel for her time, advice, and guidance through this journey. For her patience, as she explained linguistic concepts as we sat in her office going over my research. To the Information Retrieval, and Computational Linguistics research community who guided me in conferences and workshops by providing feedback during my presentations. Specifically, to the presenters at 6th ESSIR (European Summer School in Information Retrieval) in Glasgow in setting a direction to my research. Your guidance to research in Information Retrieval for resource scarce languages was invaluable. My special thanks for Stephen Robertson, and Leif Azzopardi who on multiple occasions encouraged and provided direction. To Karim Darwish, who explained in detail the complexity of writing a stemmer for Arabic. I am truly standing on the shoulders of giants. To my supervisors and managers who supported me through this journey. Without their support this journey could not have been possible. Most importantly, I am thankful to my family, who had to suffer through “wasted” snow days, missing fall colors, shortened spring breaks, and at times a messy house, as I was busy in research. We will make up the time! ii Contents Contents .............................................................................................................................. iii Table of Figures ................................................................................................................... x List of Tables ....................................................................................................................... xi 1 Synopsis....................................................................................................................... 1 1.1 Morphological Rich Languages............................................................................. 2 1.2 Proposition ........................................................................................................... 5 1.3 Named Entity Recognition (NER) ......................................................................... 6 1.4 Search ................................................................................................................... 8 1.4.1 Keyword Search ............................................................................................ 8 1.4.2 Concept-based Search .................................................................................. 9 1.5 Urdu .................................................................................................................... 10 1.6 Methodology ...................................................................................................... 11 1.6.1 Enabling Technologies ................................................................................ 12 1.6.2 Evaluation Measures ................................................................................... 12 1.6.3 Base-line for Urdu Search ........................................................................... 14 1.6.4 NER for Urdu ............................................................................................... 14 1.7 Other Language Evaluation ................................................................................ 16 1.8 Experiment Sketch and Results .......................................................................... 16 1.9 Analysis ............................................................................................................... 17 1.10 Roadmap ......................................................................................................... 18 2 Named Entity Recognition ........................................................................................ 19 2.1 Introduction........................................................................................................ 19 2.1.1 Named Entity .............................................................................................. 19 iii 2.1.2 The Name Entity Task (Message Understanding Conference) ................... 20 2.1.3 Applications of Name Recognition ............................................................. 20 2.2 General Challenges in NER ................................................................................. 23 2.2.1 Ambiguity of Proper Names ........................................................................ 24 2.2.2 NER as an Information Extraction task ....................................................... 25 2.2.3 Evaluation Issues in IE and in NER .............................................................. 26 2.2.4 Architecture of the NER System ................................................................. 27 2.3 Name Recognition Systems and their Approaches ............................................ 28 2.3.1 Nymble: a High Performance Learning Name Finder ................................ 29 2.3.2 NetOwl™ Extractor from IsoQuest ............................................................. 30 2.3.3 Nominator: IBM T.J. Watson Research Center ........................................... 32 2.3.4 Nationality Specific Methods ...................................................................... 32 2.4 Conditional Random Fields ................................................................................ 33 2.4.1 Background ................................................................................................. 34 2.4.2 Hidden Markov Models .............................................................................. 34 2.4.3 Conditional Models ..................................................................................... 36 2.4.4 Maximum Entropy Markov Model (MEMM) .............................................. 36 2.4.5 Conditional Random Fields (CRF) ................................................................ 38 2.4.6 Use of Condition Random Fields for NER ................................................... 41 2.5 Recent Trends in Statistical and Machine Learning Approaches ....................... 42 2.5.1 Word2vec .................................................................................................... 42 2.5.2 Deep Learning ............................................................................................. 43 2.6 Reflections on Deep Learning and Word Embeddings for Morphologically Rich Languages ..................................................................................................................... 44 iv 2.7 Summary ............................................................................................................ 45 3 Search – Information Retrieval ................................................................................. 46 3.1 Named Entities in Search ................................................................................... 47 3.1.1 Challenges of Names in Search ................................................................... 48 3.2 The Search Challenge ......................................................................................... 49 3.3 Keyword Based Search ....................................................................................... 51 3.3.1 Boolean Keyword Searching ....................................................................... 51 3.3.2 Ranked Retrieval ......................................................................................... 53 3.3.3 Probabilistic Retrieval ................................................................................. 53 3.3.4 Vector Space Model – a variant of Ranked Retrieval ................................. 54 3.3.5 Challenges of Keyword search .................................................................... 56 3.4 Traditional Concept Searching
Recommended publications
  • [2010] Manuscript Learnability Ver 2
    PERSPECTIVES ON KANO Copyright 2010 ©A Jama'ar Inuwar Kano (Kano Foundation) Initiative for kanoonline.com First Published in Nigeria December, 2010 lSBN: 978-8092-53-5 All rights reserved. No part of this book may be reproduced or. utilized in any form or by any means, electronic or mechanical, including photocopying and recording or by any information storage and retrieval system without written the or permission of the author. Printed & Bound by Tellettes Consulting Coy Ltd • 2 PERSPECTIVES ON KANO Contents CHAPTER 1 The Kano Physical Environment 7 CHAPTER2 Assessment of Water Demand Pattern in Sub-Saharan Africa: A Case Study of the Greater Kano Area, Nigeria 47 CHAPTER3 Aspects of Kano Cultural Tourism 57 CHAPTER4 The Structure of Kano Economy 83 CHAPTERS Introduction, Spread and Development of Islam in Kano Since 1350 A.D. 111 CHAPTER6 Manuscript Learnability and Indig;enous Knowledge for Development-The Kano Hausa Ajami in Historical Context 139 CHAPTER? The Influence of North African Arabs on Kano City 200 CHAPTERS Multiculturalism in Kano State of Nigeri.a- Processes and Dynamics 239 CHAPTER9 A Participatory Account of the Jihad in Kano 268 CHAPTER 10 Sarki Goma: The Sakkwato Jihad and the Transformation of the Hausa Built Environment 290 CHAPTER 11 Community Mobilization in Traditional Societies: A Contextual Appraisal of the Role of Oramedia in tl1e 1804 Usman Dan Fodio Jihad Movement 311 CHAPTER 12 The Hajj Exercise in Kano: Challenges, Constraints and Drawbacks 339 CHAPTER 13 Biography of Select Kano Merchants, 1853-1955 373 CHAPTER 14 Language Patterns, Etiquette and Address Forms in Kano Emir's Palace.
    [Show full text]
  • IJCNLP 2011 Proceedings of the Workshop on Advances in Text Input Methods (WTIM 2011)
    IJCNLP 2011 Proceedings of the Workshop on Advances in Text Input Methods (WTIM 2011) November 13, 2011 Shangri-La Hotel Chiang Mai, Thailand IJCNLP 2011 Proceedings of the Workshop on Advances in Text Input Methods (WTIM 2011) November 13, 2011 Chiang Mai, Thailand We wish to thank our sponsors Gold Sponsors www.google.com www.baidu.com The Office of Naval Research (ONR) Department of Systems Engineering and The Asian Office of Aerospace Research and Devel- Engineering Managment, The Chinese Uni- opment (AOARD) versity of Hong Kong Silver Sponsors Microsoft Corporation Bronze Sponsors Chinese and Oriental Languages Information Processing Society (COLIPS) Supporter Thailand Convention and Exhibition Bureau (TCEB) We wish to thank our sponsors Organizers Asian Federation of Natural Language National Electronics and Computer Technolo- Processing (AFNLP) gy Center (NECTEC), Thailand Sirindhorn International Institute of Technology Rajamangala University of Technology Lanna (SIIT), Thailand (RMUTL), Thailand Chiang Mai University (CMU), Thailand Maejo University, Thailand c 2011 Asian Federation of Natural Language Proceesing vii Preface Welcome to the IJCNLP Workshop on Advances in Text Input Methods (WTIM 2011)! Methods of text input have entered a new era. The number of people who have access to computers and mobile devices is skyrocketing in regions where people do not have a convenient method of inputting their native language. It has also become commonplace to input text not through a keyboard but through different modes such as voice and handwriting recognition. Even when people input text using a keyboard, it is done differently from only a few years ago – adaptive software keyboards, word auto- completion and prediction, and spell correction are just a few examples of such recent changes in text input experience.
    [Show full text]
  • Proposal to Encode Bosnian Arabic Characters
    Proposal to encode Bosnian Arabic characters Denis Moyogo Jacquerye <[email protected]> 2019-10-03 1. Introduction The Arabic alphabet has been used to write Bosnian from the 15th to the mid 20th century. In the early 20th century Mehmed Džemaludin Čaušević extended the alphabet, adding several letters including two letters that are proposed for encoding: • ARABIC LETTER NOON WITH INVERTED SMALL V for nj [ɲ], • ARABIC LETTER HAH WITH INVERTED SMALL V BELOW for ć [tɕ]. Other additional letters, which are already encoded, were used in this orthography: ,[For c [ts ڄ ARABIC LETTER DYEH 0684 • ,[For č [tʃ چ ARABIC LETTER TCHEH 0686 • ,[was used for both dž [dʒ] and đ [dʑ ج 062C ARABIC LETTER JEEM • ,[For lj [ʎ ڵ 06B5 ARABIC LETTER LAM WITH SMALL V • ,[For [u ۆ 06C6 ARABIC LETTER OE • .[For [o ۉ 06C9 ARABIC LETTER KIRGHIZ YU • Čaušević’s alphabet was used in publications from 1908 until the 1940s. That alphabet is referred to by several names: arebica, harfovica, matufovica, matufovača, mektebica. According to Pjanić (2009), several periodicals used arebica: Tarik (1908-1911), Muallim (1910-1913), Misbah (1912-1913) and Jeni Sabah (1914); and also the last publication using Arebica was Seid Serdarević Fikh ʾulʿibādāt (1941). Those two letters are mentioned in descriptions of Bosnian Arebica such as Bourgeois (1913), Lehfeldt (2001), Gažáková (2014). In recent years, there has been interest in Čaušević’s orthography. Notably in works by Al-Zubi and Cicak-Al-Zubi, in the Šegrt Suljica comics published in the Bosnian children magazine Elif. The 062C ARABIC LETTER is used for dž [dʒ] and another letter represents đ [dʑ], for ج JEEM example Al-Zubi and Cicak-Al-Zubi use 0620 ARABIC LETTER AIN WITH and a letter ARABIC LETTER HAH WITH SMALL V ڠ THREE DOTS ABOVE BELOW, which could potentially be encoded if in use, can be found in some character charts.
    [Show full text]
  • Urdu Zabta Takhti (UZT) 1.01 L2/02-004
    Ref: Proceedings of INMIC2001, Organised by IEEE & Lahore University of Management Sciences, Lahore, December 28-30, 2001, pp: 223-228 Urdu Computing Standards: Urdu Zabta Takhti (UZT) 1.01 Sarmad Hussain* and Muhammad Afzal** Introduction iv. Urdu aerab/diacritics (66 – 79, 123 – 126) Software development in Urdu has been v. Urdu characters (80 – 122) going on for more than three decades. However, vi. reserved control space (128 – 159, 255) until recently there were no industry standards for vii. special symbols (160 – 176, 192 – 199) coding in Urdu, similar to ASCII standard for viii. reserved expansion space (177 – 191, 200 – English. Therefore, all the individual or industry 207, 240 – 253) efforts were isolated and based on ad hoc mapping ix. vendor area (208 – 239) of Urdu characters on binary codes. Every x. toggle character (254) developer was using arbitrary code pages (character set and its mapping onto binary code), making it incompatible with other vendors. In addition, due to the competition in the industry these code pages have been well-guarded trade-secrets. Thus, this absence of a standard has been a significant hurdle to the development and propagation of Urdu software. Keeping in view the necessity and urgency of standardization needs, three years ago a national effort was initiated to get relevant people together and formulate a common standard for everybody to follow. This effort, detailed in another paper within this volume (Afzal and Hussain, 2001), resulted in the creation of Urdu Zabta Takhti (UZT) version 1.01. UZT 1.01 has been accepted by the Government of Pakistan (GoP) as the standard code page for Urdu.
    [Show full text]
  • The Impact of Arabic Orthography on Literacy and Economic Development in Afghanistan
    International Journal of Education, Culture and Society 2019; 4(1): 1-12 http://www.sciencepublishinggroup.com/j/ijecs doi: 10.11648/j.ijecs.20190401.11 ISSN: 2575-3460 (Print); ISSN: 2575-3363 (Online) The Impact of Arabic Orthography on Literacy and Economic Development in Afghanistan Anwar Wafi Hayat Department of Economics, Kabul University, Kabul, Afghanistan Email address: To cite this article: Anwar Wafi Hayat. The Impact of Arabic Orthography on Literacy and Economic Development in Afghanistan. International Journal of Education, Culture and Society . Vol. 4, No. 1, 2019, pp. 1-12. doi: 10.11648/j.ijecs.20190401.11 Received : October 15, 2018; Accepted : November 8, 2018; Published : January 31, 2019 Abstract: Currently, Pashto and Dari (Afghan Persian), the two official languages, and other Afghan languages are written in modified Arabic alphabets. Persian adopted the Arabic alphabets in the ninth century, and Pashto, in sixteenth century CE. This article looks at how the Arabic Orthography has hindered Literacy and Economic development in Afghanistan. The article covers a comprehensive analysis of Arabic Orthography adopted for writing Dari and Pashto, a study of the proposed Arabic Language reforms, and research conducted about reading and writing difficulty in Arabic script by Arab intellectuals. The study shows how adopting modified Latin alphabets for a language can improve literacy level which further plays its part in the economic development of a country. The article dives into the history of Romanization of languages in the Islamic World and its impact on Literacy and economic development in those countries. Romanization of the Afghan Official languages and its possible impact on Literacy, Economy, and Peace in Afghanistan is discussed.
    [Show full text]
  • Urdu Keypad Free Download
    Urdu keypad free download CLICK TO DOWNLOAD /6/8 · JoshPavety October 10, / Version: Urduayub Urdu Keyboard | By JoshPavetySubcategory: System Utilities. Download this app from Microsoft Store for Windows 10 Mobile, Windows Phone , Windows Phone 8. See screenshots, read the latest customer reviews, and compare ratings for Urdu Keyboard. easy urdu keyboard free download - Easy Urdu Keyboard - Urdu on Photos, Urdu and English Easy Keyboard, Easy Typing Urdu Keyboard Fonts And Themes, and many more programs. Urdu Keyboard app is free to download for everyone. Urdu keyboard App is very easy to use and easy Urdu typing. This is free android emojis keypad. Change the vibration and sound of typing in sound keyboard Urdu keyboard is one handed keyboard /5(). Free Download % CLEAN report malware Enhance your system with a native Urdu language phonetic keyboard by turning to this lightweight software solution that can be easily deployed. Urdu Paksign Keyboard for Windows 7, 8, 10 Free Download This package is just the keyboard from Pak Urdu Installer software. Urdu Paksign Keyboard is the best Urdu keyboard used for Urdu typing any where in Windows. Best available Urdu keyboard for free. /2/9 · Urdu Keypad Nastaleeq the typing system got a new version of English keyboard with Urdu keys in Urdu keypad english, Urdu keypad Phonetic Old type writer is remake due to the dire necessity of double language for local and international users for urduCategory: APP. Download computer urdu keyboard for free. System Utilities downloads - Phonetic Urdu Keyboard By Semanticsoft by TransUrdu and many more programs are available for instant and free download.
    [Show full text]
  • The World's 500 Most Influential Muslims, 2021
    PERSONS • OF THE YEAR • The Muslim500 THE WORLD’S 500 MOST INFLUENTIAL MUSLIMS • 2021 • B The Muslim500 THE WORLD’S 500 MOST INFLUENTIAL MUSLIMS • 2021 • i The Muslim 500: The World’s 500 Most Influential Chief Editor: Prof S Abdallah Schleifer Muslims, 2021 Editor: Dr Tarek Elgawhary ISBN: print: 978-9957-635-57-2 Managing Editor: Mr Aftab Ahmed e-book: 978-9957-635-56-5 Editorial Board: Dr Minwer Al-Meheid, Mr Moustafa Jordan National Library Elqabbany, and Ms Zeinab Asfour Deposit No: 2020/10/4503 Researchers: Lamya Al-Khraisha, Moustafa Elqabbany, © 2020 The Royal Islamic Strategic Studies Centre Zeinab Asfour, Noora Chahine, and M AbdulJaleal Nasreddin 20 Sa’ed Bino Road, Dabuq PO BOX 950361 Typeset by: Haji M AbdulJaleal Nasreddin Amman 11195, JORDAN www.rissc.jo All rights reserved. No part of this book may be repro- duced or utilised in any form or by any means, electronic or mechanic, including photocopying or recording or by any information storage and retrieval system, without the prior written permission of the publisher. Views expressed in The Muslim 500 do not necessarily reflect those of RISSC or its advisory board. Set in Garamond Premiere Pro Printed in The Hashemite Kingdom of Jordan Calligraphy used throughout the book provided courte- sy of www.FreeIslamicCalligraphy.com Title page Bismilla by Mothana Al-Obaydi MABDA • Contents • INTRODUCTION 1 Persons of the Year - 2021 5 A Selected Surveyof the Muslim World 7 COVID-19 Special Report: Covid-19 Comparing International Policy Effectiveness 25 THE HOUSE OF ISLAM 49 THE
    [Show full text]
  • Creating Standards
    Creating Standards Unauthenticated Download Date | 6/17/19 6:48 PM Studies in Manuscript Cultures Edited by Michael Friedrich Harunaga Isaacson Jörg B. Quenzer Volume 16 Unauthenticated Download Date | 6/17/19 6:48 PM Creating Standards Interactions with Arabic Script in 12 Manuscript Cultures Edited by Dmitry Bondarev Alessandro Gori Lameen Souag Unauthenticated Download Date | 6/17/19 6:48 PM ISBN 978-3-11-063498-3 e-ISBN (PDF) 978-3-11-063906-3 e-ISBN (EPUB) 978-3-11-063508-9 ISSN 2365-9696 This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. For details go to http://creativecommons.org/licenses/by-nc-nd/4.0/. Library of Congress Control Number: 2019935659 Bibliographic information published by the Deutsche Nationalbibliothek The Deutsche Nationalbibliothek lists this publication in the Deutsche Nationalbibliografie; detailed bibliographic data are available on the Internet at http://dnb.dnb.de. © 2019 Dmitry Bondarev, Alessandro Gori, Lameen Souag, published by Walter de Gruyter GmbH, Berlin/Boston Printing and binding: CPI books GmbH, Leck www.degruyter.com Unauthenticated Download Date | 6/17/19 6:48 PM Contents The Editors Preface VII Transliteration of Arabic and some Arabic-based Script Graphemes used in this Volume (including Persian and Malay) IX Dmitry Bondarev Introduction: Orthographic Polyphony in Arabic Script 1 Paola Orsatti Persian Language in Arabic Script: The Formation of the Orthographic Standard and the Different Graphic Traditions of Iran in the First Centuries of
    [Show full text]
  • Arabic Samaritan Yezidi
    The Unicode® Standard Version 14.0 – Core Specification To learn about the latest version of the Unicode Standard, see https://www.unicode.org/versions/latest/. Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and the publisher was aware of a trade- mark claim, the designations have been printed with initial capital letters or in all capitals. Unicode and the Unicode Logo are registered trademarks of Unicode, Inc., in the United States and other countries. The authors and publisher have taken care in the preparation of this specification, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. © 2021 Unicode, Inc. All rights reserved. This publication is protected by copyright, and permission must be obtained from the publisher prior to any prohibited reproduction. For information regarding permissions, inquire at https://www.unicode.org/reporting.html. For information about the Unicode terms of use, please see https://www.unicode.org/copyright.html. The Unicode Standard / the Unicode Consortium; edited by the Unicode Consortium. — Version 14.0. Includes index. ISBN 978-1-936213-29-0 (https://www.unicode.org/versions/Unicode14.0.0/) 1.
    [Show full text]
  • Proposal to Encode the Khwarezmian Script in Unicode
    L2/18-010R 2018-03-26 Proposal to encode the Khwarezmian script in Unicode Anshuman Pandey [email protected] March 26, 2018 1 Introduction This proposal is a substantial revision and expansion of the following: • L2/17-054R: “Proposal to encode the Khwarezmian script in Unicode” It incorporates comments provided by the UTC Script Ad Hoc Committee in: • L2/17-255: Recommendations to UTC #152 July-August 2017 on Script Proposals • L2/18-039: Recommendations to UTC #154 January 2018 on Script Proposals Major changes include: • Definition of the script as a cursive joining abjad • Details on joining properties, shaping behaviors, and usage of • Addition of a detached form of aleph and a vocalization sign for waw and yodh • Encoded representations of examples words and numbers • Additional specimens showing usage of the script 2 Background The proposed script was used between the 2nd and 9th centuries for writing Khwarezmian (ISO 639-3: xco), a now-extinct Eastern Iranian language that was spoken in the Oxus (Amu Darya) river delta. This region in Central Asia is known in the Avesta as ନରଌଭଆଓ hvâirizem (Yašt 10.5.14). The Achaemenids re- ferred to it as ΢κΠμπηΡρ uvârazmiš (XPh inscription, lines 21–22). It is called xvārazm in and this , خوارزمclassical Persian, which is transcribed as ‘Khwarezm’ in English. The Greeks knew it as Χορασμία hellenic form entered the English lexicon as ‘Chorasmia’. The territory lies across portions of present-day Uzbekistan, Kazakhstan, and Turkmenistan. Both ‘Khwarezm’ and ‘Chorasmia’ are used in modern English, along with the adjectives ‘Khwarezmian’ and ‘Chorasmian’, and the variant spellings ‘Khwarazmian’ and ‘Choresmian’.
    [Show full text]
  • Urdu Keyboard Label Instructions and Specifications
    Smart Keyboard Solutions 1855 E Southern Avenue, Suite #213 Mesa, AZ 85204 Phone: 877-477-1988 Visit our web site at: Buy this product online SmartKeyboardSolutions.com Urdu Keyboard Label Instructions and Specifications Table of Contents Product Description Configuring Windows 8 for Urdu Configuring Windows 7 and Vista for Urdu Configuring Windows XP for Urdu Configuring Microsoft Office for Urdu How to Install the Labels How to Use the Keyboard Layout in Windows 8 How to Use the Keyboard Layout in Windows 7, Vista, and XP How to Type Urdu Characters Product Features Product Description: The Urdu keyboard labels are clear labels with Urdu characters on the right side. This allows you to convert any keyboard to a bilingual Urdu keyboard. The labels are available in green (for light or beige colored keyboards) and white (for black keyboards). 1 Language Compatibility. The Urdu keyboard labels are compatible with the Windows keyboard layouts used in Pakistan. Windows Compatibility. The Urdu keyboard labels are compatible with the Urdu keyboard layout in Windows 8, 7, Vista, and XP. The labels might be compatible with other versions of Windows, but they have not been tested to ensure complete compatibility. Because the language uses right-to-left text entry, certain keys are handled differently depending upon the software. These keys, ( ) { } [ ] < >, appear in one way in Microsoft Office programs, but are different in other programs. We include stickers for both layouts so ensure the keyboard layout you install matches your software. Configuring Windows 8 for Urdu Instructions for a Touch Screen Windows 8.x 1. Swipe right to left on the screen and tap the Settings item.
    [Show full text]
  • Urdu Word Processor – Standards and Guidelines
    J. Appl. Environ. Biol. Sci. , 4(7S)185-192, 2014 ISSN: 2090-4274 Journal of Applied Environmental © 2014, TextRoad Publication and Biological Sciences www.textroad.com Urdu Word Processor – Standards and Guidelines Uzair Muhammad, S. Tahir Ali Jan, M. Shahid Sultan, M. Naeem Faculty of Computing, Riphah International University at Islamabad Received: September 1, 2014 Accepted: November 13, 2014 ABSTRACT Transliteration and word processing in Urdu is becoming under limelight in the last decade. Though there has been an immense addition of word processing features in contemporary word processors for English but as far as Urdu word processing is concerned, the researchers are seen mostly at the side of finding single aspect resolutions or optimization of Urdu language processing problems but tactlessly they give their integration (as one single text editing solution) a very less significance. Integrated solutions for the fulfillment of modern Urdu content editing needs are found to be very fewer and feeble at present. Because of having no compact word processing solution to retort present-day Urdu word processing needs, there is a great need for an Urdu word processor that could be able to justify the contemporary text editing needs of people who can read and write Urdu language.This paper presents features and requirements for a state of the art Unicode based Urdu word processor described as per the present day necessities. It also considers the comparative analysis of the present day word processing solutions, their blemishes, and their strengths over each other and what more can be done in this area to promote Urdu word processing would be described in this paper as well.
    [Show full text]