Dirêjî Kurdî: a Lexicographic Environment for Kurdish Language
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Ukrainian ASCII-Cyrillic
This is the ASCII-Cyrillic Home Page, PDF rendition. N.B. The bitmaps probably look best at 100% size! ASCII-Cyrillic and its converter email-ru.tex (beta version) A new faithful ASCII representation for Russian called ASCII-Cyrillic is presented here, one which permits accurate typing and reading of Russian where no Russian keyboard or font is available -- as often occurs outside of Russia. ASCII-Cyrillic serves the Russian and Ukrainian languages in parallel. This brief introduction is initially for Russian; but, further along, come the modifications needed to adapt to the Ukrainian alphabet. Here is a fragment of Russian email. As far as the email system was concerned, the email message was roughly a sequence of "octets" or "bytes" (each 8 zeros or ones); where each octet corresponds to a character according to some 8-bit encoding. As originally typed and sent, it is probably readable (using a 8-bit Russian screen font) on most computers in any country where a Cyrillic alphabet is indigenous --- but rarely beyond. (The GIF image you see here is widely readable, but at least 10 times as bulky, and somewhat hazy too.) The portability of 8-bit Cyrillic text is hampered by the frequent need to re-encode for another computer operating system. When the targeted encoding does not contain all the characters used, reencoding can become not just inconvenient but downright problematic. The utility "email-ru.tex" converts this 8-bit text to and from ASCII-Cyrillic, the new 7-bit ASCII transcription of Russian. This scheme was designed to be both typeable and readable on every computer worldwide: Na obratnom puti !Gardine obq'asnila mne, kak delath peresadku na metro. -
Technical Reference Manual for the Standardization of Geographical Names United Nations Group of Experts on Geographical Names
ST/ESA/STAT/SER.M/87 Department of Economic and Social Affairs Statistics Division Technical reference manual for the standardization of geographical names United Nations Group of Experts on Geographical Names United Nations New York, 2007 The Department of Economic and Social Affairs of the United Nations Secretariat is a vital interface between global policies in the economic, social and environmental spheres and national action. The Department works in three main interlinked areas: (i) it compiles, generates and analyses a wide range of economic, social and environmental data and information on which Member States of the United Nations draw to review common problems and to take stock of policy options; (ii) it facilitates the negotiations of Member States in many intergovernmental bodies on joint courses of action to address ongoing or emerging global challenges; and (iii) it advises interested Governments on the ways and means of translating policy frameworks developed in United Nations conferences and summits into programmes at the country level and, through technical assistance, helps build national capacities. NOTE The designations employed and the presentation of material in the present publication do not imply the expression of any opinion whatsoever on the part of the Secretariat of the United Nations concerning the legal status of any country, territory, city or area or of its authorities, or concerning the delimitation of its frontiers or boundaries. The term “country” as used in the text of this publication also refers, as appropriate, to territories or areas. Symbols of United Nations documents are composed of capital letters combined with figures. ST/ESA/STAT/SER.M/87 UNITED NATIONS PUBLICATION Sales No. -
Bulletin of the School of Oriental and African Studies Vowel Harmony In
Bulletin of the School of Oriental and African Studies http://journals.cambridge.org/BSO Additional services for Bulletin of the School of Oriental and African Studies: Email alerts: Click here Subscriptions: Click here Commercial reprints: Click here Terms of use : Click here Vowel harmony in nounandparticle words in the Tibetan of Baltistan R. K. Sprigg Bulletin of the School of Oriental and African Studies / Volume 43 / Issue 03 / October 1980, pp 511 519 DOI: 10.1017/S0041977X00137413, Published online: 24 December 2009 Link to this article: http://journals.cambridge.org/abstract_S0041977X00137413 How to cite this article: R. K. Sprigg (1980). Vowel harmony in nounandparticle words in the Tibetan of Baltistan. Bulletin of the School of Oriental and African Studies, 43, pp 511519 doi:10.1017/S0041977X00137413 Request Permissions : Click here Downloaded from http://journals.cambridge.org/BSO, IP address: 212.219.238.14 on 17 Oct 2012 VOWEL HARMONY IN NOUN-AND-PARTICLE WORDS IN THE TIBETAN OF BALTISTAN1 By R. K. SPEIGG I. Noun-and-definite-particle ivords In an earlier study, ' Assimilation, and the definite nominal particle in Balti Tibetan' (Sprigg, 1972), I dealt with vowel harmony in words in which the noun is colligated with the definite nominal particle, a particle that has the constant phonetic form po when in junction with the final syllable of a noun ending in a consonant (-Cpo), e.g. smcnpo ' the medicine ' strum, but variable phonetic features where the final syllable of the noun ends in a vowel: a share in the features of the resulting word-final long vowel, either -o: or -u: according to vowel harmony.2 The former of these two long vowels applies to the ' open ' type of junction, e.g. -
Micrometl SUBMITTAL
WEIGHT: DATE: Part Number: SUBMITTAL MicroMetl 204 lbs 8/11 0821-1725-DT SUBMITTED TO: JOB NAME: COMPANY: EQUIPMENT: DRAWN BY: NOTES: NEW CARRIER UNITS (CAR-537) 48/50DJ,HJ,TJ,TM,48TF 004-007; 50LJ,TFF,TFQ,TJQ 004-007; 50/48GJ,HE,48GJD,GJE,GJF 006; 48/50HC A04,B04; 48/50HE 003-005; 48/50HM,48HJ,50TJ 007; 48/50LC,50HCQ A04-A06; 48/50HJ,50QJ 004; 48LJ 005-007; 48/50TC A04-A07,B04-B07; 50HEQ 003-006; 50HJQ 004-007; 50LJQ 004-006; 50QJ 005-006; 50TCQ A04-A07; 48/50HC A05-A06,B05-B06 EXISTING UNIT (MMC-352) CARRIER: 48GH 036-060; 48GL 042-060; 48KH,KHA 036-060,136; 48KL 042-048,142-160; 48KLA 142-160 BRYANT: 579E,J 060; 585C,D,E,G,H,J 036100 ,036125,042-060 YORK: D1EC,D2EC 048-060 Features: • Fully assembled curb adaptor. • Includes internal duct transitions. • Internally insulated with 1” - 1.5” lb insulation. Gasketing package provided. • Adaptor pans and supports provided / field installed Curb Adaptor Information • To verify that the curb adaptor shown on this page is the correct adaptor for you job be sure the existing curb is the same size as the dimensions provided. (The bottom dimensions of curb adaptor are larger than existing curb outside dimension.) • Before new HVAC unit is set in place, inspect structural stability of existing curb and building’s roof load capability. Reinforce if required. • All curb adaptors will increase the systems external static pressure and must be included when calculating unit requirements. • Curb adaptors are designed to attach to an existing curb with side x side duct connections. -
Nastaleeq: a Challenge Accepted by Omega
Nastaleeq: A challenge accepted by Omega Atif Gulzar, Shafiq ur Rahman Center for Research in Urdu Language Processing, National University of Computer and Emerging Sciences, Lahore, Pakistan atif dot gulzar (at) gmail dot com, shafiq dot rahman (at) nu dot edu dot pk Abstract Urdu is the lingua franca as well as the national language of Pakistan. It is based on Arabic script, and Nastaleeq is its default writing style. The complexity of Nastaleeq makes it one of the world's most challenging writing styles. Nastaleeq has a strong contextual dependency. It is a cursive writing style and is written diagonally from right to left. The overlapping shapes make the nuqta (dots) and kerning problem even harder. With the advent of multilingual support in computer systems, different solu- tions have been proposed and implemented. But most of these are immature or platform-specific. This paper discuses the complexity of Nastaleeq and a solution that uses Omega as the typesetting engine for rendering Nastaleeq. 1 Introduction 1.1 Complexity of the Nastaleeq writing Urdu is the lingua franca as well as the national style language of Pakistan. It has more than 60 mil- The Nastaleeq writing style is far more complex lion speakers in over 20 countries [1]. Urdu writing than other writing styles of Arabic script{based lan- style is derived from Arabic script. Arabic script has guages. The salient features`r of Nastaleeq that many writing styles including Naskh, Sulus, Riqah make it more complex are these: and Deevani, as shown in figure 1. Urdu may be • Nastaleeq is a cursive writing style, like other written in any of these styles, however, Nastaleeq Arabic styles, but it is written diagonally from is the default writing style of Urdu. -
Sino-Tibetan Numeral Systems: Prefixes, Protoforms and Problems
Sino-Tibetan numeral systems: prefixes, protoforms and problems Matisoff, J.A. Sino-Tibetan Numeral Systems: Prefixes, Protoforms and Problems. B-114, xii + 147 pages. Pacific Linguistics, The Australian National University, 1997. DOI:10.15144/PL-B114.cover ©1997 Pacific Linguistics and/or the author(s). Online edition licensed 2015 CC BY-SA 4.0, with permission of PL. A sealang.net/CRCL initiative. PACIFIC LINGUISTICS FOUNDING EDITOR: Stephen A. Wunn EDITORIAL BOARD: Malcolm D. Ross and Darrell T. Tryon (Managing Editors), Thomas E. Dutton, Nikolaus P. Himmelmann, Andrew K. Pawley Pacific Linguistics is a publisher specialising in linguistic descriptions, dictionaries, atlases and other material on languages of the Pacific, the Philippines, Indonesia and southeast Asia. The authors and editors of Pacific Linguistics publications are drawn from a wide range of institutions around the world. Pacific Linguistics is associated with the Research School of Pacific and Asian Studies at the Australian National University. Pacific Linguistics was established in 1963 through an initial grant from the Hunter Douglas Fund. It is a non-profit-making body financed largely from the sales of its books to libraries and individuals throughout the world, with some assistance from the School. The Editorial Board of Pacific Linguistics is made up of the academic staff of the School's Department of Linguistics. The Board also appoints a body of editorial advisors drawn from the international community of linguists. Publications in Series A, B and C and textbooks in Series D are refereed by scholars with re levant expertise who are normally not members of the editorial board. -
On a Collection of Australian and Asiatic Bees
Utah State University DigitalCommons@USU Ca Bee Lab 3-27-1907 On a Collection of Australian and Asiatic Bees T. D. A. Cockerell University of Colorado Follow this and additional works at: https://digitalcommons.usu.edu/bee_lab_ca Part of the Entomology Commons Recommended Citation Cockerell, T. D. A., "On a Collection of Australian and Asiatic Bees" (1907). Ca. Paper 386. https://digitalcommons.usu.edu/bee_lab_ca/386 This Bulletin is brought to you for free and open access by the Bee Lab at DigitalCommons@USU. It has been accepted for inclusion in Ca by an authorized administrator of DigitalCommons@USU. For more information, please contact [email protected]. 59.57.9\J(\l-\) Article X.- ON A COLLECTIO~ OF .-\T.:STilALL\N AND ASIATIC BEES. I BY T. D. A. COCKERELL. l1. I The bees which form the subject of this paper bd_cnged t,) the collertion of the well-known entornologi~t Henry EdwarJs , and arc the prop erty of the .~rnerican JHu~~um_ of Natural History. :,.1r. Bemcnmi.iller informs roe that the Australian specimens were collected by '\V. ::\lack·ay, but the collectors of the othcrs are unfortuna tely unknmYn. Of peculiar inkrest I is the little series from Amboina. So far as I can ascertain, ten species of l bees have been recor<led from th at island; ele-..en are now added, making I a total of 21. The first records were three by Fabriciu s in 1S0!, based ,m i specimens obtained from Billardier. These Fabrician cpccir s were very i briefly described ; one of them, a Crocisa, was supposed to be ,Yell-kr.own, ' but a topotypical specimen now before me shows that the species has been misunderstood; the second, a Megach·ile, is not kno,,.-n to modern writers, unless it is identical with one later described by Smith - as is indeed prob able; the third, suppo ~ed to be an A nt!toplwra, is unknown to-day, b.ut the abdomen has a singi,~ con, picucus fulrnus band, which should make the insect easily recognizable. -
Oj Divchyno Krasna (Oh, Beautiful Maiden)
0088 Oj divchyno krasna (Oh, beautiful maiden) Alexander Jacobchuk Moderato p # j Soprano 1 ° 3 j œ œ œ ˙ œ œ œ œ œ œ ˙ œ œ œ œ & 4 œ œ œ J J J J Oj div- chy - no krаs - nа, vyj - dy, vyj - dy z khа - ty, rа - zom z nа - my, p Soprano 2 # 3 j j j j j j & 4 œ œ œ œ œ œ ˙ œ œ œ œ œ œ ˙ œ œ œ œ Oj div- chy - no krаs - nа, vyj - dy, vyj - dy z khа - ty, rа - zom z nа - my, p Alto 1 # 3 & 4 ˙™ ˙ œ œ ˙™ ˙ œ œ ˙ œ Oj kras - na, z kha - ty, vyj - p Alto 2 # 3 & 4 ˙™ ˙ œ œ ˙™ ˙ œ œ ˙ œ ¢ Oj kras - na, z kha - ty, vyj - 6 # j j S1 ° œ œ œ œ œ œ œ œ œ 2 ˙ 3 œ œ œ œ œ œ œ œ œ œ œ œ œ 2 & J J J J œ J 4 4 J J J J J J œ J 4 pа-rub-kа- my, zа - ko - lja - du - vа - ty, rа-zom znа - my, pа - rub-kа- my, zа - ko - lja - du - va- # j j j S2 œ œ j j j œ 2 3 j œ œ œ œ j j j œ 2 & J J œ œ œ œ œ œ œ 4 ˙ 4#œ œ J J œ nœ œ œ œ œ œ 4 pа-rub-kа- my, zа - ko - lja - du - vа - ty, rа-zom znа - my, pа - rub-kа- my, zа - ko - lja - du - va- A1 # 2 3 j j j j 2 & ˙ œ œ œ œ œ 4 œ œ 4 ˙ œ #œ œ œ nœ œ œ œ œ 4 de, ko - lja - du - va - ty, ra - zom z pa-rub-ka- my ko - lja - du - va - A2 # 2 3 j j j j 2 & ˙ œ œ œ œ œ 4 œ œ 4 ˙ œ #œ œ œ nœ œ œ œ œ 4 ¢ d.e, ko - lja - du - va - ty, ra - zom z pa-rub-ka- my ko - lja - du - va - 12 S1 # j j œ ° 2 ˙ 3 œ œ œ œ œ ˙ œ œ œ œ œ ˙ & 4 4 œ J J ty. -
Low Resolution Arabic Recognition with Multidimensional Recurrent Neural Networks
Low Resolution Arabic Recognition with Multidimensional Recurrent Neural Networks Sheikh Faisal Rashid Image Understanding and Pattern Recognition (IUPR) Technical University Kaiserslautern, Germany [email protected] Marc-Peter Schambach Jörg Rottland Stephan von der Nüll Siemens AG Siemens AG Siemens AG Bücklestraße 1-5, 78464 Bücklestraße 1-5, 78464 Bücklestraße 1-5, 78464 Konstanz, Germany Konstanz, Germany Konstanz, Germany marc-peter.schambach [email protected] stephan.vondernuell @siemens.com @siemens.com ABSTRACT OCR of multi-font Arabic text is difficult due to large vari- ations in character shapes from one font to another. It be- comes even more challenging if the text is rendered at very low resolution. This paper describes a multi-font, low reso- lution, and open vocabulary OCR system based on a multi- dimensional recurrent neural network architecture. For this work, we have developed various systems, trained for single- font/single-size, single-font/multi-size, and multi-font/multi- size data of the well known Arabic printed text image data- base (APTI). The evaluation tasks from the second Arabic Figure 1: Example of low resolution Arabic word text recognition competition, organized in conjunction with image. Font: Diwani Letter at 6 point. The figure 1 ICDAR 2013 , have been adopted. Ten Arabic fonts in six is rescaled for better visual appearance. font size categories are used for evaluation. Results show that the proposed method performs very well on the task of printed Arabic text recognition even for very low resolution and small font size images. Overall, the system yields above Arabic text due to various potential applications like mail 99% recognition accuracy at character and word level for sorting, electronic processing of bank checks, official forms most of the printed Arabic fonts. -
Middle East-I 9 Modern and Liturgical Scripts
The Unicode® Standard Version 13.0 – Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and the publisher was aware of a trade- mark claim, the designations have been printed with initial capital letters or in all capitals. Unicode and the Unicode Logo are registered trademarks of Unicode, Inc., in the United States and other countries. The authors and publisher have taken care in the preparation of this specification, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. © 2020 Unicode, Inc. All rights reserved. This publication is protected by copyright, and permission must be obtained from the publisher prior to any prohibited reproduction. For information regarding permissions, inquire at http://www.unicode.org/reporting.html. For information about the Unicode terms of use, please see http://www.unicode.org/copyright.html. The Unicode Standard / the Unicode Consortium; edited by the Unicode Consortium. — Version 13.0. Includes index. ISBN 978-1-936213-26-9 (http://www.unicode.org/versions/Unicode13.0.0/) 1. -
Iranian Contribution to the Art of Islamic Calligraphy
Electronic Articles Collections Prof. Dr. Amir H. Zekrgoo ZEKRGOO.COM Zekrgoo, Amir H. (1991). “Iranian’s Contribution to the Art of Islamic Calligraphy”, Journal of the Indian Museums. New Delhi: Museum Association of India, pp. 62-71. Iranian Contribution to the Art of Islamic Calligraphy The art of Islamic Calligraphy is considered to be one of the highest forms of artistic expression in the entire Muslim world. In view of its widespread use even-to-day, only second to the Roman Alphabet, the Arabic script which is the mother of all Islamic scripts, was developed at a much later date. If we compare the kind of script used by the nomadic Arabs of Hijaz before the appearance of Islam with the great progress it made in the wake of Islam and the revelation of the Holy Quran we discern on it the tremendous influence of Islam—the discipline, beauty and elegance which it lent to this script. A comparison of the two inscriptions written in Nabataean script (The Pre-Islamic script of Arabs) fig. I & 2 first dated 250 A.D. and the second dated 568 A.D. will reveal how little change has taken place in course of about 300 years until the birth of Islam. Immediately after the advent of Islam we witness the invention of the Kufic script. Unlike the primitive Nabataean script, the Kufic, an early Islamic script has a strong geometrical structure (Fig. 3). It has based mainly on long horizontal and short vertical lines with very minor curves. The Kufic later developed into various decorative styles. -
Cyrillic # Version Number
############################################################### # # TLD: xn--j1aef # Script: Cyrillic # Version Number: 1.0 # Effective Date: July 1st, 2011 # Registry: Verisign, Inc. # Address: 12061 Bluemont Way, Reston VA 20190, USA # Telephone: +1 (703) 925-6999 # Email: [email protected] # URL: http://www.verisigninc.com # ############################################################### ############################################################### # # Codepoints allowed from the Cyrillic script. # ############################################################### U+0430 # CYRILLIC SMALL LETTER A U+0431 # CYRILLIC SMALL LETTER BE U+0432 # CYRILLIC SMALL LETTER VE U+0433 # CYRILLIC SMALL LETTER GE U+0434 # CYRILLIC SMALL LETTER DE U+0435 # CYRILLIC SMALL LETTER IE U+0436 # CYRILLIC SMALL LETTER ZHE U+0437 # CYRILLIC SMALL LETTER ZE U+0438 # CYRILLIC SMALL LETTER II U+0439 # CYRILLIC SMALL LETTER SHORT II U+043A # CYRILLIC SMALL LETTER KA U+043B # CYRILLIC SMALL LETTER EL U+043C # CYRILLIC SMALL LETTER EM U+043D # CYRILLIC SMALL LETTER EN U+043E # CYRILLIC SMALL LETTER O U+043F # CYRILLIC SMALL LETTER PE U+0440 # CYRILLIC SMALL LETTER ER U+0441 # CYRILLIC SMALL LETTER ES U+0442 # CYRILLIC SMALL LETTER TE U+0443 # CYRILLIC SMALL LETTER U U+0444 # CYRILLIC SMALL LETTER EF U+0445 # CYRILLIC SMALL LETTER KHA U+0446 # CYRILLIC SMALL LETTER TSE U+0447 # CYRILLIC SMALL LETTER CHE U+0448 # CYRILLIC SMALL LETTER SHA U+0449 # CYRILLIC SMALL LETTER SHCHA U+044A # CYRILLIC SMALL LETTER HARD SIGN U+044B # CYRILLIC SMALL LETTER YERI U+044C # CYRILLIC