Internet Engineering Task Force (IETF) H. Alvestrand, Ed. Request for Comments: 5893 Google Category: Standards Track C
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Yiddish Diction in Singing
UNLV Theses, Dissertations, Professional Papers, and Capstones May 2016 Yiddish Diction in Singing Carrie Suzanne Schuster-Wachsberger University of Nevada, Las Vegas Follow this and additional works at: https://digitalscholarship.unlv.edu/thesesdissertations Part of the Language Description and Documentation Commons, Music Commons, Other Languages, Societies, and Cultures Commons, and the Theatre and Performance Studies Commons Repository Citation Schuster-Wachsberger, Carrie Suzanne, "Yiddish Diction in Singing" (2016). UNLV Theses, Dissertations, Professional Papers, and Capstones. 2733. http://dx.doi.org/10.34917/9112178 This Dissertation is protected by copyright and/or related rights. It has been brought to you by Digital Scholarship@UNLV with permission from the rights-holder(s). You are free to use this Dissertation in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s) directly, unless additional rights are indicated by a Creative Commons license in the record and/or on the work itself. This Dissertation has been accepted for inclusion in UNLV Theses, Dissertations, Professional Papers, and Capstones by an authorized administrator of Digital Scholarship@UNLV. For more information, please contact [email protected]. YIDDISH DICTION IN SINGING By Carrie Schuster-Wachsberger Bachelor of Music in Vocal Performance Syracuse University 2010 Master of Music in Vocal Performance Western Michigan University 2012 -
Technical Reference Manual for the Standardization of Geographical Names United Nations Group of Experts on Geographical Names
ST/ESA/STAT/SER.M/87 Department of Economic and Social Affairs Statistics Division Technical reference manual for the standardization of geographical names United Nations Group of Experts on Geographical Names United Nations New York, 2007 The Department of Economic and Social Affairs of the United Nations Secretariat is a vital interface between global policies in the economic, social and environmental spheres and national action. The Department works in three main interlinked areas: (i) it compiles, generates and analyses a wide range of economic, social and environmental data and information on which Member States of the United Nations draw to review common problems and to take stock of policy options; (ii) it facilitates the negotiations of Member States in many intergovernmental bodies on joint courses of action to address ongoing or emerging global challenges; and (iii) it advises interested Governments on the ways and means of translating policy frameworks developed in United Nations conferences and summits into programmes at the country level and, through technical assistance, helps build national capacities. NOTE The designations employed and the presentation of material in the present publication do not imply the expression of any opinion whatsoever on the part of the Secretariat of the United Nations concerning the legal status of any country, territory, city or area or of its authorities, or concerning the delimitation of its frontiers or boundaries. The term “country” as used in the text of this publication also refers, as appropriate, to territories or areas. Symbols of United Nations documents are composed of capital letters combined with figures. ST/ESA/STAT/SER.M/87 UNITED NATIONS PUBLICATION Sales No. -
Assessment of Options for Handling Full Unicode Character Encodings in MARC21 a Study for the Library of Congress
1 Assessment of Options for Handling Full Unicode Character Encodings in MARC21 A Study for the Library of Congress Part 1: New Scripts Jack Cain Senior Consultant Trylus Computing, Toronto 1 Purpose This assessment intends to study the issues and make recommendations on the possible expansion of the character set repertoire for bibliographic records in MARC21 format. 1.1 “Encoding Scheme” vs. “Repertoire” An encoding scheme contains codes by which characters are represented in computer memory. These codes are organized according to a certain methodology called an encoding scheme. The list of all characters so encoded is referred to as the “repertoire” of characters in the given encoding schemes. For example, ASCII is one encoding scheme, perhaps the one best known to the average non-technical person in North America. “A”, “B”, & “C” are three characters in the repertoire of this encoding scheme. These three characters are assigned encodings 41, 42 & 43 in ASCII (expressed here in hexadecimal). 1.2 MARC8 "MARC8" is the term commonly used to refer both to the encoding scheme and its repertoire as used in MARC records up to 1998. The ‘8’ refers to the fact that, unlike Unicode which is a multi-byte per character code set, the MARC8 encoding scheme is principally made up of multiple one byte tables in which each character is encoded using a single 8 bit byte. (It also includes the EACC set which actually uses fixed length 3 bytes per character.) (For details on MARC8 and its specifications see: http://www.loc.gov/marc/.) MARC8 was introduced around 1968 and was initially limited to essentially Latin script only. -
Arabic Alphabet - Wikipedia, the Free Encyclopedia Arabic Alphabet from Wikipedia, the Free Encyclopedia
2/14/13 Arabic alphabet - Wikipedia, the free encyclopedia Arabic alphabet From Wikipedia, the free encyclopedia َأﺑْ َﺠ ِﺪﯾﱠﺔ َﻋ َﺮﺑِﯿﱠﺔ :The Arabic alphabet (Arabic ’abjadiyyah ‘arabiyyah) or Arabic abjad is Arabic abjad the Arabic script as it is codified for writing the Arabic language. It is written from right to left, in a cursive style, and includes 28 letters. Because letters usually[1] stand for consonants, it is classified as an abjad. Type Abjad Languages Arabic Time 400 to the present period Parent Proto-Sinaitic systems Phoenician Aramaic Syriac Nabataean Arabic abjad Child N'Ko alphabet systems ISO 15924 Arab, 160 Direction Right-to-left Unicode Arabic alias Unicode U+0600 to U+06FF range (http://www.unicode.org/charts/PDF/U0600.pdf) U+0750 to U+077F (http://www.unicode.org/charts/PDF/U0750.pdf) U+08A0 to U+08FF (http://www.unicode.org/charts/PDF/U08A0.pdf) U+FB50 to U+FDFF (http://www.unicode.org/charts/PDF/UFB50.pdf) U+FE70 to U+FEFF (http://www.unicode.org/charts/PDF/UFE70.pdf) U+1EE00 to U+1EEFF (http://www.unicode.org/charts/PDF/U1EE00.pdf) Note: This page may contain IPA phonetic symbols. Arabic alphabet ا ب ت ث ج ح خ د ذ ر ز س ش ص ض ط ظ ع en.wikipedia.org/wiki/Arabic_alphabet 1/20 2/14/13 Arabic alphabet - Wikipedia, the free encyclopedia غ ف ق ك ل م ن ه و ي History · Transliteration ء Diacritics · Hamza Numerals · Numeration V · T · E (//en.wikipedia.org/w/index.php?title=Template:Arabic_alphabet&action=edit) Contents 1 Consonants 1.1 Alphabetical order 1.2 Letter forms 1.2.1 Table of basic letters 1.2.2 Further notes -
Shahmukhi to Gurmukhi Transliteration System: a Corpus Based Approach
Shahmukhi to Gurmukhi Transliteration System: A Corpus based Approach Tejinder Singh Saini1 and Gurpreet Singh Lehal2 1 Advanced Centre for Technical Development of Punjabi Language, Literature & Culture, Punjabi University, Patiala 147 002, Punjab, India [email protected] http://www.advancedcentrepunjabi.org 2 Department of Computer Science, Punjabi University, Patiala 147 002, Punjab, India [email protected] Abstract. This research paper describes a corpus based transliteration system for Punjabi language. The existence of two scripts for Punjabi language has created a script barrier between the Punjabi literature written in India and in Pakistan. This research project has developed a new system for the first time of its kind for Shahmukhi script of Punjabi language. The proposed system for Shahmukhi to Gurmukhi transliteration has been implemented with various research techniques based on language corpus. The corpus analysis program has been run on both Shahmukhi and Gurmukhi corpora for generating statistical data for different types like character, word and n-gram frequencies. This statistical analysis is used in different phases of transliteration. Potentially, all members of the substantial Punjabi community will benefit vastly from this transliteration system. 1 Introduction One of the great challenges before Information Technology is to overcome language barriers dividing the mankind so that everyone can communicate with everyone else on the planet in real time. South Asia is one of those unique parts of the world where a single language is written in different scripts. This is the case, for example, with Punjabi language spoken by tens of millions of people but written in Indian East Punjab (20 million) in Gurmukhi script (a left to right script based on Devanagari) and in Pakistani West Punjab (80 million), written in Shahmukhi script (a right to left script based on Arabic), and by a growing number of Punjabis (2 million) in the EU and the US in the Roman script. -
Techniques for Automatic Normalization of Orthographically Variant Yiddish Texts
City University of New York (CUNY) CUNY Academic Works All Dissertations, Theses, and Capstone Projects Dissertations, Theses, and Capstone Projects 2-2015 Techniques for Automatic Normalization of Orthographically Variant Yiddish Texts Yakov Peretz Blum Graduate Center, City University of New York How does access to this work benefit ou?y Let us know! More information about this work at: https://academicworks.cuny.edu/gc_etds/525 Discover additional works at: https://academicworks.cuny.edu This work is made publicly available by the City University of New York (CUNY). Contact: [email protected] TECHNIQUES FOR AUTOMATIC NORMALIZATION OF ORTHOGRAPHICALLY VARIANT YIDDISH TEXTS by YAKOV PERETZ BLUM A master's thesis submitted to the Graduate Faculty in Linguistics in partial fulfillment of the requirements for the degree of Master of Arts, The City University of New York 2015 2015 © Yakov Blum Some rights reserved. This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. http://creativecommons.org/licenses/by-nc-nd/4.0/ ii This manuscript has been read and accepted for the Graduate Faculty in Linguistics in satisfaction of the requirement for the degree of Master of Arts. Martin Chodorow ______________________ ______________________________________ Date Thesis Advisor Gita Martohardjono ______________________ ______________________________________ Date Executive Officer THE CITY UNIVERSITY OF NEW YORK iii Abstract TECHNIQUES FOR AUTOMATIC NORMALIZATION OF ORTHOGRAPHICALLY VARIANT YIDDISH TEXTS by YAKOV PERETZ BLUM Adviser: Professor Martin Chodorow Yiddish is characterized by a multitude of orthographic systems. A number of approaches to automatic normalization of variant orthography have been explored for the processing of historic texts of languages whose orthography has since been standardized. -
The Semitic Component in Yiddish and Its Ideological Role in Yiddish Philology
philological encounters � (�0�7) 368-387 brill.com/phen The Semitic Component in Yiddish and its Ideological Role in Yiddish Philology Tal Hever-Chybowski Paris Yiddish Center—Medem Library [email protected] Abstract The article discusses the ideological role played by the Semitic component in Yiddish in four major texts of Yiddish philology from the first half of the 20th century: Ysroel Haim Taviov’s “The Hebrew Elements of the Jargon” (1904); Ber Borochov’s “The Tasks of Yiddish Philology” (1913); Nokhem Shtif’s “The Social Differentiation of Yiddish: Hebrew Elements in the Language” (1929); and Max Weinreich’s “What Would Yiddish Have Been without Hebrew?” (1931). The article explores the ways in which these texts attribute various religious, national, psychological and class values to the Semitic com- ponent in Yiddish, while debating its ontological status and making prescriptive sug- gestions regarding its future. It argues that all four philologists set the Semitic component of Yiddish in service of their own ideological visions of Jewish linguistic, national and ethnic identity (Yiddishism, Hebraism, Soviet Socialism, etc.), thus blur- ring the boundaries between descriptive linguistics and ideologically engaged philology. Keywords Yiddish – loshn-koydesh – semitic philology – Hebraism – Yiddishism – dehebraization Yiddish, although written in the Hebrew alphabet, is predominantly Germanic in its linguistic structure and vocabulary.* It also possesses substantial Slavic * The comments of Yitskhok Niborski, Natalia Krynicka and of the anonymous reviewer have greatly improved this article, and I am deeply indebted to them for their help. © koninklijke brill nv, leiden, ���7 | doi �0.��63/�45�9�97-��Downloaded34003� from Brill.com09/23/2021 11:50:14AM via free access The Semitic Component In Yiddish 369 and Semitic elements, and shows some traces of the Romance languages. -
Orthography of Solomon Birnbaum
iQa;;&u*,"" , I iii The "Orthodox" Orthography of Solomon Birnbaum Kalman Weiser YORK UNIVERSITY Apart from the YIVO Institute's Standard Yiddish Orthography, the prevalent sys tem in the secular sector, two other major spelling codes for Yiddish are current today. Sovetisher oysteyg (Soviet spelling) is encountered chiefly among aging Yid dish speakers from the former Soviet bloc and appears in print with diminishing frequency since the end of the Cold War. In contrast, "maskiIic" spelling, the system haphazardly fashioned by 19th-century proponents of the Jewish Enlightenment (which reflects now obsolete Germanized conventions) continues to thrive in the publications of the expanding hasidic sector. As the one group that has by and large preserved and even encouraged Yiddish as its communal vernacular since the Holocaust, contemporary ultra-Orthodox (and primarily hasidic) Jews regard themselves as the custodians of the language and its traditions. Yiddish is considered a part of the inheritance of their pious East European ancestors, whose way of life they idealize and seek to recreate as much as possible. l The historic irony of this development-namely, the enshrinement of the work of secularized enlighteners, who themselves were bent on modernizing East European Jewry and uprooting Hasidism-is probably lost upon today's tens of thousands of hasidic Yiddish speakers. The irony, however, was not lost on Solomon (in German, Salomo; in Yiddish, Shloyme) Birnbaum (1898-1989), a pioneering Yiddish linguist and Hebrew pa leographer in the ultra-Orthodox sector. This essay examines Birnbaum's own "Or thodox" orthography for Yiddish in both its purely linguistic and broader ideolog ical contexts in interwar Poland, then home to the largest Jewish community in Europe and the world center of Yiddish culture. -
Adaptations of Hebrew Script -Mala Enciklopedija Prosvetq I978 [Small Prosveta Encyclopedia]
726 PART X: USE AND ADAPTATION OF SCRIPTS Series Minor 8) The Hague: Mouton SECTION 6I Ly&in, V. I t952. Drevnepermskij jazyk [The Old Pemic language] Moscow: Izdalel'slvo Aka- demii Nauk SSSR ry6r. Komi-russkij sLovar' [Komi-Russian diclionary] Moscow: Gosudarstvennoe Izda- tel'slvo Inostrannyx i Nacional'nyx Slovuej. Adaptations of Hebrew Script -MaLa Enciklopedija Prosvetq I978 [Small Prosveta encycloPedia]. Belgrade. Moll, T. A,, & P. I InEnlikdj t951. Cukotsko-russkij sLovaf [Chukchee-Russian dicrionary] Len- BENJAMIN HARY ingrad: Gosudtrstvemoe udebno-pedagogideskoe izdatel'stvo Ministerstva Prosveldenija RSFSR Poppe, Nicholas. 1963 Tatqr Manual (Indima Universily Publications, Uralic atrd Altaic Series 25) Mouton Bloomington: Indiana University; The Hague: "lagguages" rgjo. Mongolian lnnguage Handbook.Washington, D C.: Center for Applied Linguistics Jewish or ethnolects HerbertH Papet(Intema- Rastorgueva,V.S. 1963.A ShortSketchofTajikGrammar, fans anded It is probably impossible to offer a purely linguistic definition of a Jewish "language," tional Joumal ofAmerican Linguisticr 29, no part 2) Bloominglon: Indiana University; The - 4, as it is difficult to find many cornmon linguistic criteria that can apply to Judeo- Hague: Moulon. (Russiu orig 'Kratkij oderk grammatiki lad;ikskogo jzyka," in M. V. Rax- Arabic, Judeo-Spanish, and Yiddish, for example. Consequently, a sociolinguistic imi & L V Uspenskaja, eds,Tadiikskurusstj slovar' lTajik-Russian dictionary], Moscow: Gosudustvennoe Izdatel'stvo Inostrmyx i Nacional'tryx SIovarej, r954 ) definition with a more suitable term, such as ethnolect, is in order. An ethnolect is an Sjoberg, Andr€e P. t963. Uzbek StructuraL Grammar (Indiana University Publications, Uralic and independent linguistic entity with its own history and development that refers to a lan- Altaic Series r8). -
ISO/IEC JTC1/SC2/WG2 N 2029 Date: 1999-05-29
ISO INTERNATIONAL ORGANIZATION FOR STANDARDIZATION ORGANISATION INTERNATIONALE DE NORMALISATION --------------------------------------------------------------------------------------- ISO/IEC JTC1/SC2/WG2 Universal Multiple-Octet Coded Character Set (UCS) -------------------------------------------------------------------------------- ISO/IEC JTC1/SC2/WG2 N 2029 Date: 1999-05-29 TITLE: Repertoire additions for ISO/IEC 10646-1 - Cumulative List No.9 SOURCE: Bruce Paterson, project editor STATUS: Standing Document, replacing WG2 N 1936 ACTION: For review and confirmation by WG2 DISTRIBUTION: Members of JTC1/SC2/WG2 INTRODUCTION This working paper contains the accumulated list of additions to the repertoire of ISO/IEC 10646-1 agreed by WG2, up to meeting no.36 (Fukuoka). A summary of all allocations within the BMP is given in Annex 1. A list of additional Collections, Blocks, and character tables is given in Annex 2. All additions are assigned to a Character Category, in accordance with clause II of the document "Principles and Procedures for Allocation of New Characters and Scripts" WG2 N 1502. The column Cat. in the table below shows the category (A to G) assigned by WG2. An entry P in this column indicates that the characters are provisionally accepted by WG2. WG2 Cat. No of Code Character(s) Source Current Ballot mtg.res chars position(s) doc. ref. end date NEW/EXTENDED SCRIPTS 27.14 A 11172 AC00-D7A3 Hangul syllables (revision) N1158 AMD 5 - 28.2 A 31 0591-05AF Hebrew cantillation marks N1217 AMD 7 - +05C4 28.5 A 174 0F00-0FB9 Tibetan N1238 AMD 6 - 31.4 A 346 1200-137F Ethiopic N1420 AMD10 - 31.6 A 623 1400-167F Canadian Aboriginal Syllabics N1441 AMD11 - 31.7 B1 85 13A0-13AF Cherokee N1172 AMD12 - & N1362 32.14 A 6582 3400-4DBF CJK Unified Ideograph Exten. -
1 Working Draft for Supporting Blueberry Revision of XML
Working Draft for Supporting Blueberry Revision of XML Recommendation (1.0) with Particular Emphasis on Ethiopic Script and Writing System November 2001 This Version: http://www.digitaladdis.com/sk/ Ethiopics XML_Proposal_Version1.pdf Latest Ver.: http://www.digitaladdis.com/sk/ Ethiopics XML_Proposal_Version1.pdf Authors/Editors: Samuel Kinde Kassegne (Nanogen, UC San Diego, Engg Ext.) <[email protected]> Bibi Ephraim (Hewlett Packard) <[email protected]> Copyright ©2001, SKK, BE. Working Draft for Supporting Blueberry Revision of XML - Ethiopics Script 1 Abstract This document contains XML example codes, surveys and recommendations that support the need for Ethiopic script-based XML element type and attribute names in Amharic, Afaan Oromo, Tigrigna, Agewgna, Guragina, Hadiya, Harari, Sidama, Kembatigna and other Ethiopian languages that use Ethiopic script. Status of this Document This document is an initial working draft that is to be widely circulated for comment from the Ethiopic XML Interest Group and the wider public. The document is a work in progress. Table of Contents 1. Scope 2. Introduction 3. Objective 4. Review of Status of Native HTML and XML Applications in Ethiopic 4.1. Evolution of Ethiopic Content in HTML 4.2. Ethiopic and XML 5. Hybrid XML Document Markups in Ethiopic 5.1. Examples 5.2. Discussions 6. Fully-Native XML Document Markups in Ethiopic 6.1. Examples 6.2. Discussions 7. Conclusions and Recommendations 8. Appendix 9. References Working Draft for Supporting Blueberry Revision of XML - Ethiopics Script 2 1. Scope The work reported in this document contains XML example codes, surveys and recommendations that support the need for a revision of current XML standards to allow Ethiopic-based XML element type and attribute names in Amharic, Afaan Oromo, Tigrigna, Agewgna, Guragina, Hadiya, Harari, Sidama, Kembatigna and other Ethiopian languages that use Ethiopic script. -
Pronunciation
PRONUNCIATION Guide to the Romanized version of quotations from the Guru Granth Saheb. A. Consonants Gurmukhi letter Roman Word in Roman Word in Gurmukhi Meaning Letter letters using the letters using the relevant letter relevant letter from from the second the first column column S s Sabh sB All H h Het ihq Affection K k Krodh kroD Anger K kh Khayl Kyl Play G g Guru gurU Teacher G gh Ghar Gr House | ng Ngyani / gyani i|AwnI / igAwnI Possessing divine knowledge C c Cor cor Thief C ch Chaata Cwqw Umbrella j j Jahaaj jhwj Ship J jh Jhaaroo JwVU Broom \ ny Sunyi su\I Quiet t t Tap t`p Jump T th Thag Tg Robber f d Dar fr Fear F dh Dholak Folk Drum x n Hun hux Now q t Tan qn Body Q th Thuk Quk Sputum d d Den idn Day D dh Dhan Dn Wealth n n Net inq Everyday p p Peta ipqw Father P f Fal Pl Fruit b b Ben ibn Without B bh Bhagat Bgq Saint m m Man mn Mind X y Yam Xm Messenger of death r r Roti rotI Bread l l Loha lohw Iron v v Vasai vsY Dwell V r Koora kUVw Rubbish (n) in brackets, and (g) in brackets after the consonant 'n' both indicate a nasalised sound - Eg. 'Tu(n)' meaning 'you'; 'saibhan(g)' meaning 'by himself'. All consonants in Punjabi / Gurmukhi are sounded - Eg. 'pai-r' meaning 'foot' where the final 'r' is sounded. 3 Copyright Material: Gurmukh Singh of Raub, Pahang, Malaysia B.