Petit Manuel Unix®
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Armenian Secret and Invented Languages and Argots
Armenian Secret and Invented Languages and Argots The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters Citation Russell, James R. Forthcoming. Armenian secret and invented languages and argots. Proceedings of the Institute of Linguistics of the Russian Academy of Sciences. Citable link http://nrs.harvard.edu/urn-3:HUL.InstRepos:9938150 Terms of Use This article was downloaded from Harvard University’s DASH repository, and is made available under the terms and conditions applicable to Open Access Policy Articles, as set forth at http:// nrs.harvard.edu/urn-3:HUL.InstRepos:dash.current.terms-of- use#OAP 1 ARMENIAN SECRET AND INVENTED LANGUAGES AND ARGOTS. By James R. Russell, Harvard University. Светлой памяти Карена Никитича Юзбашяна посвящается это исследование. CONTENTS: Preface 1. Secret languages and argots 2. Philosophical and hypothetical languages 3. The St. Petersburg Manuscript 4. The Argot of the Felt-Beaters 5. Appendices: 1. Description of St. Petersburg MS A 29 2. Glossary of the Ṙuštuni language 3. Glossary of the argot of the Felt-Beaters of Moks 4. Texts in the “Third Script” of MS A 29 List of Plates Bibliography PREFACE Much of the research for this article was undertaken in Armenia and Russia in June and July 2011 and was funded by a generous O’Neill grant through the Davis Center for Russian and Eurasian Studies at Harvard. For their eager assistance and boundless hospitality I am grateful to numerous friends and colleagues who made my visit pleasant and successful. For their generous assistance in Erevan and St. -
Guide to the Use of Character Set Standards in Europe
CEN TECHNICAL REPORT Draft 3 for CEN Trnnnn:1999 1999-07-23 Descriptors: Data processing, information interchange, text processing, text communication, graphic characters, character sets, representation of characters, coded character sets, architecture Information Technology - Guide to the use of character set standards in Europe This CEN Technical Report has been drawn up by CEN/TC 304 This CEN Technical Report was established by TC 304 in one official version (English). A version in any other language made by translation under the responsibility of a CEN member into its own lan- guage and notified to the Central Secretariat has the same status as the official version. CEN members are the national bodies of Austria, Belgium, the Czech Republic, Denmark, Finland, France, Germany, Greece, Iceland, Ireland, Italy, Luxembourg, Netherlands, Norway, Portugal, Spain, Sweden, Switzerland, and the United Kingdom. CEN European Committee for Standardization Comité Européen de Normalisation Europäisches Komitee für Normung Central Secretariat: rue de Stassart 36, B-1050 Brussels © CEN 1999 Copyright reserved to all CEN members Ref.No. TR xxxx:1999 E CEN TR nnnn : Draft 2 Guide to the use of character set standards in Europe ii Guide to the use of character set standards in Europe CEN TR nnnn : Draft 2 FOREWORD This report was produced by a CEN/TC 304 Project Team, set up in June, 1998, as one of several to carry out the funded work program of TC 304 (documented in CEN/TC 304 N 666 R2). A first draft was discussed at the TC meeting in Brussels in November, 1998. A revised draft was circulated for comments within the TC and thereafter discussed at the TC plenary meeting in April, 1999. -
Choosing Inscriptions Making Font for 'Armazuli' Aramaic Objectives Mark up of the Texts and Linked Data New Photo Document
@EAGLE 2016, Rome Epigraphic Corpus of Georgia Inscriptions found in Georgia are diverse in their typology, content and language. From among this range perhaps the most compelling examples are those inscriptions The Institute of Linguistic Research, in Aramaic and Old Greek (numbering more than 1000), and dating from V AC to XIX (T. Kaukhchishvili 2009.) None of these have been published online according to the EpiDoc guidelines. Thanks to a year’s funding from The Institute of Linguistic Research of the Ilia State University (ISU), the “Epigraphic Corpus of Georgia Project”, Ilia State University led by Prof. Nino Doborjginidze, began on March 1st 2015. It aims to make the first, key 30 inscriptions available to both a scholarly audience and to the general public. It presents an opportunity to question the strict practice of the “print-only” publishing of epigraphic materials among Georgian epigraphists. Objectives New photo documentation and web page - epigraphy.iliauni.edu.ge A desired outcome of the digital publishing of the inscriptions of Georgia is, on the one hand, to preserve those inscriptions and likewise to preserve editions of these inscriptions that were made by Georgian experts from 1930s onwards, although only a few of these were published in international scientific journals because of the So- viet restrictions. Thus the aims of the project can be summarized thus: • to protect these inscriptions as an element of our cultural heritage • to document the printed critical editions of the inscriptions, which likewise consti- tute a part of that cultural heritage • to demonstrate and illustrate common, historical-cultural contexts(e.g. -
Proposal for the Georgian Script Root Zone LGR
Proposal for the Georgian Script Root Zone LGR LGR Version 2 Date: 2016-11-24 Document version: 1.1 Authors: GEORGIAN SCRIPT GENERATION PANEL 1 General Information/ Overview/ Abstract The purpose of this document is to give an overview of the proposed Georgian LGR in the XML format and the rationale behind the design decisions taken. It includes a discussion of relevant features of the script, the communities or languages using it, the process and methodology used and information on the contributors. The formal specification of the LGR can be found in the accompanying XML document: • Proposed-LGR-Georgian-20160915.xml Labels for testing can be found in the accompanying text document: • Labels-GeorgianScript-20160915.txt 2 Script for which the LGR is proposed ISO 15924 Code: Geor ISO 15924 Key N°: 240 ISO 15924 English Name: Georgian Mkhedruli Latin transliteration of native script name: Mkhedruli Native name of the script: მხედრული Maximal Starting Repertoire (MSR) version: MSR-2 Proposal for a Georgian Script Root Zone LGR Georgian Script GP 3 Background on Script and Principal Languages Using It The Georgian scripts are the three writing systems used to write the Georgian language: Asomtavruli, Nuskhuri and Mkhedruli. Mkhedruli (Georgian: მხედრული) is the current Georgian script and is therefore the standard script for modern Georgian and its related Kartvelian languages, whereas Asomtavruli and Nuskhuri are used only in ceremonial religious texts and iconography. In the following, the term Georgian script is used synonymously with Mkhedruli. Like the two other scripts, Mkhedruli is purely unicameral. Mkhedruli first appears in the 10th century - the oldest Mkhedruli inscription found is dated back to 982 AD. -
Sylfaen : Foundations of Multiscript Typography
Sylfaen : Foundations of Multiscript Typography John Hudson, 2000 © INTRODUCTION In the autumn of 1997, I was hired by the Microsoft Typography Group to consult For a general introduction to OpenType, see David Lemon’s on the production of support materials for OpenType font developers working with article in the first issue of Type. For more detailed information, multiple scripts and languages. The project was given the working title Web Resource including the current version of the OpenType specification, for International Typography (WRIT). A year later, I had the pleasure of showing this see the Microsoft Typography work in progress at the 1998 ATypI Conference in Lyon, where it was well received website at: www.microsoft.com/typography by my colleagues in the type industry. Shortly after my return to Canada, Microsoft decided to put further development of this work on hold, so this paper, inevitably, lacks some of the enthusiasm I had for the project in 1998. It is difficult to remain excited about something which has been effectively cancelled, but at the same time I am happy to respond to the editors’ invitation to write something about the project for the Type journal, because I think the small team at Microsoft who worked on this project achieved something important. Perhaps I still hope that some day the project may be revived, even if in a different or more limited form. Certainly, I still believe that the ever increasing emphasis on internationalisation in software development, business and, of course, the Internet, requires a better level of support from font developers, and this in turn requires something like WRIT. -
Binary Codes
2 Binary Codes The present chapter is an extension of the previous chapter on number systems. In the previous chapter, beginning with some of the basic concepts common to all number systems and an outline on the familiar decimal number system, we went on to discuss the binary, the hexadecimal and the octal number systems. While the binary system of representation is the most extensively used one in digital systems, including computers, octal and hexadecimal number systems are commonly used for representing groups of binary digits. The binary coding system, called the straight binary code and discussed in the previous chapter, becomes very cumbersome to handle when used to represent larger decimal numbers. To overcome this shortcoming, and also to perform many other special functions, several binary codes have evolved over the years. Some of the better-known binary codes, including those used efficiently to represent numeric and alphanumeric data, and the codes used to perform special functions, such as detection and correction of errors, will be detailed in this chapter. 2.1 Binary Coded Decimal The binary coded decimal (BCD) is a type of binary code used to represent a given decimal number in an equivalent binary form. BCD-to-decimal and decimal-to-BCD conversions are very easy and straightforward. It is also far less cumbersome an exercise to represent a given decimal number in an equivalent BCD code than to represent it in the equivalent straight binary form discussed in the previous chapter. The BCD equivalent of a decimal number is written by replacing each decimal digit in the integer and fractional parts with its four-bit binary equivalent. -
UTF-8 and Unicode FAQ for Unix/Linux by Markus Kuhn
UTF-8 and Unicode FAQ for Unix/Linux by Markus Kuhn This text is a very comprehensive one-stop information resource on how you can use Unicode/UTF-8 on POSIX systems (Linux, Unix). You will find here both introductory information for every user, as well as detailed references for the experienced developer. Unicode has started to replace ASCII, ISO 8859 and EUC at all levels. It enables users to handle not only practically any script and language used on this planet, it also supports a comprehensive set of mathematical and technical symbols to simplify scientific information exchange. With the UTF-8 encoding, Unicode can be used in a convenient and backwards compatible way in environments that were designed entirely around ASCII, like Unix. UTF-8 is the way in which Unicode is used under Unix, Linux, and similar systems. It is now time to make sure that you are well familiar with it and that your software supportsUTF-8 smoothly. Contents • What are UCS and ISO 10646? • What are combining characters? • What are UCS implementation levels? • Has UCS been adopted as a national standard? • What is Unicode? • So what is the difference between Unicode and ISO 10646? • What is UTF-8? • Who invented UTF-8? • Where do I find nice UTF-8 example files? • What different encodings are there? • What programming languages support Unicode? • How should Unicode be used under Linux? • How do I have to modify my software? • C support for Unicode and UTF-8 • How should the UTF-8 mode be activated? • How do I get a UTF-8 version of xterm? • How much of Unicode -
Nill-1BER 97 the ORIGINS of CAUCASIAN CIVILIZATION: the CHRISTIAN COMPONENT R
Nill-1BER 97 THE ORIGINS OF CAUCASIAN CIVILIZATION: THE CHRISTIAN COMPONENT R. W. Thomson Harvard University Conference on "NATIONALISM AND SOCIAL CHA."iGE IN TRM-l"SCAUCASIA" Co-sponsored by Kennan Institute for Ad\'anced Russian Studies, The Wilson Center· and American Association for the Advancement of Slavic Studies April 24-25, 1980 ) R. W. Thomson Harvard University THE ORIGINS OF CAUCASIAN CIVILIZATION: THE CHRISTIAN C0f..1PONENT* The Caucasus has been home for members of the human race since the early Stone Age, yet it is only in comparatively recent times - in the last 1500 years or so - that the two most significant peoples of the area, the Armenians and the Georgians, have acquired truly individual cultures expressed in their own native languages. I shall not be concerned here with the origin of these peoples, with the thorny question of the mingling of newcomers and indigenous inhabitants, or with the development of social and material culture in the pre-Christian era. This paper will deal with Christian Armenia and .Georgia, for it was only with their conversion that the Armenians and Georgians developed their own scripts and native literatures. Not that they were illiterate and ignorant until then. Indeed, they had played a role in the Greco-Roman world, and in the Old Persian empire and the civilizations of the Near East that had preceded it. But with the development of their o~n literature and art in a Christian context the Armenians and Georgians set themselves apart from these other civilizations to a certain extent and moulded more truly national cultures out of the heritage they owed to Greece, Rome and Iran. -
'Links Between Scripts, Writing Systems, Orthographies, Fonts And
April 2019 Field School in Bova, Calabria, IT Fantastic Languages and where to find them ‘Links between scripts, writing systems, orthographies, fonts and ethnolinguistic identities’ Tomasz Wicherkiewicz [email protected] Adam Mickiewicz University in Poznań, Poland Dept. of Language Policy & Minority Studies Minority Languages, Major Opportunities. Collaborative Research, Community Engagement and Innovative Educational Tools Standardization & Literacy Since at least the rise of nationalism, Westerners have in large part judged languages by whether they are written and standardized. As the colonial era came to an end across much of the world in the 1960s, this tendency intermingled with the rising developmentist impulse: what would be the place of the local, languages all over the world in educational and political projects of the post-colonial states across the globe? This led to a still-flourishing of orthography development for a large number of languages. The process has been even strengthened with the worldwide wave of language revival of the 1990s-2010s The language revival, however, is often~still directly associated with the standardization of its written form Linguists have generally claimed that script is something that does not belong to language, it is something secondary, rather left to culture scientists or historians. Even those sociolinguists, who deal with interrelations between scripts, writing systems and orthographies from one side and users’ societies and speech communities from the other, claim that the former are fundamentally independent of languages. Yet, they absolutely are not independent of language ideologies. On the other hand, scripts are thought by their speakers to be intrinsically related to their languages. -
Revitalizing Endangered Languages
Downloaded from https://www.cambridge.org/core. IP address: 170.106.40.219, on 30 Sep 2021 at 21:18:31, subject to the Cambridge Core terms of use, available at https://www.cambridge.org/core/terms. https://www.cambridge.org/core/product/ADCBBA31190F259BA13525C769E92A9A Downloaded from https://www.cambridge.org/core. IP address: 170.106.40.219, on 30 Sep 2021 at 21:18:31, subject to the Cambridge Core terms of use, available at https://www.cambridge.org/core/terms. https://www.cambridge.org/core/product/ADCBBA31190F259BA13525C769E92A9A Revitalizing Endangered Languages Of the approximately 7,000 languages in the world, at least half may no longer be spoken by the end of the twenty-first century. Languages are endangered by a number of factors, including globalization, education pol- icies, and the political, economic, and cultural marginalization of minority groups. This guidebook provides ideas and strategies, as well as some background, to help with the effective revitalization of endangered languages. It covers a broad scope of themes including effective planning, benefits, well- being, economic aspects, attitudes, and ideologies. The chapter authors have hands-on experience of language revitalization in many countries around the world, and each chapter includes a wealth of examples, such as case studies from specific languages and language areas. Clearly and accessibly written, it is suitable for nonspecialists as well as for academic researchers and students interested in language revitalization. This book is also available as Open Access on Cambridge Core. justyna olko, director of the Center for Research and Practice in Cultural Continuity at the University of Warsaw, is engaged in revitalizing the Nahuatl language in Mexico and works with activists supporting other endangered languages, especially in the area of Poland. -
Section 7.9, Combining Marks, and Also Section 2.11, Combining Characters
The Unicode® Standard Version 13.0 – Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and the publisher was aware of a trade- mark claim, the designations have been printed with initial capital letters or in all capitals. Unicode and the Unicode Logo are registered trademarks of Unicode, Inc., in the United States and other countries. The authors and publisher have taken care in the preparation of this specification, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. © 2020 Unicode, Inc. All rights reserved. This publication is protected by copyright, and permission must be obtained from the publisher prior to any prohibited reproduction. For information regarding permissions, inquire at http://www.unicode.org/reporting.html. For information about the Unicode terms of use, please see http://www.unicode.org/copyright.html. The Unicode Standard / the Unicode Consortium; edited by the Unicode Consortium. — Version 13.0. Includes index. ISBN 978-1-936213-26-9 (http://www.unicode.org/versions/Unicode13.0.0/) 1. -
Proposal for Malayalam Script Root Zone Label Generation
Proposal for Malayalam Script Root Zone Label Generation Rules – Second Public Comment Proceeding Publication Date: 23 June 2020 Prepared By: Internationalized Domain Name (IDN) Program, ICANN Org Public Comment Proceeding Important Information Links Open Date: 7 May 2020 Close Date: 17 June 2020 Announcement Staff Report 23 June 2020 Public Comment Proceeding View Comments Submitted Due Date: Staff Contact: Pitinan Kooarmornpatana Email: [email protected] Section I: General Overview and Next Steps The Neo-Brahmi Generation Panel (NBGP) was formed by nine communities that use scripts derived from the Brahmi script. NBGP has developed Root Zone Label Generation Rules (LGR) proposals for Bangla, Devanagari, Gujarati, Gurmukhi, Kannada, Malayalam, Oriya, Tamil and Telugu scripts. The Malayalam LGR proposal (LGR and supporting documentation) is being updated to address an inconsistency involving the conjunct "nta" and adjust the cross-script variants with Myanmar and Georgian scripts. As per the LGR Procedure, the updated proposal was posted for Public Comments to allow those who have not participated in the NBGP to make their views known. Based on feedback received, the NBGP will finalize the proposal for its evaluation and re-integration into the Label Generation Rules for the Root Zone. Section II: Contributors At the time this report was prepared, a total of two (2) community submissions had been posted to the forum. The contributors are listed below with initials noted. To the extent that quotations are used in the foregoing narrative (Section III), such citations will reference the contributor’s initials. Individuals: Name Affiliation (if provided) Initials Wil Tan WT 梁海 (Liang Hai) LH Section III: Summary of Comments General Disclaimer: This section intends to summarize broadly and comprehensively the comments submitted to this public comment proceeding but does not address every specific position stated by each contributor.