Standards for Language Coding: the ISO 639 Family
Total Page:16
File Type:pdf, Size:1020Kb
Load more
										Recommended publications
									
								- 
												  EN 300 468 V1.3.1 (1997-09) European Standard (Telecommunications Series)Draft EN 300 468 V1.3.1 (1997-09) European Standard (Telecommunications series) Digital Video Broadcasting (DVB); Specification for Service Information (SI) in DVB systems European Broadcasting Union Union Européenne de Radio-Télévision EBU UER European Telecommunications Standards Institute 2 Draft EN 300 468 V1.3.1 (1997-09) Reference REN/JTC-00DVB-43 (4c000j0o.PDF) Keywords DVB, broadcasting, digital, video, MPEG, TV ETSI Secretariat Postal address F-06921 Sophia Antipolis Cedex - FRANCE Office address 650 Route des Lucioles - Sophia Antipolis Valbonne - FRANCE Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16 Siret N° 348 623 562 00017 - NAF 742 C Association à but non lucratif enregistrée à la Sous-Préfecture de Grasse (06) N° 7803/88 X.400 c= fr; a=atlas; p=etsi; s=secretariat Internet [email protected] http://www.etsi.fr Copyright Notification No part may be reproduced except as authorized by written permission. The copyright and the foregoing restriction extend to reproduction in all media. © European Telecommunications Standards Institute 1997. © European Broadcasting Union 1997. All rights reserved. 3 Draft EN 300 468 V1.3.1 (1997-09) Contents Intellectual Property Rights................................................................................................................................5 Foreword ............................................................................................................................................................5 1 Scope........................................................................................................................................................6
- 
												  Standardisation Action Plan for ClarinStandardisation Action Plan for Clarin State: Proposal to CLARIN community Nuria Bel, Jonas Beskow, Lou Boves, Gerhard Budin, Nicoletta Calzolari, Khalid Choukri, Erhard Hinrichs, Steven Krauwer, Lothar Lemnitzer, Stelios Piperidis, Adam Przepiorkowski, Laurent Romary, Florian Schiel, Helmut Schmidt, Hans Uszkoreit, Peter Wittenburg August 2009 Summary This document describes a proposal for a Standardisation Action Plan (SAP) for the Clarin initiative in close synchronization with other relevant initiatives such as Flarenet, ELRA, ISO and TEI. While Flarenet is oriented towards a broader scope since it is also addressing standards that are typically used in industry, CLARIN wants to be more focussed in its statements to the research domain. Due to the overlap it is agreed that the Flarenet and CLARIN documents on standards need to be closely synchronized. This note covers standards that are generic (XML, UNICODE) as well as standards that are domain specific where naturally the LRT community has much more influence. This Standardization Action Plan wants to give an orientation for all practical work in CLARIN to achieve a harmonized domain of language resources and technology stepwise and therefore its core message is to overcome fragmentation. To meet these goals it wants to keep its message as simple as possible. A web-site will be established that will contain more information about examples, guidelines, explanations, tools, converters and training events such as summer schools. The organization of the document is as follows: • Chapter 1: Introduction to the topic. • Chapter 2: Recommended standards that CLARIN should endorse page 4 • Chapter 3: Standards that are emerging and relevant for CLARIN page 8 • Chapter 4: General guidelines that need to be followed page 12 • Chapter 5: Reference to community practices page 14 • Chapter 6: References This document tries to be short and will give comments, recommendations and discuss open issues for each of the standards.
- 
												  Technical Study Desktop InternationalizationTechnical Study Desktop Internationalization NIC CH A E L T S T U D Y [This page intentionally left blank] X/Open Technical Study Desktop Internationalisation X/Open Company Ltd. December 1995, X/Open Company Limited All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, without the prior permission of the copyright owners. X/Open Technical Study Desktop Internationalisation X/Open Document Number: E501 Published by X/Open Company Ltd., U.K. Any comments relating to the material contained in this document may be submitted to X/Open at: X/Open Company Limited Apex Plaza Forbury Road Reading Berkshire, RG1 1AX United Kingdom or by Electronic Mail to: [email protected] ii X/Open Technical Study (1995) Contents Chapter 1 Internationalisation.............................................................................. 1 1.1 Introduction ................................................................................................. 1 1.2 Character Sets and Encodings.................................................................. 2 1.3 The C Programming Language................................................................ 5 1.4 Internationalisation Support in POSIX .................................................. 6 1.5 Internationalisation Support in the X/Open CAE............................... 7 1.5.1 XPG4 Facilities.........................................................................................
- 
												  Tags for Identifying Languages File:///C:/W3/International/Draft-Langtags/Draft-Phillips-LanTags for Identifying Languages file:///C:/w3/International/draft-langtags/draft-phillips-lan... Network Working Group A. Phillips, Ed. TOC Internet-Draft webMethods, Inc. Expires: October 7, 2004 M. Davis IBM April 8, 2004 Tags for Identifying Languages draft-phillips-langtags-02 Status of this Memo This document is an Internet-Draft and is in full conformance with all provisions of Section 10 of RFC2026. Internet-Drafts are working documents of the Internet Engineering Task Force (IETF), its areas, and its working groups. Note that other groups may also distribute working documents as Internet-Drafts. Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress." The list of current Internet-Drafts can be accessed at http://www.ietf.org/ietf/1id-abstracts.txt. The list of Internet-Draft Shadow Directories can be accessed at http://www.ietf.org/shadow.html. This Internet-Draft will expire on October 7, 2004. Copyright Notice Copyright (C) The Internet Society (2004). All Rights Reserved. Abstract This document describes a language tag for use in cases where it is desired to indicate the language used in an information object, how to register values for use in this language tag, and a construct for matching such language tags, including user defined extensions for private interchange. Table of Contents 1. Introduction 2. The Language Tag 2.1 Syntax 2.2 Language Tag Sources 2.2.1 Pre-Existing RFC3066 Registrations 1 of 20 08/04/2004 11:03 Tags for Identifying Languages file:///C:/w3/International/draft-langtags/draft-phillips-lan..
- 
												  API Dict 1 API DictAPI Dict 1 API Dict Get the list of available dictionaries Endpoint https:/ / api. pons. com/ v1/ dictionaries Building the request • We are expecting GET-Requests. • All other parameters have to be appended to the Endpoint-URL as request parameters. Name Type Description language Request-Parameter The language of the output (ISO 639-1 - two-letter codes). Supported languages are de,el,en,es,fr,it,pl,pt,ru,sl,tr,zh. Example using wget: wget -O - --no-check-certificate "https:/ / api. pons. com/ v1/ dictionaries?language=es" Response • The response is sent in JSON format (see below). • If an unsupported language was supplied, the response for the default language (english) will be delivered. Response content A list of available dictionaries: • key is the internal name of our dictionary. For two-language dictionaries, it should consist of the two languages ordered alphabetically. • simple_label is built this way: "[translated language1] «» [translated language2]" • directed_label should be used if there is a direction involved (for example when displaying search results). The direction is implied in the key (i.e. plde means pl » de). This applies only to some languages (see example - you could not use simple_label here) • languages is a list containing the languages of the dictionary. Please note that some dictionaries may have only one language (at the time of writing: dede, dedx). Example: [ { "key": "depl", "simple_label": "niemiecki «» polski", "directed_label": { "depl": "niemiecki » polski", "plde": "polski » niemiecki" }, "languages": [ "de", "pl" ] API Dict 2 }, [...] ] Query dictionary Endpoint https:/ / api. pons. com/ v1/ dictionary Building the request • We are expecting GET-Requests. • The request has to contain the credential (secret) in an HTTP-Header.
- 
												  Ethnologue: Languages of Honduras Twentieth Edition DataEthnologue: Languages of Honduras Twentieth edition data Gary F. Simons and Charles D. Fennig, Editors Based on information from the Ethnologue, 20th edition: Simons, Gary F. and Charles D. Fennig (eds.). 2017. Ethnologue: Languages of the World, Twentieth edition. Dallas, Texas: SIL International. Online: http://www.ethnologue.com. For personal use only Permission to distribute or reuse this work (in whole or in part) may be obtained through the Copyright Clearance Center at http://www.copyright.com. SIL International, 7500 West Camp Wisdom Road, Dallas, Texas 75236-5699 USA Web: www.sil.org, Phone: +1 972 708 7404, Email: [email protected] Ethnologue: Languages of Honduras 2 Contents List of Abbreviations 3 How to Use This Digest 4 Country Overview 6 Language Status Profile 7 Statistical Summaries 8 Alphabetical Listing of Languages 11 Language Map 14 Languages by Population 15 Languages by Status 16 Languages by Department 18 Languages by Family 19 Language Code Index 20 Language Name Index 21 Bibliography 22 Copyright © 2017 by SIL International All rights reserved. No part of this publication may be reproduced, redistributed, or transmitted in any form or by any means—electronic, mechanical, photocopying, recording, or otherwise—without the prior written permission of SIL International, with the exception of brief excerpts in articles or reviews. Ethnologue: Languages of Honduras 3 List of Abbreviations A Agent in constituent word order alt. alternate name for alt. dial. alternate dialect name for C Consonant in canonical syllable patterns CDE Convention against Discrimination in Education (1960) Class Language classification CPPDCE Convention on the Protection and Promotion of the Diversity of Cultural Expressions (2005) CSICH Convention for the Safeguarding of Intangible Cultural Heritage (2003) dial.
- 
												  Datacite Metadata KernelDataCite ‐ International Data Citation DataCite Metadata Schema for the Publication and Citation of Research Data Version 2.1 March 2011 doi:10.5438/0003 Members of the Metadata Working Group Joan Starr, California Digital Library (head of working group) Jan Ashton, British Library Jan Brase, TIB / DataCite Paul Bracke, Purdue University Angela Gastl, ETH Zürich Jacqueline Gillet, Inist Alfred Heller, DTU Library Birthe Krog, DTU Library Lynne McAvoy, CISTI Karen Morgenroth, CISTI Elizabeth Newbold, British Library Madeleine de Smaele, TU Delft Anja Wilde, GESIS Scott Yeadon, ANDS Wolfgang Zenk‐Möltgen, GESIS Frauke Ziedorn, TIB (Metadata Supervisor) Table of Contents 1 Introduction..................................................................................................................................... 3 1.1 The DataCite Consortium....................................................................................................... 3 1.2 The Metadata Schema ........................................................................................................... 3 1.3 A Note about DataCite DOI registration ................................................................................ 4 1.4 Final Thoughts........................................................................................................................4 1.5 Version 2.1 Update ................................................................................................................ 4 2 DataCite Metadata Properties .......................................................................................................
- 
												  ISO 639-3 Change Request 2010-020ISO 639-3 Registration Authority Request for Change to ISO 639-3 Language Code Change Request Number: 2010-020 (completed by Registration authority) Date: 2010-3-12 Primary Person submitting request: Eric Johnson Affiliation: SIL International, East Asia Group E-mail address: eric_johnson at sil dot org Names, affiliations and email addresses of additional supporters of this request: Postal address for primary contact person for this request (in general, email correspondence will be used): P O Box 307, Chiang Mai, Thailand 50000 PLEASE NOTE: This completed form will become part of the public record of this change request and the history of the ISO 639-3 code set and will be posted on the ISO 639-3 website. Types of change requests This form is to be used in requesting changes (whether creation, modification, or deletion) to elements of the ISO 639 Codes for the representation of names of languages — Part 3: Alpha-3 code for comprehensive coverage of languages . The types of changes that are possible are to 1) modify the reference information for an existing code element, 2) propose a new macrolanguage or modify a macrolanguage group; 3) retire a code element from use, including merging its scope of denotation into that of another code element, 4) split an existing code element into two or more new language code elements, or 5) create a new code element for a previously unidentified language variety. Fill out section 1, 2, 3, 4, or 5 below as appropriate, and the final section documenting the sources of your information. The process by which a change is received, reviewed and adopted is summarized on the final page of this form.
- 
												  Peopletools 8.52: Global TechnologyOracle's PeopleTools PeopleBook PeopleTools 8.52: Global Technology February 2012 PeopleTools 8.52: Global Technology SKU pt8.52tgbl-b0212 Copyright © 1988, 2012, Oracle and/or its affiliates. All rights reserved. Trademark Notice Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners. License Restrictions Warranty/Consequential Damages Disclaimer This software and related documentation are provided under a license agreement containing restrictions on use and disclosure and are protected by intellectual property laws. Except as expressly permitted in your license agreement or allowed by law, you may not use, copy, reproduce, translate, broadcast, modify, license, transmit, distribute, exhibit, perform, publish or display any part, in any form, or by any means. Reverse engineering, disassembly, or decompilation of this software, unless required by law for interoperability, is prohibited. Warranty Disclaimer The information contained herein is subject to change without notice and is not warranted to be error-free. If you find any errors, please report them to us in writing. Restricted Rights Notice If this is software or related documentation that is delivered to the U.S. Government or anyone licensing it on behalf of the U.S. Government, the following notice is applicable: U.S. GOVERNMENT END USERS: Oracle programs, including any operating system, integrated software, any programs installed on the hardware, and/or documentation, delivered to U.S. Government end users are "commercial computer software" pursuant to the applicable Federal Acquisition Regulation and agency- specific supplemental regulations. As such, use, duplication, disclosure, modification, and adaptation of the programs, including any operating system, integrated software, any programs installed on the hardware, and/or documentation, shall be subject to license terms and license restrictions applicable to the programs.
- 
												  Language ISO CodesLLAANNGGUUAAGGEE IISSOO CCOODDEESS http://www.tutorialspoint.com/html/language_iso_codes.htm Copyright © tutorialspoint.com The following is a draft list of language code correspondences between ISO codes, Microsoft codes, and Macintosh codes. Source of this information is Unicode Consortium. Language Codes: ISO 639, Microsoft Language ISO Windows Name Win Code Code Abkhazian ab Afar aa Afrikaans af LANG_AFRIKAANS 0x36 Albanian sq LANG_ALBANIAN 0x1c Amharic am noconstantdefined 0x5e Arabic ar LANG_ARABIC 0x01 Armenian hy LANG_ARMENIAN 0x2b Assamese as LANG_ASSAMESE 0x4d Aymara ay Azerbaijani az LANG_AZERI 0x2c Bashkir ba Basque eu LANG_BASQUE 0x2d Bengali Bangla bn LANG_BENGALI 0x45 Bhutani dz Bihari bh Bislama bi Breton br Bulgarian bg LANG_BULGARIAN 0x02 Burmese my noconstantdefined 0x55 Byelorussian be LANG_BELARUSIAN 0x23 Belarusian Cambodian km noconstantdefined 0x53 Catalan ca LANG_CATALAN 0x03 Cherokee noconstantdefined 0x5c Chewa Chinese Simplified zh LANG_CHINESE SUBLANGCHINESESIMPLIFIED 0x04 0x0804 Chinese zh LANG_CHINESE SUBLANGCHINESETRADITIONAL 0x04 0x0404 Traditional Corsican co Croatian hr LANG_CROATIAN 0x1a Czech cs LANG_CZECH 0x05 Danish da LANG_DANISH 0x06 Divehi LANG_DIVEHI 0x65 Dutch nl LANG_DUTCH 0x13 Edo noconstantdefined 0x66 English en LANG_ENGLISH 0x09 Esperanto eo Estonian et LANG_ESTONIAN 0x25 Faeroese fo LANG_FAEROESE 0x38 Farsi fa LANG_FARSI 0x29 Fiji fj Finnish fi LANG_FINNISH 0x0b Flemish LANG_DUTCH SUBLANGDUTCHBELGIAN 0x13 0x0813 French fr LANG_FRENCH 0x0c Frisian fy noconstantdefined 0x62 Fulfulde noconstantdefined
- 
												  ISO 639-3 Code Split Request TemplateISO 639-3 Registration Authority Request for Change to ISO 639-3 Language Code Change Request Number: 2019-063 (completed by Registration authority) Date: 2019 Aug 31 Primary Person submitting request: Kirk Miller E-mail address: kirkmiller at gmail dot com Names, affiliations and email addresses of additional supporters of this request: PLEASE NOTE: This completed form will become part of the public record of this change request and the history of the ISO 639-3 code set and will be posted on the ISO 639-3 website. Types of change requests Type of change proposed (check one): 1. Modify reference information for an existing language code element 2. Propose a new macrolanguage or modify a macrolanguage group 3. Retire a language code element from use (duplicate or non-existent) 4. Expand the denotation of a code element through the merging one or more language code elements into it (retiring the latter group of code elements) 5. Split a language code element into two or more new code elements 6. Create a code element for a previously unidentified language For proposing a change to an existing code element, please identify: Affected ISO 639-3 identifier: wuu Associated reference name: Wu Chinese 5. Split a language code element into two or more code elements (a) List the languages into which this code element should be split: Taihu Wu Chinese Taizhou Wu Chinese Wuzhou Wu Chinese (Jinqu Wu) Xuanzhou Wu Chinese Chuqu Wu Chinese (Shangli Wu) Oujiang Wu / Ouyu Chinese The highest priority is Oujiang. (b) Referring to the criteria given above, give the rationale for splitting the existing code element into two or more languages: The varieties of Wu are not mutually intelligible.
- 
												  ISO 639-2 Language Code LisISO 639-2 Language Code List - Codes for the representation of... h p://www.loc.gov/standards/iso639-2/php/code_list.php Library of Congress >> Standards IS0 639-2 HOME - Code List - Changes to Codes ISO 639 Joint Advisory Committee - ISO 639-1 Registration Authority Codes for the Representation of Names of Languages Codes arranged alphabetically by alpha-3/ISO 639-2 Code Note: ISO 639-2 is the alpha-3 code in Codes for the representation of names of languages-- Part 2. There are 21 languages that have alternative codes for bibliographic or terminology purposes. In those cases, each is listed separately and they are designated as "B" (bibliographic) or "T" (terminology). In all other cases there is only one ISO 639-2 code. Multiple codes assigned to the same language are to be considered synonyms. ISO 639-1 is the alpha-2 code. ISO 639-2 Code ISO 639-1 Code English name of French name of Language Language aar aa Afar afar abk ab Abkhazian abkhaze ace Achinese aceh ach Acoli acoli ada Adangme adangme ady Adyghe; Adygei adyghé afa Afro-Asiatic languages afro-asiatiques, langues afh Afrihili afrihili afr af Afrikaans afrikaans ain Ainu aïnou aka ak Akan akan akk Akkadian akkadien alb (B) sq Albanian albanais sqi (T) ale Aleut aléoute alg Algonquian languages algonquines, langues alt Southern Altai altai du Sud amh am Amharic amharique ang English, Old (ca.450-1100) anglo-saxon (ca.450-1100) anp Angika angika apa Apache languages apaches, langues ara ar Arabic arabe arc Official Aramaic (700-300 araméen d'empire BCE); Imperial Aramaic (700-300 BCE) (700-300 BCE) arg an Aragonese aragonais 1 von 15 15.01.2010 15:16 ISO 639-2 Language Code List - Codes for the representation of..