Iso/Iec Jtc 1/Sc 2 N 3566 Date: 2001-10-22

Total Page:16

File Type:pdf, Size:1020Kb

Iso/Iec Jtc 1/Sc 2 N 3566 Date: 2001-10-22 ISO/IEC JTC 1/SC 2 N 3566 DATE: 2001-10-22 ISO/IEC JTC 1/SC 2 Coded Character Sets Secretariat: Japan (JISC) DOC. TYPE Activity Report TITLE Report of SC 2/WG 3 to the SC 2 Plenary in Singapore, 2001-10-18/19 SOURCE Mr. Evangelos Melagrakis, WG 3 Convener PROJECT This document was reviewed at the Eleventh Plenary Meeting of SC 2 STATUS held in Singapore, 2001-10-18/19. ACTION ID FYI DUE DATE P, O and L Members of ISO/IEC JTC 1/SC 2 ; ISO/IEC JTC 1 DISTRIBUTION Secretariat; ISO/IEC ITTF ACCESS LEVEL Open ISSUE NO. 126 NAME 02n3566.pdf FILE SIZE (KB) PAGES 7 Secretariat ISO/IEC JTC 1/SC 2 - IPSJ/ITSCJ *(Information Processing Society of Japan/Information Technology Standards Commission of Japan) Room 308-3, Kikai-Shinko-Kaikan Bldg., 3-5-8, Shiba-Koen, Minato-ku, Tokyo 105-0011 Japan *Standard Organization Accredited by JISC Telephone: +81-3-3431-2808; Facsimile: +81-3-3431-6493; E-mail: [email protected] ISO/IEC JTC 1/SC 2 N 3566 ISO/IEC JTC 1/SC 2/WG 3 N 515 Date : 2001-10-15 ISO/IEC JTC 1/SC 2/WG 3 7-bit and 8-bit codes and their extension SECRETARIAT : ELOT DOC TYPE : Convener’s Report TITLE : Report of SC2/WG3 to the SC2 Plenary in Singapore SOURCE : E.Melagrakis, ISO JTC 1/SC 2/WG 3 Convenor PROJECT: STATUS : ACTION ID : FYI DUE DATE : ---- P, O and L Members of ISO/IEC JTC 1/SC 2 DISTRIBUTION : MEDIUM : NO OF PAGES : 5+1 Contact 1: Secretariat ISO/IEC JTC 1/SC 2/WG 3 ELOT Mrs K.Velli (acting) Acharnon 313, 111 45 Kato Patissia, ATHENS – GREECE Tel: +30 1 21 20 307 Fax : +30 1 22 86 219 E-mail : [email protected] Contact 2 : Convenor ISO/IEC JTC 1/SC 2/WG 3 Mr E.Melagrakis Acharnon 313, 111 45 Kato Patissia, ATHENS – GREECE Tel: +30 1 21 20 301 Fax : +30 1 22 86 219 E-mail: [email protected] ISO/IEC JTC 1/SC 2/WG 3 N 515 Report of SC2/WG3 to the SC2 Plenary in Singapore General Information: Since the last SC2 plenary, WG3 has had one meeting, in London, (September 1998). The resolutions can be found in SC2/WG3 N 474 (attached). Before and during this meeting most of the parts of ISO/IEC 8859 were finalized and sent to ITTF for final publication, namely the parts 1,2,3,4,5,6,8,9,10,13,14, and 15. The project editors have done their best to prepare the final texts and their effort was highly appreciated by WG3 (see Resolution WG3 M14.04). Program of Work: Under the program of work for SC2/WG3, the following situation exists: JTC 1/SC 2/WG3 -Programme of Work Updated: 2001-10-15 WG 3: 7 and 8 bit codes and extension techniques W SC 2 Title Curren Reference Editor Target Date (Top)/ G Project # t Stage Actual Date (Bottom) (JTC 1.02) WD CD FCD FDIS IS 3 01 Information technology - ISO 7- - ISO 646:1991 -- Published: 1991-12-01 (Third bit coded character set Edition) for information interchange 3 02 Information technology -- - ISO/IEC J. B. Published: 1994-12-01 Character code Structure and 2022: 1994 Paterson (Fourth Edition) extension techniques 3 02.00.01 Technical Corrigendum 1 to - ISO/IEC Published: 1999-04-15 ISO/IEC 2022 2022/Cor.1 SC2N3171 3 04.00.00 Data Processing -- Procedure for 3 SC2N3505 M. Everson 99- 99- registration of escape sequences 07 09 (Revision of ISO 2375: 1985) 99- 99- 07 11 3 08 ISO 8-bit code for information - ISO/IEC -- Published: 1991-12-15 interchange -- Structure and rules 4873: 1991 for implementation 3 13 Control functions for coded - ISO/IEC -- Published: 1992-12-15 character sets 6429: 1992 (Third Edition) 3 14.01/02 Code graphic character set for text 4 SC2N3522 K. Simonsen - - 99- 00- 01- communication (Revision of 98- 99- 09 11 03 ISO/IEC 6937: 1994) 10 05 00- 01- 05 04 3 14.03/04 Control functions for text - ISO/IEC -- Published: 1991-09-15 communication 10538: 1991 3 16 Registration of repertories of - ISO/IEC J. W. van Published: 1991-12-15 graphic characters from ISO 7350 1991 Wingen (Second Edition) 10367 3 19 Arabic 7-bit coded character set - ISO 9036: K. Sherif Published: 1987-04-15 for information interchange 1987 ISO/IEC JTC 1/SC 2/WG 3 N 515 3 20.01 8-bit single-byte coded graphic - ISO/IEC J. B. Published: 1998-04-15 character sets -- 8859-1: 1998 Paterson Part 1: Latin alphabet No.1 (SC2N2988) 3 20.02 8-bit single-byte coded graphic - ISO/IEC J. W. van Published: 1999-01-15 character sets -- 8859-2: 1999 Wingen Part2: Latin alphabet No. 2 (SC2N3092) 3 20.03 8-bit single-byte coded graphic - ISO/IEC J. W. van Published: 1999-01-15 character sets -- 8859-3: 1999 Wingen Part 3: Latin alphabet No. 3 (SC2N3093) 3 20.04 8-bit single-byte coded graphic - ISO/IEC8859 K. I. Published: 1998-07-01 character sets -- -4: 1998 Larsson Part 4: Latin alphabet No. 4 (SC2N2990) 3 20.05 8-bit single-byte coded graphic - ISO/IEC J. W. van Published: 1999-01-15 character sets -- 8859-5: 1999 Wingen Part 5: Latin/Cyrillic alphabet (SC2N3094) 3 20.06 8-bit single-byte coded graphic - ISO/IEC K. Sherif Published: 1999-01-15 character sets -- 8859-6: 1999 Part 6: Latin/Arabic alphabet (SC2N3096) 3 20.07 8-bit single-byte coded graphic 3 SC2N3329 E.Melagraki - - 98- 00- 01- character sets -- Part 7: s 99- 99- 04 11 03 Latin/Greek alphabet 05 10 99- 06 3 20.08 8-bit single-byte coded graphic - ISO/IEC S.Fuchs Published: 1999-01-15 character sets -- Part 8: 8859-8: 1999 Latin/Hebrew alphabet (SC2N3095) 3 20.09 8-bit single-byte coded graphic - ISO/IEC J. W. van Published: 1999-01-15 character sets -- 8859-9: 1999 Wingen Part 9: Latin alphabet No. 5 (SC2N3097) 3 20.10 8-bit single-byte coded graphic - ISO/IEC K. I. Published: 1998-07-15 character sets -- 8859-10: Larsson Part 10: Latin alphabet No. 6 1998 (SC2N2992) 3 20.11 8-bit single-byte coded graphic 4 SC2N3537 B.Surayuth --- 00- 01- character sets -- J. W. van 97- 99- 11 03 Part 11: Latin/Thai character set Wingen 02 06 01- 06 2 20.13 8-bit single-byte coded graphic - ISO/IEC I. Metra Published: 1998-10-15 character sets -- Part 13: Latin 8859-13: J. W. van alphabet No.7 1998 Wingen (SC2N3098) 3 20.14 8-bit single-byte coded graphic - ISO/IEC M. Everson Published: 1998-11-15 character sets -- Part 14: Latin 8859-14: alphabet No.8 (Celtic) 1998 (SC2N3101) 3 20.15 8-bit single-byte coded graphic - ISO/IEC A. LaBonte Published: 1999-03-15 character sets -- Part 15: Latin 8859-15: alphabet N. 9 1999 (SC2N3091) 3 20.16 8-bit single-byte coded graphic - ISO/IEC M. Everson Published: 2001-07-15 character sets -- Part 16: Latin 8859-16 alphabet N. 10 (SC2N3504) 3 22 Standardized coded graphic - ISO/IEC -- Published: 1991-12-15 character sets for use in 8-bit 10367: 1991 codes ISO/IEC JTC 1/SC 2/WG 3 N 515 3 22.00.00.0 Technical Corrigendum 1 to - ISO/IEC J. W. van Published: 2001-06-01 1 ISO/IEC 10367 10367/Cor. 1 Wingen SC 2 N 3499 3 26.01 Alphanumeric character sets for - ISO 1073- - Published: 1976 optical recognition -- Part 1: 1:1976 Character set OCR-A -- Shapes and dimensions of the printed image 3 26.02 Coded character sets -- - ISO 1073-2: K. I. Published: 1976-12-01 Alphanumeric character sets for 1976 Larsson optical recognition -- Part 2: Character set OCR-B -- Shapes and dimensions of the printed image 3 00962 Implementation of the 7- bit coded - ISO 962:1974 - Published: 1974 character set and its 7- bit and 8- bit extensions on 9- track 12,7 mm (0.5 in) magnetic tape 3 01113 Representation of the 7- bit coded - ISO - Published: 1979 character set on punched tape 1113:1979 3 01154 Punched paper tape -- Dimensions - ISO - Published: 1975 and location of feed holes and 1154:1975 code holes 3 01681 Unpunched paper cards -- - ISO - Published: 1973 Specification 1681:1973 3 01831 Printing specifications for optical - ISO - Published: 1980 character recognition 1831:1980 3 02033 Information processing -- Coding - ISO 2023: - Published: 1983 of machine readable characters 1983 (MICR and OCR) 3 02047 Graphical representations for the - ISO - Published: 1975 control characters of the 7- bit 2047:1975 coded character set 3 03275 Implementation of the 7- bit coded - ISO - Published: 1974 character set and its 7- bit and 8- 3275:1974 bit extensions on 3,81 mm magnetic cassette for data interchange 3 06586 Implementation of the ISO 7- bit - ISO - Published: 1980 and 8- bit coded character sets on 6586:1980 punched cards 3 06936 Information processing -- - ISO - Published: 1988 Conversion between the two 6936:1988 coded character sets of ISO 646 and ISO 6937-2 and the CCITT international telegraph alphabet No. 2 (ITA 2) 3 05426 Extension of the Latin alphabet - ISO - Published: 1983-11-01 coded character set for 5426:1983 bibliographic information interchange 3 05427 Extension of the Cyrillic alphabet - ISO - Published: 1984-09-15 coded character set for 5427:1984 bibliographic information interchange ISO/IEC JTC 1/SC 2/WG 3 N 515 3 05428 Greek alphabet coded character set - ISO 5428: - Published: 1984-11-01 for bibliographic information 1984 interchange 3 06438 Documentation - African coded - ISO - Published: 1983-08-01 character set for bibliographic 6438:1983 information interchange 3 06862 Information and documentation -- - ISO 6862: Published: 1996-10-01 Mathematical coded character set 1996 for bibliographical information interchange 3 10585 Information and documentation - - ISO - Published: 1996-12-15 Armenian alphabet coded 10585:1996 character set for bibliographic information interchange 3 10586 Information and documentation - - ISO - Published: 1996-12-15 Georgian alphabet coded character 10586:1996 set for bibliographic information interchange 3 11822 Information and documentation - - ISO - Published: 1996-12-15 Extension of the Arabic alphabet 11822:1996 coded character set for bibliographic information interchange The dark grey areas are considered to be finished work.
Recommended publications
  • Evitalia NORMAS ISO En El Marco De La Complejidad
    No. 7 Revitalia NORMAS ISO en el marco de la complejidad ESTEQUIOMETRIA de las relaciones humanas FRACTALIDAD en los sistemas biológicos Dirección postal Calle 82 # 102 - 79 Bogotá - Colombia Revista Revitalia Publicación trimestral Contacto [email protected] Web http://revitalia.biogestion.com.co Volumen 2 / Número 7 / Noviembre-Enero de 2021 ISSN: 2711-4635 Editor líder: Juan Pablo Ramírez Galvis. Consultor en Biogestión, NBIC y Gerencia Ambiental/de la Calidad. Globuss Biogestión [email protected] ORCID: 0000-0002-1947-5589 Par evaluador: Jhon Eyber Pazos Alonso Experto en nanotecnología, biosensores y caracterización por AFM. Universidad Central / Clúster NBIC [email protected] ORCID: 0000-0002-5608-1597 Contenido en este número Editorial p. 3 Estequiometría de las relaciones humanas pp. 5-13 Catálogo de las normas ISO en el marco de la complejidad pp. 15-28 Fractalidad en los sistemas biológicos pp. 30-37 Licencia Creative Commons CC BY-NC-ND 4.0 2 Editorial: “En armonía con lo ancestral” Juan Pablo Ramírez Galvis. Consultor en Biogestión, NBIC y Gerencia Ambiental/de la Calidad. [email protected] ORCID: 0000-0002-1947-5589 La dicotomía entre ciencia y religión proviene de la edad media, en la cual, los aspectos espirituales no podían explicarse desde el método científico, y a su vez, la matematización mecánica del universo era el único argumento que convencía a los investigadores. Sin embargo, más atrás en la línea del tiempo, los egipcios, sumerios, chinos, etc., unificaban las teorías metafísicas con las ciencias básicas para dar cuenta de los fenómenos en todas las escalas desde lo micro hasta lo macro.
    [Show full text]
  • ISO/IEC JTC1/SC2/WG2 N 2005 Date: 1999-05-29
    ISO INTERNATIONAL ORGANIZATION FOR STANDARDIZATION ORGANISATION INTERNATIONALE DE NORMALISATION --------------------------------------------------------------------------------------- ISO/IEC JTC1/SC2/WG2 Universal Multiple-Octet Coded Character Set (UCS) -------------------------------------------------------------------------------- ISO/IEC JTC1/SC2/WG2 N 2005 Date: 1999-05-29 TITLE: ISO/IEC 10646-1 Second Edition text, Draft 2 SOURCE: Bruce Paterson, project editor STATUS: Working paper of JTC1/SC2/WG2 ACTION: For review and comment by WG2 DISTRIBUTION: Members of JTC1/SC2/WG2 1. Scope This paper provides a second draft of the text sections of the Second Edition of ISO/IEC 10646-1. It replaces the previous paper WG2 N 1796 (1998-06-01). This draft text includes: - Clauses 1 to 27 (replacing the previous clauses 1 to 26), - Annexes A to R (replacing the previous Annexes A to T), and is attached here as “Draft 2 for ISO/IEC 10646-1 : 1999” (pages ii & 1 to 77). Published and Draft Amendments up to Amd.31 (Tibetan extended), Technical Corrigenda nos. 1, 2, and 3, and editorial corrigenda approved by WG2 up to 1999-03-15, have been applied to the text. The draft does not include: - character glyph tables and name tables (these will be provided in a separate WG2 document from AFII), - the alphabetically sorted list of character names in Annex E (now Annex G), - markings to show the differences from the previous draft. A separate WG2 paper will give the editorial corrigenda applied to this text since N 1796. The editorial corrigenda are as agreed at WG2 meetings #34 to #36. Editorial corrigenda applicable to the character glyph tables and name tables, as listed in N1796 pages 2 to 5, have already been applied to the draft character tables prepared by AFII.
    [Show full text]
  • Janusz S. Bień Kodowanie Tekstów Polskich W Systemach Komputerowych
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Biblioteka Cyfrowa KLF UW (Digital Library of the Formal... Multimedia w nauczaniu jêzyka rodzimego jako obcego Janusz S. Bieñ Kodowanie tekstów polskich w systemach komputerowych 1. Wstęp W niniejszym artykule przedstawiono podstawowe pojęcia kodowania tekstów oraz omówiono wszystkie aktualnie obowiązujące polskie normy dotyczące tej problematyki, a także najważniejsze standardy międzynaro- dowe. 2. Podstawowe pojęcia kodowania tekstów 2.1. Klasyfikacja tekstów Pojęcie tekstu traktujemy jako pierwotne, którego nie będziemy tutaj definiować, wyróżnimy natomiast dwa podstawowe typy tekstów: „teksty fizyczne” i „teksty elektroniczne”. Za charakterystyczną cechę tekstów elek- tronicznych uznajemy fakt, że nie zmieniają one swojej tożsamości przy zmianie ich nośnika fizycznego: ten sam tekst elektroniczny może być zapi- sany na dyskietce, twardym dysku komputera lub na płycie CD-ROM i mimo to pozostaje tym samym tekstem. Tymczasem w przypadku tekstów fizycz- nych każde skopiowanie, bez względu na zastosowaną technikę, wprowadza mniejsze lub większe zmiany i zniekształcenia – w rezultacie po wielo- krotnym kopiowaniu tekst może np. znacznie zmniejszyć swoją czytelność. Teksty fizyczne dzielimy m.in. na: — teksty mówione, — teksty pisane ręcznie, — teksty drukowane. 4 POSTSCRIPTUM 27-29 Do wykazu tego można dołączyć teksty gestykulowane, stosowane przez głuchoniemych. Zasadniczym jednak powodem przedstawienia tego podziału jest podkreślenie istotnej różnicy między tekstami pisanymi ręcznie a tekstami drukowanymi – co widać wyraźnie np. przy wprowa- dzaniu tekstu do komputera za pomocą optycznego rozpoznawania zna- ków (ang. Optical Character Recognition, OCR); są one przez wielu lingwistów zbyt pochopnie traktowane łącznie jako jednorodna klasa tekstów pisanych. Niektóre teksty elektroniczne są po prostu mniej lub bardziej wiernym zapisem pewnych tekstów fizycznych – możemy mówić wtedy o tekście pierwotnym (fizycznym) i wtórnym (elektronicznym).
    [Show full text]
  • Iso/Iec Jtc 1/Sc 2 N 4355
    ISO/IEC JTC 1/SC 2 N 4355 ISO/IEC JTC 1/SC 2 Coded character sets Secretariat: JISC (Japan) Document type: Secretariat Report Title: Secretariat Report to the 19th Plenary Meeting of ISO/IEC JTC 1/SC 2, Colombo, Sri Lanka, 2014-09-30, 10-03 Status: This document is circulated to the SC 2 members for information at the 19th Plenary Meeting to be held in Colombo, Sri Lanka. Date of document: 2014-09-08 Source: SC 2 Secretariat Expected action: INFO No. of pages: 8 Email of secretary: [email protected] Committee URL: http://isotc.iso.org/livelink/livelink/open/jtc1sc2 Secretariat Report to the 19th Plenary Meeting of ISO/IEC JTC 1/SC 2, Colombo, Sri Lanka, 2014-09-30, 10-03 Part I: Administration 1. Title Coded Character Sets 2. Scope Standardization of graphic character sets and their characteristics, including string ordering, associated control functions, their coded representation for information interchange and code extension techniques. Excluded: audio and picture coding. 3. Chairman and Secretariat Chairman: Prof. Yoshiki MIKAMI (re-appointed at the 2013 JTC 1 Perros-Guirec Plenary Meeting) Secretariat: JISC - Japan Attn.: Ayuko Nagasawa, IPSJ/ITSCJ 308-3, Kikai Shinko Kaikan Bldg. 3-5-8, Shibakoen, Minato-ku, Tokyo 105-0011 JAPAN TEL: +81 3 3431 2808/ FAX: +81 3 3431 6493 E-mail: [email protected] 4. Membership 4.1. P - Members (28) Austria (ASI); Canada (SCC); China (SAC); Egypt (EOS); Finland (SFS); France (AFNOR); Germany (DIN); Greece(ELOT); Hungary (MSZT); Iceland (IST); India (BIS); Indonesia (BSN); Ireland (NSAI); Japan (JISC); Korea, Democratic People's Republic (CSK); Korea, Republic of (KATS) ; Lithuania (LST); Mongolia (MASM); Norway (SN); Poland (PKN); Russian Federation (GOST R); Serbia (ISS) ; Sri Lanka (SLSI) ;Thailand (TISI); Tunisia (INNORPI); USA (ANSI) ; Ukraine (DSSU); United Kingdom (BSI) P-member NB added since the 18th plenary: none, removed: Romania (ASRO); Sweden (SIS) 4.2.
    [Show full text]
  • International Character Code Standard for the BE2
    °, , CMU-ITC-87-091 International Character Code Standard for the BE2 June 18, 1987 Tomas Centerlind Information Technology Center (ITC) Camegie Mellon University 1. Major problems with foreign languages All European languages have a set of unique characters, even Great Britain with their Pound sign. Most of these characters are static and do not change if they are in the end or in the middle of the word. The Greek sigma sign however is an example of a character that changes look depending on the position. If we move on to the non-Roman alphabets like Arabic, they have a more complex structure. A basic rule is that certain of the characters are written together if they follow each other but not otherwise. This complicates representation and requires look ahead. In addition to this many of the languages have leftwards or downwards writing. All together these properties makes it very difficult to integrate them with Roman languages. Lots of guidelines have to be established to solve these problems, and before any coding can be done a set of standards must be selected or defined. In this paper I intend to gather all that I can think of that must be considered before selecting standards. Only the basic level of the implementation will be designed, so therefore routines that for example display complex languages like Arabic must be written by the user. A basic method that dislpays Roman script correctly will be supported. 1. Standards 1.1 Existing standards Following is a list of currently existing and used standards. 1.1.1 ASCII, ISO646 The ASCII standard, that is in use ahnost anywhere, will probably have to rcmain as a basic part of the system, and without any doubt it must be possible to read and write ASCII coded documents in the foreseeable future.
    [Show full text]
  • Proach to Learning German and Mandarin Chinese Alexa
    Difficulties in Learning Western and Oriental Languages: A Comparative Ap- proach to Learning German and Mandarin Chinese Alexandra Mechsheryakova Internship Report Master in Intercultural Studies for Business Esta versão contém as críticas e sugestões dos elementos do júri SEPTEMBER – 2018 Difficulties in Learning Western and Oriental Languages: A Comparative Ap- proach to Learning German and Mandarin Chinese Alexandra Mechsheryakova Internship Report Presented to the Porto Accounting and Business School (ISCAP) to obtain a Master’s degree in Intercultural Studies for Business, under the supervision of Professors Yu Yong, and Clara Sarmento, Ph.D. Abstract The following internship report aims to analyse and compare the learning difficulties underlying western and oriental languages. In this case, the first group of languages is represented by German, whereas the second group is represented by Mandarin Chinese. Regarding the report structure, it is organized in six parts. Firstly, in the introduction section, a brief contextualization of the research topic is made and the research method- ologies are defined. Secondly, in the second chapter, some general theoretical back- ground on foreign language learning, the general difficulties related to it and the role of the language teacher are given. After that, in the third chapter, a detailed characteriza- tion of the studied languages is made. Finished the theoretical part, in the fourth chapter, all the practical work performed during the internship as a language teacher is described. Further, in the fifth chapter, the results concerning the learning difficulties are presented. As last part, in the conclusion section, a summing up of the main ideas and findings is made.
    [Show full text]
  • International Register of Coded Character Sets to Be Used with Escape Sequences for Information Interchange in Data Processing
    INTERNATIONAL REGISTER OF CODED CHARACTER SETS TO BE USED WITH ESCAPE SEQUENCES 1 Introduction 1.1 General This document is the ISO International Register of Coded Character Sets To Be Used With Escape Sequences for information interchange in data processing. It is compiled in accordance with the provisions of ISO/IEC 2022, "Code Extension Technique" and of ISO 2375 "Procedure for Registration of Escape Sequences". This International Register contains coded character sets which have been registered in accordance with procedures given in ISO 2375. Its purpose is to identify widely used coded character sets and associate with each a unique escape sequence by means of which it can be designated according to ISO/IEC 2022 and ISO/IEC 4873. The publication of this International Register should promote compatibility in international information interchange and avoid duplication of effort in developing application-oriented coded character sets. Registration provides an identification for a coded character set but implies nothing about its status; it may or may not be part of a standard of an international, national or a corporate body. However, if such a standard is published subsequently to the registration, it would be appropriate for the escape sequence identifying the character set to be specified in the standard. If it is desired to register a set, application should be made to the Registration Authority through an appropriate Sponsoring Authority as specified in ISO 2375. Any character set can be a candidate for registration if it meets the requirements of ISO 2375. The Registration Authority ascertains that the proposals received are formally in accordance with this International Standard, technically in accordance with ISO/IEC 2022, and, where applicable, with ISO/IEC 646 and ISO/IEC 4873, and meet the presentation practice of the Registration Authority.
    [Show full text]
  • Kodowanie Tekstów Polskich W Systemach Komputerowych∗
    Kodowanie tekstów polskich w systemach komputerowych∗ Janusz S. Bień† 4 stycznia 1999‡ 1 Wstęp W niniejszym artykule przedstawiono podstawowe pojęcia kodowania tek- stów oraz omówiono wszystkie aktualnie obowiązujące polskie normy doty- czące tej problematyki, a także najważniejsze standardy międzynarodowe. 2 Podstawowe pojęcia kodowania tekstów 2.1 Klasyfikacja tekstów Pojęcie tekstu traktujemy jako pierwotne, którego nie będziemy tutaj defi- niować, wyróżnimy natomiast dwa podstawowe typy tekstów: teksty fizyczne i teksty elektroniczne. Za charakterystyczną cechę tekstów elektronicznych ∗Referat wygłoszony na konferencji Multimedia w nauczaniu języka rodzime- go jako obcego (Szkoła Języka i Kultury Polskiej, Uniwersytet Śląski, Katowi- ce, 7-8 grudnia 1998) i opublikowany z niewielkimi skrótami w kwartalniku Post- sciptum nr 27–29 (jesień 1998 — wiosna 1999), s. 4-27 (ISSN 1427–0501). Za zgo- dą organizatorów i redakcji niniejszy pełny tekst referatu był dostępny pod adresem ftp://ftp.mimuw.edu.pl/pub/polszczyzna/ogonki/katow98.*, obecnie jest dostępny pod adresem http://www.mimuw.edu.pl/~jsbien/publ/Kodtp98/. †Dr hab. J. S. Bień, prof. UW jest pracownikiem Katedry Lingwistyki Formalnej Uni- wersytetu Warszawskiego. W latach 1998-2003 był kierownikiem Zakładu Zastosowań In- formatycznych Instytutu Orientalistycznego Uniwersytetu Warszawskiego. W Instytucie Informatyki UW, w którym pracował do czerwca 1998 r., prowadził m.in. wykłady mono- graficzne pt. Wybrane standardy przetwarzania tekstu. W latach 1992–1993 był członkiem Normalizacyjnej Komisji Problemowej ds. Informatyki, w latach 1997-1999 Normaliza- cyjnej Komisji Problemowej nr 242 ds. Informacji i Dokumentacji, a w latach 1999-2001 Normalizacyjnej Komisji Problemowej nr 170 ds. Terminologii Informatycznej i Kodowania Informacji Polskiego Komitetu Normalizacyjnego. Adres elektroniczny: [email protected] (lub [email protected]).
    [Show full text]
  • Network Working Group D. Spinellis Request for Comments: 1947 SENA S.A
    Network Working Group D. Spinellis Request for Comments: 1947 SENA S.A. Category: Informational May 1996 Greek Character Encoding for Electronic Mail Messages Status of This Memo This memo provides information for the Internet community. This memo does not specify an Internet standard of any kind. Distribution of this memo is unlimited. Overview and Rational This document describes a standard encoding for electronic mail [RFC822] containing Greek text and provides implementation guide- lines. The standard is based on MIME [RFC1521] and the ISO 8859-7 character encoding. Although the implementation of this standard is straightforward several non-standard but "functional" - though unlikely to inter-operate - alternatives are in common use. For this reason we highlight common implementation and mail user agent setup errors. Description In order to transfer Greek text via electronic mail the text is first translated into the ISO 8859-7 character set, and then encoded using either the Base64 (preferable for text that is mainly Greek) or the Quoted-Printable (justifiable in cases where some Greek words appear inside predominately Latin text) method, as defined in MIME. The following table provides most common Greek encodings (see also [RFC1345]): 0646 37 M7 51 MC 23 69 LG L1 G7 GO GC 28 97 Description ---- -- -- -- -- -- -- -- -- -- -- -- -- -- ----------- 0386 ea a2 86 cd 71 86 b6 Capital alpha with acute 0388 eb b8 8d ce 72 8d b8 Capital epsilon with acute 0389 ec b9 8f d7 73 8f b9 Capital eta with acute 038a ed ba 90 d8 75 90 ba Capital iota
    [Show full text]
  • The Unicode Standard, Version 3.0, Issued by the Unicode Consor- Tium and Published by Addison-Wesley
    The Unicode Standard Version 3.0 The Unicode Consortium ADDISON–WESLEY An Imprint of Addison Wesley Longman, Inc. Reading, Massachusetts · Harlow, England · Menlo Park, California Berkeley, California · Don Mills, Ontario · Sydney Bonn · Amsterdam · Tokyo · Mexico City Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and Addison-Wesley was aware of a trademark claim, the designations have been printed in initial capital letters. However, not all words in initial capital letters are trademark designations. The authors and publisher have taken care in preparation of this book, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode®, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. If these files have been purchased on computer-readable media, the sole remedy for any claim will be exchange of defective media within ninety days of receipt. Dai Kan-Wa Jiten used as the source of reference Kanji codes was written by Tetsuji Morohashi and published by Taishukan Shoten. ISBN 0-201-61633-5 Copyright © 1991-2000 by Unicode, Inc. All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording or other- wise, without the prior written permission of the publisher or Unicode, Inc.
    [Show full text]
  • Uniedit Conversions
    UNIEDIT MULTILINGUAL TEXT EDITOR USER’S GUIDE HUMANITIES COMPUTING FACILITY DUKE UNIVERSITY Copyright Information © COPYRIGHT 1998 BY THE HUMANITIES COMPUTING FACILITY, DUKE UNIVERSITY. ALL RIGHTS RESERVED. NO PART OF THIS PUBLICATION MAY BE REPRODUCED, TRANSMITTED, OR TRANSCRIBED, STORED INTO A RETRIEVAL SYSTEM OR TRANSLATED INTO ANY LANGUAGE OR COMPUTER LANGUAGE, IN ANY FORM OR BY ANY MEANS, ELECTRONIC, MECHANICAL, MAGNETIC, OPTICAL, CHEMICAL, MANUAL OR OTHERWISE, WITHOUT THE PRIOR WRITTEN CONSENT OF DUKE UNIVERSITY. THE HUMANITIES COMPUTING FACILITY RESERVES THE RIGHT TO REVISE THIS PUBLICATION AND TO MAKE CHANGES FROM TIME TO TIME IN THE CONTENT OF THIS PUBLICATION WITHOUT OBLIGATION OF THE HUMANITIES COMPUTING FACILITY TO NOTIFY ANY PERSON OF SUCH REVISION. MICROSOFT IS A REGISTERED TRADEMARK OF MICROSOFT CORPORATION; VIDEO FOR WINDOWS, WINDOWS 3.1, WINDOWS NT AND MULTIMEDIA MOVIE PLAYER ARE TRADEMARKS OF MICROSOFT CORPORATION. IBM IS A REGISTERED TRADEMARK OF INTERNATIONAL BUSINESS MACHINES CORPORATION AND PC-AT, PS/2, AND M-MOTION ARE TRADEMARKS OF INTERNATIONAL BUSINESS MACHINES. UNICODE IS A REGISTERED TRADEMARK OF THE UNICODE CONSORTIUM. SEND YOUR COMPLETED UNIEDIT REGISTRATION FORM, OR DIRECT ANY QUESTIONS, SUGGESTIONS, PRODUCT ORDERS, AND COMMENTS TO: HUMANITIES COMPUTING FACILITY 319 NORTH BUILDING BOX 90269 DUKE UNIVERSITY DURHAM, NC 27708-0269 USA PHONE: (919) 660-3190 FAX: (919) 660-3191 E-MAIL: [email protected] Table of Contents iii Table of Contents COPYRIGHT INFORMATION .......................................................................................................................
    [Show full text]
  • Sources Bibliographiques
    ANNEXE B SOURCES BIBLIOGRAPHIQUES Annexe B Sources bibliographiques On trouvera ci-dessous les sources bibliographiques ayant servies à l’établissement de la norme ISO/CEI 10646 : 2000. Les sources utilisées pour la traduction française de cette norme sont également mentionnées. Pour obtenir une copie des normes internationales veuillez vous reporter au site réticulaire de l’Organisation internationale de normalisation <http://www.iso.ch/indexf.html>, vous y trouverez un catalogue de toutes les normes les plus récentes et des liens vers votre organisme national où vous pourrez vous procurer les normes nécessaires. B. 1.Normes internationales ISO 233 : 1984, Documentation — Translittération des caractères arabes en caractères latins. ISO/IEC 646 : 1991, Technologies de l’information — Jeu ISO de caractères codés à 7 éléments pour l’échange d’informations. ISO 2033 : 1983, Traitement de l’information — Codage des jeux de caractères pour reconnaissance automatisée (MICR et ROC). ISO 2047 : 1975, Traitement de l’information — Représentation graphique des caractères de commande du jeu de caractères codés à 7 éléments. ISO 5426 : 1983, Extension du jeu de caractères latins codés employé pour l'échange d’information bibliographique. ISO 5426-2 : 1996, Information et documentation —- Extension du jeu de caractères latins codés pour les échanges d'informations bibliographiques — Partie 2: Caractères latins utilisés dans les langues européennes peu répandues et typographie obsolète ISO 5427 : 1984, Extension du jeu de caractères cyrilliques codés employé pour l’échange d'information bibliographique. ISO 5428 : 1984, Jeu de caractères grecs codés employé pour l’échange d’information bibliographique. ISO 6438 : 1983, Documentation — Jeu de caractères africains codés pour l’échange d'informations bibliographiques.
    [Show full text]