2100–214F the Unicode Standard, Version 4.0 Disclaimer Fonts Terms

Total Page:16

File Type:pdf, Size:1020Kb

Load more

Letterlike Symbols Range: 2100–214F The Unicode Standard, Version 4.0 This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 4.0. Characters in this chart that are new for The Unicode Standard, Version 4.0 are shown in conjunction with any existing characters. For ease of reference, the new characters have been highlighted in the chart grid and in the names list. This file will not be updated with errata, or when additional characters are assigned to the Unicode Standard. See http://www.unicode.org/charts for access to a complete list of the latest character charts. Disclaimer These charts are provided as the on-line reference to the character contents of the Unicode Standard, Version 4.0 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this excerpt file, please consult the appropriate sections of The Unicode Standard, Version 4.0 (ISBN 0-321-18578-1), as well as Unicode Standard Annexes #9, #11, #14, #15, #24 and #29, the other Unicode Technical Reports and the Unicode Character Database, which are available on-line. See http://www.unicode.org/Public/UNIDATA/UCD.html and http://www.unicode.org/unicode/reports A thorough understanding of the information contained in these additional sources is required for a successful implementation. Fonts The shapes of the reference glyphs used in these code charts are not prescriptive. Considerable variation is to be expected in actual fonts. The particular fonts used in these charts were provided to the Unicode Consortium by a number of different font designers, who own the rights to the fonts. See http://www.unicode.org/charts/fonts.html for a list. Terms of Use You may freely use these code charts for personal or internal business uses only. You may not incorporate them either wholly or in part into any product or publication, or otherwise distribute them without express written permission from the Unicode Consortium. However, you are welcome to provide links to these charts. The fonts and font data used in production of these Code Charts may NOT be extracted or otherwise used in any commercial product without permission or license granted by the typeface owner(s). The information in this file may be updated from time to time. The Unicode Consortium is not liable for errors or omissions in this excerpt file or the standard itself. Information on characters added to the Unicode Standard since the publication of Version 4.0 as well as on characters currently being considered for addition to the Unicode Standard can be found on the Unicode web site. See http://www.unicode.org/pending/pending.html and http://www.unicode.org/unicode/alloc/Pipeline.html. Copyright © 1991-2003 Unicode, Inc. All rights reserved. 2100 Letterlike Symbols 214F 210 211 212 213 214 0 ƛ Ư 2100 2110 2120 2130 2140 1 Ɯ ư 2101 2111 2121 2131 2141 2 Ɠ Ɲ ™ 2102 2112 2122 2132 2142 3 Ʋ 2103 2113 2123 2133 2143 4 ƕ Ʃ Ƴ 2104 2114 2124 2134 2144 5 ℅ Ơ ℵ 2105 2115 2125 2135 2145 6 № = 2106 2116 2126 2136 2146 7 > 2107 2117 2127 2137 2147 8 ℘ ƭ ? 2108 2118 2128 2138 2148 9 2109 2119 2129 2139 2149 A 210A 211A 212A 213A 214A B Å 210B 211B 212B 213B 214B C 210C 211C 212C D 210D 211D 212D 213D E ℞ ℮ 210E 211E 212E 213E F ƚ Ʈ 210F 211F 212F 213F 184 The Unicode Standard 4.0, Copyright © 1991–2003, Unicode, Inc. All rights reserved. 2100 Letterlike Symbols 2125 Letterlike symbols 2113 SCRIPT SMALL L = mathematical symbol 'ell' Some of the letterlike symbols are intended to complete the set of mathematical alphanumeric symbols starting = liter (traditional symbol) at U+1D400. • despite its character name, this symbol is 2100 ACCOUNT OF derived from a special italicized version of the small letter l 0061 a 002F / 0063 c 2101 • the SI recommended symbol for liter is ! ADDRESSED TO THE SUBJECT 006C l 0061 002F 0073 a / s → 1D4C1 mathematical script small l 2102 Ɠ DOUBLE-STRUCK CAPITAL C <font> 006C l latin small letter l = the set of complex numbers 2114 0043 L B BAR SYMBOL <font> C latin capital letter c = pounds 2103 $ DEGREE CELSIUS 2115 Ơ DOUBLE-STRUCK CAPITAL N = degrees Centigrade 00B0 0043 = natural number ° C <font> 004E N latin capital letter n 2104 ƕ CENTRE LINE SYMBOL 2116 № NUMERO SIGN = clone 004E N 006F o 2105 ℅ CARE OF 2117 0063 002F 006F SOUND RECORDING COPYRIGHT c / o = published 2106 ' CADA UNA = phonorecord sign 0063 002F 0075 c / u → 00A9 © copyright sign 2107 ) EULER CONSTANT 2118 ℘ SCRIPT CAPITAL P 0045 → E latin capital letter e = Weierstrass elliptic function 0190 * latin capital letter open e • actually this has the form of a lowercase 2108 + SCRUPLE calligraphic p, despite its name 2109 , DEGREE FAHRENHEIT 2119 DOUBLE-STRUCK CAPITAL P 00B0 ° 0046 F <font> 0050 P latin capital letter p 210A SCRIPT SMALL G 211A DOUBLE-STRUCK CAPITAL Q = real number symbol = the set of rational numbers <font> 0067 g latin small letter g <font> 0051 Q latin capital letter q 210B SCRIPT CAPITAL H 211B SCRIPT CAPITAL R = Hamiltonian function = Riemann Integral <font> 0048 H latin capital letter h <font> 0052 R latin capital letter r 210C BLACK-LETTER CAPITAL H 211C BLACK-LETTER CAPITAL R = Hilbert space = real part <font> 0048 H latin capital letter h <font> 0052 R latin capital letter r 210D DOUBLE-STRUCK CAPITAL H 211D DOUBLE-STRUCK CAPITAL R <font> 0048 H latin capital letter h = the set of real numbers 210E PLANCK CONSTANT <font> 0052 R latin capital letter r <font> 0068 h latin small letter h 211E ℞ PRESCRIPTION TAKE 210F ƚ PLANCK CONSTANT OVER TWO PI = recipe → 045B ћ cyrillic small letter tshe = cross ratio <font> 0127 2 latin small letter h with 211F RESPONSE stroke 2120 SERVICE MARK 2110 ƛ SCRIPT CAPITAL I <super> 0053 S 004D M <font> 0049 I latin capital letter i 2121 TELEPHONE SIGN 2111 Ɯ BLACK-LETTER CAPITAL I 0054 T 0045 E 004C L = imaginary part 2122 ™ TRADE MARK SIGN <font> 0049 I latin capital letter i <super> 0054 T 004D M 2112 Ɲ SCRIPT CAPITAL L 2123 VERSICLE = Laplace symbol 2124 Ʃ DOUBLE-STRUCK CAPITAL Z 004C <font> L latin capital letter l = the set of integers <font> 005A Z latin capital letter z 2125 OUNCE SIGN → 021D latin small letter yogh The Unicode Standard 4.0, Copyright © 1991–2003, Unicode, Inc. All rights reserved. 185 2126 Letterlike Symbols 2146 2126 B OHM SIGN Hebrew letterlike math symbols • SI unit of resistance, named after G. S. There are left-to-right characters. Ohm, German physicist 2135 • preferred representation is 03A9 Ω ℵ ALEF SYMBOL 03A9 = first transfinite cardinal (countable) ≡ Ω greek capital letter omega 05D0 < 2127 hebrew letter alef E INVERTED OHM SIGN 2136 = = MHO BET SYMBOL • archaic unit of conductance (= the SI unit = second transfinite cardinal (the continuum) siemens) 05D1 = • typographically a turned greek capital hebrew letter bet letter omega 2137 > GIMEL SYMBOL → 01B1 F latin capital letter upsilon = third transfinite cardinal (functions of a → 03A9 Ω greek capital letter omega real variable) 05D2 > 2128 ƭ BLACK-LETTER CAPITAL Z hebrew letter gimel 2138 ? <font> 005A Z latin capital letter z DALET SYMBOL 2129 = fourth transfinite cardinal G TURNED GREEK SMALL LETTER 05D3 ? IOTA hebrew letter dalet • unique element fulfilling a description Additional letterlike symbols (logic) 2139 → 03B9 ι greek small letter iota 5 INFORMATION SOURCE 20DD 212A I KELVIN SIGN • intended for use with 67 0069 ≡ 004B K latin capital letter k <font> i latin small letter i 213A 212B Å ANGSTROM SIGN 9 ROTATED CAPITAL Q • non SI length unit (=0.1 nm) named after • a binding signature mark A. J. Ångström, Swedish physicist 213B FACSIMILE SIGN • preferred representation is 00C5 Å → 2121 telephone sign ≡ 00C5 Å latin capital letter a with ring 0046 F 0041 A 0058 X above 213C " <reserved> 212C SCRIPT CAPITAL B 213D DOUBLE-STRUCK SMALL GAMMA = Bernoulli function <font> 03B3 γ greek small letter gamma <font> 0042 B latin capital letter b 213E DOUBLE-STRUCK CAPITAL GAMMA 212D BLACK-LETTER CAPITAL C <font> 0393 Γ greek capital letter gamma <font> 0043 C latin capital letter c 213F DOUBLE-STRUCK CAPITAL PI 212E ℮ ESTIMATED SYMBOL <font> 03A0 Π greek capital letter pi • used in European packaging → 0065 e latin small letter e Double-struck large operator 212F Ʈ SCRIPT SMALL E 2140 DOUBLE-STRUCK N-ARY = error SUMMATION <font> 0065 e latin small letter e <font> 2211 ∑ n-ary summation 2130 Ư SCRIPT CAPITAL E Additional letterlike symbols = emf (electromotive force) <font> 0045 E latin capital letter e 2141 TURNED SANS-SERIF CAPITAL G 2131 ư SCRIPT CAPITAL F = game = Fourier transform 2142 TURNED SANS-SERIF CAPITAL L <font> 0046 F latin capital letter f 2143 REVERSED SANS-SERIF CAPITAL L 2132 O TURNED CAPITAL F 2144 TURNED SANS-SERIF CAPITAL Y 0046 → F latin capital letter f Double-struck italic mathematical 2133 Ʋ SCRIPT CAPITAL M = M-matrix (physics) symbols = German Mark (not the current Deutsche These stylized mathematical symbols are used in some Mark) documents to distinguish special mathematical usages <font> 004D M latin capital letter m from ordinary variables. 2134 Ƴ SCRIPT SMALL O 2145 DOUBLE-STRUCK ITALIC CAPITAL D = order, of inferior order to • sometimes used for the differential <font> 006F o latin small letter o <font> 0044 D latin capital letter d 2146 DOUBLE-STRUCK ITALIC SMALL D • sometimes used for the differential <font> 0064 d latin small letter d 186 The Unicode Standard 4.0, Copyright © 1991–2003, Unicode, Inc.
Recommended publications
  • Assessment of Options for Handling Full Unicode Character Encodings in MARC21 a Study for the Library of Congress

    Assessment of Options for Handling Full Unicode Character Encodings in MARC21 a Study for the Library of Congress

    1 Assessment of Options for Handling Full Unicode Character Encodings in MARC21 A Study for the Library of Congress Part 1: New Scripts Jack Cain Senior Consultant Trylus Computing, Toronto 1 Purpose This assessment intends to study the issues and make recommendations on the possible expansion of the character set repertoire for bibliographic records in MARC21 format. 1.1 “Encoding Scheme” vs. “Repertoire” An encoding scheme contains codes by which characters are represented in computer memory. These codes are organized according to a certain methodology called an encoding scheme. The list of all characters so encoded is referred to as the “repertoire” of characters in the given encoding schemes. For example, ASCII is one encoding scheme, perhaps the one best known to the average non-technical person in North America. “A”, “B”, & “C” are three characters in the repertoire of this encoding scheme. These three characters are assigned encodings 41, 42 & 43 in ASCII (expressed here in hexadecimal). 1.2 MARC8 "MARC8" is the term commonly used to refer both to the encoding scheme and its repertoire as used in MARC records up to 1998. The ‘8’ refers to the fact that, unlike Unicode which is a multi-byte per character code set, the MARC8 encoding scheme is principally made up of multiple one byte tables in which each character is encoded using a single 8 bit byte. (It also includes the EACC set which actually uses fixed length 3 bytes per character.) (For details on MARC8 and its specifications see: http://www.loc.gov/marc/.) MARC8 was introduced around 1968 and was initially limited to essentially Latin script only.
  • Letterlike Symbols Range: 2100–214F

    Letterlike Symbols Range: 2100–214F

    Letterlike Symbols Range: 2100–214F This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 14.0 This file may be changed at any time without notice to reflect errata or other updates to the Unicode Standard. See https://www.unicode.org/errata/ for an up-to-date list of errata. See https://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See https://www.unicode.org/charts/PDF/Unicode-14.0/ for charts showing only the characters added in Unicode 14.0. See https://www.unicode.org/Public/14.0.0/charts/ for a complete archived file of character code charts for Unicode 14.0. Disclaimer These charts are provided as the online reference to the character contents of the Unicode Standard, Version 14.0 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode Standard, Version 14.0, online at https://www.unicode.org/versions/Unicode14.0.0/, as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, #34, #38, #41, #42, #44, #45, and #50, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available online. See https://www.unicode.org/ucd/ and https://www.unicode.org/reports/ A thorough understanding of the information contained in these additional sources is required for a successful implementation.
  • The Unicode Standard 5.1 Code Charts

    The Unicode Standard 5.1 Code Charts

    Letterlike Symbols Range: 2100–214F The Unicode Standard, Version 5.1 This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 5.1. Characters in this chart that are new for The Unicode Standard, Version 5.1 are shown in conjunction with any existing characters. For ease of reference, the new characters have been highlighted in the chart grid and in the names list. This file will not be updated with errata, or when additional characters are assigned to the Unicode Standard. See http://www.unicode.org/errata/ for an up-to-date list of errata. See http://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See http://www.unicode.org/charts/PDF/Unicode-5.1/ for charts showing only the characters added in Unicode 5.1. See http://www.unicode.org/Public/5.1.0/charts/ for a complete archived file of character code charts for Unicode 5.1. Disclaimer These charts are provided as the online reference to the character contents of the Unicode Standard, Version 5.1 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode Standard, Version 5.0 (ISBN 0-321-48091-0), online at http://www.unicode.org/versions/Unicode5.0.0/, as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, #34, #38, #41, #42, and #44, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available online.
  • Character Properties 4

    Character Properties 4

    The Unicode® Standard Version 14.0 – Core Specification To learn about the latest version of the Unicode Standard, see https://www.unicode.org/versions/latest/. Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and the publisher was aware of a trade- mark claim, the designations have been printed with initial capital letters or in all capitals. Unicode and the Unicode Logo are registered trademarks of Unicode, Inc., in the United States and other countries. The authors and publisher have taken care in the preparation of this specification, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. © 2021 Unicode, Inc. All rights reserved. This publication is protected by copyright, and permission must be obtained from the publisher prior to any prohibited reproduction. For information regarding permissions, inquire at https://www.unicode.org/reporting.html. For information about the Unicode terms of use, please see https://www.unicode.org/copyright.html. The Unicode Standard / the Unicode Consortium; edited by the Unicode Consortium. — Version 14.0. Includes index. ISBN 978-1-936213-29-0 (https://www.unicode.org/versions/Unicode14.0.0/) 1.
  • Letterlike Symbols Number Forms

    Letterlike Symbols Number Forms

    ISO/IEC JTC1/SC2/WG2 N2392 Title: A Report of Korean Script ad hoc group meeting on Oct. 15, 2001 Participants: Kim Kyongsok (ROK), Mun Hwang Ryong, Park Dong Ki, Yang Song Jin, Yun Chang Hwa (four from D P R of Korea), Kobayashi (not present when the report was written) Source : Korean script ad hoc group. Date: 2001-10-16 References: WG2 N2374, WG2 N2376, WG2 N2390, WG2 N2243 1. D P R of Korea nominated Mr. YANG, Song Jin as a co-chair from D P R of Korea. 2. Adding a 6th column to CJK and CJK Ext. A tables of ISO/IEC 10646-1:2000 [WG2 N2376] - D P R of Korea proposed that they would prepare a sample output of one page so that IRG and WG2 can review it, on the condition that IRG Rapporteur, IRG Technical Editor and Contributing Editor provide D P R of Korea with current CJK fonts and related software used to produce the current CJK tables. When the sample output proves acceptable, DPRK would prepare CJK and CJK Ext. A tables. - Detailed milestones can be discussed at WG2. 3. Adding 70 symbols [WG2 N2374, WG2 N2390] 3.1 For the following 47 characters, no issues were raised and propose that they be added to BMP. Letterlike Symbols Proposed Shape Character Name UCS LIMITED LIABILITY SIGN U+214C # 004C 0054 0044 PARTNERSHIP SIGN U+214D # 0050 0054 0045 FACSIMILE SIGN U+214E # 0046 0041 0058 Number Forms Proposed Shape Character Name UCS VULGAR FRACTION ONE HALF WITH U+2151 HORIZONTAL BAR # 00BD VULGAR FRACTION ONE THIRD WITH U+2184 HORIZONTAL BAR # 2153 VULGAR FRACTION TWO THIRDS WITH U+2185 HORIZONTAL BAR # 2154 VULGAR FRACTION
  • Unicode Characters in Proofpower Through Lualatex

    Unicode Characters in Proofpower Through Lualatex

    Unicode Characters in ProofPower through Lualatex Roger Bishop Jones Abstract This document serves to establish what characters render like in utf8 ProofPower documents prepared using lualatex. Created 2019 http://www.rbjones.com/rbjpub/pp/doc/t055.pdf © Roger Bishop Jones; Licenced under Gnu LGPL Contents 1 Prelude 2 2 Changes 2 2.1 Recent Changes .......................................... 2 2.2 Changes Under Consideration ................................... 2 2.3 Issues ............................................... 2 3 Introduction 3 4 Mathematical operators and symbols in Unicode 3 5 Dedicated blocks 3 5.1 Mathematical Operators block .................................. 3 5.2 Supplemental Mathematical Operators block ........................... 4 5.3 Mathematical Alphanumeric Symbols block ........................... 4 5.4 Letterlike Symbols block ..................................... 6 5.5 Miscellaneous Mathematical Symbols-A block .......................... 7 5.6 Miscellaneous Mathematical Symbols-B block .......................... 7 5.7 Miscellaneous Technical block .................................. 7 5.8 Geometric Shapes block ...................................... 8 5.9 Miscellaneous Symbols and Arrows block ............................. 9 5.10 Arrows block ........................................... 9 5.11 Supplemental Arrows-A block .................................. 10 5.12 Supplemental Arrows-B block ................................... 10 5.13 Combining Diacritical Marks for Symbols block ......................... 11 5.14
  • Character Repertoire of Symbola

    Character Repertoire of Symbola

    Symbola, version 8.00, October 2015, Unicode Fonts for Ancient Scripts, George Douros Symbola Basic Latin, IPA Extensions, Spacing Modifier Letters, Combining Diacritical Marks, Greek and Coptic, Cyrillic, Cyrillic Supplement, General Punctuation, Superscripts and Subscripts, Currency Symbols, Combining Diacritical Marks for Symbols, Letterlike Symbols, Number Forms, Arrows, Mathematical Operators, Miscellaneous Technical, Control Pictures, Optical Character Recognition, Enclosed Alphanumerics, Box Drawing, Block Elements, Geometric Shapes, Miscellaneous Symbols, Dingbats, Miscellaneous Mathematical Symbols-A, Supplemental Arrows-A, Braille Patterns, Supplemental Arrows-B, Miscellaneous Mathematical Symbols-B, Supplemental Mathematical Operators, Miscellaneous Symbols and Arrows, Supplemental Punctuation, Yijing Hexagram Symbols, Combining Half Marks, Specials, Aegean Numbers, Ancient Greek Numbers, Ancient Symbols, Phaistos Disc, Coptic Epact Numbers, Byzantine Musical Symbols, Musical Symbols, Ancient Greek Musical Notation, Tai Xuan Jing Symbols, Counting Rod Numerals, Mathematical Alphanumeric Symbols, Mahjong Tiles, Domino Tiles, Playing Cards, Enclosed Alphanumeric Supplement, Enclosed Ideographic Supplement, Miscellaneous Symbols and Pictographs, Emoticons, Ornamental Dingbats, Transport and Map Symbols, Alchemical Symbols, Geometric Shapes Extended, Supplemental Arrows-C, Supplemental Symbols and Pictographs, Symbols of occasional mathematical interest, et al. Symbola version 8.00 2015 Symbola is not a merchandise; it is free for
  • The Unicode Standard, Version 3.0, Issued by the Unicode Consor- Tium and Published by Addison-Wesley

    The Unicode Standard, Version 3.0, Issued by the Unicode Consor- Tium and Published by Addison-Wesley

    The Unicode Standard Version 3.0 The Unicode Consortium ADDISON–WESLEY An Imprint of Addison Wesley Longman, Inc. Reading, Massachusetts · Harlow, England · Menlo Park, California Berkeley, California · Don Mills, Ontario · Sydney Bonn · Amsterdam · Tokyo · Mexico City Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and Addison-Wesley was aware of a trademark claim, the designations have been printed in initial capital letters. However, not all words in initial capital letters are trademark designations. The authors and publisher have taken care in preparation of this book, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode®, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. If these files have been purchased on computer-readable media, the sole remedy for any claim will be exchange of defective media within ninety days of receipt. Dai Kan-Wa Jiten used as the source of reference Kanji codes was written by Tetsuji Morohashi and published by Taishukan Shoten. ISBN 0-201-61633-5 Copyright © 1991-2000 by Unicode, Inc. All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording or other- wise, without the prior written permission of the publisher or Unicode, Inc.
  • Mathematical Alphanumeric Symbols Range: 1D400–1D7FF the Unicode Standard 3.1 Disclaimer Fonts Terms Of

    Mathematical Alphanumeric Symbols Range: 1D400–1D7FF the Unicode Standard 3.1 Disclaimer Fonts Terms Of

    Mathematical Alphanumeric Symbols Range: 1D400–1D7FF The Unicode Standard 3.1 This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 3.1. The characters in this file that are new for The Unicode Standard, Version 3.1 are shown in conjunction with characters that already exist in The Unicode Standard, Version 3.0. For ease of reference, the new characters have been highlighted in the charts and in the nameslist. This file will not be updated with errata or when additional characters are assigned by the Unicode Standard. See http://www.unicode.org/charts for access to a complete set of the latest character charts. Disclaimer The shapes of the reference glyphs used in these code charts are not prescriptive. Considerable variation is to be expected in actual fonts. For a complete understanding of the use of the characters contained in this excerpt file, please consult the appropriate sections of The Unicode Standard, Version 3.0 (ISBN 0-201-61633-5), as well as the Unicode Technical Reports and the Unicode Character Database, which are available online. See ftp://ftp.unicode.org/Public/UNIDATA/UnicodeCharacterDatabase.html and http://www.unicode.org/unicode/reports A thorough understanding of the information contained in these additional sources is required for a successful implementation. Fonts The fonts used in these charts were provided to the Unicode Consortium by a number of different font designers See http://www.unicode.org/unicode/uni2book/u2fonts.html for a list. Terms of Use These charts are provided as a convenient online reference to the character contents of the Unicode Standard, Version 3.1.
  • The Unicode Standard, Version 4.0--Online Edition

    The Unicode Standard, Version 4.0--Online Edition

    This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consor- tium and published by Addison-Wesley. The material has been modified slightly for this online edi- tion, however the PDF files have not been modified to reflect the corrections found on the Updates and Errata page (http://www.unicode.org/errata/). For information on more recent versions of the standard, see http://www.unicode.org/standard/versions/enumeratedversions.html. Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and Addison-Wesley was aware of a trademark claim, the designations have been printed in initial capital letters. However, not all words in initial capital letters are trademark designations. The Unicode® Consortium is a registered trademark, and Unicode™ is a trademark of Unicode, Inc. The Unicode logo is a trademark of Unicode, Inc., and may be registered in some jurisdictions. The authors and publisher have taken care in preparation of this book, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode®, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. Dai Kan-Wa Jiten used as the source of reference Kanji codes was written by Tetsuji Morohashi and published by Taishukan Shoten.
  • Outline of the Course

    Outline of the Course

    Outline of the course Introduction to Digital Libraries (15%) Description of Information (30%) Access to Information (()30%) User Services (10%) Additional topics (()15%) Buliding of a (small) digital library Reference material: – Ian Witten, David Bainbridge, David Nichols, How to build a Digital Library, Morgan Kaufmann, 2010, ISBN 978-0-12-374857-7 (Second edition) – The Web FUB 2012-2013 Vittore Casarosa – Digital Libraries Part 7 -1 Access to information Representation of characters within a computer Representation of documents within a computer – Text documents – Images – Audio – Video How to store efficiently large amounts of data – Compression How to retrieve efficiently the desired item(s) out of large amounts of data – Indexing – Query execution FUB 2012-2013 Vittore Casarosa – Digital Libraries Part 7 -2 Representation of characters The “natural” wayyp to represent ( (palphanumeric ) characters (and symbols) within a computer is to associate a character with a number,,g defining a “coding table” How many bits are needed to represent the Latin alphabet ? FUB 2012-2013 Vittore Casarosa – Digital Libraries Part 7 -3 The ASCII characters The 95 printable ASCII characters, numbdbered from 32 to 126 (dec ima l) 33 control characters FUB 2012-2013 Vittore Casarosa – Digital Libraries Part 7 -4 ASCII table (7 bits) FUB 2012-2013 Vittore Casarosa – Digital Libraries Part 7 -5 ASCII 7-bits character set FUB 2012-2013 Vittore Casarosa – Digital Libraries Part 7 -6 Representation standards ASCII (late fifties) – AiAmerican
  • Dejavusansmono-Bold.Ttf [Dejavu Sans Mono Bold]

    Dejavusansmono-Bold.Ttf [Dejavu Sans Mono Bold]

    DejaVuSerif.ttf [DejaVu Serif] [DejaVu Serif] Basic Latin, Latin-1 Supplement, Latin Extended-A, Latin Extended-B, IPA Extensions, Phonetic Extensions, Phonetic Extensions Supplement, Spacing Modifier Letters, Modifier Tone Letters, Combining Diacritical Marks, Combining Diacritical Marks Supplement, Greek And Coptic, Cyrillic, Cyrillic Supplement, Cyrillic Extended-A, Cyrillic Extended-B, Armenian, Thai, Georgian, Georgian Supplement, Latin Extended Additional, Latin Extended-C, Latin Extended-D, Greek Extended, General Punctuation, Supplemental Punctuation, Superscripts And Subscripts, Currency Symbols, Letterlike Symbols, Number Forms, Arrows, Supplemental Arrows-A, Supplemental Arrows-B, Miscellaneous Symbols and Arrows, Mathematical Operators, Supplemental Mathematical Operators, Miscellaneous Mathematical Symbols-A, Miscellaneous Mathematical Symbols-B, Miscellaneous Technical, Control Pictures, Box Drawing, Block Elements, Geometric Shapes, Miscellaneous Symbols, Dingbats, Non- Plane 0, Private Use Area (plane 0), Alphabetic Presentation Forms, Specials, Braille Patterns, Mathematical Alphanumeric Symbols, Variation Selectors, Variation Selectors Supplement DejaVuSansMono.ttf [DejaVu Sans Mono] [DejaVu Sans Mono] Basic Latin, Latin-1 Supplement, Latin Extended-A, Latin Extended-B, IPA Extensions, Phonetic Extensions, Phonetic Extensions Supplement, Spacing Modifier Letters, Modifier Tone Letters, Combining Diacritical Marks, Combining Diacritical Marks Supplement, Greek And Coptic, Cyrillic, Cyrillic Supplement, Cyrillic Extended-A,