Quick viewing(Text Mode)

The Unicode Standard 5.0 Code Charts

The Unicode Standard 5.0 Code Charts

General Range: 2000–206F

This file contains an excerpt from the character code tables and list of character names for The Standard, Version 5.0.

This file may be changed at any time without notice to reflect errata or other updates to the Unicode Standard. See http://www.unicode.org/errata/ for an up-to-date list of errata.

See http://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See http://www.unicode.org/charts/PDF/Unicode-5.0/ for charts showing only the characters added in Unicode 5.0. See http://www.unicode.org/Public/5.0.0/charts/ for a complete archived file of character code charts for Unicode 5.0.

Disclaimer These charts are provided as the on-line reference to the character contents of the Unicode Standard, Version 5.0 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode Standard, Version 5.0 (ISBN 0-321-48091-0), online at http://www.unicode.org/versions/Unicode5.0.0/, as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, and #34, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available on-line.

See http://www.unicode.org/ucd/ and http://www.unicode.org/reports/

A thorough understanding of the information contained in these additional sources is required for a successful implementation.

Fonts The shapes of the reference used in these code charts are not prescriptive. Considerable variation is to be expected in actual . The particular fonts used in these charts were provided to the by a number of different designers, who own the rights to the fonts.

See http://www.unicode.org/charts/fonts.html for a list.

Terms of Use You may freely use these code charts for personal or internal business uses only. You may not incorporate them either wholly or in part into any product or publication, or otherwise distribute them without express written permission from the Unicode Consortium. However, you may provide links to these charts.

The fonts and font data used in production of these Code Charts may NOT be extracted, or used in any other way in any product or publication, without permission or license granted by the owner(s).

The Unicode Consortium is not liable for errors or omissions in this file or the standard itself. Information on characters added to the Unicode Standard since the publication of the most recent version of the Unicode Standard, as well as on characters currently being considered for addition to the Unicode Standard can be found on the Unicode web site.

See http://www.unicode.org/pending/pending.html and http://www.unicode.org/alloc/Pipeline.html.

Copyright © 1991-2006 Unicode, Inc. All rights reserved. 2000 206F

200 201 202 203 204 205 206

0 È - † ‰   Ą

2000 2010 2020 2030 2040 2050 2060

1 É Ó ‡   ą

2001 2011 2021 2031 2041 2051 2061

2 Ê ‒ • ⁂  Ć

2002 2012 2022 2032 2042 2052 2062

3 Ë –    ˛ ć

2003 2013 2023 2033 2043 2053 2063

4 Ì —  ⁄ ˇ

2004 2014 2024 2034 2044 2054

5 Í    m

2005 2015 2025 2035 2045 2055

6 Î  …   –

2006 2016 2026 2036 2046 2056

7 Ï ‗   

2007 2017 2027 2037 2047 2057

8 Ð ‘ ä  —

2008 2018 2028 2038 2048 2058

9 Ñ ’ å ‹ “

2009 2019 2029 2039 2049 2059

A ‚  › ô

200A 201A 202A 203A 204A 205A 206A

B    ! Ò !

200B 201B 202B 203B 204B 205B 206B

C  “   " ⸏ "

200C 201C 202C 203C 204C 205C 206C

D  ”   # Û #

200D 201D 202D 203D 204D 205D 206D

E  „  ¯  Ù $

200E 201E 202E 203E 204E 205E 206E

F       %

200F 201F 202F 203F 204F 205F 206F

166 The Unicode Standard 5.0, Copyright © 1991-2006 Unicode, Inc. All rights reserved. 2000 General Punctuation 201C

For additional general punctuation characters see also Basic Latin, Latin-1, and CJK 2010 - Symbols and Punctuation. → 002D - hyphen-minus → 00AD  soft hyphen Spaces 2011 Ó NON-BREAKING HYPHEN 2000 È QUAD → 002D - hyphen-minus ≡ 2002 Ê en →  2001 É QUAD 00AD soft hyphen  2010 - = mutton quad ‒ FIGURE ≡ Ë 2012 2003 em space 2013 – EN DASH 2002 Ê EN SPACE 2014 — EM DASH = nut • may be used in pairs to offset parenthetical text • half an em → ー   30FC katakana-hiragana prolonged sound 0020 space mark 2003 Ë EM SPACE 2015  HORIZONTAL = mutton • = quotation dash nominally, a space equal to the type size in • long dash introducing quoted text points • may scale by the condensation factor of a font General punctuation  0020  space 2016  DOUBLE VERTICAL LINE 2004 Ì THREE-PER-EM SPACE • used in pairs to indicate norm of a matrix = thick space → 20E6 combining double vertical stroke  0020  space 2005 Í FOUR-PER-EM SPACE overlay → 2225  parallel to = mid space ‗ DOUBLE LOW LINE   2017 0020 space • 2006 Î SIX-PER-EM SPACE this is a spacing character → 005F _ low line • in computer typography sometimes equated to → 0333 ã combining double low line  0020  space  0020  0333 ã 2007 Ï 2018 ‘ LEFT SINGLE • space equal to tabular width of a font = single turned quotation mark • this is equivalent to the digit width of fonts • this is the preferred character (as opposed to with fixed-width digits 201B )  0020  → 0027 ' 2008 Ð PUNCTUATION SPACE → 02BB modifier letter turned comma • space equal to narrow punctuation of a font → 275B  heavy single turned comma quotation  0020  space mark ornament 2009 Ñ THIN SPACE 2019 ’ RIGHT SINGLE QUOTATION MARK • a fifth of an em (or sometimes a sixth) = single comma quotation mark  0020  space • this is the preferred character to use for 200A  HAIR SPACE apostrophe • thinner than a thin space → 0027 ' apostrophe • in traditional typography, the thinnest space → 02BC modifier letter apostrophe available → 275C  heavy single comma quotation mark  0020  space ornament 200B  ZERO WIDTH SPACE 201A ‚ SINGLE LOW-9 QUOTATION MARK • commonly abbreviated ZWSP = low single comma quotation mark • this character is intended for line break control; • used as opening single quotation mark in some it has no width, but its presence between two characters does not prevent increased letter 201B SINGLE HIGH-REVERSED-9 QUOTATION MARK spacing in justification = single reversed comma quotation mark • has same semantic as 2018 ‘ , but differs in Format characters appearance 200C  ZERO WIDTH NON-JOINER → 02BD modifier letter reversed comma • commonly abbreviated ZWNJ 201C “ LEFT DOUBLE QUOTATION MARK 200D  ZERO WIDTH JOINER = double turned comma quotation mark • commonly abbreviated ZWJ • this is the preferred character (as opposed to 200E  LEFT-TO-RIGHT MARK 201F  ) • commonly abbreviated LRM → 0022 " quotation mark 200F  RIGHT-TO-LEFT MARK → 275D  heavy double turned comma • commonly abbreviated RLM quotation mark ornament → 301D  reversed double quotation mark

The Unicode Standard 5.0, Copyright © 1991-2006 Unicode, Inc. All rights reserved. 167 201D General Punctuation 203C

201D ” RIGHT DOUBLE QUOTATION MARK 202F  NARROW NO-BREAK SPACE = double comma quotation mark • commonly abbreviated NNBSP → 0022 " quotation mark → 00A0  no-break space → 2033  double prime  0020  → 275E  heavy double comma quotation mark ornament General punctuation → 301E 〞 double prime quotation mark 2030 ‰ SIGN 201E „ DOUBLE LOW-9 QUOTATION MARK = permille, per thousand = low double comma quotation mark • used, for example, in measures of blood • used as opening double quotation mark in some alcohol content, salinity, etc. languages → 0025 % → 301F  low double prime quotation mark 2031  PER TEN THOUSAND SIGN 201F  DOUBLE HIGH-REVERSED-9 QUOTATION = permyriad MARK • percent of a percent, rarely used = double reversed comma quotation mark → 0025 % percent sign • has same semantic as 201C “ , but differs in 2032  PRIME appearance = minutes, feet 2020 † → 0027 ' apostrophe = obelisk, obelus, long cross → 00B4 ´ 2021 ‡ DOUBLE DAGGER → 02B9 ʹ modifier letter prime = diesis, double obelisk 2033  DOUBLE PRIME 2022 • = seconds, = black small circle → " → · 0022 quotation mark 00B7 middle → 02BA  modifier letter double prime → 2024  one dot leader → ” →  201D right double quotation mark 2219 bullet operator → 3003 〃 → 25D8 inverse bullet → 〞 → 301E double prime quotation mark 25E6 white bullet  2032  2032  2023 # TRIANGULAR BULLET 2034  TRIPLE PRIME → í 220E end of proof = lines (old measure, 1/12 of an ) →  25B8 black right-pointing small triangle  2032  2032  2032  2024  ONE DOT LEADER 2035  REVERSED PRIME • also used as an Armenian (mijaket) → 0060 ` → 00B7 · middle dot 2036  REVERSED DOUBLE PRIME → 2022 • bullet → 301D  reversed double prime quotation → 2219  bullet operator mark  002E .  2035  2035  2025 & TWO DOT LEADER 2037  REVERSED TRIPLE PRIME  002E . 002E .  2035  2035  2035  2026 … HORIZONTAL 2038  = three dot leader → 2303  up arrowhead → 22EE ( vertical ellipsis 2039 ‹ SINGLE LEFT-POINTING ANGLE QUOTATION → FE19  presentation form for vertical MARK horizontal ellipsis = left pointing single  002E . 002E . 002E . • usually opening, sometimes closing 2027 ) HYPHENATION → 003C < less-than sign → 2329 〈 left-pointing angle Format characters → 3008 〈 left angle bracket 2028 ä LINE SEPARATOR 203A › SINGLE RIGHT-POINTING ANGLE QUOTATION • may be used to represent this semantic MARK unambiguously = right pointing single guillemet 2029 å SEPARATOR • usually closing, sometimes opening • may be used to represent this semantic → 003E > greater-than sign unambiguously → 232A 〉 right-pointing angle bracket 202A  LEFT-TO-RIGHT EMBEDDING → 3009 〉 right angle bracket • commonly abbreviated LRE 203B  202B  RIGHT-TO-LEFT EMBEDDING = Japanese kome • commonly abbreviated RLE = Urdu paragraph separator 202C  POP DIRECTIONAL FORMATTING →  • 0FBF tibetan ku ru kha bzhi mig can commonly abbreviated PDF → 200AD G cjk unified ideograph-200AD 202D  LEFT-TO-RIGHT OVERRIDE • commonly abbreviated LRO Double punctuation for vertical text 202E  RIGHT-TO-LEFT OVERRIDE 203C  DOUBLE • commonly abbreviated RLO → 0021 ! exclamation mark  0021 ! 0021 !

168 The Unicode Standard 5.0, Copyright © 1991-2006 Unicode, Inc. All rights reserved. 203D General Punctuation 2061

General punctuation 2055 m FLOWER PUNCTUATION MARK = phul, puspika 203D * • → ! used as a punctuation mark with Syloti Nagri, 0021 exclamation mark Bengali and other Indic scripts → 003F ? →  203E ¯ 274B heavy eight teardrop-spoked propeller = spacing overscore  0020  0305 , Archaic punctuation 203F / UNDERTIE 2056 – THREE DOT PUNCTUATION = Greek enotikon → 2323 0 smile General punctuation 2040 1 CHARACTER 2057  QUADRUPLE PRIME = z notation sequence concatenation  2032  2032  2032  2032  → 2322 2 frown 2041 3 CARET INSERTION POINT • ’ Archaic punctuation proofreader s mark: insert here 2058 — FOUR DOT PUNCTUATION → 22CC right semidirect product 2059 “ FIVE DOT PUNCTUATION 2042 ⁂ = Greek pentonkion 2043 5 HYPHEN BULLET = quincunx 2044 ⁄ → 2684  die face-5 = solidus (in typography) 205A TWO DOT PUNCTUATION • for composing arbitrary • historically used to indicate the end of a → 002F / solidus → 7 sentence or change of speaker 2215 slash • extends from to 2045 8 LEFT SQUARE BRACKET WITH QUILL → ︰ 2046 9 RIGHT SQUARE BRACKET WITH QUILL FE30 presentation form for vertical two dot leader Double punctuation for vertical text → 1015B 𐅛 greek acrophonic epidaurean two 205B Ò FOUR DOT MARK 2047 DOUBLE QUESTION MARK •  003F ? 003F ? used by scribes in the as highlighter 2048 : QUESTION EXCLAMATION MARK mark • this is centered on the line, but extends beyond  003F ? 0021 ! 2049 ; EXCLAMATION QUESTION MARK top and bottom of the line 205C ⸏ DOTTED CROSS  ! ? 0021 003F • used by scribes in the margin as highlighter mark General punctuation 205D Û TRICOLON 204A ô TIRONIAN SIGN ET = Epidaurean acrophonic three • Irish Gaelic, Old English, ... → ( → & 22EE vertical ellipsis 0026 → 2AF6 ⫶ triple operator 204B @ REVERSED SIGN → FE19  presentation form for vertical → 00B6 ¶ pilcrow sign 204C A BLACK LEFTWARDS BULLET horizontal ellipsis 205E Ù VERTICAL FOUR DOTS 204D B BLACK RIGHTWARDS BULLET • 204E LOW ASTERISK used in dictionaries to indicate legal but undesirable word break → 002A * asterisk • extends the whole height of the line → 0359 ŧ combining asterisk below 204F REVERSED SEMICOLON Space → 003B ; semicolon 205F  MEDIUM MATHEMATICAL SPACE 2050 CLOSE UP • • abbreviated MMSP editing mark • four-eighteenths of an em 2051 TWO ALIGNED VERTICALLY   2052 COMMERCIAL MINUS SIGN 0020 space = abzüglich (German), med avdrag av (Swedish), Format character piska (Swedish, "whip") 2060 Ą WORD JOINER • a common glyph variant and fallback • commonly abbreviated WJ representation looks like ./. • • may also be used as a to indicate a zero width non-breaking space (only) correctness • intended for disambiguation of functions for • used in Finno-Ugric Phonetic Alphabet to byte order mark indicate a related borrowed form with different → FEFF zero width no-break space sound → 0025 % percent sign Invisible operators 2061 ą FUNCTION APPLICATION → 066A ŵ percent sign 2053 ˛ SWUNG DASH • contiguity operator indicating application of a → 007E ~ function 2054 ˇ INVERTED UNDERTIE

The Unicode Standard 5.0, Copyright © 1991-2006 Unicode, Inc. All rights reserved. 169 2062 General Punctuation 206F

2062 Ć INVISIBLE TIMES • contiguity operator indicating multiplication 2063 ć INVISIBLE SEPARATOR = invisible comma • contiguity operator indicating that adjacent mathematical symbols form a list, e.g. when no visible comma is used between multiple indices Deprecated 206A $ INHIBIT SYMMETRIC SWAPPING 206B % ACTIVATE SYMMETRIC SWAPPING 206C & INHIBIT ARABIC FORM SHAPING 206D ' ACTIVATE ARABIC FORM SHAPING 206E ( NATIONAL DIGIT SHAPES 206F ) NOMINAL DIGIT SHAPES

170 The Unicode Standard 5.0, Copyright © 1991-2006 Unicode, Inc. All rights reserved.