The Unicode Standard, Last Updated for the Unicode Standard, Version 4.1
Total Page:16
File Type:pdf, Size:1020Kb
General Punctuation Range: 2000–206F This file contains an excerpt from the character code tables and list of character names for the Unicode Standard, last updated for The Unicode Standard, Version 4.1. This file may be updated as necessary to reflect errata without notice. For an up-to-date list of errata, see http://www.unicode.org/errata/ See http://www.unicode.org/charts/PDF/Unicode-4.1/ for charts showing only the characters added in Unicode 4.1. See http://www.unicode.org/Public/4.1.0/charts/ for a complete archived file of character code charts for Unicode 4.1. Disclaimer These charts are provided as the on-line reference to the character contents of the Unicode Standard, Version 4.1 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this excerpt file, please consult the appropriate sections of The Unicode Standard, Version 4.1, at http://www.unicode.org/versions/Unicode4.1.0/, including sections unchanged in The Unicode Standard, Version 4.0 (ISBN 0-321-18578-1), as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, and #34, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available on-line. See http://www.unicode.org/ucd/ and http://www.unicode.org/reports/ A thorough understanding of the information contained in these additional sources is required for a successful implementation. Fonts The shapes of the reference glyphs used in these code charts are not prescriptive. Considerable variation is to be expected in actual fonts. The particular fonts used in these charts were provided to the Unicode Consortium by a number of different font designers, who own the rights to the fonts. See http://www.unicode.org/charts/fonts.html for a list. Terms of Use You may freely use these code charts for personal or internal business uses only. You may not incorporate them either wholly or in part into any product or publication, or otherwise distribute them without express written permission from the Unicode Consortium. However, you may provide links to these charts. The fonts and font data used in production of these Code Charts may NOT be extracted, or used in any other way in any product or publication, without permission or license granted by the typeface owner(s). The information in this file may be updated from time to time. The Unicode Consortium is not liable for errors or omissions in this excerpt file or the standard itself. Information on characters added to the Unicode Standard since the publication of the most recent version of the Unicode Standard, as well as on characters currently being considered for addition to the Unicode Standard can be found on the Unicode web site. See http://www.unicode.org/pending/pending.html and http://www.unicode.org/alloc/Pipeline.html. Copyright © 1991-2005 Unicode, Inc. All rights reserved. 2000 General Punctuation 206F 200 201 202 203 204 205 206 0 È - † ‰ Ą 2000 2010 2020 2030 2040 2050 2060 1 É Ó ‡ ą 2001 2011 2021 2031 2041 2051 2061 2 Ê ‒ • ⁂ Ć 2002 2012 2022 2032 2042 2052 2062 3 Ë – ˛ ć 2003 2013 2023 2033 2043 2053 2063 4 Ì — ⁄ ˇ 2004 2014 2024 2034 2044 2054 5 Í m 2005 2015 2025 2035 2045 2055 6 Î … – 2006 2016 2026 2036 2046 2056 7 Ï ‗ 2007 2017 2027 2037 2047 2057 8 Ð ‘ ä — 2008 2018 2028 2038 2048 2058 9 Ñ ’ å ‹ “ 2009 2019 2029 2039 2049 2059 A ‚ › 200A 201A 202A 203A 204A 205A 206A B ! Ò 200B 201B 202B 203B 204B 205B 206B C “ " ⸏ ! 200C 201C 202C 203C 204C 205C 206C D ” # Û " 200D 201D 202D 203D 204D 205D 206D E „ ¯ Ù # 200E 201E 202E 203E 204E 205E 206E F $ 200F 201F 202F 203F 204F 205F 206F 590 The Unicode Standard 4.1, Copyright © 1991–2005, Unicode, Inc. All rights reserved. 2000 General Punctuation 201B For additional general punctuation characters see Dashes also Basic Latin, Latin-1, Supplemental Punctuation - and CJK Symbols and Punctuation. 2010 HYPHEN → 002D - hyphen-minus Spaces → 00AD soft hyphen 2011 Ó NON-BREAKING HYPHEN 2000 È EN QUAD → 002D - hyphen-minus ≡ 2002 Ê en space → 00AD soft hyphen 2001 É EM QUAD <noBreak> 2010 - = mutton quad 2012 ‒ FIGURE DASH ≡ 2003 Ë em space 2013 – 2002 Ê EN SPACE EN DASH 2014 — EM DASH = nut • • half an em may be used in pairs to offset parenthetical text → ー 0020 space 30FC katakana-hiragana prolonged sound Ë mark 2003 EM SPACE = mutton 2015 HORIZONTAL BAR • = QUOTATION DASH nominally, a space equal to the type size in • points long dash introducing quoted text • may scale by the condensation factor of a font General punctuation 0020 space 2004 Ì THREE-PER-EM SPACE 2016 DOUBLE VERTICAL LINE = thick space • used in pairs to indicate norm of a matrix 0020 space → 20E6 combining double vertical stroke 2005 Í FOUR-PER-EM SPACE overlay → = mid space 2225 parallel to 0020 space 2017 ‗ DOUBLE LOW LINE • 2006 Î SIX-PER-EM SPACE this is a spacing character → _ • in computer typography sometimes equated to 005F low line thin space → 0333 combining double low line 0020 space 0020 0333 2007 Ï FIGURE SPACE 2018 ‘ LEFT SINGLE QUOTATION MARK • space equal to tabular width of a font = SINGLE TURNED COMMA QUOTATION • this is equivalent to the digit width of fonts MARK with fixed-width digits • this is the preferred glyph (as opposed to <noBreak> 0020 201B ) → ' 2008 Ð PUNCTUATION SPACE 0027 apostrophe → • space equal to narrow punctuation of a font 02BB modifier letter turned comma 0020 space → 275B heavy single turned comma quotation 2009 Ñ THIN SPACE mark ornament • a fifth of an em (or sometimes a sixth) 2019 ’ RIGHT SINGLE QUOTATION MARK 0020 space = SINGLE COMMA QUOTATION MARK • 200A HAIR SPACE this is the preferred character to use for apostrophe • thinner than a thin space • → 0027 ' apostrophe in traditional typography, the thinnest space → available 02BC modifier letter apostrophe 0020 space → 275C heavy single comma quotation mark 200B ZERO WIDTH SPACE ornament ‚ = ZWSP 201A SINGLE LOW-9 QUOTATION MARK • this character is intended for line break control; = LOW SINGLE COMMA QUOTATION it has no width, but its presence between two MARK characters does not prevent increased letter • used as opening single quotation mark in some spacing in justification languages 201B SINGLE HIGH-REVERSED-9 QUOTATION Formatting characters MARK 200C ZERO WIDTH NON-JOINER = SINGLE REVERSED COMMA QUOTATION MARK = ZWNJ • glyph variant of 2018 ‘ 200D ZERO WIDTH JOINER → 02BD modifier letter reversed comma = ZWJ 200E LEFT-TO-RIGHT MARK = LRM 200F RIGHT-TO-LEFT MARK = RLM The Unicode Standard 4.1, Copyright © 1991–2005, Unicode, Inc. All rights reserved. 591 201C General Punctuation 2039 201C “ LEFT DOUBLE QUOTATION MARK 2029 å PARAGRAPH SEPARATOR = DOUBLE TURNED COMMA QUOTATION • may be used to represent this semantic MARK unambiguously • this is the preferred glyph (as opposed to 202A LEFT-TO-RIGHT EMBEDDING 201F ) = LRE → 0022 " quotation mark 202B RIGHT-TO-LEFT EMBEDDING → 275D heavy double turned comma quotation = RLE mark ornament 202C POP DIRECTIONAL FORMATTING → 301D reversed double prime quotation = PDF mark 202D LEFT-TO-RIGHT OVERRIDE 201D ” RIGHT DOUBLE QUOTATION MARK = LRO = DOUBLE COMMA QUOTATION MARK 202E RIGHT-TO-LEFT OVERRIDE → 0022 " quotation mark = RLO → 2033 double prime 202F NARROW NO-BREAK SPACE → 275E heavy double comma quotation mark = NNBSP ornament → 00A0 no-break space → 301E 〞 double prime quotation mark <noBreak> 0020 201E „ DOUBLE LOW-9 QUOTATION MARK = LOW DOUBLE COMMA QUOTATION General punctuation MARK ‰ • 2030 PER MILLE SIGN used as opening double quotation mark in some = permille, per thousand languages • → used, for example, in measures of blood alcohol 301F low double prime quotation mark content, salinity, etc. 201F DOUBLE HIGH-REVERSED-9 QUOTATION → 0025 % percent sign MARK 2031 PER TEN THOUSAND SIGN = DOUBLE REVERSED COMMA = permyriad QUOTATION MARK • percent of a percent, rarely used • 201C “ glyph variant of → 0025 % percent sign 2020 † DAGGER 2032 PRIME = obelisk, obelus, long cross ‡ = minutes, feet 2021 DOUBLE DAGGER → 0027 ' apostrophe = diesis, double obelisk → 00B4 ´ acute accent • 2022 BULLET → 02B9 ʹ modifier letter prime = black small circle → · 2033 DOUBLE PRIME 00B7 middle dot = seconds, inches → 2024 one dot leader → " → 0022 quotation mark 2219 bullet operator → 02BA modifier letter double prime → 25D8 inverse bullet → ” → 201D right double quotation mark 25E6 white bullet → 3003 〃 ditto mark 2023 TRIANGULAR BULLET → 301E 〞 double prime quotation mark → í 220E end of proof 2032 2032 → 25B8 black right-pointing small triangle 2034 TRIPLE PRIME 2024 ONE DOT LEADER = lines (old measure, 1/12 of an inch) • also used as an Armenian semicolon (mijaket) 2032 2032 2032 → · 00B7 middle dot 2035 REVERSED PRIME → • 2022 bullet → 0060 ` grave accent → 2219 bullet operator . 2036 REVERSED DOUBLE PRIME 002E full stop → 301D reversed double prime quotation 2025 TWO DOT LEADER mark . 002E 002E 2035 2035 … 2026 HORIZONTAL ELLIPSIS 2037 REVERSED TRIPLE PRIME = three dot leader 2035 2035 2035 → 22EE vertical ellipsis 2038 CARET → FE19 presentation form for vertical → 2303 up arrowhead horizontal ellipsis ‹ 002E . 002E . 002E . 2039 SINGLE LEFT-POINTING ANGLE QUOTATION MARK 2027 HYPHENATION POINT = LEFT POINTING SINGLE GUILLEMET Formatting characters • usually opening, sometimes closing → 003C < less-than sign 2028 ä LINE SEPARATOR → 2329 〈 left-pointing angle bracket • may be used to represent this semantic → 3008 〈 left angle bracket unambiguously 592 The Unicode Standard 4.1, Copyright © 1991–2005, Unicode, Inc. All rights reserved. 203A General Punctuation 205E 203A › SINGLE RIGHT-POINTING ANGLE 2050 CLOSE UP QUOTATION MARK • editing mark = RIGHT POINTING SINGLE GUILLEMET 2051 TWO ASTERISKS ALIGNED VERTICALLY • usually closing, sometimes opening 2052 COMMERCIAL MINUS SIGN → 003E > greater-than sign = abzüglich (German), med avdrag av (Swedish), → 232A 〉 right-pointing angle bracket piska (Swedish, "whip") → 3009 〉 right angle bracket • a common glyph variant and fallback 203B REFERENCE MARK representation looks like ./.