Quick viewing(Text Mode)

The Unicode Standard 5.1 Code Charts

The Unicode Standard 5.1 Code Charts

C0 Controls and Basic Latin Range: 0000–007F

This file contains an excerpt from the character code tables and list of character names for The Standard, Version 5.1.

This file may be changed at any time without notice to reflect errata or other updates to the Unicode Standard. See http://www.unicode.org/errata/ for an up-to-date list of errata.

See http://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See http://www.unicode.org/charts/PDF/Unicode-5.1/ for charts showing only the characters added in Unicode 5.1. See http://www.unicode.org/Public/5.1.0/charts/ for a complete archived file of character code charts for Unicode 5.1.

Disclaimer These charts are provided as the online reference to the character contents of the Unicode Standard, Version 5.1 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode Standard, Version 5.0 (ISBN 0-321-48091-0), online at http://www.unicode.org/versions/Unicode5.0.0/, as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, #34, #38, #41, #42, and #44, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available online.

See http://www.unicode.org/ucd/ and http://www.unicode.org/reports/

A thorough understanding of the information contained in these additional sources is required for a successful implementation.

Fonts The shapes of the reference used in these code charts are not prescriptive. Considerable variation is to be expected in actual fonts. The particular fonts used in these charts were provided to the Unicode Consortium by a number of different font designers, who own the rights to the fonts.

See http://www.unicode.org/charts/fonts.html for a list.

Terms of Use You may freely use these code charts for personal or internal business uses only. You may not incorporate them either wholly or in part into any product or publication, or otherwise distribute them without express written permission from the Unicode Consortium. However, you may provide links to these charts.

The fonts and font data used in production of these code charts may NOT be extracted, or used in any other way in any product or publication, without permission or license granted by the typeface owner(s).

The Unicode Consortium is not liable for errors or omissions in this file or the standard itself. Information on characters added to the Unicode Standard since the publication of the most recent version of the Unicode Standard, as well as on characters currently being considered for addition to the Unicode Standard can be found on the Unicode web site.

See http://www.unicode.org/pending/pending.html and http://www.unicode.org/alloc/Pipeline.html.

Copyright © 1991-2008 Unicode, Inc. All rights reserved. 0000 C0 Controls and Basic Latin 007F

000 001 002 003 004 005 006 007

0   0 @ P ` p 0000 0010 0020 0030 0040 0050 0060 0070

1  ! 1 A Q a q 0001 0011 0021 0031 0041 0051 0061 0071

2  " 2 B R b r 0002 0012 0022 0032 0042 0052 0062 0072

3  # 3 C S c s 0003 0013 0023 0033 0043 0053 0063 0073

4   $ 4 D T d t 0004 0014 0024 0034 0044 0054 0064 0074

5   % 5 E U e u 0005 0015 0025 0035 0045 0055 0065 0075

6   & 6 F V f v 0006 0016 0026 0036 0046 0056 0066 0076

7   ' 7 G W g w 0007 0017 0027 0037 0047 0057 0067 0077

8   ( 8 H X h x 0008 0018 0028 0038 0048 0058 0068 0078

9  ) 9 I Y i y 0009 0019 0029 0039 0049 0059 0069 0079

A  * : J Z j z 000A 001A 002A 003A 004A 005A 006A 007A

B  + ; K [ k { 000B 001B 002B 003B 004B 005B 006B 007B

C   , < L \ l | 000C 001C 002C 003C 004C 005C 006C 007C

D  - = M ] m } 000D 001D 002D 003D 004D 005D 006D 007D

E  . > N ^ n ~ 000E 001E 002E 003E 004E 005E 006E 007E

F  ! / ? O _ o # 000F 001F 002F 003F 004F 005F 006F 007F

2 The Unicode Standard 5.1, Copyright © 1991-2008 Unicode, Inc. All rights reserved. 0000 C0 Controls and Basic Latin 0026

C0 controls 001B = ESCAPE Alias names are those for ISO/IEC 6429:1992. Commonly 001C used alternative aliases are also shown. = INFORMATION SEPARATOR FOUR 0000 = file separator (FS) = NULL 001D 0001 = INFORMATION SEPARATOR THREE = START OF HEADING = group separator (GS) 0002 001E = START OF TEXT = INFORMATION SEPARATOR TWO 0003 = record separator (RS) = END OF TEXT 001F 0004 = INFORMATION SEPARATOR ONE = END OF TRANSMISSION = unit separator (US) 0005 = ENQUIRY ASCII and symbols 0006 Based on ISO/IEC 646. = ACKNOWLEDGE 0020 0007 • sometimes considered a control code = BELL 0008 • other space characters: 2000 –200A = BACKSPACE → 00A0 no-break space 0009 → 200B zero width space = CHARACTER TABULATION → 2060 word joiner = horizontal tabulation (HT), tab → 3000 000A ideographic space = LINE FEED (LF) → FEFF zero width no-break space = new line (NL), end of line (EOL) 0021 ! 000B = factorial = LINE TABULATION = bang = vertical tabulation (VT) → 00A1 ¡ inverted exclamation mark 000C → 01C3 ǃ latin letter retroflex click = FORM FEED (FF) → 203C ‼ double exclamation mark 000D → 203D ‽ = CARRIAGE RETURN (CR) 000E → 2762 heavy exclamation mark ornament = SHIFT OUT 0022 " • known as LOCKING-SHIFT ONE in 8-bit • neutral (vertical), used as opening or closing environments quotation mark 000F • preferred characters in English for paired = SHIFT IN quotation marks are 201C “ & 201D ” • known as LOCKING-SHIFT ZERO in 8-bit → 02BA ʺ modifier letter double environments → 030B $̋ combining double acute accent 0010 → 030E $̎ combining double vertical line above = DATA LINK ESCAPE 0011 → 2033 ″ double prime = DEVICE CONTROL ONE → 3003 〃 0012 0023 # = DEVICE CONTROL TWO = pound sign, hash, crosshatch, octothorpe 0013 → 2114 l b bar symbol = DEVICE CONTROL THREE → 266F ♯ music sharp sign 0014 0024 $ DOLLAR SIGN = DEVICE CONTROL FOUR = milreis, escudo 0015 may have one or two vertical bars = NEGATIVE ACKNOWLEDGE • 0016 other characters: 20A0 ₠ –20B5 ₵ = SYNCHRONOUS IDLE 0017 → 00A4 ¤ currency sign = END OF TRANSMISSION BLOCK 0025 % 0018 → 066A  arabic percent sign = CANCEL → 2030 ‰ sign 0019 → 2031 ‱ per ten thousand sign = END OF MEDIUM 001A → 2052 commercial minus sign 0026 = SUBSTITUTE & → 204A ⁊ → FFFD replacement character tironian sign et → 214B turned ampersand

The Unicode Standard 5.1, Copyright © 1991-2008 Unicode, Inc. All rights reserved. 3 0027 C0 Controls and Basic Latin 0047

0027 ' 0036 6 DIGIT SIX = apostrophe-quote (1.0) 0037 7 DIGIT SEVEN = APL quote 0038 8 DIGIT EIGHT • neutral (vertical) glyph with mixed usage 0039 9 DIGIT NINE • 2019 ’ is preferred for apostrophe • preferred characters in English for paired ASCII punctuation and symbols 003A quotation marks are 2018 ‘ & 2019 ’ : → 0589 ։ armenian → 02B9 ʹ modifier letter prime sof pasuq ׃ 05C3 → → 02BC ʼ modifier letter apostrophe → 2236 ∶ ratio → 02C8 ˈ modifier letter vertical line → A789 ꞉ → 0301 $́ modifier letter colon combining acute accent 003B → 2032 ′ ; prime • this, and not 037E ; , is the preferred character → A78C ꞌ latin small letter saltillo for ’Greek ’ 0028 LEFT PARENTHESIS ( → 037E ; = opening parenthesis (1.0) greek question mark 0029 ) RIGHT PARENTHESIS → 061B  arabic semicolon = closing parenthesis (1.0) → 204F reversed semicolon • see discussion on semantics of paired 003C < LESS-THAN SIGN bracketing characters → 2039 ‹ single left-pointing angle quotation 002A * mark = star (on phone keypads) → 2329 〈 left-pointing angle → 066D  arabic five pointed star → 27E8 ⟨ mathematical left angle bracket → 204E low asterisk → 3008 〈 left angle bracket → 2217 ∗ asterisk operator 003D = EQUALS SIGN → 26B9 ⚹ sextile • other related characters: 2241 ≁ –2263 ≣ → 2731 heavy asterisk → 2260 ≠ not equal to 002B + PLUS SIGN → 2261 ≡ identical to 002C , → A78A ꞊ modifier letter short equals sign = decimal separator → 10190 𐆐 → 060C  roman sextans sign arabic comma 003E > GREATER-THAN SIGN → 201A ‚ single low-9 quotation mark → 203A › single right-pointing angle quotation → 3001 、 ideographic comma mark 002D -MINUS - → 232A 〉 right-pointing angle bracket = hyphen or minus sign → 27E9 ⟩ mathematical right angle bracket • used for either hyphen or minus sign → 3009 〉 → 2010 ‐ right angle bracket hyphen 003F ? QUESTION MARK → 2011 non-breaking hyphen → 00BF ¿ inverted question mark → 2012 ‒ figure → 037E ; greek question mark → 2013 – en dash → 061F  arabic question mark → 2212 − minus sign → 203D ‽ interrobang → 10191 𐆑 roman uncia sign → 2048 ⁈ question exclamation mark 002E FULL STOP . → 2049 ⁉ = period, dot, decimal point exclamation question mark 0040 COMMERCIAL AT • may be rendered as a raised decimal point in @ old style numbers = → 06D4  arabic full stop Uppercase Latin alphabet → 3002 。 ideographic full stop 0041 A LATIN CAPITAL LETTER A 002F / SOLIDUS 0042 B LATIN CAPITAL LETTER B = , virgule → 212C ℬ script capital b → 01C0 ǀ latin letter dental click 0043 C LATIN CAPITAL LETTER C → 0338 $̸ combining long solidus overlay → 2102 ℂ double-struck capital c → 2044 ⁄ slash → 212D ℭ black-letter capital c → 2215 ∕ slash 0044 D LATIN CAPITAL LETTER D 0045 E LATIN CAPITAL LETTER E ASCII digits → 2107 ℇ euler constant 0030 DIGIT ZERO 0 → 2130 ℰ script capital e 0031 1 DIGIT ONE 0046 F LATIN CAPITAL LETTER F 0032 DIGIT TWO 2 → 2131 ℱ 0033 3 DIGIT THREE script capital f → 2132 Ⅎ 0034 4 DIGIT FOUR turned capital f 0035 5 DIGIT FIVE 0047 G LATIN CAPITAL LETTER G

4 The Unicode Standard 5.1, Copyright © 1991-2008 Unicode, Inc. All rights reserved. 0048 C0 Controls and Basic Latin 007B

0048 H LATIN CAPITAL LETTER H 005F _ LOW LINE → 210B ℋ script capital h = spacing (1.0) • → 210C ℌ black-letter capital h this is a spacing character → 210D ℍ double-struck capital h → 02CD ˍ modifier letter low macron 0049 I LATIN CAPITAL LETTER I → 0331 $̱ combining macron below • Turkish and Azerbaijani use 0131 ı for → 0332 $̲ combining low line lowercase → 2017 ‗ double low line → 0130 İ latin capital letter i with dot above 0060 ` GRAVE ACCENT → 0406 І cyrillic capital letter byelorussian- • this is a spacing character ukrainian i → 02CB ˋ modifier letter grave accent → 04C0 Ӏ cyrillic letter palochka → 0300 $̀ combining grave accent → 2110 ℐ script capital i → 2035 ‵ reversed prime → 2111 ℑ black-letter capital i → 2160 Ⅰ roman numeral one Lowercase Latin alphabet 004A J LATIN CAPITAL LETTER J 0061 a LATIN SMALL LETTER A 004B K LATIN CAPITAL LETTER K 0062 b LATIN SMALL LETTER B → 212A K kelvin sign 0063 c LATIN SMALL LETTER C 004C L LATIN CAPITAL LETTER L 0064 d LATIN SMALL LETTER D 0065 LATIN SMALL LETTER E → 2112 ℒ script capital l e → 212E ℮ 004D M LATIN CAPITAL LETTER M estimated symbol → 212F ℯ → 2133 ℳ script capital m script small e 004E N LATIN CAPITAL LETTER N 0066 f LATIN SMALL LETTER F 0067 LATIN SMALL LETTER G → 2115 ℕ double-struck capital n g → 0261 ɡ 004F O LATIN CAPITAL LETTER O latin small letter script g 0050 P LATIN CAPITAL LETTER P → 210A ℊ script small g → 2119 ℙ double-struck capital p 0068 h LATIN SMALL LETTER H 0051 Q LATIN CAPITAL LETTER Q → 04BB һ cyrillic small letter shha → 211A ℚ double-struck capital q → 210E ℎ planck constant 0052 R LATIN CAPITAL LETTER R 0069 i LATIN SMALL LETTER I → 211B ℛ script capital r • Turkish and Azerbaijani use 0130 İ for → 211C ℜ black-letter capital r uppercase → 0131 ı → 211D ℝ double-struck capital r latin small letter dotless i 0053 S LATIN CAPITAL LETTER S → 1D6A4 𝚤 mathematical italic small dotless i 0054 T LATIN CAPITAL LETTER T 006A j LATIN SMALL LETTER J 0055 U LATIN CAPITAL LETTER U → 0237 ȷ latin small letter dotless j 0056 LATIN CAPITAL LETTER V V → 1D6A5 𝚥 mathematical italic small dotless j 0057 W LATIN CAPITAL LETTER W 006B k LATIN SMALL LETTER K 0058 X LATIN CAPITAL LETTER X 006C l LATIN SMALL LETTER L 0059 LATIN CAPITAL LETTER Y Y → 2113 ℓ 005A LATIN CAPITAL LETTER Z script small l Z → 1D4C1 𝓁 → 2124 ℤ mathematical script small l double-struck capital z 006D LATIN SMALL LETTER M → 2128 ℨ m black-letter capital z 006E n LATIN SMALL LETTER N → 207F ⁿ superscript latin small letter n ASCII punctuation and symbols 006F LATIN SMALL LETTER O 005B LEFT SQUARE BRACKET o [ → 2134 ℴ = opening square bracket (1.0) script small o • 27E6 ⟦ –27EB ⟫ 0070 p LATIN SMALL LETTER P other bracket characters: , 0071 LATIN SMALL LETTER Q 2983 ⦃ –2998 ⦘ 3008 〈 –301B 〛 q , 0072 r LATIN SMALL LETTER R 005C \ REVERSE SOLIDUS 0073 s LATIN SMALL LETTER S = 0074 t LATIN SMALL LETTER T → 20E5  combining reverse solidus overlay 0075 u LATIN SMALL LETTER U → 2216 set minus 0076 v LATIN SMALL LETTER V 005D ] RIGHT SQUARE BRACKET 0077 w LATIN SMALL LETTER W = closing square bracket (1.0) 0078 x LATIN SMALL LETTER X 005E ^ CIRCUMFLEX ACCENT 0079 y LATIN SMALL LETTER Y • this is a spacing character 007A z LATIN SMALL LETTER Z → 01B6 ƶ → 02C4 ˄ modifier letter up arrowhead latin small letter z with stroke → 02C6 ˆ modifier letter circumflex accent ASCII punctuation and symbols → 0302 $̂ combining circumflex accent 007B { LEFT CURLY BRACKET → 2038 ‸ = opening curly bracket (1.0) → 2303 ⌃ up arrowhead = left brace

The Unicode Standard 5.1, Copyright © 1991-2008 Unicode, Inc. All rights reserved. 5 007C C0 Controls and Basic Latin 007F

007C | VERTICAL LINE = • used in pairs to indicate absolute value → 01C0 ǀ latin letter dental click hebrew punctuation paseq ׀ 05C0 → → 2223 ∣ divides → 2758 light vertical bar 007D } RIGHT CURLY BRACKET = closing curly bracket (1.0) = right brace 007E ~ • this is a spacing character → 02DC ˜ small tilde → 0303 $̃ combining tilde → 2053 ⁓ swung dash → 223C ∼ tilde operator → FF5E ~ fullwidth tilde Control character 007F = DELETE

6 The Unicode Standard 5.1, Copyright © 1991-2008 Unicode, Inc. All rights reserved.