MSR-2-Annotated-Non-CJK-Tables-20150413
Total Page:16
File Type:pdf, Size:1020Kb
Maximal Starting Repertoire - MSR-2 Annotated Repertoire Tables, Non-CJK Integration Panel Date: 2015-4-13 How to read this file: This file shows all non-CJK characters that are included in the MSR-2 with a yellow background. The set of these code points matches the repertoire specified in the XML format of the MSR. Where present, annotations on individual code points indicate some or all of the languages a code point is used for. This file lists only those Unicode blocks containing non-CJK code points included in the MSR. Code points listed in this document, which are PVALID in IDNA2008 but excluded from the MSR for various reasons are shown with pinkish annotations indicating the primary rationale for excluding the code points, together with other information about usage background, where present. Code points shown with a white background are not PVALID in IDNA2008, or are ineligible for the root zone (digits, hyphen). Repertoire corresponding to the CJK Unified Ideographs: Main (4E00-9FFF), Extension-A (3400-4DBF), Extension B (20000-2A6DF), and Hangul Syllables (AC00-D7A3) are included in separate files. For links to these files see "Maximal Starting Repertoire - MSR-2: Overview and Rationale". How the repertoire was chosen: This file only provides a brief categorization of code points that are PVALID in IDNA2008 but excluded from the MSR. For a complete discussion of the principles and guidelines followed by the Integration Panel in creating the MSR, as well as links to the other files, please see “Maximal Starting Repertoire - MSR-2: Overview and Rationale”. Brief description of exclusion types: - Obsolete (historic, archaic), - Limited or declining use (educational, threatened, nearly extinct), - Symbol (characters classified as letters that are symbolic in nature), - Numeric (characters used in numerical context), - Punctuation (characters classified as letters that look like punctuation), - CONTEXTJ (context - join controls), - CONTEXTO (context - others), - Unstable (encoding model changed), - Deprecated (no longer in use, alternate code preferred), - Technical use (phonetic, poetry), - Religious use (annotation, cantillation), - Homoglyph (digraph of x y), - Deferred repertoire (7.0 repertoire not yet included in IANA tables), The optional parenthetical annotations provided give further information where appropriate or available. This page left intentionally blank 0000 C0 Controls and Basic Latin 007F 000 001 002 003 004 005 006 007 0 0 @ P ` p 0000 0010 0020 0030 0040 0050 0060 0070 1 ! 1 A Q a q 0001 0011 0021 0031 0041 0051 0061 0071 2 " 2 B R b r 0002 0012 0022 0032 0042 0052 0062 0072 3 # 3 C S c s 0003 0013 0023 0033 0043 0053 0063 0073 4 $ 4 D T d t 0004 0014 0024 0034 0044 0054 0064 0074 5 % 5 E U e u 0005 0015 0025 0035 0045 0055 0065 0075 6 & 6 F V f v 0006 0016 0026 0036 0046 0056 0066 0076 7 ' 7 G W g w 0007 0017 0027 0037 0047 0057 0067 0077 8 ( 8 H X h x 0008 0018 0028 0038 0048 0058 0068 0078 9 ) 9 I Y i y 0009 0019 0029 0039 0049 0059 0069 0079 A * : J Z j z 000A 001A 002A 003A 004A 005A 006A 007A B + ; K [ k { 000B 001B 002B 003B 004B 005B 006B 007B C , < L \ l | 000C 001C 002C 003C 004C 005C 006C 007C D - = M ] m } 000D 001D 002D 003D 004D 005D 006D 007D E . > N ^ n ~ 000E 001E 002E 003E 004E 005E 006E 007E F ! / ? O _ o # 000F 001F 002F 003F 004F 005F 006F 007F Printed: 13-Apr-2015 1 0000 C0 Controls and Basic Latin 0070 C0 controls 0039 9 DIGIT NINE 0000 <control> ASCII punctuation and symbols 0001 <control> 003A : COLON 0002 <control> 003B ; SEMICOLON 0003 <control> 003C < LESS-THAN SIGN 0004 <control> 003D = EQUALS SIGN 0005 <control> 003E > GREATER-THAN SIGN 0006 <control> 003F ? QUESTION MARK 0007 <control> 0040 @ COMMERCIAL AT 0008 <control> 0009 <control> Uppercase Latin alphabet 000A <control> 0041 A LATIN CAPITAL LETTER A 000B <control> 0042 B LATIN CAPITAL LETTER B 000C <control> 0043 C LATIN CAPITAL LETTER C 000D <control> 0044 D LATIN CAPITAL LETTER D 000E <control> 0045 E LATIN CAPITAL LETTER E 000F <control> 0046 F LATIN CAPITAL LETTER F 0010 <control> 0047 G LATIN CAPITAL LETTER G 0011 <control> 0048 H LATIN CAPITAL LETTER H 0012 <control> 0049 I LATIN CAPITAL LETTER I 0013 <control> 004A J LATIN CAPITAL LETTER J 0014 <control> 004B K LATIN CAPITAL LETTER K 0015 <control> 004C L LATIN CAPITAL LETTER L 0016 <control> 004D M LATIN CAPITAL LETTER M 0017 <control> 004E N LATIN CAPITAL LETTER N 0018 <control> 004F O LATIN CAPITAL LETTER O 0019 <control> 0050 P LATIN CAPITAL LETTER P 001A <control> 0051 Q LATIN CAPITAL LETTER Q 001B <control> 0052 R LATIN CAPITAL LETTER R 001C <control> 0053 S LATIN CAPITAL LETTER S 001D <control> 0054 T LATIN CAPITAL LETTER T 001E <control> 0055 U LATIN CAPITAL LETTER U 001F ! <control> 0056 V LATIN CAPITAL LETTER V 0057 W LATIN CAPITAL LETTER W ASCII punctuation and symbols 0058 X LATIN CAPITAL LETTER X 0020 SPACE 0059 Y LATIN CAPITAL LETTER Y 0021 ! EXCLAMATION MARK 005A Z LATIN CAPITAL LETTER Z 0022 " QUOTATION MARK 0023 # NUMBER SIGN ASCII punctuation and symbols 0024 $ DOLLAR SIGN 005B [ LEFT SQUARE BRACKET 0025 % PERCENT SIGN 005C \ REVERSE SOLIDUS 0026 & AMPERSAND 005D ] RIGHT SQUARE BRACKET 0027 ' APOSTROPHE 005E ^ CIRCUMFLEX ACCENT 0028 ( LEFT PARENTHESIS 005F _ LOW LINE 0029 ) RIGHT PARENTHESIS 0060 ` GRAVE ACCENT 002A * ASTERISK Lowercase Latin alphabet 002B + PLUS SIGN 0061 a LATIN SMALL LETTER A 002C , COMMA 0062 b LATIN SMALL LETTER B 002D - HYPHEN-MINUS 0063 c LATIN SMALL LETTER C 002E . FULL STOP 0064 d LATIN SMALL LETTER D 002F / SOLIDUS 0065 e LATIN SMALL LETTER E ASCII digits 0066 f LATIN SMALL LETTER F 0030 0 DIGIT ZERO 0067 g LATIN SMALL LETTER G 0031 1 DIGIT ONE 0068 h LATIN SMALL LETTER H 0032 2 DIGIT TWO 0069 i LATIN SMALL LETTER I 0033 3 DIGIT THREE 006A j LATIN SMALL LETTER J 0034 4 DIGIT FOUR 006B k LATIN SMALL LETTER K 0035 5 DIGIT FIVE 006C l LATIN SMALL LETTER L 0036 6 DIGIT SIX 006D m LATIN SMALL LETTER M 0037 7 DIGIT SEVEN 006E n LATIN SMALL LETTER N 0038 8 DIGIT EIGHT 006F o LATIN SMALL LETTER O 0070 p LATIN SMALL LETTER P 2 Printed: 13-Apr-2015 0071 C0 Controls and Basic Latin 007F 0071 q LATIN SMALL LETTER Q 0072 r LATIN SMALL LETTER R 0073 s LATIN SMALL LETTER S 0074 t LATIN SMALL LETTER T 0075 u LATIN SMALL LETTER U 0076 v LATIN SMALL LETTER V 0077 w LATIN SMALL LETTER W 0078 x LATIN SMALL LETTER X 0079 y LATIN SMALL LETTER Y 007A z LATIN SMALL LETTER Z ASCII punctuation and symbols 007B { LEFT CURLY BRACKET 007C | VERTICAL LINE 007D } RIGHT CURLY BRACKET 007E ~ TILDE Control character 007F # <control> Printed: 13-Apr-2015 3 0080 C1 Controls and Latin-1 Supplement 00FF 008 009 00A 00B 00C 00D 00E 00F 0 P Z D ° À Ð à ð 0080 0090 00A0 00B0 00C0 00D0 00E0 00F0 1 Q [ ¡ ± Á Ñ á ñ 0081 0091 00A1 00B1 00C1 00D1 00E1 00F1 2 R \ ¢ ² Â Ò â ò 0082 0092 00A2 00B2 00C2 00D2 00E2 00F2 3 S ] £ ³ Ã Ó ã ó 0083 0093 00A3 00B3 00C3 00D3 00E3 00F3 4 T ^ ¤ ´ Ä Ô ä ô 0084 0094 00A4 00B4 00C4 00D4 00E4 00F4 5 U _ ¥ µ Å Õ å õ 0085 0095 00A5 00B5 00C5 00D5 00E5 00F5 6 V ` ¦ ¶ Æ Ö æ ö 0086 0096 00A6 00B6 00C6 00D6 00E6 00F6 7 W a § · Ç × ç ÷ 0087 0097 00A7 00B7 00C7 00D7 00E7 00F7 8 X b ¨ ¸ È Ø è ø 0088 0098 00A8 00B8 00C8 00D8 00E8 00F8 9 Y c © ¹ É Ù é ù 0089 0099 00A9 00B9 00C9 00D9 00E9 00F9 A . > ª º Ê Ú ê ú 008A 009A 00AA 00BA 00CA 00DA 00EA 00FA B / ? « » Ë Û ë û 008B 009B 00AB 00BB 00CB 00DB 00EB 00FB C 0 @ ¬ ¼ Ì Ü ì ü 008C 009C 00AC 00BC 00CC 00DC 00EC 00FC D 1 A E ½ Í Ý í ý 008D 009D 00AD 00BD 00CD 00DD 00ED 00FD E 2 B ® ¾ Î Þ î þ 008E 009E 00AE 00BE 00CE 00DE 00EE 00FE F 3 C ¯ ¿ Ï ß ï ÿ 008F 009F 00AF 00BF 00CF 00DF 00EF 00FF 4 Printed: 13-Apr-2015 0080 C1 Controls and Latin-1 Supplement 00F0 C1 controls 00B9 ¹ SUPERSCRIPT ONE 0080 P <control> 00BA º MASCULINE ORDINAL INDICATOR 0081 Q <control> 00BB » RIGHT-POINTING DOUBLE ANGLE QUOTATION 0082 R <control> MARK 0083 S <control> 00BC ¼ VULGAR FRACTION ONE QUARTER 0084 T <control> 00BD ½ VULGAR FRACTION ONE HALF 0085 U <control> 00BE ¾ VULGAR FRACTION THREE QUARTERS 0086 V <control> 00BF ¿ INVERTED QUESTION MARK 0087 W <control> Letters 0088 X <control> 00C0 À LATIN CAPITAL LETTER A WITH GRAVE 0089 Y <control> 00C1 Á LATIN CAPITAL LETTER A WITH ACUTE 008A . <control> 00C2 Â LATIN CAPITAL LETTER A WITH CIRCUMFLEX 008B / <control> 00C3 Ã LATIN CAPITAL LETTER A WITH TILDE 008C 0 <control> 00C4 Ä LATIN CAPITAL LETTER A WITH DIAERESIS 008D 1 <control> 00C5 Å LATIN CAPITAL LETTER A WITH RING ABOVE 008E 2 <control> 00C6 Æ LATIN CAPITAL LETTER AE 008F 3 <control> 00C7 Ç LATIN CAPITAL LETTER C WITH CEDILLA 0090 Z <control> 00C8 È LATIN CAPITAL LETTER E WITH GRAVE 0091 [ <control> 00C9 É LATIN CAPITAL LETTER E WITH ACUTE 0092 \ <control> 00CA Ê LATIN CAPITAL LETTER E WITH CIRCUMFLEX 0093 ] <control> 00CB Ë LATIN CAPITAL LETTER E WITH DIAERESIS 0094 ^ <control> 00CC Ì LATIN CAPITAL LETTER I WITH GRAVE 0095 _ <control> 00CD Í LATIN CAPITAL LETTER I WITH ACUTE 0096 ` <control> 00CE Î LATIN CAPITAL LETTER I WITH CIRCUMFLEX 0097 a <control> 00CF Ï LATIN CAPITAL LETTER I WITH DIAERESIS 0098 b <control> 00D0 Ð LATIN CAPITAL LETTER ETH 0099 c <control> 00D1 Ñ LATIN CAPITAL LETTER N WITH TILDE 009A > <control> 00D2 Ò LATIN CAPITAL LETTER O WITH GRAVE 009B ? <control> 00D3 Ó LATIN CAPITAL LETTER O WITH ACUTE 009C @ <control> 00D4 Ô LATIN CAPITAL LETTER O WITH CIRCUMFLEX 009D A <control> 00D5 Õ LATIN CAPITAL LETTER O WITH TILDE 009E B <control> 00D6 Ö LATIN CAPITAL LETTER O WITH DIAERESIS 009F C <control> Mathematical operator Latin-1 punctuation and symbols 00D7 × MULTIPLICATION SIGN 00A0 D NO-BREAK SPACE Letters 00A1 ¡ INVERTED EXCLAMATION MARK 00A2