The Unicode Standard, Version 4.1 This File Contains an Excerpt from the Character Code Tables and List of Character Names for the Unicode Standard, Version 4.1

Total Page:16

File Type:pdf, Size:1020Kb

The Unicode Standard, Version 4.1 This File Contains an Excerpt from the Character Code Tables and List of Character Names for the Unicode Standard, Version 4.1 Phonetic Extensions Supplement Range: 1D80–1DBF The Unicode Standard, Version 4.1 This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 4.1. Characters in this chart that are new for The Unicode Standard, Version 4.1 are shown in conjunction with any existing characters. For ease of reference, the new characters have been highlighted in the chart grid and in the names list. This file will not be updated with errata, or when additional characters are assigned to the Unicode Standard. See http://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See http://www.unicode.org/Public/4.1.0/charts/ for a complete archived file of character code charts for Unicode 4.1. Disclaimer These charts are provided as the on-line reference to the character contents of the Unicode Standard, Version 4.1 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this excerpt file, please consult the appropriate sections of The Unicode Standard, Version 4.1, at http://www.unicode.org/versions/Unicode4.1.0/, including sections unchanged in The Unicode Standard, Version 4.0 (ISBN 0-321-18578-1), as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, and #34, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available on-line. See http://www.unicode.org/ucd/ and http://www.unicode.org/reports/ A thorough understanding of the information contained in these additional sources is required for a successful implementation. Fonts The shapes of the reference glyphs used in these code charts are not prescriptive. Considerable variation is to be expected in actual fonts. The particular fonts used in these charts were provided to the Unicode Consortium by a number of different font designers, who own the rights to the fonts. See http://www.unicode.org/charts/fonts.html for a list. Terms of Use You may freely use these code charts for personal or internal business uses only. You may not incorporate them either wholly or in part into any product or publication, or otherwise distribute them without express written permission from the Unicode Consortium. However, you may provide links to these charts. The fonts and font data used in production of these Code Charts may NOT be extracted, or used in any other way in any product or publication, without permission or license granted by the typeface owner(s). The Unicode Consortium is not liable for errors or omissions in this excerpt file or the standard itself. Information on characters added to the Unicode Standard since the publication of Version 4.1 as well as on characters currently being considered for addition to the Unicode Standard can be found on the Unicode web site. See http://www.unicode.org/pending/pending.html and http://www.unicode.org/alloc/Pipeline.html. Copyright © 1991-2005 Unicode, Inc. All rights reserved. 1D80 Phonetic Extensions Supplement 1DBF 1D8 1D9 1DA 1DB 0 1D80 1D90 1DA0 1DB0 1 2 1D81 1D91 1DA1 1DB1 2 1D82 1D92 1DA2 1DB2 3 1D83 1D93 1DA3 1DB3 4 % 1D84 1D94 1DA4 1DB4 5 1D85 1D95 1DA5 1DB5 6 ' 7 1D86 1D96 1DA6 1DB6 7 8 1D87 1D97 1DA7 1DB7 8 1D88 1D98 1DA8 1DB8 9 1D89 1D99 1DA9 1DB9 A 1D8A 1D9A 1DAA 1DBA B 1D8B 1D9B 1DAB 1DBB C 1D8C 1D9C 1DAC 1DBC D 1D8D 1D9D 1DAD 1DBD E / 1D8E 1D9E 1DAE 1DBE F 1D8F 1D9F 1DAF 1DBF The Unicode Standard 4.1, Copyright © 1991–2005, Unicode, Inc. All rights reserved. 571 1D80 Phonetic Extensions Supplement 1DAB Latin letters with palatal hook 1D94 LATIN SMALL LETTER REVERSED OPEN E WITH RETROFLEX HOOK An additional letter with palatal hook is found in → 025D latin small letter reversed open e with another block. hook → 01AB latin small letter t with palatal hook 1D95 LATIN SMALL LETTER SCHWA WITH 1D80 LATIN SMALL LETTER B WITH PALATAL RETROFLEX HOOK HOOK → 025A ɚ latin small letter schwa with hook 1D81 LATIN SMALL LETTER D WITH PALATAL 1D96 LATIN SMALL LETTER I WITH HOOK RETROFLEX HOOK 1D82 LATIN SMALL LETTER F WITH PALATAL 1D97 LATIN SMALL LETTER OPEN O WITH HOOK RETROFLEX HOOK 1D83 LATIN SMALL LETTER G WITH PALATAL 1D98 LATIN SMALL LETTER ESH WITH HOOK RETROFLEX HOOK 1D84 LATIN SMALL LETTER K WITH PALATAL 1D99 LATIN SMALL LETTER U WITH HOOK RETROFLEX HOOK 1D85 LATIN SMALL LETTER L WITH PALATAL 1D9A LATIN SMALL LETTER EZH WITH HOOK RETROFLEX HOOK 1D86 LATIN SMALL LETTER M WITH PALATAL HOOK Modifier letters 1D87 LATIN SMALL LETTER N WITH PALATAL Other modifier letters can be found in the Spacing HOOK Modifer Letters, Phonetic Extensions, as well as 1D88 LATIN SMALL LETTER P WITH PALATAL Superscripts and Subscripts blocks. HOOK 1D9B MODIFIER LETTER SMALL TURNED 1D89 LATIN SMALL LETTER R WITH PALATAL ALPHA HOOK <super> 0252 1D8A # LATIN SMALL LETTER S WITH PALATAL 1D9C MODIFIER LETTER SMALL C HOOK <super> 0063 c 1D8B LATIN SMALL LETTER ESH WITH 1D9D MODIFIER LETTER SMALL C WITH CURL PALATAL HOOK <super> 0255 ɕ 1D8C LATIN SMALL LETTER V WITH PALATAL 1D9E MODIFIER LETTER SMALL ETH HOOK 00F0 ð 1D8D & <super> LATIN SMALL LETTER X WITH PALATAL 1D9F HOOK MODIFIER LETTER SMALL REVERSED 1D8E ' OPEN E LATIN SMALL LETTER Z WITH PALATAL 025C HOOK <super> 1DA0 MODIFIER LETTER SMALL F Latin letters with retroflex hook <super> 0066 f 1DA1 MODIFIER LETTER SMALL DOTLESS J IPA recommends transcribing vowels with r-coloring WITH STROKE (rhoticity) with the rhotic hook instead. <super> 025F → 02DE modifier letter rhotic hook 1DA2 MODIFIER LETTER SMALL SCRIPT G Additional letters with retroflex hook are found in <super> 0261 ɡ other blocks. 1DA3 MODIFIER LETTER SMALL TURNED H → 01AE latin capital letter t with retroflex <super> 0265 hook 1DA4 → 0256 MODIFIER LETTER SMALL I WITH latin small letter d with tail STROKE → 026D latin small letter l with retroflex hook <super> 0268 → 0273 latin small letter n with retroflex hook 1DA5 → 027B MODIFIER LETTER SMALL IOTA latin small letter turned r with hook <super> 0269 ɩ → 027D latin small letter r with tail 1DA6 → 0282 ʂ MODIFIER LETTER SMALL CAPITAL I latin small letter s with hook <super> 026A → 0285 latin small letter squat reversed esh 1DA7 MODIFIER LETTER SMALL CAPITAL I → 0288 latin small letter t with retroflex hook → 0290 WITH STROKE latin small letter z with retroflex hook <super> 1D7B ò → 02AF ʯ latin small letter turned h with 1DA8 MODIFIER LETTER SMALL J WITH fishhook and tail 1D8F CROSSED-TAIL LATIN SMALL LETTER A WITH <super> 029D RETROFLEX HOOK 1DA9 1D90 MODIFIER LETTER SMALL L WITH LATIN SMALL LETTER ALPHA WITH RETROFLEX HOOK RETROFLEX HOOK 026D 1D91 <super> LATIN SMALL LETTER D WITH HOOK 1DAA AND TAIL MODIFIER LETTER SMALL L WITH 1D92 PALATAL HOOK LATIN SMALL LETTER E WITH <super> 1D85 RETROFLEX HOOK 1DAB 1D93 MODIFIER LETTER SMALL CAPITAL L LATIN SMALL LETTER OPEN E WITH 029F RETROFLEX HOOK <super> 572 The Unicode Standard 4.1, Copyright © 1991–2005, Unicode, Inc. All rights reserved. 1DAC Phonetic Extensions Supplement 1DBF 1DAC MODIFIER LETTER SMALL M WITH HOOK <super> 0271 ɱ 1DAD MODIFIER LETTER SMALL TURNED M WITH LONG LEG <super> 0270 1DAE / MODIFIER LETTER SMALL N WITH LEFT HOOK <super> 0272 1DAF MODIFIER LETTER SMALL N WITH RETROFLEX HOOK <super> 0273 1DB0 MODIFIER LETTER SMALL CAPITAL N <super> 0274 1DB1 2 MODIFIER LETTER SMALL BARRED O <super> 0275 1DB2 MODIFIER LETTER SMALL PHI <super> 0278 ɸ 1DB3 MODIFIER LETTER SMALL S WITH HOOK <super> 0282 ʂ 1DB4 MODIFIER LETTER SMALL ESH <super> 0283 ʃ 1DB5 MODIFIER LETTER SMALL T WITH PALATAL HOOK <super> 01AB 1DB6 7 MODIFIER LETTER SMALL U BAR <super> 0289 # 1DB7 8 MODIFIER LETTER SMALL UPSILON <super> 028A ʊ 1DB8 MODIFIER LETTER SMALL CAPITAL U <super> 1D1C 1DB9 MODIFIER LETTER SMALL V WITH HOOK <super> 028B ʋ 1DBA MODIFIER LETTER SMALL TURNED V <super> 028C & 1DBB MODIFIER LETTER SMALL Z <super> 007A z 1DBC MODIFIER LETTER SMALL Z WITH RETROFLEX HOOK <super> 0290 1DBD MODIFIER LETTER SMALL Z WITH CURL <super> 0291 ʑ 1DBE MODIFIER LETTER SMALL EZH <super> 0292 ʒ 1DBF MODIFIER LETTER SMALL THETA <super> 03B8 θ The Unicode Standard 4.1, Copyright © 1991–2005, Unicode, Inc. All rights reserved. 573.
Recommended publications
  • Unicode Alphabets for L ATEX
    Unicode Alphabets for LATEX Specimen Mikkel Eide Eriksen March 11, 2020 2 Contents MUFI 5 SIL 21 TITUS 29 UNZ 117 3 4 CONTENTS MUFI Using the font PalemonasMUFI(0) from http://mufi.info/. Code MUFI Point Glyph Entity Name Unicode Name E262 � OEligogon LATIN CAPITAL LIGATURE OE WITH OGONEK E268 � Pdblac LATIN CAPITAL LETTER P WITH DOUBLE ACUTE E34E � Vvertline LATIN CAPITAL LETTER V WITH VERTICAL LINE ABOVE E662 � oeligogon LATIN SMALL LIGATURE OE WITH OGONEK E668 � pdblac LATIN SMALL LETTER P WITH DOUBLE ACUTE E74F � vvertline LATIN SMALL LETTER V WITH VERTICAL LINE ABOVE E8A1 � idblstrok LATIN SMALL LETTER I WITH TWO STROKES E8A2 � jdblstrok LATIN SMALL LETTER J WITH TWO STROKES E8A3 � autem LATIN ABBREVIATION SIGN AUTEM E8BB � vslashura LATIN SMALL LETTER V WITH SHORT SLASH ABOVE RIGHT E8BC � vslashuradbl LATIN SMALL LETTER V WITH TWO SHORT SLASHES ABOVE RIGHT E8C1 � thornrarmlig LATIN SMALL LETTER THORN LIGATED WITH ARM OF LATIN SMALL LETTER R E8C2 � Hrarmlig LATIN CAPITAL LETTER H LIGATED WITH ARM OF LATIN SMALL LETTER R E8C3 � hrarmlig LATIN SMALL LETTER H LIGATED WITH ARM OF LATIN SMALL LETTER R E8C5 � krarmlig LATIN SMALL LETTER K LIGATED WITH ARM OF LATIN SMALL LETTER R E8C6 UU UUlig LATIN CAPITAL LIGATURE UU E8C7 uu uulig LATIN SMALL LIGATURE UU E8C8 UE UElig LATIN CAPITAL LIGATURE UE E8C9 ue uelig LATIN SMALL LIGATURE UE E8CE � xslashlradbl LATIN SMALL LETTER X WITH TWO SHORT SLASHES BELOW RIGHT E8D1 æ̊ aeligring LATIN SMALL LETTER AE WITH RING ABOVE E8D3 ǽ̨ aeligogonacute LATIN SMALL LETTER AE WITH OGONEK AND ACUTE 5 6 CONTENTS
    [Show full text]
  • 1 Symbols (2286)
    1 Symbols (2286) USV Symbol Macro(s) Description 0009 \textHT <control> 000A \textLF <control> 000D \textCR <control> 0022 ” \textquotedbl QUOTATION MARK 0023 # \texthash NUMBER SIGN \textnumbersign 0024 $ \textdollar DOLLAR SIGN 0025 % \textpercent PERCENT SIGN 0026 & \textampersand AMPERSAND 0027 ’ \textquotesingle APOSTROPHE 0028 ( \textparenleft LEFT PARENTHESIS 0029 ) \textparenright RIGHT PARENTHESIS 002A * \textasteriskcentered ASTERISK 002B + \textMVPlus PLUS SIGN 002C , \textMVComma COMMA 002D - \textMVMinus HYPHEN-MINUS 002E . \textMVPeriod FULL STOP 002F / \textMVDivision SOLIDUS 0030 0 \textMVZero DIGIT ZERO 0031 1 \textMVOne DIGIT ONE 0032 2 \textMVTwo DIGIT TWO 0033 3 \textMVThree DIGIT THREE 0034 4 \textMVFour DIGIT FOUR 0035 5 \textMVFive DIGIT FIVE 0036 6 \textMVSix DIGIT SIX 0037 7 \textMVSeven DIGIT SEVEN 0038 8 \textMVEight DIGIT EIGHT 0039 9 \textMVNine DIGIT NINE 003C < \textless LESS-THAN SIGN 003D = \textequals EQUALS SIGN 003E > \textgreater GREATER-THAN SIGN 0040 @ \textMVAt COMMERCIAL AT 005C \ \textbackslash REVERSE SOLIDUS 005E ^ \textasciicircum CIRCUMFLEX ACCENT 005F _ \textunderscore LOW LINE 0060 ‘ \textasciigrave GRAVE ACCENT 0067 g \textg LATIN SMALL LETTER G 007B { \textbraceleft LEFT CURLY BRACKET 007C | \textbar VERTICAL LINE 007D } \textbraceright RIGHT CURLY BRACKET 007E ~ \textasciitilde TILDE 00A0 \nobreakspace NO-BREAK SPACE 00A1 ¡ \textexclamdown INVERTED EXCLAMATION MARK 00A2 ¢ \textcent CENT SIGN 00A3 £ \textsterling POUND SIGN 00A4 ¤ \textcurrency CURRENCY SIGN 00A5 ¥ \textyen YEN SIGN 00A6
    [Show full text]
  • The Brill Typeface User Guide & Complete List of Characters
    The Brill Typeface User Guide & Complete List of Characters Version 2.06, October 31, 2014 Pim Rietbroek Preamble Few typefaces – if any – allow the user to access every Latin character, every IPA character, every diacritic, and to have these combine in a typographically satisfactory manner, in a range of styles (roman, italic, and more); even fewer add full support for Greek, both modern and ancient, with specialised characters that papyrologists and epigraphers need; not to mention coverage of the Slavic languages in the Cyrillic range. The Brill typeface aims to do just that, and to be a tool for all scholars in the humanities; for Brill’s authors and editors; for Brill’s staff and service providers; and finally, for anyone in need of this tool, as long as it is not used for any commercial gain.* There are several fonts in different styles, each of which has the same set of characters as all the others. The Unicode Standard is rigorously adhered to: there is no dependence on the Private Use Area (PUA), as it happens frequently in other fonts with regard to characters carrying rare diacritics or combinations of diacritics. Instead, all alphabetic characters can carry any diacritic or combination of diacritics, even stacked, with automatic correct positioning. This is made possible by the inclusion of all of Unicode’s combining characters and by the application of extensive OpenType Glyph Positioning programming. Credits The Brill fonts are an original design by John Hudson of Tiro Typeworks. Alice Savoie contributed to Brill bold and bold italic. The black-letter (‘Fraktur’) range of characters was made by Karsten Lücke.
    [Show full text]
  • The Unicode Standard, Version 10.0
    Phonetic Extensions Supplement Range: 1D80–1DBF This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 10.0 This file may be changed at any time without notice to reflect errata or other updates to the Unicode Standard. See http://www.unicode.org/errata/ for an up-to-date list of errata. See http://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See http://www.unicode.org/charts/PDF/Unicode-10.0/ for charts showing only the characters added in Unicode 10.0. See http://www.unicode.org/Public/10.0.0/charts/ for a complete archived file of character code charts for Unicode 10.0. Disclaimer These charts are provided as the online reference to the character contents of the Unicode Standard, Version 10.0 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode Standard, Version 10.0, online at http://www.unicode.org/versions/Unicode10.0.0/, as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, #34, #38, #41, #42, #44, and #45, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available online. See http://www.unicode.org/ucd/ and http://www.unicode.org/reports/ A thorough understanding of the information contained in these additional sources is required for a successful implementation.
    [Show full text]
  • The BU Phonetic Keyboarding System
    The BU Phonetic keyboarding System Albert Bickford, March 18, 2021 Updated August 03, 2020 1 Introduction The BU1 Phonetic Keyboard provides access to a wide range of characters for Latin-based scripts in Unicode 4.1.0 (www.unicode.org), including: • English, Spanish, French, German, and other major European languages2 • nearly complete set of IPA and Americanist phonetic symbols, include obscure and obsolete symbols • special characters commonly-used in typesetting • arrows • common mathematical, numeric, and currency symbols (It also works with non-Unicode applications, providing access to the standard Windows ANSI Latin-1 character set, also known as codepage 1252, using virtually the same keyboarding conventions as for the corresponding Unicode characters.) The BU Phonetic keyboard is one of the more extensive Unicode keyboards for Latin scripts available, although it still does not cover all of the hundreds of Latin characters in Unicode. I have tried to include those that are more likely to be used by linguists and others working with multiple languages.3 To ease the memory load, the keyboarding conventions use a relatively small set of conventions that are applied very broadly and generally. Once you learn the conventions, you should be able to guess the keyboarding sequence for many characters without looking them up.4 I have also tried to avoid using keystroke combinations that may be needed for other purposes, e.g. for shortcut commands in common application programs. 1 “BU” stands for “Bickford Unicode”. I named it after myself not for vainglory but simply as an easy way to distinguish it from other Unicode keyboards.
    [Show full text]
  • Latin Extended-B Range: 0180–024F
    Latin Extended-B Range: 0180–024F This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 14.0 This file may be changed at any time without notice to reflect errata or other updates to the Unicode Standard. See https://www.unicode.org/errata/ for an up-to-date list of errata. See https://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See https://www.unicode.org/charts/PDF/Unicode-14.0/ for charts showing only the characters added in Unicode 14.0. See https://www.unicode.org/Public/14.0.0/charts/ for a complete archived file of character code charts for Unicode 14.0. Disclaimer These charts are provided as the online reference to the character contents of the Unicode Standard, Version 14.0 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode Standard, Version 14.0, online at https://www.unicode.org/versions/Unicode14.0.0/, as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, #34, #38, #41, #42, #44, #45, and #50, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available online. See https://www.unicode.org/ucd/ and https://www.unicode.org/reports/ A thorough understanding of the information contained in these additional sources is required for a successful implementation.
    [Show full text]
  • Proposed Mapping of Extipa and Modifier Phonetic Characters Kirk Miller, [email protected] 2020 April 14
    Proposed mapping of extIPA and modifier phonetic characters Kirk Miller, [email protected] 2020 April 14 Per Script Ad Hoc Committee, 10780–107BF could be allocated to Latin. Mike Everson would like to draw characters. We will need to send a font to Unicode. Latin Extended-D U+A7C0, C1 still available? ...0 ...1 ...2 ...3 ...4 ...5 ...6 ...7 ...8 ...9 ...A ...B ...C ...D ...E ...F Latin D block U+A7Cx U+A7Dx U+A7Ex U+A7Fx Latin E block U+AB6x Combining Diacritical Marks Extended U+1ACx ◌ ◌ ◌ ◌ U+1ADx U+1AEx U+1AFx Supplementary plain U+1078x U+1079x U+107Ax U+107Bx U+A7C0? LATIN LETTER SMALL CAPITAL TURNED L U+A7C1? or sup plane so turned U could join it? LATIN LETTER SMALL CAPITAL TURNED K LATIN SMALL LETTER O WITH RETROFLEX HOOK. Figure 15. LATIN SMALL LETTER I WITH STROKE AND RETROFLEX HOOK. Figure 16. LATIN SMALL LETTER TESH DIGRAPH WITH RETROFLEX HOOK. Figure 14. LATIN SMALL LETTER L WITH BELT AND PALATAL HOOK. Figure 23. LATIN SMALL LETTER ENG WITH PALATAL HOOK. Figure 24. LATIN SMALL LETTER TURNED R WITH PALATAL HOOK. Figures 17–19. ... LATIN SMALL LETTER R WITH FISHHOOK AND PALATAL HOOK. Figure 19. LATIN SMALL LETTER EZH WITH PALATAL HOOK. Figures 20–21. LATIN SMALL LETTER DEZH DIGRAPH WITH PALATAL HOOK. Figure 22. LATIN SMALL LETTER TESH DIGRAPH WITH PALATAL HOOK. Figure 22. LEFT SQUARE BRACKET WITH STROKE. Figures 11, 13. RIGHT SQUARE BRACKET WITH STROKE. Figures 11, 13. LEFT SQUARE BRACKET WITH DOUBLE STROKE. Figures 11, 14. RIGHT SQUARE BRACKET WITH DOUBLE STROKE.
    [Show full text]
  • Considerations in the Identification and Management of Variant Elements in Latin Script Tables for IDN Registration
    Considerations in the identification and management of variant elements in Latin script tables for IDN registration Cary Karp Swedish Museum of Natural History This Support Brief was contributed to the ICANN VIP initiative by the host of its Latin script study, the Internet Infrastructure Foundation, with the support of the Swedish Museum of Natural History. The code points available for use in IDNs are all taken from the Unicode Character Code Charts. The Latin script is divided there into nine blocks. The one headed “Basic Latin” restates the ASCII repertoire and therefore includes the familiar letter-digit-hyphen (“LDH”) array to which TLD registries previously restricted all second-level domain names. The TLD labels, themselves, were further restricted to the letters in that repertoire. Latin letters other than the ‘a–z’ encoded in ASCII, as well as diacritically marked and otherwise decorated forms are presented in supplemental and extended Latin blocks, with further Latin letters in blocks under the heading “Phonetic Symbols”. Many of the marked letters can be represented with differing series of code points. Other letters that are intrinsically different and have different code points may share the same glyph. Protocol constraint renders the first of these situations tractable. Contextual restriction on the use of certain code points is necessary for the second. Basic concepts and considerations in the protocol and contextual management of these conditions are discussed below, with specific regard to the local collation of permissible IDN character repertoires and the identification of variant relationships among the listed characters. The rubric “Support Brief” indicates the intention of this text serving as a source document for the study group’s deliberations and report, without being a structured work in itself.
    [Show full text]
  • Proposal to Encode Phonetic Symbols with Palatal Hook in the UCS
    Proposal to Encode Phonetic Symbols with Palatal Hook in the UCS Date: 2003-5-30 Author: Peter Constable, SIL International Address: 7500 W. Camp Wisdom Rd. Dallas, TX 75236 USA Tel: +1 972 708 7485 Email: [email protected] A. Administrative 1. Title Proposal to Encode Phonetic Symbols with Palatal Hook in the UCS 2. Requester’s name SIL International (contact: Peter Constable) 3. Requester type Expert contribution 4. Submission date 2003-05-30 5. Requester’s reference 6a. Completion This is a complete proposal 6b. More information to be Only as required for clarification. provided? B. Technical------General 1a. New Script? Name? No 1b. Addition of characters to existing block? Yes — Phonetic Extensions Name? 2. Number of characters in proposal 17 3. Proposed category A 4. Proposed level of implementation and 1 (no combining marks or jamo) rationale 5a. Character names included in proposal? Yes 5b. Character names in accordance with Yes guidelines? 5c. Character shapes reviewable? Yes 6a. Who will provide computerized font? SIL International 6b. Font currently available? Yes 6c. Font format? TrueType Proposal to Encode Phonetic Symbols with Palatal Hook in the UCS Page 1 of 12 Peter G. Constable May 30, 2003 Rev: 11 7a. Are references (to other character sets, Yes dictionaries, descriptive texts, etc.) provided? 7b. Are published examples (such as samples Yes from newspapers, magazines, or other sources) of use of proposed characters attached? 8. Does the proposal address other aspects of Yes, suggested character properties are included (see section E). character data processing? C. Technical------Justification 1. Has this proposal for addition of No character(s) been submitted before? 2a.
    [Show full text]
  • Phonetic Extensions Supplement Range: 1D80–1DBF
    Phonetic Extensions Supplement Range: 1D80–1DBF This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 14.0 This file may be changed at any time without notice to reflect errata or other updates to the Unicode Standard. See https://www.unicode.org/errata/ for an up-to-date list of errata. See https://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See https://www.unicode.org/charts/PDF/Unicode-14.0/ for charts showing only the characters added in Unicode 14.0. See https://www.unicode.org/Public/14.0.0/charts/ for a complete archived file of character code charts for Unicode 14.0. Disclaimer These charts are provided as the online reference to the character contents of the Unicode Standard, Version 14.0 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode Standard, Version 14.0, online at https://www.unicode.org/versions/Unicode14.0.0/, as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, #34, #38, #41, #42, #44, #45, and #50, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available online. See https://www.unicode.org/ucd/ and https://www.unicode.org/reports/ A thorough understanding of the information contained in these additional sources is required for a successful implementation.
    [Show full text]
  • Considerations in the Use of the Latin Script in Variant Internationalized Top-Level Domains
    Considerations in the use of the Latin script in variant internationalized top-level domains Final report of the ICANN VIP Study Group for the Latin script Executive summary The study group examined all the characters in the Unicode Character Code Chart version 6.1.0 that are associated with the Latin script and valid under the IDNA2008 protocol. It identified several forms of “confusability” that might require careful consideration in the collation of a subset of the broader repertoire for local use. The resolution of such issues is, however, highly dependent on local orthographic conventions. These frequently treat the same characters in different manners. Strings that are confusingly similar in the context of one language may have no such connotations in another. Noting that the Latin script is used by a larger number of separate language communities than is any other single script, attempting to provide a comprehensive overview of the needs of all of them is an unrealistic endeavor. A summary attempt at doing so nonetheless would be culturally insensitive to communities that have yet to join the IDN discussion. The study group therefore finds no basis for the categorical treatment of any code point assigned to an element of the Latin script as being equivalent to any other such code point. Nor does it believe that any such basis exists beyond what is already incorporated in the IDNA2008 protocol. The ICANN TLD application process should not permit requests for multiple Latin strings under the premise that they are variants of each other. Careful scrutiny is required when evaluating proposed TLD labels for confusability but that does not make them variants in the focused sense of the VIP study.
    [Show full text]
  • Those Obscure Accents
    Those obscure accents . Karel Hor´ak Institute of Mathematics, Academy of Sciences, Praha horakk (at) math dot cas dot cz Abstract »A special shape of a háček, similar to an apostrophe, is used in Czech and Slovak with ď, ľ, Ľ and ť characters. It could be derived from the apostrophe or comma, but it should be more humble, smaller, and, importantly, narrower. Generally, the symbol should draw less attention than the comma. This special form could also take a straight shape similar to acute; this usually occupies less space than an apostrophe-like form and it does not cause as many problems in kerning. Vertically, the symbol is most often placed towards the ascender line, but its position does not necessarily have to be constant (with ť, it is often necessary to place the accent higher that with the other characters). With capital Ľ, it is desirable that the accent exceeds the height of the character. This is mostly equivalent with justifying the upper edge of the accent to the ascender line.« [DIACRITICS, a project by typo.cz and designiq.cz] An excursion into history with many examples of good, bad and ugly solutions. Briefly from the history It should be noticed that black letters (frak- tur) were widely used in those times for typesetting. The motto quoted in the abstract, which states in And for many years, types were often not created the condensed form the final lesson I learned during in the country but brought from abroad. Black let- the long (never finished) way to understand typo- ters were used in printing until the end of 18th cen- graphic quality, would be sufficient.
    [Show full text]