DL 3.4 Part 2: Report on Best Practice in handling of Unicode and Non-Unicode Data
ECP 2006 DILI 510049
ENRICH
Report on Best Practice in handling of Unicode and Non-Unicode Data
Deliverable number D-3.4
Dissemination level Public
Delivery date 30 October 2009
Status Draft
Author(s) James Cummings, Tomas Psohlavec
eContentplus This project is funded under the eContentplus programme1, a multiannual Community programme to make digital content in Europe more accessible, usable and exploitable.
1 OJ L 79, 24.3.2005, p. 1. 1/125 DL 3.4 Part 2: Report on Best Practice in handling of Unicode and Non-Unicode Data
Document Version Control Version Date Change Made (and if appropriate Initials of reason for change) Commentator(s) or Author(s) 0.0 25 Oct 09 Draft Deliverable JC,OUCS and AIP
Document Review Reviewer Institution Date and result of the review
Approved By (signature) Date
Accepted by at European Commission Date (signature)
2/125 DL 3.4 Part 2: Report on Best Practice in handling of Unicode and Non-Unicode Data
1. Executive Summary
The use of Unicode is rightly commonplace as a character encoding for electronic documents. While the choice to use Unicode is beneficial, there are many contexts in which there are valid needs for characters and glyphs not represented in Unicode. Any project using non- standard characters should document needed information about these characters and their use. The nature of medieval manuscripts and their descriptions, upon which the ENRICH project is founded, means that the ENRICH project is more likely than non-medieval projects to have need of non-standard characters. In addition, large European projects working in an internationalized context not only need to use Unicode, but document carefully any time they depart from it and provide appropriate fallbacks for rendering and presentation. The ENRICH project uses the Text Encoding Initiative (TEI) P5 Guidelines recommendations on XML markup methods to record and document any non-Unicode characters or individual glyphs of interest to those creating an electronic resource. As a project ENRICH fully endorses and benefits from both Unicode and the TEI recommendations. This report provides an introduction to character encoding which surveys the terminology and key concepts needed to understand the remaining discussion, the use of Unicode and non- Unicode characters in XML, and the normalization and standardization of non-standard characters. In addition the representation of non-standard characters both for markup and annotation are discussed before a final section on the use of Unicode in the ENRICH project and the ENRICH gBank web frontend and web service API as developed. The development of the gBank is an added benefit to the method chosen by the ENRICH project and forms an additional software deliverable in its own right. There are a number of clear recommendations that come out of the use of Unicode and non- standard characters in the ENRICH project. 1. Wherever possible projects should use a Unicode character encoding such as UTF-8. 2. Projects needing to reference or record non-standard characters should in preference adopt a system such as the TEI Gaiji module recommendations for documenting their use of non-standard characters and/or the Unicode Private Use Area. ENRICH strongly recommends use of the TEI Guidelines in preference for such undertakings. 3. Character normalization should be well-documented and consistently applied using standardized decomposed characters that have wide font support. Any mappings to such characters need to be clearly documented. 4. All transformations, migrations, indexing and search routines should use the same table of equivalences in searching for normalized fonts. 5. Although CSS3 web fonts provide a promising method to push fonts to users viewing a web page, this should not yet be recommended practice until consistently implemented across browsers.
3/125 DL 3.4 Part 2: Report on Best Practice in handling of Unicode and Non-Unicode Data
TABLE OF CONTENTS
1. EXECUTIVE SUMMARY ...... 3
2. INTRODUCTION ...... 6
3. CHARACTER SETS AND ENCODING ...... 6
3.1. TERMINOLOGY AND KEY CONCEPTS ...... 6 3.2. UNICODE AND XML ...... 8 3.3. NON-UNICODE CHARACTERS AND XML ...... 8 3.4. NORMALIZATION AND STANDARDIZATION ...... 9 4. REPRESENTATION OF NON-STANDARD CHARACTERS ...... 9
4.1. DESCRIPTIVE INFORMATION FOR NON-STANDARD CHARACTERS ...... 9 4.2. ANNOTATION OF NON-STANDARD CHARACTERS ...... 10 5. THE ENRICH PROJECT AND NON-STANDARD CHARACTERS ...... 11
5.1. THE ENRICH GBANK AND THE MEDIEVAL UNICODE FONT INITIATIVE ...... 11 5.2. ENRICH GBANK IN THE MANUSCRIPTORIUM SYSTEM ...... 11 5.3. GBANK END-USER INTERFACE ...... 12 5.4. GBANK API INTERFACE ...... 13 5.5. INDEXING AND SEARCHING WITH THE GBANK ...... 13 5.5.1 Indexing with gBank Characters ...... 13 5.5.2 Searching with gBank Characters ...... 14 5.5.3 Advanced Search Features Using gBank Characters ...... 14 5.6. SUPPORT OF GBANK IN THE PRESENTATION LAYER ...... 14 5.6.1 Use of Images and Standardized Mappings ...... 15 5.6.2 Use of TTF and CSS 3 ...... 15 6. CONCLUSIONS AND RECOMMENDATIONS ...... 15
7. APPENDICES ...... 17
7.1. APPENDIX A (A) STRUCTURAL LIGATURES ...... 17 7.2. APPENDIX B (B) NON-STRUCTURAL LIGATURES ...... 17 7.3. APPENDIX C SUBRANGE 2: SMALL CAPITALS ...... 22 7.4. APPENDIX D SUBRANGE 3: ENLARGED MINUSCULES ...... 23 7.5. APPENDIX E SUBRANGE 4: BASE-LINE ABBREVIATION CHARACTERS ...... 26 7.6. APPENDIX F SUBRANGE 5: MODIFIED BASE-LINE ABBREVIATION CHARACTERS ...... 27 7.7. APPENDIX G SUBRANGE 6: COMBINING MARKS ...... 30 7.8. APPENDIX H SUBRANGE 7: COMBINING SUPERSCRIPT CHARACTERS ...... 31 7.9. APPENDIX I SUBRANGE 8: PUNCTUATION MARKS ...... 33 7.10. APPENDIX J SUBRANGE 9: CRITICAL AND EPIGRAPHICAL SIGNS ...... 35 7.11. APPENDIX K SUBRANGE 10: METRICAL SYMBOLS ...... 35 7.12. APPENDIX L SUBRANGE 11: ADDITIONAL NUMBER FORMS ...... 37 7.13. APPENDIX M SUBRANGE 12: WEIGHT, CURRENCY AND MEASUREMENT ...... 37 7.14. APPENDIX N SUBRANGE 13: MODIFIED BASE-LINE CHARACTERS ...... 39 7.15. APPENDIX O SUBRANGE 15: CHARACTERS WITH MACRON OR OVERLINE ...... 39 7.16. APPENDIX P SUBRANGE 16: CHARACTERS WITH ACUTE ACCENT ...... 48 7.17. APPENDIX Q SUBRANGE 17: CHARACTERS WITH DOUBLE ACUTE ACCENT ...... 55 7.18. APPENDIX R SUBRANGE 18: CHARACTERS WITH DOT ABOVE ...... 62 7.19. APPENDIX S SUBRANGE 19: CHARACTERS WITH DOT BELOW ...... 69 7.20. APPENDIX T SUBRANGE 20: CHARACTERS WITH DIAERESIS ...... 78 7.21. APPENDIX U SUBRANGE 21: CHARACTERS WITH CURL ABOVE (REVERSED OGONEK) ...... 80 7.22. APPENDIX V SUBRANGE 22: CHARACTERS WITH OGONEK ...... 83 7.23. APPENDIX W SUBRANGE 23: CHARACTERS WITH BREVE ...... 85 7.24. APPENDIX X SUBRANGE 24: CHARACTERS WITH BREVE BELOW ...... 86 7.25. APPENDIX Y SUBRANGE 25: CHARACTERS WITH CIRCUMFLEX ...... 86 7.26. APPENDIX Z SUBRANGE 26: CHARACTERS WITH RING ABOVE ...... 88 7.27. APPENDIX AA SUBRANGE 27: CHARACTERS WITH RING BELOW ...... 88 7.28. APPENDIX AB SUBRANGE 28: CHARACTERS WITH TILDE ...... 89 4/125 DL 3.4 Part 2: Report on Best Practice in handling of Unicode and Non-Unicode Data
7.29. APPENDIX AC SUBRANGE 29: CHARACTERS WITH CURLY BAR ABOVE ...... 90 7.30. APPENDIX AD SUBRANGE 30: CHARACTERS WITH VERTICAL BAR ABOVE ...... 90 7.31. APPENDIX AE SUBRANGE 31: CHARACTERS WITH SUPERSCRIPT LETTERS ...... 90 7.32. APPENDIX AF SUBRANGE 32: CHARACTERS WITH ACUTE ACCENT AND DOT ABOVE ...... 99 7.33. APPENDIX AG SUBRANGE 33: CHARACTERS WITH ACUTE ACCENT AND DOT BELOW ...... 103 7.34. APPENDIX AH SUBRANGE 34: CHARACTERS WITH ACUTE ACCENT AND DIAERESIS ...... 103 7.35. APPENDIX AI SUBRANGE 35: CHARACTERS WITH ACUTE ACCENT AND CURL ABOVE (REVERSED OGONEK) ...... 103 7.36. APPENDIX AJ SUBRANGE 36: CHARACTERS WITH ACUTE ACCENT AND OGONEK ...... 104 7.37. APPENDIX AK SUBRANGE 37: CHARACTERS WITH DOUBLE ACUTE ACCENT AND OGONEK ...... 106 7.38. APPENDIX AL SUBRANGE 38: CHARACTERS WITH DOT ABOVE AND OGONEK ...... 107 7.39. APPENDIX AM SUBRANGE 39: CHARACTERS WITH DOT BELOW AND OGONEK ...... 108 7.40. APPENDIX AN SUBRANGE 40: CHARACTERS WITH DIAERESIS AND MACRON ...... 109 7.41. APPENDIX AO SUBRANGE 41: CHARACTERS WITH DIAERESIS AND CIRCUMFLEX ...... 110 7.42. APPENDIX AP SUBRANGE 42: CHARACTERS WITH DIAERESIS AND DOT BELOW ...... 111 7.43. APPENDIX AQ SUBRANGE 43: CHARACTERS WITH OGONEK AND CURL ABOVE (REVERSED OGONEK) ...... 111 7.44. APPENDIX AR SUBRANGE 44: CHARACTERS WITH OGONEK AND CIRCUMFLEX ...... 112 7.45. APPENDIX AS SUBRANGE 45: CHARACTERS WITH RING ABOVE AND CIRCUMFLEX ...... 113 7.46. APPENDIX AT SUBRANGE 46: CHARACTERS WITH MACRON AND BREVE ...... 113 7.47. APPENDIX AU SUBRANGE 47: CHARACTERS WITH MACRON AND ACUTE ACCENT ...... 118 7.48. APPENDIX AV SUBRANGE 48: CHARACTERS WITH OGONEK, DOT ABOVE AND ACUTE ACCENT ...... 121 7.49. APPENDIX AW SUBRANGE 51: ALPHABETICAL LIST OF VARIANT LETTER FORMS ...... 122
5/125 2. Introduction
In any electronic document the agreements by which the characters are encoded are simultaneously of critical importance and often glossed over in vague mentions or now more frequently that "we'll just use Unicode" without a proper understanding of the implications and limitations of this decision. The choice to use Unicode is a good one, but especially in the context of medieval manuscripts descriptions may impose limitations on the recording of certain characters. This is especially the case with respect to large European projects working in an internationalized context. To counter any such limitations, the Text Encoding Initiative (TEI) P5 Guidelines have developed mark-up methods to record and document any non-Unicode characters or individual glyphs of interest to those creating the electronic resource. As a project ENRICH fully endorses and benefits from Unicode and the TEI recommendations. This report provides an introduction to character encoding which surveys the terminology and key concepts needed to understand the remaining discussion, the use of Unicode and non- Unicode characters in XML, and the normalization and standardization of non-standard characters. In addition the representation of non-standard characters both for markup and annotation are discussed before a final section on the use of Unicode in the ENRICH project and the ENRICH gBank web frontend and web service API as developed. The conclusion contains a number of recommendations for best practice in this area.
3. Character Sets and Encoding
The basis of electronic documents is the representation of one thing by another in a consistent systematic manner, hopefully also in accordance with internationally recognised standards and recommendations. At the very base of such recommendations is the application to the smallest distinctive units in any particular writing system (e.g. characters or ideograms). The development of character sets and the problems which surround these are partly related to the historical development of technological representations of these characters and also the identification, manipulation, and rendering of any characters of a natural language. Partly as an attempt to overcome many of these problems, the Unicode Standard (ISO 10646) attempts to enable the consistent representation and manipulation of text in the majority of the world's existing and historical writing systems.
3.1. Terminology and Key Concepts
In order to understand any of the issues surrounding character encoding, XML's use of Unicode, and the markup of non-standard characters, it is necessary to introduce some key concepts and especially the terminology that relates to them. The term 'character' is a good example as its use is wide and extremely varied in meaning. It can simultaneously refer to the visible symbol on the page and also to the letter or ideograph represented by that symbol. And yet, these two aspects of a 'character' are important to keep distinct when discussing their representation in electronic form. A single 'character' may have multiple forms of representation. For example, the letter 'a' may appear as a single compartment or have a double-compartment where the top ascender curls over. In the following figure a number a different 'a' characters are rendered with different fonts and thus appear different even though we recognise them all as the same abstract 'character' in one sense, they are represented by different physical forms or 'glyphs' in particular textual instantiations.
An uppercase 'A' would be a different character and represented by a different set of glyphs. However, it is important to note that in one case above, with the 'Capitals Regular' font, the lowercase letter 'a' glyph has the appearance of a typical glyph for an uppercase 'A' character. This distinction, between abstract characters as concepts and their instantiations as glyphs is fundamental to any discussion of electronic documents and their character encodings. A collection of these abstract characters that are suitable for the representation of documents created in a particular writing system is termed a 'character set'. A document's character set is simply this collection of abstract characters, but the character set that a computer program or processing device knows how to deal with is a set of abstract characters which have been predefined to match a set of numbers or 'code points' through which the characters are represented internally to the underlying machine. This is a 'coded character set' because each of these abstract characters is given a unique processable code. A writing system is a particular script used to express an particular language through the use of a defined character set. (In addition writing systems usually also have understood rules and generally are representations of at least one spoken language.)
Historically many different competing character sets have been created and the plethora of these caused much work in translating from one to another. The development of Unicode is an attempt to rationalise these all into a single form of coded character sets and actively maintain and develop it through a public international consortium. In the Unicode standard, each abstract character is given a definition and assigned a unique code point. Unicode differs from earlier attempts at coded character sets partly by its current size and scope, the in-built provision for near limitless expansion, and importantly the increasing provision by commercial providers (in fonts, hardware and software) to support each new release of the standard. 3.2. Unicode and XML
One of the important aspects of the XML standard is that it only requires that Unicode be supported internally by any XML processing system. This means that all abstract characters (not including markup) in an XML document must be treated as if they are in Unicode for purposes of internal parsing of the document. In practice many character encoding systems are used, but in most cases these are translated to one of the Unicode character sets (e.g. UTF-8 or UTF-16) by the processor before it undertakes to process it. Such transformations (as long as the character encoding used is properly declared) are mostly invisible to users. However, in the case of ENRICH, Unicode UTF-8 should be seen as the recommended character encoding to use. Characters in Unicode texts can be entered in a number of ways. These include both entering the character directly (if the software and font manufacturers have provided a method to do so) or representing the character with the appropriate 'Numeric Character Reference' (NCR). If the character is able to be entered directly in a system that gives an appropriate rendering of them, then this is to be preferred. Otherwise using an NCR might be necessary. These take the form of 'D;' where 'D' is an integer representing the code point of the abstract character in base 10, or 'H;' where 'H' is the same code point but expressed in hexadecimal notation, both delimited by the ampersand and semicolon. Generally the hexadecimal form is to be preferred since this is easier to relate to the code point. For example the lowercase 'thorn' character common in Middle English texts might be directly entered as 'þ', entered as a decimal NCR as 'þ' or hexadecimal NCR as 'þ'. This notation does not need special declaration or explanation to the XML processor, as all XML processor should be able to recognise NCRs and replace them with the required code point. This means that any Unicode character that is not able to be represented by the hardware or software has a method of entry. However, it is fairly unreadable by humans. DTD-based XML documents have a third option, that of declaring a named character entity with a replacement at the head of their document. These give a standard way for that character entity to be referred to in that one document.
3.3. Non-Unicode Characters and XML
Although Unicode attempts to be comprehensive there are instances of characters which are not included in the Unicode standard. In some cases this is because these characters have not been demonstrated to merit the status of an individual abstract character rather than a combination of two or more existing characters. When a character can be composed of the existing Unicode characters and a number of combining characters, then that is usually considered the preferred route. However, if the character has different semantics than those implied by use of the existing and similarly looking characters, that might help qualify it for inclusion. That said, Unicode included many compromise or compatibility characters to begin with because of their historical existence in common character sets and their support by hardware and software manufacturers. One of the most important aspects of Unicode character encoding is that it reserves over 137000 character code points for private use. These characters are in the 'Private Use Area' (PUA) code point range from U+E000 to U+F8FF (though U+F0000 to U+FFFFD and U+100000 to U+10FFFD are reserved as well, they are less frequently used). The Unicode Consortium has agreed never to assign characters in these ranges which means that they are free for use by individual projects, organizations, or commercial interests. There is no guarantee, however, that your PUA characters will not conflict with those chosen by others. Hence, it is best if PUA characters are either solely used internally or provided with some mechanism for normalization to an existing Unicode character. 3.4. Normalization and Standardization
The ENRICH project believes that it is necessary for every Unicode-based project dealing with any unusual characters to agree on, consistently implement and document a comprehensive and coherent normalization practice. This is necessary not only for proper long-term preservation and data migration, but also for interoperation of resources. Unicode has two different types of code points. These are characters which are precomposed single code points or those that are code point sequences of a base character with one or more combining characters such as diacritics. Scripts more recently added to Unicode usually do not have this code-point duplication because Unicode attempts to introduce no new precomposed characters that could be created by the use of combining characters. Nonetheless there are numerous duplications that exist in the older or comparability layers of the character set. For the ENRICH gBank, as discussed later, a straightforward normalization and standardization method of providing alternative basic ASCII characters was implemented. The exceptions to this are those characters which do not have acceptable equivalents in basic ASCII. In addition because of the access to the gBank as a web service, any individual project can choose to override these normalizations at any point on a character-by-character basis. ENRICH normalization decomposes ligatures and attempts to choose the lowest common denominator canonical characters that are the closest semantic equivalents to the non-Unicode abstract character.
4. Representation of Non-Standard Characters
The ENRICH project recommends the methods described in the TEI Guidelines for the Representation of Non-standard Characters and Glyphs. This method allows for the markup of individual non-standard characters in any level of textual transcription or metadata, while also recording additional details concerning that character. The ENRICH gBank concentrates on characters rather than glyphs but the same methods can be used for analysis of glyphs by using a
4.1. Descriptive Information for Non-Standard Characters
If a document is using a Unicode character encoding, then the properties of that character are known to the text processing systems through various character encoding libraries. If the document includes non-standard characters, perhaps encoded using the Unicode PUA, then this information is not available and recommended practice is to provide it through some form of additional markup. The TEI provides a method to do this using its
Inside the
If the property is a 'Unicode Normative Property', then it is mandatory to have a
4.2. Annotation of Non-Standard Characters
For the construction of the ENRICH gBank, a set of
2. So eld and hue hit hadde
3. So eld and hue hit hadde
4. So eld and hue hit hadde
5. The ENRICH Project and Non-Standard Characters
ENRICH has done its best to adhere to recommended Unicode practice and use Unicode character sets internally for its work. As most projects use Unicode whenever possible these days, to do otherwise is not best practice and may indicate work that is unlikely to be funded. However, there are many completely valid instances where for using characters that do not appear yet in Unicode or will never appear in Unicode because they go against Unicode principles. For example, precomposed characters of convenience used when studying a specific scribal variance. The ENRICH recommendation is to use TEI methods of description of these characters to preserve information about their standardization and reasons for use.
5.1. The ENRICH gBank and the Medieval Unicode Font Initiative
To provide a usable service for the ENRICH project, and potentially the others, the project decided in its investigation of Unicode to create a gBank. This is named both for the
5.2. ENRICH gBank in the Manuscriptorium System
There are five applications of gBank in the ENRICH Manuscriptorium system. Each provides different service in order to enable both end-users and content authors to work efficiently with documents that require usage of special characters and glyphs, often not supported by Unicode. The gBank is used: 1. as a database for the newly created standalone gBank end-user interface 2. as a database for the newly created standalone gBank API interface 3. to internally enhance indexing routines and the search and retrieval system 4. to enhance the presentation layer through display of the characters covered by gBank 5. to enhance searching texts that originated without using gBank and Gaiji module
5.3. gBank End-User Interface
The standalone online application is now available for use at http://beta.manuscriptorium.com/apps/gbank. The application presents the content of the gBank database to end-users interested in finding a particular non-standard character. The user friendly interface displays characters ordered into sets, which can be further searched in order to find individual characters. As there are images available (for almost all of the characters) it is fairly straightforward to find a particular character. If an image is not available, a short description is displayed as a label.
The user can display a particular character description by clicking on the appropriate image. For end-users' convenience a valid XML code is displayed - this code can be copied and pasted directly in the XML metadata of the particular digital document.
The
5.4. gBank API Interface
The gBank API interface performs one important task: it returns properties of a selected characters based on request passing character’s ID. The format of the request is as follows: http://beta.manuscriptorium.com/apps/gbank/char.php?id=eec6 The value of id parameter identifies the particular character. The API then returns the full character description as seen below:
5.5. Indexing and Searching with the gBank
5.5.1 Indexing with gBank Characters The gBank database is also used during indexing routines within the ENRICH Manuscriptorium system. The metadata in ENRICH Manuscriptorium is indexed into a special database which enables efficient on-line searching. As it is difficult to include the non-Unicode characters into the search database, the gBank is used to substitute the
We can then use the standardized mapping and replace the character with 'af' in the indexes.
5.5.2 Searching with gBank Characters Users can do the same when they build their search query: they can simply use the standardized character mappings instead of the original character. This approach is very important for end-users, because majority of the special characters - or even those covered by Unicode - are difficult to enter into search queries using common keyboards and fonts. Therefore as a result of our analysis and tests we provide standardized mappings to basic ASCII for each character in the gBank suite, using the principle that they should be able to be easily typed by a common keyboard and are covered all common fonts. The only exceptions to these are the medieval thorn 'þ' and eth 'ð' characters which are included because there are no easy basic ASCII transliteration and as they were present in extended ASCII they are present in most fonts. In all standardization any ligatures have been decomposed into their component parts and any combining non-alphabetic modifiers (accents, etc.) have been removed. All combining alphabetic characters have been decomposed as separate characters.
For instance considering the example above: having with 'af' standardized mappings then the user can simply enter 'abcafdef' string into the query line and as a result not only abcafdef will be found, but also the 'abc def' will be found too. Of course, the search result would be wider if using this approach, but the number of overabundant records will not be significant because of the limited size of the gBank database. These features are implemented into ENRICH Manuscriptorium, but have not yet been extensively tested with real world examples.
5.5.3 Advanced Search Features Using gBank Characters There are many metadata sources that do not use TEI P5 to create their primary metadata. Therefore they do not use the
5.6. Support of gBank in the Presentation Layer
The final but significant task when implementing gBank into Manuscriptorium was to analyze and prepare the rendering and presentation of texts and descriptions within the end-users interface.
5.6.1 Use of Images and Standardized Mappings The system again uses the incorporated gBank database and replaces the
5.6.2 Use of TTF and CSS 3 In rendering non-standard characters in the end-user's browser, there is the possibility to use a dedicated TrueType Font (TTF) which is capable to display the texts in combination with CSS 3. In creating images for display, all the graphic files with examples characters are based on the MUFI-compliant version 3 of Andreas Stötzner's Andron Scriptor Web font. They were converted from TTF to SVG using Apache Batik and then from SVG to PNG for online display. Additionally, the CSS 3 recommendation enables the use of web font rather than a font residing on the end-user's computer using the @font-face rule: This way it would be possible to display texts using dedicated fonts that supply the correct characters for the Unicode Private Use Area. Unfortunately the browser support for TTF web fonts currently is rather low (supported by: Mozilla Firefox 3.5+, Opera 10, Safari 3.1, Safari 4; not supported by :IE (all versions - support only Embedded Open Type), Opera 9, Google Chrome 3.0, Mozilla Firefox 2, Mozilla Firefox 3.0.) Therefore this way of usage is not recommended at present, but as more browsers support TTF web fonts, then the suggested CSS3 approach could be successfully applied. Another possible way of using TTF would be to let the users to install dedicated font into their systems. To check whether the font is installed a javascript detection of the font’s presence in the system could be implemented. This would help to decide whether use images or dedicated font during the presentation. Note: This approach is tested and ready for implementation if the end-users decide that they require it. However, bearing all this in mind, the use of standardized mappings as described above is the preferred recommendation at the moment.
6. Conclusions and Recommendations
The comprehensive implementation of gBank into the ENRICH Manuscriptorium system greatly increases the ability to create, retrieve and display documents using old languages or unusual characters. In specific the web-service aspect of exposing any individual character's metadata through a simple and straightforward API is of great benefit to anyone working in this area. Therefore the implementation provides significant added value far beyond the project and the original scope of the workpackage. There are a number of clear recommendations that come out of the use of Unicode and non- standard characters in the ENRICH project. 1. Wherever possible projects should use a Unicode character encoding such as UTF-8. 2. Projects needing to reference or record non-standard characters should in preference adopt a system such as the TEI Gaiji module recommendations for documenting their use of non-standard characters and/or the Unicode Private Use Area. ENRICH strongly recommends use of the TEI Guidelines in preference for such undertakings. 3. Character normalization should be well-documented and consistently applied using standardized decomposed characters that have wide font support. Any mappings to such characters need to be clearly documented. 4. All transformations, migrations, indexing and search routines should use the same table of equivalences in searching for normalized fonts. 5. Although CSS3 web fonts provide a promising method to push fonts to users viewing a web page, this should not yet be recommended practice until consistently implemented across browsers. The provision of the ENRICH gBank service far outstrips the original task for this workpackage. While the ENRICH gBank can be maintained for the life of the project, and perhaps on a best effort basis afterwards, its continual maintenance and upkeep has not been funded as part of ENRICH. The most significant aspect of its upkeep would be the introduction and vetting of new characters (supplied by MUFI or the ENRICH community) and the removal of characters which were late accepted into the Unicode Standard. 7. Appendices
7.1. Appendix A (a) Structural ligatures
Character Unicode Image of Standardized MUFI Entity Unicode Description ID Character Character Replacement PUA Code Point efa0 aa aacloselig U+EFA LATIN 0 SMALL LIGATURE AA CLOSED FORM f204 ae aeligred U+F204 LATIN SMALL LETTER AE WITH RIGHT UPPER LOOP efae AE AnecklessElig U+EFA LATIN E CAPITAL LIGATURE NECKLESS A E efa1 ae anecklesselig U+EFA LATIN 1 SMALL LIGATURE NECKLESS A E f205 AO AOligred U+F205 LATIN CAPITAL LIGATURE AO NECKLESS f206 ao aoligred U+F206 LATIN SMALL LIGATURE AO NECKLESS efa2 av anecklessvlig U+EFA LATIN 2 SMALL LIGATURE NECKLESS AV
7.2. Appendix B (b) Non-structural ligatures
Character Unicode Image of Standardized MUFI Entity Unicode Description ID Character Character Replacement PUA Code Point efa3 af aflig U+EFA LATIN SMALL 3 LIGATURE AF efa4 af afinslig U+EFA LATIN SMALL 4 LIGATURE A INSULAR F efa5 ag aglig U+EFA LATIN SMALL 5 LIGATURE AG efa6 al allig U+EFA LATIN SMALL 6 LIGATURE AL efa7 an anlig U+EFA LATIN SMALL 7 LIGATURE AN efa8 aN anscaplig U+EFA LATIN SMALL 8 LIGATURE A SMALL CAPITAL N efa9 ap aplig U+EFA LATIN SMALL 9 LIGATURE AP efaa ar arlig U+EFA LATIN SMALL A LIGATURE AR efab aR arscaplig U+EFA LATIN SMALL B LIGATURE A SMALL CAPITAL R efac aþ athornlig U+EFA LATIN SMALL C LIGATURE A THORN eec2 bb bblig U+EEC LATIN SMALL 2 LIGATURE BB eec3 bg bglig U+EEC LATIN SMALL 3 LIGATURE BG eec4 ck cklig U+EEC LATIN SMALL 4 LIGATURE CK eec5 ct ctlig U+EEC LATIN SMALL 5 LIGATURE CT eec6 dd drotdrotlig U+EEC LATIN SMALL 6 LIGATURE D ROTUNDA D ROTUNDA eec7 ey eylig U+EEC LATIN SMALL 7 LIGATURE EY eec8 fa faumllig U+EEC LATIN SMALL 8 LIGATURE F A WITH DIAERESIS eec9 fj fjlig U+EEC LATIN SMALL 9 LIGATURE FJ f1bc fo foumllig U+F1B LATIN SMALL C LIGATURE F O WITH DIAERESIS eeca fr frlig U+EEC LATIN SMALL A LIGATURE FR eecb ft ftlig U+EEC LATIN SMALL B LIGATURE FT eecc fu fuumllig U+EEC LATIN SMALL C LIGATURE F U WITH DIAERESIS eecd fy fylig U+EEC LATIN SMALL D LIGATURE FY eece fft fftlig U+EEC LATIN SMALL E LIGATURE FFT eecf ffy ffylig U+EEC LATIN SMALL F LIGATURE FFY eed0 fty ftylig U+EED LATIN SMALL 0 LIGATURE FTY eed1 gg gglig U+EED LATIN SMALL 1 LIGATURE GG eed2 gd gdlig U+EED LATIN SMALL 2 LIGATURE GD eed3 gd gdrotlig U+EED LATIN SMALL 3 LIGATURE G D ROTUNDA eed4 gð gethlig U+EED LATIN SMALL 4 LIGATURE G ETH eede go golig U+EED LATIN SMALL E LIGATURE GO ead2 gp gplig U+EAD LATIN SMALL 2 LIGATURE GP ead0 gr grlig U+EAD LATIN SMALL 0 LIGATURE GR ead1 qv qvinslig U+EAD LATIN SMALL 1 LIGATURE Q INSULAR V e8c3 hr hrarmlig U+E8C LATIN SMAL 3 L LETTER H LIGATED W ITH ARM OF LATIN SMA LL LETTER R e8c2 Hr Hrarmlig U+E8C LATIN CAPIT 2 AL LETTER H LIGATED WITH ARM OF LATIN S MALL LETTE R R e8c5 kr krarmlig U+E8C LATIN SMAL 5 L LETTER K LIGATED WI TH ARM OF LATIN SMAL L LETTER R f4f9 ll lllig U+F4F9 LATIN SMALL LIGATURE LL eed5 Ns nscapslonglig U+EED LATIN SMALL 5 LIGATURE SMALL CAPITAL N LONG S efad oc oclig U+EFA LATIN SMALL D LIGATURE OC eedd PP PPlig U+EED LATIN D CAPITAL LIGATURE PP eed6 pp pplig U+EED LATIN SMALL 6 LIGATURE PP eed7 pp ppflourlig U+EED LATIN SMALL 7 LIGATURE PP WITH FLOURISH eba0 sa slongaumllig U+EBA LATIN SMALL 0 LIGATURE LONG S A WITH DIAERESIS f4fa sch slongchlig U+F4F LATIN SMALL A LIGATURE LONG S CH eba1 sh slonghlig U+EBA LATIN SMALL 1 LIGATURE LONG S H eba2 si slongilig U+EBA LATIN SMALL 2 LIGATURE LONG S I f4fb sj slongjlig U+F4F LATIN SMALL B LIGATURE LONG S J f4fc sk slongklig U+F4F LATIN SMALL C LIGATURE LONG S K eba3 sl slongllig U+EBA LATIN SMALL 3 LIGATURE LONG S L eba4 so slongoumllig U+EBA LATIN SMALL 4 LIGATURE LONG S O WITH DIAERESIS eba5 sp slongplig U+EBA LATIN SMALL 5 LIGATURE LONG S P f4fd ss slongsslig U+F4F LATIN SMALL D LIGATURE LONG S S eba6 ss slongslonglig U+EBA LATIN SMALL 6 LIGATURE LONG S LONG S eba7 ssi slongslongilig U+EBA LATIN SMALL 7 LIGATURE LONG S LONG S I f4fe ssk slongslongklig U+F4F LATIN SMALL E LIGATURE LONG S LONG S K eba8 ssl slongslongllig U+EBA LATIN SMALL 8 LIGATURE LONG S LONG S L f4ff sst slongslongtlig U+F4F LATIN SMALL F LIGATURE LONG S LONG S T eba9 sti slongtilig U+EBA LATIN SMALL 9 LIGATURE LONG S TI ebaa str slongtrlig U+EBA LATIN SMALL A LIGATURE LONG S TR ebab su slonguumllig U+EBA LATIN SMALL B LIGATURE LONG S U WITH DIAERESIS ebac sv slongvinslig U+EBA LATIN SMALL C LIGATURE LONG S INSULAR V eada st slongdestlig U+EAD LATIN SMALL A LIGATURE LONG S DESCENDING T eed8 tr trlig U+EED LATIN SMALL 8 LIGATURE TR eed9 tt ttlig U+EED LATIN SMALL 9 LIGATURE TT eeda tt trottrotlig U+EED LATIN SMALL A LIGATURE T ROTUNDA T ROTUNDA eedb ty tylig U+EED LATIN SMALL B LIGATURE TY eedc tz tzlig U+EED LATIN SMALL C LIGATURE TZ e8c1 þr thornrarmlig U+E8C LATIN SMAL 1 L LETTER T HORN LIGAT ED WITH AR M OF LATIN SMALL LET TER R
7.3. Appendix C Subrange 2: Small capitals
Character Unicode Image of Character Standardized MUFI Unicode Description ID Character Replacement Entity PUA Code Point ef0c Q qscap U+EF0 LATIN C LETTER SMALL CAPITAL Q ef11 X xscap U+EF11 LATIN LETTER SMALL CAPITAL X ef15 Þ thornscap U+EF15 LATIN LETTER SMALL CAPITAL THORN 7.4. Appendix D Subrange 3: Enlarged minuscules
Character Unicode Image of Standardized MUFI Unicode Description ID Character Character Replacement Entity PUA Code Point eee0 a aenl U+EEE LATIN 0 ENLARGED LETTER SMALL A eaf0 a aenlacute U+EAF LATIN 0 ENLARGED LETTER SMALL A WITH ACUTE efdf aa aaligenl U+EFD LATIN F ENLARGED LETTER SMALL LIGATURE AA eaf1 ae aeligenl U+EAF LATIN I ENLARGED LETTER SMALL LIGATURE AE efde ao aoligenl U+EFD LATIN E ENLARGED LETTER SMALL LIGATURE AO eaf2 ao aenlosmalllig U+EAF LATIN 2 LIGATURE ENLARGED LETTER SMALL A AND LATIN SMALL LETTER O eee1 b benl U+EEE LATIN 1 ENLARGED LETTER SMALL B eee2 c cenl U+EEE LATIN 2 ENLARGED LETTER SMALL C eee3 d denl U+EEE LATIN 3 ENLARGED LETTER SMALL D eee4 d drotenl U+EEE LATIN 4 ENLARGED LETTER D ROTUNDA eee5 ð ethenl U+EEE LATIN 5 ENLARGED LETTER SMALL ETH eee6 e eenl U+EEE LATIN 6 ENLARGED LETTER SMALL E eaf3 e eogonenl U+EAF LATIN 3 ENLARGED LETTER SMALL E WITH OGONEK eee7 f fenl U+EEE LATIN 7 ENLARGED LETTER SMALL F eeff f finsenl U+EEF LATIN F ENLARGED LETTER SMALL INSULAR F eee8 g genl U+EEE LATIN 8 ENLARGED LETTER SMALL G eee9 h henl U+EEE LATIN 9 ENLARGED LETTER SMALL H eeea i ienl U+EEE LATIN A ENLARGED LETTER SMALL I eefd i inodotenl U+EEF LATIN D ENLARGED LETTER SMALL DOTLESS I eeeb j jenl U+EEE LATIN B ENLARGED LETTER SMALL J eefe j jnodotenl U+EEF LATIN E ENLARGED LETTER SMALL DOTLESS J eeec k kenl U+EEE LATIN C ENLARGED LETTER SMALL K eeed l lenl U+EEE LATIN D ENLARGED LETTER SMALL L eeee m menl U+EEE LATIN E ENLARGED LETTER SMALL M eeef n nenl U+EEE LATIN F ENLARGED LETTER SMALL N eef0 o oenl U+EEF LATIN 0 ENLARGED LETTER SMALL O efdd oe oeligenl U+EFD LATIN D ENLARGED LETTER SMALL LIGATURE OE eef1 p penl U+EEF LATIN 1 ENLARGED LETTER SMALL P eef2 q qenl U+EEF LATIN 2 ENLARGED LETTER SMALL Q eef3 r renl U+EEF LATIN 3 ENLARGED LETTER SMALL R eef4 s senl U+EEF LATIN 4 ENLARGED LETTER SMALL S eedf s slongenl U+EED LATIN F ENLARGED LETTER SMALL LONG S eef5 t tenl U+EEF LATIN 5 ENLARGED LETTER SMALL T eef7 u uenl U+EEF LATIN 7 ENLARGED LETTER SMALL U eef8 v venl U+EEF LATIN 8 ENLARGED LETTER SMALL V eef9 w wenl U+EEF LATIN 9 ENLARGED LETTER SMALL W eefa x xenl U+EEF LATIN A ENLARGED LETTER SMALL X eefb y yenl U+EEF LATIN B ENLARGED LETTER SMALL Y eefc z zenl U+EEF LATIN C ENLARGED LETTER SMALL Z eef6 þ thornenl U+EEF LATIN 6 ENLARGED LETTER SMALL THORN
7.5. Appendix E Subrange 4: Base-line abbreviation characters
Character Unicode Image of Standardized MUFI Unicode Description ID Character Character Replacement Entity PUA Code Point f1a5 US USbase U+F1A LATIN 5 ABBREVIATION SIGN SPACING BASE-LINE CAPITAL US f1a6 us usbase U+F1A LATIN 6 ABBREVIATION SIGN SPACING BASE-LINE SMALL US f142 ET ET U+F142 LATIN ABBREVIATION SIGN CAPITAL ET f1a7 ET ETslash U+F1A LATIN 7 ABBREVIATION SIGN CAPITAL ET WITH STROKE f158 et etslash U+F158 LATIN ABBREVIATION SIGN SMALL ET WITH STROKE f159 de de U+F159 LATIN ABBREVIATION SIGN SMALL DE f1ac ; sem U+F1A LATIN C ABBREVIATION SIGN SEMICOLON
7.6. Appendix F Subrange 5: Modified base-line abbreviation characters
Character Unicode Image of Standardized MUFI Entity Unicode Description ID Character Character Replacement PUA Code Point ebad hs hslonglig U+EBA LATIN SMALL D LIGATURE H AND LONG S e7c7 hs hslongligbar U+E7C LATIN SMALL 7 LIGATURE H AND LONG S WITH STROKE ebae ks kslonglig U+EBA LATIN SMALL E LIGATURE K AND LONG S e7c8 ks kslongligbar U+E7C LATIN SMALL 8 LIGATURE K AND LONG S WITH STROKE e8b3 qr q2app U+E8B LATIN SMALL 3 LETTER Q LIGATED WITH R ROTUNDA e8bf qet q3app U+E8B LATIN SMALL F LETTER Q LIGATED WITH FINAL ET e8b4 q qcentrslstrok U+E8B LATIN SMALL 4 LETTER Q WITH CENTRAL SLANTED STROKE e7e4 r rdesstrok U+E7E LATIN SMALL 4 LETTER R WITH LONG LEG AND STROKE THROUGH DESCENDER e8b7 s slongflour U+E8B LATIN SMALL 7 LETTER LONG S WITH FLOURISH e8b8 s slongslstrok U+E8B LATIN SMALL 8 LETTER LONG S WITH SLANTED DESCENDING STROKE e8ba v vslash U+E8B LATIN SMALL A LETTER V WITH SHORT SLASH e8bd x xslashula U+E8B LATIN SMALL D LETTER X WITH SHORT SLASH ABOVE e8be x xslashlra U+E8B LATIN SMALL E LETTER X WITH SHORT SLASH BELOW e337 þ THORNbarslash U+E337 LATIN CAPITAL LETTER THORN WITH DIAGONAL STROKE f149 þ thornbarslash U+F149 LATIN SMALL LETTER THORN WITH DIAGONAL STROKE e734 þs thornslonglig U+E734 LATIN SMALL LIGATURE THORN AND LONG S e735 þs thornslongligbar U+E735 LATIN SMALL LIGATURE THORN AND LONG S WITH STROKE
7.7. Appendix G Subrange 6: Combining marks
Character Unicode Image of Standardized MUFI Entity Unicode Description ID Character Character Replacement PUA Code Point f1c0 ◌ arbar U+F1C COMBINING 0 ABBREVIATION MARK BAR ABOVE WITH DOT f1c7 ◌ erang U+F1C COMBINING 7 ABBREVIATION MARK ZIGZAG ABOVE ANGLE FORM f1c8 ◌ ercurl U+F1C COMBINING 8 ABBREVIATION MARK ZIGZAG ABOVE CURLY FORM f1c1 ◌ ra rabar U+F1C COMBINING 1 ABBREVIATION MARK SUPERSCRIPT RA OPEN A FORM WITH BAR ABOVE f153 ◌ ur urrot U+F153 COMBINING ABBREVIATION MARK SUPERSCRIPT UR ROUND R FORM f1c2 ◌ ur urlemn U+F1C COMBINING 2 ABBREVIATION MARK SUPERSCRIPT UR LEMNISKATE FORM f1c5 ◌ combcurlhigh U+F1C COMBINING 5 CURL HIGH POSITION f1ca ◌ combdothigh U+F1C COMBINING A DOT ABOVE HIGH POSITION f1cc ◌ combcurlbar U+F1C COMBINING C CURLY BAR ABOVE f1fc ◌◌◌ combtripbrevebl U+F1F COMBINING C TRIPLE BREVE BELOW
7.8. Appendix H Subrange 7: Combining superscript characters
Character Unicode Image of Standardized MUFI Unicode Description ID Character Character Replacement Entity PUA Code Point f036 ◌ an anligsup U+F036 COMBINING LATIN SMALL LIGATURE AN f03a ◌ aN anscapligsup U+F03A COMBINING LATIN SMALL LIGATURE A SMALL CAPITAL N f038 ◌ ar arligsup U+F038 COMBINING LATIN SMALL LIGATURE AR f130 ◌ aR arscapligsup U+F130 COMBINING LATIN SMALL LIGATURE A SMALL CAPITAL R f012 ◌ b bsup U+F012 COMBINING LATIN SMALL LETTER B f013 ◌ B bscapsup U+F013 COMBINING LATIN LETTER SMALL CAPITAL B f016 ◌ D dscapsup U+F016 COMBINING LATIN LETTER SMALL CAPITAL D f135 ◌ e eogonsup U+F135 COMBINING LATIN SMALL LETTER E WITH OGONEK f136 ◌ e emacrsup U+F136 COMBINING LATIN SMALL LETTER E WITH MACRON f017 ◌ f fsup U+F017 COMBINING LATIN SMALL LETTER F f02f ◌ i inodotsup U+F02F COMBINING LATIN SMALL LETTER DOTLESS I f030 ◌ j jsup U+F030 COMBINING LATIN SMALL LETTER J f031 ◌ j jnodotsup U+F031 COMBINING LATIN SMALL LETTER DOTLESS J f01c ◌ k kscapsup U+F01 COMBINING C LATIN LETTER SMALL CAPITAL K f13e ◌ o oogonsup U+F13E COMBINING LATIN SMALL LETTER O WITH OGONEK f032 ◌ o oslashsup U+F032 COMBINING LATIN SMALL LETTER O WITH STROKE f13f ◌ o omacrsup U+F13F COMBINING LATIN SMALL LETTER O WITH MACRON f03e ◌ or orrotsup U+F03E COMBINING LATIN SMALL LETTER O R ROTUNDA f03f ◌ orum orumsup U+F03F COMBINING LATIN SMALL LETTER O RUM f025 ◌ p psup U+F025 COMBINING LATIN SMALL LETTER P f033 ◌ q qsup U+F033 COMBINING LATIN SMALL LETTER Q f040 ◌ rum rumsup U+F040 COMBINING LATIN SMALL LETTER RUM f02a ◌ T tscapsup U+F02A COMBINING LATIN LETTER SMALL CAPITAL T f03b ◌ T trotsup U+F03B COMBINING LATIN LETTER T ROTUNDA f03c ◌ w wsup U+F03 COMBINING C LATIN SMALL LETTER W f02b ◌ y ysup U+F02B COMBINING LATIN SMALL LETTER Y f03d ◌ þ thornsup U+F03 COMBINING D LATIN SMALL LETTER THORN
7.9. Appendix I Subrange 8: Punctuation marks
Character Unicode Image of Standardized MUFI Entity Unicode Description ID Character Character Replacement PUA Code Point f1f8 . hidot U+F1F8 DISTINCTIO f1e2 , posit U+F1E2 COMMA POSITURA f1e3 , ductsimpl U+F1E3 HIGH COMMA POSITURA (SIMPLEX DUCTUS) f1ea ; punctvers U+F1E PUNCTUS VERSUS A f1e4 ., punctposit U+F1E4 PUNCTUS WITH COMMA POSITURA f1e5 :, colmidcomposit U+F1E5 COLON WITH MIDDLE COMMA POSITURA f1f2 ; bidotscomposit U+F1F2 TWO DOTS OVER COMMA POSITURA f1e6 ; tridotscomposit U+F1E6 THREE DOTS WITH COMMA POSITURA f161 ; punctelev U+F161 PUNCTUS ELEVATUS f1f0 ; punctelevdiag U+F1F0 PUNCTUS ELEVATUS DIAGONAL STROKE f1fa ; punctelevhiback U+F1F PUNCTUS A ELEVATUS WITH HIGH BACK f1fb ; punctelevhack U+F1F PUNCTUS B ELEVATUS WITH HACKLE f1f5 ; punctflex U+F1F5 PUNCTUS FLEXUS f1e7 ! punctexclam U+F1E7 PUNCTUS EXCLAMATIVUS f160 ? punctinter U+F160 PUNCTUS INTERROGATIVUS f1e8 . punctintertilde U+F1E8 PUNCTUS INTERROGATIVUS HORIZONTAL TILDE f1f1 . punctinterlemn U+F1F1 PUNCTUS INTERROGATIVUS LEMNISKATE FORM f1f9 ~ wavylin U+F1F9 WAVY LINE f1e0 , medcom U+F1E0 MEDIEVAL COMMA f1e1 ¶ parag U+F1E1 PARAGRAPHUS f1ec renvoi U+F1E SIGNE DE RENVOI C f1f4 / virgsusp U+F1F4 VIRGULA SUSPENSIVA f1f7 / virgmin U+F1F7 SHORT VIRGULA
7.10. Appendix J Subrange 9: Critical and epigraphical signs
Character Unicode Image of Character Standardized MUFI Unicode Description ID Character Replacement Entity PUA Code Point f1da midring U+F1D MIDDLE A RING
7.11. Appendix K Subrange 10: Metrical symbols
Character Unicode Image of Standardized MUFI Entity Unicode Description ID Character Character Replacement PUA Code Point f70b ' metrancacute U+F70B METRICAL SYMBOL ANCEPS WITH ACUTE (PRIMARY STRESS) f719 " metrancdblac U+F719 METRICAL SYMBOL ANCEPS WITH DOUBLE ACUTE (PRIMARY STRESS AND ALLITERATION) f70c ' metrancgrave U+F70 METRICAL C SYMBOL ANCEPS WITH GRAVE (PRIMARY STRESS) f71a " metrancdblgrave U+F71 METRICAL A SYMBOL ANCEPS WITH DOUBLE GRAVE (SECONDARY STRESS AND ALLITERATION) f706 ' metrbreveacute U+F706 METRICAL SYMBOL BREVE WITH ACUTE (PRIMARY STRESS) f717 " metrbrevedblac U+F717 METRICAL SYMBOL BREVE WITH DOUBLE ACUTE (PRIMARY STRESS AND ALLITERATION) f707 ' metrbrevegrave U+F707 METRICAL SYMBOL BREVE WITH GRAVE (SECONDARY STRESS) f718 " metrbrevedblgrave U+F718 METRICAL SYMBOL BREVE WITH DOUBLE GRAVE (SECONDARY STRESS AND ALLITERATION) f704 ' metrmacracute U+F704 METRICAL SYMBOL LONGUM WITH ACUTE (PRIMARY STRESS) f715 " metrmacrdblac U+F715 METRICAL SYMBOL LONGUM WITH DOUBLE ACUTE (SECONDARY STRESS) f705 ' metrmacrgrave U+F705 METRICAL SYMBOL LONGUM WITH GRAVE (SECONDARY STRESS) f716 " metrmacrdblgrave U+F716 METRICAL SYMBOL LONGUM WITH DOUBLE GRAVE (SECONDARY STRESS AND ALLITERATION) f708 ' metrmacrbreveacute U+F708 METRICAL SYMBOL BREVE ABOVE LONGUM WITH ACUTE (SHORT OR LONG SYLLABLE WITH PRIMARY STRESS) f709 ' metrmacrbrevegrave U+F709 METRICAL SYMBOL BREVE ABOVE LONGUM WITH GRAVE (SHORT OR LONG SYLLABLE WITH SECONDARY STRESS) f71b ' metrdblbrevemacracute U+F71B METRICAL SYMBOL RESOLVED LIFT WITH ACUTE (PRIMARY STRESS) f71c " metrdblbrevemacrdblac U+F71 METRICAL C SYMBOL RESOLVED LIFT WITH DOUBLE ACUTE (PRIMARY STRESS AND ALLITERATION)
7.12. Appendix L Subrange 11: Additional number forms
Character Unicode Image of Character Standardized MUFI Unicode Description ID Character Replacement Entity PUA Code Point f1bd 0 smallzero U+F1B SMALL D BASE LINE ZERO SIGN f1be V Vmod U+F1B MODIFIER E CAPITAL LETTER V f1bf X Xmod U+F1BF MODIFIER CAPITAL LETTER X
7.13. Appendix M Subrange 12: Weight, currency and measurement
Character Unicode Image of Standardized MUFI Entity Unicode Description ID Character Character Replacement PUA Code Point f2e0 romaslibr U+F2E0 ROMAN AS LIBRALIS SIGN f2e2 x romscapxbar U+F2E2 LATIN SMALL CAPITAL LETTER X WITH BAR (DENARIUS SIGN) f2e3 y romscapybar U+F2E3 LATIN SMALL CAPITAL LETTER Y WITH BAR f2e4 D romscapdslash U+F2E4 LATIN SMALL CAPITAL LETTER D WITH SLASH f2e6 3 dram U+F2E6 PHARMACEUTICAL DRAM SIGN f2e7 v ecu U+F2E7 ECU SIGN f2e8 fl florloop U+F2E8 FLOREN SIGN WITH LOOP f2e9 g grosch U+F2E9 GROSCHEN SIGN f2ea £ libradut U+F2E DUTCH LIBRA SIGN A f2eb £ librafren U+F2E FRENCH LIBRA B SIGN f2ec £ libraital U+F2E ITALIAN LIBRA C SIGN f2ed £ libraflem U+F2E FLEMISH LIBRA D SIGN f2ee £ liranuov U+F2E LIRA NUOVA SIGN E f2ef £ lirasterl U+F2E LIRA STERLINA F SIGN f2f0 markold U+F2F0 OLD MARK SIGN f2f1 markflour U+F2F1 OLD FLOURISH MARK SIGN f2f2 m msign U+F2F2 MARKED SMALL LETTER M SIGN f2f3 m msignflour U+F2F3 FLOURISHED SMALL LETTER M SIGN f2f4 obol U+F2F4 PHARMACEUTICAL OBOLUS SIGN f2f5 penningar U+F2F5 PENNING SIGN f2f6 reichtalold U+F2F6 OLD REICHSTALER SIGN f2f7 schillgerm U+F2F7 GERMAN SCHILLING SIGN f2f8 schillgermscript U+F2F8 GERMAN SCRIPT SCHILLING SIGN f2f9 scudi U+F2F9 SCUDI SIGN f2fd oz ouncescript U+F2F SCRIPT OUNCE D SIGN
7.14. Appendix N Subrange 13: Modified base-line characters
Character Unicode Image of Character Standardized MUFI Unicode Description ID Character Replacement Entity PUA Code Point e7b2 n nbar U+E7B2 LATIN SMALL LETTER N WITH BAR e74e v vbar U+E74E LATIN SMALL LETTER V WITH BAR e77b y ybar U+E77B LATIN SMALL LETTER Y WITH BAR
7.15. Appendix O Subrange 15: Characters with macron or overline
Character Unicode Image of Standardized MUFI Entity Unicode Description ID Character Character Replacement PUA Code Point f00a ◌ macrhigh U+F00A COMBINING HIGH MACRON WITH FIXED HEIGHT (PART-WIDTH) f00b ◌ macrmed U+F00B COMBINING MEDIUM- HIGH MACRON WITH FIXED HEIGHT (PART-WIDTH) f00c ◌ ovlhigh U+F00C COMBINING HIGH OVERLINE WITH FIXED HEIGHT (FULL-WIDTH) f00d ◌ ovlmed U+F00D COMBINING MEDIUM- HIGH OVERLINE WITH FIXED HEIGHT (FULL-WIDTH) e44d bovlmed U+E44D LATIN SMALL LETTER B WITH MEDIUM- HIGH OVERLINE (ACROSS ASCENDER) f7b5 C Covlhigh = C + U+F7B5 LATIN bar (U+F7B5 CAPITAL = 0043 + LETTER C 0305) WITH HIGH OVERLINE (ABOVE CHARACTER) ( = LATIN CAPITAL LETTER C + COMBINING OVERLINE) f23f C romnumCrevovl U+F23F ROMAN = CONbase + (U+F23F NUMERAL bar = 2183+ REVERSED 0305) ONE HUNDRED WITH OVERLINE ( = ROMAN NUMERAL REVERSED ONE HUNDRED+ COMBINING OVERLINE) f7b6 D Dovlhigh = D + U+F7B6 LATIN bar (U+F7B6 CAPITAL = 0044 + LETTER D 0305) WITH HIGH OVERLINE (ABOVE CHARACTER) ( = LATIN CAPITAL LETTER D + COMBINING OVERLINE) e491 D dovlmed U+E491 LATIN SMALL LETTER D WITH MEDIUM- HIGH OVERLINE (ACROSS ASCENDER) e0bc E Eogonmacr = U+E0BC LATIN Eogon + (U+E0B CAPITAL combmacr C = LETTER E 0118+ WITH 0304) OGONEK AND MACRON ( = LATIN CAPITAL LETTER E WITH OGONEK+ COMBINING MACRON) e4bc e eogonmacr = U+E4BC LATIN SMALL eogon + (U+E4B LETTER E combmacr C = WITH 0119+ OGONEK AND 0304) MACRON ( = LATIN SMALL LETTER E WITH OGONEK+ COMBINING MACRON) e517 h hovlmed U+E517 LATIN SMALL LETTER H WITH MEDIUM- HIGH OVERLINE (ACROSS ASCENDER) e150 I Iovlhigh = I + U+E150 LATIN bar (U+E150 CAPITAL = 0049 + LETTER I 0305) WITH HIGH OVERLINE (ABOVE CHARACTER) ( = LATIN CAPITAL LETTER I + COMBINING OVERLINE) e550 i iovlmed = i + U+E550 LATIN SMALL bar (U+E550 LETTER I = 0069 + WITH 0305) MEDIUM- HIGH OVERLINE (ABOVE CHARACTER) ( = LATIN SMALL LETTER I WITH + COMBINING OVERLINE) e154 J Jmacrhigh = J + U+E154 LATIN combmacr (U+E154 CAPITAL = 004A + LETTER J 0304) WITH HIGH MACRON (ABOVE CHARACTER) ( = LATIN CAPITAL LETTER J + COMBINING MACRON) e152 J Jovlhigh = J + U+E152 LATIN bar (U+E152 CAPITAL = 004A + LETTER J 0305) WITH HIGH OVERLINE (ABOVE CHARACTER) ( = LATIN CAPITAL LETTER J + COMBINING OVERLINE) e554 j jmacrmed = j + U+E554 LATIN SMALL combmacr (U+E554 LETTER J = 006A + WITH 0304) MEDIUM- HIGH MACRON (ABOVE CHARACTER) ( = LATIN SMALL LETTER J + COMBINING MACRON) e552 j jovlmed = j + U+E552 LATIN SMALL bar (U+E552 LETTER J = 006A + WITH 0305) MEDIUM- HIGH OVERLINE (ABOVE CHARACTER) ( = LATIN SMALL LETTER J + COMBINING OVERLINE) e7c3 k kovlmed U+E7C3 LATIN SMALL LETTER K WITH MEDIUM- HIGH OVERLINE (ACROSS ASCENDER) e5b1 l lovlmed U+E5B1 LATIN SMALL LETTER L WITH MEDIUM- HIGH OVERLINE (ACROSS ASCENDER) f7b4 L Lovlhigh = L + U+F7B4 LATIN bar (U+F7B4 CAPITAL = 004C + LETTER L 0305) WITH HIGH OVERLINE (ABOVE CHARACTER) ( = LATIN CAPITAL LETTER L + COMBINING OVERLINE) e596 l lmacrhigh = l + U+E596 LATIN SMALL combmacr (U+E596 LETTER L = 006C + WITH HIGH 0304) MACRON (ABOVE CHARACTER) ( = LATIN SMALL LETTER L + COMBINING MACRON) e58c l lovlhigh = l + U+E58C LATIN SMALL bar (U+E58 LETTER L C = WITH HIGH 006C + OVERLINE 0305) (ABOVE CHARACTER) ( = LATIN SMALL LETTER L + COMBINING OVERLINE) e1b8 M Mmacrhigh = U+E1B8 LATIN M + combmacr (U+E1B8 CAPITAL = 004D LETTER M + 0304) WITH HIGH MACRON (ABOVE CHARACTER) ( = LATIN CAPITAL LETTER M + COMBINING MACRON) e1d2 M Movlhigh = M U+E1D2 LATIN + bar (U+E1D CAPITAL 2 = 004D LETTER M + 0305) WITH HIGH OVERLINE (ABOVE CHARACTER) ( = LATIN CAPITAL LETTER M + COMBINING OVERLINE) e5b8 m mmacrmed = m U+E5B8 LATIN SMALL + combmacr (U+E5B8 LETTER M = 006D WITH + 0304) MEDIUM- HIGH MACRON (ABOVE CHARACTER) ( = LATIN SMALL LETTER M + COMBINING MACRON) e5d2 m movlmed = m + U+E5D2 LATIN SMALL bar (U+E5D LETTER M 2 = 006D WITH + 0305) MEDIUM- HIGH OVERLINE (ABOVE CHARACTER) ( = LATIN SMALL LETTER M + COMBINING OVERLINE) e1dc N Nmacrhigh = N U+E1D LATIN + combmacr C CAPITAL (U+E1D LETTER N C = 004E WITH HIGH + 0304) MACRON (ABOVE CHARACTER) ( = LATIN CAPITAL LETTER N + COMBINING MACRON) e5dc n nmacrmed = n U+E5D LATIN SMALL + combmacr C LETTER N (U+E5D WITH C = 006E MEDIUM- + 0304) HIGH MACRON (ABOVE CHARACTER) ( = LATIN SMALL LETTER N + COMBINING MACRON) e252 O Oslashmacr = U+E252 LATIN Oslash + (U+E252 CAPITAL combmacr = 00D8+ LETTER O 0304) WITH STROKE AND MACRON ( = LATIN CAPITAL LETTER O WITH STROKE + COMBINING MACRON) e652 o oslashmacr = U+E652 LATIN SMALL oslash + (U+E652 LETTER O combmacr = 00F8 + WITH STROKE 0304) AND MACRON ( = LATIN SMALL LETTER O WITH STROKE + COMBINING MACRON) e25d OE OEligmacr = U+E25D LATIN OElig + (U+E25 CAPITAL combmacr D = LIGATURE OE 0152+ WITH 0304) MACRON ( = LATIN CAPITAL LIGATURE OE+ COMBINING MACRON) e65d oe oeligmacr = U+E65D LATIN SMALL oelig + (U+E65 LIGATURE OE combmacr D = WITH 0153+ MACRON ( = 0304) LATIN SMALL LIGATURE OE+ COMBINING MACRON) e7cc o oopenmacr U+E7CC LATIN SMALL LETTER OPEN O WITH MACRON e665 p pmacr = p + U+E665 LATIN SMALL combmacr (U+E665 LETTER P = 0070 + WITH 0304) MACRON ( = LATIN SMALL LETTER P + COMBINING MACRON) e681 q qmacr = q + U+E681 LATIN SMALL combmacr (U+E681 LETTER Q = 0071 + WITH 0304) MACRON ( = LATIN SMALL LETTER Q + COMBINING MACRON) e79e s slongovlmed U+E79E LATIN SMALL LETTER LONG S WITH MEDIUM- HIGH OVERLINE (ACROSS ASCENDER) e34d V Vmacr = V + U+E34D LATIN combmacr (U+E34 CAPITAL D = 0056 LETTER V + 0304) WITH MACRON ( = LATIN CAPITAL LETTER V + COMBINING MACRON) f7b2 V Vovlhigh = V + U+F7B2 LATIN bar (U+F7B2 CAPITAL = 0056 + LETTER V 0305) WITH HIGH OVERLINE (ABOVE CHARACTER) ( = LATIN CAPITAL LETTER V + COMBINING OVERLINE) e74d v vmacr = v + U+E74D LATIN SMALL combmacr (U+E74 LETTER V D = 0076 WITH + 0304) MACRON ( = LATIN SMALL LETTER V + COMBINING MACRON) e357 W Wmacr = W + U+E357 LATIN combmacr (U+E357 CAPITAL = 0057 + LETTER W 0304) WITH MACRON ( = LATIN CAPITAL LETTER W + COMBINING MACRON) e757 w wmacr = w + U+E757 LATIN SMALL combmacr (U+E757 LETTER W = 0077 + WITH 0304) MACRON ( = LATIN SMALL LETTER W + COMBINING MACRON) f7b3 X Xovlhigh = X + U+F7B3 LATIN bar (U+F7B3 CAPITAL = 0058 + LETTER X 0305) WITH HIGH OVERLINE (ABOVE CHARACTER) ( = LATIN CAPITAL LETTER X) e7a2 þ thornovlmed U+E7A2 LATIN SMALL LETTER THORN WITH MEDIUM- HIGH OVERLINE (ACROSS ASCENDER)
7.16. Appendix P Subrange 16: Characters with acute accent
Character Unicode Image of Standardized MUFI Entity Unicode Description ID Character Character Replacement PUA Code Point efe0 AA AAligacute = U+EFE0 LATIN AAlig + (U+EFE CAPITAL combacute 0 = EF90 LIGATURE AA + 0301) WITH ACUTE ( = LATIN CAPITAL LIGATURE AA + COMBINING ACUTE ACCENT) efe1 aa aaligacute = U+EFE1 LATIN SMALL aalig + (U+EFE LIGATURE AA combacute 1 = EF91 WITH ACUTE + 0301) ( = LATIN SMALL LIGATURE AA + COMBINING ACUTE ACCENT) efe2 AO AOligacute = U+EFE2 LATIN AOlig + (U+EFE CAPITAL combacute 2 = EF92 LIGATURE AO + 0301) WITH ACUTE ( = LATIN CAPITAL LIGATURE AO + COMBINING ACUTE ACCENT) efe3 ao aoligacute = U+EFE3 LATIN SMALL aolig + (U+EFE LIGATURE AO combacute 3 = EF93 WITH ACUTE + 0301) ( = LATIN SMALL LIGATURE AO + COMBINING ACUTE ACCENT) efe4 AU AUligacute = U+EFE4 LATIN AUlig + (U+EFE CAPITAL combacute 4 = EF94 LIGATURE AU + 0301) WITH ACUTE ( = LATIN CAPITAL LIGATURE AU + COMBINING ACUTE ACCENT) efe5 au auligacute = U+EFE5 LATIN SMALL aulig + (U+EFE LIGATURE AU combacute 5 = EF95 WITH ACUTE + 0301) ( = LATIN SMALL LIGATURE AU + COMBINING ACUTE ACCENT) efe6 av AVligacute = U+EFE6 LATIN AVlig + (U+EFE CAPITAL combacute 6 = EF96 LIGATURE AV + 0301) WITH ACUTE ( = LATIN CAPITAL LIGATURE AV + COMBINING ACUTE ACCENT) efe7 av avligacute = U+EFE7 LATIN SMALL avlig + (U+EFE LIGATURE AV combacute 7 = EF97 WITH ACUTE + 0301) ( = LATIN SMALL LIGATURE AV + COMBINING ACUTE ACCENT) ebb0 AV AVligslashacute U+EBB0 LATIN = AVligslash + (U+EBB CAPITAL combacute 0 = LIGATURE AV EF98+ WITH STROKE 0301) AND ACUTE ( = LATIN CAPITAL LIGATURE AV WITH STROKE+ COMBINING ACUTE ACCENT) ebb1 av avligslashacute U+EBB1 LATIN SMALL = avligslash + (U+EBB LIGATURE AV combacute 1 = WITH STROKE EF99+ AND ACUTE 0301) ( = LATIN SMALL LIGATURE AV WITH STROKE+ COMBINING ACUTE ACCENT) e044 B Bacute = B + U+E044 LATIN combacute (U+E044 CAPITAL = 0042 + LETTER B 0301) WITH ACUTE ( = LATIN CAPITAL LETTER B + COMBINING ACUTE ACCENT) e444 b bacute = b + U+E444 LATIN SMALL combacute (U+E444 LETTER B = 0062 + WITH ACUTE 0301) ( = LATIN SMALL LETTER B + COMBINING ACUTE ACCENT) e077 D Dacute = D + U+E077 LATIN combacute (U+E077 CAPITAL = 0044 + LETTER D 0301) WITH ACUTE ( = LATIN CAPITAL LETTER D + COMBINING ACUTE ACCENT) e477 d dacute = d + U+E477 LATIN SMALL combacute (U+E477 LETTER D = 0064 + WITH ACUTE 0301) ( = LATIN SMALL LETTER D + COMBINING ACUTE ACCENT) ebb2 d drotacute = U+EBB2 LATIN SMALL drot + (U+EBB LETTER D combacute 2 = F109 ROTUNDA + 0301) WITH ACUTE ( = LATIN SMALL LETTER D ROTUNDA + COMBINING ACUTE ACCENT) e0f0 F Facute = F + U+E0F0 LATIN combacute (U+E0F0 CAPITAL = 0046 + LETTER F 0301) WITH ACUTE ( = LATIN CAPITAL LETTER F + COMBINING ACUTE ACCENT) e4f0 f facute = f + U+E4F0 LATIN SMALL combacute (U+E4F0 LETTER F = 0066 + WITH ACUTE 0301) ( = LATIN SMALL LETTER F + COMBINING ACUTE ACCENT) ebb3 F Finsacute = U+EBB3 LATIN Fins + (U+EBB CAPITAL combacute 3 = F10C LETTER + 0301) INSULAR F WITH ACUTE ( = LATIN CAPITAL LETTER INSULAR F + COMBINING ACUTE ACCENT) ebb4 f finsacute = fins U+EBB4 LATIN SMALL + combacute (U+EBB LETTER 4 = INSULAR F F10D + WITH ACUTE 0301) ( = LATIN SMALL LETTER INSULAR F + COMBINING ACUTE ACCENT) e116 H Hacute = H + U+E116 LATIN combacute (U+E116 CAPITAL = 0048 + LETTER H 0301) WITH ACUTE ( = LATIN CAPITAL LETTER H + COMBINING ACUTE ACCENT) e516 h hacute = h + U+E516 LATIN SMALL combacute (U+E516 LETTER H = 0068 + WITH ACUTE 0301) ( = LATIN SMALL LETTER H + COMBINING ACUTE ACCENT) e153 J Jacute = J + U+E153 LATIN combacute (U+E153 CAPITAL = 004A + LETTER J 0301) WITH ACUTE ( = LATIN CAPITAL LETTER J + COMBINING ACUTE ACCENT) e553 j jacute = j + U+E553 LATIN SMALL combacute (U+E553 LETTER J = 006A + WITH ACUTE 0301) ( = LATIN SMALL LETTER J + COMBINING ACUTE ACCENT) ebb5 M Muncacute = U+EBB5 LATIN Munc + (U+EBB CAPITAL combacute 5 = LETTER F11A+ UNCIAL M 0301) WITH ACUTE ( = LATIN CAPITAL LETTER UNCIAL M+ COMBINING ACUTE ACCENT) ebb6 M muncacute = U+EBB6 LATIN SMALL munc + (U+EBB LETTER combacute 6 = UNCIAL M F225+ WITH ACUTE 0301) ( = LATIN SMALL LETTER UNCIAL M+ COMBINING ACUTE ACCENT) e259 OE OEligacute = U+E259 LATIN OElig + (U+E259 CAPITAL combacute = 0152 + LIGATURE OE 0301) WITH ACUTE ( = LATIN CAPITAL LIGATURE OE + COMBINING ACUTE ACCENT) e659 oe oeligacute = U+E659 LATIN SMALL oelig + (U+E659 LIGATURE OE combacute = 0153 + WITH ACUTE 0301) ( = LATIN SMALL LIGATURE OE + COMBINING ACUTE ACCENT) efe8 OO OOligacute = U+EFE8 LATIN OOlig + (U+EFE CAPITAL combacute 8 = F20A LIGATURE OO + 0301) WITH ACUTE ( = LATIN CAPITAL LIGATURE OO + COMBINING ACUTE ACCENT) efe9 oo ooligacute = U+EFE9 LATIN SMALL oolig + (U+EFE LIGATURE OO combacute 9 = F20B WITH ACUTE + 0301) ( = LATIN SMALL LIGATURE OO + COMBINING ACUTE ACCENT) ebb9 r rrotacute = rrot U+EBB9 LATIN SMALL + combacute (U+EBB LETTER R 9 = F20E ROTUNDA + 0301) WITH ACUTE ( = LATIN SMALL LETTER R ROTUNDA + COMBINING ACUTE ACCENT) ebaf s slongacute = U+EBAF LATIN SMALL slong + (U+EBA LETTER LONG combacute F = 017F S WITH ACUTE + 0301) ( = LATIN SMALL LETTER LONG S + COMBINING ACUTE ACCENT) e2e2 T Tacute = T + U+E2E2 LATIN combacute (U+E2E2 CAPITAL = 0054 + LETTER T 0301) WITH ACUTE ( = LATIN CAPITAL LETTER T + COMBINING ACUTE ACCENT) e6e2 t tacute = t + U+E6E2 LATIN SMALL combacute (U+E6E2 LETTER T = 0074 + WITH ACUTE 0301) ( = LATIN SMALL LETTER T + COMBINING ACUTE ACCENT) e33a V Vacute = V + U+E33A LATIN combacute (U+E33A CAPITAL = 0056 + LETTER V 0301) WITH ACUTE ( = LATIN CAPITAL LETTER V + COMBINING ACUTE ACCENT) e73a v vacute = v + U+E73A LATIN SMALL combacute (U+E73A LETTER V = 0076 + WITH ACUTE 0301) ( = LATIN SMALL LETTER V + COMBINING ACUTE ACCENT) ebba V Vinsacute = U+EBB LATIN Vins + A CAPITAL combacute (U+EBB LETTER A = INSULAR V F210+ (VEND) WITH 0301) ACUTE ( = LATIN CAPITAL LETTER INSULAR V (VEND)+ COMBINING ACUTE ACCENT) ebbb v vinsacute = vins U+EBBB LATIN SMALL + combacute (U+EBB LETTER B = INSULAR V F211+ (VEND) WITH 0301) ACUTE ( = LATIN SMALL LETTER INSULAR V (VEND)+ COMBINING ACUTE ACCENT) e737 þ thornacute = U+E737 LATIN SMALL thorn + (U+E737 LETTER combacute = 00FE THORN WITH + 0301) ACUTE ( = LATIN SMALL LETTER THORN + COMBINING ACUTE ACCENT)
7.17. Appendix Q Subrange 17: Characters with double acute accent
Character Unicode Image of Standardized MUFI Unicode Description ID Character Character Replacement Entity PUA Code Point e025 A Adblac = A U+E025 LATIN + combdblac (U+E025 CAPITAL = 0041 + LETTER A 030B) WITH DOUBLE ACUTE ( = LATIN CAPITAL LETTER A + COMBINING DOUBLE ACUTE ACCENT) e425 a adblac = a + U+E425 LATIN SMALL combdblac (U+E425 LETTER A = 0061 + WITH DOUBLE 030B) ACUTE ( = LATIN SMALL LETTER A + COMBINING DOUBLE ACUTE ACCENT) efea AA AAligdblac U+EFEA LATIN = AAlig + (U+EFE CAPITAL combdblac A = LIGATURE AA EF90 + WITH DOUBLE 030B) ACUTE ( = LATIN CAPITAL LIGATURE AA + COMBINING DOUBLE ACUTE ACCENT) efeb aa aaligdblac = U+EFEB LATIN SMALL aalig + (U+EFE LIGATURE AA combdblac B = EF91 WITH DOUBLE + 030B) ACUTE ( = LATIN SMALL LIGATURE AA + COMBINING DOUBLE ACUTE ACCENT) e041 AE AEligdblac U+E041 LATIN = AElig + (U+E041 CAPITAL combdblac = 00C6 + LETTER AE 030B) WITH DOUBLE ACUTE ( = LATIN CAPITAL LETTER AE + COMBINING DOUBLE ACUTE ACCENT) e441 ae aeligdblac = U+E441 LATIN SMALL aelig + (U+E441 LETTER AE combdblac = 00E6 + WITH DOUBLE 030B) ACUTE ( = LATIN SMALL LETTER AE + COMBINING DOUBLE ACUTE ACCENT) ebc0 AO AOligdblac U+EBC0 LATIN = AOlig + (U+EBC CAPITAL combdblac 0 = EF92 LIGATURE AO + 030B) WITH DOUBLE ACUTE ( = LATIN CAPITAL LIGATURE AO + COMBINING DOUBLE ACUTE ACCENT) ebc1 ao aoligdblac = U+EBC1 LATIN SMALL aolig + (U+EBC LIGATURE AO combdblac 1 = EF93 WITH DOUBLE + 030B) ACUTE ( = LATIN SMALL LIGATURE AO + COMBINING DOUBLE ACUTE ACCENT) ebc2 AV AVligdblac U+EBC2 LATIN = AVlig + (U+EBC CAPITAL combdblac 2 = EF96 LIGATURE AV + 030B) WITH DOUBLE ACUTE ( = LATIN CAPITAL LIGATURE AV + COMBINING DOUBLE ACUTE ACCENT) ebc3 av avligdblac = U+EBC3 LATIN SMALL avlig + (U+EBC LIGATURE AV combdblac 3 = EF97 WITH DOUBLE + 030B) ACUTE ( = LATIN SMALL LIGATURE AV + COMBINING DOUBLE ACUTE ACCENT) e0d1 E Edblac = E U+E0D1 LATIN + combdblac (U+E0D CAPITAL 1 = 0045 LETTER E + 030B) WITH DOUBLE ACUTE ( = LATIN CAPITAL LETTER E + COMBINING DOUBLE ACUTE ACCENT) e4d1 e edblac = e + U+E4D1 LATIN SMALL combdblac (U+E4D LETTER E 1 = 0065 WITH DOUBLE + 030B) ACUTE ( = LATIN SMALL LETTER E + COMBINING DOUBLE ACUTE ACCENT) e143 I Idblac = I + U+E143 LATIN combdblac (U+E143 CAPITAL = 0049 + LETTER I WITH 030B) DOUBLE ACUTE ( = LATIN CAPITAL LETTER I + COMBINING DOUBLE ACUTE ACCENT) e543 i idblac = i + U+E543 LATIN SMALL combdblac (U+E543 LETTER I WITH = 0069 + DOUBLE 030B) ACUTE ( = LATIN SMALL LETTER I + COMBINING DOUBLE ACUTE ACCENT) e162 J Jdblac = J + U+E162 LATIN combdblac (U+E162 CAPITAL = 004A + LETTER J 030B) WITH DOUBLE ACUTE ( = LATIN CAPITAL LETTER J + COMBINING DOUBLE ACUTE ACCENT) e562 j jdblac = j + U+E562 LATIN SMALL combdblac (U+E562 LETTER J = 006A + WITH DOUBLE 030B) ACUTE ( = LATIN SMALL LETTER J + COMBINING DOUBLE ACUTE ACCENT) ebc6 O Oslashdblac U+EBC6 LATIN = Oslash + (U+EBC CAPITAL combdblac 6 = LETTER O 00D8+ WITH STROKE 030B) AND DOUBLE ACUTE ( = LATIN CAPITAL LETTER O WITH STROKE+ COMBINING DOUBLE ACUTE ACCENT) ebc7 o oslashdblac U+EBC7 LATIN SMALL = oslash + (U+EBC LETTER O combdblac 7 = WITH STROKE 00F8+ AND DOUBLE 030B) ACUTE ( = LATIN SMALL LETTER O WITH STROKE+ COMBINING DOUBLE ACUTE ACCENT) ebc8 OE OEligdblac U+EBC8 LATIN = OElig + (U+EBC CAPITAL combdblac 8 = 0152 LIGATURE OE + 030B) WITH DOUBLE ACUTE ( = LATIN CAPITAL LIGATURE OE + COMBINING DOUBLE ACUTE ACCENT) ebc9 oe oeligdblac = U+EBC9 LATIN SMALL oelig + (U+EBC LIGATURE OE combdblac 9 = 0153 WITH DOUBLE + 030B) ACUTE ( = LATIN SMALL LIGATURE OE + COMBINING DOUBLE ACUTE ACCENT) efec OO OOligdblac U+EFEC LATIN = OOlig + (U+EFE CAPITAL combdblac C = LIGATURE OO F20A + WITH DOUBLE 030B) ACUTE ( = LATIN CAPITAL LIGATURE OO + COMBINING DOUBLE ACUTE ACCENT) efed oo ooligdblac = U+EFE LATIN SMALL oolig + D LIGATURE OO combdblac (U+EFE WITH DOUBLE D = ACUTE ( = F20B + LATIN SMALL 030B) LIGATURE OO + COMBINING DOUBLE ACUTE ACCENT) e34b V Vdblac = V U+E34B LATIN + combdblac (U+E34B CAPITAL = 0056 + LETTER V 030B) WITH DOUBLE ACUTE ( = LATIN CAPITAL LETTER V + COMBINING DOUBLE ACUTE ACCENT) e74b v vdblac = v + U+E74B LATIN SMALL combdblac (U+E74B LETTER V = 0076 + WITH DOUBLE 030B) ACUTE ( = LATIN SMALL LETTER V + COMBINING DOUBLE ACUTE ACCENT) e350 W Wdblac = U+E350 LATIN W + (U+E350 CAPITAL combdblac = 0057 + LETTER W 030B) WITH DOUBLE ACUTE ( = LATIN CAPITAL LETTER W + COMBINING DOUBLE ACUTE ACCENT) e750 w wdblac = w U+E750 LATIN SMALL + combdblac (U+E750 LETTER W = 0077 + WITH DOUBLE 030B) ACUTE ( = LATIN SMALL LETTER W + COMBINING DOUBLE ACUTE ACCENT) e37c Y Ydblac = Y U+E37C LATIN + combdblac (U+E37C CAPITAL = 0059 + LETTER Y 030B) WITH DOUBLE ACUTE ( = LATIN CAPITAL LETTER Y + COMBINING DOUBLE ACUTE ACCENT) e77c y ydblac = y + U+E77C LATIN SMALL combdblac (U+E77C LETTER Y = 0079 + WITH DOUBLE 030B) ACUTE ( = LATIN SMALL LETTER Y + COMBINING DOUBLE ACUTE ACCENT) ebca YY YYligdblac U+EBC LATIN = YYlig + A CAPITAL combdblac (U+EBC LIGATURE YY A = F212 WITH DOUBLE + 030B) ACUTE ( = LATIN CAPITAL LIGATURE YY + COMBINING DOUBLE ACUTE ACCENT) ebcb yy yyligdblac = U+EBCB LATIN SMALL yylig + (U+EBC LIGATURE YY combdblac B = F213 WITH DOUBLE + 030B) ACUTE ( = LATIN SMALL LIGATURE YY + COMBINING DOUBLE ACUTE ACCENT)
7.18. Appendix R Subrange 18: Characters with dot above
Character Unicode Image of Standardized MUFI Entity Unicode Description ID Character Character Replacement PUA Code Point efee AA AAligdot = U+EFE LATIN AAlig + E CAPITAL combdot (U+EFE LIGATURE AA E = WITH DOT EF90 + ABOVE ( = 0307) LATIN CAPITAL LIGATURE AA + COMBINING DOT ABOVE) efef aa aaligdot = aalig U+EFEF LATIN SMALL + combdot (U+EFE LIGATURE AA F = WITH DOT EF91 + ABOVE ( = 0307) LATIN SMALL LIGATURE AA + COMBINING DOT ABOVE) e043 AE AEligdot = U+E043 LATIN AElig + (U+E043 CAPITAL combdot = 00C6 LETTER AE + 0307) WITH DOT ABOVE ( = LATIN CAPITAL LETTER AE + COMBINING DOT ABOVE) e443 ae aeligdot = aelig U+E443 LATIN SMALL + combdot (U+E443 LETTER AE = 00E6 + WITH DOT 0307) ABOVE ( = LATIN SMALL LETTER AE + COMBINING DOT ABOVE) eff0 AY AYligdot = U+EFF0 LATIN AYlig + (U+EFF CAPITAL combdot 0 = LIGATURE AY EF9A + WITH DOT 0307) ABOVE ( = LATIN CAPITAL LIGATURE AY + COMBINING DOT ABOVE) eff1 ay ayligdot = aylig U+EFF1 LATIN SMALL + combdot (U+EFF LIGATURE AY 1 = WITH DOT EF9B + ABOVE ( = 0307) LATIN SMALL LIGATURE AY + COMBINING DOT ABOVE) ebd0 B bscapdot = U+EBD LATIN LETTER bscap + 0 SMALL combdot (U+EBD CAPITAL B 0 = 0299 WITH DOT + 0307) ABOVE ( = LATIN LETTER SMALL CAPITAL B + COMBINING DOT ABOVE) ebd1 d drotdot = drot U+EBD LATIN SMALL + combdot 1 LETTER D (U+EBD ROTUNDA 1 = F109 WITH DOT + 0307) ABOVE ( = LATIN SMALL LETTER D ROTUNDA + COMBINING DOT ABOVE) ebd2 D dscapdot = U+EBD LATIN LETTER dscap + 2 SMALL combdot (U+EBD CAPITAL D 2 = 1D05 WITH DOT + 0307) ABOVE ( = LATIN LETTER SMALL CAPITAL D + COMBINING DOT ABOVE) ebd3 F Finsdot = Fins U+EBD LATIN + combdot 3 CAPITAL (U+EBD LETTER 3 = INSULAR F F10C + WITH DOT 0307) ABOVE ( = LATIN CAPITAL LETTER INSULAR F + COMBINING DOT ABOVE) ebd4 f finsdot = fins + U+EBD LATIN SMALL combdot 4 LETTER (U+EBD INSULAR F 4 = WITH DOT F10D + ABOVE ( = 0307) LATIN SMALL LETTER INSULAR F + COMBINING DOT ABOVE) ebd5 f finssemiclosedot U+EBD LATIN SMALL = finssemiclose 5 LETTER SEMI- + combdot (U+EBD CLOSED 5 = INSULAR F F21B+ WITH DOT 0307) ABOVE ( = LATIN SMALL LETTER SEMI- CLOSED INSULAR F+ COMBINING DOT ABOVE) ebd6 f finsclosedot = U+EBD LATIN SMALL finsclose + 6 LETTER combdot (U+EBD CLOSED 6 = INSULAR F F207+ WITH DOT 0307) ABOVE ( = LATIN SMALL LETTER CLOSED INSULAR F+ COMBINING DOT ABOVE) ebd7 F fscapdot = fscap U+EBD LATIN LETTER + combdot 7 SMALL (U+EBD CAPITAL F 7 = EF05 WITH DOT + 0307) ABOVE ( = LATIN LETTER SMALL CAPITAL F + COMBINING DOT ABOVE) ef20 G gscapdot = U+EF20 LATIN LETTER gscap + (U+EF20 SMALL combdot = 0262 + CAPITAL G 0307) WITH DOT ABOVE ( = LATIN LETTER SMALL CAPITAL G + COMBINING DOT ABOVE) ebda H hscapdot = U+EBD LATIN LETTER hscap + A SMALL combdot (U+EBD CAPITAL H A = WITH DOT 029C + ABOVE ( = 0307) LATIN LETTER SMALL CAPITAL H + COMBINING DOT ABOVE) e15c J Jdot = J + U+E15C LATIN combdot (U+E15 CAPITAL C = LETTER J 004A + WITH DOT 0307) ABOVE ( = LATIN CAPITAL LETTER J + COMBINING DOT ABOVE) e168 K Kdot = K + U+E168 LATIN combdot (U+E168 CAPITAL = 004B + LETTER K 0307) WITH DOT ABOVE ( = LATIN CAPITAL LETTER K + COMBINING DOT ABOVE) e568 k kdot = k + U+E568 LATIN SMALL combdot (U+E568 LETTER K = 006B + WITH DOT 0307) ABOVE ( = LATIN SMALL LETTER K + COMBINING DOT ABOVE) ebdb k kscapdot = U+EBD LATIN LETTER kscap + B SMALL combdot (U+EBD CAPITAL K B = WITH DOT 1D0B + ABOVE ( = 0307) LATIN LETTER SMALL CAPITAL K + COMBINING DOT ABOVE) e19e L Ldot = L + U+E19E LATIN combdot (U+E19 CAPITAL E = LETTER L 004C + WITH DOT 0307) ABOVE ( = LATIN CAPITAL LETTER L + COMBINING DOT ABOVE) e59e l ldot = l + U+E59E LATIN SMALL combdot (U+E59 LETTER L E = WITH DOT 006C + ABOVE ( = 0307) LATIN SMALL LETTER L + COMBINING DOT ABOVE) ebdc L lscapdot = lscap U+EBD LATIN LETTER + combdot C SMALL (U+EBD CAPITAL L C = WITH DOT 029F + ABOVE ( = 0307) LATIN LETTER SMALL CAPITAL L + COMBINING DOT ABOVE) ebdd M mscapdot = U+EBD LATIN LETTER mscap + D SMALL combdot (U+EBD CAPITAL M D = WITH DOT 1D0D + ABOVE ( = 0307) LATIN LETTER SMALL CAPITAL M + COMBINING DOT ABOVE) ef21 N nscapdot = U+EF21 LATIN LETTER nscap + (U+EF21 SMALL combdot = 0274 + CAPITAL N 0307) WITH DOT ABOVE ( = LATIN LETTER SMALL CAPITAL N + COMBINING DOT ABOVE) ebcd O Oslashdot = U+EBC LATIN Oslash + D CAPITAL combdot (U+EBC LETTER O D = WITH STROKE 00D8 + AND DOT 0307) ABOVE ( = LATIN CAPITAL LETTER O WITH STROKE + COMBINING DOT ABOVE) ebce o oslashdot = U+EBC LATIN SMALL oslash + E LETTER O combdot (U+EBC WITH STROKE E = 00F8 AND DOT + 0307) ABOVE ( = LATIN SMALL LETTER O WITH STROKE + COMBINING DOT ABOVE) ebcf P pscapdot = U+EBC LATIN LETTER pscap + F SMALL combdot (U+EBC CAPITAL P F = WITH DOT 1D18 + ABOVE ( = 0307) LATIN LETTER SMALL CAPITAL P + COMBINING DOT ABOVE) e282 Q Qdot = Q + U+E282 LATIN combdot (U+E282 CAPITAL = 0051 + LETTER Q 0307) WITH DOT ABOVE ( = LATIN CAPITAL LETTER Q + COMBINING DOT ABOVE) e682 q qdot = q + U+E682 LATIN SMALL combdot (U+E682 LETTER Q = 0071 + WITH DOT 0307) ABOVE ( = LATIN SMALL LETTER Q + COMBINING DOT ABOVE) ef22 R rscapdot = rscap U+EF22 LATIN LETTER + combdot (U+EF22 SMALL = 0280 + CAPITAL R 0307) WITH DOT ABOVE ( = LATIN LETTER SMALL CAPITAL R + COMBINING DOT ABOVE) ef23 S sscapdot = sscap U+EF23 LATIN LETTER + combdot (U+EF23 SMALL = EF0E CAPITAL S + 0307) WITH DOT ABOVE ( = LATIN LETTER SMALL CAPITAL S + COMBINING DOT ABOVE) ef24 T tscapdot = tscap U+EF24 LATIN LETTER + combdot (U+EF24 SMALL = 1D1B CAPITAL T + 0307) WITH DOT ABOVE ( = LATIN LETTER SMALL CAPITAL T + COMBINING DOT ABOVE) e315 U Udot = U + U+E315 LATIN combdot (U+E315 CAPITAL = 0055 + LETTER U 0307) WITH DOT ABOVE ( = LATIN CAPITAL LETTER U + COMBINING DOT ABOVE) e715 u udot = u + U+E715 LATIN SMALL combdot (U+E715 LETTER U = 0075 + WITH DOT 0307) ABOVE ( = LATIN SMALL LETTER U + COMBINING DOT ABOVE) e34c V Vdot = V + U+E34C LATIN combdot (U+E34 CAPITAL C = 0056 LETTER V + 0307) WITH DOT ABOVE ( = LATIN CAPITAL LETTER V + COMBINING DOT ABOVE) e74c v vdot = v + U+E74C LATIN SMALL combdot (U+E74 LETTER V C = 0076 WITH DOT + 0307) ABOVE ( = LATIN SMALL LETTER V + COMBINING DOT ABOVE) e3e7 V Vinsdot = Vins U+E3E7 LATIN + combdot (U+E3E CAPITAL 7 = F210 LETTER + 0307) INSULAR V (VEND) WITH DOT ABOVE ( = LATIN CAPITAL LETTER INSULAR V (VEND) + COMBINING DOT ABOVE) e7e7 v vinsdot = vins + U+E7E7 LATIN SMALL combdot (U+E7E LETTER 7 = F211 INSULAR V + 0307) (VEND) WITH DOT ABOVE ( = LATIN SMALL LETTER INSULAR V (VEND) + COMBINING DOT ABOVE)
7.19. Appendix S Subrange 19: Characters with dot below
Character Unicode Image of Standardized MUFI Entity Unicode Description ID Character Character Replacement PUA Code Point eff2 AA AAligdotbl = U+EFF2 LATIN AAlig + (U+EFF CAPITAL combdotbl 2 = LIGATURE AA EF90 + WITH DOT 0323) BELOW ( = LATIN CAPITAL LIGATURE AA + COMBINING DOT BELOW) eff3 aa aaligdotbl = U+EFF3 LATIN SMALL aalig + (U+EFF LIGATURE AA combdotbl 3 = WITH DOT EF91 + BELOW ( = 0323) LATIN SMALL LIGATURE AA + COMBINING DOT BELOW) e036 AE AEligdotbl = U+E036 LATIN AElig + (U+E036 CAPITAL combdotbl = 00C6 LETTER AE + 0323) WITH DOT BELOW ( = LATIN CAPITAL LETTER AE + COMBINING DOT BELOW) e436 ae aeligdotbl = U+E436 LATIN SMALL aelig + (U+E436 LETTER AE combdotbl = 00E6 WITH DOT + 0323) BELOW ( = LATIN SMALL LETTER AE + COMBINING DOT BELOW) eff4 AO AOligdotbl = U+EFF4 LATIN AOlig + (U+EFF CAPITAL combdotbl 4 = LIGATURE AO EF92 + WITH DOT 0323) BELOW ( = LATIN CAPITAL LIGATURE AO + COMBINING DOT BELOW) eff5 ao aoligdotbl = U+EFF5 LATIN SMALL aolig + (U+EFF LIGATURE AO combdotbl 5 = WITH DOT EF93 + BELOW ( = 0323) LATIN SMALL LIGATURE AO + COMBINING DOT BELOW) eff6 AU AUligdotbl = U+EFF6 LATIN AUlig + (U+EFF CAPITAL combdotbl 6 = LIGATURE AU EF94 + WITH DOT 0323) BELOW ( = LATIN CAPITAL LIGATURE AU + COMBINING DOT BELOW) eff7 au auligdotbl = U+EFF7 LATIN SMALL aulig + (U+EFF LIGATURE AU combdotbl 7 = WITH DOT EF95 + BELOW ( = 0323) LATIN SMALL LIGATURE AU + COMBINING DOT BELOW) eff8 AV AVligdotbl = U+EFF8 LATIN AVlig + (U+EFF CAPITAL combdotbl 8 = LIGATURE AV EF96 + WITH DOT 0323) BELOW ( = LATIN CAPITAL LIGATURE AV + COMBINING DOT BELOW) eff9 av avligdotbl = U+EFF9 LATIN SMALL avlig + (U+EFF LIGATURE AV combdotbl 9 = WITH DOT EF97 + BELOW ( = 0323) LATIN SMALL LIGATURE AV + COMBINING DOT BELOW) effa AY AYligdotbl = U+EFF LATIN AYlig + A CAPITAL combdotbl (U+EFF LIGATURE AY A = WITH DOT EF9A + BELOW ( = 0323) LATIN CAPITAL LIGATURE AY + COMBINING DOT BELOW) effb ay ayligdotbl = U+EFF LATIN SMALL aylig + B LIGATURE AY combdotbl (U+EFF WITH DOT B = BELOW ( = EF9B + LATIN SMALL 0323) LIGATURE AY + COMBINING DOT BELOW) ef25 B bscapdotbl = U+EF25 LATIN LETTER bscap + (U+EF2 SMALL combdotbl 5 = 0299 CAPITAL B + 0323) WITH DOT BELOW ( = LATIN LETTER SMALL CAPITAL B + COMBINING DOT BELOW) e066 C Cdotbl = C + U+E066 LATIN combdotbl (U+E066 CAPITAL = 0043 + LETTER C 0323) WITH DOT BELOW ( = LATIN CAPITAL LETTER C + COMBINING DOT BELOW) e466 c cdotbl = c + U+E466 LATIN SMALL combdotbl (U+E466 LETTER C = 0063 + WITH DOT 0323) BELOW ( = LATIN SMALL LETTER C + COMBINING DOT BELOW) ef26 D dscapdotbl = U+EF26 LATIN LETTER dscap + (U+EF2 SMALL combdotbl 6 = CAPITAL D 1D05 + WITH DOT 0323) BELOW ( = LATIN LETTER SMALL CAPITAL D + COMBINING DOT BELOW) e08f Ð ETHdotbl = U+E08F LATIN ETH + (U+E08 CAPITAL combdotbl F = LETTER ETH 00D0 + WITH DOT 0323) BELOW ( = LATIN CAPITAL LETTER ETH + COMBINING DOT BELOW) e48f ð ethdotbl = eth U+E48F LATIN SMALL + combdotbl (U+E48 LETTER ETH F = WITH DOT 00F0 + BELOW ( = 0323) LATIN SMALL LETTER ETH + COMBINING DOT BELOW) e0ee F Fdotbl = F + U+E0E LATIN combdotbl E CAPITAL (U+E0E LETTER F E = 0046 WITH DOT + 0323) BELOW ( = LATIN CAPITAL LETTER F + COMBINING DOT BELOW) e4ee f fdotbl = f + U+E4E LATIN SMALL combdotbl E LETTER F (U+E4E WITH DOT E = 0066 BELOW ( = + 0323) LATIN SMALL LETTER F + COMBINING DOT BELOW) e3e5 F Finsdotbl = U+E3E5 LATIN Fins + (U+E3E CAPITAL combdotbl 5 = LETTER F10C + INSULAR F 0323) WITH DOT BELOW ( = LATIN CAPITAL LETTER INSULAR F + COMBINING DOT BELOW) e7e5 f finsdotbl = U+E7E5 LATIN SMALL fins + (U+E7E LETTER combdotbl 5 = INSULAR F F10D + WITH DOT 0323) BELOW ( = LATIN SMALL LETTER INSULAR F + COMBINING DOT BELOW) e101 G Gdotbl = G + U+E101 LATIN combdotbl (U+E101 CAPITAL = 0047 + LETTER G 0323) WITH DOT BELOW ( = LATIN CAPITAL LETTER G + COMBINING DOT BELOW) e501 G gdotbl = g + U+E501 LATIN SMALL combdotbl (U+E501 LETTER G = 0067 + WITH DOT 0323) BELOW ( = LATIN SMALL LETTER G + COMBINING DOT BELOW) ef27 G gscapdotbl = U+EF27 LATIN LETTER gscap + (U+EF2 SMALL combdotbl 7 = 0262 CAPITAL G + 0323) WITH DOT BELOW ( = LATIN LETTER SMALL CAPITAL G + COMBINING DOT BELOW) e151 J Jdotbl = J + U+E151 LATIN combdotbl (U+E151 CAPITAL = 004A LETTER J + 0323) WITH DOT BELOW ( = LATIN CAPITAL LETTER J + COMBINING DOT BELOW) e551 j jdotbl = j + U+E551 LATIN SMALL combdotbl (U+E551 LETTER J = 006A WITH DOT + 0323) BELOW ( = LATIN SMALL LETTER J + COMBINING DOT BELOW) ef28 L lscapdotbl = U+EF28 LATIN LETTER lscap + (U+EF2 SMALL combdotbl 8 = 029F CAPITAL L + 0323) WITH DOT BELOW ( = LATIN LETTER SMALL CAPITAL L + COMBINING DOT BELOW) ef29 M mscapdotbl = U+EF29 LATIN LETTER mscap + (U+EF2 SMALL combdotbl 9 = CAPITAL M 1D0D + WITH DOT 0323) BELOW ( = LATIN LETTER SMALL CAPITAL M + COMBINING DOT BELOW) ef2a N nscapdotbl = U+EF2 LATIN LETTER nscap + A SMALL combdotbl (U+EF2 CAPITAL N A = 0274 WITH DOT + 0323) BELOW ( = LATIN LETTER SMALL CAPITAL N + COMBINING DOT BELOW) ebe0 O Oslashdotbl = U+EBE0 LATIN Oslash + (U+EBE CAPITAL combdotbl 0 = LETTER O 00D8 + WITH STROKE 0323) AND DOT BELOW ( = LATIN CAPITAL LETTER O WITH STROKE + COMBINING DOT BELOW) ebe1 o oslashdotbl = U+EBE1 LATIN SMALL oslash + (U+EBE LETTER O combdotbl 1 = 00F8 WITH STROKE + 0323) AND DOT BELOW ( = LATIN SMALL LETTER O WITH STROKE + COMBINING DOT BELOW) effc OO OOligdotbl = U+EFF LATIN OOlig + C CAPITAL combdotbl (U+EFF LIGATURE OO C = WITH DOT F20A + BELOW ( = 0323) LATIN CAPITAL LIGATURE OO + COMBINING DOT BELOW) effd oo ooligdotbl = U+EFF LATIN SMALL oolig + D LIGATURE OO combdotbl (U+EFF WITH DOT D = BELOW ( = F20B + LATIN SMALL 0323) LIGATURE OO + COMBINING DOT BELOW) e26d P Pdotbl = P + U+E26D LATIN combdotbl (U+E26 CAPITAL D = LETTER P 0050 + WITH DOT 0323) BELOW ( = LATIN CAPITAL LETTER P + COMBINING DOT BELOW) e66d P pdotbl = p + U+E66D LATIN SMALL combdotbl (U+E66 LETTER P D = WITH DOT 0070 + BELOW ( = 0323) LATIN SMALL LETTER P + COMBINING DOT BELOW) e288 Q Qdotbl = Q + U+E288 LATIN combdotbl (U+E288 CAPITAL = 0051 + LETTER Q 0323) WITH DOT BELOW ( = LATIN CAPITAL LETTER Q + COMBINING DOT BELOW) e688 q qdotbl = q + U+E688 LATIN SMALL combdotbl (U+E688 LETTER Q = 0071 + WITH DOT 0323) BELOW ( = LATIN SMALL LETTER Q + COMBINING DOT BELOW) ef2b R rscapdotbl = U+EF2B LATIN LETTER rscap + (U+EF2 SMALL combdotbl B = 0280 CAPITAL R + 0323) WITH DOT BELOW ( = LATIN LETTER SMALL CAPITAL R + COMBINING DOT BELOW) e7c1 r rrotdotbl = U+E7C1 LATIN SMALL rrot + (U+E7C LETTER R combdotbl 1 = ROTUNDA F20E + WITH DOT 0323) BELOW ( = LATIN SMALL LETTER R ROTUNDA + COMBINING DOT BELOW) ef2c S sscapdotbl = U+EF2 LATIN LETTER sscap + C SMALL combdotbl (U+EF2 CAPITAL S C = WITH DOT EF0E + BELOW ( = 0323) LATIN LETTER SMALL CAPITAL S + COMBINING DOT BELOW) e7c2 s slongdotbl = U+E7C2 LATIN SMALL slong + (U+E7C LETTER LONG combdotbl 2 = 017F S WITH DOT + 0323) BELOW ( = LATIN SMALL LETTER LONG S + COMBINING DOT BELOW) ef2d T tscapdotbl = U+EF2 LATIN LETTER tscap + D SMALL combdotbl (U+EF2 CAPITAL T D = WITH DOT 1D1B + BELOW ( = 0323) LATIN LETTER SMALL CAPITAL T + COMBINING DOT BELOW) e3e6 V Vinsdotbl = U+E3E6 LATIN Vins + (U+E3E CAPITAL combdotbl 6 = F210 LETTER + 0323) INSULAR V (VEND) WITH DOT BELOW ( = LATIN CAPITAL LETTER INSULAR V (VEND) + COMBINING DOT BELOW) e7e6 v vinsdotbl = U+E7E6 LATIN SMALL vins + (U+E7E LETTER combdotbl 6 = F211 INSULAR V + 0323) (VEND) WITH DOT BELOW ( = LATIN SMALL LETTER INSULAR V (VEND) + COMBINING DOT BELOW) e39f Þ THORNdotbl U+E39F LATIN = THORN + (U+E39 CAPITAL combdotbl F = LETTER 00DE + THORN WITH 0323) DOT BELOW ( = LATIN CAPITAL LETTER THORN + COMBINING DOT BELOW) e79f þ thorndotbl = U+E79F LATIN SMALL thorn + (U+E79 LETTER combdotbl F = THORN WITH 00FE + DOT BELOW 0323) ( = LATIN SMALL LETTER THORN + COMBINING DOT BELOW)
7.20. Appendix T Subrange 20: Characters with diaeresis
Character Unicode Image of Standardized MUFI Unicode Description ID Character Character Replacement Entity PUA Code Point effe AA AAliguml U+EFFE LATIN
= AAlig + (U+EFFE CAPITAL combuml = EF90 + LIGATURE AA 0308) WITH DIAERESIS ( = LATIN CAPITAL LIGATURE AA + COMBINING DIAERESIS) efff aa aaliguml = U+EFFF LATIN SMALL
aalig + (U+EFFF LIGATURE AA combuml = EF91 + WITH 0308) DIAERESIS ( = LATIN SMALL LIGATURE AA + COMBINING DIAERESIS) e042 AE AEliguml U+E042 LATIN
= AElig + (U+E042 CAPITAL combuml = 00C6 + LETTER AE 0308) WITH DIAERESIS ( = LATIN CAPITAL LETTER AE + COMBINING DIAERESIS) e442 ae aeliguml = U+E442 LATIN SMALL
aelig + (U+E442 LETTER AE combuml = 00E6 + WITH 0308) DIAERESIS ( = LATIN CAPITAL LETTER AE + COMBINING DIAERESIS) ebe2 J Juml = J + U+EBE2 LATIN
combuml (U+EBE2 CAPITAL = 004A + LETTER J 0308) WITH DIAERESIS ( = LATIN CAPITAL LETTER J + COMBINING DIAERESIS) ebe3 j juml = j + U+EBE3 LATIN SMALL
combuml (U+EBE3 LETTER J = 006A + WITH 0308) DIAERESIS ( = LATIN SMALL LETTER J + COMBINING DIAERESIS) ebe4 OO OOliguml U+EBE4 LATIN = OOlig + (U+EBE4 CAPITAL combuml = F20A + LIGATURE OO 0308) WITH DIAERESIS ( = LATIN CAPITAL LIGATURE OO + COMBINING DIAERESIS) ebe5 oo ooliguml = U+EBE5 LATIN SMALL
oolig + (U+EBE5 LIGATURE OO combuml = F20B + WITH 0308) DIAERESIS ( = LATIN SMALL LIGATURE OO + COMBINING DIAERESIS) ebe6 PP PPliguml = U+EBE6 LATIN
PPlig + (U+EBE6 CAPITAL combuml = EEDD LIGATURE PP + 0308) WITH DIAERESIS ( = LATIN CAPITAL LIGATURE PP + COMBINING DIAERESIS) ebe7 pp ppliguml = U+EBE7 LATIN SMALL
pplig + (U+EBE7 LIGATURE PP combuml = EED6 WITH + 0308) DIAERESIS ( = LATIN SMALL LIGATURE PP + COMBINING DIAERESIS) e342 V Vuml = V U+E342 LATIN
+ combuml (U+E342 CAPITAL = 0056 + LETTER V 0308) WITH DIAERESIS ( = DIAERESIS) e742 v vuml = v + U+E742 LATIN SMALL
combuml (U+E742 LETTER V = 0076 + WITH 0308) DIAERESIS ( = LATIN SMALL LETTER V + COMBINING DIAERESIS) ebe8 YY YYliguml U+EBE8 LATIN
= YYlig + (U+EBE8 CAPITAL combuml = F212 + LIGATURE YY 0308) WITH DIAERESIS ( = LATIN CAPITAL LIGATURE YY + COMBINING DIAERESIS) ebe9 yy yyliguml = U+EBE9 LATIN SMALL
yylig + (U+EBE9 LIGATURE YY combuml = F213 + WITH 0308) DIAERESIS ( = LATIN SMALL LIGATURE YY + COMBINING DIAERESIS) e8d5 a adiaguml U+E8D5 LATIN SMALL
LETTER A WI TH DIAGONAL DIAERESIS e8d7 o odiaguml U+E8D7 LATIN SMALL
LETTER O WI TH DIAGONAL DIAERESIS
7.21. Appendix U Subrange 21: Characters with curl above (reversed ogonek)
Character Unicode Image of Standardized MUFI Unicode Description ID Character Character Replacement Entity PUA Code Point e033 A Acurl = A U+E033 LATIN CAPITAL
+ combcurl (U+E033 LETTER A = 0041 + WITH CURL ( = 1DCE) LATIN CAPITAL LETTER A + COMBINING OGONEK ABOV E) e433 a acurl = a + U+E433 LATIN SMALL
combcurl (U+E433 LETTER A = 0061 + WITH CURL ( = 1DCE) LATIN SMALL LETTER A + COMBINING OGONEK ABOV E) ebea AE AEligcurl U+EBE LATIN CAPITAL
= AElig + A LETTER AE combcurl (U+EBE WITH CURL ( = A = LATIN CAPITAL 00C6 + LETTER AE + 1DCE) COMBINING OGONEK ABOV E) ebeb ae aeligcurl = U+EBE LATIN SMALL
aelig + B LETTER AE combcurl (U+EBE WITH CURL ( = B = LATIN CAPITAL 00E6 + LETTER AE + 1DCE) COMBINING OGONEK ABOV E) e0e9 E Ecurl = E U+E0E9 LATIN CAPITAL
+ combcurl (U+E0E LETTER E 9 = 0045 WITH CURL ( = + LATIN CAPITAL 1DCE) LETTER E + COMBINING OGONEK ABOV E) e4e9 e ecurl = e + U+E4E9 LATIN SMALL
combcurl (U+E4E LETTER E 9 = 0065 WITH CURL ( = + LATIN SMALL 1DCE) LETTER E + COMBINING OGONEK ABOV E) e12a I Icurl = I + U+E12A LATIN CAPITAL
combcurl (U+E12 LETTER I WITH A = 0049 CURL ( = LATIN + CAPITAL 1DCE) LETTER I + COMBINING OGONEK ABOV E) e52a i icurl = i + U+E52A LATIN SMALL
combcurl (U+E52 LETTER I WITH A = 0069 CURL ( = LATIN + SMALL LETTER 1DCE) I + COMBINING OGONEK ABOV E) e163 J Jcurl = J + U+E163 LATIN CAPITAL
combcurl (U+E163 LETTER J WITH = 0049 + CURL ( = LATIN 1DCE) CAPITAL LETTER J + COMBINING OGONEK ABOV E) e563 j jcurl = j + U+E563 LATIN SMALL
combcurl (U+E563 LETTER J WITH = 006A CURL ( = LATIN + SMALL LETTER 1DCE) J + COMBINING OGONEK ABOV E) e3d3 O Ocurl = O U+E3D3 LATIN CAPITAL
+ combcurl (U+E3D LETTER O 3 = 004F WITH CURL ( = + LATIN CAPITAL 1DCE) LETTER O + COMBINING OGONEK ABOV E) e7d3 o ocurl = o + U+E7D3 LATIN SMALL
combcurl (U+E7D LETTER O 3 = 006F WITH CURL ( = + LATIN SMALL 1DCE) LETTER O + COMBINING OGONEK ABOV E) e3d4 O Oslashcurl U+E3D4 LATIN CAPITAL
= Oslash + (U+E3D LETTER O combcurl 4 = WITH STROKE 00D8 + AND CURL ( = 1DCE) LATIN CAPITAL LETTER O WITH STROKE + COMBINING OGONEK ABOV E) e7d4 o oslashcurl U+E7D4 LATIN SMALL
= oslash + (U+E7D LETTER O combcurl 4 = 00F8 WITH STROKE + AND CURL ( = 1DCE) LATIN SMALL LETTER O WITH STROKE + COMBINING OGONEK ABOV E) e331 U Ucurl = U U+E331 LATIN CAPITAL
+ combcurl (U+E331 LETTER U = 0055 + WITH CURL ( = 1DCE) LATIN CAPITAL LETTER U + COMBINING OGONEK ABOV E) e731 u ucurl = u + U+E731 LATIN SMALL
combcurl (U+E731 LETTER U = 0075 + WITH CURL ( = 1DCE) LATIN SMALL LETTER U + COMBINING OGONEK ABOV E) e385 Y Ycurl = Y U+E385 LATIN CAPITAL
+ combcurl (U+E385 LETTER Y = 0059 + WITH CURL ( = 1DCE) LATIN CAPITAL LETTER Y + COMBINING OGONEK ABOV E) e785 y ycurl = y + U+E785 LATIN SMALL
combcurl (U+E785 LETTER Y = 0079 + WITH CURL ( = 1DCE) LATIN SMALL LETTER Y + COMBINING OGONEK ABOV E)
7.22. Appendix V Subrange 22: Characters with ogonek
Character Unicode Image of Standardized MUFI Unicode Description ID Character Character Replacement Entity PUA Code Point e040 AE AEligogon U+E040 LATIN CAPITAL
= AElig + (U+E04 LETTER AE combogon 0 = WITH OGONEK 00C6 + ( = LATIN 0328) CAPITAL LETTER AE + COMBINING OGONEK) e440 ae aeligogon = U+E440 LATIN SMALL
aelig + (U+E44 LETTER AE combogon 0 = WITH OGONEK 00E6 + ( = LATIN 0328) SMALL LETTER AE + COMBINING OGONEK) ebf0 AV AVligogon U+EBF LATIN CAPITAL
= AVlig + 0 LIGATURE AV combogon (U+EBF WITH OGONEK 0 = ( = LATIN EF96 + CAPITAL 0328) LIGATURE AV + COMBINING OGONEK) ebf1 av avligogon = U+EBF LATIN SMALL
avlig + 1 LIGATURE AV combogon (U+EBF WITH OGONEK 1 = ( = LATIN EF97 + SMALL 0328) LIGATURE AV + COMBINING OGONEK) e076 C Cogon = C U+E076 LATIN CAPITAL
+ combogon (U+E07 LETTER C 6 = 0043 WITH OGONEK + 0328) ( = LATIN CAPITAL LETTER C + COMBINING OGONEK) e476 c cogon = c + U+E476 LATIN SMALL
combogon (U+E47 LETTER C 6 = 0063 WITH OGONEK + 0328) ( = LATIN SMALL LETTER C + COMBINING OGONEK) e255 O Oslashogon U+E255 LATIN CAPITAL
= Oslash + (U+E25 LETTER O combogon 5 = WITH STROKE 00D8 + AND OGONEK ( 0328) = LATIN CAPITAL LETTER O WITH STROKE + COMBINING OGONEK) e655 o oslashogon U+E655 LATIN SMALL
= oslash + (U+E65 LETTER O combogon 5 = WITH STROKE 00F8 + AND OGONEK ( 0328) = LATIN SMALL LETTER O WITH STROKE + COMBINING OGONEK) e2ee T Togon = T+ U+E2E LATIN CAPITAL
combogon E LETTER T (U+E2E WITH OGONEK E = ( = LATIN 0054 + CAPITAL 0328) LETTER T + COMBINING OGONEK) e6ee t togon = t + U+E6E LATIN SMALL
combogon E LETTER T (U+E6E WITH OGONEK E = ( = LATIN 0074 + SMALL LETTER 0328) T + COMBINING OGONEK)
7.23. Appendix W Subrange 23: Characters with breve
Character Unicode Image of Standardized MUFI Unicode Description ID Character Character Replacement Entity PUA Code Point e03f AE AEligbreve U+E03F LATIN
= AElig + (U+E03 CAPITAL combbreve F = LETTER AE 00C6 + WITH BREVE 0306) e43f ae aeligbreve = U+E43F LATIN SMALL
aelig + (U+E43 LETTER AE combbreve F = WITH BREVE 00E6 + 0306) ebee O Oslashbreve U+EBE LATIN
= Oslash + E CAPITAL combbreve (U+EB LETTER O EE = WITH STROKE 00D8 + AND BREVE 0306) ebef o oslashbreve U+EBE LATIN SMALL
= oslash: + F LETTER O combbreve (U+EB WITH STROKE EF = AND BREVE ( = 00F8 + LATIN SMALL 0306) LETTER O WITH STROKE + COMBINING BREVE) E376 Y Ybreve = Y U+E376 LATIN
+ combbreve (U+E37 CAPITAL 6 = 0054 LETTER Y + 0306) WITH BREVE E776 y ybreve = y + U+E776 LATIN SMALL
combbreve (U+E77 LETTER Y 6 = 0079 WITH BREVE + 0306)
7.24. Appendix X Subrange 24: Characters with breve below
Character Unicode Image of Standardized MUFI Entity Unicode Description ID Character Character Replacement PUA Code Point e548 i ibrevinvbl = i + U+E548 LATIN SMALL combbrevinvbl (U+E54 LETTER I 8 = 0069 WITH + 032F) INVERTED BREVE BELOW ( = LATIN SMALL LETTER I + COMBINING INVERTED BREVE BELOW) e727 u ubrevinvbl = u + U+E727 LATIN SMALL combbrevinvbl (U+E72 LETTER U 7 = 0075 WITH + 032F) INVERTED BREVE BELOW ( = LATIN SMALL LETTER U + COMBINING INVERTED BREVE BELOW)
7.25. Appendix Y Subrange 25: Characters with circumflex
Character Unicode Image of Standardized MUFI Entity Unicode Description ID Character Character Replacement PUA Code Point e5d7 n ncirc = n + U+E5D LATIN SMALL combcirc 7 LETTER N (U+E5 WITH D7 = CIRCUMFLEX 006E + ( = LATIN 0302) SMALL LETTER N + COMBINING CIRCUMFLEX ACCENT) e33b V Vcirc = V + U+E33B LATIN CAPITAL combcirc (U+E33 LETTER V B = WITH 0056 + CIRCUMFLEX 0302) ( = LATIN CAPITAL LETTER V + COMBINING CIRCUMFLEX ACCENT) e73b v vcirc = v + U+E73B LATIN SMALL combcirc (U+E73 LETTER V B = WITH 0076 + CIRCUMFLEX 0302) ( = LATIN SMALL LETTER V + COMBINING CIRCUMFLEX ACCENT) ebbd ea eacombcirc = U+EBB LATIN SMALL e + a+ D LETTER EA combcircdbl (U+EB WITH BD = CIRCUMFLEX 0065 + ( = LATIN 0061+ SMALL LETTER 1DCD) E + LATIN SMALL LETTER A + COMBINING DOUBLE CIRCUMFLEX ABOVE) ebbe eu eucombcirc = U+EBB LATIN SMALL e + u + E LETTER EU combcircdbl (U+EB WITH BE = CIRCUMFLEX 0065 + ( = LATIN 0075+ SMALL LETTER 1DCD) E + LATIN SMALL LETTER U + COMBINING DOUBLE CIRCUMFLEX ABOVE)
7.26. Appendix Z Subrange 26: Characters with ring above
Character Unicode Image of Standardized MUFI Unicode Description ID Character Character Replacement Entity PUA Code Point e8d1 ae aeligring = U+E8D LATIN SMALL
aelig + 1 LETTER AE combring (U+E8D WITH RING 1 = ABOVE ( = 00E6 + LATIN SMALL 030A) LETTER AE + COMBINING RING ABOVE) e4cf e ering = e + U+E4C LATIN SMALL
combring F LETTER E (U+E4C WITH RING F = ABOVE ( = 0065 + LATIN SMALL 030A) LETTER E + COMBINING RING ABOVE) e637 o oring = o + U+E637 LATIN SMALL
combring (U+E63 LETTER O 7 = WITH RING 006F + ABOVE ( = 030A) LATIN SMALL LETTER O + COMBINING RING ABOVE) e743 v vring = v + U+E743 LATIN SMALL
combring (U+E74 LETTER V 3 = 0076 WITH RING + 030A) ABOVE ( = LATIN SMALL LETTER V + COMBINING RING ABOVE)
7.27. Appendix AA Subrange 27: Characters with ring below
Character Unicode Image of Standardized MUFI Entity Unicode Description ID Character Character Replacement PUA Code Point e5a4 l lringbl = l + U+E5A LATIN SMALL combringbl 4 LETTER L (U+E5A WITH RING 4 = BELOW ( = 006C + LATIN SMALL 0325) LETTER L + COMBINING RING BELOW) e5c5 m mringbl = m U+E5C LATIN SMALL + combringbl 5 LETTER M (U+E5C WITH RING 5 = BELOW ( = 006D + LATIN SMALL 0325) LETTER M + COMBINING RING BELOW) e5ee n nringbl = n + U+E5E LATIN SMALL combringbl E LETTER N (U+E5E WITH RING E = BELOW ( = 006E + LATIN SMALL 0325) LETTER N + COMBINING RING BELOW) e6a3 r rringbl = r + U+E6A LATIN SMALL combringbl 3 LETTER R (U+E6A WITH RING 3 = 0072 BELOW ( = + 0325) LATIN SMALL LETTER R + COMBINING RING BELOW)
7.28. Appendix AB Subrange 28: Characters with tilde
Character Unicode Image of Standardized MUFI Unicode Description ID Character Character Replacement Entity PUA Code Point e68b q qbardestilde U+E68B LATIN SMALL
= qbardes + (U+E68 LETTER Q combtilde B = WITH STROKE A757+ THROUGH 0303) DESCENDER AND TILDE ( = LATIN SMALL LETTER Q WITH STROKE THROUGH DESCENDER+ COMBINING TILDE)
7.29. Appendix AC Subrange 29: Characters with curly bar above
Character Unicode Image of Standardized MUFI Entity Unicode Description ID Character Character Replacement PUA Code Point ebbf u ucurlbar = i + U+EBB LATIN SMALL combcurlbar F LETTER U (U+EBB WITH CURLY F = BAR ABOVE ( = 0075 + LATIN SMALL F1CC) LETTER U + COMBINING CURLY BAR ABOVE)
7.30. Appendix AD Subrange 30: Characters with vertical bar above
Character Unicode Image of Standardized MUFI Entity Unicode Description ID Character Character Replacement PUA Code Point e324 U Uvertline = U U+E324 LATIN + combvertline (U+E32 CAPITAL 4 = 0055 LETTER U + 030D) WITH VERTICAL LINE ABOVE ( = LATIN CAPITAL LETTER U + COMBINING VERTICAL LINE ABOVE) e724 u uvertline = u + U+E724 LATIN SMALL combvertline (U+E72 LETTER U 4 = 0075 WITH + 030D) VERTICAL LINE ABOVE ( = LATIN SMALL LETTER U + COMBINING VERTICAL LINE ABOVE)
7.31. Appendix AE Subrange 31: Characters with superscript letters
Character Unicode Image of Standardized MUFI Unicode Description ID Character Character Replacement Entity PUA Code Point e02c Ae Aesup U+E02 LATIN CAPITAL = A + C LETTER A WITH esup (U+E02 LATIN SMALL C = LETTER E 0041 + ABOVE ( = 0364) LATIN CAPITAL LETTER A + COMBINING LATIN SMALL LETTER E) e42c ae aesup = U+E42 LATIN SMALL a + C LETTER A WITH esup (U+E42 LATIN SMALL C = LETTER E 0061 + ABOVE ( = 0364) LATIN SMALL LETTER A + COMBINING LATIN SMALL LETTER E) e8e0 ai aisup = U+E8E0 LATIN SMALL a + isup (U+E8E LETTER A WITH 0 = 0061 LATIN SMALL + 0365) LETTER I ABOVE ( = LATIN SMALL LETTER A + COMBINING LATIN SMALL LETTER I) e42d ao aosup U+E42 LATIN SMALL = a + D LETTER A WITH osup (U+E42 LATIN SMALL D = LETTER O 0061 + ABOVE ( = 0366) LATIN SMALL LETTER A + COMBINING LATIN SMALL LETTER O) e8e1 au ausup U+E8E1 LATIN SMALL = a + (U+E8E LETTER A WITH usup 1 = 0061 LATIN SMALL + 0367) LETTER U ABOVE ( = LATIN SMALL LETTER A + COMBINING LATIN SMALL LETTER U) e42e av avsup = U+E42E LATIN SMALL a + (U+E42 LETTER A WITH vsup E = LATIN SMALL 0061 + LETTER V 036E) ABOVE ( = LATIN SMALL LETTER A + COMBINING LATIN SMALL LETTER V) e0e1 Ea Easup U+E0E1 LATIN CAPITAL = E + (U+E0E LETTER E WITH asup 1 = 0045 LATIN SMALL + 0363) LETTER A ABOVE ( = LATIN CAPITAL LETTER E + COMBINING LATIN SMALL LETTER A) e4e1 ea easup = U+E4E1 LATIN SMALL e + (U+E4E LETTER E WITH asup 1 = 0065 LATIN SMALL + 0363) LETTER A ABOVE ( = LATIN SMALL LETTER E + COMBINING LATIN SMALL LETTER A) e8e2 ee eesup = U+E8E2 LATIN SMALL e + (U+E8E LETTER E WITH esup 2 = 0065 LATIN SMALL + 0364) LETTER E ABOVE ( = LATIN SMALL LETTER E + COMBINING LATIN SMALL LETTER E) e4e2 ei eisup = U+E4E2 LATIN SMALL e + isup (U+E4E LETTER E WITH 2 = 0065 LATIN SMALL + 0365) LETTER I ABOVE ( = LATIN SMALL LETTER E + COMBINING LATIN SMALL LETTER I) e8e3 eo eosup U+E8E3 LATIN SMALL = e + (U+E8E LETTER E WITH osup 3 = 0065 LATIN SMALL + 0366) LETTER O ABOVE ( = LATIN SMALL LETTER E + COMBINING LATIN SMALL LETTER O) e4e3 ev evsup = U+E4E3 LATIN SMALL e + (U+E4E LETTER E WITH vsup 3 = 0065 LATIN SMALL + 036E) LETTER V ABOVE ( = LATIN SMALL LETTER E + COMBINING LATIN SMALL LETTER V) e8e4 ia iasup = U+E8E4 LATIN SMALL i + asup (U+E8E LETTER I WITH 4 = 0069 LATIN SMALL + 0363) LETTER A ABOVE ( = LATIN SMALL LETTER I + COMBINING LATIN SMALL LETTER A) e54a ie iesup = U+E54A LATIN SMALL i + esup (U+E54 LETTER I WITH A = LATIN SMALL 0069 + LETTER E 0364) ABOVE ( = LATIN SMALL LETTER I + COMBINING LATIN SMALL LETTER E) e8e5 io iosup = U+E8E5 LATIN SMALL i + osup (U+E8E LETTER I WITH 5 = 0069 LATIN SMALL + 0366) LETTER O ABOVE ( = LATIN SMALL LETTER I + COMBINING LATIN SMALL LETTER O) e8e6 iu iusup = U+E8E6 LATIN SMALL i + usup (U+E8E LETTER I WITH 6 = 0069 LATIN SMALL + 0367) LETTER U ABOVE ( = LATIN SMALL LETTER I + COMBINING LATIN SMALL LETTER U) e54b iv ivsup = U+E54B LATIN SMALL i + vsup (U+E54 LETTER I WITH B = LATIN SMALL 0069 + LETTER V 036E) ABOVE ( = LATIN SMALL LETTER I + COMBINING LATIN SMALL LETTER V) e8e7 je jesup = U+E8E7 LATIN SMALL j + esup (U+E8E LETTER J WITH 7 = LATIN SMALL 006A + LETTER E 0364) ABOVE ( = LATIN SMALL LETTER J + COMBINING LATIN SMALL LETTER E) e8e8 me mesup U+E8E8 LATIN SMALL = m + (U+E8E LETTER M esup 8 = WITH LATIN 006D + SMALL LETTER 0364) E ABOVE ( = LATIN SMALL LETTER M + COMBINING LATIN SMALL LETTER E) e643 oa oasup U+E643 LATIN SMALL = o + (U+E64 LETTER O asup 3 = 006F WITH LATIN + 0363) SMALL LETTER A ABOVE ( = LATIN SMALL LETTER O + COMBINING LATIN SMALL LETTER A) e244 Oe Oesup U+E244 LATIN CAPITAL = O + (U+E24 LETTER O esup 4 = 004F WITH LATIN + 0364) SMALL LETTER E ABOVE ( = LATIN CAPITAL LETTER O + COMBINING LATIN SMALL LETTER E) e644 oe oesup U+E644 LATIN SMALL = o + (U+E64 LETTER O esup 4 = 006F WITH LATIN + 0364) SMALL LETTER E ABOVE ( = LATIN SMALL LETTER O + COMBINING LATIN SMALL LETTER E) e645 oi oisup = U+E645 LATIN SMALL o + isup (U+E64 LETTER O 5 = 006F WITH LATIN + 0365) SMALL LETTER I ABOVE ( = LATIN SMALL LETTER O + COMBINING LATIN SMALL LETTER I) e8e9 oo oosup U+E8E9 LATIN SMALL = o + (U+E8E LETTER O osup 9 = 006F WITH LATIN + 0366) SMALL LETTER O ABOVE ( = LATIN SMALL LETTER O + COMBINING LATIN SMALL LETTER O) e246 Ou Ousup U+E246 LATIN CAPITAL = O + (U+E24 LETTER O usup 6 = 004F WITH LATIN + 0367) SMALL LETTER U ABOVE ( = LATIN CAPITAL LETTER O + COMBINING LATIN SMALL LETTER U) e646 ou ousup U+E646 LATIN SMALL = o + (U+E64 LETTER O usup 6 = 006F WITH LATIN + 0367) SMALL LETTER U ABOVE ( = LATIN SMALL LETTER O + COMBINING LATIN SMALL LETTER U) e647 ov ovsup U+E647 LATIN SMALL = o + (U+E64 LETTER O vsup 7 = 006F WITH LATIN + 036E) SMALL LETTER V ABOVE ( = LATIN SMALL LETTER O + COMBINING LATIN SMALL LETTER V) e8ea re resup = U+E8E LATIN SMALL r + esup A LETTER R WITH (U+E8E LATIN SMALL A = LETTER E 0072 + ABOVE ( = 036E) LATIN SMALL LETTER R + COMBINING LATIN SMALL LETTER E) e8eb ua uasup U+E8E LATIN SMALL = u + B LETTER U asup (U+E8E WITH LATIN B = SMALL LETTER 0075 + A ABOVE ( = 0363) LATIN SMALL LETTER U + COMBINING LATIN SMALL LETTER A) e32b Ue Uesup U+E32B LATIN CAPITAL = U + (U+E32 LETTER U esup B = WITH LATIN 0055 + SMALL LETTER 0364) E ABOVE ( = LATIN CAPITAL LETTER U + COMBINING LATIN SMALL LETTER E) e72b ue uesup U+E72B LATIN SMALL = u + (U+E72 LETTER U esup B = WITH LATIN 0075 + SMALL LETTER 0364) E ABOVE ( = LATIN SMALL LETTER U + COMBINING LATIN SMALL LETTER E) e72c ui uisup = U+E72 LATIN SMALL u + isup C LETTER U (U+E72 WITH LATIN C = SMALL LETTER 0075 + I ABOVE ( = 0365) LATIN SMALL LETTER U + COMBINING LATIN SMALL LETTER I) e32d Uo Uosup U+E32 LATIN CAPITAL = U + D LETTER U osup (U+E32 WITH LATIN D = SMALL LETTER 0055 + O ABOVE ( = 0366) LATIN CAPITAL LETTER U + COMBINING LATIN SMALL LETTER O) e72d uo uosup U+E72 LATIN SMALL = u + D LETTER U osup (U+E72 WITH LATIN D = SMALL LETTER 0075 + O ABOVE ( = 0366) LATIN SMALL LETTER U + COMBINING LATIN SMALL LETTER O) e8ec uv uvsup U+E8E LATIN SMALL = u + C LETTER U vsup (U+E8E WITH LATIN C = SMALL LETTER 0075 + V ABOVE ( = 036E) LATIN SMALL LETTER U + COMBINING LATIN SMALL LETTER V) e8ed uw uwsup U+E8E LATIN SMALL = u + D LETTER U wsup (U+E8E WITH LATIN D = SMALL LETTER 0075 + W ABOVE ( = F03C) LATIN SMALL LETTER U + COMBINING LATIN SMALL LETTER W) e781 ye yesup = U+E781 LATIN SMALL y + (U+E78 LETTER Y WITH esup 1 = 0079 LATIN SMALL + 0364) LETTER E ABOVE ( = LATIN SMALL LETTER Y + COMBINING LATIN SMALL LETTER E) e8f0 wa wasup U+E8F0 LATIN SMALL = w + (U+E8F LETTER W asup 0 = 0077 WITH LATIN + 0363) SMALL LETTER A ABOVE ( = LATIN SMALL LETTER W + COMBINING LATIN SMALL LETTER A) e353 We Wesup U+E353 LATIN CAPITAL = W + (U+E35 LETTER W esup 3 = 0057 WITH LATIN + 0364) SMALL LETTER E ABOVE ( = LATIN CAPITAL LETTER W + COMBINING LATIN SMALL LETTER E) e753 we wesup U+E753 LATIN SMALL = w + (U+E75 LETTER W esup 3 = 0077 WITH LATIN + 0364) SMALL LETTER E ABOVE ( = LATIN SMALL LETTER W + COMBINING LATIN SMALL LETTER E) e8f1 wi wisup U+E8F1 LATIN SMALL = w + (U+E8F LETTER W isup 1 = 0077 WITH LATIN + 0365) SMALL LETTER I ABOVE ( = LATIN SMALL LETTER W + COMBINING LATIN SMALL LETTER I) e754 wo wosup U+E754 LATIN SMALL = w + (U+E75 LETTER W osup 4 = 0077 WITH LATIN + 0366) SMALL LETTER O ABOVE ( = LATIN SMALL LETTER W + COMBINING LATIN SMALL LETTER O) e8f2 wu wusup U+E8F2 LATIN SMALL = w + (U+E8F LETTER W usup 2 = 0077 WITH LATIN + 0367) SMALL LETTER U ABOVE ( = LATIN SMALL LETTER W + COMBINING LATIN SMALL LETTER U) e8f3 wv wvsup U+E8F3 LATIN SMALL = w + (U+E8F LETTER W vsup 3 = 0077 WITH LATIN + 036E) SMALL LETTER V ABOVE ( = LATIN SMALL LETTER W + COMBINING LATIN SMALL LETTER V)
7.32. Appendix AF Subrange 32: Characters with acute accent and dot above
Character Unicode Image of Standardized MUFI Entity Unicode Description ID Character Character Replacement PUA Code Point ebf4 A Adotacute = A U+EBF LATIN + combdot + 4 CAPITAL combacute (U+EBF LETTER A 4 = 0041 WITH DOT + 0307+ ABOVE AND 0301) ACUTE ( = LATIN CAPITAL LETTER A + COMBINING DOT ABOVE+ COMBINING ACUTE ACCENT) ebf5 a adotacute = a U+EBF LATIN SMALL + combdot + 5 LETTER A combacute (U+EBF WITH DOT 5 = 0061 ABOVE AND + 0307+ ACUTE ( = 0301) LATIN SMALL LETTER A + COMBINING DOT ABOVE+ COMBINING ACUTE ACCENT) e0c8 E Edotacute = E U+E0C LATIN + combdot + 8 CAPITAL combacute (U+E0C LETTER E 8 = 0045 WITH DOT + 0307+ ABOVE AND 0301) ACUTE ( = LATIN CAPITAL LETTER E + COMBINING DOT ABOVE+ COMBINING ACUTE ACCENT) e4c8 e edotacute = e U+E4C LATIN SMALL + combdot + 8 LETTER E combacute (U+E4C WITH DOT 8 = 0065 ABOVE AND + 0307+ ACUTE ( = 0301) LATIN SMALL LETTER E + COMBINING DOT ABOVE+ COMBINING ACUTE ACCENT) ebf6 I Idotacute = I + U+EBF LATIN combdot + 6 CAPITAL combacute (U+EBF LETTER I 6 = 0049 WITH DOT + 0307+ ABOVE AND 0301) ACUTE ( = LATIN CAPITAL LETTER I + COMBINING DOT ABOVE+ COMBINING ACUTE ACCENT) ebf7 i idotacute = i + U+EBF LATIN SMALL combdot + 7 LETTER I combacute (U+EBF WITH DOT 7 = 0069 ABOVE AND + 0307+ ACUTE ( = 0301) LATIN SMALL LETTER I + COMBINING DOT ABOVE+ COMBINING ACUTE ACCENT) ebf8 O Odotacute = O U+EBF LATIN + combdot + 8 CAPITAL combacute (U+EBF LETTER O 8 = WITH DOT 004F + ABOVE AND 0307+ ACUTE ( = 0301) LATIN CAPITAL LETTER O + COMBINING DOT ABOVE+ COMBINING ACUTE ACCENT) ebf9 o odotacute = o U+EBF LATIN SMALL + combdot + 9 LETTER O combacute (U+EBF WITH DOT 9 = ABOVE AND 006F + ACUTE ( = 0307+ LATIN SMALL 0301) LETTER O + COMBINING DOT ABOVE+ COMBINING ACUTE ACCENT) ebfc O Oslashdotacute U+EBF LATIN = Oslash + C CAPITAL combdot + (U+EBF LETTER O combacute C = WITH STROKE 00D8 + AND DOT 0307+ ABOVE AND 0301) ACUTE ( = LATIN CAPITAL LETTER O + COMBINING DOT ABOVE+ COMBINING ACUTE ACCENT) ebfd o oslashdotacute U+EBF LATIN SMALL = oslash + D LETTER O combdot + (U+EBF WITH STROKE combacute D = AND DOT 00F8 + ABOVE AND 0307+ ACUTE ( = 0301) LATIN SMALL LETTER O + COMBINING DOT ABOVE+ COMBINING ACUTE ACCENT) ebfe U Udotacute = U U+EBF LATIN + combdot + E CAPITAL combacute (U+EBF LETTER U E = WITH DOT 0055 + ABOVE AND 0307+ ACUTE ( = 0301) LATIN CAPITAL LETTER U + COMBINING DOT ABOVE+ COMBINING ACUTE ACCENT) ebff u udotacute = u U+EBF LATIN SMALL + combdot + F LETTER U combacute (U+EBF WITH DOT F = ABOVE AND 0075 + ACUTE ( = 0307+ LATIN SMALL 0301) LETTER U + COMBINING DOT ABOVE+ COMBINING ACUTE ACCENT) e384 Y Ydotacute = Y U+E384 LATIN + combdot + (U+E38 CAPITAL combacute 4 = 0059 LETTER Y + 0307+ WITH DOT 0301) ABOVE AND ACUTE ( = LATIN CAPITAL LETTER Y + COMBINING DOT ABOVE+ COMBINING ACUTE ACCENT) e784 y ydotacute = y U+E784 LATIN SMALL + combdot + (U+E78 LETTER Y combacute 4 = 0079 WITH DOT + 0307+ ABOVE AND 0301) ACUTE ( = LATIN SMALL LETTER Y + COMBINING DOT ABOVE+ COMBINING ACUTE ACCENT)
7.33. Appendix AG Subrange 33: Characters with acute accent and dot below
Character Unicode Image of Standardized MUFI Unicode Description ID Character Character Replacement Entity PUA Code Point e498 e edotblacute U+E498 LATIN SMALL
= E + (U+E49 LETTER E combdotbl + 8 = 0065 WITH DOT combacute + 0323+ BELOW AND 0301) ACUTE ( = LATIN SMALL LETTER E + COMBINING DOT BELOW+ COMBINING ACUTE ACCENT)
7.34. Appendix AH Subrange 34: Characters with acute accent and diaeresis
Character Unicode Image of Standardized MUFI Unicode Description ID Character Character Replacement Entity PUA Code Point e62c o oumlacute = U+E62 LATIN SMALL
o + combuml C LETTER O + combacute (U+E62 WITH C = DIAERESIS AND 006F + ACUTE ( = 0308+ LATIN SMALL 0301) LETTER O + COMBINING DIAERESIS+ COMBINING ACUTE ACCENT)
7.35. Appendix AI Subrange 35: Characters with acute accent and curl above (reversed ogonek)
Character Unicode Image of Standardized MUFI Unicode Description ID Character Character Replacement Entity PUA Code Point ebb7 O Ocurlacute U+EBB7 LATIN CAPITAL = O + (U+EBB7 LETTER O WITH combcurl + = 004F + CURL AND combacute 1DCE+ ACUTE ( = 0301) LATIN CAPITAL LETTER O + COMBINING OG ONEK ABOVE+ COMBINING ACUTE ACCENT) ebb8 o ocurlacute = U+EBB8 LATIN SMALL o + (U+EBB8 LETTER O WITH combcurl + = 006F + CURL AND combacute 1DCE+ ACUTE ( = 0301) LATIN SMALL LETTER O + COMBINING OGONEK ABOV E+ COMBINING ACUTE ACCENT) 7.36. Appendix AJ Subrange 36: Characters with acute accent and ogonek
Character Unicode Image of Standardized MUFI Entity Unicode Description ID Character Character Replacement PUA Code Point e004 A Aogonacute = A U+E004 LATIN + combogon + (U+E00 CAPITAL combacute 4 = 0041 LETTER A + 0328+ WITH 0301) OGONEK AND ACUTE ( = LATIN CAPITAL LETTER A + COMBINING OGONEK + COMBINING ACUTE ACCENT) e404 a aogonacute = a U+E404 LATIN SMALL + combogon + (U+E40 LETTER A combacute 4 = 0061 WITH + 0328+ OGONEK AND 0301) ACUTE ( = LATIN SMALL LETTER A + COMBINING OGONEK + COMBINING ACUTE ACCENT) e8d3 ae aeligogonacute = U+E8D LATIN SMALL aelig + 3 LETTER AE combogon + (U+E8 WITH combacute D3 = OGONEK AND 00E6 + ACUTE ( = 0328+ LATIN SMALL 0301) LETTER AE + COMBINING OGONEK + COMBINING ACUTE ACCENT) e099 E Eogonacute = E U+E099 LATIN + combogon + (U+E09 CAPITAL combacute 9 = 0045 LETTER E + 0328+ WITH 0301) OGONEK AND ACUTE ( = LATIN CAPITAL LETTER E + COMBINING OGONEK + COMBINING ACUTE ACCENT) e499 e eogonacute = e U+E499 LATIN SMALL + combogon + (U+E49 LETTER E combacute 9 = 0065 WITH + 0328+ OGONEK AND 0301) ACUTE ( = LATIN SMALL LETTER E + COMBINING OGONEK + COMBINING ACUTE ACCENT) e20c O Oogonacute = O U+E20 LATIN + combogon + C CAPITAL combacute (U+E20 LETTER O C = WITH 004F + OGONEK AND 0328 + ACUTE ( = 0301) LATIN CAPITAL LETTER O + COMBINING OGONEK + COMBINING ACUTE ACCENT) e60c o oogonacute = o U+E60 LATIN SMALL + combogon + C LETTER O combacute (U+E60 WITH C = OGONEK AND 006F + ACUTE ( = 0328 + LATIN SMALL 0301) LETTER O + COMBINING OGONEK + COMBINING ACUTE ACCENT) e257 O Oslashogonacute U+E257 LATIN = Oslash + (U+E25 CAPITAL combogon + 7 = LETTER O combacute 00D8 + WITH STROKE 0328+ AND OGONEK 0301) AND ACUTE ( = LATIN CAPITAL LETTER O WITH STROKE + COMBINING OGONEK + COMBINING ACUTE ACCENT) e657 o oslashogonacute U+E657 LATIN SMALL = oslash + (U+E65 LETTER O combogon + 7 = WITH STROKE combacute 00F8 + AND OGONEK 0328+ AND ACUTE 0301) ( = LATIN SMALL LETTER O WITH STROKE + COMBINING OGONEK + COMBINING ACUTE ACCENT)
7.37. Appendix AK Subrange 37: Characters with double acute accent and ogonek
Character Unicode Image of Standardized MUFI Entity Unicode Description ID Character Character Replacement PUA Code Point e0ea E Eogondblac = U+E0E LATIN E + A CAPITAL combogon + (U+E0E LETTER E combdblac A = WITH OGONEK 0045 + AND DOUBLE 0328+ ACUTE ( = 030B) LATIN CAPITAL LETTER E + COMBINING OGONEK+ COMBINING DOUBLE ACUTE ACCENT) e4ea e eogondblac = U+E4E LATIN SMALL e + combogon A LETTER E + combdblac (U+E4E WITH OGONEK A = 0065 AND DOUBLE + 0328+ ACUTE ( = 030B) LATIN SMALL LETTER E + COMBINING OGONEK+ COMBINING DOUBLE ACUTE ACCENT) ebc4 O Oogondblac U+EBC LATIN = O + 4 CAPITAL combogon + (U+EBC LETTER O combdblac 4 = 004F WITH OGONEK + 0328+ AND DOUBLE 030B) ACUTE ( = LATIN CAPITAL LETTER O + COMBINING OGONEK+ COMBINING DOUBLE ACUTE ACCENT) ebc5 o oogondblac = U+EBC LATIN SMALL o + 5 LETTER O combogon + (U+EBC WITH OGONEK combdblac 5 = 006F AND DOUBLE + 328+ ACUTE ( = 030B) LATIN SMALL LETTER O + COMBINING OGONEK+ COMBINING DOUBLE ACUTE ACCENT)
7.38. Appendix AL Subrange 38: Characters with dot above and ogonek
Character Unicode Image of Standardized MUFI Unicode Description ID Character Character Replacement Entity PUA Code Point e0eb E Eogondot = U+E0E LATIN CAPITAL
E + B LETTER E combogon + (U+E0E WITH OGONEK combdot B = AND DOT 0045 + ABOVE ( = 0328+ LATIN CAPITAL 0307) LETTER E + COMBINING OGONEK+ COMBINING DOT ABOVE) e4eb e eogondot = U+E4E LATIN SMALL
e + B LETTER E combogon + (U+E4E WITH OGONEK combdot B = AND DOT 0065 + ABOVE ( = 0328 + LATIN SMALL 0307) LETTER E + COMBINING OGONEK+ COMBINING DOT ABOVE) ebde O Oogondot = U+EBD LATIN CAPITAL
O + E LETTER O combogon + (U+EB WITH OGONEK combdot DE = AND DOT 004F + ABOVE ( = 0328 + LATIN CAPITAL 0307) LETTER O + COMBINING OGONEK + COMBINING DOT ABOVE) ebdf o oogondot = U+EBD LATIN SMALL
o + F LETTER O combogon + (U+EB WITH OGONEK combdot DF = AND DOT 006F + ABOVE ( = 0328 + LATIN SMALL 0307) LETTER O + COMBINING OGONEK + COMBINING DOT ABOVE)
7.39. Appendix AM Subrange 39: Characters with dot below and ogonek
Character Unicode Image of Standardized MUFI Unicode Description ID Character Character Replacement Entity PUA Code Point e0e8 E Eogondotbl U+E0E8 LATIN CAPITAL
= E + (U+E0E LETTER E combogon + 8 = 0045 WITH OGONEK combdotbl + 0328 + AND DOT 0323) BELOW ( = LATIN CAPITAL LETTER E + COMBINING OGONEK+ COMBINING DOT BELOW) e4e8 e eogondotbl U+E4E8 LATIN SMALL
= e + (U+E4E LETTER E combogon + 8 = 0065 WITH OGONEK combdotbl + 0328 + AND DOT 0323) BELOW ( = LATIN SMALL LETTER E + COMBINING OGONEK + COMBINING DOT BELOW) e208 O Oogondotbl U+E208 LATIN CAPITAL
= O + (U+E20 LETTER O combogon + 8 = WITH OGONEK combdotbl 004F + AND DOT 0328 + BELOW ( = 0323) LATIN CAPITAL LETTER O + COMBINING OGONEK+ COMBINING DOT BELOW) e608 o oogondotbl U+E608 LATIN SMALL
= o + (U+E60 LETTER O combogon + 8 = WITH OGONEK combdotbl 006F + AND DOT 0328 + BELOW ( = 0323) LATIN SMALL LETTER O + COMBINING OGONEK + COMBINING DOT BELOW)
7.40. Appendix AN Subrange 40: Characters with diaeresis and macron
Character Unicode Image of Standardized MUFI Unicode Description ID Character Character Replacement Entity PUA Code Point e4cd e eumlmacr = U+E4C LATIN SMALL
e + D LETTER E combuml + (U+E4C WITH combmacr D = DIAERESIS AND 0065 + MACRON ( = 0308+ LATIN SMALL 0304) LETTER E + COMBINING DIAERESIS+ COMBINING MACRON)
7.41. Appendix AO Subrange 41: Characters with diaeresis and circumflex
Character Unicode Image of Standardized MUFI Unicode Description ID Character Character Replacement Entity PUA Code Point e41a a aumlcirc = U+E41 LATIN SMALL
a + A LETTER A combuml + (U+E41 WITH combcirc A = DIAERESIS AND 0061 + CIRCUMFLEX 0308+ ( = LATIN 0302) SMALL LETTER A + COMBINING DIAERESIS+ COMBINING CIRCUMFLEX ACCENT) e22d O Oumlcirc = U+E22 LATIN CAPITAL
O + D LETTER O combuml + (U+E22 WITH combcirc D = DIAERESIS AND 004F + CIRCUMFLEX 0308+ ( = LATIN 0302) CAPITAL LETTER O + COMBINING DIAERESIS+ COMBINING CIRCUMFLEX ACCENT) e62d o oumlcirc = U+E62 LATIN SMALL
o + D LETTER O combuml + (U+E62 WITH combcirc D = DIAERESIS AND 006F + CIRCUMFLEX 0308+ ( = LATIN 0302) SMALL LETTER O + COMBINING DIAERESIS+ COMBINING CIRCUMFLEX ACCENT) e317 U Uumlcirc = U+E317 LATIN CAPITAL
U + (U+E31 LETTER U combuml + 7 = 0055 WITH combcirc + 0308+ DIAERESIS AND 0302) CIRCUMFLEX ( = LATIN CAPITAL LETTER U + COMBINING DIAERESIS+ COMBINING CIRCUMFLEX ACCENT) e717 u uumlcirc = U+E717 LATIN SMALL
u + (U+E71 LETTER U combuml + 7 = 0075 WITH combcirc + 0308+ DIAERESIS AND 0302) CIRCUMFLEX ( = LATIN SMALL LETTER U + COMBINING DIAERESIS+ COMBINING CIRCUMFLEX ACCENT)
7.42. Appendix AP Subrange 42: Characters with diaeresis and dot below
Character Unicode Image of Standardized MUFI Unicode Description ID Character Character Replacement Entity PUA Code Point e41d a adotbluml = U+E41 LATIN SMALL
a + combuml D LETTER A + combdotbl (U+E41 WITH D = DIAERESIS AND 0061 + DOT BELOW ( = 0308+ LATIN SMALL 0323) LETTER A + COMBINING DIAERESIS+ COMBINING DOT BELOW)
7.43. Appendix AQ Subrange 43: Characters with ogonek and curl above (reversed ogonek)
Character Unicode Image of Standardized MUFI Unicode Description ID Character Character Replacement Entity PUA Code Point ebf2 E Eogoncurl = U+EBF LATIN CAPITAL
E + 2 LETTER E combogon + (U+EBF WITH OGONEK combcurl 2 = 0045 AND CURL ( = + 0328+ LATIN CAPITAL 1DCE) LETTER E + COMBINING OGONEK+ COMBINING OGONEK ABOV E) ebf3 e eogoncurl = U+EBF LATIN SMALL
e + 3 LETTER E combogon + (U+EBF WITH OGONEK combcurl 3 = 0065 AND CURL ( = + 0328+ LATIN SMALL 1DCE) LETTER E + COMBINING OGONEK+ COMBINING OGONEK ABOV E) e24f O Oogoncurl U+E24F LATIN CAPITAL
= O + (U+E24 LETTER O combogon + F = WITH OGONEK combcurl 004F + AND CURL ( = 0328+ LATIN CAPITAL 1DCE) LETTER O + COMBINING OGONEK+ COMBINING OGONEK ABOV E) e64f o oogoncurl = U+E64F LATIN SMALL
o + (U+E64 LETTER O combogon + F = WITH OGONEK combcurl 006F + AND CURL ( = 0328+ LATIN SMALL 1DCE) LETTER O + COMBINING OGONEK+ COMBINING OGONEK ABOV E)
7.44. Appendix AR Subrange 44: Characters with ogonek and circumflex
Character Unicode Image of Standardized MUFI Unicode Description ID Character Character Replacement Entity PUA Code Point e49f e eogoncirc = U+E49F LATIN SMALL
e + (U+E49 LETTER E combogon + F = WITH OGONEK combcirc 0065 + AND 0328+ CIRCUMFLEX 0302) ( = LATIN SMALL LETTER E + COMBINING OGONEK+ COMBINING CIRCUMFLEX ACCENT) e60e o oogoncirc = U+E60E LATIN SMALL
o + (U+E60 LETTER O combogon + E = WITH OGONEK combcirc 006F + AND 0328+ CIRCUMFLEX 0302) ( = LATIN SMALL LETTER O + COMBINING OGONEK+ COMBINING CIRCUMFLEX ACCENT)
7.45. Appendix AS Subrange 45: Characters with ring above and circumflex
Character Unicode Image of Standardized MUFI Unicode Description ID Character Character Replacement Entity PUA Code Point e41f a aringcirc = U+E41F LATIN SMALL
a + (U+E41 LETTER A combring + F = 0061 WITH RING combcirc + 030A+ ABOVE AND 0302) CIRCUMFLEX ( = LATIN SMALL LETTER A + COMBINING RING ABOVE+ COMBINING CIRCUMFLEX ACCENT)
7.46. Appendix AT Subrange 46: Characters with macron and breve
Character Unicode Image of Standardized MUFI Entity Unicode Description ID Character Character Replacement PUA Code Point e010 A Amacrbreve = A U+E010 LATIN + combmacr + (U+E01 CAPITAL combbreve 0 = 0041 LETTER A + 0304+ WITH 0306) MACRON AND BREVE ( = LATIN CAPITAL LETTER A + COMBINING MACRON + COMBINING BREVE) e4410 a amacrbreve = a U+E410 LATIN SMALL + combmacr + (U+E41 LETTER A combbreve 0 = 0061 WITH + 0304+ MACRON AND 0306) BREVE ( = LATIN SMALL LETTER A + COMBINING MACRON + COMBINING BREVE) e03d AE AEligmacrbreve U+E03 LATIN = AElig + D CAPITAL combmacr + (U+E03 LETTER AE combbreve D = WITH 00C6 + MACRON AND 0304+ BREVE ( = 0306) LATIN CAPITAL LETTER AE + COMBINING MACRON + COMBINING BREVE) e43d ae aeligmacrbreve = U+E43 LATIN SMALL aelig + D LETTER AE combmacr + (U+E43 WITH combbreve D = MACRON AND 00E6 + BREVE ( = 0304+ LATIN SMALL 0306) LETTER AE + COMBINING MACRON + COMBINING BREVE) e0b7 E Emacrbreve = E U+E0B LATIN + combmacr + 7 CAPITAL combbreve (U+E0B LETTER E 7 = 0045 WITH + 0304+ MACRON AND 0306) BREVE ( = LATIN CAPITAL LETTER E + COMBINING MACRON + COMBINING BREVE) e4b7 e emacrbreve = e U+E4B LATIN SMALL + combmacr + 7 LETTER E combbreve (U+E4B WITH 7 = 0065 MACRON AND + 0304+ BREVE ( = 0306) LATIN SMALL LETTER E + COMBINING MACRON + COMBINING BREVE) e137 I Imacrbreve = I + U+E137 LATIN combmacr + (U+E13 CAPITAL combbreve 7 = 0049 LETTER I + 0304+ WITH 0306) MACRON AND BREVE ( = LATIN CAPITAL LETTER I + COMBINING MACRON + COMBINING BREVE) e537 I imacrbreve = i + U+E537 LATIN SMALL combmacr + (U+E53 LETTER I combbreve 7 = 0069 WITH + 0304+ MACRON AND 0306) BREVE ( = LATIN SMALL LETTER I + COMBINING MACRON + COMBINING BREVE) e21b O Omacrbreve = O U+E21 LATIN + combmacr + B CAPITAL combbreve (U+E21 LETTER O B = WITH 004F + MACRON AND 0304+ BREVE ( = 0306) LATIN CAPITAL LETTER O + COMBINING MACRON + COMBINING BREVE) e61b o omacrbreve = o U+E61 LATIN SMALL + combmacr + B LETTER O combbreve (U+E61 WITH B = MACRON AND 006F + BREVE ( = 0304+ LATIN SMALL 0306) LETTER O + COMBINING MACRON + COMBINING BREVE) CAPITAL LETTER OE + COMBINING MACRON + COMBINING BREVE) e660 oe oeligmacrbreve U+E660 LATIN SMALL = oelig + (U+E66 LIGATURE OE combmacr + 0 = 0153 WITH combbreve + 0304+ MACRON AND 0306) BREVE ( = LATIN SMALL LETTER OE + COMBINING MACRON + COMBINING BREVE) e253 O Oslashmacrbreve U+E253 LATIN = Oslash + (U+E25 CAPITAL combmacr + 3 = LETTER O combbreve 00D8 + WITH STROKE 0304+ AND MACRON 0306) AND BREVE ( = LATIN CAPITAL LETTER O WITH STROKE + COMBINING MACRON + COMBINING BREVE) e653 o oslashmacrbreve U+E653 LATIN SMALL = oslash + (U+E65 LETTER O combmacr + 3 = WITH STROKE combbreve 00F8 + AND MACRON 0304+ AND BREVE ( = 0306) LATIN SMALL LETTER O WITH STROKE + COMBINING MACRON + COMBINING BREVE) e30b U Umacrbreve = U U+E30 LATIN + combmacr + B CAPITAL combbreve (U+E30 LETTER U B = WITH 0055 + MACRON AND 0304+ BREVE ( = 0306) LATIN CAPITAL LETTER U + COMBINING MACRON + COMBINING BREVE) e70b u umacrbreve = u U+E70 LATIN SMALL + combmacr + B LETTER U combbreve (U+E70 WITH B = MACRON AND 0075 + BREVE ( = 0304+ LATIN SMALL 0306) LETTER U + COMBINING MACRON + COMBINING BREVE) e375 Y Ymacrbreve = Y U+E375 LATIN + combmacr + (U+E37 CAPITAL combbreve 5 = 0059 LETTER Y + 0304+ WITH 0306) MACRON AND BREVE ( = LATIN CAPITAL LETTER Y + COMBINING MACRON + COMBINING BREVE) e775 y ymacrbreve = y U+E775 LATIN SMALL + combmacr + (U+E77 LETTER Y combbreve 5 = 0079 WITH + 0304+ MACRON AND 0306) BREVE ( = LATIN SMALL LETTER Y + COMBINING MACRON + COMBINING BREVE)
7.47. Appendix AU Subrange 47: Characters with macron and acute accent
Character Unicode Image of Standardized MUFI Entity Unicode Description ID Character Character Replacement PUA Code Point e00a A Amacracute = A U+E00 LATIN + combmacr + A CAPITAL combacute (U+E00 LETTER A A = WITH 0041 + MACRON AND 0304+ ACUTE ( = 0301) LATIN CAPITAL LETTER A + COMBINING MACRON + COMBINING ACUTE) e40a a amacracute = a U+E40 LATIN SMALL + combmacr + A LETTER A combacute (U+E40 WITH A = MACRON AND 0061 + ACUTE ( = 0304+ LATIN SMALL 0301) LETTER A + COMBINING MACRON + COMBINING ACUTE) e03a AE AEligmacracute U+E03 LATIN = AElig + A CAPITAL combmacr + (U+E03 LETTER AE combacute A = WITH 00C6 + MACRON AND 0304+ ACUTE ( = 0301) LATIN CAPITAL LETTER AE + COMBINING MACRON + COMBINING ACUTE) e43a ae aeligmacracute = U+E43 LATIN SMALL aelig + A LETTER AE combmacr + (U+E43 WITH combacute A = MACRON AND 00E6 + ACUTE ( = 0304+ LATIN SMALL 0301) LETTER AE + COMBINING MACRON + COMBINING ACUTE) e135 I Imacracute = I + U+E135 LATIN combmacr + (U+E13 CAPITAL combacute 5 = 0049 LETTER I + 0304+ WITH 0301) MACRON AND ACUTE ( = LATIN CAPITAL LETTER I + COMBINING MACRON + COMBINING ACUTE) e535 i imacracute = i + U+E535 LATIN SMALL combmacr + (U+E53 LETTER I combacute 5 = 0069 WITH + 0304+ MACRON AND 0301) ACUTE ( = LATIN SMALL LETTER I + COMBINING MACRON + COMBINING ACUTE) ebec O Oslashmacracute U+EBE LATIN = Oslash + C CAPITAL combmacr + (U+EB LETTER O combacute EC = WITH STROKE 00D8 + AND MACRON 0304+ AND ACUTE 0301) ( = LATIN CAPITAL LETTER O WITH STROKE + COMBINING MACRON + COMBINING ACUTE) ebed o oslashmacracute U+EBE LATIN SMALL = oslash + D LETTER O combmacr + (U+EB WITH STROKE combacute ED = AND MACRON 00F8 + AND ACUTE 0304+ ( = LATIN 0301) SMALL LETTER O WITH STROKE + COMBINING MACRON + COMBINING ACUTE) e309 U Umacracute = U U+E309 LATIN + combmacr + (U+E30 CAPITAL combacute 9 = 0055 LETTER U + 0304+ WITH 0301) MACRON AND ACUTE ( = LATIN CAPITAL LETTER U + COMBINING MACRON + COMBINING ACUTE) e709 u umacracute = u U+E709 LATIN SMALL + combmacr + (U+E70 LETTER U combacute 9 = 0075 WITH + 0304+ MACRON AND 0301) ACUTE ( = LATIN SMALL LETTER U + COMBINING MACRON + COMBINING ACUTE) e373 Y Ymacracute = Y U+E373 LATIN + combmacr + (U+E37 CAPITAL combacute 3 = 0059 LETTER Y + 0304+ WITH 0301) MACRON AND ACUTE ( = LATIN CAPITAL LETTER Y + COMBINING MACRON + COMBINING ACUTE) e773 y ymacracute = y U+E773 LATIN SMALL + combmacr + (U+E77 LETTER Y combacute 3 = 0079 WITH + 0304+ MACRON AND 0301) ACUTE ( = LATIN SMALL LETTER Y + COMBINING MACRON + COMBINING ACUTE)
7.48. Appendix AV Subrange 48: Characters with ogonek, dot above and acute accent
Character Unicode Image of Standardized MUFI Entity Unicode Description ID Character Character Replacement PUA Code Point e0ec E Eogondotacute U+E0E LATIN = E + C CAPITAL combogon + (U+E0E LETTER E combdot + C = WITH OGONEK combacute 0045 + AND DOT 0328+ ABOVE AND 0307 + ACUTE ( = 0301) LATIN CAPITAL LETTER E + COMBINING OGONEK+ COMBINING DOT ABOVE + COMBINING ACUTE ACCENT) e4ec e eogondotacute U+E4E LATIN SMALL = e + C LETTER E combogon + (U+E4E WITH OGONEK combdot + C = AND DOT combacute 0065 + ABOVE AND 0328+ ACUTE ( = 0307 + LATIN SMALL 0301) LETTER E + COMBINING OGONEK+ COMBINING DOT ABOVE + COMBINING ACUTE ACCENT) ebfa O Oogondotacute U+EBF LATIN = O + A CAPITAL combogon + (U+EBF LETTER O combdot + A = WITH OGONEK combacute 004F + AND DOT 0328+ ABOVE AND 0307 + ACUTE ( = 0301) LATIN CAPITAL LETTER O + COMBINING OGONEK+ COMBINING DOT ABOVE + COMBINING ACUTE ACCENT) ebfb o oogondotacute U+EBF LATIN SMALL = o + B LETTER O combogon + (U+EBF WITH OGONEK combdot + B = AND DOT combacute 006F + ABOVE AND 0328+ ACUTE ( = 0307 + LATIN SMALL 0301) LETTER O + COMBINING OGONEK+ COMBINING DOT ABOVE + COMBINING ACUTE ACCENT)
7.49. Appendix AW Subrange 51: Alphabetical list of variant letter forms
Character Unicode Image of Standardized MUFI Entity Unicode Description ID Character Character Replacement PUA Code Point f13a A Asqu U+F13 LATIN CAPITAL A LETTER A SQUARE FORM f214 a aunc U+F214 LATIN SMALL LETTER A UNCIAL FORM f201 A Ains U+F201 LATIN CAPITAL LETTER A INSULAR FORM f200 a ains U+F200 LATIN SMALL LETTER A INSULAR FORM f202 a aopen U+F202 LATIN SMALL LETTER OPEN A CAROLINGIAN FORM f215 a aneckless U+F215 LATIN SMALL LETTER NECKLESS A f203 a aclose U+F203 LATIN SMALL LETTER CLOSED A GOTHIC FORM f106 C Csqu U+F106 LATIN CAPITAL LETTER C SQUARE FORM f198 c ccurl U+F198 LATIN SMALL LETTER C WITH CURL f193 d dcurl U+F193 LATIN SMALL LETTER D WITH CURL f10a E Eunc U+F10 LATIN CAPITAL A LETTER E UNCIAL FORM f217 E Euncclose U+F217 LATIN CAPITAL LETTER CLOSED E UNCIAL FORM f218 e eunc U+F218 LATIN SMALL LETTER E UNCIAL FORM f219 e eext U+F219 LATIN SMALL LETTER E EXTENDED BAR FORM f21a e etall U+F21 LATIN SMALL A LETTER E TALL FORM f21b f finssemiclose U+F21B LATIN SMALL LETTER SEMI- CLOSED INSULAR F f21c f finsdothook U+F21 LATIN SMALL C LETTER INSULAR F WITH DOTTED HOOKS f207 f finsclose U+F207 LATIN SMALL LETTER CLOSED INSULAR F f194 f fcurl U+F194 LATIN SMALL LETTER F WITH CURL f10e G Gsqu U+F10E LATIN CAPITAL LETTER G SQUARE FORM f196 g gcurl U+F196 LATIN SMALL LETTER G WITH CURL f21d g gdivloop U+F21 LATIN SMALL D LETTER G WITH SEPARATE LOOPS f21e g glglowloop U+F21E LATIN SMALL LETTER CLOSED G WITH LARGE LOWER LOOP f21f g gsmlowloop U+F21F LATIN SMALL LETTER CLOSED G WITH SMALL LOWER LOOP f110 H Hunc U+F110 LATIN CAPITAL LETTER H UNCIAL FORM f23a h hrdes U+F23 LATIN SMALL A LETTER H WITH RIGHT DESCENDER f220 i ilong U+F220 LATIN SMALL LETTER LONG I f208 k kunc U+F208 LATIN SMALL LETTER K UNCIAL FORM f221 k ksemiclose U+F221 LATIN SMALL LETTER K SEMI-CLOSED FORM f209 k kclose U+F209 LATIN SMALL LETTER K CLOSED FORM f195 k kcurl U+F195 LATIN SMALL LETTER K WITH CURL f222 l ldes U+F222 LATIN SMALL LETTER L DESCENDING f11a M Munc U+F11 LATIN CAPITAL A LETTER M UNCIAL FORM f224 M Muncdes U+F224 LATIN CAPITAL LETTER M UNCIAL FORM WITH RIGHT DESCENDER f225 m munc U+F225 LATIN SMALL LETTER M UNCIAL FORM f226 m muncdes U+F226 LATIN SMALL LETTER M UNCIAL FORM WITH RIGHT DESCENDER f223 m mrdes U+F223 LATIN SMALL LETTER M WITH RIGHT DESCENDER f229 N Nrdes U+F229 LATIN CAPITAL LETTER N WITH RIGHT DESCENDER f228 n nrdes U+F228 LATIN SMALL LETTER N WITH RIGHT DESCENDER f22a N nscaprdes U+F22 LATIN LETTER A SMALL CAPITAL N WITH RIGHT DESCENDER f22b N nscapldes U+F22B LATIN LETTER SMALL CAPITAL N WITH LEFT DESCENDER f19a n nflour U+F19 LATIN SMALL A LETTER N WITH FLOURISH f22c Q Qstem U+F22 LATIN CAPITAL C LETTER Q WITH STEM f19b r rflour U+F19B LATIN SMALL LETTER R WITH FLOURISH f126 S Sclose U+F126 LATIN CAPITAL LETTER S CLOSED FORM f128 s sclose U+F128 LATIN SMALL LETTER S CLOSED FORM f127 s slongdes U+F127 LATIN SMALL LETTER LONG S DESCENDING f199 t tcurl U+F199 LATIN SMALL LETTER T WITH CURL f232 x xldes U+F232 LATIN SMALL LETTER X WITH LEFT DESCENDER f233 y yrgmainstrok U+F233 LATIN SMALL LETTER Y WITH RIGHT MAIN STROKE