Appendix C: Diacritics and Special Characters

Appendix C: Diacritics and Special Characters BACKGROUND: Diacritics are marks placed over, under, or through letters or combinations of letters that indicate differences in pronunciation and meaning between the marked letter and its unmarked counterpart. Common diacritics are the umlaut (for example, ü), the acute (for example, ó), the grave (for example, à), and the macron (for example, ). Double diacritics are two diacritics modifying a single letter. Special characters occupy a place of their own rather than modifying a letter. Common special characters include the Polish l ( ), the miagki znak ( ), the ayn ( ), and the alif ( ’ ). Superscript characters are letters, numbers, or symbols written or printed higher on a line of text; subscript characters are letters, numbers, or symbols written or printed lower on a line of text. PLACEMENT OF DIACRITICS IN LCDGT: In LCDGT diacritics and special characters are generally printed in accordance with either the LC romanization conventions for a particular language or with a conventional form found in reference sources. 1. Policy for established demographic group terms. Transcribe existing terms exactly as they have been established, using diacritics and special characters as indicated. 2. Establishing terms with or without diacritics. When, as a result of doing authority research, a choice must be made between a term without diacritics and the otherwise identical term with diacritics, generally prefer the term without diacritics if it is clearly the conventional English form. If there is no conventional English form, prefer the form of the term as found in the Encyclopaedia Britannica or in reference sources of the particular discipline or language, using diacritics to conform to LC’s transcription or romanization conventions. 3. Diacritic use and LC practice. If it is determined that an authorized term or reference is to have diacritics, use diacritics that conform to LC transcription or romanization conventions, even if reference sources, including BGN, indicate different diacritics. Use judgment in determining whether to make UF references from forms with variant diacritics that are spelled differently from the proposed term. (For forms that have variant diacritics but are spelled the same as the proposed heading, see sec. 4, below.) 4. Diacritics in authorized terms and references. Because some online systems cannot distinguish between a term without diacritics and the otherwise identically spelled term with diacritics, do not add 450 fields for variants of this type, even if the terms do not represent the same word in a particular language. 5. Greek or Hebrew letters. Romanize Greek or Hebrew letters when establishing terms. Do not use Greek or Hebrew letters in authorized terms or references, even though they appear as such in reference sources. Demographic Group Terms Manual Appendix C Page 1 December 2015 Appendix C: Diacritics and Special Characters 6. Ampersands. Use an ampersand only when establishing a demographic group term that is based on a name heading and the name heading includes an ampersand (e.g., members of a political party, see L 650). Spell out the word and when it is used to connect two elements in a qualifier. Example: Yaka speakers (Angolan and Congolese (Democratic Republic) language) 7. Common diacritics and special characters. a. Acute ( ´ ) Algerians UF Algériens Irish (Republic of Ireland) UF Éireannaigh Do not confuse this diacritic with the miagki znak ( ) and the apostrophe ( ’ ), which are special characters. b. Alif ( ’ ) c. Angstrom or Circle above letter ( ) Do not confuse with the dot above letter ( ⋅ ). d. Apostrophe ( ’ ) Do not confuse with the acute ( ´ ), the miagki znak ( ), the alif ( ’ ), the ayn ( ), or the grave ( ` ). e. Ayn ( ) Do not confuse with the acute ( ´ ), the miagki znak ( ), the alif ( ’ ), or the apostrophe ( ’ ) Demographic Group Terms Manual Appendix C Page 2 December 2015 Appendix C: Diacritics and Special Characters 7. Common diacritics and special characters. (Continued) f. Breve ( ) Do not confuse with the hacek ( ) g. Cedilla ( ¸ ) French UF Français Do not confuse with the left hook ( ). h. Circle below letter ( ) Do not confuse with the dot below letter ( . ). i. Circumflex ( ^ ) Ivoirians UF Côte-d'Ivoire residents j. Dot above letter or Superior dot ( . ) Do not confuse with the angstrom or circle above letter ( ). k. Dot below letter ( . ) Do not confuse with the circle below letter ( ). l. Double acute ( ″ ) This diacritic appears only in Hungarian. Do not confuse with the tverdyi znak ( ), a special character that follows the letter it modifies, the Umlaut ( ¨ ), or the standard quotation mark ( " ). Demographic Group Terms Manual Appendix C Page 3 December 2015 Appendix C: Diacritics and Special Characters 7. Common diacritics and special characters. (Continued) m. Grave ( ` ) Swiss Helvètes Do not confuse with the apostrophe ( ’ ). n. Hacek ( ) Do not confuse with the breve ( ). o. Left hook ( ) Do not confuse with the cedilla ( ¸ ). p. Ligature ( ) q. Macron ( B ) r. Miagki znak ( ) Do not confuse with the acute ( ´ ), the alif ( ’ ), the apostrophe ( ’ ), or the ayn ( ). s. Polish l ( ) Note that Polish also has a regular l. t. Slash o ( ø ) u. Tilde ( ~ ) Spanish speakers UF Español speakers Demographic Group Terms Manual Appendix C Page 4 December 2015 Appendix C: Diacritics and Special Characters 7. Common diacritics and special characters. (Continued) v. Turkish i ( ) Note that i is not dotted. If uppercased, render as I. Turkish also has a regular i. w. Tverdyi znak ( ) Do not confuse this special character with the double acute ( ″ ) of Hungarian, the umlaut ( ¨ ), or the standard quotation mark ( " ). x. Umlaut or diaeresis ( ¨ ) Haitians UF Haïtiens Albanian speakers UF Shqipëri speakers Do not confuse this diacritic with the double acute ( ″ ) of Hungarian, the tverdyi znak ( ), or the standard quotation mark ( " ). Demographic Group Terms Manual Appendix C Page 5 December 2015 .

Appendix C: Diacritics and Special Characters

BIBLIOGRAPHY N Most Diacritic Letters ( C, I, Ö, ˙ S, Etc.) Are

Ligature Modeling for Recognition of Characters Written in 3D Space Dae Hwan Kim, Jin Hyung Kim

Ffontiau Cymraeg

Form IT-2658-E:12/19:Certificate of Exemption from Partnership Or New

Combining Diacritical Marks Range: 0300–036F the Unicode Standard

Your Baby Has Hemoglobin E Or Hemoglobin O Trait for Parents

Medical Terminology Abbreviations Medical Terminology Abbreviations

Left Ventricular Diastolic Dysfunction in Ischemic Stroke: Functional and Vascular Outcomes

Typing in Greek Sarah Abowitz Smith College Classics Department

How to Edit IPA 1 How to Use SAMPA for Editing IPA 2 How to Use X

A Collection of Mildly Interesting Facts About the Little Symbols We Communicate With

The Selnolig Package: Selective Suppression of Typographic Ligatures*