Appendix C: Diacritics and Special Characters

Total Page:16

File Type:pdf, Size:1020Kb

Appendix C: Diacritics and Special Characters Appendix C: Diacritics and Special Characters BACKGROUND: Diacritics are marks placed over, under, or through letters or combinations of letters that indicate differences in pronunciation and meaning between the marked letter and its unmarked counterpart. Common diacritics are the umlaut (for example, ü), the acute (for example, ó), the grave (for example, à), and the macron (for example, ). Double diacritics are two diacritics modifying a single letter. Special characters occupy a place of their own rather than modifying a letter. Common special characters include the Polish l ( ), the miagki znak ( ), the ayn ( ), and the alif ( ’ ). Superscript characters are letters, numbers, or symbols written or printed higher on a line of text; subscript characters are letters, numbers, or symbols written or printed lower on a line of text. PLACEMENT OF DIACRITICS IN LCDGT: In LCDGT diacritics and special characters are generally printed in accordance with either the LC romanization conventions for a particular language or with a conventional form found in reference sources. 1. Policy for established demographic group terms. Transcribe existing terms exactly as they have been established, using diacritics and special characters as indicated. 2. Establishing terms with or without diacritics. When, as a result of doing authority research, a choice must be made between a term without diacritics and the otherwise identical term with diacritics, generally prefer the term without diacritics if it is clearly the conventional English form. If there is no conventional English form, prefer the form of the term as found in the Encyclopaedia Britannica or in reference sources of the particular discipline or language, using diacritics to conform to LC’s transcription or romanization conventions. 3. Diacritic use and LC practice. If it is determined that an authorized term or reference is to have diacritics, use diacritics that conform to LC transcription or romanization conventions, even if reference sources, including BGN, indicate different diacritics. Use judgment in determining whether to make UF references from forms with variant diacritics that are spelled differently from the proposed term. (For forms that have variant diacritics but are spelled the same as the proposed heading, see sec. 4, below.) 4. Diacritics in authorized terms and references. Because some online systems cannot distinguish between a term without diacritics and the otherwise identically spelled term with diacritics, do not add 450 fields for variants of this type, even if the terms do not represent the same word in a particular language. 5. Greek or Hebrew letters. Romanize Greek or Hebrew letters when establishing terms. Do not use Greek or Hebrew letters in authorized terms or references, even though they appear as such in reference sources. Demographic Group Terms Manual Appendix C Page 1 December 2015 Appendix C: Diacritics and Special Characters 6. Ampersands. Use an ampersand only when establishing a demographic group term that is based on a name heading and the name heading includes an ampersand (e.g., members of a political party, see L 650). Spell out the word and when it is used to connect two elements in a qualifier. Example: Yaka speakers (Angolan and Congolese (Democratic Republic) language) 7. Common diacritics and special characters. a. Acute ( ´ ) Algerians UF Algériens Irish (Republic of Ireland) UF Éireannaigh Do not confuse this diacritic with the miagki znak ( ) and the apostrophe ( ’ ), which are special characters. b. Alif ( ’ ) c. Angstrom or Circle above letter ( ) Do not confuse with the dot above letter ( ⋅ ). d. Apostrophe ( ’ ) Do not confuse with the acute ( ´ ), the miagki znak ( ), the alif ( ’ ), the ayn ( ), or the grave ( ` ). e. Ayn ( ) Do not confuse with the acute ( ´ ), the miagki znak ( ), the alif ( ’ ), or the apostrophe ( ’ ) Demographic Group Terms Manual Appendix C Page 2 December 2015 Appendix C: Diacritics and Special Characters 7. Common diacritics and special characters. (Continued) f. Breve ( ) Do not confuse with the hacek ( ) g. Cedilla ( ¸ ) French UF Français Do not confuse with the left hook ( ). h. Circle below letter ( ) Do not confuse with the dot below letter ( . ). i. Circumflex ( ^ ) Ivoirians UF Côte-d'Ivoire residents j. Dot above letter or Superior dot ( . ) Do not confuse with the angstrom or circle above letter ( ). k. Dot below letter ( . ) Do not confuse with the circle below letter ( ). l. Double acute ( ″ ) This diacritic appears only in Hungarian. Do not confuse with the tverdyi znak ( ), a special character that follows the letter it modifies, the Umlaut ( ¨ ), or the standard quotation mark ( " ). Demographic Group Terms Manual Appendix C Page 3 December 2015 Appendix C: Diacritics and Special Characters 7. Common diacritics and special characters. (Continued) m. Grave ( ` ) Swiss Helvètes Do not confuse with the apostrophe ( ’ ). n. Hacek ( ) Do not confuse with the breve ( ). o. Left hook ( ) Do not confuse with the cedilla ( ¸ ). p. Ligature ( ) q. Macron ( B ) r. Miagki znak ( ) Do not confuse with the acute ( ´ ), the alif ( ’ ), the apostrophe ( ’ ), or the ayn ( ). s. Polish l ( ) Note that Polish also has a regular l. t. Slash o ( ø ) u. Tilde ( ~ ) Spanish speakers UF Español speakers Demographic Group Terms Manual Appendix C Page 4 December 2015 Appendix C: Diacritics and Special Characters 7. Common diacritics and special characters. (Continued) v. Turkish i ( ) Note that i is not dotted. If uppercased, render as I. Turkish also has a regular i. w. Tverdyi znak ( ) Do not confuse this special character with the double acute ( ″ ) of Hungarian, the umlaut ( ¨ ), or the standard quotation mark ( " ). x. Umlaut or diaeresis ( ¨ ) Haitians UF Haïtiens Albanian speakers UF Shqipëri speakers Do not confuse this diacritic with the double acute ( ″ ) of Hungarian, the tverdyi znak ( ), or the standard quotation mark ( " ). Demographic Group Terms Manual Appendix C Page 5 December 2015 .
Recommended publications
  • BIBLIOGRAPHY N Most Diacritic Letters ( C, I, Ö, ˙ S, Etc.) Are
    BIBLIOGRAPHY Notes Most diacritic letters (ˇc,ı, ö, s, etc.) are alphabetized as separate letters after ˙ the base letter. The abbreviation “ms. NAN KR” stands for “manuscript in the Archives of the National Center for Manasology and Artistic Culture of the National Academy of Sciences of the Kyrgyz Republic,” followed by the archival shelf number. In the Commentary and footnotes, the form of some of the bibliographic citations of mid-nineteenth century Kirghiz epics (recorded by Radlof from anonymous bards, published under separate headings in his Obraztsy/ Proben [1885], and re-edited by Hatto in The Manas of Wilhelm Radlof [1990]) is nearly identical to the citations of analytical articles on the texts that Hatto published in various journals. These are the references to the Radlovian epic texts, which were coincidentally published by Hatto in his re-edition: “Almambet, Er Kökˇcöand Ak-Erkeˇc”;“Birth of Manas”; “Birth of Semetey”; “Bok-Murun”; “Köz-Kaman”; and “Semetey.” Hatto’s articles are cited as: Hatto, “Almambet, Er Kökˇcöand Ak Erkeˇc”;Hatto, “Birth of Manas”; Hatto, “Köz-Kaman”; Hatto, “Semetey.” Listed below by language are references to the main bibliographic entries of the dictionaries, grammars, and other aids used, sometimes without citation, in the preparation of this edition: Arabic Baranov, Arabsko–russkii slovar’ Steingass, A Learner’s Arabic–English Dictionary Baˇskir Akhmerov, Bashkirsko–russkii slovar’ Chaghatay Budagov, Sravnitel’nyi slovar’ turetsko–tatarskikh narechii Shcherbak, Grammatika starouzbekskogo iazyka Radlov, Opyt/Versuch Eastern Turki Jarring, An Eastern Turki–English Dialect Dictionary (Uyghur) Nadzhip, Uigursko–russkii slovar’ Kalmyk Ramstedt, Kalmückisches Wörterbuch Kirghiz Batmanov, Sovremennyi kirgizskii iazyk Hu and Imart, A Kirghiz Reader Imart, Le kirghiz Iudakhin, Kirgizsko–russkii slovar’ ———, Russko–kirgizskii slovar’ 374 bibliography Kirghiz (cont.) Muqambaev, Qır ˙gıztilinin dialektologiyalıq sözdügü, vol.
    [Show full text]
  • Ligature Modeling for Recognition of Characters Written in 3D Space Dae Hwan Kim, Jin Hyung Kim
    Ligature Modeling for Recognition of Characters Written in 3D Space Dae Hwan Kim, Jin Hyung Kim To cite this version: Dae Hwan Kim, Jin Hyung Kim. Ligature Modeling for Recognition of Characters Written in 3D Space. Tenth International Workshop on Frontiers in Handwriting Recognition, Université de Rennes 1, Oct 2006, La Baule (France). inria-00105116 HAL Id: inria-00105116 https://hal.inria.fr/inria-00105116 Submitted on 10 Oct 2006 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Ligature Modeling for Recognition of Characters Written in 3D Space Dae Hwan Kim Jin Hyung Kim Artificial Intelligence and Artificial Intelligence and Pattern Recognition Lab. Pattern Recognition Lab. KAIST, Daejeon, KAIST, Daejeon, South Korea South Korea [email protected] [email protected] Abstract defined shape of character while it showed high recognition performance. Moreover when a user writes In this work, we propose a 3D space handwriting multiple stroke character such as ‘4’, the user has to recognition system by combining 2D space handwriting write a new shape which is predefined in a uni-stroke models and 3D space ligature models based on that the and which he/she has never seen.
    [Show full text]
  • Ffontiau Cymraeg
    This publication is available in other languages and formats on request. Mae'r cyhoeddiad hwn ar gael mewn ieithoedd a fformatau eraill ar gais. [email protected] www.caerphilly.gov.uk/equalities How to type Accented Characters This guidance document has been produced to provide practical help when typing letters or circulars, or when designing posters or flyers so that getting accents on various letters when typing is made easier. The guide should be used alongside the Council’s Guidance on Equalities in Designing and Printing. Please note this is for PCs only and will not work on Macs. Firstly, on your keyboard make sure the Num Lock is switched on, or the codes shown in this document won’t work (this button is found above the numeric keypad on the right of your keyboard). By pressing the ALT key (to the left of the space bar), holding it down and then entering a certain sequence of numbers on the numeric keypad, it's very easy to get almost any accented character you want. For example, to get the letter “ô”, press and hold the ALT key, type in the code 0 2 4 4, then release the ALT key. The number sequences shown from page 3 onwards work in most fonts in order to get an accent over “a, e, i, o, u”, the vowels in the English alphabet. In other languages, for example in French, the letter "c" can be accented and in Spanish, "n" can be accented too. Many other languages have accents on consonants as well as vowels.
    [Show full text]
  • Form IT-2658-E:12/19:Certificate of Exemption from Partnership Or New
    Department of Taxation and Finance IT-2658-E Certificate of Exemption from Partnership (12/19) or New York S Corporation Estimated Tax Paid on Behalf of Nonresident Individual Partners and Shareholders Do not send this certificate to the Tax Department (see instructions below). Use this certificate for tax years 2020 and 2021; it will expire on February 1, 2022. First name and middle initial Last name Social Security number Mailing address (number and street or PO box) Telephone number ( ) City, village, or post office State ZIP code I certify that I will comply with the New York State estimated tax provisions and tax return filing requirements, to the extent that they apply to me, for tax years 2020 and 2021 (see instructions). Signature of nonresident individual partner or shareholder Date Instructions General information If you do not meet either of the above exceptions, you may still claim exemption from this estimated tax provision by Tax Law section 658(c)(4) requires the following entities filing Form IT-2658-E. that have income derived from New York sources to make estimated personal income tax payments on behalf of You qualify to claim exemption and file Form IT-2658-E by partners or shareholders who are nonresident individuals: certifying that you will comply in your individual capacity with • New York S corporations; all the New York State personal income tax and MCTMT estimated tax and tax return filing requirements, to the • partnerships (other than publicly traded partnerships as extent that they apply to you, for the years covered by this defined in Internal Revenue Code section 7704); and certificate.
    [Show full text]
  • Combining Diacritical Marks Range: 0300–036F the Unicode Standard
    Combining Diacritical Marks Range: 0300–036F The Unicode Standard, Version 4.0 This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 4.0. Characters in this chart that are new for The Unicode Standard, Version 4.0 are shown in conjunction with any existing characters. For ease of reference, the new characters have been highlighted in the chart grid and in the names list. This file will not be updated with errata, or when additional characters are assigned to the Unicode Standard. See http://www.unicode.org/charts for access to a complete list of the latest character charts. Disclaimer These charts are provided as the on-line reference to the character contents of the Unicode Standard, Version 4.0 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this excerpt file, please consult the appropriate sections of The Unicode Standard, Version 4.0 (ISBN 0-321-18578-1), as well as Unicode Standard Annexes #9, #11, #14, #15, #24 and #29, the other Unicode Technical Reports and the Unicode Character Database, which are available on-line. See http://www.unicode.org/Public/UNIDATA/UCD.html and http://www.unicode.org/unicode/reports A thorough understanding of the information contained in these additional sources is required for a successful implementation. Fonts The shapes of the reference glyphs used in these code charts are not prescriptive. Considerable variation is to be expected in actual fonts.
    [Show full text]
  • Your Baby Has Hemoglobin E Or Hemoglobin O Trait for Parents
    NEW HAMPSHIRE NEWBORN SCREENING PROGRAM Your Baby Has Hemoglobin E or Hemoglobin O Trait For Parents All infants born in New Hampshire are screened for a panel of conditions at birth. A small amount of blood was collected from your baby’s heel and sent to the laboratory for testing. One of the tests looked at the hemoglobin in your baby’s blood. Your baby’s test found that your baby has either hemoglobin E trait or hemoglobin O trait. The newborn screen- ing test cannot tell the difference between hemoglobin E and hemoglobin O so we do not know which one your baby has. Both hemoglobin E trait and hemoglobin O trait are common and do not cause health problems. Hemoglobin E trait and hemoglobin O trait will never develop to disease. What is hemoglobin? Hemoglobin is the part of the blood that carries oxygen to all parts of the body. There are different types of hemoglobin. The type of hemoglobin we have is determined from genes that we inherit from our parents. Genes are the instructions for how our body develops and functions. We have two copies of each gene; one copy is inherited from our mother in the egg and one copy is inherited from our father in the sperm. What are hemoglobin E trait and hemoglobin O trait? The normal, and most common, type of hemoglobin is called hemoglobin A. Hemoglobin E trait is when a baby inherited one gene for hemoglobin A from one parent and one gene for hemoglobin E from the other parent.
    [Show full text]
  • Medical Terminology Abbreviations Medical Terminology Abbreviations
    34 MEDICAL TERMINOLOGY ABBREVIATIONS MEDICAL TERMINOLOGY ABBREVIATIONS The following list contains some of the most common abbreviations found in medical records. Please note that in medical terminology, the capitalization of letters bears significance as to the meaning of certain terms, and is often used to distinguish terms with similar acronyms. @—at A & P—anatomy and physiology ab—abortion abd—abdominal ABG—arterial blood gas a.c.—before meals ac & cl—acetest and clinitest ACLS—advanced cardiac life support AD—right ear ADL—activities of daily living ad lib—as desired adm—admission afeb—afebrile, no fever AFB—acid-fast bacillus AKA—above the knee alb—albumin alt dieb—alternate days (every other day) am—morning AMA—against medical advice amal—amalgam amb—ambulate, walk AMI—acute myocardial infarction amt—amount ANS—automatic nervous system ant—anterior AOx3—alert and oriented to person, time, and place Ap—apical AP—apical pulse approx—approximately aq—aqueous ARDS—acute respiratory distress syndrome AS—left ear ASA—aspirin asap (ASAP)—as soon as possible as tol—as tolerated ATD—admission, transfer, discharge AU—both ears Ax—axillary BE—barium enema bid—twice a day bil, bilateral—both sides BK—below knee BKA—below the knee amputation bl—blood bl wk—blood work BLS—basic life support BM—bowel movement BOW—bag of waters B/P—blood pressure bpm—beats per minute BR—bed rest MEDICAL TERMINOLOGY ABBREVIATIONS 35 BRP—bathroom privileges BS—breath sounds BSI—body substance isolation BSO—bilateral salpingo-oophorectomy BUN—blood, urea, nitrogen
    [Show full text]
  • Left Ventricular Diastolic Dysfunction in Ischemic Stroke: Functional and Vascular Outcomes
    Journal of Stroke 2016;18(2):195-202 http://dx.doi.org/10.5853/jos.2015.01669 Original Article Left Ventricular Diastolic Dysfunction in Ischemic Stroke: Functional and Vascular Outcomes Hong-Kyun Park,a* Beom Joon Kim,a* Chang-Hwan Yoon,b Mi Hwa Yang,a Moon-Ku Han,a Hee-Joon Baea aDepartment of Neurology and Cerebrovascular Center, Seoul National University Bundang Hospital, Seongnam, Korea bDivision of Cardiology, Department of Internal Medicine, Seoul National University Bundang Hospital, Seongnam, Korea Background and Purpose Left ventricular (LV) diastolic dysfunction, developed in relation to myo- Correspondence: Moon-Ku Han cardial dysfunction and remodeling, is documented in 15%-25% of the population. However, its role Department of Neurology and Cerebrovascular Center, Seoul National in functional recovery and recurrent vascular events after acute ischemic stroke has not been thor- University Bundang Hospital, oughly investigated. 82 Gumi-ro 173beon-gil, Bundang-gu, Seongnam 13620, Korea Methods In this retrospective observational study, we identified 2,827 ischemic stroke cases with Tel: +82-31-787-7464 adequate echocardiographic evaluations to assess LV diastolic dysfunction within 1 month after the Fax: +82-31-787-4059 index stroke. The peak transmitral filling velocity/mean mitral annular velocity during early diastole (E/ E-mail: [email protected] e’) was used to estimate LV diastolic dysfunction. We divided patients into 3 groups according to E/e’ Received: December 1, 2015 as follows: <8, 8-15, and ≥15. Recurrent vascular events and functional recovery were prospectively Revised: March 14, 2016 collected at 3 months and 1 year. Accepted: March 21, 2016 Results Among included patients, E/e’ was 10.6±6.4: E/e’ <8 in 993 (35%), 8-15 in 1,444 (51%), and *These authors contributed equally to this ≥15 in 378 (13%) cases.
    [Show full text]
  • Typing in Greek Sarah Abowitz Smith College Classics Department
    Typing in Greek Sarah Abowitz Smith College Classics Department Windows 1. Down at the lower right corner of the screen, click the letters ENG, then select Language Preferences in the pop-up menu. If these letters are not present at the lower right corner of the screen, open Settings, click on Time & Language, then select Region & Language in the sidebar to get to the proper screen for step 2. 2. When this window opens, check if Ελληνικά/Greek is in the list of keyboards on your ​ ​ computer under Languages. If so, go to step 3. Otherwise, click Add A New Language. Clicking Add A New Language will take you to this window. Look for Ελληνικά/Greek and click it. When you click Ελληνικά/Greek, the language will be added and you will return to the previous screen. 3. Now that Ελληνικά is listed in your computer’s languages, click it and then click Options. 4. Click Add A Keyboard and add the Greek Polytonic option. If you started this tutorial without the pictured keyboard menu in step 1, it should be in the lower right corner of your screen now. 5. To start typing in Greek, click the letters ENG next to the clock in the lower right corner of the screen. Choose “Greek Polytonic keyboard” to start typing in greek, and click “US keyboard” again to go back to English. Mac 1. Click the apple button in the top left corner of your screen. From the drop-down menu, choose System Preferences. When the window below appears, click the “Keyboard” icon.
    [Show full text]
  • How to Edit IPA 1 How to Use SAMPA for Editing IPA 2 How to Use X
    version July 19 How to edit IPA When you want to enter the International Phonetic Association (IPA) character set with a computer keyboard, you need to know how to enter each IPA character with a sequence of keyboard strokes. This document describes a number of techniques. The complete SAMPA and RTR mapping can be found in the attached html documents. The main html document (ipa96.html) comes in a pdf-version (ipa96.pdf) too. 1 How to use SAMPA for editing IPA The Speech Assessment Method (SAM) Phonetic Alphabet has been developed by John Wells (http://www.phon.ucl.ac.uk/home/sampa). The goal was to map 176 IPA characters into the range of 7-bit ASCII, which is a set of 96 characters. The principle is to represent a single IPA character by a single ASCII character. This table is an example for five vowels: Description IPA SAMPA script a ɑ A ae ligature æ { turned a ɐ 6 epsilon ɛ E schwa ə @ A visual represenation of a keyboard shows the mapping on screen. The source for the SAMPA mapping used is "Handbook of multimodal an spoken dialogue systems", D Gibbon, Kluwer Academic Publishers 2000. 2 How to use X-SAMPA for editing IPA The multi-character extension to SAMPA has also been developed by John Wells (http://www.phon.ucl.ac.uk/home/sampa/x-sampa.htm). The basic principle used is to form chains of ASCII characters, that represent a single IPA character, e.g. This table lists some examples Description IPA X-SAMPA beta β B small capital B ʙ B\ lower-case B b b lower-case P p p Phi ɸ p\ The X-SAMPA mapping is in preparation and will be included in the next release.
    [Show full text]
  • A Collection of Mildly Interesting Facts About the Little Symbols We Communicate With
    Ty p o g raph i c Factettes A collection of mildly interesting facts about the little symbols we communicate with. Helvetica The horizontal bars of a letter are almost always thinner than the vertical bars. Minion The font size is approximately the measurement from the lowest appearance of any letter to the highest. Most of the time. Seventy-two points equals one inch. Fridge256 point Cochin most of 50the point Zaphino time Letters with rounded bottoms don’t sit on the baseline, but slightly below it. Visually, they would appear too high if they rested on the same base as the squared letters. liceAdobe Caslon Bold UNITED KINGDOM UNITED STATES LOLITA LOLITA In Ancient Rome, scribes would abbreviate et (the latin word for and) into one letter. We still use that abbreviation, called the ampersand. The et is still very visible in some italic ampersands. The word ampersand comes from and-per-se-and. Strange. Adobe Garamond Regular Adobe Garamond Italic Trump Mediaval Italic Helvetica Light hat two letters ss w it cam gue e f can rom u . I Yo t h d. as n b ha e rt en ho a s ro n u e n t d it r fo w r s h a u n w ) d r e e m d a s n o r f e y t e t a e r b s , a b s u d t e d e e n m t i a ( n l d o b s o m a y r S e - d t w A i e t h h t t , h d e n a a s d r v e e p n t m a o f e e h m t e a k i i l .
    [Show full text]
  • The Selnolig Package: Selective Suppression of Typographic Ligatures*
    The selnolig package: Selective suppression of typographic ligatures* Mico Loretan† 2015/10/26 Abstract The selnolig package suppresses typographic ligatures selectively, i.e., based on predefined search patterns. The search patterns focus on ligatures deemed inappropriate because they span morpheme boundaries. For example, the word shelfful, which is mentioned in the TEXbook as a word for which the ff ligature might be inappropriate, is automatically typeset as shelfful rather than as shelfful. For English and German language documents, the selnolig package provides extensive rules for the selective suppression of so-called “common” ligatures. These comprise the ff, fi, fl, ffi, and ffl ligatures as well as the ft and fft ligatures. Other f-ligatures, such as fb, fh, fj and fk, are suppressed globally, while making exceptions for names and words of non-English/German origin, such as Kafka and fjord. For English language documents, the package further provides ligature suppression rules for a number of so-called “discretionary” or “rare” ligatures, such as ct, st, and sp. The selnolig package requires use of the LuaLATEX format provided by a recent TEX distribution, e.g., TEXLive 2013 and MiKTEX 2.9. Contents 1 Introduction ........................................... 1 2 I’m in a hurry! How do I start using this package? . 3 2.1 How do I load the selnolig package? . 3 2.2 Any hints on how to get started with LuaLATEX?...................... 4 2.3 Anything else I need to do or know? . 5 3 The selnolig package’s approach to breaking up ligatures . 6 3.1 Free, derivational, and inflectional morphemes .
    [Show full text]