Doc: a List of Character Name Equivalents

Total Page:16

File Type:pdf, Size:1020Kb

Doc: a List of Character Name Equivalents A List of Character Name Equivalents The names by which characters are known vary from source to source. This is especially true of diacritical marks, modified letters, and symbols. Below is a list of synonyms for some characters used in MARC21 that have been known by various names. Most of the names included come from three sources: Unicode, Z39.47 (ANSEL), and MARC21 documentation. Unicode names tend to be long and ungainly, so shortened forms are given here. The characters are divided in two categories here. Table 1 includes combining diacritics that always modify another letter and share its space when printed. Table 2 lists special characters that are independent of other characters and occupy their own space in a printed line. Table 1. Combining diacritics Unicode name Other names Comments and usage notes Ring above Circle above, Angstrom Diaeresis Umlaut Dot above Superior dot Caron Hacek Hook above Pseudo question, low rising tone Comma above High comma centered For Latvian g, use precomposed form ‘with cedilla’. Comma may appear inverted in source document Comma above right High comma off center Do not use. Use precomposed forms ‘with caron’. Ligature [halves Ligature [halves This must be stored in decomposed form, for example, encoded separately] encoded separately] T ligature left half S ligature right half. Double tilde [halves Double tilde [halves This must be stored in decomposed form, for example, encoded separately] encoded separately] n double tilde left half g double tilde right half. Ogonek Right hook Left half ring below Right cedilla Use only for romanized Thai. Comma below Left hook, Use this routinely only for Romanian S and T. For all comma below other cases, prefer precomposed forms ‘with cedilla’ even if the comma does not appear to touch the letter. Low line Underscore This entry refers only to a line under a letter, not to the spacing underscore used in e-mail addresses, etc. Double low line Double underscore Ring below Circle below Diaeresis below Double dot below Breve below Upadhmaniya, half circle below July 1, 2002 Harvard University Library Office for Information Systems page 1 of 2 Table 2. Special characters that occupy their own space. Unicode name Other names Comments and usage notes Modifier letter right Alif, aleph half ring Modifier letter turned Ayn, ayin comma Modifier letter prime Soft sign, miagkii znak Modifier letter double Hard sign, tverdii znak prime Middle dot Centered dot, dot in middle of line Vertical line Vertical bar, fill character O with horn O-hook, hook O U with horn U-hook, hook U D with stroke D with crossbar, slash D L with stroke L with crossbar, Polish L, slash L O with stroke O with crossbar, Scandinavian O Slash O Letter Æ Digraph AE, ligature AE July 1, 2002 Harvard University Library Office for Information Systems page 2 of 2.
Recommended publications
  • Technical Reference Manual for the Standardization of Geographical Names United Nations Group of Experts on Geographical Names
    ST/ESA/STAT/SER.M/87 Department of Economic and Social Affairs Statistics Division Technical reference manual for the standardization of geographical names United Nations Group of Experts on Geographical Names United Nations New York, 2007 The Department of Economic and Social Affairs of the United Nations Secretariat is a vital interface between global policies in the economic, social and environmental spheres and national action. The Department works in three main interlinked areas: (i) it compiles, generates and analyses a wide range of economic, social and environmental data and information on which Member States of the United Nations draw to review common problems and to take stock of policy options; (ii) it facilitates the negotiations of Member States in many intergovernmental bodies on joint courses of action to address ongoing or emerging global challenges; and (iii) it advises interested Governments on the ways and means of translating policy frameworks developed in United Nations conferences and summits into programmes at the country level and, through technical assistance, helps build national capacities. NOTE The designations employed and the presentation of material in the present publication do not imply the expression of any opinion whatsoever on the part of the Secretariat of the United Nations concerning the legal status of any country, territory, city or area or of its authorities, or concerning the delimitation of its frontiers or boundaries. The term “country” as used in the text of this publication also refers, as appropriate, to territories or areas. Symbols of United Nations documents are composed of capital letters combined with figures. ST/ESA/STAT/SER.M/87 UNITED NATIONS PUBLICATION Sales No.
    [Show full text]
  • Ffontiau Cymraeg
    This publication is available in other languages and formats on request. Mae'r cyhoeddiad hwn ar gael mewn ieithoedd a fformatau eraill ar gais. [email protected] www.caerphilly.gov.uk/equalities How to type Accented Characters This guidance document has been produced to provide practical help when typing letters or circulars, or when designing posters or flyers so that getting accents on various letters when typing is made easier. The guide should be used alongside the Council’s Guidance on Equalities in Designing and Printing. Please note this is for PCs only and will not work on Macs. Firstly, on your keyboard make sure the Num Lock is switched on, or the codes shown in this document won’t work (this button is found above the numeric keypad on the right of your keyboard). By pressing the ALT key (to the left of the space bar), holding it down and then entering a certain sequence of numbers on the numeric keypad, it's very easy to get almost any accented character you want. For example, to get the letter “ô”, press and hold the ALT key, type in the code 0 2 4 4, then release the ALT key. The number sequences shown from page 3 onwards work in most fonts in order to get an accent over “a, e, i, o, u”, the vowels in the English alphabet. In other languages, for example in French, the letter "c" can be accented and in Spanish, "n" can be accented too. Many other languages have accents on consonants as well as vowels.
    [Show full text]
  • Combining Diacritical Marks Range: 0300–036F the Unicode Standard
    Combining Diacritical Marks Range: 0300–036F The Unicode Standard, Version 4.0 This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 4.0. Characters in this chart that are new for The Unicode Standard, Version 4.0 are shown in conjunction with any existing characters. For ease of reference, the new characters have been highlighted in the chart grid and in the names list. This file will not be updated with errata, or when additional characters are assigned to the Unicode Standard. See http://www.unicode.org/charts for access to a complete list of the latest character charts. Disclaimer These charts are provided as the on-line reference to the character contents of the Unicode Standard, Version 4.0 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this excerpt file, please consult the appropriate sections of The Unicode Standard, Version 4.0 (ISBN 0-321-18578-1), as well as Unicode Standard Annexes #9, #11, #14, #15, #24 and #29, the other Unicode Technical Reports and the Unicode Character Database, which are available on-line. See http://www.unicode.org/Public/UNIDATA/UCD.html and http://www.unicode.org/unicode/reports A thorough understanding of the information contained in these additional sources is required for a successful implementation. Fonts The shapes of the reference glyphs used in these code charts are not prescriptive. Considerable variation is to be expected in actual fonts.
    [Show full text]
  • Evaluation of the Orientation of 90° and 180° Reinforcing Bar Hooks In
    Research Note RN-2009-2 Evaluation of the Orientation of 90° and 180° Reinforcing Bar Hooks in Wide Members Introduction Longitudinal reinforcing steel bars in flexural members are often developed at discontinuous ends with a 90° or 180° standard hook. These hooks are usually oriented in the vertical direc- tion, which generally corresponds with being perpendicular (or normal) to the member’s major axis. Because standard hook sizes can be quite large, especially for large diameter bars, detailing of hooked bars can be critical. In some instances, such as the case of a shallow member that is Figure 1 ‒ Example of tilted reinforcing bar hooks heavily reinforced, the standard hook height in ac- shown at the edge of a cantilever balcony slab cordance with the ACI 318-11 Code (2011), plus the required concrete cover above and below the search Note bar, may exceed the member depth. In this case, one solution used in practice to satisfy the cover requirements and maintain the hook length is to rotate the bar along its longitudinal axis, such that the hook is no longer vertical. Hook tilting is illus- trated in Figures 1 and 2. Re This Research Note describes results of a study initiated as part of the CRSI Graduate Re- search Fellowship Program to evaluate the influ- ence of hook angle tilt on hook performance and behavior. The research also examined the poten- tial limitations of hook tilt in a concrete member, depicted in Figure 3. Figure 2 ‒ Recommended bar details for solid slabs (CRSI 2008) Background Hook Behavior Forces are transferred between 90° or 180° hooked bars in tension and the surrounding con- crete through bond along the bar surface and bearing of the bar on the enclosed concrete.
    [Show full text]
  • Basis Technology Unicode対応ライブラリ スペックシート 文字コード その他の名称 Adobe-Standard-Encoding A
    Basis Technology Unicode対応ライブラリ スペックシート 文字コード その他の名称 Adobe-Standard-Encoding Adobe-Symbol-Encoding csHPPSMath Adobe-Zapf-Dingbats-Encoding csZapfDingbats Arabic ISO-8859-6, csISOLatinArabic, iso-ir-127, ECMA-114, ASMO-708 ASCII US-ASCII, ANSI_X3.4-1968, iso-ir-6, ANSI_X3.4-1986, ISO646-US, us, IBM367, csASCI big-endian ISO-10646-UCS-2, BigEndian, 68k, PowerPC, Mac, Macintosh Big5 csBig5, cn-big5, x-x-big5 Big5Plus Big5+, csBig5Plus BMP ISO-10646-UCS-2, BMPstring CCSID-1027 csCCSID1027, IBM1027 CCSID-1047 csCCSID1047, IBM1047 CCSID-290 csCCSID290, CCSID290, IBM290 CCSID-300 csCCSID300, CCSID300, IBM300 CCSID-930 csCCSID930, CCSID930, IBM930 CCSID-935 csCCSID935, CCSID935, IBM935 CCSID-937 csCCSID937, CCSID937, IBM937 CCSID-939 csCCSID939, CCSID939, IBM939 CCSID-942 csCCSID942, CCSID942, IBM942 ChineseAutoDetect csChineseAutoDetect: Candidate encodings: GB2312, Big5, GB18030, UTF32:UTF8, UCS2, UTF32 EUC-H, csCNS11643EUC, EUC-TW, TW-EUC, H-EUC, CNS-11643-1992, EUC-H-1992, csCNS11643-1992-EUC, EUC-TW-1992, CNS-11643 TW-EUC-1992, H-EUC-1992 CNS-11643-1986 EUC-H-1986, csCNS11643_1986_EUC, EUC-TW-1986, TW-EUC-1986, H-EUC-1986 CP10000 csCP10000, windows-10000 CP10001 csCP10001, windows-10001 CP10002 csCP10002, windows-10002 CP10003 csCP10003, windows-10003 CP10004 csCP10004, windows-10004 CP10005 csCP10005, windows-10005 CP10006 csCP10006, windows-10006 CP10007 csCP10007, windows-10007 CP10008 csCP10008, windows-10008 CP10010 csCP10010, windows-10010 CP10017 csCP10017, windows-10017 CP10029 csCP10029, windows-10029 CP10079 csCP10079, windows-10079
    [Show full text]
  • Orthographies in Grammar Books
    Preprints (www.preprints.org) | NOT PEER-REVIEWED | Posted: 30 July 2018 doi:10.20944/preprints201807.0565.v1 Tomislav Stojanov, [email protected], [email protected] Institute of Croatian Language and Linguistic Republike Austrije 16, 10.000 Zagreb, Croatia Orthographies in Grammar Books – Antiquity and Humanism Summary This paper researches the as yet unstudied topic of orthographic content in antique, medieval, and Renaissance grammar books in European languages, as part of a wider research of the origin of orthographic standards in European languages. As a central place for teachings about language, grammar books contained orthographic instructions from the very beginning, and such practice continued also in later periods. Understanding the function, content, and orthographic forms in the past provides for a better description of the nature of the orthographic standard in the present. The evolution of grammatographic practice clearly shows the continuity of development of orthographic content from a constituent of grammar studies through the littera unit gradually to an independent unit, then into annexed orthographic sections, and later into separate orthographic manuals. 5 antique, 22 Latin, and 17 vernacular grammars were analyzed, describing 19 European languages. The research methodology is based on distinguishing orthographic content in the narrower sense (grapheme to meaning) from the broader sense (grapheme to phoneme). In this way, the function of orthographic description was established separately from the study of spelling. As for the traditional description of orthographic content in the broader sense in old grammar books, it is shown that orthographic content can also be studied within the grammatographic framework of a specific period, similar to the description of morphology or syntax.
    [Show full text]
  • 1St Rule: FANBOYS and Compound Sentences FANBOYS Is a Mnemonic Device, Which Stands for the Coordinating Conjunctions: For, And, Nor, But, Or, Yet, and So
    1st Rule: FANBOYS and Compound Sentences FANBOYS is a mnemonic device, which stands for the coordinating conjunctions: For, And, Nor, But, Or, Yet, and So. These words, when used to connect two independent clauses (two complete thoughts), must be preceded by a comma. A sentence is a complete thought, consisting of a Subject and a Verb. Conjunctions should not be confused with conjunctive adverbs, such as However, Therefore, and Moreover or with conjunctions, such as Because. 2nd Rule: Introductory Bits When using an introductory word, phrase, or clause to begin a sentence, it is important to place a comma between the introduction and the main sentence. The introduction is not a complete thought on its own; it simply introduces the main clause. This lets the reader know that the “meat” of the sentence is what follows the comma. You should be able to remove the part which comes before the comma and still have a complete thought. Examples: Generally, John is opposed to overt acts of affection. However, Lucy inspires him to be kind. If the hamster is launched fifty feet, there is too much pressure in the cannon. Although the previous sentence has little to do with John and Lucy, it is a good example of how to use commas with introductory clauses. 3rd Rule: Separating Items in a Series Commas belong between each item in a list. However, in a series of three items, the comma between the second and third item and the conjunction (generally and or or) is optional, whereas in a list of four items, the comma is necessary.
    [Show full text]
  • Alphabets, Letters and Diacritics in European Languages (As They Appear in Geography)
    1 Vigleik Leira (Norway): [email protected] Alphabets, Letters and Diacritics in European Languages (as they appear in Geography) To the best of my knowledge English seems to be the only language which makes use of a "clean" Latin alphabet, i.d. there is no use of diacritics or special letters of any kind. All the other languages based on Latin letters employ, to a larger or lesser degree, some diacritics and/or some special letters. The survey below is purely literal. It has nothing to say on the pronunciation of the different letters. Information on the phonetic/phonemic values of the graphic entities must be sought elsewhere, in language specific descriptions. The 26 letters a, b, c, d, e, f, g, h, i, j, k, l, m, n, o, p, q, r, s, t, u, v, w, x, y, z may be considered the standard European alphabet. In this article the word diacritic is used with this meaning: any sign placed above, through or below a standard letter (among the 26 given above); disregarding the cases where the resulting letter (e.g. å in Norwegian) is considered an ordinary letter in the alphabet of the language where it is used. Albanian The alphabet (36 letters): a, b, c, ç, d, dh, e, ë, f, g, gj, h, i, j, k, l, ll, m, n, nj, o, p, q, r, rr, s, sh, t, th, u, v, x, xh, y, z, zh. Missing standard letter: w. Letters with diacritics: ç, ë. Sequences treated as one letter: dh, gj, ll, rr, sh, th, xh, zh.
    [Show full text]
  • International Language Environments Guide
    International Language Environments Guide Sun Microsystems, Inc. 4150 Network Circle Santa Clara, CA 95054 U.S.A. Part No: 806–6642–10 May, 2002 Copyright 2002 Sun Microsystems, Inc. 4150 Network Circle, Santa Clara, CA 95054 U.S.A. All rights reserved. This product or document is protected by copyright and distributed under licenses restricting its use, copying, distribution, and decompilation. No part of this product or document may be reproduced in any form by any means without prior written authorization of Sun and its licensors, if any. Third-party software, including font technology, is copyrighted and licensed from Sun suppliers. Parts of the product may be derived from Berkeley BSD systems, licensed from the University of California. UNIX is a registered trademark in the U.S. and other countries, exclusively licensed through X/Open Company, Ltd. Sun, Sun Microsystems, the Sun logo, docs.sun.com, AnswerBook, AnswerBook2, Java, XView, ToolTalk, Solstice AdminTools, SunVideo and Solaris are trademarks, registered trademarks, or service marks of Sun Microsystems, Inc. in the U.S. and other countries. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. in the U.S. and other countries. Products bearing SPARC trademarks are based upon an architecture developed by Sun Microsystems, Inc. SunOS, Solaris, X11, SPARC, UNIX, PostScript, OpenWindows, AnswerBook, SunExpress, SPARCprinter, JumpStart, Xlib The OPEN LOOK and Sun™ Graphical User Interface was developed by Sun Microsystems, Inc. for its users and licensees. Sun acknowledges the pioneering efforts of Xerox in researching and developing the concept of visual or graphical user interfaces for the computer industry.
    [Show full text]
  • Supplemental Punctuation Range: 2E00–2E7F
    Supplemental Punctuation Range: 2E00–2E7F This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 14.0 This file may be changed at any time without notice to reflect errata or other updates to the Unicode Standard. See https://www.unicode.org/errata/ for an up-to-date list of errata. See https://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See https://www.unicode.org/charts/PDF/Unicode-14.0/ for charts showing only the characters added in Unicode 14.0. See https://www.unicode.org/Public/14.0.0/charts/ for a complete archived file of character code charts for Unicode 14.0. Disclaimer These charts are provided as the online reference to the character contents of the Unicode Standard, Version 14.0 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode Standard, Version 14.0, online at https://www.unicode.org/versions/Unicode14.0.0/, as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, #34, #38, #41, #42, #44, #45, and #50, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available online. See https://www.unicode.org/ucd/ and https://www.unicode.org/reports/ A thorough understanding of the information contained in these additional sources is required for a successful implementation.
    [Show full text]
  • Diacritics-ELL.Pdf
    Diacritics J.C. Wells, University College London Dkadvkxkdw avf ekwxkrhykwjkrh qavow axxadjfe xs pfxxfvw sg xjf aptjacfx, gsv f|aqtpf xjf adyxf addfrx sr xjf ‘ kr dag‘. M swx parhyahf svxjshvatjkfw cawfe sr xjf Laxkr aptjacfx qaof wsqf ywf sg ekadvkxkdw, aw kreffe es xjswf cawfe sr sxjfv aptjacfxw are {vkxkrh w}wxfqw. Tjf gsdyw sg xjkw avxkdpf kw sr xjf vspf sg ekadvkxkdw kr xjf svxjshvatj} sg parhyahfw {vkxxfr {kxj xjf Laxkr aptjacfx. Ireffe, xjf svkhkr sg wsqf pfxxfvw xjax avf rs{ a wxareave tavx sg xjf aptjacfx pkfw kr xjf ywf sg ekadvkxkdw. Tjf pfxxfv G {aw krzfrxfe kr Rsqar xkqfw aw a zavkarx sg C, ekwxkrhykwjfe c} xjf dvswwcav sr xjf ytwxvsof. Tjf pfxxfv J {aw rsx ekwxkrhykwjfe gvsq I, rsv U gvsq V, yrxkp xjf 16xj dfrxyv} (Saqtwsr 1985: 110). Tjf rf{ pfxxfv 1 kw sczksywp} a zavkarx sr r are ws dsype cf wffr aw krdsvtsvaxkrh a ekadvkxkd xakp. Dkadvkxkdw tvstfv, xjsyhj, avf wffr aw qavow axxadjfe xs a cawf pfxxfv. Ir xjkw wfrwf, m y 1 es rsx krzspzf ekadvkxkdw. Tjf f|xfrwkzf ywf sg ekadvkxkdw xs wyttpfqfrx xjf Laxkr aptjacfx kr dawfw {jfvf kx {aw wffr aw kraefuyaxf gsv xjf wsyrew sg sxjfv parhyahfw kw hfrfvapp} axxvkcyxfe xs xjf vfpkhksyw vfgsvqfv Jar Hyw (1369-1415), {js efzkwfe a vfgsvqfe svxjshvatj} gsv C~fdj krdsvtsvaxkrh 9addfrxfe: pfxxfvw wydj aw ˛ ¹ = > ?. M swx ekadvkxkdw avf tpadfe acszf xjf cawf pfxxfv {kxj {jkdj xjf} avf awwsdkaxfe. A gf{, js{fzfv, avf tpadfe cfps{ kx (aw “) sv xjvsyhj kx (aw B). 1 Laxkr pfxxfvw dsqf kr ps{fv-dawf are yttfv-dawf zfvwksrw.
    [Show full text]
  • List of Approved Special Characters
    List of Approved Special Characters The following list represents the Graduate Division's approved character list for display of dissertation titles in the Hooding Booklet. Please note these characters will not display when your dissertation is published on ProQuest's site. To insert a special character, simply hold the ALT key on your keyboard and enter in the corresponding code. This is only for entering in a special character for your title or your name. The abstract section has different requirements. See abstract for more details. Special Character Alt+ Description 0032 Space ! 0033 Exclamation mark '" 0034 Double quotes (or speech marks) # 0035 Number $ 0036 Dollar % 0037 Procenttecken & 0038 Ampersand '' 0039 Single quote ( 0040 Open parenthesis (or open bracket) ) 0041 Close parenthesis (or close bracket) * 0042 Asterisk + 0043 Plus , 0044 Comma ‐ 0045 Hyphen . 0046 Period, dot or full stop / 0047 Slash or divide 0 0048 Zero 1 0049 One 2 0050 Two 3 0051 Three 4 0052 Four 5 0053 Five 6 0054 Six 7 0055 Seven 8 0056 Eight 9 0057 Nine : 0058 Colon ; 0059 Semicolon < 0060 Less than (or open angled bracket) = 0061 Equals > 0062 Greater than (or close angled bracket) ? 0063 Question mark @ 0064 At symbol A 0065 Uppercase A B 0066 Uppercase B C 0067 Uppercase C D 0068 Uppercase D E 0069 Uppercase E List of Approved Special Characters F 0070 Uppercase F G 0071 Uppercase G H 0072 Uppercase H I 0073 Uppercase I J 0074 Uppercase J K 0075 Uppercase K L 0076 Uppercase L M 0077 Uppercase M N 0078 Uppercase N O 0079 Uppercase O P 0080 Uppercase
    [Show full text]