The Unicode Standard, Last Updated for the Unicode Standard, Version 4.1
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Supplementary Guide to UEB Reference Materials V.8.31.16
Supplementary Guide to UEB Reference Materials v.8.31.16 Unless otherwise indicated, page numbers refer to The Rules of Unified English Braille, 2013 For referenced BANA Guidances visit: www.brailleauthority.org * indicates definition of entry word A @ sign, 25 Caret, 24, 42 Abbreviations, 106, 152 Cent Sign ¢, 26 Accented letters, 42, 190 Chemistry, 89, 178, see BANA Guidance capitals, 80 Code switching, 199-210 in fully capped words, 89 how to use, 202-203 Acronyms, 106, 152 indicators Addition foreign language, 191-192, 195 non-technical materials, 31 IPA, 199, 207-208 technical materials, 169 music, 199, 208-209 Alphabetic wordsign, *7, 9, 15, 103-106, Nemeth code, 199, 209-210 164 non-UEB, 199, 203-208 Ampersand &, 21 Coinage, 26, 64 Anglicized words, 45, 158, 186, 189 Colored type, 11, 97 Apostrophe, 18, 69, 105, 107 Comma, 69 Arrows, 21, 174 numeric mode, 59 line mode, 219 Comparison, signs of, 169,31 Asterisk, 21 Compound words, bridging, 146 At sign @, 25 Computer material contractions in, 155 B email addresses, 155 Blank to be filled in, 73, 160 grade 1 indicators, 52 Boldface indicators, 91 Computer notation, 178 Brackets, opening and closing, 69, 78 Contracted (grade 2) braille, *7, 14 Braille grouping indicators, 23, 45, 172 usage cross-referenced, 14 Braille order, list of symbols, 275 Contractions summary, 9 Bullet, 24, 34, 37 Contractions, *7, 9, 103-168 abbreviations, 152 C acronyms, 152 Capitalization, 79-90 alphabetic wordsigns, *7, 9, 15, 103-106, grade 1, 55 164 indicators bridging, 146-152 choice of, 87 aspirated -
Assessment of Options for Handling Full Unicode Character Encodings in MARC21 a Study for the Library of Congress
1 Assessment of Options for Handling Full Unicode Character Encodings in MARC21 A Study for the Library of Congress Part 1: New Scripts Jack Cain Senior Consultant Trylus Computing, Toronto 1 Purpose This assessment intends to study the issues and make recommendations on the possible expansion of the character set repertoire for bibliographic records in MARC21 format. 1.1 “Encoding Scheme” vs. “Repertoire” An encoding scheme contains codes by which characters are represented in computer memory. These codes are organized according to a certain methodology called an encoding scheme. The list of all characters so encoded is referred to as the “repertoire” of characters in the given encoding schemes. For example, ASCII is one encoding scheme, perhaps the one best known to the average non-technical person in North America. “A”, “B”, & “C” are three characters in the repertoire of this encoding scheme. These three characters are assigned encodings 41, 42 & 43 in ASCII (expressed here in hexadecimal). 1.2 MARC8 "MARC8" is the term commonly used to refer both to the encoding scheme and its repertoire as used in MARC records up to 1998. The ‘8’ refers to the fact that, unlike Unicode which is a multi-byte per character code set, the MARC8 encoding scheme is principally made up of multiple one byte tables in which each character is encoded using a single 8 bit byte. (It also includes the EACC set which actually uses fixed length 3 bytes per character.) (For details on MARC8 and its specifications see: http://www.loc.gov/marc/.) MARC8 was introduced around 1968 and was initially limited to essentially Latin script only. -
The Sectionbreak Package∗
The sectionbreak package∗ Michal Hoftich 2019-03-11 Contents 1 The sectionbreak package 1 1.1 Package options . 1 1.2 Commands . 1 1.3 The tex4ht support . 2 2 Example 2 3 License 2 4 Changelog 3 1 The sectionbreak package This package provides LaTeX support for section breaks, used mainly in fiction books to signal changes in a story, like changes in time, location, etc. (Source: Wikipedia1). 1.1 Package options There are several package options: disable disable section break printing. mark characters printed in the center of the section break. It is space by default, common value might be *** for example. asterism request asterism2 as a section mark. preskip Amount of space printed before the section mark. It should be a dimension unit supported by TeX. postskip Similar to preskip, but it is a space printed after the section mark. skip Set both preskip and postskip to the same value. style LaTeX commands for font style change, like \bfseries, for example. ∗Version v0.1d 1<https://en.wikipedia.org/wiki/Section_(typography)> 2<https://en.wikipedia.org/wiki/Asterism_(typography)> 1 1.2 Commands \sectionbreak[<mark>] print the section break. Optional argument can contain content which should be printed in the break center. \sectionbreakmark{<mark>} set the content printed in the following \sectionbreak commands. \asterism print the asterism (***) symbol. 1.3 The tex4ht support The sectionbreak package has support for tex4ht, so it is possible to use this package in conversion to HTML or other formats supported by this conversion system. It provides the following configurations: \Configure{sectionbreak}{before section break }{after section} this configuration can be used for insertion of a box around the section break mark, which can be styled using CSS to print the blank space around the content. -
Supplemental Punctuation Range: 2E00–2E7F
Supplemental Punctuation Range: 2E00–2E7F This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 14.0 This file may be changed at any time without notice to reflect errata or other updates to the Unicode Standard. See https://www.unicode.org/errata/ for an up-to-date list of errata. See https://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See https://www.unicode.org/charts/PDF/Unicode-14.0/ for charts showing only the characters added in Unicode 14.0. See https://www.unicode.org/Public/14.0.0/charts/ for a complete archived file of character code charts for Unicode 14.0. Disclaimer These charts are provided as the online reference to the character contents of the Unicode Standard, Version 14.0 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode Standard, Version 14.0, online at https://www.unicode.org/versions/Unicode14.0.0/, as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, #34, #38, #41, #42, #44, #45, and #50, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available online. See https://www.unicode.org/ucd/ and https://www.unicode.org/reports/ A thorough understanding of the information contained in these additional sources is required for a successful implementation. -
Special Characters in Aletheia
Special Characters in Aletheia Last Change: 28 May 2014 The following table comprises all special characters which are currently available through the virtual keyboard integrated in Aletheia. The virtual keyboard aids re-keying of historical documents containing characters that are mostly unsupported in other text editing tools (see Figure 1). Figure 1: Text input dialogue with virtual keyboard in Aletheia 1.2 Due to technical reasons, like font definition, Aletheia uses only precomposed characters. If required for other applications the mapping between a precomposed character and the corresponding decomposed character sequence is given in the table as well. When writing to files Aletheia encodes all characters in UTF-8 (variable-length multi-byte representation). Key: Glyph – the icon displayed in the virtual keyboard. Unicode – the actual Unicode character; can be copied and pasted into other applications. Please note that special characters might not be displayed properly if there is no corresponding font installed for the target application. Hex – the hexadecimal code point for the Unicode character Decimal – the decimal code point for the Unicode character Description – a short description of the special character Origin – where this character has been defined Base – the base character of the special character (if applicable), used for sorting Combining Character – combining character(s) to modify the base character (if applicable) Pre-composed Character Decomposed Character (used in Aletheia) (only for reference) Combining Glyph -
The Unicode Standard 5.2 Code Charts
C0 Controls and Basic Latin Range: 0000–007F This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 5.2. This file may be changed at any time without notice to reflect errata or other updates to the Unicode Standard. See http://www.unicode.org/errata/ for an up-to-date list of errata. See http://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See http://www.unicode.org/charts/PDF/Unicode-5.2/ for charts showing only the characters added in Unicode 5.2. See http://www.unicode.org/Public/5.2.0/charts/ for a complete archived file of character code charts for Unicode 5.2. Disclaimer These charts are provided as the online reference to the character contents of the Unicode Standard, Version 5.2 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode Standard, Version 5.2, online at http://www.unicode.org/versions/Unicode5.2.0/, as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, #34, #38, #41, #42, and #44, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available online. See http://www.unicode.org/ucd/ and http://www.unicode.org/reports/ A thorough understanding of the information contained in these additional sources is required for a successful implementation. -
CJK Symbols and Punctuation Range: 3000–303F
CJK Symbols and Punctuation Range: 3000–303F This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 14.0 This file may be changed at any time without notice to reflect errata or other updates to the Unicode Standard. See https://www.unicode.org/errata/ for an up-to-date list of errata. See https://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See https://www.unicode.org/charts/PDF/Unicode-14.0/ for charts showing only the characters added in Unicode 14.0. See https://www.unicode.org/Public/14.0.0/charts/ for a complete archived file of character code charts for Unicode 14.0. Disclaimer These charts are provided as the online reference to the character contents of the Unicode Standard, Version 14.0 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode Standard, Version 14.0, online at https://www.unicode.org/versions/Unicode14.0.0/, as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, #34, #38, #41, #42, #44, #45, and #50, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available online. See https://www.unicode.org/ucd/ and https://www.unicode.org/reports/ A thorough understanding of the information contained in these additional sources is required for a successful implementation. -
The Ultimate Guide to Style, Grammar, Punctuation, Usage
THE AMA HANDBOOK OF BUSINESS WRITING This page intentionally left blank The AMA Handbook of Business Writing The Ultimate Guide to Style, Grammar, Usage, Punctuation, Construction, and Formatting KEVIN WILSON and JENNIFER WAUSON AMERICAN MANAGEMENT ASSOCIATION New York • Atlanta • Brussels • Chicago • Mexico City • San Francisco Shanghai • Tokyo • Toronto • Washington, D. C. Bulk discounts available. For details visit: www.amacombooks.org/go/specialsales Or contact special sales: Phone: 800-250-5308 Email: [email protected] View all the AMACOM titles at: www.amacombooks.org This publication is designed to provide accurate and authoritative information in regard to the subject matter covered. It is sold with the understanding that the publisher is not engaged in rendering legal, accounting, or other professional service. If legal advice or other expert assistance is required, the services of a competent professional person should be sought. Library of Congress Cataloging-in-Publication Data AMA handbook of business writing : the ultimate guide to style, grammar, usage, punctuation, construction, and formatting / Kevin Wilson and Jennifer Wauson. p. cm. Includes bibliographical references and index. ISBN-13: 978-0-8144-1589-4 Isbn-10: 0-8144-1589-x 1. Commercial correspondence--Handbooks, manuals, etc. 2. Business writing— Handbooks, manuals, etc. 3. English language—Business English—Handbooks, manuals, etc. I. Wilson, K. (Kevin), 1958– II. Wauson, Jennifer. III. American Management Association. HF5726.A485 1996 808'.06665—dc22 2009050050 © 2010 Kevin Wilson and Jennifer Wauson. All rights reserved. Printed in the United States of America. This publication may not be reproduced, stored in a retrieval system, or transmitted in whole or in part, in any form or by any means, electronic, mechanical, photocopying, recording, or otherwise, without the prior written permission of AMACOM, a division of American Management Association, 1601 Broadway, New York, NY 10019. -
Unified English Braille (UEB) General Symbols and Indicators
Unified English Braille (UEB) General Symbols and Indicators UEB Rulebook Section 3 Published by International Council on English Braille (ICEB) space (see 3.23) ⠣ opening braille grouping indicator (see 3.4) ⠹ first transcriber‐defined print symbol (see 3.26) ⠫ shape indicator (see 3.22) ⠳ arrow indicator (see 3.2) ⠳⠕ → simple right pointing arrow (east) (see 3.2) ⠳⠩ ↓ simple down pointing arrow (south) (see 3.2) ⠳⠪ ← simple left pointing arrow (west) (see 3.2) ⠳⠬ ↑ simple up pointing arrow (north) (see 3.2) ⠒ ∶ ratio (see 3.17) ⠒⠒ ∷ proportion (see 3.17) ⠢ subscript indicator (see 3.24) ⠶ ′ prime (see 3.11 and 3.15) ⠶⠶ ″ double prime (see 3.11 and 3.15) ⠔ superscript indicator (see 3.24) ⠼⠡ ♮ natural (see 3.18) ⠼⠣ ♭ flat (see 3.18) ⠼⠩ ♯ sharp (see 3.18) ⠼⠹ second transcriber‐defined print symbol (see 3.26) ⠜ closing braille grouping indicator (see 3.4) ⠈⠁ @ commercial at sign (see 3.7) ⠈⠉ ¢ cent sign (see 3.10) ⠈⠑ € euro sign (see 3.10) ⠈⠋ ₣ French franc sign (see 3.10) ⠈⠇ £ pound sign (pound sterling) (see 3.10) ⠈⠝ ₦ naira sign (see 3.10) ⠈⠎ $ dollar sign (see 3.10) ⠈⠽ ¥ yen sign (Yuan sign) (see 3.10) ⠈⠯ & ampersand (see 3.1) ⠈⠣ < less‐than sign (see 3.17) ⠈⠢ ^ caret (3.6) ⠈⠔ ~ tilde (swung dash) (see 3.25) ⠈⠼⠹ third transcriber‐defined print symbol (see 3.26) ⠈⠜ > greater‐than sign (see 3.17) ⠈⠨⠣ opening transcriber’s note indicator (see 3.27) ⠈⠨⠜ closing transcriber’s note indicator (see 3.27) ⠈⠠⠹ † dagger (see 3.3) ⠈⠠⠻ ‡ double dagger (see 3.3) ⠘⠉ © copyright sign (see 3.8) ⠘⠚ ° degree sign (see 3.11) ⠘⠏ ¶ paragraph sign (see 3.20) -
JRAI STYLE SHEET 2021 General Punctuation, Spelling Punctuation
JRAI STYLE SHEET 2021 General Punctuation, Spelling Punctuation Dash – spaced en rule Hyphen – hyphen not en rule for linked personal names, kin relations etc. (‘parent-child relation’) Quotes – single outer, double inner; final punctuation outside closing quote Ellipsis – three-point unspaced (no brackets) regardless of punctuation in original material Series comma, i.e. comma before final and: ‘red, white, and blue’. Apostrophe after final s: Thomas’s, etc. (except classical names and where this sounds too awkward) Italics For foreign words or phrases, not institutions or political organizations (e.g. parties) Credit in quotes whether ‘original emphasis’ or ‘emphasis added’ Spelling UK –ize spelling (except if original quotations differ) Hyphenation northeast, southwest, etc. co-operate, co-ordinate, re-emerge NB: No hyphen after adverb, e.g. ‘religiously labelled forms’ Dates 12 February 2017 1950s (not fifties) etc. 1953-6 (not 1953-1956 or 1953-56) The twenty-first century, but twenty-first-century examples Numbers Written out under 100 except for statistical material, ages (even when under ten) Allow words for large general numbers ‘around two thousand’ Commas in numerals of four digits or more: 5,000, 50,000 50 million Abbreviations cf. (used in sense of ‘compare’ only), pers. comm. In parenthesis: ‘(chap. 1)’ ‘(Fig. 1.)’ p. 14 n. 6 USA Displayed material Prose quotes Quotes of 50+ words are displayed as extracts. Closing full point comes after author-date citation at the end of the extract. In a translated quote, any original terms should be included in parentheses Any added commentary in a quote should be in square brackets Epigraphs Source on new line after quotation. -
Character Properties 4
The Unicode® Standard Version 14.0 – Core Specification To learn about the latest version of the Unicode Standard, see https://www.unicode.org/versions/latest/. Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and the publisher was aware of a trade- mark claim, the designations have been printed with initial capital letters or in all capitals. Unicode and the Unicode Logo are registered trademarks of Unicode, Inc., in the United States and other countries. The authors and publisher have taken care in the preparation of this specification, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. © 2021 Unicode, Inc. All rights reserved. This publication is protected by copyright, and permission must be obtained from the publisher prior to any prohibited reproduction. For information regarding permissions, inquire at https://www.unicode.org/reporting.html. For information about the Unicode terms of use, please see https://www.unicode.org/copyright.html. The Unicode Standard / the Unicode Consortium; edited by the Unicode Consortium. — Version 14.0. Includes index. ISBN 978-1-936213-29-0 (https://www.unicode.org/versions/Unicode14.0.0/) 1. -
The Brill Typeface User Guide & Complete List of Characters
The Brill Typeface User Guide & Complete List of Characters Version 2.06, October 31, 2014 Pim Rietbroek Preamble Few typefaces – if any – allow the user to access every Latin character, every IPA character, every diacritic, and to have these combine in a typographically satisfactory manner, in a range of styles (roman, italic, and more); even fewer add full support for Greek, both modern and ancient, with specialised characters that papyrologists and epigraphers need; not to mention coverage of the Slavic languages in the Cyrillic range. The Brill typeface aims to do just that, and to be a tool for all scholars in the humanities; for Brill’s authors and editors; for Brill’s staff and service providers; and finally, for anyone in need of this tool, as long as it is not used for any commercial gain.* There are several fonts in different styles, each of which has the same set of characters as all the others. The Unicode Standard is rigorously adhered to: there is no dependence on the Private Use Area (PUA), as it happens frequently in other fonts with regard to characters carrying rare diacritics or combinations of diacritics. Instead, all alphabetic characters can carry any diacritic or combination of diacritics, even stacked, with automatic correct positioning. This is made possible by the inclusion of all of Unicode’s combining characters and by the application of extensive OpenType Glyph Positioning programming. Credits The Brill fonts are an original design by John Hudson of Tiro Typeworks. Alice Savoie contributed to Brill bold and bold italic. The black-letter (‘Fraktur’) range of characters was made by Karsten Lücke.