Inside ASCII

Total Page:16

File Type:pdf, Size:1020Kb

Inside ASCII By Ft. W. Bemer The data alphabet called ASCII [Figure 1. page '38. and STICKS «ll-T ; Reference 111, also has two other names—International ASCII. as a then code. is usually represented in 3 col- Standard 1545 {the 150 Code [Reference 2]} and Alphabet um ns of 16 positions. The row positions are (flit) through No. 5 ol CCI'I'T {the International Consultative Committee 1111. the tow-order 4 hits. C through 15 in decimal. The for Telephone and Telegraph}. It is used throughout the columns are CUE! through 111, the next higher 3 hits. Ci world. incorporated in billions ofdollars of equipment. through ? in decimal. For some reason, the developers But is it used correctly and wisely“? Not alwavs. There of ASCII found it convenient to refer to these eight col- are misinterpretations. and gaps in definition that per- urnns as ”sticks.” So shall we. Each position will he rep- mit nonstandard usage. This article (in three parts] will resented in this article lav its usual decimal represents- give you the background, peculiarities. preferred prac' tion. For example. capital A is position 4“. Figure 2 is a tices. and. new developments. for ASCII. You will find a representation of ASCII that is more convenient to those lot of information not too generally known or realized; it working in octal. rather than hexadecimal, notation. should help in the correct and safe usage of ASCII. For additional help. vou can reference the various national and international standards given in Table 1. Some other detailed articles are listed in References 3, A and S. HlflH DFIDER LE'I'II' CIHDE H BET-At. “h CHSTAL DIGETS EMGIT 1H “'5‘ I'll Fl” {1| I1. I1 “HIT it'll 9.1” HuL'oLE st- ' '1] than-Tma In“ I. IJJ-I‘IFF 'I :Eil1i “*3: "TI- LI tum TIN-4‘1! sot-loot t ‘ Eta-ruff? 5-1"! III-.W fifflittfi tn! W“ I? [LII-HF! In! IVE! {b51957 [Aorist-H use ETI‘. till-Follvt“ 1" "I! ILI-tu-I'HI- .15 Ila-LBJ Ill Int-linking. IT!" I.'§_F§ EDT ‘ lfl'i'fiwl [lei'il III I'll {Pl’lfif‘ll‘ ill-mini IL“- Em c-en nu on 1|. 11- HHA mitt-In to}: ACE 5TH ink [itli'lJEp'l HIM 1‘], “Jul-”hf. 5!. I'MLH “'11 F]..{Illt|.w “:11 ETE accentuate Ipfit 11 rg: 1m lrxmrn 3175 I'M “up-I- haul-Mil. III-I: can: an 4.! Ik3~l'?ri'l:l-l [lira-nor 5!: lb”! ‘ HICH CHEER LCIH' DHUEH {twitter “1'. lo!- M F I : .ill'll "1HEIFI are: ”re: III-twat NIH- I'J l-l- . II- “3"? this“ 1 15”.? H 'II'|-.i".i Ell-PH”? in! far if” 1-! I '“Wifl'l‘f'! twill“ “'5! J LEEEWII it. [SE - Internattonal Stendirds Greenfzatfofi L ECFA - European computer Manufacturers Assoctation ANSI - Airricln Rational Standards Institute H FIFE - Federal Interaction Processlog Standard EBA - Elnadtlfl Standards Associatice H 55 - Brittsh Standard D A5 - Australian Standard CEITT — Eonsultlttve Eennfttee loternattenet, Telcencne E Teardfedh l H5 - Jaaanese IfiGJstrfaL Standard Figure 2. ‘ BUST - USSR Standard Table 1. 95 INTERFACE AGE MA? WES The first positions of sticks it and E are respectively Also in sticks 4-? are three diacritical marl-ts. They are the “commercial at" and “accent grave.” Then the upper accent grave t“! in Etfl. circumflex It} in SIM. and tilde tau} and lower case Ftoman alphabets follow. This ottset of ene in ms. These are called secondary national usage posi- position is historical them the United Kingdom}. and ot tions. In some countries the tilde is a straight overline. no importance as long as you remember that it is so. But it is the circumflex where we have a lot of confu- Foitowihg the alphabet in both sticks 5 and T are three sion. Teletype first made it an “up arrow“ in an earlier positions each that one must be very cautious about. In version of ASCII. to serve as an exponentiation symbol. ASCII they are assigned as [. I". and ] in stick 5 — i. l". primarily for BASIC. But that doesn’t do very well. be- and I in stick T. Eiut in the ISO Code and CCIT’T versions cause the exponentiation for FflFtTHAN is a double they are reserved for national usage. Table It gives the asterisk! The FDRTHAH version is preferable in France. national use assignment tor these positions. Surely you certainly. because they use such words as crane. cote. remember that the Scandinavian alphabet has 29 letters. cout. and so on. not 26'? My friend Grier Heen in field is very protective of A companion problem exists in position bits. with the these positlons. He says ”If you Americans want to sell underscore. The underscore is neither national nor dia- computers and software abrcad. don't use the ASCII critical; all countries use it just as underscore {and for characters for these positions in your software.” typesetting it is a US. convention to indicate italics. but To be more precise. position-s E11. E12. 5i13. Till. in Italy it means beldface. except when it is the last THE. and 1113 {noted above] are called primary national character in a line!"l. But Teletype's early version of usage positions. Se is 4.13. where ASCII has the “com- ASCII used it as a ”left arrow” — probably for an assign- mercial at.” Honeywell. for example. uses the “at” in its ment symbol equivalent to := in ALGDL. The up and left timesharlng systems for deleting the previous character arrow have been carried over from Teletype into many upon entry. But this isn't too serious. because many na- video terminals. Ask your terminal manufacturer to cease tions also have the “at” in their primary sets. and desist and retrofit. It's not ASCII and will only cause trouble forever. currency tat T nah-mil 7 du- die [fl 3" rial-twat till The last character in sticks 4-? is the Delete. symbol at: an: axe 5H] site sets emf etc MT Hz F'r'lJ DEL. in position THT. It was put here because the binary Metrerlmtli-J - I ' . | "1 f I code is 1111111. which would be all punched holes in Australia I ‘ t . Hull mm all ‘ perforated {not always paper!) tape. and that is the only t'I" 4r'mafiy— A t. US way to make sure that it cannot be misread as some Japan *— other character- ASCII is a complete set; all positions Uh it C lllly -; I. I are assigned to have meaning. fimttlr lend ml- Fun“ ~ ill. :- HE‘SP i...__. 2311ch 2-3 . Heme r term — E These are usually called the sticks for digits and spe- Helium —H _._._.__ I when — B 1 It. cials. Remember that they are the “digits" El 1:! 9: not Emu-Ithaca EH numbers. not numerals. not anything but digits! They llfly— E l t IIII‘II'I Eva-fireman —»t: are in am through SIB so that the Iowverder it bits are the Hungary l fr representations for packed decimal. Griginally we con- it: Germany — B i“ t- It: Fm Svr-tnrllnd =-D r-gau—A- sidered the possibility of a special it+bit set for numeri- hug-.- Semen I ' I £‘—L-.-- entry in Table Ia}. but it Ftfiufid H cal applications [see the fifth a. Denmark h out that computer hardware became inexpensive "i‘ri‘niL‘ifilfii-II turned RIPE-ii Helm-y :ga-z-a-J-v- Slum mifieafififlfl‘nfififiz enough to not deprive ourselves of the extra capabitities of the Tabit and 3-bit sets. Table 2. H111“ TQM i‘NTEfiFACE AGE 9? hbfhlhi. hhnhabu mm 0001 [—J 1:] l3 T123 1131:} El '1 C] {—1 [Jr E“ 959 E3 [:1 SD CICI x = SC! IZJIZI D L] II E] D E! I] 3 LS] Hit: 1 TH“ H? pm mm in um sun In: mm unl- - 2 in: :unmr. 'r' prim-*1: nil-anal * fl‘ilfl.flld3iflml'¥ .‘ which if! ducrihnl my, I when fluid-d hr HEP 1hr Humilarvkm Win an [rum in the m hum Figural 1. 93 FM TEHFAGE AGE HAY 19H Position as is officially called "space.“ I don't and HANDPHIHTIHG FDR STICKS 2-? didn't like it. and would have prelerred ”blank.” Which is The classical contusion for many years was “between why the IBM community often uses a lower case "bee” the digit zero and the letter “oh.” but there are other with a slash through the vertical as its symbol. From the possibilities for contusion. American Standard 323.45 Univac side. the space has the official symbol “delta." specifies the handwritten character shapes shown in Having mentioned packed decimal. where two digits Figure 3. go into each 8-bit group (”byte” to the American. “octet" to the French}. a word of caution on the plus and minus signs — they are in stick 2. rather than stick 3 with the digits. But the low order 4 bits are distinct. and + should be used only as tbtt. — only as 1101. I mention this because the nonstandard code EBCD’IC permits multiple representations of + and — in packed decimal. And the ASCII representations are not even coincident NB CDEEEHII with any of these. with obvious dangers! Watch out for the “currency" positions, 33 and a4. They also have national variations. ln ASCII they are we tomarily it and S. but there are some things to be remem- bored: III is not “number sign” for many countries. most of which use ”lilo." or ”Mr." for that purpose. And when it is “number." it must precede the digits.
Recommended publications
  • End-Of-Line Hyphenation of Chemical Names (IUPAC Provisional
    Pure Appl. Chem. 2020; aop IUPAC Recommendations Albert J. Dijkstra*, Karl-Heinz Hellwich, Richard M. Hartshorn, Jan Reedijk and Erik Szabó End-of-line hyphenation of chemical names (IUPAC Provisional Recommendations) https://doi.org/10.1515/pac-2019-1005 Received October 16, 2019; accepted January 21, 2020 Abstract: Chemical names and in particular systematic chemical names can be so long that, when a manu- script is printed, they have to be hyphenated/divided at the end of a line. Many systematic names already contain hyphens, but sometimes not in a suitable division position. In some cases, using these hyphens as end-of-line divisions can lead to illogical divisions in print, as can also happen when hyphens are added arbi- trarily without considering the ‘chemical’ context. The present document provides recommendations and guidelines for authors of chemical manuscripts, their publishers and editors, on where to divide chemical names at the end of a line and instructions on how to avoid these names being divided at illogical places as often suggested by desk dictionaries. Instead, readability and chemical sense should prevail when authors insert optional hyphens. Accordingly, the software used to convert electronic manuscripts to print can now be programmed to avoid illogical end-of-line hyphenation and thereby save the author much time and annoy- ance when proofreading. The recommendations also allow readers of the printed article to determine which end-of-line hyphens are an integral part of the name and should not be deleted when ‘undividing’ the name. These recommendations may also prove useful in languages other than English.
    [Show full text]
  • ISO Basic Latin Alphabet
    ISO basic Latin alphabet The ISO basic Latin alphabet is a Latin-script alphabet and consists of two sets of 26 letters, codified in[1] various national and international standards and used widely in international communication. The two sets contain the following 26 letters each:[1][2] ISO basic Latin alphabet Uppercase Latin A B C D E F G H I J K L M N O P Q R S T U V W X Y Z alphabet Lowercase Latin a b c d e f g h i j k l m n o p q r s t u v w x y z alphabet Contents History Terminology Name for Unicode block that contains all letters Names for the two subsets Names for the letters Timeline for encoding standards Timeline for widely used computer codes supporting the alphabet Representation Usage Alphabets containing the same set of letters Column numbering See also References History By the 1960s it became apparent to thecomputer and telecommunications industries in the First World that a non-proprietary method of encoding characters was needed. The International Organization for Standardization (ISO) encapsulated the Latin script in their (ISO/IEC 646) 7-bit character-encoding standard. To achieve widespread acceptance, this encapsulation was based on popular usage. The standard was based on the already published American Standard Code for Information Interchange, better known as ASCII, which included in the character set the 26 × 2 letters of the English alphabet. Later standards issued by the ISO, for example ISO/IEC 8859 (8-bit character encoding) and ISO/IEC 10646 (Unicode Latin), have continued to define the 26 × 2 letters of the English alphabet as the basic Latin script with extensions to handle other letters in other languages.[1] Terminology Name for Unicode block that contains all letters The Unicode block that contains the alphabet is called "C0 Controls and Basic Latin".
    [Show full text]
  • The Stata Journal
    The Stata Journal View metadata, citation and similar papers at core.ac.uk brought to you by CORE Editor Executive Editor provided by Research Papers in Economics H. Joseph Newton Nicholas J. Cox Department of Statistics Department of Geography Texas A & M University University of Durham College Station, Texas 77843 South Road 979-845-3142; FAX 979-845-3144 Durham City DH1 3LE UK [email protected] [email protected] Associate Editors Christopher Baum J. Scott Long Boston College Indiana University Rino Bellocco Thomas Lumley Karolinska Institutet University of Washington, Seattle David Clayton Roger Newson Cambridge Inst. for Medical Research King’s College, London Mario A. Cleves Marcello Pagano Univ. of Arkansas for Medical Sciences Harvard School of Public Health Charles Franklin Sophia Rabe-Hesketh University of Wisconsin, Madison University of California, Berkeley Joanne M. Garrett J. Patrick Royston University of North Carolina MRC Clinical Trials Unit, London Allan Gregory Philip Ryan Queen’s University University of Adelaide James Hardin Mark E. Schaffer University of South Carolina Heriot-Watt University, Edinburgh Stephen Jenkins Jeroen Weesie University of Essex Utrecht University Jens Lauritsen Jeffrey Wooldridge Odense University Hospital Michigan State University Stanley Lemeshow Ohio State University Stata Press Production Manager Lisa Gilmore Copyright Statement: The Stata Journal and the contents of the supporting files (programs, datasets, and help files) are copyright c by StataCorp LP. The contents of the supporting files (programs, datasets, and help files) may be copied or reproduced by any means whatsoever, in whole or in part, as long as any copy or reproduction includes attribution to both (1) the author and (2) the Stata Journal.
    [Show full text]
  • Unicode and Code Page Support
    Natural for Mainframes Unicode and Code Page Support Version 4.2.6 for Mainframes October 2009 This document applies to Natural Version 4.2.6 for Mainframes and to all subsequent releases. Specifications contained herein are subject to change and these changes will be reported in subsequent release notes or new editions. Copyright © Software AG 1979-2009. All rights reserved. The name Software AG, webMethods and all Software AG product names are either trademarks or registered trademarks of Software AG and/or Software AG USA, Inc. Other company and product names mentioned herein may be trademarks of their respective owners. Table of Contents 1 Unicode and Code Page Support .................................................................................... 1 2 Introduction ..................................................................................................................... 3 About Code Pages and Unicode ................................................................................ 4 About Unicode and Code Page Support in Natural .................................................. 5 ICU on Mainframe Platforms ..................................................................................... 6 3 Unicode and Code Page Support in the Natural Programming Language .................... 7 Natural Data Format U for Unicode-Based Data ....................................................... 8 Statements .................................................................................................................. 9 Logical
    [Show full text]
  • SAS 9.3 UTF-8 Encoding Support and Related Issue Troubleshooting
    SAS 9.3 UTF-8 Encoding Support and Related Issue Troubleshooting Jason (Jianduan) Liang SAS certified: Platform Administrator, Advanced Programmer for SAS 9 Agenda Introduction UTF-8 and other encodings SAS options for encoding and configuration Other Considerations for UTF-8 data Encoding issues troubleshooting techniques (tips) Introduction What is UTF-8? . A character encoding capable of encoding all possible characters Why UTF-8? . Dominant encoding of the www (86.5%) SAS system options for encoding . Encoding – instructs SAS how to read, process and store data . Locale - instructs SAS how to present or display currency, date and time, set timezone values UTF-8 and other Encodings ASSCII (American Standard Code for Information Interchange) . 7-bit . 128 - character set . Examples (code point-char-hex): 32-Space-20; 63-?-3F; 64-@-40; 65-A-41 UTF-8 and other Encodings ISO 8859-1 (Latin-1) for Western European languages Windows-1252 (Latin-1) for Western European languages . 8-bit (1 byte, 256 character set) . Identical to asscii for the first 128 chars . Extended ascii chars examples: . 155-£-A3; 161- ©-A9 . SAS option encoding value: wlatin1 (latin1) UTF-8 and other Encodings UTF-8 and other Encodings Problems . Only covers English and Western Europe languages, ISO-8859-2, …15 . Multiple encoding is required to support national languages . Same character encoded differently, same code point represents different chars Unicode . Unicode – assign a unique code/number to every possible character of all languages . Examples of unicode points: o U+0020 – Space U+0041 – A o U+00A9 - © U+C3BF - ÿ UTF-8 and other Encodings UTF-8 .
    [Show full text]
  • The Not So Short Introduction to Latex2ε
    The Not So Short Introduction to LATEX 2ε Or LATEX 2ε in 139 minutes by Tobias Oetiker Hubert Partl, Irene Hyna and Elisabeth Schlegl Version 4.20, May 31, 2006 ii Copyright ©1995-2005 Tobias Oetiker and Contributers. All rights reserved. This document is free; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version. This document is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details. You should have received a copy of the GNU General Public License along with this document; if not, write to the Free Software Foundation, Inc., 675 Mass Ave, Cambridge, MA 02139, USA. Thank you! Much of the material used in this introduction comes from an Austrian introduction to LATEX 2.09 written in German by: Hubert Partl <[email protected]> Zentraler Informatikdienst der Universität für Bodenkultur Wien Irene Hyna <[email protected]> Bundesministerium für Wissenschaft und Forschung Wien Elisabeth Schlegl <noemail> in Graz If you are interested in the German document, you can find a version updated for LATEX 2ε by Jörg Knappen at CTAN:/tex-archive/info/lshort/german iv Thank you! The following individuals helped with corrections, suggestions and material to improve this paper. They put in a big effort to help me get this document into its present shape.
    [Show full text]
  • Basis Technology Unicode対応ライブラリ スペックシート 文字コード その他の名称 Adobe-Standard-Encoding A
    Basis Technology Unicode対応ライブラリ スペックシート 文字コード その他の名称 Adobe-Standard-Encoding Adobe-Symbol-Encoding csHPPSMath Adobe-Zapf-Dingbats-Encoding csZapfDingbats Arabic ISO-8859-6, csISOLatinArabic, iso-ir-127, ECMA-114, ASMO-708 ASCII US-ASCII, ANSI_X3.4-1968, iso-ir-6, ANSI_X3.4-1986, ISO646-US, us, IBM367, csASCI big-endian ISO-10646-UCS-2, BigEndian, 68k, PowerPC, Mac, Macintosh Big5 csBig5, cn-big5, x-x-big5 Big5Plus Big5+, csBig5Plus BMP ISO-10646-UCS-2, BMPstring CCSID-1027 csCCSID1027, IBM1027 CCSID-1047 csCCSID1047, IBM1047 CCSID-290 csCCSID290, CCSID290, IBM290 CCSID-300 csCCSID300, CCSID300, IBM300 CCSID-930 csCCSID930, CCSID930, IBM930 CCSID-935 csCCSID935, CCSID935, IBM935 CCSID-937 csCCSID937, CCSID937, IBM937 CCSID-939 csCCSID939, CCSID939, IBM939 CCSID-942 csCCSID942, CCSID942, IBM942 ChineseAutoDetect csChineseAutoDetect: Candidate encodings: GB2312, Big5, GB18030, UTF32:UTF8, UCS2, UTF32 EUC-H, csCNS11643EUC, EUC-TW, TW-EUC, H-EUC, CNS-11643-1992, EUC-H-1992, csCNS11643-1992-EUC, EUC-TW-1992, CNS-11643 TW-EUC-1992, H-EUC-1992 CNS-11643-1986 EUC-H-1986, csCNS11643_1986_EUC, EUC-TW-1986, TW-EUC-1986, H-EUC-1986 CP10000 csCP10000, windows-10000 CP10001 csCP10001, windows-10001 CP10002 csCP10002, windows-10002 CP10003 csCP10003, windows-10003 CP10004 csCP10004, windows-10004 CP10005 csCP10005, windows-10005 CP10006 csCP10006, windows-10006 CP10007 csCP10007, windows-10007 CP10008 csCP10008, windows-10008 CP10010 csCP10010, windows-10010 CP10017 csCP10017, windows-10017 CP10029 csCP10029, windows-10029 CP10079 csCP10079, windows-10079
    [Show full text]
  • JS Character Encodings
    JS � Character Encodings Anna Henningsen · @addaleax · she/her 1 It’s good to be back! 2 ??? https://travis-ci.org/node-ffi-napi/get-symbol-from-current-process-h/jobs/641550176 3 So … what’s a character encoding? People are good with text, computers are good with numbers Text List of characters “Encoding” List of bytes List of integers 4 So … what’s a character encoding? People are good with text, computers are good with numbers Hello [‘H’,’e’,’l’,’l’,’o’] 68 65 6c 6c 6f [72, 101, 108, 108, 111] 5 So … what’s a character encoding? People are good with text, computers are good with numbers 你好! [‘你’,’好’] ??? ??? 6 ASCII 0 0x00 <NUL> … … … 65 0x41 A 66 0x42 B 67 0x43 C … … … 97 0x61 a 98 0x62 b … … … 127 0x7F <DEL> 7 ASCII ● 7-bit ● Covers most English-language use cases ● … and that’s pretty much it 8 ISO-8859-*, Windows code pages ● Idea: Usually, transmission has 8 bit per byte available, so create ASCII-extending charsets for more languages ISO-8859-1 (Western) ISO-8859-5 (Cyrillic) Windows-1251 (Cyrillic) (aka Latin-1) … … … … 0xD0 Ð а Р 0xD1 Ñ б С 0xD2 Ò в Т … … … … 9 GBK ● Idea: Also extend ASCII, but use 2-byte for Chinese characters … … 0x41 A 0x42 B … … 0xC4 0xE3 你 0xC4 0xE4 匿 … … 10 https://xkcd.com/927/ 11 Unicode: Multiple encodings! 4d c3 bc 6c 6c (UTF-8) U+004D M “Müll” U+00FC ü 4d 00 fc 00 6c 00 6c 00 (UTF-16LE) U+006C l U+006C l 00 4d 00 fc 00 6c 00 6c (UTF-16BE) 12 Unicode ● New idea: Don’t create a gazillion charsets, and drop 1-byte/2-byte restriction ● Shared character set for multiple encodings: U+XXXX with 4 hex digits, e.g.
    [Show full text]
  • CONTENTS 1. Introduction...1 2. RPL Principles
    CONTENTS 1. Introduction...................................... 1 2. RPL Principles.................................... 2 2.1 Origins..................................... 2 2.2 Mathematical Control........................ 3 2.3 Formal Definitions ......................... 5 2.4 Execution................................... 6 2.4.1 EVAL............................... 8 2.4.2 Data Class Objects................. 9 2.4.3 Identifier Class Objects........... 9 2.4.4 Procedure Class Objects............ 10 2.4.5 Object Skipover and SEMI........... 10 2.4.6 RPL Pointers....................... 11 2.5 Memory Management........................... 11 2.6 User RPL and System RPL..................... 13 2.7 Programming in System RPL................... 14 2.8 Sample RPL Program.......................... 16 2.8.1 The Source File.................... 16 2.8.2 Compiling the Program.............. 18 3. Object Structures................................. 19 3.1 Object Types................................ 19 3.1.1 Identifier Object.................. 19 3.1.2 Temporary Identifier Object........ 19 3.1.3 ROM Pointer Object................. 20 3.1.4 Binary Integer Object.............. 20 3.1.5 Real Number Object................. 20 3.1.6 Extended Real Number Object........ 21 3.1.7 Complex Number Object.............. 22 3.1.8 Extended Complex Number Object............................. 22 3.1.9 Array Object....................... 23 3.1.10 Linked Array Object................ 23 3.1.11 Character String Object............ 25 3.1.12 Hex String Object.................. 25 3.1.13 Character Object................... 25 3.1.14 Unit Object........................ 26 3.1.15 Code Object........................ 26 3.1.16 Primitive Code Object.............. 27 3.1.17 Program Object..................... 27 3.1.18 List Object........................ 28 3.1.19 Symbolic Object.................... 28 3.1.20 Directory Object................... 29 3.1.21 Graphics Object.................... 29 3.2 Terminology and Abbreviations............... 30 4.
    [Show full text]
  • Solaris Internationalization Guide for Developers
    Solaris Internationalization Guide For Developers Sun Microsystems, Inc. 901 San Antonio Road Palo Alto, CA 94303-4900 U.S.A. Part No: 805-4123–10 October 1998 Copyright 1998 Sun Microsystems, Inc. 901 San Antonio Road, Palo Alto, California 94303-4900 U.S.A. All rights reserved. This product or document is protected by copyright and distributed under licenses restricting its use, copying, distribution, and decompilation. No part of this product or document may be reproduced in any form by any means without prior written authorization of Sun and its licensors, if any. Third-party software, including font technology, is copyrighted and licensed from Sun suppliers. Parts of the product may be derived from Berkeley BSD systems, licensed from the University of California. UNIX is a registered trademark in the U.S. and other countries, exclusively licensed through X/Open Company, Ltd. Sun, Sun Microsystems, the Sun logo, SunDocs, Java, the Java Coffee Cup logo, and Solaris are trademarks, registered trademarks, or service marks of Sun Microsystems, Inc. in the U.S. and other countries. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. in the U.S. and other countries. Products bearing SPARC trademarks are based upon an architecture developed by Sun Microsystems, Inc. SunOS, Solaris, X11, SPARC, UNIX, PostScript, OpenWindows, AnswerBook, SunExpress, SPARCprinter, JumpStart, Xlib The OPEN LOOK and SunTM Graphical User Interface was developed by Sun Microsystems, Inc. for its users and licensees. Sun acknowledges the pioneering efforts of Xerox in researching and developing the concept of visual or graphical user interfaces for the computer industry.
    [Show full text]
  • Unicode Unicode
    You are at: ALA.org » PLA » Professional Tools » PLA Tech Notes » Unicode Unicode By Richard W. Boss Unicode is an international character-encoding standard designed to support the electronic exchange, processing, storage, and display of the written texts of all of the world’s languages and scripts, including scientific, mathematical, and technical symbols. Unicode is an encoding standard, not a software program or a font. The standard is independent of operating systems, programming languages, database management systems, and hardware platforms. Before Unicode, there were hundreds of different encoding schemes; no single one could contain enough characters. The European Union alone required several for its languages. The Importance of Unicode for Libraries While academic libraries have had holdings in a number of languages and scripts for many decades, public libraries—as the result of the increasingly diverse communities that they serve-- have begun to purchase books in a wide variety of languages and scripts. There is hardly a metropolitan area in North America that does not have immigrants from a score of countries. Even public libraries in small communities are serving an increasing number of people for whom English is not their primary language. Unicode is also important for resource sharing. Libraries need to be able to borrow or lend materials in a wide variety of languages and scripts on behalf of patrons who cannot find what they need in their own libraries. The ability to send and receive bibliographic information among Unicode-conforming libraries is a major breakthrough. Even libraries that have no interest in Unicode for themselves should appreciate the fact that vendors of integrated library systems and other information technologies that support Unicode are able to market their products globally, therefore, strengthening them financially and making it possible to maintain aggressive product development programs.
    [Show full text]
  • Nuts 'N' Bolts: Legal-Writing Mechanics—Part I Gerald Lebovits
    University of Ottawa Faculty of Law (Civil Law Section) From the SelectedWorks of Hon. Gerald Lebovits June, 2009 Nuts 'n' Bolts: Legal-Writing Mechanics—Part I Gerald Lebovits Available at: https://works.bepress.com/gerald_lebovits/158/ JUNE 2009 VOL. 81 | NO. 5 JournalNEW YORK STATE BAR ASSOCIATION Attorney Web Sites Ethical Issues Are Only the Beginning Also in this Issue From the “Code” to the by Adam R. Bialek, Paris A. Gunther “Rules” and Scott M. Smedresman “Cohabitation” Clauses After Graev Avoiding IRS Tax Disputes 2008 Insurance Law – Part II THE LEGAL WRITER BY GERALD LEBOVITS GERALD LEBOVITS is a judge at the New York City Civil Court, Housing Part, in Manhattan and an adjunct professor at St. John’s University School of Law. For their research help on Parts I and II of this column, he thanks Alexandra Standish, his court attorney, and St. John’s University law student Jamie Bunyan. Judge Lebovits’s e-mail address is [email protected]. Nuts ’n’ Bolts: Legal-Writing Mechanics — Part I riting mechanics refers to unless all the pieces fit. Making your feet from the car.” Tanbook and Redbook writing details: numbers, nuts ’n’ bolts fit just right will assure an example: “The top law students earned Wnumerals, and figures; typo- accessible, comprehensible, and con- 2, 4, and 11 awards, respectively.” graphic symbols; abbreviations; italics sistent document. Don’t start a sentence with a fig- and underlining; and capitalizations. ure.11 Incorrect: “2009 marks the one- No one right way best tightens 1. Numbers, Numerals, and Figures year anniversary of his admission legal-writing nuts ’n’ bolts, but differ- A number is an abstract mathematical to the New York bar.” Correct: “Two ent authorities try.
    [Show full text]