Emoji Symbols Proposed for New Encoding

Total Page:16

File Type:pdf, Size:1020Kb

Emoji Symbols Proposed for New Encoding Emoji Symbols Proposed for New Encoding For the Proposal for Encoding Emoji Symbols L2/09-026R Date: 2009-Feb-06 Authors: Markus Scherer, Mark Davis, Kat Momoi, Darick Tong (Google Inc.) Yasuo Kida, Peter Edberg (Apple Inc.) In the HTML version of this document, each symbol row has an anchor to allow direct linking by appending #e-4B0 (for example) to this page's URL in the address bar. Miscellaneous technical Code Symbol Name & Annotations Internal ID Point U+23E9 ALARM CLOCK e-02A UI symbols Code Symbol Name & Annotations Internal ID Point U+23EA BLACK RIGHT-POINTING DOUBLE TRIANGLE e-AFE = fast forward U+23EB BLACK LEFT-POINTING DOUBLE TRIANGLE e-AFF = fast rewind U+23EC BLACK UP-POINTING DOUBLE TRIANGLE e-B03 U+23ED BLACK DOWN-POINTING DOUBLE TRIANGLE e-B02 Miscellaneous symbols Code Symbol Name & Annotations Internal ID Point U+2705 WHITE HEAVY CHECK MARK e-B4A x (heavy check mark - 2714) RAISED FIST U+270A = rock in Rock, Paper, Scissors game e-B93 x (raised hand - 270B) x (victory hand - 270C) RAISED HAND U+270B = paper in Rock, Paper, Scissors game e-B95 x (raised fist - 270A) x (victory hand - 270C) Length mark Code Symbol Name & Annotations Internal ID Point U+2E32 LOOPED LENGTH MARK e-B08 Proposed Properties: gc=Pd Negative squared Latin letters Code Symbol Name & Annotations Internal ID 1 of 29 Point U+1F170 NEGATIVE SQUARED LATIN CAPITAL LETTER A e-50B A = blood type A U+1F171 NEGATIVE SQUARED LATIN CAPITAL LETTER B e-50C B = blood type B U+1F17E NEGATIVE SQUARED LATIN CAPITAL LETTER O e-50E O = blood type O U+1F18E NEGATIVE SQUARED LATIN CAPITAL LETTER AB e-50D = blood type AB Square katakanas Code Symbol Name & Annotations Internal ID Point SQUARED KATAKANA KOKO U+1F201 = here sign e-B24 # <square> 30B3 30B3 SQUARED KATAKANA SA U+1F202 = service sign e-B3F x (circled katakana sa - 32DA) # <square> 30B5 Squared ideographs Code Symbol Name & Annotations Internal ID Point SQUARED CJK UNIFIED IDEOGRAPH-7981 U+1F231 = prohibited sign e-B2E # <square> 7981 SQUARED CJK UNIFIED IDEOGRAPH-7A7A U+1F232 = empty sign e-B2F # <square> 7A7A SQUARED CJK UNIFIED IDEOGRAPH-5408 U+1F233 = passed sign e-B30 # <square> 5408 SQUARED CJK UNIFIED IDEOGRAPH-6E80 U+1F234 = full sign e-B31 # <square> 6E80 SQUARED CJK UNIFIED IDEOGRAPH-6709 U+1F235 = existence sign e-B39 x (circled ideograph have - 3292) # <square> 6709 SQUARED CJK UNIFIED IDEOGRAPH-6708 U+1F236 = monthly sign e-B3B x (circled ideograph moon - 328A) # <square> 6708 SQUARED CJK UNIFIED IDEOGRAPH-7533 U+1F237 = application sign e-B3C # <square> 7533 SQUARED CJK UNIFIED IDEOGRAPH-5272 U+1F238 = discount sign e-B3E # <square> 5272 SQUARED CJK UNIFIED IDEOGRAPH-55B6 U+1F239 = in business sign e-B41 # <square> 55B6 Circled ideographs Code Symbol Name & Annotations Internal ID Point CIRCLED IDEOGRAPH ADVANTAGE U+1F250 = advantage sign e-B3D # <circle> 5F97 CIRCLED IDEOGRAPH ACCEPT U+1F251 = accept sign e-B50 # <circle> 53EF 2 of 29 Weather and landscape symbols Code Symbol Name & Annotations Internal ID Point U+1F300 CYCLONE e-005 = typhoon, hurricane U+1F301 FOG e-006 U+1F302 CLOSED UMBRELLA e-007 U+1F303 NIGHT WITH STARS e-008 U+1F304 SUNRISE OVER MOUNTAINS e-009 U+1F305 SUNRISE e-00A U+1F306 CITYSCAPE AT DUSK e-00B U+1F307 SUNSET OVER BUILDINGS e-00C U+1F308 RAINBOW e-00D U+1F309 BRIDGE AT NIGHT e-010 Moon symbols Code Symbol Name & Annotations Internal ID Point U+1F30A NEW MOON e-011 U+1F30B WAXING MOON e-012 U+1F30C HALF MOON e-013 CRESCENT MOON U+1F30D * indicate either first or last quarter moon e-014 x (first quarter moon - 263D) x (last quarter moon - 263E) U+1F30E FULL MOON e-015 U+1F30F HALF MOON WITH FACE e-016 Time symbols Code Symbol Name & Annotations Internal ID Point U+1F310 SOON WITH RIGHT ARROW ABOVE e-018 U+1F311 ON WITH DOUBLE POINTING ARROW ABOVE e-019 U+1F312 END WITH LEFT ARROW ABOVE e-01A U+1F313 HOURGLASS WITH FLOWING SAND e-01B x (hourglass - 231B) U+1F314 CLOCK FACE ONE OCLOCK e-01E U+1F315 CLOCK FACE TWO OCLOCK e-01F 3 of 29 U+1F316 CLOCK FACE THREE OCLOCK e-020 U+1F317 CLOCK FACE FOUR OCLOCK e-021 U+1F318 CLOCK FACE FIVE OCLOCK e-022 U+1F319 CLOCK FACE SIX OCLOCK e-023 U+1F31A CLOCK FACE SEVEN OCLOCK e-024 U+1F31B CLOCK FACE EIGHT OCLOCK e-025 U+1F31C CLOCK FACE NINE OCLOCK e-026 U+1F31D CLOCK FACE TEN OCLOCK e-027 U+1F31E CLOCK FACE ELEVEN OCLOCK e-028 U+1F31F CLOCK FACE TWELVE OCLOCK e-029 Zodiacal symbol Code Symbol Name & Annotations Internal ID Point U+1F320 OPHIUCHUS e-037 Miscellaneous symbols Code Symbol Name & Annotations Internal ID Point U+1F321 WATER WAVE e-038 U+1F322 EARTH GLOBE e-039 U+1F323 VOLCANO e-03A U+1F324 MILKY WAY e-03B Plant symbols Code Symbol Name & Annotations Internal ID Point U+1F325 FOUR LEAF CLOVER e-03C x (shamrock - 2618) U+1F326 TULIP e-03D U+1F327 SEEDLING e-03E U+1F328 MAPLE LEAF e-03F U+1F329 CHERRY BLOSSOM e-040 U+1F32A ROSE e-041 U+1F32B FALLEN LEAF e-042 4 of 29 U+1F32C LEAF FLUTTERING IN WIND e-043 U+1F32D HIBISCUS e-045 U+1F32E SUNFLOWER e-046 U+1F32F PALM TREE e-047 U+1F330 CACTUS e-048 U+1F331 EAR OF RICE e-049 U+1F332 CORN e-04A U+1F333 MUSHROOM e-04B U+1F334 CHESTNUT e-04C U+1F335 BLOSSOM e-04D U+1F336 HERB e-04E Fruit symbols Code Symbol Name & Annotations Internal ID Point U+1F337 CHERRIES e-04F U+1F338 BANANA e-050 U+1F339 APPLE-1 e-051 U+1F33A TANGERINE e-052 U+1F33B STRAWBERRY e-053 U+1F33C WATERMELON e-054 U+1F33D TOMATO e-055 U+1F33E EGGPLANT e-056 U+1F33F MELON e-057 U+1F340 PINEAPPLE e-058 U+1F341 GRAPES e-059 U+1F342 PEACH e-05A U+1F343 APPLE-2 e-05B Facial parts symbols Code Symbol Name & Annotations Internal ID Point U+1F344 EYES e-190 5 of 29 U+1F345 EAR e-191 U+1F346 NOSE e-192 U+1F347 MOUTH e-193 U+1F348 TONGUE e-194 Personal care symbols Code Symbol Name & Annotations Internal ID Point U+1F349 LIPSTICK e-195 U+1F34A NAIL CARE e-196 U+1F34B FACE MASSAGE e-197 U+1F34C HAIRCUT e-198 A * usually indicates a beauty parlor U+1F34D BARBER POLE e-199 Miscellaneous symbol Code Symbol Name & Annotations Internal ID Point U+1F34E SILHOUETTE OF BUST e-19A Portrait and role symbols Code Symbol Name & Annotations Internal ID Point U+1F34F BOYS HEAD e-19B U+1F350 GIRLS HEAD e-19C U+1F351 MANS HEAD e-19D U+1F352 WOMANS HEAD e-19E U+1F353 FAMILY e-19F U+1F354 COUPLE e-1A0 U+1F355 POLICE OFFICER e-1A1 U+1F356 WOMAN WITH BUNNY EARS e-1A2 U+1F357 BRIDE WITH VEIL e-1A3 U+1F358 WESTERN PERSON e-1A4 U+1F359 MAN WITH GUA PI MAO e-1A5 U+1F35A MAN WITH TURBAN e-1A6 6 of 29 U+1F35B OLDER MAN e-1A7 U+1F35C OLDER WOMAN e-1A8 U+1F35D BABY e-1A9 U+1F35E CONSTRUCTION WORKER e-1AA Fairy tale symbols Code Symbol Name & Annotations Internal ID Point U+1F35F PRINCESS e-1AB U+1F360 OGRE e-1AC U+1F361 GOBLIN e-1AD U+1F362 GHOST e-1AE U+1F363 ANGEL e-1AF U+1F364 EXTRATERRESTRIAL ALIEN e-1B0 U+1F365 ALIEN MONSTER e-1B1 U+1F366 IMP e-1B2 U+1F367 SKULL e-1B3 Role symbols Code Symbol Name & Annotations Internal ID Point U+1F368 INFORMATION DESK PERSON e-1B4 U+1F369 GUARDSMAN e-1B5 U+1F36A DANCER e-1B6 Animal symbols Code Symbol Name & Annotations Internal ID Point U+1F36B DOG e-1B7 U+1F36C CAT e-1B8 U+1F36D SNAIL e-1B9 U+1F36E BABY CHICK e-1BA U+1F36F FRONT-FACING BABY CHICK e-1BB U+1F370 HATCHING CHICK e-1DD 7 of 29 U+1F371 PENGUIN e-1BC U+1F372 FISH e-1BD U+1F373 HORSE e-1BE U+1F374 PIG e-1BF U+1F375 TIGER e-1C0 U+1F376 BEAR e-1C1 U+1F377 MOUSE e-1C2 U+1F378 WHALE e-1C3 U+1F379 MONKEY FACE e-1C4 U+1F37A OCTOPUS e-1C5 U+1F37B SPIRAL SHELL e-1C6 U+1F37C DOLPHIN e-1C7 U+1F37D BIRD e-1C8 U+1F37E TROPICAL FISH e-1C9 U+1F37F HAMSTER e-1CA U+1F380 BUG e-1CB U+1F381 ELEPHANT e-1CC U+1F382 KOALA e-1CD U+1F383 MONKEY e-1CE U+1F384 SHEEP e-1CF U+1F385 WOLF e-1D0 U+1F386 COW e-1D1 U+1F387 RABBIT e-1D2 U+1F388 SNAKE e-1D3 U+1F389 CHICKEN e-1D4 U+1F38A BOAR e-1D5 U+1F38B CAMEL e-1D6 U+1F38C FROG e-1D7 U+1F38D POODLE e-1D8 U+1F38E BLOWFISH e-1D9 8 of 29 U+1F38F ANT e-1DA U+1F390 PAW PRINTS e-1DB U+1F391 TURTLE e-1DC U+1F392 DRAGON e-1DE U+1F393 PANDA e-1DF U+1F394 PIG NOSE e-1E0 U+1F395 HONEYBEE e-1E1 U+1F396 LADYBUG e-1E2 Faces used as emoticons Code Symbol Name & Annotations Internal ID Point U+1F397 ANGRY FACE e-320 U+1F398 ANGUISHED FACE e-321 U+1F399 ASTONISHED FACE e-322 U+1F39A DISAPPOINTED FACE e-323 U+1F39B DIZZY FACE e-324 U+1F39C EXASPERATED FACE e-325 U+1F39D EXPRESSIONLESS FACE e-326 U+1F39E FACE WITH HEART SHAPED EYES e-327 U+1F39F FACE WITH LOOK OF TRIUMPH e-328 U+1F3A0 WINKING FACE WITH STUCK OUT TONGUE e-329 U+1F3A1 FACE WITH STUCK OUT TONGUE e-32A U+1F3A2 FACE SAVORING DELICIOUS FOOD e-32B U+1F3A3 A FACE THROWING A KISS e-32C U+1F3A4 FACE KISSING e-32D U+1F3A5 A FACE WITH MASK e-32E U+1F3A6 FLUSHED FACE e-32F U+1F3A7 HAPPY FACE WITH OPEN MOUTH e-330 U+1F3A8 HAPPY FACE WITH OPEN MOUTH AND COLD SWEAT e-331 U+1F3A9 HAPPY FACE WITH OPEN MOUTH AND CLOSED EYES e-332 9 of 29 U+1F3AA HAPPY FACE WITH GRIN e-333 U+1F3AB HAPPY AND CRYING FACE e-334 U+1F3AC HAPPY FACE WITH WIDE MOUTH AND RAISED EYEBROWS e-335 U+1F3AD HAPPY FACE WITH OPEN MOUTH AND RAISED EYEBROWS e-338 U+1F3AE CRYING FACE e-339 U+1F3AF LOUDLY CRYING FACE e-33A U+1F3B0 FEARFUL FACE e-33B U+1F3B1 PERSEVERING FACE e-33C U+1F3B2 POUTING FACE e-33D U+1F3B3 RELIEVED FACE e-33E U+1F3B4 CONFOUNDED FACE e-33F U+1F3B5 PENSIVE FACE e-340 U+1F3B6 FACE SCREAMING IN FEAR e-341 U+1F3B7 SLEEPY FACE e-342 U+1F3B8 SMIRKING FACE e-343 U+1F3B9 FACE WITH COLD SWEAT e-344 U+1F3BA DISAPPOINTED BUT RELIEVED FACE e-345 U+1F3BB TIRED FACE e-346 U+1F3BC WINKING FACE e-347 Cat faces
Recommended publications
  • Transport and Map Symbols Range: 1F680–1F6FF
    Transport and Map Symbols Range: 1F680–1F6FF This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 14.0 This file may be changed at any time without notice to reflect errata or other updates to the Unicode Standard. See https://www.unicode.org/errata/ for an up-to-date list of errata. See https://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See https://www.unicode.org/charts/PDF/Unicode-14.0/ for charts showing only the characters added in Unicode 14.0. See https://www.unicode.org/Public/14.0.0/charts/ for a complete archived file of character code charts for Unicode 14.0. Disclaimer These charts are provided as the online reference to the character contents of the Unicode Standard, Version 14.0 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode Standard, Version 14.0, online at https://www.unicode.org/versions/Unicode14.0.0/, as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, #34, #38, #41, #42, #44, #45, and #50, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available online. See https://www.unicode.org/ucd/ and https://www.unicode.org/reports/ A thorough understanding of the information contained in these additional sources is required for a successful implementation.
    [Show full text]
  • Assessment of Options for Handling Full Unicode Character Encodings in MARC21 a Study for the Library of Congress
    1 Assessment of Options for Handling Full Unicode Character Encodings in MARC21 A Study for the Library of Congress Part 1: New Scripts Jack Cain Senior Consultant Trylus Computing, Toronto 1 Purpose This assessment intends to study the issues and make recommendations on the possible expansion of the character set repertoire for bibliographic records in MARC21 format. 1.1 “Encoding Scheme” vs. “Repertoire” An encoding scheme contains codes by which characters are represented in computer memory. These codes are organized according to a certain methodology called an encoding scheme. The list of all characters so encoded is referred to as the “repertoire” of characters in the given encoding schemes. For example, ASCII is one encoding scheme, perhaps the one best known to the average non-technical person in North America. “A”, “B”, & “C” are three characters in the repertoire of this encoding scheme. These three characters are assigned encodings 41, 42 & 43 in ASCII (expressed here in hexadecimal). 1.2 MARC8 "MARC8" is the term commonly used to refer both to the encoding scheme and its repertoire as used in MARC records up to 1998. The ‘8’ refers to the fact that, unlike Unicode which is a multi-byte per character code set, the MARC8 encoding scheme is principally made up of multiple one byte tables in which each character is encoded using a single 8 bit byte. (It also includes the EACC set which actually uses fixed length 3 bytes per character.) (For details on MARC8 and its specifications see: http://www.loc.gov/marc/.) MARC8 was introduced around 1968 and was initially limited to essentially Latin script only.
    [Show full text]
  • Character Properties 4
    The Unicode® Standard Version 14.0 – Core Specification To learn about the latest version of the Unicode Standard, see https://www.unicode.org/versions/latest/. Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and the publisher was aware of a trade- mark claim, the designations have been printed with initial capital letters or in all capitals. Unicode and the Unicode Logo are registered trademarks of Unicode, Inc., in the United States and other countries. The authors and publisher have taken care in the preparation of this specification, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. © 2021 Unicode, Inc. All rights reserved. This publication is protected by copyright, and permission must be obtained from the publisher prior to any prohibited reproduction. For information regarding permissions, inquire at https://www.unicode.org/reporting.html. For information about the Unicode terms of use, please see https://www.unicode.org/copyright.html. The Unicode Standard / the Unicode Consortium; edited by the Unicode Consortium. — Version 14.0. Includes index. ISBN 978-1-936213-29-0 (https://www.unicode.org/versions/Unicode14.0.0/) 1.
    [Show full text]
  • The Unicode Standard, Version 4.1 This File Contains an Excerpt from the Character Code Tables and List of Character Names for the Unicode Standard, Version 4.1
    Miscellaneous Symbols Range: 2600–26FF The Unicode Standard, Version 4.1 This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 4.1. Characters in this chart that are new for The Unicode Standard, Version 4.1 are shown in conjunction with any existing characters. For ease of reference, the new characters have been highlighted in the chart grid and in the names list. This file will not be updated with errata, or when additional characters are assigned to the Unicode Standard. See http://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See http://www.unicode.org/Public/4.1.0/charts/ for a complete archived file of character code charts for Unicode 4.1. Disclaimer These charts are provided as the on-line reference to the character contents of the Unicode Standard, Version 4.1 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this excerpt file, please consult the appropriate sections of The Unicode Standard, Version 4.1, at http://www.unicode.org/versions/Unicode4.1.0/, including sections unchanged in The Unicode Standard, Version 4.0 (ISBN 0-321-18578-1), as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, and #34, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available on-line. See http://www.unicode.org/ucd/ and http://www.unicode.org/reports/ A thorough understanding of the information contained in these additional sources is required for a successful implementation.
    [Show full text]
  • Miscellaneous Mathematical Symbols-A Range: 27C0–27EF
    Miscellaneous Mathematical Symbols-A Range: 27C0–27EF This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 14.0 This file may be changed at any time without notice to reflect errata or other updates to the Unicode Standard. See https://www.unicode.org/errata/ for an up-to-date list of errata. See https://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See https://www.unicode.org/charts/PDF/Unicode-14.0/ for charts showing only the characters added in Unicode 14.0. See https://www.unicode.org/Public/14.0.0/charts/ for a complete archived file of character code charts for Unicode 14.0. Disclaimer These charts are provided as the online reference to the character contents of the Unicode Standard, Version 14.0 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode Standard, Version 14.0, online at https://www.unicode.org/versions/Unicode14.0.0/, as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, #34, #38, #41, #42, #44, #45, and #50, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available online. See https://www.unicode.org/ucd/ and https://www.unicode.org/reports/ A thorough understanding of the information contained in these additional sources is required for a successful implementation.
    [Show full text]
  • Musical Symbols Range: 1D100–1D1FF
    Musical Symbols Range: 1D100–1D1FF This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 14.0 This file may be changed at any time without notice to reflect errata or other updates to the Unicode Standard. See https://www.unicode.org/errata/ for an up-to-date list of errata. See https://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See https://www.unicode.org/charts/PDF/Unicode-14.0/ for charts showing only the characters added in Unicode 14.0. See https://www.unicode.org/Public/14.0.0/charts/ for a complete archived file of character code charts for Unicode 14.0. Disclaimer These charts are provided as the online reference to the character contents of the Unicode Standard, Version 14.0 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode Standard, Version 14.0, online at https://www.unicode.org/versions/Unicode14.0.0/, as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, #34, #38, #41, #42, #44, #45, and #50, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available online. See https://www.unicode.org/ucd/ and https://www.unicode.org/reports/ A thorough understanding of the information contained in these additional sources is required for a successful implementation.
    [Show full text]
  • Unicode Characters in Proofpower Through Lualatex
    Unicode Characters in ProofPower through Lualatex Roger Bishop Jones Abstract This document serves to establish what characters render like in utf8 ProofPower documents prepared using lualatex. Created 2019 http://www.rbjones.com/rbjpub/pp/doc/t055.pdf © Roger Bishop Jones; Licenced under Gnu LGPL Contents 1 Prelude 2 2 Changes 2 2.1 Recent Changes .......................................... 2 2.2 Changes Under Consideration ................................... 2 2.3 Issues ............................................... 2 3 Introduction 3 4 Mathematical operators and symbols in Unicode 3 5 Dedicated blocks 3 5.1 Mathematical Operators block .................................. 3 5.2 Supplemental Mathematical Operators block ........................... 4 5.3 Mathematical Alphanumeric Symbols block ........................... 4 5.4 Letterlike Symbols block ..................................... 6 5.5 Miscellaneous Mathematical Symbols-A block .......................... 7 5.6 Miscellaneous Mathematical Symbols-B block .......................... 7 5.7 Miscellaneous Technical block .................................. 7 5.8 Geometric Shapes block ...................................... 8 5.9 Miscellaneous Symbols and Arrows block ............................. 9 5.10 Arrows block ........................................... 9 5.11 Supplemental Arrows-A block .................................. 10 5.12 Supplemental Arrows-B block ................................... 10 5.13 Combining Diacritical Marks for Symbols block ......................... 11 5.14
    [Show full text]
  • The Unicode Standard 5.2 Code Charts
    Miscellaneous Symbols Range: 2600–26FF The Unicode Standard, Version 5.2 This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 5.2. Characters in this chart that are new for The Unicode Standard, Version 5.2 are shown in conjunction with any existing characters. For ease of reference, the new characters have been highlighted in the chart grid and in the names list. This file will not be updated with errata, or when additional characters are assigned to the Unicode Standard. See http://www.unicode.org/errata/ for an up-to-date list of errata. See http://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See http://www.unicode.org/charts/PDF/Unicode-5.2/ for charts showing only the characters added in Unicode 5.2. See http://www.unicode.org/Public/5.2.0/charts/ for a complete archived file of character code charts for Unicode 5.2. Disclaimer These charts are provided as the online reference to the character contents of the Unicode Standard, Version 5.2 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode Standard, Version 5.2, online at http://www.unicode.org/versions/Unicode5.2.0/, as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, #34, #38, #41, #42, and #44, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available online.
    [Show full text]
  • Emoji Art: the Aesthetics of 
    Emoji art: The aesthetics of P.D. Magnus January 2018 This is an unpublished draft. Comments are welcome. e-mail: pmagnus(at)fecundity.com web: http://www.fecundity.com/job Abstract This paper explores the possibility of art made using emoji (picture characters like the pile-of-poo glyph in the title). That such art is possible is apprarent from some specific examples. Although these cases are of- ten described in terms of “translation” between emoji and English, emoji cannot generally be given a literal natural-language translation. So what kind of thing is an emoji work? A particular emoji work could turn out to be (1) a digital image, like an illustration; (2) a specified string of emoji characters, in the way that a natural-language novel is a specified string of letters; or (3) a single-instance work, a particular display. keywords: emoji, emoji art, Emoji Dick, emoji poems, art ontology Emoji are picture characters familiar from smart phone text messages. Given their ubiquity, it is inevitable that emoji have been used in works of art. What kind of art are they, though? I begin in §1 by discussing the history of emoji. One of the more notable emoji is the pile of poo which figures in the title of this paper. In§2, I consider the meaning of emoji and argue that there is not generally a natural-language translation for emoji. In §§3–4, I discuss some specific works of emoji art: Fred Benenson’s Emoji Dick and Carina Finn and Stephanie Berger’s emoji poems.
    [Show full text]
  • Outline of the Course
    Outline of the course Introduction to Digital Libraries (15%) Description of Information (30%) Access to Information (()30%) User Services (10%) Additional topics (()15%) Buliding of a (small) digital library Reference material: – Ian Witten, David Bainbridge, David Nichols, How to build a Digital Library, Morgan Kaufmann, 2010, ISBN 978-0-12-374857-7 (Second edition) – The Web FUB 2012-2013 Vittore Casarosa – Digital Libraries Part 7 -1 Access to information Representation of characters within a computer Representation of documents within a computer – Text documents – Images – Audio – Video How to store efficiently large amounts of data – Compression How to retrieve efficiently the desired item(s) out of large amounts of data – Indexing – Query execution FUB 2012-2013 Vittore Casarosa – Digital Libraries Part 7 -2 Representation of characters The “natural” wayyp to represent ( (palphanumeric ) characters (and symbols) within a computer is to associate a character with a number,,g defining a “coding table” How many bits are needed to represent the Latin alphabet ? FUB 2012-2013 Vittore Casarosa – Digital Libraries Part 7 -3 The ASCII characters The 95 printable ASCII characters, numbdbered from 32 to 126 (dec ima l) 33 control characters FUB 2012-2013 Vittore Casarosa – Digital Libraries Part 7 -4 ASCII table (7 bits) FUB 2012-2013 Vittore Casarosa – Digital Libraries Part 7 -5 ASCII 7-bits character set FUB 2012-2013 Vittore Casarosa – Digital Libraries Part 7 -6 Representation standards ASCII (late fifties) – AiAmerican
    [Show full text]
  • SSAC Advisory on the Use of Emoji in Domain Names SAC095 SSAC Advisory on the Use of Emoji in Domain Names
    SSAC Advisory on the Use of Emoji in Domain Names SAC095 SSAC Advisory on the Use of Emoji in Domain Names An Advisory from the ICANN Security and Stability Advisory Committee (SSAC) 25 May 2017 SAC095 1 SSAC Advisory on the Use of Emoji in Domain Names Preface This is an advisory to the ICANN Board, the ICANN community, and, more broadly, the Internet community from the ICANN Security and Stability Advisory Committee (SSAC) on the use of emoji in domain names. The SSAC focuses on matters relating to the security and integrity of the Internet’s naming and address allocation systems. This includes operational matters (e.g., pertaining to the correct and reliable operation of the root zone publication system), administrative matters (e.g., pertaining to address allocation and Internet number assignment), and registration matters (e.g., pertaining to registry and registrar services). SSAC engages in ongoing threat assessment and risk analysis of the Internet naming and address allocation services to assess where the principal threats to stability and security lie, and advises the ICANN community accordingly. The SSAC has no authority to regulate, enforce, or adjudicate. Those functions belong to other parties, and the advice offered here should be evaluated on its merits. SAC095 2 SSAC Advisory on the Use of Emoji in Domain Names Table of Contents 1 Introduction ................................................................................................... 4 2 Emoji in Domain Names ..............................................................................
    [Show full text]
  • Supplement Notes on Alphanumeric Representation Device, but Is Often Either 8 Or 10
    Supplement Notes on Alphanumeric Representation device, but is often either 8 or 10. LF (NL line feed, new line) - Moves the cursor (or print head) to a SUB (substitute) new line. On Unix systems, moves to a new line 2.1 Introduction AND all the way to the left. VT (vertical tab) ESC (escape) Inside a computer program or data file, text is stored as a sequence of numbers, just like everything else. These sequences are FF (form feed) - Advances paper to the top of the next page (if the FS (file separator) integers of various sizes, values, and interpretations, and it is the code pages, character sets, and encodings that determine how output device is a printer). integer values are interpreted. CR (carriage return) - Moves the cursor all the way to the left, but GS (group separator) does not advance to the next line. Text consists of characters, mostly. Fancy text or rich text includes display properties like color, italics, and superscript styles, SO (shift out) - Switches output device to alternate character set. RS (record separator) but it is still based on characters forming plain text. Sometimes the distinction between fancy text and plain text is complex, SI (shift in) - Switches output device back to default character set. US (unit separator) and the distinction may depend on the application. Here, we focus on plain text. 2. 001x xxxx Numeric and “specials” or punctuation. So, what is a character? Typically, it is a letter. Also, it is a digit, a period, a hyphen, punctuation, and mathematic symbol. 3. 010x xxxx Upper case letters (and some punctuation) There are also control characters (typically not visible) that define the end of a line or paragraph.
    [Show full text]