Internationalization 360˚ Testing

Total Page:16

File Type:pdf, Size:1020Kb

Internationalization 360˚ Testing Internationalization 360˚ Testing MahipalsinnhMahipalsinh Rana Rana Member of Technical Staff Sun Microsystems 1 Agenda z Introduction of I18n - 40 Minutes z Internationalization(I18n) 360˚ testing - 60 Minutes z Testing Standalone Applications – 15 minutes z Quiz – 10 minutes z Testing Web Applications – 15 minutes z Quiz – 10 minutes z I18n testing Automation – 30 minutes z Advanced I18n testing , References – 15 minutes z Q/A - 15 minutes Introduction z Understanding of Internationalization (I18n) z Why I18n testing z Myths for I18n testing z Scope of I18n testing z Terminologies in i18n technology z Character set/Character repertoire z Character Code/Code Point,Coded Character z Encoding , Unicode , UTF-8 ,UTF-16 ,UTF-32 z Glyph , Fonts , Input Method Engine (IME)˫ z Locale “Everyone has the right... to seek, receive and impart information and ideas through any media regardless of frontiers” -- Universal Declaration of Human Rights Why Globalization Why Globalization Sun Portal server in Chinese Why Globalization Yahoo.com in Kannada Why Globalization z “Visitors linger twice as long as they do at English-only URL's. z Business users are 3 times more likely to buy when addressed in their language. z Customer service costs drop when instructions are displayed in the user's native language." 'Strategies for Global Sites' Donald DePalma Forrester Research Inc. Why Globalization "One large IT company discovered that a significant percentage of inquiries were coming from South Korea - they created a Korean website and revenues rose by 8 percent." 'Global eCommerce' Donald J. Plumley Bowne Global Solutions What's with the acronyms? Internationalization ====> i18n , How? There are 18 characters between i and n With that logic : Localization ====>L10n Globalization ====> G11n Translation ===> T9n and you can call me M5l ==> Mahipal , Don't they all look the same? z Localization z Internationalization z Globalization z Translation How do they differ and relate? z Globalization encompasses i18n and l10n. z InternationalizationAn enables localization. z An expert in i18N may not be an expert in l10N. LISA* Definitions z Globalization-(G11n)˫ z “Globalization addresses the business issues associated with taking a product global. In the globalization of high-tech products this involves integrating localization throughout a company, after proper internationalization and product design, as well as marketing, sales, and support in the world market.” z Internationalization-(I18n)˫ z “Internationalization is the process of generalizing a product so that it can handle multiple languages and cultural conventions without the need for re- design. Internationalization takes place at the level of program design and document development.” z Localization-(L10n)˫ z “Localization involves taking a product and making it linguistically and culturally appropriate to the target locale (country/region and language) where it will be used and sold.” *Localization Industry Standards Association Why I18n testing ?˫ z I18n testing is required for enable product localization in multiple languages. z Removing barriers to localization z Enabling Unicode z Independence from UI strings in code z Handling legacy character encodings. z Separating localizable elements from source. z Enabling code to support local,regional, language, or culturally related preferences. Myths for I18n Testing z Misunderstood as translation testing z Only language expert can perform i18n testing z Done after product released z Misunderstood with product localization z It is only about String messages Terminologies of I18n z What is Character set/Character repertoire? z What is Character Code/Code Point,Coded Character? z What is Unicode? z What is meant by Encoding? z UTF-8,UTF-16,UTF-32 z What is Glyph? z What is Font? z What is Input Method Engine(IME) ? z What is Locale? What is Character, Character Set ? z A character is just an abstract minimal unit of text. It doesn't have a fixed shape (that would be a glyph), and it doesn't have a value. z "A" is a character, and so is "$", the symbol for the currency. z Character set/repertoire is a collection of characters. z Examples ȡ _ ȯ Ȫ Ǖ ȡ Ȣ ȡ ¡ ¡ ! Making the World Wide Web world wide! ࡢ࡯࡞࠼࡮ࡢࠗ࠼࡮࠙ࠚ࠶ࡉࠍ਎⇇ਛߦᐢߍ߹ߒࠂ ߁ ȡ _ ȯ ȡ _ ȡ Ȱ ȡ Ȣ ȡ `ȯ ! 놹ꫭ陹넍 낉麑 꿵넩麑 낮냱ꈑ ꎁ麙韥! What is Character Code/Code Point, Coded Character Set ? z Character Code - A mapping, which defines a one- to-one correspondence between characters in a character repertoire and a set of non-negative integers. Examples of character codes: z ASCII, ISO Latin 1 alias ISO 8859-1, ISO 10646, the Windows character set exists in different variations,or "code pages" (CP)- Windows code page 1252 etc z A Character Code point is unique non-negative integer assigned to character in character code z A coded character set is a character set where each character has been assigned to a unique code point What is Character Code/Code Point, Coded Character Set ? Image Source : z ASCII character set , one of early character set What is Character Code/Code Point, Coded Character Set ? Image Source: Ex. ASCII Character set z 8 bit character set , cover most of character needed by Europeans but What about east part of the world? Unicode z Answer is z It has characters from almost every written script in this world z European alphabetic scripts z Latin,Greek,Cyrillic,Armenian,Georgian,Runic,Ogham,Modifier letters z Middle East Scripts z Hebrew,Arabic,Syriac,Thaana z South & South East Asian scripts z Devanagari,Bengali,Gujurati,Panjabi,Oriya,Tamil,Telugu,Kannada,Mala yalam z East Asian scripts z Han,Hiragana,Katakana,Hangul,Bopomofo,Yi z Symbols Ex. ASCII Character set z Currency symbols,Letter like symbols,Mathematic operators,Numeric forms,Technical symbols,Geometrical symbols z Additional scripts z Ethiopic Cherokee Canadian Aboriginal Syllabics Mongolian What is Character Encoding ? z A mapping from a set of non-negative integers that are elements of a Coded Character Set, to a set of sequences of particular code units of some specified width, such as 8- bit/16- bit/32-bit integers z The most commonly used code units are bytes, but 16-bit or 32- bit integers can also be used for internal processing. z Examples are UTF-8,UTF-16,UTF-32 UTF-8 , UTF-16 , UTF-32 zUTF-32 simply represents each Unicode code point as the 32- bit integer of the same value. zUTF-16 uses sequences of one or two unsigned 16-bit code units to encode Unicode code points. [Values U+0000 to U+FFFF are encoded in one 16-bit unit with the same value. Supplementary characters are encoded in two code units] zUTF-8 uses sequences of one to four bytes to encode Unicode code points. [U+0000 to U+007F are encoded in one byte, U+0080 to U+07FF in two bytes, U+0800 to U+FFFF in three bytes, and U+10000 to U+10FFFF in four bytes.] Relation between Character set and Encoding Characters A ʠ ᅢ Code Point 41 5D0 597D UTF-8 41 D7 90 E5 A5 BD UTF-16 00 41 05 D0 59 7D UTF-32 00 00 00 41 00 00 05 D0 00 00 59 7D z Different encodings yield different byte sequences for same character in Character set Unicode Character set, code set, encodings Universal Unicode UTF Character Code encodings set/repertoire Points All Character UTF-8, set will be Each Unicode UTF-16, a subset of character is UTF-32 this huge assigned a are the character Unicode Code encoding repertoire. point .Range ASCII formats set,French, is U+0000 to for Japanese, U+10FFFF. internal Korean, processi Devanagari ng What is Glyph? z A glyph - a visual appearance z It is important to distinguish the character concept from the glyph concept. A glyph is a presentation of a particular shape which a character may have when rendered or displayed. z Example: a letter and different glyphs for it:latin capital letter z (U+00E9)˫ Z Z Z Z ٥Q + + Ȣ + + Ǔ + ¡ Ǔ¡Ȣ What is Font? z A repertoire of glyphs comprises a font z A font is a numbered set of glyphs. z The numbers correspond to code positions of the characters (presented by the glyphs). z Font including characters for a language should be available for an application to display text for the language What is Input Method Engine(IME)˫ z Input methods capture a sequence of keystrokes and form a character or characters as input for languages z Input Method Engine (IME) is a program or operating system component that allows computer users to enter complex characters and symbols using a standard Western keyboard. It is also referred as Input Method Environment. What is Locale? z Locale is a set of parameters that defines the user's language, country and any special variant references that the user wants to see in their user interface. z The locale naming convention is usually: language[_territory][.encoding][@modifier]. z Example for Hindi with UTF8 encoding : hi_IN.UTF8 z Encoding [ Native encoding (iso8859-*, Shift_JIS,GB18030, BIG5, ISO2022) , Unicode encoding (UTF-8, UTF-16, ) ] What is Locale? z Behavior affected by Locale z Language culture data z Sorting, searching, text boundary, text conversion z Indexing z Country culture data z Calendar, date/time/number/currency format z People name/mailing address layout I18n 360˚ Testing Approach z What is Traditional Approach z What is 360˚ Approach z Case Study z Requirement Phase z Design Phase z Implementation Phase z QA Phase z Documentation Traditional Approach of I18n Testing z Generally start after build released by development team. z In some case starts even after product release as they release separate international release z Major Focus on functionality testing z I18n testing done on
Recommended publications
  • International Language Environments Guide
    International Language Environments Guide Sun Microsystems, Inc. 4150 Network Circle Santa Clara, CA 95054 U.S.A. Part No: 806–6642–10 May, 2002 Copyright 2002 Sun Microsystems, Inc. 4150 Network Circle, Santa Clara, CA 95054 U.S.A. All rights reserved. This product or document is protected by copyright and distributed under licenses restricting its use, copying, distribution, and decompilation. No part of this product or document may be reproduced in any form by any means without prior written authorization of Sun and its licensors, if any. Third-party software, including font technology, is copyrighted and licensed from Sun suppliers. Parts of the product may be derived from Berkeley BSD systems, licensed from the University of California. UNIX is a registered trademark in the U.S. and other countries, exclusively licensed through X/Open Company, Ltd. Sun, Sun Microsystems, the Sun logo, docs.sun.com, AnswerBook, AnswerBook2, Java, XView, ToolTalk, Solstice AdminTools, SunVideo and Solaris are trademarks, registered trademarks, or service marks of Sun Microsystems, Inc. in the U.S. and other countries. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. in the U.S. and other countries. Products bearing SPARC trademarks are based upon an architecture developed by Sun Microsystems, Inc. SunOS, Solaris, X11, SPARC, UNIX, PostScript, OpenWindows, AnswerBook, SunExpress, SPARCprinter, JumpStart, Xlib The OPEN LOOK and Sun™ Graphical User Interface was developed by Sun Microsystems, Inc. for its users and licensees. Sun acknowledges the pioneering efforts of Xerox in researching and developing the concept of visual or graphical user interfaces for the computer industry.
    [Show full text]
  • Development Production Line the Short Story
    Development Production Line The Short Story Jene Jasper Copyright © 2007-2018 freedumbytes.dev.net (Free Dumb Bytes) Published 3 July 2018 4.0-beta Edition While every precaution has been taken in the preparation of this installation manual, the publisher and author assume no responsibility for errors or omissions, or for damages resulting from the use of the information contained herein. This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. To get an idea of the Development Production Line take a look at the following Application Integration overview and Maven vs SonarQube Quality Assurance reports comparison. 1. Operating System ......................................................................................................... 1 1.1. Windows ........................................................................................................... 1 1.1.1. Resources ................................................................................................ 1 1.1.2. Desktop .................................................................................................. 1 1.1.3. Explorer .................................................................................................. 1 1.1.4. Windows 7 Start Menu ................................................................................ 2 1.1.5. Task Manager replacement ........................................................................... 3 1.1.6. Resource Monitor .....................................................................................
    [Show full text]
  • PTOCA Reference (Presentation Text Object Content Architecture
    Advanced Function Presentation Consortium Data Stream and Object Architectures Presentation Text Object Content Architecture Reference AFPC-0009-03 Note: Before using this information, read the information in “Notices” on page 171. AFPC-0009-03 Fourth Edition (March 2016) This edition applies to the Presentation Text Object Content Architecture (PTOCA). It is the first edition produced by the AFP Consortium™(AFPC™) and replaces and makes obsolete the previous edition, SC31-6803-02, published by the IBM® Corporation. This edition remains current until a new edition is published. Specific changes are indicated by a vertical bar to the left of the change. For a detailed list of the changes, see “Summary of Changes” on page ix. Internet Visit our home page: www.afpcinc.org Copyright © AFP Consortium 1997, 2016 ii Preface This book describes the functions and services associated with the Presentation Text Object Content Architecture (PTOCA) architecture. This book is a reference, not a tutorial. It complements individual product publications, but does not describe product implementations of the architecture. Who Should Read This Book This book is for systems programmers and other developers who need such information to develop or adapt a product or program to interoperate with other presentation products. Copyright © AFP Consortium 1997, 2016 iii AFP Consortium AFP Consortium (AFPC) The Advanced Function Presentation™(AFP™) architectures began as the strategic, general purpose document and information presentation architecture for the IBM Corporation. The first specifications and products go back to 1984. Although all of the components of the architecture have grown over the years, the major concepts of object-driven structures, print integrity, resource management, and support for high print speeds were built in from the start.
    [Show full text]
  • Higher Quality 2D Text Rendering
    Journal of Computer Graphics Techniques Vol. 2, No. 1, 2013 Higher Quality 2D Text Rendering http://jcgt.org Higher Quality 2D Text Rendering Nicolas P. Rougier INRIA No hinting Native hinting Auto hinting Vertical hinting Figure 1. When displaying text on low-resolution devices (DPI < 150), one typically has to decide if one wants to respect the pixel grid (e.g., Cleartype technology / Microsoft / native hinting) for crisp rendering or, to privilege glyph shapes (Quartz technology / Apple / no hinting) at the cost of blurring. There is, however, a third way that may combine the best of the two technologies (vertical hinting). Abstract Even though text is pervasive in most 3D applications, there is surprisingly no native sup- port for text rendering in OpenGL. To cope with this absence, Mark Kilgard introduced the use of texture fonts [Kilgard 1997]. This technique is well known and widely used and en- sures both good performances and a decent quality in most situations. However, the quality may degrade strongly in orthographic mode (screen space) due to pixelation effects at large sizes and to legibility problems at small sizes due to incorrect hinting and positioning of glyphs. In this paper, we consider font-texture rendering to develop methods to ensure the highest quality in orthographic mode. The method used allows for both the accurate render- ing and positioning of any glyph on the screen. While the method is compatible with complex shaping and/or layout (e.g., the Arabic alphabet), these specific cases are not studied in this article. 50 Journal of Computer Graphics Techniques Vol.
    [Show full text]
  • Font Features for Scheherazade
    Font Features for Scheherazade The Scheherazade font includes a number of optional features that provide alternative rendering that might be preferable for use in some contexts. The chart below enumerates the details of these features. Whether these features are available to users will depend on both the application and the rendering technology (Graphite or OpenType) being used. Most features are available in both Graphite and OpenType, though there may be minor differences in their implementation. In LibreOffice 3.4.2+ (http://www.libreoffice.org/download/) or OpenOffice 3.2+ (http://www.openoffice.org/download/) the features are available only when Graphite rendering is enabled (the default). Features can be turned on by choosing the font (i.e., Scheherazade), followed by a colon, followed by the feature ID, and then followed by the feature setting. So, for example, if the “Kurdish-style Heh” is desired, the font selection would be “Scheherazade:cv48=3”. If you wish to apply two (or more) features, you can separate them with an “&”. Thus, “Scheherazade:cv48=3&cv44=1” would apply the “Kurdish-style Heh” plus the “Sindhi-style Meem” feature. In Mozilla Firefox, with either Graphite or OpenType rendering, features can be accessed using the appropriate CSS markup. A description of how to use the font features in Mozilla Firefox can be found here: http://scripts.sil.org/cms/scripts/page.php?site_id=projects&item_id=graphite_firefox#cf8a0574 (the technique described there works for both Graphite and OpenType). Ideally the selection of these font features is done in application programs, but many applications do not yet support this functionality.
    [Show full text]
  • Javatm Internationalization Roadmap
    JavaTM Internationalization Roadmap by Brian Beck – [email protected] Stuart Gill – [email protected] Norbert Lindenberg – [email protected] John O’Conner – [email protected] Sun Microsystems Inc. Introduction The Java 2 platform as available today provides a strong foundation for the development of internationalized and multilingual applications. The functionality it provides covers all the core areas of internationalization: • It uses Unicode 2.1 as its built-in character data type. • The Locale class lets applications identify locales, allowing for truly multilingual applications. • The ResourceBundle class provides the foundation for localization, including localization for multiple locales in a single application container. • The Date, Calendar, and TimeZone classes provide the basis for time handling around the globe. • The String and Character classes as well as the java.text package contain rich functionality for text processing, formatting, and parsing. • Text stream input and output classes support converting text between Unicode and other character encodings. • Keyboard handling and the input method framework allow for text input in many languages. • The Java 2D™ system supports text rendering for many of the major languages, including bidirectional text handling for Arabic and Hebrew. • The text components in the Swing user interface toolkit provide plug-and-play text editing functionality that takes full advantage of the input method framework and the 2D rendering system. • Component orientation support lets applications switch their user interfaces between left- to-right and right-to-left layouts. To date, most of our internationalization support has been focused on European, Middle Eastern, and Far East Asian locales. Those locales are now reasonably well supported so we have begun to turn our attention to the locales of South and Southeast Asia.
    [Show full text]
  • INDIAN LANGUAGE SUPPORT in GNOME-TERMINAL B.Tech Information Technology
    INDIAN LANGUAGE SUPPORT IN GNOME-TERMINAL A Project Report Submitted by Kulkarni Swapnil 110708035 Kulkarni Mihir 110708033 DigeSourabh 110708020 in partial fulfilment for the award of the degree of B.Tech Information Technology Under the guidance of Prof. Abhijit A.M. College of Engineering, Pune DEPARTMENT OF COMPUTER ENGINEERING AND INFORMATION TECHNOLOGY, COLLEGE OF ENGINEERING, PUNE-5 May, 2011 Acknowledgements We would sincerely like to acknowledge the following people who played a very im- portant role in the success of the project. They guided us from time to time and helped us whenever we were stuck at any point. We deeply appreciate their commitment and dedication towards the spread of Open Source and the Free Software Movement in general. Prof. Abhijit A.M. - Our guide and mentor throughout the project. We thank him for giving us the opportunity to work with him and in particular on this project and for spending invaluable time in critical moments of our project. Praveen Arimbrathodiyil, Pravin Satpute, Santhosh Thottingal - For provid- ing timely help and guidance. We would also like to thank Dr. Jibi Abraham [HOD Computer Engg. and Information Technology, COEP] and all the staff of Computer Engg. and Information Technology Department. Last but not the least, we would like to thank the whole FOSS community and the people involved in Indic Langauge Computing throughtout the world without whom this project would not have been a reality. DEPARTMENT OF COMPUTER ENGINEERING AND INFORMATION TECHNOLOGY, COLLEGE OF ENGINEERING, PUNE CERTIFICATE Certified that this project, titled “INDIAN LANGUAGE SUPPORT IN GNOME- TERMINAL” has been successfully completed by Kulkarni Swapnil 110708035 Kulkarni Mihir 110708033 DigeSourabh 110708020 and is approved for the partial fulfilment of the requirements for the degree of “B.Tech.
    [Show full text]
  • Complex Text Layout Language Support in the Solaris Operating Environment
    Complex Text Layout Language Support in the Solaris Operating Environment Sun Microsystems, Inc. 901 San Antonio Road Palo Alto, CA 94303-4900 U.S.A. Part Number 806-5583 May 2000 Copyright 2000 Sun Microsystems, Inc. 901 San Antonio Road, Palo Alto, California 94303-4900 U.S.A. All rights reserved. This product or document is protected by copyright and distributed under licenses restricting its use, copying, distribution, and decompilation. No part of this product or document may be reproduced in any form by any means without prior written authorization of Sun and its licensors, if any. Third-party software, including font technology, is copyrighted and licensed from Sun suppliers. Parts of the product may be derived from Berkeley BSD systems, licensed from the University of California. UNIX is a registered trademark in the U.S. and other countries, exclusively licensed through X/Open Company, Ltd. Sun, Sun Microsystems, the Sun logo, docs.sun.com, AnswerBook, AnswerBook2, and Solaris are trademarks, registered trademarks, or service marks of Sun Microsystems, Inc. in the U.S. and other countries. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. in the U.S. and other countries. Products bearing SPARC trademarks are based upon an architecture developed by Sun Microsystems, Inc. The OPEN LOOK and SunTM Graphical User Interface was developed by Sun Microsystems, Inc. for its users and licensees. Sun acknowledges the pioneering efforts of Xerox in researching and developing the concept of visual or graphical user interfaces for the computer industry. Sun holds a non-exclusive license from Xerox to the Xerox Graphical User Interface, which license also covers Sun’s licensees who implement OPEN LOOK GUIs and otherwise comply with Sun’s written license agreements.
    [Show full text]
  • Harmattan Font Features Guide
    Harmaan Features The Harmaan font includes a number of oponal features that provide alternave rendering that might be preferable for use in some contexts. The chart below enumerates the details of these features. Whether these features are available to users will depend on both the applicaon and the rendering technology (Graphite or OpenType) being used. Most features are available in both Graphite and OpenType, though there may be minor differences in their implementaon. Features When Harmaan is used in applicaons that support Graphite or OpenType, and that provide an appropriate user interface, various user-controllable font features are available allowing access to alternavely-designed glyphs. The table below gives a visual representaon of the featured character glyphs in the font. Note that within each feature the top-most value is the de- fault. The other lines show the first alternate and, if available, the second or third alternates. List of Graphite features and OpenType Character Variants Feature Name Feature ID Feature Setting (top-most Character Shapes Implementation in each section is default) Notes1 Alef diacritic placement cv02 0=Standard G,O,T (U+0623, U+0625, U+0627, U+064E, أإاَ ِاأإل ٔالإ (U+0650, U+0654, U+0655 1=Hamza touching أإاَ ِاأإل ٔالإ 2=Touching أإَاِاأإل ٔالإ Jeem/Hah cv08 0=Standard G,O,T (U+062C, U+062D, U+062E, U+0682, U+0683, U+0684, U+0685, U+0686, ج ججج ح ححح خ خخخ ڂ ڂڂڂ ڃ ڃڃڃ ڄ ڄڄڄ څ څڅڅ چ ,U+06BF, U+0757, U+0758, U+08A2 چچچ ڿ ڿڿڿ U+08C1, U+08C5, U+08C6) ࣆࣆࣆ ࣆ ࣅࣅࣅ ࣅ ࣁࣁࣁ ࣁ ࢢࢢࢢ ࢢ ݘݘݘ ݘ ݗݗݗ ݗ 1=Handwritten ج ججج ح ححح خ خخخ ڂ ڂڂڂ ڃ ڃڃڃ ڄ ڄڄڄ څ څڅڅ چ چچچ ڿ ڿڿڿ ࣆࣆࣆ ࣆ ࣅࣅࣅ ࣅ ࣁࣁࣁ ࣁ ࢢࢢࢢ ࢢ ݘݘݘ ݘ ݗݗݗ ݗ 1 Implementation Notes: G=Implemented in Graphite; O=Implemented in OpenType; T=Implemented in TypeTuner (command line version: http://scripts.sil.org/TypeTuner and web-based version: http://scripts.sil.org/ttw) .
    [Show full text]
  • Integrated Issues Report)
    The IDN Variant Issues Project A Study of Issues Related to the Management of IDN Variant TLDs (Integrated Issues Report) 20 February 2012 The IDN Variant Issues Project: A Study of Issues Related to the Management of IDN Variant TLDs 20 February 2012 Contents Executive Summary…………………………………………………………………………………………………………….. 6 1 Overview of this Report………………………………………………………………………………………………. 9 1.1 Fundamental Assumptions…………………………………………………………………………………. 10 1.2 Variants and the Current Environment………………………………………………………………. 12 2 Project Overview……………………………………………………………………………………………………….. 16 2.1 The Variant Issues Project…………………………………………………………………………………… 16 2.2 Objectives of the Integrated Issues Report………………………………………………………… 17 2.3 Scope of the Integrated Issues Report………………………………………………………………… 18 3 Range of Possible Variant Cases Identified………………………………………………………………… 19 3.1 Classification of Variants as Discovered……………………………………………………………… 19 3.1.1 Code Point Variants……………………………………………………………………………………. 20 3.1.2 Whole-String Variants………………………………………………………………………………… 20 3.2 Taxonomy of Identified Variant Cases……………………………………………………………….. 21 3.3 Discussion of Variant Classes……………………………………………………………………………… 28 3.4 Visual Similarity Cases……………………………………………………………………………………….. 33 3.4.1 Treatment of Visual Similarity Cases………………………………………………………….. 33 3.4.2 Cross-Script Visual Similarity ……………………………………………………………………….34 3.4.3 Terminology concerning Visual Similarity ……………………………………………………35 3.5 Whole-String Issues …………………………………………………………………………………………….36 3.6 Synopsis of Issues ……………………………………………………………………………………………….39
    [Show full text]
  • Proposal for Encoding Book Pahlavi in the Unicode Standard
    Proposal for Encoding Book Pahlavi in the Unicode Standard Version 1.1 Abe Meyers1 May 5, 2014 1abraham DOT meyers AT orientology DOT ca Foreword The purpose of this document is two-fold. First, to propose the inclusion of the Book Pahlavi script characters in the Unicode Standard, and second, to outline and address some of the shortcomings of the previous proposed encoding models [7, 20], in particular the more recent of the two [20]. The previous proposal admits that it “does not attempt to solve the multi-layer and complex problems of properly and completely representing text written in Book Pahlavi” [20, p.3]. The current proposal, on the other hand, proceeds to indeed solve those very problems. My first goal is to be able to uniquely and unambiguously represent Book Pahlavi texts in Unicode, without any loss of vital information in the transcoding process. To elaborate, first, a given shape standing for a Pahlavi word, should be able to be encoded only in one way to Unicode. Second, starting from the encoded word, only one shape must be able to be generated. Third, going through a round trip, i.e. encoding the word and then rendering it, the rendered shape and the original shape should contain the same amount of (relevant) information. In addition, I will encode the text in such a way that in the process of rendering it, no obligate ligature features and no obligate contextual shape changes, and in general no complex text layout (CTL) capabilities, are needed to get “minimum legible” text as defined by the Unicode Standard.
    [Show full text]
  • Migrating Java UI Client Applications to Openjdk 11 25Th October, 2018
    Migrating Java UI Client Applications to OpenJDK 11 25th October, 2018 Phil Race, Sergey Bylokhov, Java Platform Group Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, timing, and pricing of any features or functionality described for Oracle’s products may change and remains at the sole discretion of Oracle Corporation. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | Introduction • Lots of things have changed between 8u to 11 – For lots of different reasons. • Most desktop applications are still using JDK8u • Most applications will want to move to 11u and later soon as JDK 8 End Of Public Updates occurs for commercial and personal use. • This session is to help you understand what has changed that affects UI client applications, and what steps you can (or must) take to move your code to JDK 11 and later. Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | 3 Topics • OpenJDK and Oracle JDK • Desktop application migration issues in JDK11 • JavaFX removal from JDK11 • The deployment technologies removal from JDK11 Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | 4 Oracle JDK and OpenJDK • OpenJDK project launched by Sun Microsystems in 2007 • All OpenJDK source had to be under an Open Source license – GPL v2 (+/- ClassPath Exception) where possible.
    [Show full text]