Data Quality Guide Table of Contents

Total Page:16

File Type:pdf, Size:1020Kb

Data Quality Guide Table of Contents Spectrum™ Technology Platform Version 2019.1.0 Data Quality Guide Table of Contents Using an Express Match Key 107 Analyzing Match Results 112 1 - Getting Started Dataflow Templates for Matching 127 Introduction to Data Quality 5 5 - Deduplication 2 - Parsing Filtering Out Duplicate Records 136 Creating a Best of Breed Record 140 Introduction to Parsing 8 Defining Domain-Independent Parsing Grammars in Dataflows 9 6 - Exception Records Culture-Specific Parsing 10 Analyzing Parsing Results 36 Designing a Dataflow to Handle Exceptions 146 Parsing Personal Names 40 Designing a Dataflow for Real-Time Dataflow Templates for Parsing 41 Revalidation 147 3 - Standardization 7 - Lookup Tables Standardizing Terms 56 Introduction to Lookup Tables 151 Standardizing Personal Names 57 Data Normalization Module Tables 151 Templates for Standardization 59 Universal Name Module Tables 156 Viewing the Contents of a Lookup Table 158 Adding a Term to a Lookup Table 159 4 - Matching Removing a Term from a Lookup Table 159 Modifying the Standardized Form of a Term 159 Matching Terminology 62 Reverting Table Customizations 160 Standard Fields 63 Creating a Lookup Table 160 Techniques for Defining Match Keys 64 Importing Data 161 Match Rules 67 Matching Records from a Single Source 80 Matching Records from One Source to Another 8 - Stages Reference Source 86 Matching Records Between and Within Advanced Matching Module 165 Sources 92 Business Steward Module 226 Matching Records Against a Database 98 Data Normalization Module 271 Matching Records Using Multiple Match Rules 100 Universal Name Module 289 Creating a Universal Matching Service 104 9 - ISO Country Codes and Module Support ISO Country Codes and Module Support 324 Spectrum™ Technology Platform 2019.1.0 Data Quality Guide 3 1 - Getting Started In this section Introduction to Data Quality 5 Getting Started Introduction to Data Quality Data quality involves ensuring the accuracy, timeliness, completeness, and consistency of the data used by an organization so that the data is fit for use. Spectrum™ Technology Platform supports data quality initiatives by providing the following capabilities. Parsing Parsing is the process of analyzing a sequence of input characters in a field and breaking it up into multiple fields. For example, you might have a field called Name which contains the value "John A. Smith" and through parsing, you can break it up so that you have a FirstName field containing "John", a MiddleName field containing "A" and a LastName field containing "Smith." Standardization Standardization takes data of the same type and puts it in the same format. Some types of data that may be standardized include telephone numbers, dates, names, addresses, and identification numbers. For example, telephone numbers can be formatted to eliminate non-numeric characters such as parentheses, periods, or dashes. You should standardize your data before performing matching or deduplication activities since standardized data will be more accurately matched than data that is inconsistently formatted. Matching Matching is the process of identifying records that are related to each other in some way that is significant for your purposes. For example, if you are trying to eliminate redundant information from your customer data, you may want to identify duplicate records for the same customer; or, if you are trying to eliminate duplicate marketing pieces going to the same address, you may want to identify records of customers that live in the same household. Deduplication Deduplication identifies records that represent one entity but for one reason or another were entered into the system multiple times, sometimes with slightly different data. For example, your system may contain vendor information from different departments in your organization, with each department using a different vendor ID for the same vendor. Using Spectrum™ Technology Platform you can consolidate these records into a single record for each vendor. Review of Exception Records In some cases you may have data that cannot be confidently processed automatically and that must be reviewed by a knowledgeable data steward. Some examples of records that may require manual review include: • Address verification failures Spectrum™ Technology Platform 2019.1.0 Data Quality Guide 5 Getting Started • Geocoding failures • Low-confidence matches • Merge/consolidation decisions The Business Steward Module is a set of features that allow you to identify and resolve exception records. Spectrum™ Technology Platform 2019.1.0 Data Quality Guide 6 2 - Parsing In this section Introduction to Parsing 8 Defining Domain-Independent Parsing Grammars in Dataflows 9 Culture-Specific Parsing 10 Analyzing Parsing Results 36 Parsing Personal Names 40 Dataflow Templates for Parsing 41 Parsing Introduction to Parsing Parsing is the process of analyzing a sequence of input characters in a field and breaking it up into multiple fields. For example, you might have a field called Name which contains the value "John A. Smith" and through parsing, you can break it up so that you have a FirstName field containing "John", a MiddleName field containing "A" and a LastName field containing "Smith." To create a dataflow that parses, use the Open Parser stage. Open Parser allows you to write parsing rules called grammars. A grammar is a set of expressions that map a sequence of characters to a set of named entities called domain patterns. A domain pattern is a sequence of one or more tokens in your input data that you want to represent as a data structure, such as name, address, or account numbers. A domain pattern can consist of any number of tokens that can be parsed from your input data. A domain pattern is represented in the parsing grammar as the <root> expression. Input data often contains such tokens in hard-to-use or mixed formats. For example: • Your input data contains names in a single field that you want to separate into given name and family name. • Your input data contains addresses from several cultures and you want to extract address data for a specific culture only. • Your input data includes free-form text that contains embedded email addresses and you want to extract email addresses and match them up with personal data and store them in a database. There are two kinds of grammars: culture specific and domain independent. A culture-specific parsing grammar is associated with a culture and/or language (such as English, Canadian English, Spanish, Mexican Spanish, and so on) and a particular type of data (phone numbers, personal names, and so on). When an Open Parser stage is configured to perform culture-specific parsing, each culture's parsing grammar is applied to each record. The grammar with the best parser score (or the first one to have a score of 100) is the one whose results are returned. Alternatively, culture-specific parsing grammars can use the value in the input record's CultureCode field and process the data according to the culture settings contained in the culture's parsing grammar. Culture-specific parsing grammars can inherit properties from a parent. A domain-independent parsing grammar is not associated with either a language or a particular type of data. Domain-independent parsing grammars do not inherit properties from a parent and ignore any CultureCode information in the input data. Open Parser analyzes a sequence of characters in input fields and categorizes them into a sequence of tokens through a process called tokenization. Tokenization is the process of delimiting and classifying sections of a string of input characters into a set of tokens based on separator characters (also called tokenizing characters), such as space, hyphen, and others. The tokens are then placed into output fields you specify. The following diagram illustrates the process of creating a parsing grammar: Spectrum™ Technology Platform 2019.1.0 Data Quality Guide 8 Parsing Defining Domain-Independent Parsing Grammars in Dataflows To define domain-independent parsing grammars in a dataflow: 1. In Enterprise Designer, add an Open Parser stage to your dataflow. 2. Double-click the Open parser stage on the canvas. 3. Click Define Domain Independent Grammar on the Rules tab. 4. Use the Grammar Editor to create the grammar rules. You can type commands and variables into the text box or use the commands provided in the Commands tab. For more information, see Grammars on page 27. Spectrum™ Technology Platform 2019.1.0 Data Quality Guide 9 Parsing 5. To cut, copy, paste, and find and replace text strings in your parsing grammar, right-click in the Grammar Editor and select the appropriate command. 6. To check the parsing grammar you have created, click Validate. The validate feature lists any errors in your grammar syntax, including the line and column where the error occurs, a description of the error, and the command name or value that is causing the error. 7. Click the Preview tab to test the parsing grammar. 8. When you are finished creating your parsing grammar, click OK. Culture-Specific Parsing Defining a Culture-Specific Parsing Grammar A culture-specific parsing grammar allows you to specify different parsing rules for different languages and cultures. This allows you to parse data from different countries in a single Open Parser stage, for example phone numbers from the United States and phone numbers from the United Kingdom. By default, each input record is parsed using each culture's parsing grammar, in the order specified in the Open Parser stage. You can also add a CultureCode field to the input records if you want a specific culture's parsing grammar to be used for that record. For more information, see Assigning a Parsing Culture to a Record on page 12. Note: If you want to create a domain-independent parsing grammar, see Defining Domain-Independent Parsing Grammars in Dataflows on page 9. 1. In Enterprise Designer, go to Tools > Open Parser Domain Editor.
Recommended publications
  • Technical Reference Manual for the Standardization of Geographical Names United Nations Group of Experts on Geographical Names
    ST/ESA/STAT/SER.M/87 Department of Economic and Social Affairs Statistics Division Technical reference manual for the standardization of geographical names United Nations Group of Experts on Geographical Names United Nations New York, 2007 The Department of Economic and Social Affairs of the United Nations Secretariat is a vital interface between global policies in the economic, social and environmental spheres and national action. The Department works in three main interlinked areas: (i) it compiles, generates and analyses a wide range of economic, social and environmental data and information on which Member States of the United Nations draw to review common problems and to take stock of policy options; (ii) it facilitates the negotiations of Member States in many intergovernmental bodies on joint courses of action to address ongoing or emerging global challenges; and (iii) it advises interested Governments on the ways and means of translating policy frameworks developed in United Nations conferences and summits into programmes at the country level and, through technical assistance, helps build national capacities. NOTE The designations employed and the presentation of material in the present publication do not imply the expression of any opinion whatsoever on the part of the Secretariat of the United Nations concerning the legal status of any country, territory, city or area or of its authorities, or concerning the delimitation of its frontiers or boundaries. The term “country” as used in the text of this publication also refers, as appropriate, to territories or areas. Symbols of United Nations documents are composed of capital letters combined with figures. ST/ESA/STAT/SER.M/87 UNITED NATIONS PUBLICATION Sales No.
    [Show full text]
  • A Könyvtárüggyel Kapcsolatos Nemzetközi Szabványok
    A könyvtárüggyel kapcsolatos nemzetközi szabványok 1. Állomány-nyilvántartás ISO 20775:2009 Information and documentation. Schema for holdings information 2. Bibliográfiai feldolgozás és adatcsere, transzliteráció ISO 10754:1996 Information and documentation. Extension of the Cyrillic alphabet coded character set for non-Slavic languages for bibliographic information interchange ISO 11940:1998 Information and documentation. Transliteration of Thai ISO 11940-2:2007 Information and documentation. Transliteration of Thai characters into Latin characters. Part 2: Simplified transcription of Thai language ISO 15919:2001 Information and documentation. Transliteration of Devanagari and related Indic scripts into Latin characters ISO 15924:2004 Information and documentation. Codes for the representation of names of scripts ISO 21127:2014 Information and documentation. A reference ontology for the interchange of cultural heritage information ISO 233:1984 Documentation. Transliteration of Arabic characters into Latin characters ISO 233-2:1993 Information and documentation. Transliteration of Arabic characters into Latin characters. Part 2: Arabic language. Simplified transliteration ISO 233-3:1999 Information and documentation. Transliteration of Arabic characters into Latin characters. Part 3: Persian language. Simplified transliteration ISO 25577:2013 Information and documentation. MarcXchange ISO 259:1984 Documentation. Transliteration of Hebrew characters into Latin characters ISO 259-2:1994 Information and documentation. Transliteration of Hebrew characters into Latin characters. Part 2. Simplified transliteration ISO 3602:1989 Documentation. Romanization of Japanese (kana script) ISO 5963:1985 Documentation. Methods for examining documents, determining their subjects, and selecting indexing terms ISO 639-2:1998 Codes for the representation of names of languages. Part 2. Alpha-3 code ISO 6630:1986 Documentation. Bibliographic control characters ISO 7098:1991 Information and documentation.
    [Show full text]
  • Version 13.0 Release Notes
    Release Notes The tool of thought for expert programming Version 13.0 Dyalog is a trademark of Dyalog Limited Copyright 1982-2011 by Dyalog Limited. All rights reserved. Version 13.0 First Edition April 2011 No part of this publication may be reproduced in any form by any means without the prior written permission of Dyalog Limited. Dyalog Limited makes no representations or warranties with respect to the contents hereof and specifically disclaims any implied warranties of merchantability or fitness for any particular purpose. Dyalog Limited reserves the right to revise this publication without notification. TRADEMARKS: SQAPL is copyright of Insight Systems ApS. UNIX is a registered trademark of The Open Group. Windows, Windows Vista, Visual Basic and Excel are trademarks of Microsoft Corporation. All other trademarks and copyrights are acknowledged. iii Contents C H A P T E R 1 Introduction .................................................................................... 1 Summary........................................................................................................................... 1 System Requirements ....................................................................................................... 2 Microsoft Windows .................................................................................................... 2 Microsoft .Net Interface .............................................................................................. 2 Unix and Linux ..........................................................................................................
    [Show full text]
  • Analysis of Mutual Influence of Music and Text in Svan Songs
    Analysis of mutual influence of music and text in Svan songs Nana Mzhavanadze Madona Chamgeliani University of Potsdam Lidbashi Foundation [email protected] [email protected] ABSTRACT stand and, secondly, due to the phonetic reservoir (es- pecially vowels in upper Svaneti) of Svans being rich The present paper discusses the influence of musicological, lin- and often having grammatical function. This raises a guistic, and ethnological aspects on the music-text relation in question about phonetic events being of musical or Svan songs. In a lot of cases, there is a deep bond between verbal linguistic origin. texts and their musical counterparts (Mzhavanadze & Chamgeliani, 2015). Despite the fact that the lyrics of songs are • In contradiction to the hypothesis of some authors that of critical importance to the rituals in which they are performed, Svan songs are believed to be “song-poems“ many of them are difficult (at times impossible) to transcribe. The (Shanidze et al., 1939), the texts often do not show present analysis shows that the reservoir of Svan melos is rela- high integrity with the music as they are sung. tively modest in comparison with the verbal texts. In songs in which musical patterns repeat and texts alter, the latter get modi- This suggests that in order to understand Svans’ musical fied and often distorted to the degree to make them incomprehen- grammar, the interrelationship between music and verbal sible. This is also true for texts of pre-Christian origin, which has texts cannot be ignored. The presented paper touches on interesting ethnological consequences. The results of the current the issues what role either play in the forming process of study challenge the isolated linguistic interpretation of verbal texts of Svan songs and emphasize the need for a joint analysis a musical (artistic) image.
    [Show full text]
  • Georgian Romanization
    Georgian Values are shown for the older Khutsuri and the modern Mkhedruli alphabets. There are no upper case letters in Mkhedruli. Upper case letters Lower case letters Khutsuri Romanization Khutsuri Mkhedruli Romanization Ⴀ A ⴀ ა a Ⴁ B ⴁ ბ b Ⴂ G ⴂ ზ g Ⴃ D ⴃ დ d Ⴄ E ⴄ ე e Ⴅ V ⴅ ქ v Ⴆ Z ⴆ ჩ z Ⴇ Tʻ ⴇ უ tʻ Ⴈ I ⴈ ი i Ⴉ K ⴉ ჯ k Ⴊ L ⴊ მ l Ⴋ M ⴋ ნ m Ⴌ N ⴌ ო n Ⴍ O ⴍ პ o Ⴎ P ⴎ ჰ p Ⴏ Ž ⴏ შ ž Ⴐ R ⴐ ს r Ⴑ S ⴑ ტ s Ⴒ T ⴒ ჳ t Ⴓ U ⴓ ფ u Ⴔ Pʻ ⴔ ჟ pʻ Ⴕ Kʻ ⴕ ლ kʻ Ⴖ Ġ ⴖ ჭ ġ Ⴗ Q ⴗ ჱ q Ⴘ Š ⴘ ჲ š Ⴙ Čʻ ⴙ ძ čʻ Ⴚ Cʻ ⴚ გ cʻ Ⴛ Ż ⴛ ჴ ż Ⴜ C ⴜ ც c Ⴝ Č ⴝ წ č Ⴞ X ⴞ ყ x Ⴟ J ⴟ ხ j Ⴠ H ⴠ თ h Ⴡ Ē ⴡ ჵ ē Ⴢ Y ⴢ კ y Ⴤ X ⴤ რ x Ⴥ Ō ⴥ ჶ ō Ⴣ W ⴣ ღ w ვ f ჷ ĕ ჸ ʻ (ayn) , ŭ Transliteration of Georgian 1/3 GEORGIAN Georgian script* ISO 9984 National IKE ALA-LC KNAB BGN/PCGN TITUS 1996(1.0) 2002(2.0) 1974(3.0) 1997(4.0) 1993(5.0) 1981(6.0) 2000(7.0) 01 Ⴀ ა a a a a a a a 02 Ⴁ ბ b b b b b b b 03 Ⴂ გ g g g g g g g 04 Ⴃ დ d d d d d d d 05 Ⴄ ე e e e e e e e 06 Ⴅ ვ v v v v v v v 07 Ⴆ ზ z z z z z z z 08 Ⴡ ჱ a ē — ē / ey ē — (ey) (ē) 09 Ⴇ თ t’ t t tʻ t tʼ t 10 Ⴈ ი i i i i i i i 11 Ⴉ კ k k’ ḳ k ķ k ḳ 12 Ⴊ ლ l l l l l l l 13 Ⴋ მ m m m m m m m 14 Ⴌ ნ n n n n n n n 15 Ⴢ ჲ a y — y y — (j) — 16 Ⴍ ო o o o o o o o 17 Ⴎ პ p p’ ṗ p ṗ p ṗ 18 Ⴏ ჟ ž zh ž z̆ ž zh ž 19 Ⴐ რ r r r r r r r 20 Ⴑ ს s s s s s s s 21 Ⴒ ტ t t’ ṭ t ţ t ṭ 22 Ⴣ ჳ a w — wi / ü w — — — 23 Ⴓ უ u u u u u u u 24 Ⴔ ფ pʼ p p pʻ p pʼ p 25 Ⴕ ქ kʼ k k kʻ k kʼ k 26 Ⴖ ღ ḡ gh ǧ / ɣ ġ ǧ gh ġ 27 Ⴗ ყ q q’ q̇ q q q q̇ 28 Ⴘ შ š sh š š š sh š 29 Ⴙ ჩ čʼ ch č čʻ č chʼ č 30 Ⴚ ც cʼ ts c cʻ c tsʼ c 31 Ⴛ ძ j dz j / ʒ ż dz dz ʒ 32 Ⴜ წ c ts’ c̣ c ç ts c̣ 33 Ⴝ ჭ č ch’ č̣ č ç̌ ch č̣ 34 Ⴞ ხ x kh x x x kh χ 35 Ⴤ ჴ a ẖ — q x̣ — (qʼ) q 36 Ⴟ ჯ ǰ j ǰ / ǯ j dž j ǯ 37 Ⴠ ჰ h h h h h h h 38 Ⴥ ჵ a ō — ō / ow ō — — (ō) Thomas T.
    [Show full text]
  • Svan Funeral Dirges (Zär): Musical Acoustical Analysis of a New Collection of Field Recordings
    Trabzon University State Conservatory © 2017-2020 Volume 4 Issue 2 December 2020 Research Article Musicologist 2020. 4 (2): 138-167 DOI: 10.33906/musicologist.782094 FRANK SCHERBAUM Universität Potsdam, Germany [email protected] orcid.org/0000-0002-5050-7331 NANA MZHAVANADZE Universität Potsdam, Germany [email protected] orcid.org/0000-0001-5726-1656 Svan Funeral Dirges (Zär): Musical Acoustical Analysis of a New Collection of Field Recordings ABSTRACT This paper is a companion paper to Mzhavanadze & Scherbaum (2020). KEYWORDS Jointly, the two papers describe the results of an interdisciplinary study of three-voiced Svan funeral dirges, known as zär in Svan and zari in Traditional Georgian Georgian. In the present paper, to which we refer as paper 1, we analyze Vocal Music the musical acoustical properties of a new set of field recordings Computational collected during an ethnomusicological field expedition to Georgia in Ethnomusicology 2016. The aim of the study is to investigate the tonal organization of eleven different performances of six different variants of zär, performed Ethnomusicological by singers from different villages. For some of the performances, we Field Recordings observe a strong gradual pitch rise of up to 100 cents per minute. The intra-variant differences in the performances of different groups of singers were observed to be remarkably different, including the use of significantly different harmonic tuning systems. In contrast, two subsequent performances of the Mest’ia variant of zär by a group of singers recorded in Zargǟsh were essentially identical. This demonstrates the widespread absence of improvisational elements in these two performances. One of the most interesting results of our analysis is the observation that the musical structure of zär, expressed, for example, in its ambitus, the complexity of its melodic progression, and its harmonic chord inventory, change systematically along the course of the Enguri valley.
    [Show full text]
  • 4 Issue: 2 December 2020 Masthead
    Trabzon University e-ISSN: 2618-5652 State Conservatory Vol: 4 Issue: 2 December 2020 Masthead Musicologist: International Journal of Music Studies Volume 4 Issue 2 December 2020 Musicologist is a biannually, peer-reviewed, open access, online periodical published in English by Trabzon University State Conservatory, in Trabzon, Turkey. e-ISSN: 2618-5652 Owner on behalf of Trabzon University State Conservatory Merve Eken KÜÇÜKAKSOY (Director) Editor-In-Chief Abdullah AKAT (İstanbul University – Turkey) Deputy Editor Merve Eken KÜÇÜKAKSOY (Trabzon University – Turkey) Technical Editor Emrah ERGENE (Trabzon University – Turkey) Language Editor Marina KAGANOVA (Colombia University – USA) Editorial Assistant Uğur ASLAN (Trabzon University – Turkey) Contacts Address: Trabzon Üniversitesi Devlet Konservatuvarı Müdürlüğü, Fatih Kampüsü, Söğütlü 61335 Akçaabat/Trabzon, Turkey Web: www.musicologistjournal.com Email: [email protected] All rights reserved. The authors are responsible for all visual elements, including tables, figures, graphics and pictures. They are also responsible for any scholarly citations. Trabzon University does not assume any legal responsibility for the use of any of these materials. ©2017-2020 Trabzon University State Conservatory Editorial Board Alper Maral Ankara Music and Fine Arts University – Turkey Caroline Bithell The University of Manchester – UK Ekaterine Diasamidze Tbilisi State University – Georgia Elif Damla Yavuz Mimar Sinan Fine Arts University – Turkey Erol Köymen The University of Chicago – USA
    [Show full text]
  • CEN WORKSHOP AGREEMENT CWA 13873:2000 Multilingual
    CEN WORKSHOP AGREEMENT CWA 13873:2000 2000-03-01 English version Information technology – Multilingual European Subsets in ISO/IEC 10646-1 Technologies de l’information – Informationstechnologie – Jeux partiels européens multilingues Mehrsprachige europäische Untermengen dans l’ISO/CEI 10646-1 in ISO/IEC 10646-1 This CEN Workshop Agreement has been drafted and approved by a Workshop of representatives of interested parties, whose names and affiliations can be obtained from the CEN/ISSS Secretariat. The formal process followed by the Workshop in the development of this Workshop Agreement has been endorsed by the National Members of CEN, but neither the National Members of CEN nor the CEN Central Secretariat can be held accountable for the technical content of this CEN Workshop Agreement or for possible conflicts with standards or legislation. This CEN Workshop Agreement can in no way be held as being an official standard developed by CEN and its Members. This CEN Workshop Agreement is publicly available, as a reference document, from the CEN Members National Standard Bodies. CEN Members are the National Standards Bodies of Austria, Belgium, the Czech Republic, Denmark, Finland, France, Germany, Greece, Iceland, Ireland, Italy, Luxembourg, the Netherlands, Norway, Portugal, Spain, Sweden, Switzerland and the United Kingdom. CEN EUROPEAN COMMITTEE FOR STANDARDIZATION COMITÉ EUROPÉEN DE NORMALISATION EUROPÄISCHES KOMITEE FÜR NORMUNG Central Secretariat: rue de Stassart 36, B-1050 Brussels © CEN 2000 All rights of exploitation in any form and by any means reserved worldwide for CEN national Members Ref.No. CWA 13873:2000 E Page 2 CWA 13873:2000 Contents Foreword 3 Introduction 4 1. Scope 5 2.
    [Show full text]
  • The Wili Benchmark Dataset for Written Natural Language Identification
    1 The WiLI benchmark dataset for written II. WRITTEN LANGUAGE BASICS language identification A language is a system of communication. This could be spoken language, written language or sign language. A spoken language Martin Thoma can have several forms of written language. For example, the E-Mail: [email protected] spoken Chinese language has at least three writing systems: Traditional Chinese characters, Simplified Chinese characters and Pinyin — the official romanization system for Standard Chinese. Abstract—This paper describes the WiLI-2018 benchmark dataset for monolingual written natural language identification. Languages evolve. New words like googling, television and WiLI-2018 is a publicly available,1 free of charge dataset of Internet get added, but written languages are also refactored. short text extracts from Wikipedia. It contains 1000 paragraphs For example, the German orthography reform of 1996 aimed at of 235 languages, totaling in 235 000 paragraphs. WiLI is a classification dataset: Given an unknown paragraph written in making the written language simpler. This means any system one dominant language, it has to be decided which language it which recognizes language and any benchmark needs to be is. adapted over time. Hence WiLI is versioned by year. Languages do not necessarily have only one name. According to Wikipedia, the Sranan language is also known as Sranan Tongo, Sranantongo, Surinaams, Surinamese, Surinamese Creole and I. INTRODUCTION Taki Taki. This makes ISO 369-3 valuable, but not all languages are represented in ISO 369-3. As ISO 369-3 uses combinations The identification of written natural language is a task which of 3 Latin letters and has 547 reserved combinations, it can appears often in web applications.
    [Show full text]
  • Proposal for the Georgian Script Root Zone LGR
    Proposal for the Georgian Script Root Zone LGR LGR Version 2 Date: 2016-11-24 Document version: 1.1 Authors: GEORGIAN SCRIPT GENERATION PANEL 1 General Information/ Overview/ Abstract The purpose of this document is to give an overview of the proposed Georgian LGR in the XML format and the rationale behind the design decisions taken. It includes a discussion of relevant features of the script, the communities or languages using it, the process and methodology used and information on the contributors. The formal specification of the LGR can be found in the accompanying XML document: • Proposed-LGR-Georgian-20160915.xml Labels for testing can be found in the accompanying text document: • Labels-GeorgianScript-20160915.txt 2 Script for which the LGR is proposed ISO 15924 Code: Geor ISO 15924 Key N°: 240 ISO 15924 English Name: Georgian Mkhedruli Latin transliteration of native script name: Mkhedruli Native name of the script: მხედრული Maximal Starting Repertoire (MSR) version: MSR-2 Proposal for a Georgian Script Root Zone LGR Georgian Script GP 3 Background on Script and Principal Languages Using It The Georgian scripts are the three writing systems used to write the Georgian language: Asomtavruli, Nuskhuri and Mkhedruli. Mkhedruli (Georgian: მხედრული) is the current Georgian script and is therefore the standard script for modern Georgian and its related Kartvelian languages, whereas Asomtavruli and Nuskhuri are used only in ceremonial religious texts and iconography. In the following, the term Georgian script is used synonymously with Mkhedruli. Like the two other scripts, Mkhedruli is purely unicameral. Mkhedruli first appears in the 10th century - the oldest Mkhedruli inscription found is dated back to 982 AD.
    [Show full text]
  • Ukraine: Meat Sector Review
    Ukraine – Meat sector review – Meat sector Ukraine Ukraine Meat sector review FAO INVESTMENT CENTRE COUNTRY HIGHLIGHTS Please address questions and comments to: Investment Centre Division Food and Agriculture Organization of the United Nations (FAO) Viale delle Terme di Caracalla – 00153 Rome, Italy Report No. 14 Report [email protected] http://www.fao.org/investment/en Ukraine: Meat sector review Report No. 14 - February 2014 I3532E/1/11.13 FAO INVESTMENT CENTRE Ukraine Meat sector review Andriy Yarmak Economist, Investment Centre Division, FAO Elizaveta Svyatkivska Meat Market Analyst Dmitry Prikhodko Economist, Investment Centre Division, FAO COUNTRY HIGHLIGHTS prepared under the FAO/EBRD Cooperation FOOD AND AGRICULTURE ORGANIZATION OF THE UNITED NATIONS Rome, 2014 The designations employed and the presentation of material in this information product do not imply the expression of any opinion whatsoever on the part of the Food and Agriculture Organization of the United Nations (FAO) or the European Bank for Reconstruction and Development (EBRD) concerning the legal or development status of any country, territory, city or area or of its authorities, or concerning the delimitation of its frontiers or boundaries. The mention of specific companies or products of manufacturers, whether or not these have been patented, does not imply that these have been endorsed or recommended by FAO or the EBRD in preference to others of a similar nature that are not mentioned. The views expressed in this information product are those of the author(s) and do not necessarily reflect the views or policies of FAO or the EBRD. © FAO 2014 FAO encourages the use, reproduction and dissemination of material in this information product.
    [Show full text]
  • Fare Un Libro.Pdf
    Regione siciliana Assessorato dei beni culturali e dell’identità siciliana Dipartimento dei beni culturali e dell’identità siciliana CARLO PASTENA Sausage Sausage Sausage Sausage FARE UN LIBRO PALERMO 2013 Si ringraziano per la collaborazione i colleghi del CRICD, e in particolare: Mi- chele Di Dio, Paolo Gambino, Sebastiano Gambino, Angela Genovese, Ro- salia La Mendola, Fabio Militello, Pierantonio Passante e Paolo Tuzzolino. La copertina è di Fabio Militello. I disegni delle pp. 42-44 sono tratti da: P. Prévôt e F. Roher, Cours d’indu- stries graphiques. Techniques d’impression, Paris, Eyrolles, 2008, pp. 66-67. © Regione siciliana. Assessorato dei beni culturali e dell’identità siciliana. Dipartimento dei beni culturali e dell’identità siciliana. CRICD. Pastena, Carlo <1955-> Fare un libro / Carlo Pastena. – Palermo : CRICD, 2012. 1. Editoria. 070.50945 CDD-22 SBN Pal0241192 CIP - Biblioteca centrale della Regione siciliana “Alberto Bombace” Negli ultimi anni la produzione editoriale dell’Assessorato regionale dei beni culturali è considerevolmente aumentata, raggiungendo una media di cinquanta nuovi titoli ogni anno. A fronte di questa considerevole attività, molti Istituti periferici del Dipartimento regionale dei beni culturali hanno più volte manifestato l’esigenza di un’opera che, pur sinteticamente, introducesse non solo al DTP (Deskstop Publishing) cioè alle tecniche d’impaginazione e stampa, ma anche alle problematiche connesse al deposito obbligatorio, al diritto d’autore, alle norme fiscali in materia di editoria, ecc. Rispondendo a questa sempre più frequente richiesta, questo Centro regionale per l’inventario, la catalogazione e la documentazione, nell’adempimento dei suoi compiti istituzionali (l.r. 80/77 art. 9, lett. c.), intende così fornire a chi si accinge a pubblicare un’opera, alcune informazioni su come preparare un testo da dare in tipografia, sui principali tipi di carta utilizzate per la stampa, ecc.
    [Show full text]