How to Use Non-English Texts*

Total Page:16

File Type:pdf, Size:1020Kb

How to Use Non-English Texts* How to Use Non-English Texts* English is the main language being used for the preparation of the papers in the journal. Occasionally, non-English symbols, abbreviations, words, phrases, sayings, sentences, paragraphs, notes or references could be included on a case-by-case basis. The non-English texts should appear in italic style to be easily recognized within the textual content. It is recommended that a proper translation into English is provided for such textual inclusions so that ambiguity is reduced to a minimum. Latin Language Latin language is used frequently in technical writings for historical reasons. Some examples of typical phrases which do not need translation due to their frequent use in English are provided below. Latin phrases: ad hoc, for this [purpose]; a posteriori, from the latter; a priori, from the former; in situ, in the place; in vitro, in glass; in vivo, in a living thing. Abbreviated Latin phrases: idem quod (i.q.) the same as; idem (id.), the same; id est (i.e.), that is; in illo ordine (i.o.), in that order; exempli gratia (e.g.), for example; ibidem (ibid.), in the same place; et cetera (etc.), and so forth. Other Latin phrases which are used less frequently or only in specific fields may need translation for clarity: in esse, in existence; in foro, in forum; in hunc effectum, for this purpose; in posse, in potential; in re, in the matter [of]. Latin sayings must be translated for completeness: Labor omnia vincit. Work conquers all things. Pax melior est quam iustissimum bellum. Peace is better than the most just war. Qui scribit, bis legit. Who writes, reads twice. Greek Language The Greek alphabet: Αα, Ββ, Γγ, Δδ, Εε, Ζζ, Ηη, Θθ, Ιι, Κκ, Λλ, Μμ, Νν, Ξξ, Οο, Ππ, Ρρ, Σσ, Ττ, Υυ, Φφ, Χχ, Ψψ, Ωω (Alpha, Beta, Gamma, Delta, Epsilon, Zeta, Eta, Theta, Iota, Kappa, Lambda, Mu, Nu, Xi, Omicron, Pi, Rho, Sigma, Tau, Upsilon, Phi, Chi, Psi, Omega), is used for technical notations and the English representation of the Greek symbols is not needed but can prove useful sometimes. Some Greek phrases are used occasionally followed by translation into English: Εὕρηκα! Eureka! French Language Many French words and phrases are used in English texts, for example: à la [...], in the manner of; à propos, regarding; au contraire, on the contrary; café, a coffee shop; cliché, a stereotype; déjà vu, already seen; en route, on the way; façade, front view; par avion, by air mail; par excellence, by excellence; pas de problème, no problem; vis-à-vis, face to face. The proper diacritics and ligatures should be included in the French content. * By Dobri Atanassov Batovski, Deputy Editor, AU J.T. (Continued on inside back cover) (Continued from inside front cover) German Language Specific texts from German language containing umlauts and sharp s: Ää, Öö, Üü, ß, should be properly written and subsequently translated into English. Optional Use of Languages Other than English in the References The prospective authors are encouraged to locate the origins of known results used in their scholarly works. Many studies are initially published in non-English media and some of the said studies are still unavailable in English. It is preferable to include references to such studies with a proper English translation in each case of the name(s) of the author(s), the title of the paper, and the information about the publisher. Optional representations of said references in the original language of publication are allowed as illustrated in the sample references below. Liapounoff, A.M. 1901. A new form of the theorem on probability limit. Proc. Imperial Academy of Sciences in St. Petersburg, Physico-mathematical Series VIII 12(5):1- 24. In French: Liapounoff, A.M. 1901. Nouvelle forme du théoréme sur la limite de probabilitès. Mémoires de l'Academie Impériale des Sciences de St. Pétersbourg, Classe Physico-mathématique VIII 12(5):1-24. Pukhov, G.E. 1980. Differential transforms of functions and equations. Naukova Dumka, Kiev, USSR. In Russian: Пухов, Г.Е. 1980. Дифференциальные преобразования функций и уравнений. Наукова думка, Киев, СССР. Use of Texts Written in Complex Writing Systems Partial inclusion of symbols from fonts for complex scripts, right-to-left languages, and East- Asian languages is validated whenever the submitted paper deals with specific technical topics such as optical character recognition (OCR), voice recognition, machine translation, etc. Use of Texts Written in Ancient or Artificial Languages The decipherment of ancient texts or the generation of artificial languages would require the processing of symbols which have no equivalent in the existing standard fonts. Therefore, the writing system under consideration should be described as a set of distinct images. Useful direct or indirect relations between the studied symbols and the English letters/words are needed where possible. Textual segments included as images must be of good quality. The original text could be a photographed part of a deteriorated ancient sample such as papyrus, carving, painting, textile, etc. Then both graphical enhancements and sketches of the textual content should be shown in separate images. Analysis of Statements Written in Languages Other than English In some cases, statements written in different languages by different authors are a subject of comparison and/or criticism. Then the original texts should be included in a comparison table together with the corresponding English translation. .
Recommended publications
  • The Festvox Indic Frontend for Grapheme-To-Phoneme Conversion
    The Festvox Indic Frontend for Grapheme-to-Phoneme Conversion Alok Parlikar, Sunayana Sitaram, Andrew Wilkinson and Alan W Black Carnegie Mellon University Pittsburgh, USA aup, ssitaram, aewilkin, [email protected] Abstract Text-to-Speech (TTS) systems convert text into phonetic pronunciations which are then processed by Acoustic Models. TTS frontends typically include text processing, lexical lookup and Grapheme-to-Phoneme (g2p) conversion stages. This paper describes the design and implementation of the Indic frontend, which provides explicit support for many major Indian languages, along with a unified framework with easy extensibility for other Indian languages. The Indic frontend handles many phenomena common to Indian languages such as schwa deletion, contextual nasalization, and voicing. It also handles multi-script synthesis between various Indian-language scripts and English. We describe experiments comparing the quality of TTS systems built using the Indic frontend to grapheme-based systems. While this frontend was designed keeping TTS in mind, it can also be used as a general g2p system for Automatic Speech Recognition. Keywords: speech synthesis, Indian language resources, pronunciation 1. Introduction in models of the spectrum and the prosody. Another prob- lem with this approach is that since each grapheme maps Intelligible and natural-sounding Text-to-Speech to a single “phoneme” in all contexts, this technique does (TTS) systems exist for a number of languages of the world not work well in the case of languages that have pronun- today. However, for low-resource, high-population lan- ciation ambiguities. We refer to this technique as “Raw guages, such as languages of the Indian subcontinent, there Graphemes.” are very few high-quality TTS systems available.
    [Show full text]
  • Sanskrit Dictionary
    Online Sanskrit Dictionary Introduction The following is a list of Sanskrit words printed in Devanagari with its transliterated form and a short meaning provided as a reference source. This cannot be a substitute for a good printed Sanskrit-English dictionary. However, we anticipate this to aid a student of Sanskrit in the on-line world. The list of words is a compilation from various sources such as messages on sanskrit-digest, translated documents such as Bhagavadgita, atharvashiirshha, raamarakshaa et cetera, and other files accessible on the web. The words are encoded in ITRANS transliteration scheme so as to print them in Devanagari. There is a copyright on this file to the extent of preventing misuse on other internet sites and ego-trips of individuals. We recommend not to copy and post this file on any other site since we periodically update and correct this list and we do not want different versions of file floating around the internet. We have seen people copying this work and calling of their own. We request you to provide corrections, and more importantly many such additions from your own collection. The list has been arranged according to Devanagari sequence.\\ The transliteration according to ITRANS (older 3.2) scheme is given by \medskip\hrule\medskip \underline{vowels(svara):} \hskip .5in a aa(A) i ii(I) u uu(U) R\^{}i R\^{}I e ai o \hskip .5in au aM aH L\^{}i L\^{}I \underline{consonants(vya.njana):} \hskip .5in k kh g gh N\^{}\\ \hskip .5in ch chh j jh JN\\ \hskip .5in T Th D Dh N\\ \hskip .5in t th d dh n\\ \hskip .5in p ph b bh m\\ \hskip .5in y r l v sh shh s h L(maraaThii) kSh(x ksh) GY(hindii)\\ Both .n and M produce anusvaara, .a avagraha, .h haLa.nta \\ H visarga, Only a dot .
    [Show full text]
  • Foreign Languages for the Use of Printers and Translators
    u. Gmm^-mi'mr printing office k. K GIEGJij^^a^GlI, Public Pbinter FOREIGN LANG-UAGI SUPPLEMENT TO STYLE MANUAL JIICVISED EDITION FOREIGN LANGUAGES For the Use of Printers and Translators SUPPLEMENT TO STYLE MANUAL of the UNITED STATES GOVERNMENT PRINTING OFFICE SECOND EDITION, REVISED AND ENLARGED APRIL 1935 By GEORGE F. von OSTERMANN Foreign Reader A. E. GIEGENGACK Public Printer WASHINGTON, D. C. 1935 For sale by the Superintendent of Documents, Washington, D. C. Price $1.00 (Buckram) PREFACE This manual relating to foreign languages is purposely condensed for ready reference and is intended merely as a guide, not a textbook. Only elementary rules and examples are given, and no effort is made to deal exhaustively with any one subject. Minor exceptions exist to some of the rules given, but a close adherence to the usage indicated will be sufficient for most foreign-language work. In the Romance languages, especially, there are other good forms and styles not shovm in the following pages. It is desired to acknowledge the assistance and cooperation of officials and members of the staff of the Library of Congress in the preparation of these pages and, in particular. Dr. Herbert Putnam, Librarian of Congress; Mr. Martin A. Roberts, Superintendent of the Reading Room; Mr. Charles Martel, Consultant in Cataloging, Classification, and Bibliography; Mr. Julian Leavitt, Chief of Catalog Division; Mr. James B. Childs, Chief of Document Division; Dr. Israel Schapiro, Chief of the Semitic Division; Mr. George B. Sanderlin; Mr. S. N. Cerick; Mr. Jens Nyholm; Mr. N. H. Randers-Pehrson; Mr. Oscar E.
    [Show full text]
  • Narrative of a Pedestrian Journey
    This is a reproduction of a library book that was digitized by Google as part of an ongoing effort to preserve the information in books and make it universally accessible. http://books.google.com /4s, 0 HARVARD UNIVERSITY LIBRARY or THE PEABODY MUSEUM OF AMERICAN ARCHAEOLOGY AND ETHNOLOGY GIFT OF R.B. Dixon Received May 7, 1936 NARRATIVE A PEDESTRIAN JOURNEY RUSSIA AND SIBERIAN TARTARY, FRONTIERS OF CHINA FROZEN SEA AND KAMTCHATKA; [•XIIVORMED DURIHG THE TEARS 1820, 1821, 1822, AND 1823. CAPT. JOHN DUNDAS COCHRANE, R.N. PHILADELPHIA .- H. C. CAREY, & I. LEA, AND A. SMALL; AND COLLINS & HANNAY, NEW-YORK. 1824. Jl 371 A S, KOL--XL (-"_>x ••* V-T TO THE RIGHT HONOURABLE VISCOUNT MELVILLE, K.T. FIRST LORD OF THE ADMIRALTY, ETC. MY LORD, WHEN I determined upon committing to the press the following Narrative of my journey through Siberia and Tartary, I felt a desire that I might be permitted to dedicate the same to your Lordship. For the kind and handsome manner in which your Lordship was pleased to comply with my request, per mit me to hope that your Lordship may derive a few hours of amusement from the perusal of the work; and should this wish fortunately be accomplished, I shall not consider my time to have been uselessly employed. I have the honor to be, with all respect, YOUR OBLIGED AND FAITHFUL SERVANT, JOHN DUNDAS COCHRANE. PREFACE. IT has not been, the kind partiality of my friends, the common apology for an uninteresting or ill written book, which has induced me to launch upon the world the following narrative of my journey to the utmost north ern and eastern limits of the Russian empire; nor has it arisen from a vain idea of my being qualified for a task of the kind.
    [Show full text]
  • Development of a Micro Telugu Opinion Wordnet and Aligning with TELOWN Ontology for Automatic Recognition of Opinion Words from Telugu Documents
    INTERNATIONAL JOURNAL OF RESEARCH ISSN NO : 2236-6124 Development of a Micro Telugu Opinion WordNet and Aligning with TELOWN Ontology for Automatic Recognition of Opinion Words from Telugu Documents Benarji Tharini1, Dr.Vishnu Vardhan Bulusu2 1 Research Scholar Rayalaseema University, Kurnool, AP, India. [email protected] 2 Professor in Department of CSE, Manthany JNTUH, TS, India. [email protected] Abstract: The emergencies in Indian language based documents over the web are observed in the recent past with the advent of Unicode standard. The content of Indian language and its accessibility is observed to be minimal in the linguistic process evaluation. Unicode based language tools are created in order to prepare language specific repositories and the dictionaries online. Construction of wordnet, language constructs and thinking about a semantically rich lexical synsets is useful in the linguistic processing of the Indian context. Over the two decades the research world is trending towards construction of the semantic models. It is necessary to start a beginning to create a rich knowledge base in order to attain semantically rich linguistic models. With this phenomenon as an aim a micro opinion Telugu wordnet is created in order to map with Telugu Opinion WordNet Ontology (TELOWN) which consists of semantic knowledge on positive and negative Telugu opinion words. The objective of this process is to create opinion wordnet in Telugu along with their synsets for the automatic recognition of opinion words from Telugu documents. SPARQL is used as a query language for the retrieval at the backend. Keywords: semantic web, ontology, Telugu, WordNet, opinion words, SPARQL I.
    [Show full text]
  • The European Union's Need for an International Auxiliary Language
    Federico Gobbo 1 Journal of Universal Language 6 March 2005, 1-28 The European Union’s Need for an International Auxiliary Language Federico Gobbo Insubria University Abstract In the last few years, debate has centered around European Union language policy. Many people, language specialists and laymen alike, argue that the European Union should adopt a common tongue for practical purposes, at institutional and educational levels, while respecting the cultural richness of other languages. Although English, or sometimes Latin, has been proposed, an International Auxiliary Language (IAL) would seem to accomplish this aim better. In this paper we will compare, structurally and sociolinguistically, three major IALs: Esperanto, Ido, and Interlingua, as candidates to serve as the common language of the EU, on account of their language vigour and vitality. Keywords: International Auxiliary Language, European Union, language politics, English, Esperanto, Ido, Interlingua, Latin 2 The European Union’s Need for an International Auxiliary Language 1. Introduction May 1, 2004 is a crucial date in European history: twenty-five European countries created the most important political reality the Old World ever dreamed of. With its 455 million citizens, the European Union became more populous than the United States of America, as well as topping the USA’s Gross Domestic Product (GDP), making it the largest economy in the world (Rifkin 2004). Conversely, as has been noted elsewhere, after September 11, 2001, the appeal of United States’ world leadership has greatly declined (Jung 2004). Despite these data, the political relevance of the EU in the world is not as important as its economy. It is worth recalling that the creation of a new Europe, after the Second World War, was driven by two agendas, one European, the other American (Phillipson 2004).
    [Show full text]
  • Esperanto 1 Esperanto
    Esperanto 1 Esperanto Esperanto Created by L. L. Zamenhof Date 1887 Setting and usage International auxiliary language Users Native: 200 to 1,000 (1996) L2 users: 10,000 to 2,000,000 Purpose constructed language • International auxiliary language • Esperanto Writing system Latin (Esperanto alphabet) Sources Vocabulary from Romance and Germanic languages; phonology from Slavic languages Official status Regulated by Akademio de Esperanto Language codes ISO 639-1 eo ISO 639-2 epo ISO 639-3 epo Esperanto is the most widely spoken constructed international auxiliary language.[1] Its name derives from Doktoro Esperanto (Esperanto translates as "one who hopes"), the pseudonym under which L. L. Zamenhof published the first book detailing Esperanto, the Unua Libro, in 1887. Zamenhof's goal was to create an easy-to-learn and politically neutral language that transcends nationality and would foster peace and international understanding between people with different regional and/or national languages. Estimates of Esperanto speakers range from 10,000 to 2,000,000 active or fluent speakers, as well as native speakers, that is, people who learned Esperanto from their parents as one of their native languages. Esperanto is spoken in about 115 countries. Usage is particularly high in Europe, East Asia, and South America.[2] The first World Congress of Esperanto was organized in France in 1905. Since then congresses have been held in various countries every year with the exception of years in which there were world wars. Although no country has adopted Esperanto officially, Esperanto was recommended by the French Academy of Sciences in 1921 and recognized in 1954 by UNESCO (which later, in 1985, also recommended it to its member states).
    [Show full text]
  • List of South African Slang Words from Wikipedia, the Free Encyclopedia Contents 1 Afrikanerisms 2 Words from Khoi San Languages
    List of South African slang words From Wikipedia, the free encyclopedia Contents 1 Afrikanerisms 2 Words from Khoi San languages 3 Words from Xhosa, Zulu and the other Nguni Languages 4 Words from Sesotho and Setswana Languages 5 Words from Portuguese 6 Original South African English coinages 7 Slang originating from other countries 8 Slang terms originating from ethnic minorities 8.1 South African slang 8.2 South African Portuguese slang 8.3 South African Indian slang 8.4 South African Jewish slang 8.5 South African Lebanese slang 9 Special-use slang 9.1 Kasi / township slang 9.2 Gay slang also called 'gayle' 10 See also 11 Example 12 References 13 External links Afrikanerisms This list of "Afrikanerisms" comprises slang words and phrases influenced by Afrikaans and other African languages. Typical users include people with Afrikaans as their first language but who speak English as a second language; and people living in areas where the population speaks both English and Afrikaans. Many of these terms also occur widely amongst ethnic/native South Africans. These terms do not occur in formal South African English. Fanakalo also refers to when people of non Zulu origin attempt to speak Zulu without the proper pronunciation and throat sounds. Fanakalo in Zulu literally means "same as this" – (fana – same, ka – as, lo – this). ag man – oh man; ag as the Afrikaans equivalent to "oh" (pronounced \ach\ like German ACHtung), man pronounced as in English. antie – an older female authority figure. [Derived from the English "aunt".] anties – breasts aweh/awe (pronounced \AAAH-WHERE\ --> not rolling the "R") – said in excitement, as in: Aweh; my boss said I can go home early today.
    [Show full text]
  • The Amazing Adventures of Kavalier & Clay
    PART 1. The ESCAPE ARTIST 1 IN later years, holding forth to an interviewer or to an audience of aging fans at a comic book convention, Sam Clay liked to declare, apropos of his and Joe Kavalier's greatest creation, that back when he was a boy, sealed and hog-tied inside the airtight vessel known as Brooklyn, New York, he had been haunted by dreams of Harry Houdini. "To me, Clark Rent in a phone booth and Houdini in a packing crate, they were one and the same thing," he would learnedly expound at WonderCon or Angouleme or to the editor of The Comics Journal. "You weren't THE AMAZING ADVENTURES OF the same person when you came out as when you went in. Houdini's first magic act, you know, back when he was just getting started. It was called 'Metamorphosis.' It was never just a question of escape. It was also a question of KAVALIER & CLAY transformation." The truth was that, as a kid, Sammy had only a casual interest, at best, in Harry Houdini and his legendary feats; his great heroes were Nikola Tesla, Louis Pasteur, and Jack London. Yet his account of his role-of the role of Michael Chabon his own imagination-in the Escapist's birth, like all of his best fabulations, rang true. His dreams had always been Houdiniesque: they were the dreams of a pupa struggling in its blind cocoon, mad for a taste of light and air. Houdini was a hero to little men, city boys, and Jews; Samuel Louis Klayman was all three.
    [Show full text]
  • Different Ways of Displaying Non-Latin Character Sets in Blaise
    Different Ways of Displaying Non-Latin Character Sets in Blaise Leif Bochis, Statistics Denmark Abstract Blaise offers excellent support for multi-language interview instruments allowing the interviewer to select interview language on the fly. While this is easily managed with European languages using characters supported by the standard ANSI character set, there are a number of problems to address when using non- Latin character sets - for example, Arabic script. This paper addresses certain problems concerning preparation of documents in non-Latin scripts and how to incorporate these as field texts in Blaise instruments - including problems of converting texts from word processing format into the simple text file format supported by Blaise. As new versions of Blaise offer still more new facilities in general, also new ways to make non-Latin characters displayable in Blaise instruments is introduced - through separate display software, as field texts using special character sets and fonts or as images. The aim of this paper is to describe a range of possible solutions and to discuss advantages and drawbacks of each solution in existing and (close) future versions of Blaise. This paper elaborates further on a paper presented at the 6th Blaise Users' Conference in Kinsale 2000 and discusses advantages and drawbacks of the different solutions exemplified by Farsi text, based on work with incorporating Farsi field texts for Immigrant Surveys carried out by Statistics Denmark 1998- 2001. What is it all about? Statistics Denmark has carried out a number of Immigrant surveys over the last 3 years concerning living conditions, integration on the labour market, language skills et cetera.
    [Show full text]
  • A Study of Telugu Regional and Social Dialects
    A STUDY OF TELUGU REGIONAL AND SOCIAL DIALECTS: A PROSODIC ANALYSIS J. Venkateswara Sastry Thesis submitted in fulfilment of the requirements of the Degree of Doctor of Philosophy, Department of Phonetics and Linguistics, School of Oriental and African Studies, University of London \ •i September 1987 ■ i ProQuest Number: 10673011 All rights reserved INFORMATION TO ALL USERS The quality of this reproduction is dependent upon the quality of the copy submitted. In the unlikely event that the author did not send a com plete manuscript and there are missing pages, these will be noted. Also, if material had to be removed, a note will indicate the deletion. uest ProQuest 10673011 Published by ProQuest LLC(2017). Copyright of the Dissertation is held by the Author. All rights reserved. This work is protected against unauthorized copying under Title 17, United States C ode Microform Edition © ProQuest LLC. ProQuest LLC. 789 East Eisenhower Parkway P.O. Box 1346 Ann Arbor, Ml 48106- 1346 2 ABSTRACT Chapter 1: INTRODUCTION: A brief introduction of the land, language, people and their occupations and society are presented. The historical, political divisions of the land, influence of other languages on Telugu, and the caste system that exists in Telugu society are explained. The controversy that exists between the traditional school and the modern school in the fields of education and language policy is touched upon. A survey of earlier work on different aspects of Telugu phonology is given at the start of each chapter. This is followed by my own analyses in terms of prosodic phonology. Chapter 2: TELUGU PHONEMIC SYSTEMS: A review of previous accounts of the Telugu phonological system (§ given and it is shown how each system proposed is inadequate in the circumstances that obtain in the Telugu language today.
    [Show full text]
  • For Charles Ess, Ed
    For Charles Ess, ed. Cultural Attitudes Towards Technology and Communication (New York: Suny Press, 1999) Language, Power, and Software Kenneth Keniston Massachusetts Institute of Technology In discussions of the impact of "The Information Age", the role of language in computing is rarely mentioned. Hundreds of books have analyzed the digital age, the networked society, the cyberworld, computer-mediated-communications (CMC), the impact of the new electronic media with hardly a word about the central importance of language in the Information Age. The goal of this paper is to give language - by which I mean the language in which computing is done and in which computer-mediated-communication occurs - a key place in discussions of the impact of computation and computer-mediated-communications. I will argue that the language in which computing takes place is a critical variable in determining who benefits, who loses, who gains, who is excluded, who is included - in short, how the Information Age impacts the peoples and the cultures of the world. In other words, I will stress the relationship of language to power, wealth, privilege, and access to desired resources. Localization and Language. Although the ultimate "language" of the computer consists of digital zeroes and ones, the language of users, including programmers, is and must be one of the thousands of existing languages of the world. In fact, however, virtually all programming languages, all operating systems, and most applications are written originally in English, making language a "non-issue" for the approximately seven percent of the world's population that speaks, reads and writes fluent English.
    [Show full text]