Issues in Text-To-Speech for French

Total Page:16

File Type:pdf, Size:1020Kb

Issues in Text-To-Speech for French ISSUES IN TEXT-TO-SPEECH FOR FRENCH Evelyne Tzoukermann AT&T Bell Laboratories 600 Mountain Avenue, Murray tlill, N.J. 07974 evelyne@rcsearch, art.corn Abstract in the standard International Phonetic Alphabi,t; the second column ASCII shows the ascii correspon- This paper reports the progress of the French dence of these characters for the text-to-speech text-to-speech system being developed at AT&T system, and the third column shows art example Bell Laboratories as part of a larger project for of the phoneme in a French word. multilingual text-to-speech systems, including lan- guages such as Spanish, Italian, German, Rus- Consonant s Vowels sian, and Chinese. These systems, based on di- IPA ASCII WORD IPA ASCII WORD phone and triphone concatenation, follow the gen- p p paix i i vive eral framework of the Bell Laboratories English t t tout e e the TTS system [?], [?]. This paper provides a de- k k eas e g aisc scription of the approach, the current status of the b b bas a a table French text-to-speech project, and some problems iI d dos u a time particular to French. g g gai 3 > homme m m mais o o tgt n n liOn u U boue 1 Introduction .p N gagner y y tour l 1 livre n ellX In this paper, the new French text-to-sIieech sys- f f faux ce @ seul tem being developed at AT&T is presented; sev- s s si o & peser eral steps have been already achieved while others f S chanter I bain are still in progress. First we present a brief' de- v v vive ~t A bane scription of the phonetic inventory of French, with z z zero 5 O bon a discussion of the approach used to select and 3 Z jupe de I brun segment phonetic units for the system. Methods r r rare o A samedi for automatic segmentation, and for the choice of Semi-vowels diphone and triphone units are presented. Some comments on durational and prosodic issues fol- IPA ASCII WORD low. We conclude with some discnssions on direc- j j yeux tions for fllture improw.'ment, including morpho- w w oui logical analysis, part-of-speech tagging, and par- q W huit tial phrasal analysis for the purpose of phrasal Table 1: French Phonetic Phonemes grouping. For the French text-to-speech synthesis system Phonetic Description of we use 35 phonemes, consisting of 17 consonants, 15 vowels (and not 1{3 like in the n,a cohlmn), and French 3 semi-vowels. As shown in Table 1, the fourth nasal /de/ has been removed, /07,/ and /g/ being The French phonetic system consists of 36 represented by the single phoneme /g/. The rea- phonemes, including 17 consonants, 16 vowels, sons for this change are that (1) /de/ tends to be and 3 semi-vowels. Table 1 shows the different assimilated to the phoneme/g/, and (2) this nasal phonemes; the IPA column contains the phonemes vowel occurs in very few words in French. Thus, 976 iC eould be said thai, functionally the disi, inel, ion eouix~xl of ol hor phon<mlc's, SylllrhOSiZiIlg separat.~ I)(;Cwoon i< 'l and is ininiinal. Prcneti also COil-. pIIOliONIOS ea.niioC (:a.pCllro ;trticllla.Cory aSl>ocCs of rains two ])holiOlilOS for 1,he eharaetor "a", /al and the languag(!. Ad(lil, ioually, transitions are harder /q/ , the first ouo hoing a front unrounded vowel Co modo.I I,han steady staCo.s. Thus, diphones are and the second one abael( romidod vowel. A small l&c standard minimal uniCs in segmental synCho iillliiboi: of l,'r<;n{:h spcal<crs lli;[ko this I)roduetion sis. Froln an acoustic stan(IpoinC, a diphollo (;ram ai<l<l i)<;i:C<~l)Cu~-d disl, hietion; in addiiAon, Coday's be seen as a signal passing from/,he co.Cral parC of tendency shows a dis;-q)t>caraii<:o of 1,his I)honeniie ~ !c)holmm('. Co the central pare of the sut>soquelHi disc.l,hieCion. Therefore, ouly /a/, the IliOsL {'orii-. ph<mcmo; iu oth<~r words, it is a unit oonllmSed liiOll t>holiellle of the Iwo> was roCahiod for s/nthc of Cwo half phonemo.s. At a sogmo.nt,al low:l, one sis. NoCiee thaC I,wo dilfcronC "sehwas" (or lilllto I;~)> cau Chink of a diphone as a sCored length el'st>etch ,,la,.kod as It+/and /A/wc,:o retaino{l for synl, hc ChaC goes fi:om nt'm: the target of one phonelne {tilt[ sis; sin<'.<esehwa in spokeil l"rcneh ca, it t)~, iu SOlliO cxCen<ls Co near I.he t;-trg~.'C of Cho followiug one, ia crises, prosollC or not dcpondiug 011 i, hc level of ocher word l.ho CransiCion [?]. fornlality of i&ilgu&ge it is iisot'ul Co ll~-wo Owe 'l'h<~ earliest diphono, systcln was <loscrihed hy dilfo=renC signs Co aeeounl for I.tiis option, l,l addi- I'oCcrson oC al [?]; ocher <liphono apl>roa<:hes have. tion, Cho graphcnio-I;o:pholieirie systelil IlSOd ill the been roi>orC<xl by [?], [?], [?], an<t [?]. AlChough Vronch TTS sysColn and dose.ribod hi SocCion ??, there are only about 40 phonemo.s in/"nglish al)out is o=quipp<;d wiCh the Cal>at)ility of ineh.ling or .ot 1600 diphonos sulfieo= for synthesis. Nev<;rthe I, Ilc schwa <lol>on<ling on the lc'w'.l ot' language. For less) b('.eaaise of lllllNerOLlS allophono.s and the face ex~-~inplo> Clio sonl;onco "jo Ill'Oil V~-tiss;uncdi", I ai'i'l that some dil>hones are not really conCexC floe, re' h:auiny on saturday, (;21.II lie said <tither/3,) lll('l Vg searchers like I'ctcrsou suggesl, that, aboul. 8000 samc)<li/ or, liioro eolloquially, /:,;mh ve samdi/ , <tiphoHes are nce<t<xl for high quality <liphone syn.- dot)ondiug on whether the schwa ix reduced or noC. thesis. Moreover> the vowel diphtongs in gnglistl In olir systoiii, l;ho solil,(;llCO will t>o I,ra, nseribcd t:+/x could be trcato.d as peudo-diphones, l,'arly Iq'cneh HI('I Vg sanlAdi/, A ;-t(;eOlllitiilg; for the Cl';tce of synthesis systems [?] relied also on sym, hesis by the schwa. An ad<liCioual eilaraccer "*", was uso<l diphouos exc<'pt for the. diphone [qi] that is into to r{qpro.sent silences aC the Iwginlling and end of gratc'{l in a Cril)honi<: group. This phonemic pair WOl'(ts, was sCore<l diff<,rontly hoeauso of its high fr<!qu<mcy Ig'onch Idlouenies (:au also he viewed ac<:ord iu lg'onch in oe<:urrcnces such as "hii" him~her. In itig t(> their Sl)OeCfal variabilil.y iu the eont,oxC of lnoro recent work, systelliS (;olltaiil diphonos and oCher i>honoliiOS, li, is knowil thaC l,'ronch vowels larger units, such as Cril>hones , quadriphonos> and show spectral stability ;MIll low c()llt(~XCllltl vari evol, q,,intophonos [?] [?], iu order to capture ahility [?], [?]. '1't1{; voiceless f,.icaCivos show some- eoarticu[a.tory lihononio.na of a longer domain that wiled; less spoeCra.l sCal>iliCy, I;tioai Chc plosives. The would iloli be adequately irio<l<'.lcd in a stric.tly di- nasals and voieod fricatives present ow!n less sCa= ]>honic system. hility. Ifi<luids l/l/ and/r/)and semi vowels l/j/, lu the current sysCem, the dil>hone invcutory for /w/, /q/) arc the i>ho,~omcs showing high vari- lq'ench was built by taking 35 ~ phonernic pairs, ahiliCy a,n(l this poses prot>hmis in diphono hasod Chat is 1225 ilnits. Ad<lod Co that was Clio silence synl;hosis [?]. Liquids ai'o very scrisiCive Co lh<='ir symbol in initial and final position, which adds eolltoxC; forinaAiC strllei.;tlres show subsCanl, ial cf a,lioChor 70 phoneniic [)aii:s, [gl'OIH this iniCial sol;, fects of c.oart,icuhd.ioti. As for the s<;mi-vowols, il. l, he pairs of se.lni-vowels wcrc relnow;d. All the is ditliculC I.o ot~t)Clll'O Che ZOllO of spec.tral stability. ottior <x)mt)inations were kept. Even though all of For those' reasons, some researchers, o.g. [?], th('.Ill do llOt oecllr ill French lexical strueCure, they orgauizc l)iionernie classi(i<:ation using Che crit<;- <:a. still app<!ar in tile intcr-wor<l boundaries. For ria of the stable vs unsi;ablc phone.me raChor than oxaml>lc , the sequence /lr/ is not permiCted word place of arCieulation. Sinii]ar to Clio approach in internally, but imist be handled since it appears Ill<'. l']nglish TTS sysCerii, syi'lthesis for French is in the interwor<l assimilation in /val r.jc/ "valenC (tolie using f>restorc'<] liilil,s. Within this frainc rion" cost 'n,othiny. This is partieularly iinportant work, there are various stralo<~gies for 1,he colh'.o-- in French sin<:e inter-word liaison is comnion as l,ion of uniCs, units i,hat will then eonsl, iCui;e the in /el z 5/ "ell<;s ont" they have vs /el s5/ "olios dicl, ionary of polyphonos. 1)lie to Chc eoil{inuo.[ sont" they are, whero the final consonant/s/eithor a.spe<:l; of the speech signal and tile fact chaC the undergo0s liaison wiC]l the vowo,1 /5/ rosulting in lt&Cllro of l)honenies is greatly modified in the /z/, or undergoes linking with the consonani, ts/ 977 resulting in the devoiced sibilant.
Recommended publications
  • Multi-Disciplinary Lexicography
    Multi-disciplinary Lexicography Multi-disciplinary Lexicography: Traditions and Challenges of the XXIst Century Edited by Olga M. Karpova and Faina I. Kartashkova Multi-disciplinary Lexicography: Traditions and Challenges of the XXIst Century, Edited by Olga M. Karpova and Faina I. Kartashkova This book first published 2013 Cambridge Scholars Publishing 12 Back Chapman Street, Newcastle upon Tyne, NE6 2XX, UK British Library Cataloguing in Publication Data A catalogue record for this book is available from the British Library Copyright © 2013 by Olga M. Karpova and Faina I. Kartashkova and contributors All rights for this book reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, without the prior permission of the copyright owner. ISBN (10): 1-4438-4256-7, ISBN (13): 978-1-4438-4256-3 TABLE OF CONTENTS List of Illustrations ..................................................................................... ix List of Tables............................................................................................... x Editors’ Preface .......................................................................................... xi Olga M. Karpova and Faina I. Kartashkova Ivanovo Lexicographic School................................................................ xvii Ekaterina A. Shilova Part I: Dictionary as a Cross-road of Language and Culture Chapter One................................................................................................
    [Show full text]
  • The Art of Lexicography - Niladri Sekhar Dash
    LINGUISTICS - The Art of Lexicography - Niladri Sekhar Dash THE ART OF LEXICOGRAPHY Niladri Sekhar Dash Linguistic Research Unit, Indian Statistical Institute, Kolkata, India Keywords: Lexicology, linguistics, grammar, encyclopedia, normative, reference, history, etymology, learner’s dictionary, electronic dictionary, planning, data collection, lexical extraction, lexical item, lexical selection, typology, headword, spelling, pronunciation, etymology, morphology, meaning, illustration, example, citation Contents 1. Introduction 2. Definition 3. The History of Lexicography 4. Lexicography and Allied Fields 4.1. Lexicology and Lexicography 4.2. Linguistics and Lexicography 4.3. Grammar and Lexicography 4.4. Encyclopedia and lexicography 5. Typological Classification of Dictionary 5.1. General Dictionary 5.2. Normative Dictionary 5.3. Referential or Descriptive Dictionary 5.4. Historical Dictionary 5.5. Etymological Dictionary 5.6. Dictionary of Loanwords 5.7. Encyclopedic Dictionary 5.8. Learner's Dictionary 5.9. Monolingual Dictionary 5.10. Special Dictionaries 6. Electronic Dictionary 7. Tasks for Dictionary Making 7.1. Panning 7.2. Data Collection 7.3. Extraction of lexical items 7.4. SelectionUNESCO of Lexical Items – EOLSS 7.5. Mode of Lexical Selection 8. Dictionary Making: General Dictionary 8.1. HeadwordsSAMPLE CHAPTERS 8.2. Spelling 8.3. Pronunciation 8.4. Etymology 8.5. Morphology and Grammar 8.6. Meaning 8.7. Illustrative Examples and Citations 9. Conclusion Acknowledgements ©Encyclopedia of Life Support Systems (EOLSS) LINGUISTICS - The Art of Lexicography - Niladri Sekhar Dash Glossary Bibliography Biographical Sketch Summary The art of dictionary making is as old as the field of linguistics. People started to cultivate this field from the very early age of our civilization, probably seven to eight hundred years before the Christian era.
    [Show full text]
  • Towards Building a Multilingual Sememe Knowledge Base
    Towards Building a Multilingual Sememe Knowledge Base: Predicting Sememes for BabelNet Synsets Fanchao Qi1∗, Liang Chang2∗y, Maosong Sun13z, Sicong Ouyang2y, Zhiyuan Liu1 1Department of Computer Science and Technology, Tsinghua University Institute for Artificial Intelligence, Tsinghua University Beijing National Research Center for Information Science and Technology 2Beijing University of Posts and Telecommunications 3Jiangsu Collaborative Innovation Center for Language Ability, Jiangsu Normal University [email protected], [email protected] fsms, [email protected], [email protected] Abstract word husband A sememe is defined as the minimum semantic unit of human languages. Sememe knowledge bases (KBs), which contain sense "married man" "carefully use" words annotated with sememes, have been successfully ap- plied to many NLP tasks. However, existing sememe KBs are built on only a few languages, which hinders their widespread human economize utilization. To address the issue, we propose to build a uni- sememe fied sememe KB for multiple languages based on BabelNet, a family male spouse multilingual encyclopedic dictionary. We first build a dataset serving as the seed of the multilingual sememe KB. It man- ually annotates sememes for over 15 thousand synsets (the entries of BabelNet). Then, we present a novel task of auto- Figure 1: Sememe annotation of the word “husband” in matic sememe prediction for synsets, aiming to expand the HowNet. seed dataset into a usable KB. We also propose two simple and effective models, which exploit different information of synsets. Finally, we conduct quantitative and qualitative anal- sememes to annotate senses of over 100 thousand Chinese yses to explore important factors and difficulties in the task.
    [Show full text]
  • Kernerman Kdictionaries.Com/Kdn DICTIONARY News
    Number 19 y July 2011 Kernerman kdictionaries.com/kdn DICTIONARY News Integrating phonetic transcription in a Brazilian Portuguese dictionary Luiz Carlos Cagliari 1. The dictionary in everyday work. It is always common for a phonetician to K Dictionaries has developed a series of dictionaries for learners transcribe his or her own language. When doing that, all kinds of various languages, including Portuguese/French. This of sound variation are registered, according to the speakers’ dictionary was based on European Portuguese, whose words pronunciation. On the other hand, when doing phonology, the had a phonetic transcription with European pronunciation. A sound patterns are interpreted following theories in order to group of lexicographers from the University of São Paulo, under get an abstract sound system of the language. In this case, the the supervision of Ieda Maria Alves, adapted the Portuguese phonetic variation is accommodated into their phonemes. The entries to Brazilian vocabulary, and another team from UNESP phonological transcription does not represent a pronunciation (São Paulo State University) at São José do Rio Preto, under of the language, in the same sense as phonetic transcription the supervision of Claudia Xatara, adapted to Brazilian the does. Moreover, phonological transcription does not relate to Poruguese translations of the French entries. language in the same way as orthography does. According In line with the change from European to Brazilian to the rules of writing systems, orthography has the function Portuguese, I gathered a group of students to be involved in of neutralizing the phonetic variation of a language on the modifying the phonetic transcription. The collaborators were word pronunciation level, allowing all speakers to read.
    [Show full text]
  • Producing an Encyclopedic Dictionary Using Patent Documents
    Producing an Encyclopedic Dictionary Using Patent Documents Atsushi Fujii Graduate School of Library, Information and Media Studies University of Tsukuba 1-2 Kasuga, Tsukuba, 305-8550, Japan [email protected] Abstract Although the World Wide Web has of late become an important source to consult for the meaning of words, a number of technical terms related to high technology are not found on the Web. This paper describes a method to produce an encyclopedic dictionary for high-tech terms from patent information. We used a collection of unexamined patent applications published by the Japanese Patent Office as a source corpus. Given this collection, we extracted terms as headword candidates and retrieved applications including those headwords. Then, we extracted paragraph-style descriptions and categorized them into technical domains. We also extracted related terms for each headword. We have produced a dictionary including approximately 400 000 Japanese terms as headwords. We have also implemented an interface with which users can explore our dictionary by reading text descriptions and viewing a related-term graph. 1. Introduction CLONE has been used for various research purposes. Term descriptions that have been carefully organized At the same time, we have identified that descriptions of in hand-compiled dictionaries and encyclopedias pro- technical terms associated with high technology are not vide valuable linguistic knowledge for human use and necessarily found on the Web. Example terms are “pho- knowledge-intensive computer systems, developed in the tosensitive lithographic printing plate”, “tracking error sig- human language technology community. However, as with nal”, and “magenta coupler”. Even in Wikipedia, which other types of linguistic knowledge relying on human intro- is a large encyclopedia on the Web, a number of high-tech spection and supervision, compiling encyclopedias is ex- terms are not explained.
    [Show full text]
  • A Method of Automatic Hypertext Construction from an Encyclopedic Dictionary of a Specific Field
    A Method of Automatic Hypertext Construction from an Encyclopedic Dictionary of a Specific Field Sadao Kurohashi, Makoto Nagao, Satoshi Sato and Masahiko Murakami Dept. of Electrical Engineering, Kyoto University Yoshida-honmachi, Sakyo, Kyoto, 606, Japan 1 Introduction (ii) s-link (by synonym) is set up from a defined word to defining words by synonym relation. Nowadays, very large volume of texts are created and stored in computer, and as a result the retrieval of texts Typical sentential styles of intensional definition are: which fits to a user's demand has become a difficult prob- (i) A is defined as B. A is regarded as B. lem. Hypertext is a typical system to answer this prob- (ii) A means B. A connotes B. A is B. lem, whose primary objective is to establish flexible as- sociative links between relevant text parts and to allow (iii) A is a {kind, form, way, branch, method, ...) of B. users to select and trace links to see relevant text con- (iv) A is regarded as B, so C as D. tents which are connected by links. A difficult problem here is how to construct automatically a network struc- By identifying these patterns in a term description part, ture in a given set of text data. This paper is concerned the relation between the defined word (A) and the defi- with (1) automatic conversion of a plain text set into nition sentences is established as: a hypertext structure, and (2) construction of flexible (i) p-link is set up from the defined word to the human interface for the hypertext system.
    [Show full text]
  • A Lexicographic Approach to Language Policy and Recommen
    A Lexicographic Approach to Language Policy and Recommen- dations for Future Dictionaries Sven Tarp, Centre for Lexicography, Aarhus School of Business, University of Aarhus, Aarhus, Denmark ([email protected]) and Rufus H. Gouws, Department of Afrikaans and Dutch, University of Stellenbosch, Stellenbosch, Republic of South Africa ([email protected]) Abstract: Language policy prevails at different levels and its formulation typically results in a prescriptive presentation of data. In their dictionaries, lexicographers have to respond to the deci- sions of language policy makers. In this regard dictionaries can adhere to a strict prescriptive policy by including only the prescribed forms. Dictionaries can also give a descriptive account of lan- guage use without making any recommendations or claims of correctness. Thirdly, dictionaries can be proscriptive by recommending certain forms, even if such a recommendation goes against the prescribed forms. This article offers an overview of different levels of language policy and the prin- ciples of prescription, description and proscription. Examples are given to illustrate certain lexico- graphic applications of prescription. It is emphasised that access to relevant data is important to dictionary users. Consequently the lexicographic application of proscription is discussed as a viable alternative to prescription. It is suggested that proscription, in its different possible applications, can lead to a lexicographic presentation that benefits the user and that contributes to the satisfac- tion of the functions
    [Show full text]
  • Doctor-Patient Communication As a Linguistic Model
    Advances in Social Science, Education and Humanities Research, volume 331 1st International Scientific Practical Conference "The Individual and Society in the Modern Geopolitical Environment" (ISMGE 2019) Doctor-patient communication as a linguistic model Lyubov Kasimtseva Lilya Kiseleva Sara Dzhabrailova Astrakhan State Medical University, Astrakhan State Medical University, Astrakhan State Medical University Chair of Latin and Foreign Languages Chair of Latin and Foreign Languages Chair of Latin and Foregn Languages Astrakhan, Russia Astrakhan, Russia Astrakhan,Russia [email protected] [email protected] [email protected] https://orcid.org/0000-0002-2987-4217 https://orcid.org/0000-0003-0251-8324 https://orcid.org/0000-0001-9050- 6635 Abstract — This article is devoted to the study of oral broad sense of the word, a complex unity of a language medical discourse in doctor-patient communication, which practice and extralinguistic factors, necessary for many scientists consider as a linguistic model. The relationship understanding the text, i.e. giving an idea about the between the doctor and the patient is one of the actual participants of communication, their settings and goals, problems in modern society. The relationship between the conditions of production and perception of the message [2]. doctor and the patient has evolved that is why it became necessary to inform and obtain the patient's consent to this or II. MATERIALS AND METHODS (MODEL) that medical intervention. The doctor has the task to decide which information is the medical secrecy and which is open for Medical discourse has attracted the attention of many the patient. One of the main tasks of the medical discourse is to scientists, such as V.I.Karasik, V.V.
    [Show full text]
  • Dictionary of Lexicography
    Dictionary of Lexicography Anyone who has ever handled a dictionary will have wondered how it was put together, where the information has come from, and how and why it can benefit so many of its users. The Dictionary of Lexicography addresses all these issues. The Dictionary of Lexicography examines both the theoretical and practical aspects of its subject, and how they are related. In the realm of dictionary research the authors highlight the history, criticism, typology, structures and use of dictionaries. They consider the subjects of data-collection and corpus technology, definition-writing and editing, presentation and publishing in relation to dictionary-making. English lexicography is the main focus of the work, but the wide range of lexicographical compilations in other cultures also features. The Dictionary gives a comprehensive overview of the current state of lexicography and all its possibilities in an interdisciplinary context. The representative literature has been included and an alphabetically arranged appendix lists all bibliographical references given in the more than 2,000 entries, which also provide examples of relevant dictionaries and other reference works. The authors have specialised in various aspects of the field and have contributed significantly to its astonishing development in recent years. Dr R.R.K.Hartmann is Director of the Dictionary Research Centre at the University of Exeter, and has founded the European Association for Lexicography and pioneered postgraduate training in the field. Dr Gregory James is Director of the Language Centre at the Hong Kong University of Science and Technology, where he has done research into what separates and unites European and Asian lexicography.
    [Show full text]
  • Normalizing Headwords of Cologne Digital Dictionaries
    Normalizing headwords of Cologne digital dictionaries Dr. Dhaval Patel A/8, Gokul Flats, Nava Vadaj, Ahmedabad, Gujarat, India, Pin - 380013 [email protected] Abstract 1 Cologne Digital Sanskrit Dictionaries site maintains 36 Sanskrit related dictionaries as on th 2 31 October 2016. In Sanskrit NLP, these dictionaries are the main source for lexical 3 data. Discounting 3 English – Sanskrit dictionaries, there are 33 dictionaries in total which 4 have headwords in Sanskrit. As these dictionaries are compiled over a vast period of time, 5 different conventions were followed by their authors. When we want to align these digital 6 lexica, we need to understand these conventions and arrive at a standard convention, so that 7 these different databases can communicate to one another. 8 Examples would make it more evident. Some dictionaries tend to use first inflected form of 9 the headword e.g. धमः , whereas some tend to use the uninflected form of the headword e.g. 10 धम. Similarly some dictionaries have tendency to use ‘अर ’् for words ending with ऋ e.g. िपतर ,् 11 whereas some have tendency to use ‘ऋ’ e.g. िपतृ and some others have tendency to use आ e.g. 12 िपता. The data referred to by headwords धमः / धम or िपतर /् िपतृ / िपता are the same. When we 13 want these dictionaries to communicate seamlessly with one another, such differences need to 14 be ironed out. Greater consonance can be brought between these dictionaries by some form 15 of standardization in this regards. Present paper tries to analyse the different conventions 16 followed by these authors / editors and come out with a standardized convention, so that 17 the headwords list is in standard format.
    [Show full text]
  • 8 Word Meaning
    8 Word Meaning KEY CONCEPTS Dictionary entries Sense relations Models of word meaning Mental dictionaries INTRODUCTION In this chapter we discuss word meaning. While it’s uncontroversial that words mean, it is far from clear how they mean, or indeed what meaning is. Because dictionaries are so familiar, we begin our discussion from the point of view of dictionary entries, which are designed primarily to describe the meanings of words, though they do much else besides. We discuss two ap- proaches to modeling word meaning, and then move to a discussion of the meanings of words as they might be stored in human minds and of the ways in which book and mental dictionaries are alike and different. We would be surprised if anyone reading this book had never consulted a dictionary; however, our experience over several decades of teaching about language is that very few people read the introductions (front matter) of dic- tionaries they may have had for many years. Indeed, our experience strongly suggests that most people believe in the myth of “The Dictionary,” a unique, authoritative, and accurate source of information on words, their spellings, meanings, and histories, of which actual dictionaries are merely longer or shorter versions. Everyone, especially teachers, should be aware that dictionaries are not all cut from the same cloth. Rather, they differ in substantial ways, which their users ignore at the cost of misinterpreting what they read. The goals of the exercise just below are to raise your awareness of the differences among dictionaries, to show you that it is essential to adopt as critical a stance toward dictionaries as you would toward any other commercial product, and to encourage you to examine dictionaries carefully as you buy them for yourselves, have them bought for your schools, or recommend them to your students.
    [Show full text]
  • Headwords in Macmillan English Dictionary for Advanced Learners
    Headwords in Macmillan English Dictionary 269 number of sample pages surveyed) in each dictionary. The result is shown in Table 1. The items counted are basically those presented in boldface in Headwords in Macmillan English Dictionary for the sample pages of every 100 page (page 1, 101, 201, . ) in each Advanced Learners and Macmillan English dictionary. Then the number of items presented in each dictionary is estimated by multiplying the collected item count by the number of A-Z Dictionary for Advanced Learners of American English pages in the dictionary and then dividing the result by the number of sample pages surveyed (in the case of MED-A main headwords, 478 x 2110 ÷ 21), which may give us some rough idea of the entry count in each TAKAHIRO KOKAWA dictionary. Here we should bear in mind that it is not always arguable that the more items a dictionary presents, the better, since as the number of In this paper headwords or entered lemmata and their presentation in items treated increases, the information per item typically decreases, if Macmillan English Dictionary for Advanced Learners (henceforth to be they are to be presented in the same limited space available in the printed abbreviated as MED-B) and Macmillan English Dictionary for Advanced dictionary. Learners of American English (henceforce MED-A), both published in What this count may tell us is that its claim of being 'the new learners' 2002, are discussed. MED-A and MED-B are what may be called 'twin dictionary with more entries than any other,' (blurb on the back cover of dictionaries,' that is, the two different versions of an EFL dictionary MED) may be true.
    [Show full text]