English Linguistic Guide

Total Page:16

File Type:pdf, Size:1020Kb

English Linguistic Guide ENGLISH LINGUISTIC GUIDE Despite the remarkable and irreversible changes that have come up on the English language since the AngloSaxon p erio d it has not yet reached a p oint of p erfection and stability suchaswe sometimes asso ciate with Latin of the Golden Age the language of Virgil Horace and others DR ROBERT BURCHFIELD The English Language in The Oxford Guide to the English Language What a patchwork has b een our old saxon by the bitter frost that nipp ed its early budding and the constant habit of b orrowing thence resulting the learned among usas well as the unlearned though in very dierentwaysare constantly made to feel The English language as wehaveitnow is not so much a coherentgrowth as a disturb ed organism PROF JOHN STUART BLACKIE Gaelic Self Taught CONTENTS ENGLISH ORTHOGRAPHY Numb er of sp ellings Sp elling number Language Co de Frequency of Sp elling Sp elling Diacritics Reverse transcriptions Sp elling columns Transcriptions for corpus typ es Transcriptions for English lemmas Sp ellings for English headwords Sp ellings for syllabied headwords Transcriptions for wordforms Sp ellings for plain wordforms Sp ellings for syllabied wordforms ENGLISH PHONOLOGY Numb er of pronunciations Pronunciation number Status of pronunciation Phonetic transcriptions Computer phonetic character sets Plain transcriptions Syllabied transcriptions Stressed and syllabied transcriptions Example transcriptions Phonetic patterns ENGLISH MORPHOLOGY Morphology of English lemmas How to segment a stem Typ es of analyses The Derivation The Comp ound The Derivational Comp ound The NeoClassical Comp ound The NounVerbAx Comp ound How to assign an analysis The NounVerbAx Comp ound Status and language co des Derivationalcomp ositional information Analysis typ e co des Immediate segmentation Complete segmentation at Complete segmentation hierarchical Other co des Morphology of English wordforms Inectional features Typ e of ection Inectional Transformation ENGLISH SYNTAX Word class co des letters or numbers Sub classication Y or N Sub classication nouns Sub classication verbs Sub classication adjectives Sub classication adverbs Sub classication numerals Sub classication pronouns Sub classication conjunctions ENGLISH FREQUENCY Frequency information for lemmas and wordforms Frequency information from written and sp oken sources Written corpus information Sp oken corpus information Frequency information for COBUILD corpus typ es Frequency information for COBUILD written corpus typ es Frequency information for COBUILD sp oken corpus typ es ENGLISH ORTHOGRAPHY Detailed and varied information is available on the ortho graphic forms of headwords and wordforms You can cho ose from a range of transcriptions they can b e syllabied or un syllabied they can include or omit diacritics as explained below or in some cases they come with the order of the letters reversed or with the letters sorted alphab eticallyIn addition there are columns whichtellyou the number of letters or syllables a particular transcription contains This flex window is the menuyou see for a lemma or a wordform lexicon when you cho ose the Orthography option of the rst ADD COLUMNS menu ADD COLUMNS Number of spellings Spelling number N Language code Frequency of spelling Spelling TOP MENU PREVIOUS MENU NUMBER OF SPELLINGS This option in the ADD COLUMNS menu is a column which tells you how manyways eachlemmawordform or abbrevi ation according to the typ e of lexicon you are using can b e sp elt For the verb lemma generalize this column has the value which means there are twopossibleways of sp elling it Unless you construct a restriction on your lexicon gener alize will o ccur twice one row using the form generalizethe other using the form generalise This column is particularly useful when you wanttoidentify words whichhave sp elling variants To exclude from your english linguistic guide lexicon all items whichonlyhave one p ossible sp elling con taining instead those whichcanbespeltina number of ways you can construct an expression restriction which simply states that the numb er of sp ellings must b e greater than OrthoCnt The flex name and description of this column are as follows OrthoCnt Number of spellings OrthoCntLemma SPELLING NUMBER Just as the very rst available ADD COLUMNS option is a number which uniquely identies each lemma or wordform according to the typ e of lexicon you are using so this column uniquely identies every sp elling to b e found for each lemma or wordform If you are using a lemma lexicon the sp elling variants are given in the form of headwords with or without syllable markers For example the verb generalize has two sp ellings one is the form generalize and another is the form generalise These have the sp elling numb ers and resp ectivelyIfyou use a syllabied stem representation in place of the plain headword representation sp elling takes the form general ize and sp elling the form generalise This means you can use the universal sequence number to identify a particular lemma or wordform dep ending on the typ e of lexicon you are using and then the sp elling number to identify the dierent individual sp ellings used for each lemma Usually no preference is indicated by the sp elling numb ers eachvariant has generalize is as valid as gener aliseHowever the numb er sp elling is always an acceptable British from and any American variants are always given a higher numb er Thus the lemma monologue has the British form monologue as its numb er sp elling and the American form monolog as its numb er sp elling You can nd out whether a sp elling is British or American by using the Or thoStatus column describ ed b elow One imp ortant p oint to rememb er is that the sp elling number can b e used to eliminate unwanted rows from your lexicon If you only want to see one sp elling for each lemma or Sp elling number wordform you should construct a restriction which states that only rows with a sp elling numb er equal to are to b e included in the form OrthoNum If you dont do this you usually end up with lexicons that are to o long b ecause they needlessly rep eat certain pieces of information Takethe example generalize again only this time imagine you wantto know its pronunciation rather than the various ways it can b e sp elt You create a lexicon with three columns one giving the sp elling numb er one giving the orthography of the stem and one giving the pronunciation of the headword Without the restriction flex returns tworows for generalize Sp elling Number Headword Pronunciation generalize dZEnrl aIz generalise dZEnrl aIz generalize dZEnrla Iz generalise dZEnrla Iz This is unnecessary since you are interested only in the p ossible pronunciations The extra row merely gives you a sp elling variant while the pronunciations remain the same When you include the restriction OrthoNum however only the rows with the numb er sp elling are included Sp elling Number Headword Pronunciation generalize dZEnrl aIz generalize dZEnrla Iz And of course the more lemmas your lexicon contains the greater the numberofeliminated lines b ecomes simply as a consequence of adding this imp ortant restriction If you are particularly interested in sp elling variation then do not add this OrthoNum restriction that wayyou get to see all the variant orthographic forms of each lemma Otherwise whenever you just want to use a simple ortho graphic transcription as a means of representing the lemma in your lexicon always rememb er to insert it The flex name and description of this column are as follows OrthoNum Spelling number OrthoNumLemma english linguistic guide LANGUAGE CODE For every dierent sp elling there is a co de which tells you whether it is an acceptable British form B or whether it is only ever an American form A This applies to lemmas and wordforms A British sp elling is always the rst one given that is its sp elling numb er is always while one which only o ccurs in American English is never the rst form and always has a sp elling numb er greater than Sp elling Status Example typ e co de British B adviser American A advisor Table Orthographic status co des for English sp ellings The flex name and description of this column are as follows OrthoStatus Status of spelling OrthoStatusLemma FREQUENCY OF SPELLING There are gures available which tell you how frequently each sp elling of each lemma or wordform o ccurs in the cobuild corpus along with deviation gures whichgive a range of error for each frequency They dier from the main frequency gures in that they are sp ecic to one sp elling whereas the frequency columns prop er refer to the more general frequency counts for the whole lemma or for eachwordform To arrive at these gures a count has to b e made of the numb er of times each string o ccurs in the current million word version of the cobuild corpus This lets you see that the string b eauty for example o ccurs times and that truth o ccurs times and these gures are the sp elling frequencies for the wordforms which are sp elt that way Usually but not always this string frequency is the same as the wordform frequency and its then p ossible to formulate sp elling frequencies for each lemma This simply means adding together the frequencies for eachwordform in each inectional paradigm Thus the sp elling frequency for the Frequency of Sp elling lemma b eauty is the sp elling frequency for the wordform b eauties plus
Recommended publications
  • Department of English and American Studies English Language And
    Masaryk University Faculty of Arts Department of English and American Studies English Language and Literature Jan Štěrba Word-formation Processes in TV Series Bachelor’s Diploma Thesis Supervisor: doc. PhDr. Naděžda Kudrnáčová, Csc. 2015 I declare that I have worked on this thesis independently, using only the primary and secondary sources listed in the bibliography. …………………………………………….. Jan Štěrba I would like to thank my supervisor doc, PhDr. Naděžda Kudrnáčová, Csc., for her time, kindness and willingness to help. Table of Contents Introduction ....................................................................................................................... 6 1. Word-formation ............................................................................................................ 9 1.1 Definitions of Basic Terms ..................................................................................... 9 1.1.1 Lexeme, word-form, derivation and inflection ................................................. 9 1.1.2 Root, stem and base ........................................................................................ 10 1.2 Definition of Word-formation ............................................................................... 11 1.3 Productivity and Lexicalization ............................................................................ 12 2. Word-formation Processes in English ........................................................................ 13 2.1 Compounding .......................................................................................................
    [Show full text]
  • Problem of Creating a Professional Dictionary of Uncodified Vocabulary
    Utopía y Praxis Latinoamericana ISSN: 1315-5216 ISSN: 2477-9555 [email protected] Universidad del Zulia Venezuela Problem of Creating a Professional Dictionary of Uncodified Vocabulary MOROZOVA, O.A; YAKHINA, A.M; PESTOVA, M.S Problem of Creating a Professional Dictionary of Uncodified Vocabulary Utopía y Praxis Latinoamericana, vol. 25, no. Esp.7, 2020 Universidad del Zulia, Venezuela Available in: https://www.redalyc.org/articulo.oa?id=27964362018 DOI: https://doi.org/10.5281/zenodo.4009661 This work is licensed under Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International. PDF generated from XML JATS4R by Redalyc Project academic non-profit, developed under the open access initiative O.A MOROZOVA, et al. Problem of Creating a Professional Dictionary of Uncodified Vocabulary Artículos Problem of Creating a Professional Dictionary of Uncodified Vocabulary Problema de crear un diccionario profesional de vocabulario no codificado O.A MOROZOVA DOI: https://doi.org/10.5281/zenodo.4009661 Kazan Federal University, Rusia Redalyc: https://www.redalyc.org/articulo.oa? [email protected] id=27964362018 http://orcid.org/0000-0002-4573-5858 A.M YAKHINA Kazan Federal University, Rusia [email protected] http://orcid.org/0000-0002-8914-0995 M.S PESTOVA Ural State Law University, Rusia [email protected] http://orcid.org/0000-0002-7996-9636 Received: 03 August 2020 Accepted: 10 September 2020 Abstract: e article considers the problem of lexicographical fixation of uncodified vocabulary in a professional dictionary. e oil sublanguage, being one of the variants of common language realization used by a limited group of its medium in conditions of official and also non-official communication, provides interaction of people employed in the oil industry.
    [Show full text]
  • Towards a Conceptual Representation of Lexical Meaning in Wordnet
    Towards a Conceptual Representation of Lexical Meaning in WordNet Jen-nan Chen Sue J. Ker Department of Information Management, Department of Computer Science, Ming Chuan University Soochow University Taipei, Taiwan Taipei, Taiwan [email protected] [email protected] Abstract Knowledge acquisition is an essential and intractable task for almost every natural language processing study. To date, corpus approach is a primary means for acquiring lexical-level semantic knowledge, but it has the problem of knowledge insufficiency when training on various kinds of corpora. This paper is concerned with the issue of how to acquire and represent conceptual knowledge explicitly from lexical definitions and their semantic network within a machine-readable dictionary. Some information retrieval techniques are applied to link between lexical senses in WordNet and conceptual ones in Roget's categories. Our experimental results report an overall accuracy of 85.25% (87, 78, 89, and 87% for nouns, verbs, adjectives and adverbs, respectively) when evaluating on polysemous words discussed in previous literature. 1 Introduction Knowledge representation has traditionally been thought of as the heart of various applications of natural language processing. Anyone who has built a natural language processing system has had to tackle the problem of representing its knowledge of the world. One of the applications for knowledge representation is word sense disambiguation (WSD) for unrestricted text, which is one of the major problems in analyzing sentences. In the past, the substantial literature on WSD has concentrated on statistical approaches to analyzing and extracting information from corpora (Gale et al. 1992; Yarowsky 1992, 1995; Dagan and Itai 1994; Luk 1995; Ng and Lee 1996).
    [Show full text]
  • Inferring Parts of Speech for Lexical Mappings Via the Cyc KB
    to appear in Proceeding 20th International Conference on Computational Linguistics (COLING-04), August 23-27, 2004 Geneva Inferring parts of speech for lexical mappings via the Cyc KB Tom O’Hara‡, Stefano Bertolo, Michael Witbrock, Bjørn Aldag, Jon Curtis,withKathy Panton, Dave Schneider,andNancy Salay ‡Computer Science Department Cycorp, Inc. New Mexico State University Austin, TX 78731 Las Cruces, NM 88001 {bertolo,witbrock,aldag}@cyc.com [email protected] {jonc,panton,daves,nancy}@cyc.com Abstract in the phrase “painting for sale.” In cases like this, semantic or pragmatic criteria, as opposed We present an automatic approach to learn- to syntactic criteria only, are necessary for deter- ing criteria for classifying the parts-of-speech mining the proper part of speech. The headword used in lexical mappings. This will fur- part of speech is important for correctly identifying ther automate our knowledge acquisition sys- phrasal variations. For instance, the first term can tem for non-technical users. The criteria for also occur in the same sense in “paint for money.” the speech parts are based on the types of However, this does not hold for the second case, the denoted terms along with morphological since “paint for sale” has an entirely different sense and corpus-based clues. Associations among (i.e., a substance rather than an artifact). these and the parts-of-speech are learned us- When lexical mappings are produced by naive ing the lexical mappings contained in the Cyc users, such as in DARPA’s Rapid Knowledge For- knowledge base as training data. With over mation (RKF) project, it is desirable that tech- 30 speech parts to choose from, the classifier nical details such as the headword part of speech achieves good results (77.8% correct).
    [Show full text]
  • Symbols Used in the Dictionary
    Introduction 1. This dictionary will help you to be able to use Einglish - English Dictionary well & use it in your studying. 2. The exercies will provide you with what information your dictionary contains & how to find as well use them quickly & correctly. So you MUST: 1. make sure you understand the instructuion of the exercise. 2. check on yourself by working out examples, using your dictionay. 3. Do it carefully but quickly as you can even if you think you know the answer. 4. always scan the whole entry first to get the general idea, then find the part that you’re interested in. 5. Read slowly & carefully. 6. Help to make the learning process interesting. 7. If you make a mistake, find out why. Remember, the more you use the dictionary, the more uses you will find for it. 1 Chapter one Dictionary Entry • Terms used in the Dictionary. •Symbols used in the Dictionary. •Different typfaces used in the Dictionary. •Abbreviation used in the Dictionary. 2 1- Terms used in the Dictionay: -What is Headword? -What is an Entry? -What is a Definition? -What is a Compound? -What is a Derivative? -What information does an entry contain? 3 1- What is Headword? It’s the word you look up in a dictionary. Also, it’s the name given to each of the words listed alphabetically. 4 2- What is an Entry? It consists of a headword & all the information about the headword. 5 3- What is a Definition? It’s a part of an entry which describes a particular meaning & usage of the headword.
    [Show full text]
  • DICTIONARY News
    Number 17 y July 2009 Kernerman kdictionaries.com/kdn DICTIONARY News KD’s BLDS: a brief introduction In 2005 K Dictionaries (KD) entered a project of developing We started by establishing an extensive infrastructure both dictionaries for learners of different languages. KD had already contentwise and technologically. The lexicographic compilation created several non-English titles a few years earlier, but those was divided into 25 projects: 1 for the French core, 8 for the were basic word-to-word dictionaries. The current task marked dictionary cores of the eight languages, another 8 for the a major policy shift for our company since for the first time we translation from French to eight languages, and 8 more for the were becoming heavily involved in learner dictionaries with translation of each language to French. target languages other than English. That was the beginning of An editorial team was set up for developing each of the nine our Bilingual Learners Dictionaries Series (BLDS), which so (French + 8) dictionary cores. The lexicographers worked from far covers 20 different language cores and keeps growing. a distance, usually at home, all over the world. The chief editor The BLDS was launched with a program for eight French for each language was responsible for preparing the editorial bilingual dictionaries together with Assimil, a leading publisher styleguide and the list of headwords. Since no corpora were for foreign language learning in France which made a strategic publicly available for any of these languages, each editor used decision to expand to dictionaries in cooperation with KD. different means to retrieve information in order to compile the The main target users were identified as speakers of French headword list.
    [Show full text]
  • Japanese-English Cross-Language Headword Search of Wikipedia
    Japanese-English Cross-Language Headword Search of Wikipedia Satoshi Sato and Masaya Okada Graduate School of Engineering Nagoya University Chikusa-ku, Nagoya, Japan [email protected] masaya [email protected] Abstract glish article. To realize CLHS, term translation is required. This paper describes a Japanese-English cross-language headword search system of Term translation is not a mainstream of machine Wikipedia, which enables to find an appro- translation research, because a simple dictionary- priate English article from a given Japanese based method is widely used and enough for query term. The key component of the sys- general terms. For technical terms and proper tem is a term translator, which selects an ap- nouns, automatic extraction of translation pairs propriate English headword among the set from bilingual corpora and the Web has been in- of headwords in English Wikipedia, based on the framework of non-productive ma- tensively studied (e.g., (Daille et al., 1994) and chine translation. An experimental result (Lin et al., 2008)) in order to fill the lack of en- shows that the translation performance of tries in a bilingual dictionary. However, the qual- our system is equal or slightly better than ity of automatic term translation is not well exam- commercial machine translation systems, ined with a few exception (Baldwin and Tanaka, Google translate and Mac-Transer. 2004). Query translation in cross-language infor- mation retrieval is related to term translation, but it concentrates on translation of content words in 1 Introduction a query (Fujii and Ishikawa, 2001). Many people use Wikipedia, an encyclopedia on In the CLHS situation, the ultimate goal of the Web, to find meanings of unknown terms.
    [Show full text]
  • Download in the Conll-Format and Comprise Over ±175,000 Tokens
    Integrated Sequence Tagging for Medieval Latin Using Deep Representation Learning Mike Kestemont1, Jeroen De Gussem2 1 University of Antwerp, Belgium 2 Ghent University, Belgium *Corresponding author: [email protected] Abstract In this paper we consider two sequence tagging tasks for medieval Latin: part-of-speech tagging and lemmatization. These are both basic, yet foundational preprocessing steps in applications such as text re-use detection. Nevertheless, they are generally complicated by the considerable orthographic variation which is typical of medieval Latin. In Digital Classics, these tasks are traditionally solved in a (i) cascaded and (ii) lexicon-dependent fashion. For example, a lexicon is used to generate all the potential lemma-tag pairs for a token, and next, a context-aware PoS-tagger is used to select the most appropriate tag-lemma pair. Apart from the problems with out-of-lexicon items, error percolation is a major downside of such approaches. In this paper we explore the possibility to elegantly solve these tasks using a single, integrated approach. For this, we make use of a layered neural network architecture from the field of deep representation learning. keywords tagging, lemmatization, medieval Latin, LSTM, deep learning INTRODUCTION Latin —and its historic variants in particular— have long been a topic of major interest in Natural Language Processing [Piotrowksi 2012]. Especially in the community of Digital Humanities, the automated processing of Latin texts has always been a popular research topic. In a variety of computational applications, such as text re-use detection [Franzini et al, 2015], it is desirable to annotate and augment Latin texts with useful morpho-syntactical or lexical information, such as lemmas.
    [Show full text]
  • Phonemic Similarity Metrics to Compare Pronunciation Methods
    Phonemic Similarity Metrics to Compare Pronunciation Methods Ben Hixon1, Eric Schneider1, Susan L. Epstein1,2 1 Department of Computer Science, Hunter College of The City University of New York 2 Department of Computer Science, The Graduate Center of The City University of New York [email protected], [email protected], [email protected] error rate. The minimum number of insertions, deletions and Abstract substitutions required for transformation of one sequence into As grapheme­to­phoneme methods proliferate, their careful another is the Levenshtein distance [3]. Phoneme error rate evaluation becomes increasingly important. This paper ex­ (PER) is the Levenshtein distance between a predicted pro­ plores a variety of metrics to compare the automatic pronunci­ nunciation and the reference pronunciation, divided by the ation methods of three freely­available grapheme­to­phoneme number of phonemes in the reference pronunciation. Word er­ packages on a large dictionary. Two metrics, presented here ror rate (WER) is the proportion of predicted pronunciations for the first time, rely upon a novel weighted phonemic substi­ with at least one phoneme error to the total number of pronun­ tution matrix constructed from substitution frequencies in a ciations. Neither WER nor PER, however, is a sufficiently collection of trusted alternate pronunciations. These new met­ sensitive measurement of the distance between pronunciations. rics are sensitive to the degree of mutability among phonemes. Consider, for example, two pronunciation pairs that use the An alignment tool uses this matrix to compare phoneme sub­ ARPAbet phoneme set [4]: stitutions between pairs of pronunciations. S OW D AH S OW D AH Index Terms: grapheme­to­phoneme, edit distance, substitu­ S OW D AA T AY B L tion matrix, phonetic distance measures On the left are two reasonable pronunciations for the English word “soda,” while the pair on the right compares a pronuncia­ 1.
    [Show full text]
  • Towards Structuring an Arabic-English Machine-Readable Dictionary Using Parsing Expression Grammars
    Towards Structuring an Arabic-English Machine-Readable Dictionary Using Parsing Expression Grammars Diaa Mohamed Fayed1 Aly Aly Fahmy2 Mohsen Abdelrazek Rashwan3 Wafaa Kamel Fayed4 1 Department of Computer Science, Faculty of Computers and Information, 2 Department of Computer Science, Faculty of Computers and Information, 3 Department of EECE, Faculty of Engineering, 4 Department of Arabic Language and Literatures, Faculty of Arts, Cairo University, [email protected], [email protected], [email protected], [email protected] ABSTRACT: Dictionaries are rich sources of lexical information about words that is required for many applications of human language technology. However, publishers prepare printed dictionaries for human usage not for machine processing. This paper presents a method to structure partly a machine-readable version of the Arabic-English Al-Mawrid dictionary. The method converted the entries of Al-Mawrid from a stream of words and punctuation marks into hierarchical structures. The hierarchical structure or tree structure expresses the components of each dictionary entry in explicit format. The components of an entry are subentries and each subentry consists of defining phrases, domain labels, cross-references, and translation equivalences. We designed the proposed method as cascaded steps where parsing is the main step. We implemented the parser through the parsing expression grammar formalism. In conclusion, although Arabic dictionaries do not have microstructure standardization, this study demonstrated that it is possible to structure them automatically or semi- automatically with plausible accuracy after inducing their microstructure. Keywords: Al-Mawrid, Arabic-English, Machine-Readable Dictionary, Parsing Expression Grammars Received: 16 November 2013, Revised 19 December 2013, Accepted 24 December 2013 © 2014 DLINE.
    [Show full text]
  • Prefixes and Suffixes
    им от medical roots, suffixes and prefi... ■ 2.5 Roots of quantity j _ • - • ' I * 3 See also j Prefixes and suffixes The following is an alphabetical list o f medical prefixes and suffixes, along with their meanings, origin, and an English example. A P re fix o r 1 M e a n in g | Origin language and etymology Example (s) su ffix j i1 Ancient Greek a-/ccv- (a-/an-), | a-, an - Denotes an absence of ! Apathy, Analgia without, not ... ........... ' i iab - aw ay from Latin A bduction i Latin (abdomen), abdomen, fat abdomin(o)- O f or relating to the abdomen A bdom en ! around the belly cardiac, hydrophobiac, -ac, -a c a l pertaining to Greek -ако<; (-akos) pharmacomaniacal . Ancient Greek racavGa (akantha), acanthioa acanthocyte, aca n th (o )- thorn or spine thorn acanthoma, acanthulus Greek акоосгакос; (acoustikos), " i aco u s(io )- O f or relating to hearing acoumeter, acoustician o f or for hearing ■ Greek axpov (akron), highest or Acrocrany, acromegaly, acr(o )- extremity, topmost larthest point acroosteolysis, acroposthia Greek акоостко*; (acoustikos), -acu sis hearing paracusis o f or for hearing ' -a d tow ard, in the direction o f dorsad increase, adherence, motion ad - Latin A dduction toward, very ad en (o )-, Ancient Greek d8f|v, a8ev- Adenocarcinoma, adenology, i O f or relating to a gland ad en (i)- (aden, aden-), an acorn; a gland jadenotome, adenotyphus | O f or relating to lat or fatty adip(o)- Latin (adeps, adip-), lat A dipocyte tissue j | i O f or relating to adrenal i adren(o)- , Latin Iadrenal artery glands j -a p .
    [Show full text]
  • Also Called Back-Derivation, Retrograde Derivation Or Deaffixation) Is Often Described As One of the Minor Word-Formation Processes
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by DSpace at University of West Bohemia 2013 ACTA UNIVERSITATIS CAROLINAE PAG. 31–60 PHILOLOGICA 3 / PRAGUE STUDIES IN ENGLISH XXVI BACK-FORMATION IN THE NEWEST LAYER OF ENGLISH VOCABULARY NADĚŽDA STAŠKOVÁ 1. Introduction Back-formation (also called back-derivation, retrograde derivation or deaffixation) is often described as one of the minor word-formation processes. It has a special position among the other types of word-formation for several reasons. Firstly, it is a process direc- tionally opposite in its character to the most frequent, and possibly most natural way of forming new words in English – affixation. In back-formation, instead of being added, an affix (or that part of the source word which is supposed to be an affix) is subtracted, which thus results in a shorter or morphologically less complex item of vocabulary. Comparison of the two processes is given in Table 1. The verb ovate (1988) represents the most typical and modern result of back-formation – it is a verb formed from an action noun and its meaning is “to give someone an ovation; applaud and cheer enthusiastically”. Table 1: Comparison of affixation and back-formation SOURCE WORD > RESULTING WORD AFFIXATION: translate > translation BACK-FORMATION: ovation > ovate Especially because of the opposite direction of creation, back-formation is often regard- ed as an unusual (or even unnatural) process, and new back-formed words are at their first occurrence frequently felt as weird, humorous or inacceptable. The truth is that some back-formations appear only as nonce words, sometimes deliberately coined by a speak- er for humorous, ironic, or otherwise idiosyncratic, purposes and do not have a long life.
    [Show full text]