Representation Learning for Natural Language Processing Representation Learning for Natural Language Processing Zhiyuan Liu • Yankai Lin • Maosong Sun

Total Page:16

File Type:pdf, Size:1020Kb

Representation Learning for Natural Language Processing Representation Learning for Natural Language Processing Zhiyuan Liu • Yankai Lin • Maosong Sun Zhiyuan Liu · Yankai Lin · Maosong Sun Representation Learning for Natural Language Processing Representation Learning for Natural Language Processing Zhiyuan Liu • Yankai Lin • Maosong Sun Representation Learning for Natural Language Processing 123 Zhiyuan Liu Yankai Lin Tsinghua University Pattern Recognition Center Beijing, China Tencent Wechat Beijing, China Maosong Sun Department of Computer Science and Technology Tsinghua University Beijing, China ISBN 978-981-15-5572-5 ISBN 978-981-15-5573-2 (eBook) https://doi.org/10.1007/978-981-15-5573-2 © The Editor(s) (if applicable) and The Author(s) 2020. This book is an open access publication. Open Access This book is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adap- tation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made. The images or other third party material in this book are included in the book’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the book’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. The use of general descriptive names, registered names, trademarks, service marks, etc. in this publi- cation does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. The publisher, the authors and the editors are safe to assume that the advice and information in this book are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or the editors give a warranty, express or implied, with respect to the material contained herein or for any errors or omissions that may have been made. The publisher remains neutral with regard to jurisdictional claims in published maps and institutional affiliations. This Springer imprint is published by the registered company Springer Nature Singapore Pte Ltd. The registered company address is: 152 Beach Road, #21-01/04 Gateway East, Singapore 189721, Singapore Preface In traditional Natural Language Processing (NLP) systems, language entries such as words and phrases are taken as distinct symbols. Various classic ideas and methods, such as n-gram and bag-of-words models, were proposed and have been widely used until now in many industrial applications. All these methods take words as the minimum units for semantic representation, which are either used to further estimate the conditional probabilities of next words given previous words (e.g., n-gram) or used to represent semantic meanings of text (e.g., bag-of-words models). Even when people find it is necessary to model word meanings, they either manually build some linguistic knowledge bases such as WordNet or use context words to represent the meaning of a target word (i.e., distributional representation). All these semantic representation methods are still based on symbols! With the development of NLP techniques for years, it is realized that many issues in NLP are caused by the symbol-based semantic representation. First, the symbol-based representation always suffers from the data sparsity problem. Take statistical NLP methods such as n-gram with large-scale corpora, for example, due to the intrinsic power-law distribution of words, the performance will decay dra- matically for those few-shot words, even many smoothing methods have been developed to calibrate the estimated probabilities about them. Moreover, there are multiple-grained entries in natural languages from words, phrases, sentences to documents, it is difficult to find a unified symbol set to represent the semantic meanings for all of them simultaneously. Meanwhile, in many NLP tasks, it is required to measure semantic relatedness between these language entries at different levels. For example, we have to measure semantic relatedness between words/phrases and documents in Information Retrieval. Due to the absence of a unified scheme for semantic representation, there used to be distinct approaches proposed and explored for different tasks in NLP, and it sometimes makes NLP does not look like a compatible community. As an alternative approach to symbol-based representation, distributed repre- sentation was originally proposed by Geoffrey E. Hinton in a technique report in 1984. The report was then included in the well-known two-volume book Parallel Distributed Processing (PDP) that introduced neural networks to model human v vi Preface cognition and intelligence. According to this report, distributed representation is inspired by the neural computation scheme of humans and other animals, and the essential idea is as follows: Each entity is represented by a pattern of activity distributed over many computing ele- ments, and each computing element is involved in representing many different entities. It means that each entity is represented by multiple neurons, and each neuron involves in the representation of many concepts. This also indicates the meaning of distributed in distributed representation. As opposed to distributed representation, people used to assume one neuron only represents a specific concept or object, e.g., there exists a single neuron that will only be activated when recognizing a person or object, such as his/her grandmother, well known as the grandmother-cell hypothesis or local representation. We can see the straightforward connection between the grandmother-cell hypothesis and symbol-based representation. It was about 20 years after distributed representation was proposed, neural probabilistic language model was proposed to model natural languages by Yoshua Bengio in 2003, in which words are represented as low-dimensional and real-valued vectors based on the idea of distributed representation. However, it was until 2013 that a simpler and more efficient framework word2vec was proposed to learn word distributed representations from large-scale corpora, we come to the popularity of distributed representation and neural network techniques in NLP. The performance of almost all NLP tasks has been significantly improved with the support of the distributed representation scheme and the deep learning methods. This book aims to review and present the recent advances of distributed repre- sentation learning for NLP, including why representation learning can improve NLP, how representation learning takes part in various important topics of NLP, and what challenges are still not well addressed by distributed representation. Book Organization This book is organized into 11 chapters with 3 parts. The first part of the book depicts key components in NLP and how representation learning works for them. In this part, Chap. 1 first introduces the basics of representation learning and why it is important for NLP. Then we give a comprehensive review of representation learning techniques on multiple-grained entries in NLP, including word representation (Chap. 2), phrase representation as known as compositional semantics (Chap. 3), sentence representation (Chap. 4), and document representa- tion (Chap. 5). The second part presents representation learning for those components closely related to NLP. These components include sememe knowledge that describes the commonsense knowledge of words as human concepts, world knowledge (also known as knowledge graphs) that organizes relational facts between entities in the real world, various network data such as social networks, document networks, and Preface vii cross-modal data that connects natural languages to other modalities such as visual data. A deep understanding of natural languages requires these complex compo- nents as a rich context. Therefore, we provide an extensive introduction to these components, i.e., sememe knowledge representation (Chap. 6), world knowledge representation (Chap. 7), network representation (Chap. 8), and cross-modal rep- resentation (Chap. 9). In the third part, we will further provide some widely used open resource tools on representation learning techniques (Chap. 10)andfinally outlook the remaining challenges and future research directions of representation learning for NLP (Chap. 11). Although the book is about representation learning for NLP, those theories and algorithms can be also applied in other related domains, such as machine learning, social network analysis, semantic web, information retrieval, data mining, and computational biology. Note that, some parts of this book are based on our previous published or pre-printed papers, including [1, 11] in Chap. 2, [32] in Chap. 3, [10, 5, 29] in Chap. 4, [12, 7] in Chap. 5, [17, 14, 24, 30, 6, 16, 2, 15] in Chap. 6, [9, 8, 13, 21, 22, 23, 3, 4, 31] in Chap. 7, and [25, 19, 18, 20, 26, 27, 33, 28] in Chap. 8. Book Cover The book cover shows an oracle bone divided into three parts, corresponding to three revolutionized stages of cognition and representation in human history. The left part shows oracle scripts, the earliest known form of Chinese writing characters used on oracle bones in the late 1200 BC. It is used to represent the emergence of human languages, especially writing systems. We consider this as the first representation revolution for human beings
Recommended publications
  • A Sentiment-Based Chat Bot
    A sentiment-based chat bot Automatic Twitter replies with Python ALEXANDER BLOM SOFIE THORSEN Bachelor’s essay at CSC Supervisor: Anna Hjalmarsson Examiner: Mårten Björkman E-mail: [email protected], [email protected] Abstract Natural language processing is a field in computer science which involves making computers derive meaning from human language and input as a way of interacting with the real world. Broadly speaking, sentiment analysis is the act of determining the attitude of an author or speaker, with respect to a certain topic or the overall context and is an application of the natural language processing field. This essay discusses the implementation of a Twitter chat bot that uses natural language processing and sentiment analysis to construct a be- lievable reply. This is done in the Python programming language, using a statistical method called Naive Bayes classifying supplied by the NLTK Python package. The essay concludes that applying natural language processing and sentiment analysis in this isolated fashion was simple, but achieving more complex tasks greatly increases the difficulty. Referat Natural language processing är ett fält inom datavetenskap som innefat- tar att få datorer att förstå mänskligt språk och indata för att på så sätt kunna interagera med den riktiga världen. Sentiment analysis är, generellt sagt, akten av att försöka bestämma känslan hos en författare eller talare med avseende på ett specifikt ämne eller sammanhang och är en applicering av fältet natural language processing. Den här rapporten diskuterar implementeringen av en Twitter-chatbot som använder just natural language processing och sentiment analysis för att kunna svara på tweets genom att använda känslan i tweetet.
    [Show full text]
  • Myrtle2018.Pdf
    Ron Martin THE STAFF “Thunder” Welcome to the Hurricane Alley Rally! I hope [email protected] [email protected] you all survived and are doing well. God bless the people Phone: (386) 323-9955 Illustrator / Maps in the Carolinas that had major damage. Please donate to your local charities and make sure they are the real Publisher Paul Fabiano deal before you give the money. Many thank to all the first Mike Barnard Phone: (386) 717-1110 responders, fireman, police, linemen, and ALL for their [email protected] service. [email protected] Okay. I want to thank Myrtle Beach Harley-Davidson for Art Director Sales the cover – awesome as always; Russ Brown for the back cover, and Spokes The Biker’s Pocket Guide, and Bones for the Centerfold. Without you guys, we don't exist. Linda Montgomery Inc. So much going on, so let's begin. [email protected] 1000 Walker St. Unit 379 Tattoos – Pit Bull Tattoo voted the best tattoo shop 11 years in a row; and Tiki Daytona Beach, FL 32117 Tattoo specializing in Polynesian art. Editor / Event Listings Phone: (386) 323-9955 Come take a demo ride at Coastal Indian Motorcycle of Myrtle Beach. Demo BikersPocketGuide.com trucks will be there for the Rally on Thursday, Friday and Saturday. Live entertainment at Suck Bang Blow! Rebel Son, Bobby Friss, Sonny Led- Made & Printed in the USA. furd, Jamine Cain to mention a few. Fall Rally parties – check out the complete list for Myrtle Beach Harley-David- SOUVENIR BIKER'S POCKET GUIDE FOR MYRTLE BEACH FALL RALLY 2018 son's Rally Events and parties.
    [Show full text]
  • Powering Inclusion: Artificial Intelligence and Assistive Technology
    POLICY BRIEF MARCH 2021 Powering Inclusion: Artificial Intelligence and Assistive Technology Assistive technology (AT) is broadly defined as any equipment, product or service that can make society more inclusive. Eye- glasses, hearing aids, wheelchairs or even some mobile applica- tions are all examples of AT. This briefing explores the power of Key facts using Artificial Intelligence (AI) to enhance the design of current and future AT, as well as to increase its reach. The UN Convention on the Context Rights of Persons with Dis- Globally, over 1 billion people are currently in need of AT. Lack of access to basic AT excludes individuals, reduces their ability to live independent abilities (UNCRPD) estab- lives [3], and is a barrier to the realisation of the Sustainable Development lished AT provision as a Goals (SDGs) [2]. human right [1]. Advances in AI offer the potential to develop and enhance AT and gain new insights into the scale and nature of AT needs (although the risks of potential bias and discrimination in certain AI-based tools must also be Over 1 billion people are acknowledged). In June 2019, the Conference of State Parties to the UN currently in need of AT. Convention on the Rights of Persons with Disabilities (CRPD) recognized that AI has the potential to enhance inclu- By 2050, this number is sion, participation, and independence for people with disabilities [5]. AI- predicted to double [2]. enabled tools – such as predictive text, visual recognition, automatic speech- to-text transcription, speech recognition, and virtual assistants - have experi- enced great advances in the last few years and are already being Only 10% of those who used by people with vision, hearing, mobility and learning impairments.
    [Show full text]
  • Applying Ensembling Methods to BERT to Boost Model Performance
    Applying Ensembling Methods to BERT to Boost Model Performance Charlie Xu, Solomon Barth, Zoe Solis Department of Computer Science Stanford University cxu2, sbarth, zoesolis @stanford.edu Abstract Due to its wide range of applications and difficulty to solve, Machine Comprehen- sion has become a significant point of focus in AI research. Using the SQuAD 2.0 dataset as a metric, BERT models have been able to achieve superior performance to most other models. Ensembling BERT with other models has yielded even greater results, since these ensemble models are able to utilize the relative strengths and adjust for the relative weaknesses of each of the individual submodels. In this project, we present various ensemble models that leveraged BiDAF and multiple different BERT models to achieve superior performance. We also studied the effects how dataset manipulations and different ensembling methods can affect the performance of our models. Lastly, we attempted to develop the Synthetic Self- Training Question Answering model. In the end, our best performing multi-BERT ensemble obtained an EM score of 73.576 and an F1 score of 76.721. 1 Introduction Machine Comprehension (MC) is a complex task in NLP that aims to understand written language. Question Answering (QA) is one of the major tasks within MC, requiring a model to provide an answer, given a contextual text passage and question. It has a wide variety of applications, including search engines and voice assistants, making it a popular problem for NLP researchers. The Stanford Question Answering Dataset (SQuAD) is a very popular means for training, tuning, testing, and comparing question answering systems.
    [Show full text]
  • A Way with Words: Recent Advances in Lexical Theory and Analysis: a Festschrift for Patrick Hanks
    A Way with Words: Recent Advances in Lexical Theory and Analysis: A Festschrift for Patrick Hanks Gilles-Maurice de Schryver (editor) (Ghent University and University of the Western Cape) Kampala: Menha Publishers, 2010, vii+375 pp; ISBN 978-9970-10-101-6, e59.95 Reviewed by Paul Cook University of Toronto In his introduction to this collection of articles dedicated to Patrick Hanks, de Schryver presents a quote from Atkins referring to Hanks as “the ideal lexicographer’s lexicogra- pher.” Indeed, Hanks has had a formidable career in lexicography, including playing an editorial role in the production of four major English dictionaries. But Hanks’s achievements reach far beyond lexicography; in particular, Hanks has made numerous contributions to computational linguistics. Hanks is co-author of the tremendously influential paper “Word association norms, mutual information, and lexicography” (Church and Hanks 1989) and maintains close ties to our field. Furthermore, Hanks has advanced the understanding of the relationship between words and their meanings in text through his theory of norms and exploitations (Hanks forthcoming). The range of Hanks’s interests is reflected in the authors and topics of the articles in this Festschrift; this review concentrates more on those articles that are likely to be of most interest to computational linguists, and does not discuss some articles that appeal primarily to a lexicographical audience. Following the introduction to Hanks and his career by de Schryver, the collection is split into three parts: Theoretical Aspects and Background, Computing Lexical Relations, and Lexical Analysis and Dictionary Writing. 1. Part I: Theoretical Aspects and Background Part I begins with an unfinished article by the late John Sinclair, in which he begins to put forward an argument that multi-word units of meaning should be given the same status in dictionaries as ordinary headwords.
    [Show full text]
  • University of Oklahoma Graduate College
    UNIVERSITY OF OKLAHOMA GRADUATE COLLEGE TRANSVERSING OUR BORDERS: DECOLONIZING COMMUNICATION THEORY A DISSERTATION SUBMITED TO THE GRADUATE FACULTY in partial fulfillment of the requirements for the Degree of DOCTOR OF PHILOSOPHY By STERLIN LEDREW MOSLEY Norman, Oklahoma 2015 TRANSVERSING OUR BORDERS: DECOLONIZING COMMUNICATION THEORY A DISSERTATION APPROVED FOR THE DEPARTMENT OF COMMUNICATION BY ___________________________ Dr. Clemencia Rodriguez, Chair ___________________________ Dr. Eric Kramer ___________________________ Dr. Lupe Davidson ___________________________ Dr. James Olufowote ___________________________ Dr. Catherine John ©Copyright STERLIN LEDREW MOSLEY 2015 All Rights Reserved. Table of Contents Introduction……………………………………………………………………1 Shifts Exploring Another Way Research Motivation Gaps in Intercultural Scholarship Chapter I: Literature Review ………………………………………………..13 Fragmented Meaning Romanticism and Idealism Existentialism and Post-Modernism Marxism, Critical Theory and The Frankfurt School Feminist Theory Postcolonial Theory Edward Said and Orientalism Homi Bhaba and Colonial Discourse Analysis Gayarti Spivak and Postcolonial Reflexivity African and Afro-Caribbean Postcolonialism Sylvia Wynter Edouard Glissant Paul Gilroy Lewis Gordon Patricia Hill Collins, Afrocentrism and Afro-Idealism Chapter II: Research and Methodology……………………………………66 Research Questions The Paradox of Postcolonial Theory Building Methodology Reflexive Scholarship and Spiritual Activism Traditional Rhetorical Analysis in the Postcolonial Tradition
    [Show full text]
  • Finetuning Pre-Trained Language Models for Sentiment Classification of COVID19 Tweets
    Technological University Dublin ARROW@TU Dublin Dissertations School of Computer Sciences 2020 Finetuning Pre-Trained Language Models for Sentiment Classification of COVID19 Tweets Arjun Dussa Technological University Dublin Follow this and additional works at: https://arrow.tudublin.ie/scschcomdis Part of the Computer Engineering Commons Recommended Citation Dussa, A. (2020) Finetuning Pre-trained language models for sentiment classification of COVID19 tweets,Dissertation, Technological University Dublin. doi:10.21427/fhx8-vk25 This Dissertation is brought to you for free and open access by the School of Computer Sciences at ARROW@TU Dublin. It has been accepted for inclusion in Dissertations by an authorized administrator of ARROW@TU Dublin. For more information, please contact [email protected], [email protected]. This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 4.0 License Finetuning Pre-trained language models for sentiment classification of COVID19 tweets Arjun Dussa A dissertation submitted in partial fulfilment of the requirements of Technological University Dublin for the degree of M.Sc. in Computer Science (Data Analytics) September 2020 Declaration I certify that this dissertation which I now submit for examination for the award of MSc in Computing (Data Analytics), is entirely my own work and has not been taken from the work of others save and to the extent that such work has been cited and acknowledged within the test of my work. This dissertation was prepared according to the regulations for postgraduate study of the Technological University Dublin and has not been submitted in whole or part for an award in any other Institute or University.
    [Show full text]
  • An Improved Text Sentiment Classification Model Using TF-IDF and Next Word Negation
    An Improved Text Sentiment Classification Model Using TF-IDF and Next Word Negation Bijoyan Das Sarit Chakraborty Student Member, IEEE Member, IEEE, Kolkata, India Abstract – With the rapid growth of Text sentiment tickets. Although this is a very trivial problem, text analysis, the demand for automatic classification of classification can be used in many different areas as electronic documents has increased by leaps and bound. follows: The paradigm of text classification or text mining has been the subject of many research works in recent time. Most of the consumer based companies use In this paper we propose a technique for text sentiment sentiment classification to automatically classification using term frequency- inverse document generate reports on customer feedback. It is an frequency (TF-IDF) along with Next Word Negation integral part of CRM. [2] (NWN). We have also compared the performances of In medical science, text classification is used to binary bag of words model, TF-IDF model and TF-IDF analyze and categorize reports, clinical trials, with ‘next word negation’ (TF-IDF-NWN) model for hospital records etc. [2] text classification. Our proposed model is then applied Text classification models are also used in law on three different text mining algorithms and we found on the various trial data, verdicts, court the Linear Support vector machine (LSVM) is the most transcripts etc. [2] appropriate to work with our proposed model. The Text classification models can also be used for achieved results show significant increase in accuracy Spam email classification. [7] compared to earlier methods. In this paper we have demonstrated a study on the three different techniques to build models for text I.
    [Show full text]
  • Weekend Glance
    Friday, Dec. 1, 2017 Vol. 11 No. 43 12040 Foster Road, Norwalk, CA 90650 Norwalk Norwalk City Council votes restaurant to give themselves grades Donut King 12000 Rosecrans Ave. NOVEMBER 30 12 more months in office Date Inspected: 11/17/17 FridayWeekend74˚ Neighborhood Watch meeting Grade: B CITY COUNCIL: Council members at a DATE: Thursday, Nov. 30 Glance choose to extend elections one year Buy Low Market TIME: 6:30 pm under legislation signed by Gov. Jerry Saturday 75˚⁰ LOCATION: Paddison Elementary 10951 Rosecrans Ave. 68 Brown. Date Inspected: 11/13/17 Friday DECEMBER 2 Grade: B By Raul Samaniego SnowFest and Christmas Tree Contributor Wienerschnitzel Sunday 71˚70⁰ Lighting 11660 Imperial Hwy. Saturday DATE: Saturday, Dec. 2 NORWALK – The Norwalk City Date Inspected: 11/15/17 TIME: 12-8 pm Council voted 5-0 on November Grade: A 21, to approve Ordinance 17-1698 LOCATION: City Hall lawn which called for the changing of Waba Grill Holiday Sonata Election Dates to comply with 11005 Firestone Blvd. California Senate Bill 415. DATE: Saturday, Dec. 2 Date Inspected: 11/16/17 TIME: 4-9 pm For Norwalk residents, it means Grade: A that with the approval of the Norwalk’s next election will be in 2020. Photo courtesy city of LOCATION: Uptown Whittier ordinance, “all five council members Norwalk Ana’s Bionicos have added another 12 months to 11005 Firestone Blvd. DECEMBER 5 their terms,” according to City Clerk numbered years to even, it could California’s history. Date Inspected: 11/16/17 Theresa Devoy increase the voter turnout for those City Council meeting This transition was mandated Grade: A elections.
    [Show full text]
  • Double-Edged Sword’
    2 | Wednesday, August 25, 2021 HONG KONG EDITION | CHINA DAILY PAGE TWO Music: Internet a ‘double-edged sword’ Supporters attend a pop concert in Jianghan Road in downtown Wuhan, Hubei province, on Oct 31. Online reality shows play a key role in advancing a singer’s career. ZHAO JUN / FOR CHINA DAILY From page 1 theme song for the popular drama A Beijing Native in New York, in 1993, and Heroes’ Although the internet helps showcase Song for the 1997 TV series Shui Hu Zhuan new talent, which connects with fans (The Water Margin). through social media platforms, such plat- He also wrote and performed Asian forms can be a double-edged sword. Mighty Winds, the official theme song for Liu said figures suggest the music scene in the 1990 Asian Games in Beijing, and You China is booming. The 2021 International and Me, the theme song for the 2008 Bei- Federation of the Phonographic Industry jing Olympics, which he performed with annual Global Music Report ranked the British singer Sarah Brightman. nation as the seventh-largest music market It is not the first time that Liu has voiced last year. his concerns about the music scene. In 2019, According to the year-end report on Chi- when he appeared on Hunan Satellite TV’s na’s music market released by domestic reality show Singer, he said the pop music online music entertainment platform Ten- industry in China was in crisis and he called cent Music Entertainment Group, or TME, for domestic singer-songwriters to produce more than 748,000 new songs were written “good original music”.
    [Show full text]
  • Noel Paul Stookey Bio
    Noel Paul Stookey Bio Singer/songwriter Noel Paul Stookey has been altering both the musical and ethical landscape of this country and the world for decades—both as the “Paul” of the legendary Peter, Paul and Mary and as an independent musician who passionately belie es in bringing the spiritual into the practice of daily life! "unny, irre erently re erent, thoughtful, compassionately passionate, Stookey’s oice is known all across this land: from the “%edding Song” to “&n 'hese 'imes.” Noel and Betty, his bride of over )* years +but who’s counting,, moved with their three daughters to the coast of maine over forty years ago. since that time he’s done the occasional home town benefit and the here-and-there in-state show but ne er had undertaken a concentrated tour in one season. “& recorded all . of my maine concerts last summer - from ogun/uit to eastport - check '01' out on a map” he says, “and, though not all of the ideo or audio made 2prime time2, i was able to collect *3 music ideos for the 4VD and +amazingly, fit all *3 songs on the 7D to create this 1' 08ME: the maine tour package.” 'he songs in this newest release represent a broad range: '0E 71(&N "959: %1;'< +homage to the rigors of enduring the lengthened winter in maine,, a new ersion of %01'S09:NAM9 +a bittersweet =az6 shaded reminiscence of a middle-aged man in denial, originally recorded on the PP&M ?@AA album, "1M&;&1 49; 78:1<8N +a new song that speaks to the immigration issue in compassionate rather than political terms,, %944&NB S8NG +with the 2original2 lyric and a spoken introduction - DVD only, and '0E ;1DY S1YS S09 48N2' ;&KE E1<< +a commentary on a common misperception of the creati e process,.
    [Show full text]
  • The General Stud Book : Containing Pedigrees of Race Horses, &C
    ^--v ''*4# ^^^j^ r- "^. Digitized by tine Internet Arciiive in 2009 witii funding from Lyrasis IVIembers and Sloan Foundation http://www.archive.org/details/generalstudbookc02fair THE GENERAL STUD BOOK VOL. II. : THE deiterol STUD BOOK, CONTAINING PEDIGREES OF RACE HORSES, &C. &-C. From the earliest Accounts to the Year 1831. inclusice. ITS FOUR VOLUMES. VOL. II. Brussels PRINTED FOR MELINE, CANS A.ND C"., EOILEVARD DE WATERLOO, Zi. M DCCC XXXIX. MR V. un:ve PREFACE TO THE FIRST EDITION. To assist in the detection of spurious and the correction of inaccu- rate pedigrees, is one of the purposes of the present publication, in which respect the first Volume has been of acknowledged utility. The two together, it is hoped, will form a comprehensive and tole- rably correct Register of Pedigrees. It will be observed that some of the Mares which appeared in the last Supplement (whereof this is a republication and continua- tion) stand as they did there, i. e. without any additions to their produce since 1813 or 1814. — It has been ascertained that several of them were about that time sold by public auction, and as all attempts to trace them have failed, the probability is that they have either been converted to some other use, or been sent abroad. If any proof were wanting of the superiority of the English breed of horses over that of every other country, it might be found in the avidity with which they are sought by Foreigners. The exportation of them to Russia, France, Germany, etc. for the last five years has been so considerable, as to render it an object of some importance in a commercial point of view.
    [Show full text]