The Google Similarity Distance

Total Page:16

File Type:pdf, Size:1020Kb

The Google Similarity Distance IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 19, NO 3, MARCH 2007, 370–383 1 The Google Similarity Distance Rudi L. Cilibrasi and Paul M.B. Vit´anyi Abstract— Words and phrases acquire meaning from the way is represented by the literal object itself. Objects can also be they are used in society, from their relative semantics to other given by name, like “the four-letter genome of a mouse,” words and phrases. For computers the equivalent of ‘society’ or “the text of War and Peace by Tolstoy.” There are also is ‘database,’ and the equivalent of ‘use’ is ‘way to search the database.’ We present a new theory of similarity between words objects that cannot be given literally, but only by name, and and phrases based on information distance and Kolmogorov com- that acquire their meaning from their contexts in background plexity. To fix thoughts we use the world-wide-web as database, common knowledge in humankind, like “home” or “red.” To and Google as search engine. The method is also applicable make computers more intelligent one would like to represent to other search engines and databases. This theory is then meaning in computer-digestable form. Long-term and labor- applied to construct a method to automatically extract similarity, the Google similarity distance, of words and phrases from the intensive efforts like the Cyc project [22] and the WordNet world-wide-web using Google page counts. The world-wide-web project [33] try to establish semantic relations between com- is the largest database on earth, and the context information mon objects, or, more precisely, names for those objects. The entered by millions of independent users averages out to provide idea is to create a semantic web of such vast proportions automatic semantics of useful quality. We give applications in that rudimentary intelligence, and knowledge about the real hierarchical clustering, classification, and language translation. We give examples to distinguish between colors and numbers, world, spontaneously emerge. This comes at the great cost cluster names of paintings by 17th century Dutch masters and of designing structures capable of manipulating knowledge, names of books by English novelists, the ability to understand and entering high quality contents in these structures by emergencies, and primes, and we demonstrate the ability to do knowledgeable human experts. While the efforts are long- a simple automatic English-Spanish translation. Finally, we use running and large scale, the overall information entered is the WordNet database as an objective baseline against which to judge the performance of our method. We conduct a massive minute compared to what is available on the world-wide-web. randomized trial in binary classification using support vector The rise of the world-wide-web has enticed millions of machines to learn categories based on our Google distance, users to type in trillions of characters to create billions of web resulting in an a mean agreement of 87% with the expert crafted pages of on average low quality contents. The sheer mass of WordNet categories. the information about almost every conceivable topic makes Index Terms— accuracy comparison with WordNet categories, automatic it likely that extremes will cancel and the majority or average classification and clustering, automatic meaning discovery using is meaningful in a low-quality approximate sense. We devise Google, automatic relative semantics, automatic translation, dis- a general method to tap the amorphous low-grade knowledge similarity semantic distance, Google search, Google distribution available for free on the world-wide-web, typed in by local via page hit counts, Google code, Kolmogorov complexity, nor- users aiming at personal gratification of diverse objectives, and malized compression distance ( NCD ), normalized information distance ( NID ), normalized Google distance ( NGD ), meaning yet globally achieving what is effectively the largest semantic of words and phrases extracted from the web, parameter-free electronic database in the world. Moreover, this database is data-mining, universal similarity metric available for all by using any search engine that can return aggregate page-count estimates for a large range of search- queries, like Google. arXiv:cs/0412098v3 [cs.CL] 30 May 2007 I. INTRODUCTION Previously, we and others developed a compression-based Objects can be given literally, like the literal four-letter method to establish a universal similarity metric among objects genome of a mouse, or the literal text of War and Peace by given as finite binary strings [2], [25], [26], [7], [8], [39], Tolstoy. For simplicity we take it that all meaning of the object [40], which was widely reported [20], [21], [13]. Such objects can be genomes, music pieces in MIDI format, computer The material of this paper was presented in part at the IEEE ITSOC programs in Ruby or C, pictures in simple bitmap formats, Information Theory Workshop 2005 on Coding and Complexity, 29th Aug. - or time sequences such as heart rhythm data. This method 1st Sept., 2005, Rotorua, New Zealand, and the IEEE Intn’l Symp. Information Theory, Seattle, Wash. USA, August 2006. Manuscript received April 12, is feature-free in the sense that it doesn’t analyze the files 2005; final revision June 18, 2006. Rudi Cilibrasi was supported in part by the looking for particular features; rather it analyzes all features Netherlands BSIK/BRICKS project, and by NWO project 612.55.002. He is at simultaneously and determines the similarity between every the Centre for Mathematics and Computer Science (Centrum voor Wiskunde en Informatica), Amsterdam, the Netherlands. Address: CWI, Kruislaan 413, pair of objects according to the most dominant shared feature. 1098 SJ Amsterdam, The Netherlands. Email: [email protected]. The crucial point is that the method analyzes the objects Paul Vitanyi’s work was done in part while the author was on sabbatical themselves. This precludes comparison of abstract notions or leave at National ICT of Australia, Sydney Laboratory at UNSW. He is affiliated with the Centre for Mathematics and Computer Science (Centrum other objects that don’t lend themselves to direct analysis, voor Wiskunde en Informatica) and the University of Amsterdam, both in like emotions, colors, Socrates, Plato, Mike Bonanno and Amsterdam, the Netherlands. Supported in part by the EU EU Project RESQ Albert Einstein. While the previous method that compares the IST-2001-37559, the ESF QiT Programmme, the EU NoE PASCAL, and the Netherlands BSIK/BRICKS project. Address: CWI, Kruislaan 413, 1098 SJ objects themselves is particularly suited to obtain knowledge Amsterdam, The Netherlands. Email: [email protected]. about the similarity of objects themselves, irrespective of 2 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 19, NO 3, MARCH 2007, 370–383 common beliefs about such similarities, here we develop a storage, and on assumptions that are more refined, than what method that uses only the name of an object and obtains we propose. In contrast, [11], [1] and the many references knowledge about the similarity of objects, a quantified relative cited there, use the web and Google counts to identify lexico- Google semantics, by tapping available information generated syntactic patterns or other data. Again, the theory, aim, feature by multitudes of web users. Here we are reminded of the words analysis, and execution are different from ours, and cannot of D.H. Rumsfeld [31] “A trained ape can know an awful lot meaningfully be compared. Essentially, our method below / Of what is going on in this world / Just by punching on automatically extracts semantic relations between arbitrary his mouse / For a relatively modest cost!” In this paper, the objects from the web in a manner that is feature-free, up to the Google semantics of a word or phrase consists of the set of search-engine used, and computationally feasible. This seems web pages returned by the query concerned. to be a new direction altogether. A. An Example: C. Outline: While the theory we propose is rather intricate, the resulting The main thrust is to develop a new theory of semantic method is simple enough. We give an example: At the time of distance between a pair of objects, based on (and unavoidably doing the experiment, a Google search for “horse”, returned biased by) a background contents consisting of a database 46,700,000 hits. The number of hits for the search term “rider” of documents. An example of the latter is the set of pages was 12,200,000. Searching for the pages where both “horse” constituting the world-wide-web. Similarity relations between and “rider” occur gave 2,630,000 hits, and Google indexed pairs of objects is distilled from the documents by just using 8,058,044,651 web pages. Using these numbers in the main the number of documents in which the objects occur, singly formula (III.3) we derive below, with N = 8, 058, 044, 651, and jointly (irrespective of location or multiplicity). For us, this yields a Normalized Google Distance between the terms the Google semantics of a word or phrase consists of the set “horse” and “rider” as follows: of web pages returned by the query concerned. Note that this NGD(horse, rider) 0.443. can mean that terms with different meaning have the same ≈ semantics, and that opposites like ”true” and ”false” often In the sequel of the paper we argue that the NGD is a have a similar semantics. Thus, we just discover associations normed semantic distance between the terms in question, between terms, suggesting a likely relationship. As the web usually (but not always, see below) in between 0 (identical) grows, the Google semantics may become less primitive. The and 1 (unrelated), in the cognitive space invoked by the usage theoretical underpinning is based on the theory of Kolmogorov of the terms on the world-wide-web as filtered by Google.
Recommended publications
  • Evolution and Ambition in the Career of Jan Lievens (1607-1674)
    ABSTRACT Title: EVOLUTION AND AMBITION IN THE CAREER OF JAN LIEVENS (1607-1674) Lloyd DeWitt, Ph.D., 2006 Directed By: Prof. Arthur K. Wheelock, Jr. Department of Art History and Archaeology The Dutch artist Jan Lievens (1607-1674) was viewed by his contemporaries as one of the most important artists of his age. Ambitious and self-confident, Lievens assimilated leading trends from Haarlem, Utrecht and Antwerp into a bold and monumental style that he refined during the late 1620s through close artistic interaction with Rembrandt van Rijn in Leiden, climaxing in a competition for a court commission. Lievens’s early Job on the Dung Heap and Raising of Lazarus demonstrate his careful adaptation of style and iconography to both theological and political conditions of his time. This much-discussed phase of Lievens’s life came to an end in 1631when Rembrandt left Leiden. Around 1631-1632 Lievens was transformed by his encounter with Anthony van Dyck, and his ambition to be a court artist led him to follow Van Dyck to London in the spring of 1632. His output of independent works in London was modest and entirely connected to Van Dyck and the English court, thus Lievens almost certainly worked in Van Dyck’s studio. In 1635, Lievens moved to Antwerp and returned to history painting, executing commissions for the Jesuits, and he also broadened his artistic vocabulary by mastering woodcut prints and landscape paintings. After a short and successful stay in Leiden in 1639, Lievens moved to Amsterdam permanently in 1644, and from 1648 until the end of his career was engaged in a string of important and prestigious civic and princely commissions in which he continued to demonstrate his aptitude for adapting to and assimilating the most current style of his day to his own somber monumentality.
    [Show full text]
  • Rembrandt Remembers – 80 Years of Small Town Life
    Rembrandt School Song Purple and white, we’re fighting for you, We’ll fight for all things that you can do, Basketball, baseball, any old game, We’ll stand beside you just the same, And when our colors go by We’ll shout for you, Rembrandt High And we'll stand and cheer and shout We’re loyal to Rembrandt High, Rah! Rah! Rah! School colors: Purple and White Nickname: Raiders and Raiderettes Rembrandt Remembers: 80 Years of Small-Town Life Compiled and Edited by Helene Ducas Viall and Betty Foval Hoskins Des Moines, Iowa and Harrisonburg, Virginia Copyright © 2002 by Helene Ducas Viall and Betty Foval Hoskins All rights reserved. iii Table of Contents I. Introduction . v Notes on Editing . vi Acknowledgements . vi II. Graduates 1920s: Clifford Green (p. 1), Hilda Hegna Odor (p. 2), Catherine Grigsby Kestel (p. 4), Genevieve Rystad Boese (p. 5), Waldo Pingel (p. 6) 1930s: Orva Kaasa Goodman (p. 8), Alvin Mosbo (p. 9), Marjorie Whitaker Pritchard (p. 11), Nancy Bork Lind (p. 12), Rosella Kidman Avansino (p. 13), Clayton Olson (p. 14), Agnes Rystad Enderson (p. 16), Alice Haroldson Halverson (p. 16), Evelyn Junkermeier Benna (p. 18), Edith Grodahl Bates (p. 24), Agnes Lerud Peteler (p. 26), Arlene Burwell Cannoy (p. 28 ), Catherine Pingel Sokol (p. 29), Loren Green (p. 30), Phyllis Johnson Gring (p. 34), Ken Hadenfeldt (p. 35), Lloyd Pressel (p. 38), Harry Edwall (p. 40), Lois Ann Johnson Mathison (p. 42), Marv Erichsen (p. 43), Ruth Hill Shankel (p. 45), Wes Wallace (p. 46) 1940s: Clement Kevane (p. 48), Delores Lady Risvold (p.
    [Show full text]
  • Ramia S. Badri
    1 THE DIVISION A CREATIVE PROJECT SUBMITTED TO THE GRADUATE SCHOOL IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE MASTER OF ARTS BY RAMIA S. BADRI (CHAIRPERSON, DAVID HANNON) BALL STATE UNIVERSITY MUNCIE, INDIANA DECEMBER 2011 2 Abstract This is about my artistic activity involving my six paintings that were finished in 2008 on which I used oil medium on canvas. Carrying the theme “The Division,” the paintings were reflective of my experiences of struggle. I have had my ample share of life experiences involving dissection. My country was and is tribally divided. I moved to a country, which I expected to be the beacon of equality in the world. I found myself to be proven wrong, as I still stumble across instances of social partitions. It has appeared to me as though everywhere I go division is going to be a fact of life. Division simply alters its form; its pangs are nevertheless as lethal. I could have lost my life in Iraq; I stand to have my spirit crushed in America. I approach my artworks from a feministic perspective that highlights more how my fate is shaped in the patriarchal society. I followed the lead of the symbolist painters, I used symbols to represent my thoughts and colors and lines to make known to the public my emotions that accompany my intellectual concerns. Painting is the political tool that suits my individuality and more effectively brings to the fore my personal struggles which I believe other people, despite our cultural differences, may relate to. 3 Statement of the problem This artistic activity intends to accomplish my self-disclosure of my self- identity (Franchi & Duncan, 2007, pp.
    [Show full text]
  • Sebasco Harbor Resort, Rte
    04_595881 ch01.qxd 6/28/06 9:57 PM Page 1 Chapter 1 Taking in the Scenery Awesome Vistas . 2 Seven Beautiful Bridges . 12 Drives . 20 Train Rides . 32 Boat Rides . 41 COPYRIGHTED MATERIAL Sequoia National Park. 04_595881 ch01.qxd 6/28/06 9:57 PM Page 2 TAKING IN THE SCENERY Awesome Vistas 1 Monument Valley The Iconic Wild West Landscape Ages 6 & up • Kayenta, Arizona, USA WHEN MOST OF US THINK of the American visits backcountry areas that are other- West, this is what clicks into our mental wise off-limits to visitors, including close- Viewmasters: A vast, flat sagebrush plain ups of several natural arches and with huge sandstone spires thrusting to Ancient Puebloan petroglyphs.) the sky like the crabbed fingers of a Sticking to the Valley Drive takes primeval Mother Earth clutching for the you to 11 scenic overlooks, once-in-a- heavens. Ever since movie director John lifetime photo ops with those incredible Ford first started shooting westerns here sandstone buttes for backdrop. Often in the 1930s, this landscape has felt famil- Navajos sell jewelry and other crafts at iar to millions who have never set foot the viewing areas, or even pose on horse- here. We’ve all seen it on the big screen, back to add local color to your snapshots but oh, what a difference to see it in real (a tip will be expected). life. John Wayne—John Ford’s favorite lead- If you possibly can, time your visit to ing cowboy—roamed these scrublands include sunset—as the sheer walls of on horseback, and seeing it from a West- these monoliths capture the light of the ern saddle does seem like the thing to do.
    [Show full text]
  • Insights from Stourhead Gardens
    Open Research Online The Open University’s repository of research publications and other research outputs Myth In Reception: Insights From Stourhead Gardens Thesis How to cite: Harrison, John Edward (2018). Myth In Reception: Insights From Stourhead Gardens. PhD thesis The Open University. For guidance on citations see FAQs. c 2017 The Author https://creativecommons.org/licenses/by-nc-nd/4.0/ Version: Version of Record Link(s) to article on publisher’s website: http://dx.doi.org/doi:10.21954/ou.ro.0000d97e Copyright and Moral Rights for the articles on this site are retained by the individual authors and/or other copyright owners. For more information on Open Research Online’s data policy on reuse of materials please consult the policies page. oro.open.ac.uk Myth in reception: Insights from Stourhead gardens John Edward Harrison BSc (Hons) Psychology, University of Hertfordshire, UK Dip CS, Open University, UK PhD Neuroscience, University of London, UK Thesis submitted to The Open University in partial fulfilment of the requirement for the degree of Doctor of Philosophy Faculty of Arts and Social Sciences (FASS) The Open University December 2017 1 Declaration I declare that this thesis represents my own work, except where due acknowledgement is made, and that is has not been previously submitted to the Open University or to any other institution for a degree, diploma or other qualification. 2 Abstract The focus of my thesis is the reception of classical myth in Georgian Britain as exemplified by responses to the garden imagery at Stourhead, Wiltshire. Previous explanations have tended to the view that the gardens were designed to recapitulate Virgil’s Aeneid.
    [Show full text]
  • Automatic Meaning Discovery Using Google
    Automatic Meaning Discovery Using Google Paul Vitanyi CWI, University of Amsterdam, National ICT Australia Joint work with Rudi Cilibrasi New Scientist, Jan. 29, 2005 Slashdot: News for Nerds; Stuff that Matters, Jan. 28, 2005 Dutch Radio: TROS Radio Online, March 8, 2005 The Problem: Given: Literal objects (binary files) 1 2 3 45 Determine: “Similarity” Distance Matrix (distances between every pair) Applications: Clustering, Classification, Evolutionary trees of Internet documents, computer programs, chain letters, genomes, languages, texts, music pieces, ocr, …… TOOL: z Information Distance (Li, Vitanyi, 96; Bennett,Gacs,Li,Vitanyi,Zurek, 98) D(x,y) = min { |p|: p(x)=y & p(y)=x} Binary program for a Universal Computer (Lisp, Java, C, Universal Turing Machine) Theorem (i) D(x,y) = max {K(x|y),K(y|x)} Kolmogorov complexity of x given y, defined as length of shortest binary ptogram that outputs x on input y. (ii) D(x,y) ≤D’(x,y) Any computable distance satisfying ∑2 --D’(x,y) ≤ 1 for every x. y (iii) D(x,y) is a metric. However: z x X’ Y Y’ D(x,y)=D(x’,y’) = But x and y are much more similar than x’ and y’ z So, we Normalize: Li Badger Chen Kwong Kearney Zhang 01 Li Vitanyi 01/02 Li Chen Li Ma Vitanyi 04 z d(x,y) = D(x,y) Max {K(x),K(y)} Normalized Information Distance (NID) The “Similarity metric” Properties NID: z Theorem: (i) 0 ≤ d(x,y) ≤ 1 (ii) d(x,y) is a metric symmetric,triangle inequality,: d(x,x)=0 (iii) d(x,y) is universal d(x,y) ≤ d’(x,y) for every computable, normalized (0≤d’(x,y)≤1) distance satisfying standard “density” condition.
    [Show full text]
  • Illustrated Catalogue of Magic Lanterns
    OUR SPECIALTIES. 2. 3. 4. and Stereopticon Com- I —Dr. McIntosh Solar Microscope 5. bination. —McIntosh Combination Stereopticon. — McIntosh Professional Microscope. —Mclntosh-lves Saturator. —McIntosh Sciopticon. 6—Everything in Projection Apparatus. will be supplied Specialties manufactured or sold by other houses furnished to illustrate almost any at their advertised prices. Slides colored slides painted to order by the best artists of •ubject ; also the day. We have a commodious room fitted up to exhibit the practical working of our apparatus to prospective purchasers. TERMS. Registered Let- i. —Cash in current funds, which may be sent by sent C. O. D., ter, Draft, Postal Money Order or Express. Goods balance provided twenty-five per cent of bill is sent with order, the to be collected by the Express Company. greatest care to avoid 2 —All goods will be packed with the foi them breakage in transportation, but we cannot be responsible after leaving our premises, except under special contract. reported within ten days from 3. —Any errors in invoice must be receipt of goods. all no old stock. Our Goods are new ; we have and Nos. 141 AND 143 Wabash Ave„ CHICAGO, ILLS., U. S. A. ILLUSTRATED CATALOGUE Stereopticons, Sciopticons, DISSOLVING VIEW APPARATUS, MICROSCOPES, SOLAR MICROSCOPE STEREOPTICON COMBINATION, OBJECTIVES, PHOTOGRAPHIC TRANSPARENCIES, Artistically Colored Views and Microscopical Preparations. MANUFACTURED AND IMPORTED BY THE OPTICAL DEPARTMENT OP THE McIntosh Battery and Optical Co., Nos. 141 and 143 Wabash Ave, CHICAGO, ILLS,, U. S. A. THE WORLD’S INDUSTRIAL -A-isrio C&tlott Centennial ^Exposition* GEI^FIBIGAJFE OB AWAI^D dr. ^zccinBrTOSE:, UNITED STATES, FOE SOLAS MICROSCOPES AND OPTICAL INSTBUIEMTS, Sc.
    [Show full text]
  • Observations Made During a Tour Through Parts of England, Scotland
    This is a reproduction of a library book that was digitized by Google as part of an ongoing effort to preserve the information in books and make it universally accessible. https://books.google.com OBSERVATIO NS MADE DURING A T O U R THROUGH PARTS OP ENGLAND, SCOTLAND, and WALES. CONTENTS. Letter I. — Page i. TOPISTOLART introduction — The cause of travelling traced to its source — Of man in an uncultivated Jlate — In the first stages of society — In a more civilized condition — 'The advantages arising from travelling — "The different closes of travellers described — Observations on the extent of the metropolis of England — Pleasures attainable in London — Reflections on the wretched sttuation of women of the town, and on their seducers — A story of uncommon resolution — Of the Opera, Pantheon, Play-houses, &c. Letter II. — Page i3. Observations on sundry places in a journey from London to Bath, Richmond, Windsor — Meditations on human nature, excited by a walk on the terrace of Windsor castle. Letter III. — Page i9. The journey continued — Eaton college — The advantages and disad vantages of a public and of a private education pointed out, and the . preference given to the former — Account of an abrupt secession upon some disgust of the scholars belonging to Eaton school — Maidenhead bridge — Cliefden house — The city of Bath — Its antiquity — Baths — Quality of the waters — Buildings — Amusements — Prior park, with a poetical description of it by Mrs. Chandler. Letter IV. — Page 30. A tour from Bath through some of the southern parts of England . — Mendip-hills — The city of Wells — Its cathedral, and public build ings — Instance of filial ajfeclion — Ancient tombs — The library — A literary imposition detebled — Description of Okey-Hole, a famous cavern a near CONTENTS.
    [Show full text]
  • Uva-DARE (Digital Academic Repository)
    UvA-DARE (Digital Academic Repository) Normalized information distance Vitányi, P.M.B.; Balbach, F.J.; Cilibrasi, R.L.; Li, M. Publication date 2008 Link to publication Citation for published version (APA): Vitányi, P. M. B., Balbach, F. J., Cilibrasi, R. L., & Li, M. (2008). Normalized information distance. Institute for Logic, Language and Computation. http://arxiv.org/abs/0809.2553 General rights It is not permitted to download or to forward/distribute the text or part of it without the consent of the author(s) and/or copyright holder(s), other than for strictly personal, individual use, unless the work is under an open content license (like Creative Commons). Disclaimer/Complaints regulations If you believe that digital publication of certain material infringes any of your rights or (privacy) interests, please let the Library know, stating your reasons. In case of a legitimate complaint, the Library will make the material inaccessible and/or remove it from the website. Please Ask the Library: https://uba.uva.nl/en/contact, or a letter to: Library of the University of Amsterdam, Secretariat, Singel 425, 1012 WP Amsterdam, The Netherlands. You will be contacted as soon as possible. UvA-DARE is a service provided by the library of the University of Amsterdam (https://dare.uva.nl) Download date:26 Sep 2021 Chapter 3 Normalized Information Distance Paul M. B. Vita´nyi, Frank J. Balbach, Rudi L. Cilibrasi, and Ming Li Abstract The normali ed in!ormation distance is a uni"ersal distance measure !or ob#ects o! all kinds. $t is based on %olmogoro" com&le'ity and thus uncom&utable, but there are (ays to utili e it.
    [Show full text]
  • Heres 29.05.11 Aarestrup, Emil: in Clear October Moonlight 02.10.13/25.08.11 Aarestrup, Emil: What Do They Mean
    ALPHABETICAL LIST OF POEMS Aakjær, Jeppe: Evening 08.01.15/27.10.10 Aakjær, Jeppe: I bear with a smile my burden 25.04.16 Aakjær, Jeppe: Jens Roadman 03.07.20/15.11.09 Aakjær, Jeppe: May Night 30.04.18/09.08.10 Aakjær, Jeppe: Now the day is full of song 27.05.17/13.04.10 Aakjær, Jeppe: The Hawk 26.08.19/22.01.16/29.03.10/14.09.12 Aamodt, Bjørn: Anchorage 13.05.12 Aamodt, Bjørn: Howl 05.12.20/18.05.12 Aamodt, Bjørn: Nothing 14.05.12 Aarestrup, Emil: Accept this kiss 13.10.13/08.01.10 Aarestrup, Emil: Are you a Christian? 18.01.11 Aarestrup, Emil: Admission 14.01.11 Aarestrup, Emil: Fear 17.01.13 Aarestrup, Emil: Hemispheres 29.05.11 Aarestrup, Emil: In clear October moonlight 02.10.13/25.08.11 Aarestrup, Emil: What do they mean... 20.11.09 Aarestrup, Emil: You were the rose in flower 27.08.12 Achterberg, Gerrit: Code 01.12.09 Albinus, Johann G.: All on earth must end by dying (Chorale) 20.10.14 Andersen, Astrid Hjertenæs: Horses in rain 07.08.20/08.12.15/07.05.12 Andersen, Astrid Hjertenæs: I would fly home 01.07.19 Andersen, Astrid Hjertenæs: The horse’s eye 02.07.19 Andersen, Benny: Adultery and love 25.12.12 Andersen, Benny: Closet Swedes 13.06.20/15.06.15 Andersen, Benny: Diet 09.11.15/28.12.12 Andersen, Benny: Fornication and love 05.04.18 Andersen, Benny: Healthy advice 12.11.15 Andersen, Benny: High time 18.02.10 Andersen, Benny: Kierkegaard on a bicycle 17.06.21/04.04.18 Andersen, Benny: Letter from you 12.10.16 Andersen, Benny: Little Song for Nina 13.02.18/29.09.16/06.12.09 Andersen, Benny: Longing for Sweden 02.10.16 Andersen,
    [Show full text]
  • Statistical Inference Through Data Compression
    Statistical Inference Through Data Compression Rudi Cilibrasi Statistical Inference Through Data Compression ILLC Dissertation Series DS-2007-01 For further information about ILLC-publications, please contact Institute for Logic, Language and Computation Universiteit van Amsterdam Plantage Muidergracht 24 1018 TV Amsterdam phone: +31-20-525 6051 fax: +31-20-525 5206 e-mail: [email protected] homepage: http://www.illc.uva.nl/ Statistical Inference Through Data Compression ACADEMISCH PROEFSCHRIFT ter verkrijging van de graad van doctor aan de Universiteit van Amsterdam op gezag van de Rector Magnificus prof.mr. P.F. van der Heijden ten overstaan van een door het college voor promoties ingestelde commissie, in het openbaar te verdedigen in de Aula der Universiteit op vrijdag 23 februari 2007, te 10.00 uur door Rudi Langston Cilibrasi geboren te Brooklyn, New York, Verenigde Staten Promotiecommissie: Promotor: Prof.dr.ir. P.M.B. Vitányi Co-promotor: Dr. P.D. Grünwald Overige leden: Prof.dr. P. Adriaans Prof.dr. R. Dijkgraaf Prof.dr. M. Li Prof.dr. B. Ryabko Prof.dr. A. Siebes Dr. L. Torenvliet Faculteit der Natuurwetenschappen, Wiskunde en Informatica Copyright © 2007 by Rudi Cilibrasi Printed and bound by PRINTPARTNERS IPSKAMP. ISBN: 90–6196–540–3 My arguments will be open to all, and may be judged of by all. – Publius v Contents 1 Introduction 1 1.1 Overview of this thesis ............................... 1 1.1.1 Data Compression as Learning ....................... 1 1.1.2 Visualization ................................ 3 1.1.3 Learning From the Web .......................... 5 1.1.4 Clustering and Classification ........................ 5 1.2 Gestalt Historical Context .............................. 5 1.3 Contents of this Thesis ..............................
    [Show full text]
  • Vestures of the Past: the Other Historicisms of Victorian Aesthetics
    University of Pennsylvania ScholarlyCommons Publicly Accessible Penn Dissertations 2019 Vestures Of The Past: The Other Historicisms Of Victorian Aesthetics Timothy Chandler University of Pennsylvania, [email protected] Follow this and additional works at: https://repository.upenn.edu/edissertations Part of the Comparative Literature Commons Recommended Citation Chandler, Timothy, "Vestures Of The Past: The Other Historicisms Of Victorian Aesthetics" (2019). Publicly Accessible Penn Dissertations. 3434. https://repository.upenn.edu/edissertations/3434 This paper is posted at ScholarlyCommons. https://repository.upenn.edu/edissertations/3434 For more information, please contact [email protected]. Vestures Of The Past: The Other Historicisms Of Victorian Aesthetics Abstract The importance of history to Victorian culture, and to nineteenth-century Europe more generally, is readily apprehended not only from its historiography, but also from its philosophy, art, literature, science, politics, and public institutions. This dissertation argues that the discourse of aesthetics in Victorian Britain constitutes a major area of historical thinking that, in contrast to the scientific and philosophical historicisms that dominated nineteenth-century European intellectual culture, focuses on individual experience. Its starting point is Walter Pater’s claim that we are born “clothed in a vesture of the past”—that is, that our relation to ourselves is historical and that our relation to history is aesthetic. Through readings of aesthetic theory and art criticism, along with works of historiography, fiction, poetry, and visual art, this dissertation explores some of the ways in which Victorian aesthetics addresses the problem of the relationship between the sensuous representation and experience of the historical, on the one hand, and the subjects of such representation and experience, on the other.
    [Show full text]