Uva-DARE (Digital Academic Repository)

Total Page:16

File Type:pdf, Size:1020Kb

Uva-DARE (Digital Academic Repository) UvA-DARE (Digital Academic Repository) Normalized information distance Vitányi, P.M.B.; Balbach, F.J.; Cilibrasi, R.L.; Li, M. Publication date 2008 Link to publication Citation for published version (APA): Vitányi, P. M. B., Balbach, F. J., Cilibrasi, R. L., & Li, M. (2008). Normalized information distance. Institute for Logic, Language and Computation. http://arxiv.org/abs/0809.2553 General rights It is not permitted to download or to forward/distribute the text or part of it without the consent of the author(s) and/or copyright holder(s), other than for strictly personal, individual use, unless the work is under an open content license (like Creative Commons). Disclaimer/Complaints regulations If you believe that digital publication of certain material infringes any of your rights or (privacy) interests, please let the Library know, stating your reasons. In case of a legitimate complaint, the Library will make the material inaccessible and/or remove it from the website. Please Ask the Library: https://uba.uva.nl/en/contact, or a letter to: Library of the University of Amsterdam, Secretariat, Singel 425, 1012 WP Amsterdam, The Netherlands. You will be contacted as soon as possible. UvA-DARE is a service provided by the library of the University of Amsterdam (https://dare.uva.nl) Download date:26 Sep 2021 Chapter 3 Normalized Information Distance Paul M. B. Vita´nyi, Frank J. Balbach, Rudi L. Cilibrasi, and Ming Li Abstract The normali ed in!ormation distance is a uni"ersal distance measure !or ob#ects o! all kinds. $t is based on %olmogoro" com&le'ity and thus uncom&utable, but there are (ays to utili e it. First, com&ression algorithms can be used to a&&ro'imate the %olmogoro" com&le'ity i! the ob#ects ha"e a string re&resentation. )econd, !or names and abstract conce&ts, &age count statistics !rom the *orld *ide *eb can be used. These &ractical reali ations o! the normali ed in!ormation distance can then be a&&lied to machine learning tasks, e'&ecially clustering, to &er!orm !eature+!ree and &arameter+!ree data mining. This cha&ter discusses the theoretical !oundations o! the normali ed in!ormation distance and both &ractical reali ations. $t &resents numerous e'am&les o! success!ul real+(orld a&&lications based on these distance measures, ranging !rom bioin!ormatics to music clustering to machine translation. 3.1 Introduction The ty&ical data mining algorithm uses e'&licitly gi"en !eatures o! the data to assess their similarity and disco"er &atterns among them. $t also comes (ith many &arameters !or the user to tune to s&eci,c needs according to the domain at hand. $n this cha&ter, by contrast, (e are discussing algorithms that neither use !eatures o! the data nor &ro"ide any &arameters to be tuned, but that ne"ertheless o!ten out&er!orm algorithms o! the a!orementioned kind. $n addition, the methods &resented here are not #ust heuristics that ha&&en to (ork, but they are !ounded in the mathematical theory o! %olmogoro" com&le'ity. The &roblems discussed in this cha&ter (ill mostly, yet not e'clusi"ely, be clustering tasks, in (hich naturally the notion o! distance bet(een ob#ects &lays a dominant role. Paul M. B. Vita´nyi C*$, %ruislaan -./, .012 )J 3msterdam, The 4etherlands; e+mail6 paul"7c(i.nl Frank J. Balbach 8ni"ersity o! *aterloo, *aterloo, 94, Canada5 e+mail6 fbalbach7u(aterloo.ca :su&&orted by a &ostdoctoral !ello(shi& o! the ;erman 3cademic <'change )er"ice :=33=>> Rudi L. Cilibrasi C*$, %ruislaan -./, .012 )J 3msterdam, The 4etherlands; e+mail6 cilibrar7cilibrar.com Ming Li 8ni"ersity o! *aterloo, *aterloo, 94, Canada5 e+mail6 mli7u(aterloo.ca /1 -0 Paul M. B. Vita´nyi, Frank J. Balbach, Rudi L. Cilibrasi, and Ming Li There are good reasons to a"oid &arameter laden methods. )etting the &arameters re?uires an intimate understanding o! the underlying algorithm. )etting them incorrectly can result in missing the right &atterns or, &erha&s (orse, in detecting !alse ones. Moreo"er, com&aring t(o &arametri ed algorithms is di!,cult because di!!erent &arameter settings can gi"e a (rong im&ression that one algorithm is better than another, (hen in !act one is sim&ly ad#usted &oorly. Com&arisons using the o&timal &arameter settings !or each algorithm are o! little hel& because these settings are hardly e"er kno(n in real situations. Lastly, t(eaking &arameters might tem&t users to im&ose their assum&tions and e'&ectations on the algorithm. There are also good reasons to a"oid !eature based methods. =etermining the rele"ant !eatures re?uires domain kno(ledge, and determining ho( rele"ant they are o!ten re?uires guessing. $m&lementing the !ea+ ture e'traction in an algorithm can be di!,cult, error+&rone, and is o!ten time consuming. $t also limits the a&&licability o! an algorithm to a s&eci,c ,eld. @o( can an algorithm &er!orm (ell i! it does not e'tract the im&ortant !eatures o! the data and does not allo( us to t(eak its &arameters to hel& it do the right thingA 9! course, &arameter and !eature !ree algorithms cannot mind read, so i! (e a &riori kno( the !eatures, ho( to e'tract them, and ho( to combine them into e'actly the distance measure (e (ant, (e should do #ust that. For e'am&le, i! (e ha"e a list o! cars (ith their color, motor rating, etc. and (ant to cluster them by color, (e can easily do that in a straight!or(ard (ay. Parameter and !eature !ree algorithms are made (ith a di!!erent scenario in mind. $n this exploratory data mining scenario (e are con!ronted (ith data (hose im&ortant !eatures and ho( to e'tract them are unkno(n to us :&erha&s there are not e"en !eatures>. *e are then stri"ing not !or a certain similarity measure, but !or the similarity measure bet(een the ob#ects. =oes such an absolute measure o! similarity e'ist at allA Bes, it does, in theory. $t is called the in!ormation distance, and the idea behind it is that t(o ob#ects are similar i! there is a sim&le descri&tion o! ho( to trans!orm each one o! them into the other one. $!, ho(e"er, all such descri&tions are com&le', the ob#ects are deemed dissimilar. For e'am&le, an image and its negati"e are "ery similar because the trans!ormation can be described as Cin"ert e"ery &i'el.D By contrast, a descri&tion o! ho( to trans!orm a blank can"as into da VinciEs Mona Lisa (ould in"ol"e the com&lete, and com&arably large, descri&tion o! that &ainting. The latter e'am&le already &oints to some issues one has to take care o!, like asymmetry and normali a+ tion. 3symmetry re!ers to the !act that, a!ter all, the in"erse trans!ormation o! the Mona Lisa into a blank can"as can be described rather sim&ly. 4ormali ation re!ers to the !act that the trans!ormation descri&tion si e must be seen in relation to the si e o! the &artici&ating ob#ects. )ection /.F details ho( these and other issues are dealt (ith and e'&lains in (hich sense the resulting information distance measure is uni"ersal. The !ormulation o! this distance measure (ill in"ol"e the mathematical theory o! %olmogoro" com&le'ity, (hich is generally concerned (ith shortest e!!ecti"e descri&tions. While the de,nition o! the in!ormation distance is rather theoretical and cannot be reali ed in &ractice, one can still use its theoretical idea and a&&ro'imate it (ith &ractical methods. T(o such a&&roaches are discussed in subse?uent sections. They di!!er in (hich &ro&erty o! the %olmogoro" com&le'ity they use and to (hat kind o! ob#ects they a&&ly. The ,rst a&&roach, &resented in )ect. /./, e'&loits the relation bet(een %olmogoro" com&le'ity and data com&ression and conse?uently em&loys common com&ression algorithms to measure distances bet(een ob#ects. This method is a&&licable (hene"er the data to be clustered are gi"en in a com&ressible !orm, !or instance, as a te't or other literal descri&tion. The second a&&roach, &resented in )ect. /.-, e'&loits the relation bet(een %olmogoro" com&le'ity and &robability. $t uses statistics generated by common *eb search engines to measure distances bet(een ob+ #ects. This method is a&&licable to non+literal ob#ects, names and conce&ts, (hose &ro&erties and interrela+ tions are gi"en by common sense and human kno(ledge. / 4ormali ed $n!ormation =istance -. 3.2 Normalized Information Distance %olmogoro" com&le'ity measures the absolute in!ormation content o! indi"idual ob#ects. For the &ur&ose o! data mining, es&ecially clustering, (e (ould also like to be able to measure the absolute in!ormation distance bet(een indi"idual ob#ects. )uch a notion should be uni"ersal in the sense that it contains all other alternati"e or intuiti"e notions o! com&utable distances as s&ecial cases. )uch a notion should also ser"e as an absolute measure o! the in!ormational, or cogniti"e, distance bet(een discrete ob#ects x and y.
Recommended publications
  • Evolution and Ambition in the Career of Jan Lievens (1607-1674)
    ABSTRACT Title: EVOLUTION AND AMBITION IN THE CAREER OF JAN LIEVENS (1607-1674) Lloyd DeWitt, Ph.D., 2006 Directed By: Prof. Arthur K. Wheelock, Jr. Department of Art History and Archaeology The Dutch artist Jan Lievens (1607-1674) was viewed by his contemporaries as one of the most important artists of his age. Ambitious and self-confident, Lievens assimilated leading trends from Haarlem, Utrecht and Antwerp into a bold and monumental style that he refined during the late 1620s through close artistic interaction with Rembrandt van Rijn in Leiden, climaxing in a competition for a court commission. Lievens’s early Job on the Dung Heap and Raising of Lazarus demonstrate his careful adaptation of style and iconography to both theological and political conditions of his time. This much-discussed phase of Lievens’s life came to an end in 1631when Rembrandt left Leiden. Around 1631-1632 Lievens was transformed by his encounter with Anthony van Dyck, and his ambition to be a court artist led him to follow Van Dyck to London in the spring of 1632. His output of independent works in London was modest and entirely connected to Van Dyck and the English court, thus Lievens almost certainly worked in Van Dyck’s studio. In 1635, Lievens moved to Antwerp and returned to history painting, executing commissions for the Jesuits, and he also broadened his artistic vocabulary by mastering woodcut prints and landscape paintings. After a short and successful stay in Leiden in 1639, Lievens moved to Amsterdam permanently in 1644, and from 1648 until the end of his career was engaged in a string of important and prestigious civic and princely commissions in which he continued to demonstrate his aptitude for adapting to and assimilating the most current style of his day to his own somber monumentality.
    [Show full text]
  • Rembrandt Remembers – 80 Years of Small Town Life
    Rembrandt School Song Purple and white, we’re fighting for you, We’ll fight for all things that you can do, Basketball, baseball, any old game, We’ll stand beside you just the same, And when our colors go by We’ll shout for you, Rembrandt High And we'll stand and cheer and shout We’re loyal to Rembrandt High, Rah! Rah! Rah! School colors: Purple and White Nickname: Raiders and Raiderettes Rembrandt Remembers: 80 Years of Small-Town Life Compiled and Edited by Helene Ducas Viall and Betty Foval Hoskins Des Moines, Iowa and Harrisonburg, Virginia Copyright © 2002 by Helene Ducas Viall and Betty Foval Hoskins All rights reserved. iii Table of Contents I. Introduction . v Notes on Editing . vi Acknowledgements . vi II. Graduates 1920s: Clifford Green (p. 1), Hilda Hegna Odor (p. 2), Catherine Grigsby Kestel (p. 4), Genevieve Rystad Boese (p. 5), Waldo Pingel (p. 6) 1930s: Orva Kaasa Goodman (p. 8), Alvin Mosbo (p. 9), Marjorie Whitaker Pritchard (p. 11), Nancy Bork Lind (p. 12), Rosella Kidman Avansino (p. 13), Clayton Olson (p. 14), Agnes Rystad Enderson (p. 16), Alice Haroldson Halverson (p. 16), Evelyn Junkermeier Benna (p. 18), Edith Grodahl Bates (p. 24), Agnes Lerud Peteler (p. 26), Arlene Burwell Cannoy (p. 28 ), Catherine Pingel Sokol (p. 29), Loren Green (p. 30), Phyllis Johnson Gring (p. 34), Ken Hadenfeldt (p. 35), Lloyd Pressel (p. 38), Harry Edwall (p. 40), Lois Ann Johnson Mathison (p. 42), Marv Erichsen (p. 43), Ruth Hill Shankel (p. 45), Wes Wallace (p. 46) 1940s: Clement Kevane (p. 48), Delores Lady Risvold (p.
    [Show full text]
  • Ramia S. Badri
    1 THE DIVISION A CREATIVE PROJECT SUBMITTED TO THE GRADUATE SCHOOL IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE MASTER OF ARTS BY RAMIA S. BADRI (CHAIRPERSON, DAVID HANNON) BALL STATE UNIVERSITY MUNCIE, INDIANA DECEMBER 2011 2 Abstract This is about my artistic activity involving my six paintings that were finished in 2008 on which I used oil medium on canvas. Carrying the theme “The Division,” the paintings were reflective of my experiences of struggle. I have had my ample share of life experiences involving dissection. My country was and is tribally divided. I moved to a country, which I expected to be the beacon of equality in the world. I found myself to be proven wrong, as I still stumble across instances of social partitions. It has appeared to me as though everywhere I go division is going to be a fact of life. Division simply alters its form; its pangs are nevertheless as lethal. I could have lost my life in Iraq; I stand to have my spirit crushed in America. I approach my artworks from a feministic perspective that highlights more how my fate is shaped in the patriarchal society. I followed the lead of the symbolist painters, I used symbols to represent my thoughts and colors and lines to make known to the public my emotions that accompany my intellectual concerns. Painting is the political tool that suits my individuality and more effectively brings to the fore my personal struggles which I believe other people, despite our cultural differences, may relate to. 3 Statement of the problem This artistic activity intends to accomplish my self-disclosure of my self- identity (Franchi & Duncan, 2007, pp.
    [Show full text]
  • Sebasco Harbor Resort, Rte
    04_595881 ch01.qxd 6/28/06 9:57 PM Page 1 Chapter 1 Taking in the Scenery Awesome Vistas . 2 Seven Beautiful Bridges . 12 Drives . 20 Train Rides . 32 Boat Rides . 41 COPYRIGHTED MATERIAL Sequoia National Park. 04_595881 ch01.qxd 6/28/06 9:57 PM Page 2 TAKING IN THE SCENERY Awesome Vistas 1 Monument Valley The Iconic Wild West Landscape Ages 6 & up • Kayenta, Arizona, USA WHEN MOST OF US THINK of the American visits backcountry areas that are other- West, this is what clicks into our mental wise off-limits to visitors, including close- Viewmasters: A vast, flat sagebrush plain ups of several natural arches and with huge sandstone spires thrusting to Ancient Puebloan petroglyphs.) the sky like the crabbed fingers of a Sticking to the Valley Drive takes primeval Mother Earth clutching for the you to 11 scenic overlooks, once-in-a- heavens. Ever since movie director John lifetime photo ops with those incredible Ford first started shooting westerns here sandstone buttes for backdrop. Often in the 1930s, this landscape has felt famil- Navajos sell jewelry and other crafts at iar to millions who have never set foot the viewing areas, or even pose on horse- here. We’ve all seen it on the big screen, back to add local color to your snapshots but oh, what a difference to see it in real (a tip will be expected). life. John Wayne—John Ford’s favorite lead- If you possibly can, time your visit to ing cowboy—roamed these scrublands include sunset—as the sheer walls of on horseback, and seeing it from a West- these monoliths capture the light of the ern saddle does seem like the thing to do.
    [Show full text]
  • Insights from Stourhead Gardens
    Open Research Online The Open University’s repository of research publications and other research outputs Myth In Reception: Insights From Stourhead Gardens Thesis How to cite: Harrison, John Edward (2018). Myth In Reception: Insights From Stourhead Gardens. PhD thesis The Open University. For guidance on citations see FAQs. c 2017 The Author https://creativecommons.org/licenses/by-nc-nd/4.0/ Version: Version of Record Link(s) to article on publisher’s website: http://dx.doi.org/doi:10.21954/ou.ro.0000d97e Copyright and Moral Rights for the articles on this site are retained by the individual authors and/or other copyright owners. For more information on Open Research Online’s data policy on reuse of materials please consult the policies page. oro.open.ac.uk Myth in reception: Insights from Stourhead gardens John Edward Harrison BSc (Hons) Psychology, University of Hertfordshire, UK Dip CS, Open University, UK PhD Neuroscience, University of London, UK Thesis submitted to The Open University in partial fulfilment of the requirement for the degree of Doctor of Philosophy Faculty of Arts and Social Sciences (FASS) The Open University December 2017 1 Declaration I declare that this thesis represents my own work, except where due acknowledgement is made, and that is has not been previously submitted to the Open University or to any other institution for a degree, diploma or other qualification. 2 Abstract The focus of my thesis is the reception of classical myth in Georgian Britain as exemplified by responses to the garden imagery at Stourhead, Wiltshire. Previous explanations have tended to the view that the gardens were designed to recapitulate Virgil’s Aeneid.
    [Show full text]
  • Automatic Meaning Discovery Using Google
    Automatic Meaning Discovery Using Google Paul Vitanyi CWI, University of Amsterdam, National ICT Australia Joint work with Rudi Cilibrasi New Scientist, Jan. 29, 2005 Slashdot: News for Nerds; Stuff that Matters, Jan. 28, 2005 Dutch Radio: TROS Radio Online, March 8, 2005 The Problem: Given: Literal objects (binary files) 1 2 3 45 Determine: “Similarity” Distance Matrix (distances between every pair) Applications: Clustering, Classification, Evolutionary trees of Internet documents, computer programs, chain letters, genomes, languages, texts, music pieces, ocr, …… TOOL: z Information Distance (Li, Vitanyi, 96; Bennett,Gacs,Li,Vitanyi,Zurek, 98) D(x,y) = min { |p|: p(x)=y & p(y)=x} Binary program for a Universal Computer (Lisp, Java, C, Universal Turing Machine) Theorem (i) D(x,y) = max {K(x|y),K(y|x)} Kolmogorov complexity of x given y, defined as length of shortest binary ptogram that outputs x on input y. (ii) D(x,y) ≤D’(x,y) Any computable distance satisfying ∑2 --D’(x,y) ≤ 1 for every x. y (iii) D(x,y) is a metric. However: z x X’ Y Y’ D(x,y)=D(x’,y’) = But x and y are much more similar than x’ and y’ z So, we Normalize: Li Badger Chen Kwong Kearney Zhang 01 Li Vitanyi 01/02 Li Chen Li Ma Vitanyi 04 z d(x,y) = D(x,y) Max {K(x),K(y)} Normalized Information Distance (NID) The “Similarity metric” Properties NID: z Theorem: (i) 0 ≤ d(x,y) ≤ 1 (ii) d(x,y) is a metric symmetric,triangle inequality,: d(x,x)=0 (iii) d(x,y) is universal d(x,y) ≤ d’(x,y) for every computable, normalized (0≤d’(x,y)≤1) distance satisfying standard “density” condition.
    [Show full text]
  • Illustrated Catalogue of Magic Lanterns
    OUR SPECIALTIES. 2. 3. 4. and Stereopticon Com- I —Dr. McIntosh Solar Microscope 5. bination. —McIntosh Combination Stereopticon. — McIntosh Professional Microscope. —Mclntosh-lves Saturator. —McIntosh Sciopticon. 6—Everything in Projection Apparatus. will be supplied Specialties manufactured or sold by other houses furnished to illustrate almost any at their advertised prices. Slides colored slides painted to order by the best artists of •ubject ; also the day. We have a commodious room fitted up to exhibit the practical working of our apparatus to prospective purchasers. TERMS. Registered Let- i. —Cash in current funds, which may be sent by sent C. O. D., ter, Draft, Postal Money Order or Express. Goods balance provided twenty-five per cent of bill is sent with order, the to be collected by the Express Company. greatest care to avoid 2 —All goods will be packed with the foi them breakage in transportation, but we cannot be responsible after leaving our premises, except under special contract. reported within ten days from 3. —Any errors in invoice must be receipt of goods. all no old stock. Our Goods are new ; we have and Nos. 141 AND 143 Wabash Ave„ CHICAGO, ILLS., U. S. A. ILLUSTRATED CATALOGUE Stereopticons, Sciopticons, DISSOLVING VIEW APPARATUS, MICROSCOPES, SOLAR MICROSCOPE STEREOPTICON COMBINATION, OBJECTIVES, PHOTOGRAPHIC TRANSPARENCIES, Artistically Colored Views and Microscopical Preparations. MANUFACTURED AND IMPORTED BY THE OPTICAL DEPARTMENT OP THE McIntosh Battery and Optical Co., Nos. 141 and 143 Wabash Ave, CHICAGO, ILLS,, U. S. A. THE WORLD’S INDUSTRIAL -A-isrio C&tlott Centennial ^Exposition* GEI^FIBIGAJFE OB AWAI^D dr. ^zccinBrTOSE:, UNITED STATES, FOE SOLAS MICROSCOPES AND OPTICAL INSTBUIEMTS, Sc.
    [Show full text]
  • Observations Made During a Tour Through Parts of England, Scotland
    This is a reproduction of a library book that was digitized by Google as part of an ongoing effort to preserve the information in books and make it universally accessible. https://books.google.com OBSERVATIO NS MADE DURING A T O U R THROUGH PARTS OP ENGLAND, SCOTLAND, and WALES. CONTENTS. Letter I. — Page i. TOPISTOLART introduction — The cause of travelling traced to its source — Of man in an uncultivated Jlate — In the first stages of society — In a more civilized condition — 'The advantages arising from travelling — "The different closes of travellers described — Observations on the extent of the metropolis of England — Pleasures attainable in London — Reflections on the wretched sttuation of women of the town, and on their seducers — A story of uncommon resolution — Of the Opera, Pantheon, Play-houses, &c. Letter II. — Page i3. Observations on sundry places in a journey from London to Bath, Richmond, Windsor — Meditations on human nature, excited by a walk on the terrace of Windsor castle. Letter III. — Page i9. The journey continued — Eaton college — The advantages and disad vantages of a public and of a private education pointed out, and the . preference given to the former — Account of an abrupt secession upon some disgust of the scholars belonging to Eaton school — Maidenhead bridge — Cliefden house — The city of Bath — Its antiquity — Baths — Quality of the waters — Buildings — Amusements — Prior park, with a poetical description of it by Mrs. Chandler. Letter IV. — Page 30. A tour from Bath through some of the southern parts of England . — Mendip-hills — The city of Wells — Its cathedral, and public build ings — Instance of filial ajfeclion — Ancient tombs — The library — A literary imposition detebled — Description of Okey-Hole, a famous cavern a near CONTENTS.
    [Show full text]
  • Heres 29.05.11 Aarestrup, Emil: in Clear October Moonlight 02.10.13/25.08.11 Aarestrup, Emil: What Do They Mean
    ALPHABETICAL LIST OF POEMS Aakjær, Jeppe: Evening 08.01.15/27.10.10 Aakjær, Jeppe: I bear with a smile my burden 25.04.16 Aakjær, Jeppe: Jens Roadman 03.07.20/15.11.09 Aakjær, Jeppe: May Night 30.04.18/09.08.10 Aakjær, Jeppe: Now the day is full of song 27.05.17/13.04.10 Aakjær, Jeppe: The Hawk 26.08.19/22.01.16/29.03.10/14.09.12 Aamodt, Bjørn: Anchorage 13.05.12 Aamodt, Bjørn: Howl 05.12.20/18.05.12 Aamodt, Bjørn: Nothing 14.05.12 Aarestrup, Emil: Accept this kiss 13.10.13/08.01.10 Aarestrup, Emil: Are you a Christian? 18.01.11 Aarestrup, Emil: Admission 14.01.11 Aarestrup, Emil: Fear 17.01.13 Aarestrup, Emil: Hemispheres 29.05.11 Aarestrup, Emil: In clear October moonlight 02.10.13/25.08.11 Aarestrup, Emil: What do they mean... 20.11.09 Aarestrup, Emil: You were the rose in flower 27.08.12 Achterberg, Gerrit: Code 01.12.09 Albinus, Johann G.: All on earth must end by dying (Chorale) 20.10.14 Andersen, Astrid Hjertenæs: Horses in rain 07.08.20/08.12.15/07.05.12 Andersen, Astrid Hjertenæs: I would fly home 01.07.19 Andersen, Astrid Hjertenæs: The horse’s eye 02.07.19 Andersen, Benny: Adultery and love 25.12.12 Andersen, Benny: Closet Swedes 13.06.20/15.06.15 Andersen, Benny: Diet 09.11.15/28.12.12 Andersen, Benny: Fornication and love 05.04.18 Andersen, Benny: Healthy advice 12.11.15 Andersen, Benny: High time 18.02.10 Andersen, Benny: Kierkegaard on a bicycle 17.06.21/04.04.18 Andersen, Benny: Letter from you 12.10.16 Andersen, Benny: Little Song for Nina 13.02.18/29.09.16/06.12.09 Andersen, Benny: Longing for Sweden 02.10.16 Andersen,
    [Show full text]
  • Statistical Inference Through Data Compression
    Statistical Inference Through Data Compression Rudi Cilibrasi Statistical Inference Through Data Compression ILLC Dissertation Series DS-2007-01 For further information about ILLC-publications, please contact Institute for Logic, Language and Computation Universiteit van Amsterdam Plantage Muidergracht 24 1018 TV Amsterdam phone: +31-20-525 6051 fax: +31-20-525 5206 e-mail: [email protected] homepage: http://www.illc.uva.nl/ Statistical Inference Through Data Compression ACADEMISCH PROEFSCHRIFT ter verkrijging van de graad van doctor aan de Universiteit van Amsterdam op gezag van de Rector Magnificus prof.mr. P.F. van der Heijden ten overstaan van een door het college voor promoties ingestelde commissie, in het openbaar te verdedigen in de Aula der Universiteit op vrijdag 23 februari 2007, te 10.00 uur door Rudi Langston Cilibrasi geboren te Brooklyn, New York, Verenigde Staten Promotiecommissie: Promotor: Prof.dr.ir. P.M.B. Vitányi Co-promotor: Dr. P.D. Grünwald Overige leden: Prof.dr. P. Adriaans Prof.dr. R. Dijkgraaf Prof.dr. M. Li Prof.dr. B. Ryabko Prof.dr. A. Siebes Dr. L. Torenvliet Faculteit der Natuurwetenschappen, Wiskunde en Informatica Copyright © 2007 by Rudi Cilibrasi Printed and bound by PRINTPARTNERS IPSKAMP. ISBN: 90–6196–540–3 My arguments will be open to all, and may be judged of by all. – Publius v Contents 1 Introduction 1 1.1 Overview of this thesis ............................... 1 1.1.1 Data Compression as Learning ....................... 1 1.1.2 Visualization ................................ 3 1.1.3 Learning From the Web .......................... 5 1.1.4 Clustering and Classification ........................ 5 1.2 Gestalt Historical Context .............................. 5 1.3 Contents of this Thesis ..............................
    [Show full text]
  • Vestures of the Past: the Other Historicisms of Victorian Aesthetics
    University of Pennsylvania ScholarlyCommons Publicly Accessible Penn Dissertations 2019 Vestures Of The Past: The Other Historicisms Of Victorian Aesthetics Timothy Chandler University of Pennsylvania, [email protected] Follow this and additional works at: https://repository.upenn.edu/edissertations Part of the Comparative Literature Commons Recommended Citation Chandler, Timothy, "Vestures Of The Past: The Other Historicisms Of Victorian Aesthetics" (2019). Publicly Accessible Penn Dissertations. 3434. https://repository.upenn.edu/edissertations/3434 This paper is posted at ScholarlyCommons. https://repository.upenn.edu/edissertations/3434 For more information, please contact [email protected]. Vestures Of The Past: The Other Historicisms Of Victorian Aesthetics Abstract The importance of history to Victorian culture, and to nineteenth-century Europe more generally, is readily apprehended not only from its historiography, but also from its philosophy, art, literature, science, politics, and public institutions. This dissertation argues that the discourse of aesthetics in Victorian Britain constitutes a major area of historical thinking that, in contrast to the scientific and philosophical historicisms that dominated nineteenth-century European intellectual culture, focuses on individual experience. Its starting point is Walter Pater’s claim that we are born “clothed in a vesture of the past”—that is, that our relation to ourselves is historical and that our relation to history is aesthetic. Through readings of aesthetic theory and art criticism, along with works of historiography, fiction, poetry, and visual art, this dissertation explores some of the ways in which Victorian aesthetics addresses the problem of the relationship between the sensuous representation and experience of the historical, on the one hand, and the subjects of such representation and experience, on the other.
    [Show full text]
  • Automatic Semantics Using Google
    Automatic Semantics Using Google Rudi Cilibrasi¤ Paul Vitanyi† CWI CWI, University of Amsterdam, National ICT of Australia Abstract We have found a method to automatically extract the meaning of words and phrases from the world-wide-web using Google page counts. The approach is novel in its unrestricted problem domain, simplicity of implementation, and manifestly ontological underpinnings. The world-wide-web is the largest database on earth, and the latent se- mantic context information entered by millions of independent users averages out to provide automatic meaning of useful quality. We demonstrate positive correlations, evidencing an underlying semantic structure, in both numeri- cal symbol notations and number-name words in a variety of natural languages and contexts. Next, we demonstrate the ability to distinguish between colors and numbers, and to distinguish between 17th century Dutch painters; the ability to understand electrical terms, religious terms, emergency incidents, and we conduct a massive experiment in understanding WordNet categories; the ability to do a simple automatic English-Spanish translation. 1 Introduction Objects can be given literally, like the literal four-letter genome of a mouse, or the literal text of War and Peace by Tolstoy. For simplicity we take it that all meaning of the object is represented by the literal object itself. Objects can also be given by name, like “the four-letter genome of a mouse,” or “the text of War and Peace by Tolstoy.” There are also objects that cannot be given literally, but only by name and acquire their meaning from their contexts in background common knowledge in humankind, like “home” or “red.” To make computers more intelligent one would like to represent meaning in computer-digestable form.
    [Show full text]