Google Origins.Pdf

Total Page:16

File Type:pdf, Size:1020Kb

Google Origins.Pdf Source: NSF: www.nsf.gov/discoveries/disc_summ.jsp?cntn_id=100660 Abridged version of original article by David Hart On the Origins of Google: Even in the early days of the Internet, people saw the need for better interfaces to growing data collections. A graduate student supported by an NSF digital library project at Stanford University uncovered the missing links in Web page ranking. In the primordial ooze of Internet content (1993), fewer than 100 Web sites inhabited the planet. Early clans of information seekers hunted for data among the far larger populations of text-only Gopher sites and FTP file-sharing servers. This was the world in the years before Google. Even in this primitive Internet world, the need for more accessible interfaces to growing data collections had already been recognized. The National Science Foundation led the multi-agency Digital Library Initiative (DLI) that, in 1994, made its first six awards. One of those awards supported a Stanford University project led by professors Hector Garcia-Molina and Terry Winograd. None of the early DLI proposals -- submitted before the World Wide Web experienced its Cambrian explosion -- explicitly included research into the Web. However, by the time DLI funding began, the information landscape had changed. In 1994, some of the first Web search tools crawled out of the Internet sea. Two Stanford students started Yahoo!, a manually constructed "table of contents" for Web sites. Other early search engines emerged, such as Lycos and WebCrawler, and began automatically indexing Web pages, focusing on keyword-based techniques to rank search results. Around the same time, one of the graduate students funded under the NSF-supported DLI project at Stanford took an interest in the Web as a "collection." The student was Larry Page. Page uncovered the missing links, so to speak, in Web page ranking. His evolutionary leap was to recognize that the act of linking one page to another required conscious effort, which in turn was evidence of human judgment about the link's destination. Individually, each link was a simple but effective tool. But collectively, millions of these links provided a key adaptation for the natural selection of search results. Page was soon joined by Sergey Brin, another Stanford graduate student working on the DLI project. (Brin was supported by an NSF Graduate Student Fellowship.) Together, Page and Brin constructed an ambitious prototype in their Stanford student offices. The equipment for the prototype, called BackRub, was funded by the DLI project and other industrial contributions. The prototype used well-established technology to crawl from page to page by following links. However, in addition to compiling a standard text index, the prototype also mapped out a vast family tree that reflected the Web links among pages. The pair then developed the PageRank method that, in short, ranks a particular Web page highly if many other highly ranked Web pages link to it. Page and Brin tested the fitness of the approach on live Web data -- initially a test set of 24 million pages. By late 1997, the BackRub approach proved to be sound, expandable and popular. By the end of the Early DLI Age in 1998, Page and Brin obtained funding and moved their growing facility from the Stanford campus to a friend’s garage and incorporated Google, Inc. The rest, as they say, is history. .
Recommended publications
  • Intro to Google for the Hill
    Introduction to A company built on search Our mission Google’s mission is to organize the world’s information and make it universally accessible and useful. As a first step to fulfilling this mission, Google’s founders Larry Page and Sergey Brin developed a new approach to online search that took root in a Stanford University dorm room and quickly spread to information seekers around the globe. The Google search engine is an easy-to-use, free service that consistently returns relevant results in a fraction of a second. What we do Google is more than a search engine. We also offer Gmail, maps, personal blogging, and web-based word processing products to name just a few. YouTube, the popular online video service, is part of Google as well. Most of Google’s services are free, so how do we make money? Much of Google’s revenue comes through our AdWords advertising program, which allows businesses to place small “sponsored links” alongside our search results. Prices for these ads are set by competitive auctions for every search term where advertisers want their ads to appear. We don’t sell placement in the search results themselves, or allow people to pay for a higher ranking there. In addition, website managers and publishers take advantage of our AdSense advertising program to deliver ads on their sites. This program generates billions of dollars in revenue each year for hundreds of thousands of websites, and is a major source of funding for the free content available across the web. Google also offers enterprise versions of our consumer products for businesses, organizations, and government entities.
    [Show full text]
  • Oral History Interview with Severo Ornstein and Laura Gould
    An Interview with SEVERO ORNSTEIN AND LAURA GOULD OH 258 Conducted by Bruce H. Bruemmer on 17 November 1994 Woodside, CA Charles Babbage Institute Center for the History of Information Processing University of Minnesota, Minneapolis Severo Ornstein and Laura Gould Interview 17 November 1994 Abstract Ornstein and Gould discuss the creation and expansion of Computer Professionals for Social Responsibility (CPSR). Ornstein recalls beginning a listserve at Xerox PARC for those concerned with the threat of nuclear war. Ornstein and Gould describe the movement of listserve participants from e-mail discussions to meetings and forming a group concerned with computer use in military systems. The bulk of the interview traces CPSR's organizational growth, fundraising, and activities educating the public about computer-dependent weapons systems such as proposed in the Strategic Defense Initiative (SDI). SEVERO ORNSTEIN AND LAURA GOULD INTERVIEW DATE: November 17, 1994 INTERVIEWER: Bruce H. Bruemmer LOCATION: Woodside, CA BRUEMMER: As I was going through your oral history interview I began to wonder about your political background and political background of the people who had started the organization. Can you characterize that? You had mentioned in your interview that you had done some antiwar demonstrations and that type of thing which is all very interesting given also your connection with DARPA. ORNSTEIN: Yes, most of the people at DARPA knew this. DARPA was not actually making bombs, you understand, and I think in the interview I talked about my feelings about DARPA at the time. I was strongly against the Vietnam War and never hid it from anyone. I wore my "Resist" button right into the Pentagon and if they didn't like it they could throw me out.
    [Show full text]
  • Backmatter In
    Appendices Bibliography This book is a little like the previews of coming attractions at the movies; it's meant to whet your appetite in several directions, without giving you the complete story about anything. To ®nd out more, you'll have to consult more specialized books on each topic. There are a lot of books on computer programming and computer science, and whichever I chose to list here would be out of date by the time you read this. Instead of trying to give current references in every area, in this edition I'm listing only the few most important and timeless books, plus an indication of the sources I used for each chapter. Computer science is a fast-changing ®eld; if you want to know what the current hot issues are, you have to read the journals. The way to start is to join the Association for Computing Machinery, 1515 Broadway, New York, NY 10036. If you are a full-time student you are eligible for a special rate for dues, which as I write this is $25 per year. (But you should write for a membership application with the current rates.) The Association publishes about 20 monthly or quarterly periodicals, plus the newsletters of about 40 Special Interest Groups in particular ®elds. 341 Read These! If you read no other books about computer science, you must read these two. One is an introductory text for college computer science students; the other is intended for a nonspecialist audience. Abelson, Harold, and Gerald Jay Sussman with Julie Sussman, Structure and Interpretation of Computer Programs, MIT Press, Second Edition, 1996.
    [Show full text]
  • Ali Aydar Anita Borg Alfred Aho Bjarne Stroustrup Bill Gates
    Ali Aydar Ali Aydar is a computer scientist and Internet entrepreneur. He is the chief executive officer at Sporcle. He is best known as an early employee and key technical contributor at the original Napster. Aydar bought Fanning his first book on programming in C++, the language he would use two years later to build the Napster file-sharing software. Anita Borg Anita Borg (January 17, 1949 – April 6, 2003) was an American computer scientist. She founded the Institute for Women and Technology (now the Anita Borg Institute for Women and Technology). While at Digital Equipment, she developed and patented a method for generating complete address traces for analyzing and designing high-speed memory systems. Alfred Aho Alfred Aho (born August 9, 1941) is a Canadian computer scientist best known for his work on programming languages, compilers, and related algorithms, and his textbooks on the art and science of computer programming. Aho received a B.A.Sc. in Engineering Physics from the University of Toronto. Bjarne Stroustrup Bjarne Stroustrup (born 30 December 1950) is a Danish computer scientist, most notable for the creation and development of the widely used C++ programming language. He is a Distinguished Research Professor and holds the College of Engineering Chair in Computer Science. Bill Gates 2 of 10 Bill Gates (born October 28, 1955) is an American business magnate, philanthropist, investor, computer programmer, and inventor. Gates is the former chief executive and chairman of Microsoft, the world’s largest personal-computer software company, which he co-founded with Paul Allen. Bruce Arden Bruce Arden (born in 1927 in Minneapolis, Minnesota) is an American computer scientist.
    [Show full text]
  • The Evolution of Lisp
    1 The Evolution of Lisp Guy L. Steele Jr. Richard P. Gabriel Thinking Machines Corporation Lucid, Inc. 245 First Street 707 Laurel Street Cambridge, Massachusetts 02142 Menlo Park, California 94025 Phone: (617) 234-2860 Phone: (415) 329-8400 FAX: (617) 243-4444 FAX: (415) 329-8480 E-mail: [email protected] E-mail: [email protected] Abstract Lisp is the world’s greatest programming language—or so its proponents think. The structure of Lisp makes it easy to extend the language or even to implement entirely new dialects without starting from scratch. Overall, the evolution of Lisp has been guided more by institutional rivalry, one-upsmanship, and the glee born of technical cleverness that is characteristic of the “hacker culture” than by sober assessments of technical requirements. Nevertheless this process has eventually produced both an industrial- strength programming language, messy but powerful, and a technically pure dialect, small but powerful, that is suitable for use by programming-language theoreticians. We pick up where McCarthy’s paper in the first HOPL conference left off. We trace the development chronologically from the era of the PDP-6, through the heyday of Interlisp and MacLisp, past the ascension and decline of special purpose Lisp machines, to the present era of standardization activities. We then examine the technical evolution of a few representative language features, including both some notable successes and some notable failures, that illuminate design issues that distinguish Lisp from other programming languages. We also discuss the use of Lisp as a laboratory for designing other programming languages. We conclude with some reflections on the forces that have driven the evolution of Lisp.
    [Show full text]
  • Larry Page Developing the Largest Corporate Foundation in Every Successful Company Must Face: As Google Word.” the United States
    LOWE —continued from front flap— Praise for $19.95 USA/$23.95 CAN In addition to examining Google’s breakthrough business strategies and new business models— In many ways, Google is the prototype of a which have transformed online advertising G and changed the way we look at corporate successful twenty-fi rst-century company. It uses responsibility and employee relations——Lowe Google technology in new ways to make information universally accessible; promotes a corporate explains why Google may be a harbinger of o 5]]UZS SPEAKS culture that encourages creativity among its where corporate America is headed. She also A>3/9A addresses controversies surrounding Google, such o employees; and takes its role as a corporate citizen as copyright infringement, antitrust concerns, and “It’s not hard to see that Google is a phenomenal company....At Secrets of the World’s Greatest Billionaire Entrepreneurs, very seriously, investing in green initiatives and personal privacy and poses the question almost Geico, we pay these guys a whole lot of money for this and that key g Sergey Brin and Larry Page developing the largest corporate foundation in every successful company must face: as Google word.” the United States. grows, can it hold on to its entrepreneurial spirit as —Warren Buffett l well as its informal motto, “Don’t do evil”? e Following in the footsteps of Warren Buffett “Google rocks. It raised my perceived IQ by about 20 points.” Speaks and Jack Welch Speaks——which contain a SPEAKS What started out as a university research project —Wes Boyd conversational style that successfully captures the conducted by Sergey Brin and Larry Page has President of Moveon.Org essence of these business leaders—Google Speaks ended up revolutionizing the world we live in.
    [Show full text]
  • Should Google Be Taken at Its Word?
    CAN GOOGLE BE TRUSTED? SHOULD GOOGLE BE TAKEN AT ITS WORD? IF SO, WHICH ONE? GOOGLE RECENTLY POSTED ABOUT “THE PRINCIPLES THAT HAVE GUIDED US FROM THE BEGINNING.” THE FIVE PRINCIPLES ARE: DO WHAT’S BEST FOR THE USER. PROVIDE THE MOST RELEVANT ANSWERS AS QUICKLY AS POSSIBLE. LABEL ADVERTISEMENTS CLEARLY. BE TRANSPARENT. LOYALTY, NOT LOCK-IN. BUT, CAN GOOGLE BE TAKEN AT ITS WORD? AND IF SO, WHICH ONE? HERE’S A LOOK AT WHAT GOOGLE EXECUTIVES HAVE SAID ABOUT THESE PRINCIPLES IN THE PAST. DECIDE FOR YOURSELF WHO TO TRUST. “DO WHAT’S BEST FOR THE USER” “DO WHAT’S BEST FOR THE USER” “I actually think most people don't want Google to answer their questions. They want Google to tell them what they should be doing next.” Eric Schmidt The Wall Street Journal 8/14/10 EXEC. CHAIRMAN ERIC SCHMIDT “DO WHAT’S BEST FOR THE USER” “We expect that advertising funded search engines will be inherently biased towards the advertisers and away from the needs of consumers.” Larry Page & Sergey Brin Stanford Thesis 1998 FOUNDERS BRIN & PAGE “DO WHAT’S BEST FOR THE USER” “The Google policy on a lot of things is to get right up to the creepy line.” Eric Schmidt at the Washington Ideas Forum 10/1/10 EXEC. CHAIRMAN ERIC SCHMIDT “DO WHAT’S BEST FOR THE USER” “We don’t monetize the thing we create…We monetize the people that use it. The more people use our products,0 the more opportunity we have to advertise to them.” Andy Rubin In the Plex SVP OF MOBILE ANDY RUBIN “PROVIDE THE MOST RELEVANT ANSWERS AS QUICKLY AS POSSIBLE” “PROVIDE THE MOST RELEVANT ANSWERS AS QUICKLY
    [Show full text]
  • Eric Schmidt
    Eric Schmidt Chairman Dr. Eric Schmidt Schmidt Futures Nominated by then-Chairman and Current Ranking Member Mac Thornberry (R-TX), House Armed Services Committee Dr. Eric Schmidt is the technical advisor to the board of Alphabet where he was formerly the executive chairman. As executive chairman, he was responsible for the external matters of all of the holding company's businesses, including Google Inc., advising their CEOs and leadership on business and policy issues. Prior to the establishment of Alphabet, Eric was the chairman of Google Inc. for four years. From 2001-2011, Eric served as Google’s chief executive officer, overseeing the company’s technical and business strategy alongside founders Sergey Brin and Larry Page. Under his leadership, Google dramatically scaled its infrastructure and diversified its product offerings while maintaining a strong culture of innovation, growing from a Silicon Valley startup to a global leader in technology. Prior to joining Google, Eric was the chairman and CEO of Novell and chief technology officer at Sun Microsystems, Inc. Previously, he served on the research staff at Xerox Palo Alto Research Center (PARC), Bell Laboratories and Zilog. He holds a bachelor’s degree in electrical engineering from Princeton University as well as a master’s degree and Ph.D. in computer science from the University of California, Berkeley. Eric was elected to the National Academy of Engineering in 2006 and inducted into the American Academy of Arts and Sciences as a fellow in 2007. Since 2008, he has been a trustee of the Institute for Advanced Study in Princeton, New Jersey.
    [Show full text]
  • Page Ndcal Complaint
    1 JOHN JASNOCH SCOTT+SCOTT, ATTORNEYS AT LAW, LLP 2 707 Broadway, Suite 1000 San Diego, California 92101 3 Telephone: (619) 233-4565 Facsimile: (619) 233-0508 4 Email: [email protected] 5 THOMAS L. LAUGHLIN, IV SCOTT+SCOTT, ATTORNEYS AT LAW, LLP 6 The Chrysler Building 405 Lexington Avenue, 40th Floor 7 New York, New York 10174 Telephone: (212) 223-6444 8 Facsimile: (212) 223-6334 9 Attorneys for Plaintiff 10 [Additional counsel on signature page.] 11 12 UNITED STATES DISTRICT COURT 13 NORTHERN DISTRICT OF CALIFORNIA 14 15 WEST PALM BEACH FIRE PENSION FUND, Case No. 16 Plaintiff, 17 v. VERIFIED SHAREHOLDER 18 LAWRENCE “LARRY” PAGE, SERGEY M. DERIVATIVE COMPLAINT BRIN, ERIC E. SCHMIDT, L. JOHN DOERR, 19 DIANE B. GREENE, JOHN L. HENNESSY, ANN MATHER, PAUL S. OTELLINI, K. RAM 20 SHRIRAM, SHIRLEY M. TILGHMAN, MICHAEL J. MORITZ, ARTHUR D. LEVINSON, 21 ROBERT ALAN EUSTACE, OMID R. KORDESTANI, JONATHAN J. ROSENBERG, 22 SHONA L. BROWN, and ARNNON GESHURI, 23 Defendants, 24 and 25 GOOGLE, INC, 26 Nominal Defendant. 27 28 VERIFIED SHAREHOLDER DERIVATIVE COMPLAINT 1 PROLOGUE 2 “[T]here is ample evidence of an overarching conspiracy between” Google and the other defendants, and of “evidence of Defendants’ rigid wage structures and 3 internal equity concerns, along with statements from Defendants’ own executives, are likely to prove compelling in establishing the impact of the anti-solicitation 4 agreements . .” 5 In re High-Tech Employee Antitrust Litig., No. 11-cv-2509, 2014 WL 3917126, at *16 (N.D. 6 Cal. Aug. 8, 2014). 7 Plaintiff West Palm Beach Fire Pension Fund (“West Palm” or “Plaintiff”), on 8 behalf of Google, Inc.
    [Show full text]
  • A Debate on Teaching Computing Science
    Teaching Computing Science t the ACM Computer Science Conference last Strategic Defense Initiative. William Scherlis is February, Edsger Dijkstra gave an invited talk known for his articulate advocacy of formal methods called “On the Cruelty of Really Teaching in computer science. M. H. van Emden is known for Computing Science.” He challenged some of his contributions in programming languages and the basic assumptions on which our curricula philosophical insights into science. Jacques Cohen Aare based and provoked a lot of discussion. The edi- is known for his work with programming languages tors of Comwunications received several recommenda- and logic programming and is a member of the Edi- tions to publish his talk in these pages. His comments torial Panel of this magazine. Richard Hamming brought into the foreground some of the background received the Turing Award in 1968 and is well known of controversy that surrounds the issue of what be- for his work in communications and coding theory. longs in the core of a computer science curriculum. Richard M. Karp received the Turing Award in 1985 To give full airing to the controversy, we invited and is known for his contributions in the design of Dijkstra to engage in a debate with selected col- algorithms. Terry Winograd is well known for his leagues, each of whom would contribute a short early work in artificial intelligence and recent work critique of his position, with Dijkstra himself making in the principles of design. a closing statement. He graciously accepted this offer. I am grateful to these people for participating in We invited people from a variety of specialties, this debate and to Professor Dijkstra for creating the backgrounds, and interpretations to provide their opening.
    [Show full text]
  • Microsoft Outlook
    From: Scott Wilson <[email protected]> Sent: Wednesday, November 13, 2019 5:18 PM To: aipartnership Subject: Re: Request for Comments on IP protection for AI innovation Docket No. PTO- C-2019-0038 It appears that companies such as Google and others are using copyrighted material for machine learning training datasets and creating for-profit products and patents that are partially dependent on copyrighted works. For instance, Google AI is using YouTube videos for machine learning products, and they have posted this information on their GoogleAI blog on multiple occasions. There are Google Books ML training datasets available in multiple locations on the internet, including at Amazon.com. There are two types of machine learning processes that I am aware of - expressive and non-expressive. A non- expressive ML training dataset would catalogue how many times a keyword appears in a copyrighted text, and appears to be permitted by fair use. However, expressive ML training analyzing artistic characteristics of copyrighted works to create patented products and/or processes does not appear to be exempted by fair use copyright laws. The whole issue revolves around the differences between research and product/profit creation in a corporate context. I am including an article below that examines these legal issues in more detail. https://lawandarts.org/wp-content/uploads/sites/14/2017/12/41.1_Sobel-FINAL.pdf or www.bensobel.org - first article on page links to the link above. As a copyright holder, I am concerned that this is a widespread industry practice of using copyrighted material without the knowledge or consent of copyright holders to create for-profit AI products that not only exploits copyright creators, but increasingly separates them from the ability to profit from their own work.
    [Show full text]
  • The Pagerank Citation Ranking: Bringing Order to The
    The PageRank Citation Ranking: Bringing Order to the Web January 29, 1998 Abstract The imp ortance of a Web page is an inherently sub jective matter, which dep ends on the readers interests, knowledge and attitudes. But there is still much that can b e said ob jectively ab out the relative imp ortance of Web pages. This pap er describ es PageRank, a metho d for rating Web pages ob jectively and mechanically, e ectively measuring the human interest and attention devoted to them. We compare PageRank to an idealized random Web surfer. We show how to eciently compute PageRank for large numb ers of pages. And, we showhow to apply PageRank to search and to user navigation. 1 Intro duction and Motivation The World Wide Web creates many new challenges for information retrieval. It is very large and heterogeneous. Current estimates are that there are over 150 million web pages with a doubling life of less than one year. More imp ortantly, the web pages are extremely diverse, ranging from "What is Jo e having for lunchtoday?" to journals ab out information retrieval. In addition to these ma jor challenges, search engines on the Web must also contend with inexp erienced users and pages engineered to manipulate search engine ranking functions. However, unlike " at" do cument collections, the World Wide Web is hyp ertext and provides considerable auxiliary information on top of the text of the web pages, such as link structure and link text. In this pap er, we take advantage of the link structure of the Web to pro duce a global \imp ortance" ranking of every web page.
    [Show full text]