05_597582_ch01.qxd 8/5/05 10:23 PM Page 3
Getting Ready chapter to Hack
hat are RSS and Atom feeds? If you’re reading this, it’s pretty likely you’ve already seen links to feeds (things such as W“Syndicate this Site” or the ubiquitous orange-and-white “RSS” buttons) starting to pop up on all of your favorite sites. In fact, you might already have secured a feed reader or aggregator and stopped visiting most of your favorite sites in person. The bookmarks in your browser have started in this chapter gathering dust since you stopped clicking through them every day. And, if you’re like some feed addicts, you’re keeping track of what’s new from ˛ Taking a Crash more Web sites and news sources than you ever have before, or even thought Course in RSS and possible. Atom Feeds If you’re a voracious infovore like me and this story doesn’t sound familiar, you’re in for a treat. RSS and Atom feeds—collectively known as syndication ˛ Gathering Tools feeds—are behind one of the biggest changes to sweep across the Web since the invention of the personal home page. These syndication feeds make it easy for machines to surf the Web, so you don’t have to. So far, syndication feed readers won’t actually read or intelligently digest content on the Web for you, but they will let you know when there’s some- thing new to peruse and can collect it in an inbox, like email. In fact, these feeds and their readers layer the Web with features not alto- gether different than email newsletters and Usenet newsgroups, but with much more control over what you receive and none of the spam. With the time you used to spend browsing through bookmarked sites checking for updates, you can now just get straight to reading new stuff presented directly. It’s almost as though someone is publishing a newspaper tailored just for you. From the publishing side ofCOPYRIGHTED things, when you serve up your messages MATERIAL and content using syndication feeds, you make it so much easier for someone to keep track of your updates—and so much more likely that they will stay in touch because, once someone has subscribed to your feed, it’s practically effortless to stay tuned in. As long as you keep pushing out things worthy of an audience’s attention, syndication feeds make it easier to slip into their busy schedules and stay there. 05_597582_ch01.qxd 8/5/05 10:23 PM Page 4
4 Part I — Consuming Feeds
Furthermore, the way syndication feeds slice up the Web into timely capsules of microcontent allows you to manipulate, filter, and remix streams of fluid online content in a way never seen before. With the right tools, you can work toward applications that help more cleverly digest content and sift through the firehose of information available. You can gather resources and collectively republish, acting as the editorial newsmaster of your own personal news wire. You can train learning machines to filter for items that match your interests. And the possibilities offered by syndication will only expand as new kinds of information and new types of media are carried and referenced by feed items. But that’s enough gushing about syndication feeds. Let’s get to work figuring out what these things are, under the hood, and how you can actually do some of the things promised earlier.
Taking a Crash Course in RSS and Atom Feeds If you’re already familiar with all the basics of RSS and Atom feeds, you can skip ahead to the section “Gathering Tools” later in this chapter. But, just in case you need to be brought up to speed, this section takes a quick tour of feed consumers, feed producers, and the basics of feed anatomy. Catching Up with Feed Readers and Aggregators One of the easiest places to start with an introduction to syndication feeds is with feed aggre- gators and readers, because the most visible results of feeds start there. Though you will be building your own aggregator soon enough, having some notion of what sorts of things other working aggregators do can certainly give you some ideas. It also helps to have other aggrega- tors around as a source of comparison once you start creating some feeds. For the most part, you’ll find feed readers fall into categories such as the following: