Guest-Blogging.Pdf

Total Page:16

File Type:pdf, Size:1020Kb

Guest-Blogging.Pdf The decay and fall of guest blogging for SEO 2016-04-19, 1:36 AM HOME ABOUT ME GOOGLE/SEO DISCLAIMER DISCLOSURE Matt Cutts: Gadgets, Google, and SEO The decay and fall of guest blogging for SEO FOLLOW ME - Twitter Posted January 20, 2014 in Google/SEO - Google+ Okay, I’m calling it: if you’re using guest blogging as a way to gain links in 2014, you should probably stop. Why? Because over time it’s become a more and more spammy practice, and if you’re doing a Search this website … lot of guest blogging then you’re hanging out with really bad company. Search Back in the day, guest blogging used to be a respectable thing, much like getting a coveted, CATEGORIES respected author to write the introduction of your book. It’s not that way any more. Here’s an 30 days (40) example unsolicited, spam email that I recently received: Android (7) Books/Magazines (29) My name is XXXXXXX XXXXXXXX and I work as a content marketer for a high end digital Chrome (25) marketing agency in [a city halfway around the world]. I have been promoting high quality content Food (6) in select niches for our clients. Fun (47) Gadgets/Hack (96) We are always on the lookout for professional, high class sites to further promote our clients and Games (5) when I came across your blog I was very impressed with the fan following that you have Glossary/Definitions (2) established.I [sic] would love to speak to you regarding the possibility of posting some guest Google/SEO (554) articles on your blog. Should you be open to the idea, we can consider making suitable Halloween (18) contribution, befitting to high standard of services that your blog offers to larger audience. How to (60) iPhone (17) On my part, I assure you a high quality article that is- Leftovers (5) – 100% original Linux/Ubuntu (28) – Well written Movies/Videos (27) – Relevant to your audience and Music (6) – Exclusive to you Personal (139) Productivity (30) We can also explore including internal links to related articles across your site to help keep your Television (3) readers engaged with other content on your blog. Travel (30) All I ask in return is a dofollow link or two in the article body that will be relevant to your Uncategorized (4) audience and the article. We understand that you will want to approve the article, and I can assure Web/Net (74) you that we work with a team of highly talented writers, so we can guarantee that the article Weblog/blog (109) would be insightful and professionally written. We aim to write content that will benefit your loyal META readers. We are also happy to write on any topic, you suggest for us. Log in Entries RSS If you ignore the bad spacing and read the parts that I bolded, someone sent me a spam email Comments RSS offering money to get links that pass PageRank. That’s a clear violation of Google’s quality guidelines. WordPress.org Moreover, we’ve been seeing more and more reports of “guest blogging” that are really “paying for PageRank” or worse, “we’ll insert some spammy links on your blog without you realizing it.” ARCHIVES Archives Ultimately, this is why we can’t have nice things in the SEO space: a trend starts out as authentic. Select Month Then more and more people pile on until only the barest trace of legitimate behavior remains. We’ve reached the point in the downward spiral where people are hawking “guest post outsourcing” and writing articles about “how to automate guest blogging.” So stick a fork in it: guest blogging is done; it’s just gotten too spammy. In general I wouldn’t recommend accepting a guest blog post unless you are willing to vouch for someone personally or know them well. Likewise, I wouldn’t recommend relying on guest posting, guest blogging sites, or guest blogging SEO as a linkbuilding strategy. For historical reference, I’ll list a few videos and links to trace the decline of guest articles. Even back in 2012, I tried to draw a distinction between high-quality guest posts vs. spammier guest blogs: https://www.mattcutts.com/blog/guest-blogging/ Page 1 of 139 The decay and fall of guest blogging for SEO 2016-04-19, 1:36 AM What is Google's view on guest blogging for links? Unfortunately, a lot of people didn’t seem to hear me say to steer away from low-quality guest blog posting, so I did a follow-up video to warn folks away from spammy guest articles: Does Google take action on spammy guest blogging activities? In mid-2013, John Mueller gave spot on advice about nofollowing links in guest blog posts. I think by mid-2013, a ton of people saw the clear trend towards guest blogging being overused by a bunch of low-quality, spammy sites. Then a few months ago, I took a question about how to be a guest blogger without it looking like paying for links (even the question is a clue that guest blog posting has been getting spammier and spammier). I tried to find a sliver of daylight to talk about high-quality guest blog posts, but if you read the transcript you’ll notice that I ended up spending most of the time talking about low- quality/spam guest posting and guest articles. https://www.mattcutts.com/blog/guest-blogging/ Page 2 of 139 The decay and fall of guest blogging for SEO 2016-04-19, 1:36 AM How can I guest blog without it appearing as if I paid for links? And then in this video that we posted last month, even the question itself predicted that Google would take stronger action and asked about “guest blogging as spam”: What should I be aware of if I'm considering guest blogging? So there you have it: the decay of a once-authentic way to reach people. Given how spammy it’s become, I’d expect Google’s webspam team to take a pretty dim view of guest blogging going forward. Added: It seems like most people are getting the spirit of what I was trying to say, but I’ll add a bit more context. I’m not trying to throw the baby out with the bath water. There are still many good reasons to do some guest blogging (exposure, branding, increased reach, community, etc.). Those reasons existed way before Google and they’ll continue into the future. And there are absolutely some fantastic, high-quality guest bloggers out there. I changed the title of this post to make it more clear that I’m talking about guest blogging for search engine optimization (SEO) purposes. I’m also not talking about multi-author blogs. High-quality multi-author blogs like Boing Boing have been around since the beginning of the web, and they can be compelling, wonderful, and useful. I just want to highlight that a bunch of low-quality or spam sites have latched on to “guest blogging” as their link-building strategy, and we see a lot more spammy attempts to do guest blogging. Because of that, I’d recommend skepticism (or at least caution) when someone reaches out and offers you a guest blog article. 657 Responses to The decay and fall of guest blogging for SEO (Leave a comment) Conrad O'Connell https://www.mattcutts.com/blog/guest-blogging/ Page 3 of 139 The decay and fall of guest blogging for SEO 2016-04-19, 1:36 AM January 20, 2014 at 1:01 pm I think this paints too broad of a stroke. Sure, you shouldn’t be accepting random posts from people only looking for followed links: but that was never a legit tactic. Perhaps the act of guest blogging doesn’t need to go away: only the phrasing. For example, maybe we should think of guest blogging as a method for ONLY public relations, not for links. Matt Cutts January 20, 2014 at 3:34 pm Conrad, thanks for the feedback. I wanted to give people a heads up mainly because the practice has gotten so spammy. There are certainly reasonable and valid reasons to consider guest posts, but regular site owners should be aware of the risks as well. I was talking to a small business owner recently and she was delighted that someone wanted to write a guest blog post. She thought that her small blog was being validated, when in fact it was being targeted. Several years ago, I would have said that the default answer when someone proposed doing a guest blog post would “yes.” However, with the rapid rise of low-quality or spammy sites trying to build tons of links via guest blogging, so I’d say that the default answer now should be “no.” Of course, if you know the person writing the blog post well, or want to vouch for them, or if the author is happy to nofollow their links, then that changes the calculation–it’s much more likely that someone is looking for a new audience instead of a way to get keyword-rich links. Michael Aulia January 20, 2014 at 4:27 pm I opened up my blog for guest blogging months ago, as I was excited to have people helping out and writing good news articles. I wasn’t aware what I did back then… After about 5-10 articles, I started to realise that this is just spam and spam.
Recommended publications
  • CHAPTER 12 Making Your Web Site Mashable
    CHAPTER 12 Making Your Web Site Mashable This chapter is a guide to content producers who want to make their web sites friendly to mashups. That is, this chapter answers the question, how would you as a content producer make your digital content most effectively remixable and mashable to users and developers? Most of this book is addressed to creators of mashups who are therefore consumers of data and services. Why then should I shift in this chapter to addressing producers of data and services? Well, you have already seen aspects of APIs and web content that make it either easier or harder to remix, and you’ve seen what makes APIs easy and enjoyable to use. Showing content and data producers what would make life easier for consumers of their content provides useful guidance to service providers who might not be fully aware of what it’s like for consumers. The main audience for the book—as consumers (as opposed to producers) of services—should still find this chapter a helpful distillation of best practices for creating mashups. In some ways, this chapter is a review of Chapters 1–11 and a preview of Chapters 13–19. Chapters 1–11 prepared you for how to create mashups in general. I presented a lot of the technologies and showed how to build a reasonably sophisticated mashup with PHP and JavaScript as well as using mashup tools. Some of the discussion in this chapter will be amplified by in-depth discussions in Chapters 13–19. For example, I’ll refer to topics such as geoRSS, iCalendar, and microformats that I discuss in greater detail in those later chapters.
    [Show full text]
  • Vol-104-No-3-2012-29
    LAW LIBRARY JOURNAL Vol. 104:3 [2012-29] The Long Tail of Legal Information: Legal Reference Service in the Age of the Content Farm* R. Lee Sims** and Roberta Munoz*** The authors discuss the implications for legal reference service of a new feature of the legal information universe: the content farm. This article describes the content farm, its workings, what makes it profitable, and the market and informational forces that drive content farm creation. It also discusses how reference interactions may be altered if a patron has consulted content farm information before coming to the reference desk. A New Challenge for Law Librarians at the Reference Desk ¶1 Nonexpert legal research patrons come to the reference desk wanting answers to their questions, but they often don’t know what question to ask or how to ask it. The first encounter between patron and reference librarian frequently begins with an analysis of the question itself. It is important for the librarian to consider whether the patron has asked a question that can be answered, and whether the patron has asked the question using language that will lead him down the right path. ¶2 Now, a new type of legal information is becoming widely available—even ubiquitous—that makes this process even more challenging. Vast quantities of legal information are currently being generated by “content farms.”1 Content farm infor- mation, retrieved through search engines like Google, answers legal questions with a high degree of specificity, even when the questions posed are incomplete, poorly formed, or nonsensical. Patrons with an article from a content farm in hand may come to the reference desk assuming that someone else has posed the correct ques- tion, which they then have adopted as their own.
    [Show full text]
  • Life Sciences and the Web: a New Era for Collaboration
    Life Sciences and the web: a new era for collaboration The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters Citation Sagotsky, Jonathan A., Le Zhang, Zhihui Wang, Sean Martin, and Thomas S. Deisboeck. 2008. Life Sciences and the web: a new era for collaboration. Molecular Systems Biology 4: 201. Published Version doi:10.1038/msb.2008.39 Citable link http://nrs.harvard.edu/urn-3:HUL.InstRepos:4874800 Terms of Use This article was downloaded from Harvard University’s DASH repository, and is made available under the terms and conditions applicable to Other Posted Material, as set forth at http:// nrs.harvard.edu/urn-3:HUL.InstRepos:dash.current.terms-of- use#LAA Molecular Systems Biology 4; Article number 201; doi:10.1038/msb.2008.39 Citation: Molecular Systems Biology 4:201 & 2008 EMBO and Nature Publishing Group All rights reserved 1744-4292/08 www.molecularsystemsbiology.com PERSPECTIVE Life Sciences and the web: a new era for collaboration Jonathan A Sagotsky1, Le Zhang1, Zhihui Wang1, Sean Martin2 inaccuracies per article compared to Encyclopedia Britannica’s and Thomas S Deisboeck1,* 2.92 errors. Although measures have been taken to improve the editorial process, accuracy and completeness remain valid 1 Complex Biosystems Modeling Laboratory, Harvard-MIT (HST) Athinoula concerns. Perhaps the issue is one in which it has become A Martinos Center for Biomedical Imaging, Massachusetts General Hospital, difficult to establish exactly what has actually been peer Charlestown, MA, USA and reviewed and what has not, given that the low cost of digital 2 Cambridge Semantics Inc., Cambridge, MA, USA publishing on the web has led to an explosive amount * Corresponding author.
    [Show full text]
  • Quality Spine Care
    Quality Spine Care Healthcare Systems, Quality Reporting, and Risk Adjustment John Ratliff Todd J. Albert Joseph Cheng Jack Knightly Editors 123 Quality Spine Care John Ratliff • Todd J. Albert Joseph Cheng • Jack Knightly Editors Quality Spine Care Healthcare Systems, Quality Reporting, and Risk Adjustment Editors John Ratliff Todd J. Albert Department of Neurosurgery Hospital for Special Surgery Stanford University New York, NY Stanford, CA USA USA Jack Knightly Joseph Cheng Atlantic Neurosurgical Specialists University of Cincinnati Morristown, NJ Cincinnati, OH USA USA ISBN 978-3-319-97989-2 ISBN 978-3-319-97990-8 (eBook) https://doi.org/10.1007/978-3-319-97990-8 Library of Congress Control Number: 2018957706 © Springer Nature Switzerland AG 2019 This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed. The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. The publisher, the authors, and the editors are safe to assume that the advice and information in this book are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or the editors give a warranty, express or implied, with respect to the material contained herein or for any errors or omissions that may have been made.
    [Show full text]
  • Uncovering Social Network Sybils in the Wild
    Uncovering Social Network Sybils in the Wild ZHI YANG, Peking University 2 CHRISTO WILSON, University of California, Santa Barbara XIAO WANG, Peking University TINGTING GAO,RenrenInc. BEN Y. ZHAO, University of California, Santa Barbara YAFEI DAI, Peking University Sybil accounts are fake identities created to unfairly increase the power or resources of a single malicious user. Researchers have long known about the existence of Sybil accounts in online communities such as file-sharing systems, but they have not been able to perform large-scale measurements to detect them or measure their activities. In this article, we describe our efforts to detect, characterize, and understand Sybil account activity in the Renren Online Social Network (OSN). We use ground truth provided by Renren Inc. to build measurement-based Sybil detectors and deploy them on Renren to detect more than 100,000 Sybil accounts. Using our full dataset of 650,000 Sybils, we examine several aspects of Sybil behavior. First, we study their link creation behavior and find that contrary to prior conjecture, Sybils in OSNs do not form tight-knit communities. Next, we examine the fine-grained behaviors of Sybils on Renren using clickstream data. Third, we investigate behind-the-scenes collusion between large groups of Sybils. Our results reveal that Sybils with no explicit social ties still act in concert to launch attacks. Finally, we investigate enhanced techniques to identify stealthy Sybils. In summary, our study advances the understanding of Sybil behavior on OSNs and shows that Sybils can effectively avoid existing community-based Sybil detectors. We hope that our results will foster new research on Sybil detection that is based on novel types of Sybil features.
    [Show full text]
  • The Ultimate to Digital Marketing in a Data-Driven
    THE ULTIMATE GUIDETO DIGITAL MARKETING IN A DATA-DRIVEN WORLD THE RISE AND RISE OF DIGITAL MARKETING IN A DATA DRIVEN WORLD HOW THE BEST OF THE BEST APPROACH DIGITAL MARKETING DEFINING THE BEST PRACTICE FOR YOUR ORGANISATION HOW DIGITAL MARKETING FITS INTO THE MARKETING MIX The Ultimate Guide to Digital in a Data-driven World 1 Achieving your goals needs marketing and business pulling together. In the digital marketplace the customer is in the driving For more than a decade TrinityP3 has been working seat. Organisations are talking and moving to a customer with many major advertisers aligning their agency roster centric strategy delivered through data-driven customer to their business strategy. In the past five years we have digital engagement. increasingly been extending this strategic alignment process within the organisation too. But achieving a successful customer centric strategy and a successful customer experience requires more To find out how you can achieve the results you need, than just intentions. In most organisations there is a be it integrating digital and data-driven strategy into the requirements to make structural and process changes as marketing process, or aligning the customer experience well, from within the organisation, through marketing to brand, or transforming your company to a customer and right through to the external agencies centric organisation, TrinityP3 has the process and and suppliers. expertise to deliver. Go to trinityP3.com for more information From the EDITOR Can anyone remember what they were doing in 1989? Well for me, it was my first year of university studying a Bachelor of Economics Degree at The University of Sydney.
    [Show full text]
  • Svms for the Blogosphere: Blog Identification and Splog Detection
    SVMs for the Blogosphere: Blog Identification and Splog Detection Pranam Kolari∗, Tim Finin and Anupam Joshi University of Maryland, Baltimore County Baltimore MD {kolari1, finin, joshi}@umbc.edu Abstract Most blog search engines identify blogs and index con- tent based on update pings received from ping servers1 or Weblogs, or blogs have become an important new way directly from blogs, or through crawling blog directories to publish information, engage in discussions and form communities. The increasing popularity of blogs has and blog hosting services. To increase their coverage, blog given rise to search and analysis engines focusing on the search engines continue to crawl the Web to discover, iden- “blogosphere”. A key requirement of such systems is to tify and index blogs. This enables staying ahead of compe- identify blogs as they crawl the Web. While this ensures tition in a domain where “size does matter”. Even if a web that only blogs are indexed, blog search engines are also crawl is inessential for blog search engines, it is still pos- often overwhelmed by spam blogs (splogs). Splogs not sible that processed update pings are from non-blogs. This only incur computational overheads but also reduce user requires that the source of the pings need to be verified as a satisfaction. In this paper we first describe experimental blog prior to indexing content.2 results of blog identification using Support Vector Ma- In the first part of this paper we address blog identifica- chines (SVM). We compare results of using different feature sets and introduce new features for blog iden- tion by experimenting with different feature sets.
    [Show full text]
  • Elie Bursztein, Baptiste Gourdin, John Mitchell Stanford University & LSV-ENS Cachan
    Talkback: Reclaiming the Blogsphere Elie Bursztein, Baptiste Gourdin, John Mitchell Stanford University & LSV-ENS Cachan 1 What is a blog ? • A Blog ("Web log") is a site, usually maintained by an individual with • Regular entries • Commentary • LinkBack • Entries displayed in reverse-chronological order. http://elie.im/blog Elie Bursztein, Baptiste Gourdin, John Mitchell TalkBack: reclaiming the blogosphere from spammer http://ly.tl/p21 Key Statistics • 184 Millions blogs • 73% of users read blogs • 50% post comments universalmccann Elie Bursztein, Baptiste Gourdin, John Mitchell TalkBack: reclaiming the blogosphere from spammer http://ly.tl/p21 Anatomy of a blog post Elie Bursztein, Baptiste Gourdin, John Mitchell TalkBack: reclaiming the blogosphere from spammer http://ly.tl/p21 Why blogs are special ? User Elie Bursztein, Baptiste Gourdin, John Mitchell TalkBack: reclaiming the blogosphere from spammer http://ly.tl/p21 Why blogs are special ? User Elie Bursztein, Baptiste Gourdin, John Mitchell TalkBack: reclaiming the blogosphere from spammer http://ly.tl/p21 What is a TrackBack ? Elie Bursztein, Baptiste Gourdin, John Mitchell TalkBack: reclaiming the blogosphere from spammer http://ly.tl/p21 Trackback Illustrated Little Timmy said to me... "What's Trackback, Daddy?" "Wow! Jimmy Lightning has written the best 1. post ever! It's so funny! And it's true! That's "Best Post Ever" why it's so good. I need to tell the world!" "Check it out world! I've "Jimmy written all about Jimmy 2. Lightning is Lightning's post on my Elie Bursztein, Baptiste Gourdin, John Mitchell swell"TalkBack: reclaiming the blogosphere from spammerweblog. My weblog's http://ly.tl/p21 called 'The Unbloggable Blogness of Blogging'.
    [Show full text]
  • Lapland UAS Thesis Template
    DELIVERING USER-FRIENDLY EXPERIENCES BY SEARCH ENGINE OPTIMIZATION Lian Heng Bachelor’s Thesis School of Business and Culture Degree Programme in Business Information Technology Bachelor of Business Administration 2014 Abstract of thesis School of Business and Culture Degree Programme in Business Information Technology Author Heng Lian Year 2014 Supervisor Tuomo Lindholm Commissioned by N/A Title of thesis Delivering User-Friendly Experiences by Search Engine Optimization No. of pages + app. 60 The objective of this research is to study different and recent Google algorithms to encourage the uses of White Hat search engine optimization in order to cre- ate a user-friendly experience on websites. Moreover, reasons for not using Black Hat search engine optimization are justified. This research focuses on Google search engine, specifically on the search en- gine optimization principles and its benefits to deliver user-friendly experiences. The exploratory research approach was adopted in this thesis research due to the formulation of the research questions. Descriptive method was also utilized as a secondary research method to enhance the findings. To ensure the validity of this thesis research, the sources drawn from are more recent than from year 2007. On the basis of research analyses and findings, this thesis research proposes different techniques and principles to be applied to webmaster’s websites in order to create websites acceptable by Google standards, more importantly us- er-friendly experiences. Keywords: user-friendly experience, SEO, Google algorithm, Google Panda, PageRank, Google Penguin, Page Layout, CONTENT ABSTRACT FIGURES 1 INTRODUCTION ............................................................................................ 5 1.1 Motivation and background .................................................................... 5 1.2 Objectives .............................................................................................. 6 1.3 Structure of the thesis ...........................................................................
    [Show full text]
  • By Nilesh Bansal a Thesis Submitted in Conformity with the Requirements
    ONLINE ANALYSIS OF HIGH-VOLUME SOCIAL TEXT STEAMS by Nilesh Bansal A thesis submitted in conformity with the requirements for the degree of Doctor of Philosophy Graduate Department of Computer Science University of Toronto ⃝c Copyright 2013 by Nilesh Bansal Abstract Online Analysis of High-Volume Social Text Steams Nilesh Bansal Doctor of Philosophy Graduate Department of Computer Science University of Toronto 2013 Social media is one of the most disruptive developments of the past decade. The impact of this information revolution has been fundamental on our society. Information dissemination has never been cheaper and users are increasingly connected with each other. The line between content producers and consumers is blurred, leaving us with abundance of data produced in real-time by users around the world on multitude of topics. In this thesis we study techniques to aid an analyst in uncovering insights from this new media form which is modeled as a high volume social text stream. The aim is to develop practical algorithms with focus on the ability to scale, amenability to reliable operation, usability, and ease of implementation. Our work lies at the intersection of building large scale real world systems and developing theoretical foundation to support the same. We identify three key predicates to enable online methods for analysis of social data, namely : • Persistent Chatter Discovery to explore topics discussed over a period of time, • Cross-referencing Media Sources to initiate analysis using a document as the query, and • Contributor Understanding to create aggregate expertise and topic summaries of authors contributing online. The thesis defines each of the predicates in detail and covers proposed techniques, their practical applicability, and detailed experimental results to establish accuracy and scalability for each of the three predicates.
    [Show full text]
  • Blogosphere: Research Issues, Tools, and Applications
    Blogosphere: Research Issues, Tools, and Applications Nitin Agarwal Huan Liu Computer Science and Engineering Department Arizona State University Tempe, AZ 85287 fNitin.Agarwal.2, [email protected] ABSTRACT ging. Acknowledging this fact, Times has named \You" as the person of the year 2006. This has created a consider- Weblogs, or Blogs, have facilitated people to express their able shift in the way information is assimilated by the indi- thoughts, voice their opinions, and share their experiences viduals. This paradigm shift can be attributed to the low and ideas. Individuals experience a sense of community, a barrier to publication and open standards of content genera- feeling of belonging, a bonding that members matter to one tion services like blogs, wikis, collaborative annotation, etc. another and their niche needs will be met through online These services have allowed the mass to contribute and edit interactions. Its open standards and low barrier to publi- articles publicly. Giving access to the mass to contribute cation have transformed information consumers to produc- or edit has also increased collaboration among the people ers. This has created a plethora of open-source intelligence, unlike previously where there was no collaboration as the or \collective wisdom" that acts as the storehouse of over- access to the content was limited to a chosen few. Increased whelming amounts of knowledge about the members, their collaboration has developed collective wisdom on the Inter- environment and the symbiosis between them. Nonetheless, net. \We the media" [21], is a phenomenon named by Dan vast amounts of this knowledge still remain to be discovered Gillmor: a world in which \the former audience", not a few and exploited in its suitable way.
    [Show full text]
  • Web Spam Taxonomy
    Web Spam Taxonomy Zolt´an Gy¨ongyi Hector Garcia-Molina Computer Science Department Computer Science Department Stanford University Stanford University [email protected] [email protected] Abstract techniques, but as far as we know, they still lack a fully effective set of tools for combating it. We believe Web spamming refers to actions intended to mislead that the first step in combating spam is understanding search engines into ranking some pages higher than it, that is, analyzing the techniques the spammers use they deserve. Recently, the amount of web spam has in- to mislead search engines. A proper understanding of creased dramatically, leading to a degradation of search spamming can then guide the development of appro- results. This paper presents a comprehensive taxon- priate countermeasures. omy of current spamming techniques, which we believe To that end, in this paper we organize web spam- can help in developing appropriate countermeasures. ming techniques into a taxonomy that can provide a framework for combating spam. We also provide an overview of published statistics about web spam to un- 1 Introduction derline the magnitude of the problem. There have been brief discussions of spam in the sci- As more and more people rely on the wealth of informa- entific literature [3, 6, 12]. One can also find details for tion available online, increased exposure on the World several specific techniques on the Web itself (e.g., [11]). Wide Web may yield significant financial gains for in- Nevertheless, we believe that this paper offers the first dividuals or organizations. Most frequently, search en- comprehensive taxonomy of all important spamming gines are the entryways to the Web; that is why some techniques known to date.
    [Show full text]