Web Spam, Propaganda and Trust

Panagiotis T. Metaxas Joseph DeStefano Wellesley College College of the Holy Cross Wellesley, MA 02481, USA Worcester, MA 01610, USA [email protected] [email protected]

ABSTRACT Retrieval]: Miscellaneous Web spamming, the practice of introducing artificial text and into web pages to affect the results of searches, has General Terms been recognized as a major problem for search engines. It is also a serious problem for users because they are not aware Algorithms, Experimentation, Social Networks, Propaganda, of it and they tend to confuse trusting the search engine with Trust trusting the results of a search [16]. The parallels between web spamming on the internet and propaganda in the real Keywords world suggest that we can use anti-propaganda techniques to educate users and develop tools to help them evaluate the search, Web graph, link structure, PageRank, HITS, Web reliability of the information they find online. spam In this paper, we first analyze the effects that web spam has on the evolution of the search engines and their relation- 1. INTRODUCTION ship to propagandistic techniques in society. Then, we ex- The web has changed the way we inform and get informed. amine the neighborhoods of untrustworthy sites, finding that Every organization has a web site and people are increas- a dense biconnected component (BCCs) containing the site ingly comfortable accessing it for information for any ques- provide a reasonable trust neighborhood that has parallels in tion they may have. The exploding size of the web necessi- social network theory. The fact that spammers employ pro- tated the development of search engines and web directories. pagandistic techniques enables us to design a heuristic that Most people with online access use a search engine to get in- follows anti-propagandistic practices in order to recognize a formed and make decisions that may have medical, financial, spamming network. In society, recognition of an untrustwor- cultural, political, security or other important implications thy message (in the opinion of a particular person or other [10, 37, 23, 29]. Moreover, 85% of the time, people do not social entity) is a reason for questioning the entities that rec- look past the first ten results returned by the search en- ommend the message. Entities that are found to strongly gine [35]. Given this, it is not surprising that anyone with support more untrustworthy messages become untrustwor- a web presence struggles for a place in the top ten posi- thy themselves. So, social distrust is propagated backwards tions of relevant web search results. The importance of the for a number of steps. Our heuristic simulates this behavior top-10 placement has given birth to a new industry, which on the trust neighborhood of a spammer. claims to sell know-how for prominent placement in search In our experiments, we examined trust neighborhoods of results and includes companies, publications, and even con- web sites, both trustworthy and not. Our findings suggest ferences. Some of them are willing to bend the truth in order that spamming networks can be reliably recognized from to fool the search engines and their customers, by creating their relationship to a single untrustworthy starting point by web pages containing web spam. backward propagation of distrust. Further, nodes involved Web spamming is the practice of manipulating web pages in a spamming network can be divided into two groups: in order to cause search engines to rank some web pages those that have content similar to the starting site (aka “link higher than they would without any manipulation.1 The farms”), and those that have dissimilar content (aka “mu- motive is usually commercial, but can also be political, or tual admiration societies”). Our tool explores thousands of religious. nodes within minutes and could be deployed at the browser- The creators of web spam are often specialized companies level, making it possible to resolve the moral question of who selling their expertise as a service, but can also be the web should be making the decision of weeding out spammers in masters of the companies and organizations that would be favor of the end user. their customers. Spammers attack search engines through text and link manipulations [22, 18]: Categories and Subject Descriptors 1We should mention here that there is not a complete agree- H.3.3 [Information Storage and Retrieval]: Information ment on the definition of web spam among authors, which Search and Retrieval; H.3.m [Information Storage and leads to some confusion. Moreover, to people unfamiliar Copyright is held by the author/owner(s). with web spam, the term is mistaken for email spam. A WWW2005, May 10–14, 2005, Chiba, Japan. more descriptive name for it would be “search engine rank- . ing manipulation” or “adversarial information retrieval”. • Text spam: This includes excessively repeating text indicate that at least 8% of all pages indexed is spam [12] and/or adding irrelevant text on the page that will while experts consider web spamming the single most diffi- cause incorrect calculation of page relevance; adding cult challenge web searching is facing today [22]. misleading meta-keywords or irrelevant “anchor text” In this paper we first examine the reasons web spamming that will cause incorrect application of rank heuristics. has been so successful and its relationship to social propa- ganda. Then, we develop heuristics that are able to recog- • Link spam: This technique aims to change the per- nize web neighborhoods, especially untrustworthy ones. We ceived structure of the webgraph in order to cause in- present experimental results that show considerable success correct calculation of page reputation. Such examples in recognizing spamming neighborhoods. Finally, we dis- are the so-called “link-farms”, “mutual admiration so- cuss what we believe should be a frame for the long-term cieties”, page “awards”, domain flooding (plethora of approach to web spam. domains that re-direct to a target site), etc.

Both kinds of spam aim to boost the ranking of spammed 2. THE WEBGRAPH AS A SOCIAL NET- web pages. Sometimes cloaking is included as a third spam- WORK ming technique [22, 19]. Cloaking aims to serve different The web is typically represented by a directed graph [8]. pages to search engine robots and to web browsers (users). The nodes in the webgraph are the pages (or sites) that These pages could be created statically or dynamically. Static reside on servers on the internet. Arcs correspond to hy- pages, for example, may employ hidden links and/or hidden perlinks that appear on web pages (or sites). The theory text with colors or small font sizes noticeable by a crawler of social networks [38] also uses directed graphs to repre- but not by a human. Dynamic pages might change content sent relationships between social entities. The nodes (called on the fly depending on the visitor, fake the clickstream or “actors”) correspond to social entities (e.g., people, institu- query stream, submit millions of pages to “add-URL” forms tions, ideas). Arcs (called “ties”) correspond to social rela- of search engines, etc. We consider the false links and text tions between the entities they connect (e.g., has influence themselves to be the spam, while, strictly speaking, cloaking on, knows, trusts). is a tool that helps spammers hide their attacks. This connection is more than a similarity in descriptions. Since anyone can be an author on the web, these prac- The web itself is a social creation, and both PageRank and tices have naturally created a question of information reli- HITS are socially inspired algorithms [6, 26]. Socially in- ability. An audience used to trusting the written word of spired systems are subject to socially inspired attacks, how- newspapers and books is unable, unprepared or unwilling ever. Not surprisingly then, the theory of propaganda [28] to think critically about the information obtained from the can provide intuition into the dynamics of the web. web. In a recent study [16] we found that while college For example, PageRank is based on the assumption that students regard the web as a primary source of informa- the reputation of an entity (a web site in this case) can be tion, many do not check more than a single source, and measured as a function of both the number and reputation have trouble recognizing trustworthy sources online. In par- of other entities recommending it. A link to a web page ticular, two out of three students are consistently unable is counted as a “vote of confidence” to this web site, and to differentiate between facts and advertising claims, even in turn, the reputation of a page is divided among those “infomercials.” At the same time, they have considerable it is recommending [6]. Since HTML does not provide for confidence in their abilities to distinguish trustworthy sites “positive” and “negative” links, all links are taken as posi- from non-trustworthy ones, especially when they feel tech- tive. This is not always true, but is considered a reasonable nically competent. We have no reason to believe that the assumption. general public will perform any better than well-educated More importantly, there is also the implicit assumption is students. In fact, a recent analysis of internet related fraud that hyperlink “voting” is taking place independently, with- by a major Wall Street law firm [10] put the blame squarely out prior agreement or central control. Spammers, like social on the investors for the success of stock fraud cases. propagandists, are groups of sites that are able to gather a One of the reasons behind the users’ difficulty to distin- large number of such “votes of confidence” by design, thus guish trustworthy from untrustworthy information comes breaking the assumption of independence in a hyperlink. from the success that both search engines and spammers Search engines consider such moves spam, and would like to have enjoyed in the last decade. Users have come to trust restrict it, but there can be no algorithm that can recognize search engines as a means of finding information, and spam- spamming sites automatically based on graph isomorphism mers have successfully managed to get them to transfer that [5]. trust to the results of the search. There is clearly a need for education of the users, so that people develop a healthy sus- picion of unverified search results. Beyond that, though, 3. EVOLUTION OF THE SEARCH ENGINES there is a need for browser-level tools that will help the user In the early 90’s, when the web numbered just a few mil- move from suspicion to decision in determining which sites lion servers, the first generation search engines were rank- to trust. ing search results using classic information retrieval tech- From their side, the search engines have struggled to de- niques: the more rare words two documents share, the more liver spam-free results and have developed sophisticated search similar they are considered to be [34, 21]. A search query Q result ranking strategies. Two such ranking strategies that is simply a short document and the results of a search for Q have received major attention are the PageRank [6, 2] and are ranked according to their (normalized) similarity to the HITS [26] algorithms. Achieving high PageRank has become query which was treated as the value of the page. a sort of obsession for many companies’ IT departments, and The first attack to this “tf.idf ranking,” as it is known, the raison d’ˆetre of spamming companies. Some estimates came from within the search engines. Around 1995, search engines started selling search keywords to advertisers as a ways of circumventing PageRank. In PageRank, a page en- way of generating revenue: If a search query contained a joys some “absolute reputation”, that is, its reputation is not “sold” keyword, the results would include targeted adver- restricted on some particular issue. So, spammers develop tisement and a higher ranking for the link to the sponsor’s sites with expertise on irrelevant subjects, and they justifi- web site. This is the first time we have a socially inspired ably acquire high ranking on their expert sites. Then they ranking, which follows marketing practices of the real world. interlink their networked sites with the expert sites, creat- Mixing search results with paid advertisement raised se- ing what is called a “mutual admiration society” (MAS), rious ethical questions, but also showed the way to financial causing all sites to share a higher PageRank and the search profits to spammers who started their own attacks by creat- engine is fooled. This is the well-known propagandistic tech- ing pages containing many rare keywords to obtain a higher nique known as testimonials, where well known people (en- ranking score. In terms of propaganda theory, the spammers tertainers, public figures, etc.) offer their opinion on issues employed a variation of the technique of glittering general- about which they are not experts [28, 74]. HITS has also ities to confuse the first generation search engines [28, 47]. shown to be highly spammable by this technique [25] due The propagandist associates one or more suggestive words to the fact that its effectiveness depends on the accuracy of without evidence to alter the conceived value of a person or the initial neighborhood calculation. idea. To avoid spammers (and public embarrassment from The table below summarizes our findings for the first three the keyword selling practice), search engines would keep se- generation of search engines and the correspondence be- cret their exact ranking algorithm. Secrecy is no defense, tween web spam and social propaganda. however, since secret rules can be figured out by experimen- tation and reverse engineering (e.g., [33, 30]). SE Ranking Spamming Propaganda Second generation search engines started employing 1st Gen Doc keyword glittering more sophisticated ranking techniques in an effort to nul- Similarity stuffing generalities lify the effects of glittering generalities. One of the more 2nd Gen + Site + link + bandwagon successful ones was based on the “link voting principle”: popularity farms Each web site s has value equal to its “popularity”, which 3rd Gen + Page + mutual + testimonials is influenced by the set B of sites pointing to site s. Lycos s reputation admiration soc. became the champion of this ranking technique and had its own popularity skyrocket around 1996 [31]. Doing so, it was also distancing itself from the ethical questions introduced Web search corporations are reportedly busy developing by combining advertising with ranking. the engines of the next generation [7]. The new search en- Unfortunately, this ranking method did not succeed in gines hope to be able to recognize the need behind the query stopping spammers either. Spammers started creating clus- of the user. Given the success the spammers have enjoyed ters of interconnected web sites that had identical or similar so far, one wonders how will they spam the fourth genera- contents with the site they were promoting, which subse- tion engines. Is it possible to create a ranking that is not quently became known as “link farms” (LF). The link voting spammable? Put another way, can the web as a social space principle was socially inspired, so spammers used the well be free of propaganda? Seen in this light, it appears that we known propagandistic method of bandwagon to circumvent are trying to create in cyberspace what human societies have it [28, 105]. Using this technique, the propagandist is pro- not succeeded in creating in their social space. However, as moting the impression of a high degree of recommendation in society, we can learn to live successfully with propaganda, by inter-linking many internally controlled sites that will given appropriate education and technology. eventually all share high ranking. The introduction of PageRank in 1998 was a major de- 4. EXPLORING WEB NEIGHBORHOODS velopment for search engines, because it seemed to provide Since spammers employ propagandistic techniques, as we a more sophisticated anti-spamming solution to the band- have argued above, it makes sense to design anti-propagandistic wagon technique. Under PageRank, not every link con- methods for defending against them. These methods need tributes equally to the reputation of a page. Instead, links to be user-guided. Propaganda, after all, [11], does not al- from highly reputable pages contribute much higher than ways have a negative connotation. Advertisement is a form links from other sites. That way, the site networks devel- of propaganda that we have all learned to live with. The oped by spammers would not influence much their PageR- “art of persuasion” is objectionable mainly when it is used ank, and Google became the search engine of choice. A to promote an untrustworthy message according to the re- page p has value equal to its reputation R(p) which is cal- ceiver’s opinion. When such an untrustworthy message is culated as the sum of fractions of the reputations of the set detected, it becomes a reason for us to reconsider the mes- Bp of pages pointing to p. HITS is another socially-inspired senger. Messengers who strongly support an untrustworthy ranking which has also received a lot of attention [26]. The message become untrustworthy themselves. This process is HITS algorithm divides the sites related to a query between selectively repeated for a few steps, propagating the distrust “hubs” and “authorities”. Hubs are sites that contain many of the original back to those who show support for it. The links to authorities, while authorities are sites pointed to by results of this process become part of the user’s belief system the hubs. (This circular definition can be resolved.) and are used to filter future information. PageRank and HITS marked the development of the third Propagation of distrust contrasts with the propagation generation.2 Unfortunately, spammers have again found of trust in that it progresses backwards through the graph,

2 i.e. a page’s reliability decreases if it links to untrustworthy [7] considers the search engines in our 2nd and 3rd gener- sites. Current algorithms for ranking pages propagate trust ation to be in the same group. We believe that both the ranking and attack methods puts them in different cate- gories. forwards, i.e., a page’s rank is increased if a trustworthy site links to it. The heuristic in this paper focuses solely

0.22_http://www.freebiedot.com/ 0.334_http://www.metaxa.net/

0.277_http://www.pregnancy-symptoms.org/

0.184_http://www.sportsco.com/

0.213_http://www.mycreativekeepsakes.com/ 0.278_0h.0tt9p_:/0h/w.t2twp5:w8//_w.bhwettwstp:.n/o/0ewh.w1iwo5cawre7.rp_acahlrietsh0tcestp.1a:-a/.5/cotwed_hwvahm/0otwt.menp2.ce:ca2//4s.w-psh_cowhaawtydm/tdp.eva:a/-a/y-lwnuwceoto0wa1-i.2n.2pn3s.3hsu8t.ev/co_ranhtm/enttprmice://.wconewm/-stw.o4rehg.coh.wm/s/ 0.1650_.h1t8t0p7.:_1//wh7t8wtp_w:h//.twcltpwe:/aw/wn.mow-cre0.w2d.1tdeh6ilet_s-ce.0uihd.s/1totp9ll.:e2co//b_wrihm/wty.twpp:./lo/owstawnewswrs-o.liafeny.blcoironem/ke.br.iz/com.au/ 0.303_0h.t3tp2:4//_whwt0twp.1:./17/w23w_4hwyot0t.pl.i2uq:/0rh/uw2idew_-viahwlt0.thnpa..3a:mico/1/twu_nm/rawhs-4twltsup.ua:/n./tcowkiflownm/ws-dwe.pr.iecotru-nm/vioawn.co-mam/ca.com/ 0.191_http://www0.p.2e2rso5_nhattlp-l0:o/./a1wn2ws-w6w_.hkiithttpch-g://eownows-udw-cre.k.teorm4drgit..coucak/m/.com/ 0.205_0h.2tt0p4:/_/w0h.wt2tpw5:/6./dw_lhiwntwtks.p.:j/in/m-dwewt/wid.-iatve.conm/ue-22.com/ 0.193_0h.t1tp7:2//w_hw0t.tw1p9.:d/9/we_nwhtatwtpl.-io:/n/nwsuliwnraew-cren.bcerigd-thiott-red-tarapy.iancoiir-onm/gf-saficefe.coty-wm/ear.com/ 0.222_http://0w.1w6w01.._1h-h5t9tap_i:r-l/h/wtotpwss.:/w/wco.iwm/wr.co.hgm/h-doctor.org/ 0.162_h0t.t2p4:/8/w_whtwtp.:h//ewawltwh-i.pnhsuenratenrmice-pneo-drtaiscol.coum/nted.com/ 0.178_http:/0/w.2w4w1_.fhetmatp:/l/ew-owrgw.amy-dsm-heenltpa.lco-pm/lan.net/ 0.312_http://ww0w.2.h5a5ssl_hettfpre://ewwwewig.9hmatloss.lls.cocom/m/ 0.175_h0t.t1p7:/4/w_whtwtp.p://rowpwewci.oanulk.incoe-ci.uak/lis.co.uk/ 0.068_http://vruchtbaarheid.start.nu/ 0.216_http://www.baby-place.com/ 0.236_http://www.thetoque.net/ 0.0_http://www.nvpc.nl/ 0.002.41_3h_thtptt:p//w://ww.wxt.z.bocookmyrom/ om.com/ 0.3340_.h3t4tp3:_//hwtwtpw://.waaw1win.efoli.xicorim/ndustry.com/ 0.203_http://www.pregnancy-place.com/ 0.08.222_9h_tthpt:t/p/w://ww.ws8.co10lo.nsterut/ m-online.biz/ 0.14_h0tt.p1:2//1w_whwttp.u:/kb/wowtwox..xecon.iucak/luk.co.uk/ 0.144_h0t.t2p2://2w_whwttp.f:i/n/wdw-siwte.o.compm/lace.com/ 0.142_h0t.t1p7:/2/w_whtwtp.:u//k-rewwwdu.victailg.corag.uuk/ide.co.uk/ 0.0_http://www.pigrecoemme.com/ 0.571_http0:/./1w6w_wht.taprt:/h/wriwtis-msm-suw.fitnesscoppnlneemectinotns.oconlm/ine.com/ 0.0530_.1h7tt_ph:/t/tdpi:re//lctinok.ry.wocynrldc.dijrep/ctory.com/ 0.15_http://www.idietnow.com/ 0.146_http://www.lilacdigital.com/ 0.227_http://www.onlydirectory.com/ 0.112_http://www.kvitamins.com/ 0.0_http://search.catcha.com.my/ 0.23_http://www.infusionbath.com/ 0.161_http://www.gaza.net/ 0.088_http://morphollica.com/ 0.25_http://hotelph.com/ 0.193_http://www.getdrugsonline.com/ 0.071_http://www.downloadbyte.com/ 0.173_http://www.idnetuk.co.uk/ 0.043_http://www.giosearch.it/ 0.156_http://www.levitrauk.co.uk/ 0.196_http://www.ctsplace.com/ 0.054_http://www.woffzig.net/ 0.143_http://www.viagrasale.co.uk/ 0.072_http://www.conk.com/ 0.274_http://www.stopbitingnails.com/ 0.248_http://www.sinotechnologies.com/ 0.236_http://www.gatewayworkshops.com/ 0.067_http0:/./0w9w1w_h.ttetple:/me/wwdwici.hnea.colthm/.xq23.com/ 0.06_http://www.lavacubed.com/ 0.211_http://actionsaunas.com/ 0.192_http://www.assistantsinfrance.com/ 0.238_http://www.surebuy.co.nz/ 0.137_http://w0w.0w51.la_bhettpxp://lworewwr.co.nem/wcreationds.com/ 0.165_http://www.it-b.hr/ 0.21_http://www.mhrw.com/ 0.171_http://www.medofficeone.com/ 0.148_http://www.farfarfar.com/ 0.319_http://www.naturalcity.com.au/ 0.057_http://www.s3arch.co.uk/ 0.151_http://www.momtomomchat.com/ 0.145_http://www.mortgageblvd.com/ 0.145_http://pacosdrivers.com/ 0.17_http://www.howcool.com/ 0.166_http://ww0.w19.fl6o_rihdtatpin://fwusiwown.l.ucov4m/adoption.com/ 0.262_http://www.appraisercentral.com/ 0.248_http://www.viapharma.net/ 0.181_http://gown0p.1ri8ce1_s.hcottpm/://www.gownprices.com/ 0.055_http://odp.adbd.com/ 0.101_http://www.brightlifetime.com/ 0.125_http://www.prevent.org/ 0.0_http://www.ay-up.com/ 0.227_http://www.permanentcosmeticsbytheresa.com/ 0.177_http://www.babygiftbox.co.uk/ 0.04_http://vittle.net/ 0.124_http://www.ebotek.com/ 0.101_http://www.hanktheknife.com/ 0.156_http://www.hi-schoolpharmacy.com/ 0.192_http://bestpricechaser.com/ 0.18_http://www.vending4u.com/ 0.105_http://www.cholestech.com/ 0.254_http://www.sam-e-direct.com/ 0.196_http://securityselfstorage.us/ 0.276_http://www.sfgate.com/ 0.214_http://www.energycenter.com/ 0.206_http://www.spongebobparty.com/ 0.244_http://www.myportwinestain.com/ 0.175_http://www.express-auto-loan.com/ 0.226_http://www.fortune.com/ 0.155_http://www.healingnetwork.co.uk/ 0.35_http://www.health-happiness.com/ 0.29_http://www.theworldoftantra.com/ 0.151_http://www.dirtch0.e1a7p4st_hritntpg:s.//wcowm/w.15yearloans.com/ 0.249_http://www.sciencekomm.at/ 0.297_http://www.accelerate-weight-loss.com/ 0.042_http://www.searchsmarter.co.uk/ 0.24_http://www.fredysnet.com/ 0.257_http://www.10dollarleads.com/ 0.038_http://www.laxos.de/ 0.259_http://www.investbio.com/ 0.255_http://www.egs.com.au/ 0.0_http://www.executiveselect.com/ 0.176_http://nodiet4me.net/ 0.215_http://www.southtylerrotary.org/ 0.227_http://www.diabetesincontrol.com/ 0.22_http://www.wefixitnow.com/ 0.207_http://www.edrugstore-guide.com/ 0.033_http://www.breast-enhancement-by-city.com/ 0.13_http://mamapriestess.com/ 0.211_http://e-drugstore-guide.com/ 0.304_http://www.safemedco.com/ 0.246_http://www.hypnosistapes-and-cds.com/ 0.155_http://www.gregsushinsky.com/ 0.232_http://www.supersalve.com/ 0.301_http://www.ecarquotes.com/ 0.149_http://www.officeresort.com/ 0.299_http://revbilly.com/ 0.213_http://www.xlpharmacy.com/ 0.0_http://www.onefinedaybridal.com/ 0.129_http://www.makkelijkafvallen.nl/ 0.275_http://www.benefit-physical-fitness.com/ 0.137_http://www.30yearloans.com/ 0.385_http://vitaminmen.com/ 0.257_http://www.dressprices.com/ 0.166_h0t.t1p6:/6/d_rehtssptp://riwcews.wco.onm/efineday.cc/ 0.295_http://nyw-diet-supplements.com/ 0.22_http://www.vaniqa-cheap.com/ 0.35_http://www.iweightloss.com/ 0.146_http://international-pharmacies.com/ 0.319_http://www.lfataichi.com/ 0.082_http://webportal.smsfactory.no/ 0.3_http://www.ginkgo-biloba-online.com/ 0.273_http://www.foret-privee.qc.ca/ 0.231_http://www.viagra-kwik.com/ 0.259_http://www.metabolismbalance2.com/ 0.331_http://www.nutritional-supplements-liquid-vitamins.com/ 0.198_http://www.ecari.net.ms/ 0.193_http://www.vitamin-supplements-store.net/ 0.416_http://www.health1nutrition.com/ 0.253_http://www.webportal.com.my/ 0.226_http://www.7th-street-pharmacy.com/ 0.239_http://www.specialednews.com/ 0.194_http://www.baremakeup.com/ 0.05_http://www.komitel.org/ 0.057_http://www.mancaversa.it/ 0.213_http://www.uufys.org/ 0.172_http://www.troononline.net/ 0.092_http://mrealm.com/ 0.202_http://www.foot-care.info/ 0.0_http://tjburke.com/ 0.223_http://www.thephysioshop.com/ 0.076_http://www.stopjohnwalters.org/ 0.124_http://www.scottishreflexology.org/ 0.313_http://tripleheart.org/ 0.227_http://www.all-debt-programs.biz/ 0.135_http://www.fordhamsportslaw.org/ 0.151_http://www.diabetes-guide.org/ 0.05_http://www.outlawracing.org/ 0.11_http://www.pastconnect.com/ 0.075_http://www.uispp2001.org/ 0.097_http://www.avreskincare.com/ 0.135_http://www.ufable.org/ 0.207_http://www.dfo-professional.com/ 0.0_http://www.yestermorrow.com/ 0.221_http://www.phaze2.iwarp.com/ 0.157_http://www.sirlunchalot.com/ on distrust, but in future work we plan to investigate the 0.263_http://www.goesping.org/ 0.09_http://sexshop-romantic.ro/ 0.161_http://www.cigarettesandwater.com/ 0.0_http://www.contentfilm.com/ 0.279_http://www.edmurray.org/ 0.0_http://huminf.uib.no/ 0.034_http://www.p2pscience.org/ 0.089_http://www.philipjreed.com/ 0.079_http://www.imeplace.com/ 0.235_http://www.byo-pool.com/ 0.154_http://www.otlhelp.com/ 0.0_http://len.charest.org/ 0.171_http://members.impulse.net/ 0.084_http://www.nosightatnight.co.uk/ 0.203_http://www.magick.net/ 0.027_http://www.rexdale.net/ 0.228_http://www.smithappens.com/ 0.043_http://desertviking.nvrland.org/ 0.0_http://www.db.dk/ 0.0_http://www.andyharrismusic.com/ 0.0_http://www.thosebears.com/ 0.101_http://www.swssoftware.com/ 0.242_http://www.dbkaraoke.com/ 0.213_http://www.nhwatergarden.com/ 0.172_http://www.lilikoicandles.com/ 0.161_http://www.tablet-pc-system.com/ 0.162_http://www.shoes-123.com/ 0.234_http://www.valtrex-2-buy.com/ 0.144_http://www.dressmenow.com/ 0.132_http://www.hitsplanet.net/ 0.207_http://www.preferred-online-casinos.com/ 0.239_http://www.vioxx-on-line.com/ 0.223_http://www.phone-cards-4-you.com/ 0.249_http://www.patanol.i-drugs.org/ 0.141_http://americanprofessional.com/ 0.224_http://www.celebrex-cheap.com/ 0.168_http://www.bippy.com/ 0.177_http://www.bestpokerhand.com/ 0.158_http://www.joeant.com/ 0.226_http://www.zyban.i-drugs.org/ 0.15_http://www.rifles.biz/ 0.258_http://www.viagra-cialis.net/ 0.204_http://www.designer-fashion-online.com/ 0.221_http://www.meridia-pills-online.com/ 0.136_http://www.millionairebombay.com/ 0.232_http://www.vigorelle.7p.com/ 0.153_http://www.searchspell.com/ 0.217_http://www.vaniqa-cheap-online.com/

0.191_http://thegreatamericanmall.com/ 0.275_http://www.propecia-cheap-on-line.com/

0.198_http://www.couponfinders.com/ 0.205_http://www.phentermine-prescription-now.com/

0.034_http://www.womensdressesbymati.com/ 0.133_http://www.dailywebdeals.com/

0.255_http://www.berda.com/ 0.246_http://www.udep.com/

0.133_http://www.1st-4-bras.com/ 0.127_http://www.best-bonus-in-casino.com/

0.033_http://myfastsearch.com/ 0.253_http://www.ortho-tricyclen-cheap.com/

0.129_http://www.allkarategear.com/ 0.233_http://www.retina-cheap-online.com/

0.118_http://www.memphismanifesto.com/ 0.083_http://gambling-online-bonus.biz/

0.083_http://www.ranas-world.com/ 0.216_http://www.adipex-on-line.com/

0.184_http://www.robynsnest.com/ 0.229_http://www.renova-on-line.com/

0.142_http://www.bostonnightguide.com/ 0.203_http://www.ionamin.i-drugs.org/

0.153_http://www.companyplans.com/ 0.239_http://www.ajtek.com/

0.109_http://www.blackbearscabin.com/ 0.238_http://www.prescription-diet-pills-online-comparison.com/

0.245_http://www.newlinepubs.com/ 0.064_http://www.wrestlegirl.com/

0.133_http://www.dsperado.com/ 0.263_http://www.exercisebands.com/

0.219_http://www.meetmart.com/ 0.203_http://www.caroltuttle.com/

0.078_http://wildnetafrica.co.za/ 0.187_http://www.crystalmountain-aromatics.com/

0.059_http://www.vincent.com.cn/ 0.362_http://www.beautyworlds.com/

0.1_http://hypertext.rmit.edu.au/ 0.314_http://www.aloe-vera-juice.org/

0.236_http://www.prepaidairtime.com/ 0.247_http://www.gothambaskets.com/

1381 0.083_http://www.the-worldwide-affiliate-network.com/ 0.152_http://envogueshoes.com/

0.119_http://www.s8a.com/ 0.291_http://www.buy-generic-celebrex-online.com/

0.217_http://jollyroger.com/ 0.249_http://www.pinatapartysupply.com/

0.063_http://www.com-web.com/ 0.05_http://www.unified-view.com/

0.152_http://www.litsoft.com/ 0.112_http://www.dbfineretail.com/

0.178_http://killdevilhill.com/ 0.228_http://www.paisleycreek.com/

0.162_http://www.nutzso.com/ 0.202_http://www.giftcds.com/

0.127_http://www.pwc.k12.nf.ca/ 0.227_http://www.look-at-this.co.uk/

0.094_http://www.bodypiercing2u.com/ 0.21_http://www.onlinevacshop.com/

0.113_http://www.waao.com/ 0.126_http://excellentdoll.com/ combination of the two. 0.027_http://www.search-them-all.com/ 0.18_http://www.purepearls.com/ 0.0_http://server.scripthost.com/ 0.291_http://www.maximumwhite.com/

0.21_http://www.finditanytime.com/ 0.127_http://www.allantalyahotels.net/

0.195_http://www.flchentaichi.com/ 0.269_http://www.discount-inkjet-printer-ink-cartridges.com/

0.0_http://www.bolnabracu.com/ 0.187_http://www.gateway4gifts.net/

0.162_http://www.jewelry.shoppingbuy.biz/ 0.213_http://www.program-online-degree.com/

0.0_http://www.uroatlas.net/ 0.175_http://www.tgcgifts.com/ 0.233_ht0tp.0://8w1w_wht.tcrep://avitriivelityhheeaaltlhthin.visttiatumiten.ues/-und-mehr.org/ 0.142_http://www.re-date.com/ 0.153_http://www.dish-vs-direct.com/ 0.134_http://www.indiafreeads.0co.14m/8_http://www.chinafreeads.com/ 0.104_http://www.turkeyland.net/ 0.187_http://www.epicureanaffairs.com/

0.177_http://emeagwali.com/ 0.108_http://www.mohammed-ali-fights.com/ 0.143_http://www.kuwaitclassifieds.com/ 0.15_http://www.irelandfreeads.com/ 0.231_http://www.century21enmiami.com/ 0.04_http://potenzmittel.androxan.biz/ 0.148_http://www.worldpharmacydirectory.com/ 0.143_http://www.israelfreeads.com/ 0.216_http://www.androxan.biz/ 0.245_http://www.health-insurance-quotes-4.com/

0.122_http://texas-holdem-online-poker.com/ 0.211_http://www.phentermine-delivery.com/ 1382 0.185_http://www.avisit.net/ 0.233_http://www.search-it-all.co0m/.097_http://www.uberdose.com/ 0.241_http://www.voy.com/ 0.253_http://www.online-levitra-4sale.com/ 0.06_http://directory.dominion-web.com/

0.189_http://asia.yahoo.com/ 0.22_http://www.cheap-acyclovir.com/

0.171_http://www.youngerbeing.com/ 0.111_http://www.zo0md.13.o9rg_h/ttp://www.e-netrx.com/ 0.223_http://www.medicalsupplygroup.com/ 0.182_http://www.alldaychemist.com/

0.105_http://eu.0d.i0r._nhotdtpe:w//doirks.gilacondm/er.libero.it/ 0.295_http://www.pastfood.com/ 0.156_http://www.findmoreabout.co.uk/ 0.204_http://www.christmasilluminations.com/ 0.228_http://www.mailboxesdirect.co.uk/ 0.169_http://www.h0e.a0l2th7cycl_httopp:/e/wdwiaw.co.am/zimooth.it/ 0.15_http://www.online96.com/ 0.148_http://sitehelpcenter.com/ 0.212_http://kevinocasio.com/ 0.034_http://www.limousine-services.net/

0.175_http://www.searchenginehome.com/

0.235_http://www.newworldproducts.org/ 0.034_http://www.eurohome.dk/ 0.156_http://www.reci0p.0ro51ca_lhptatprt:/n/scriers.pcotstm/oprofit.com/ 0.13_http://www.logicjungle.com/

0.094_http://www.davenezia.it/ 0.279_http://reciprocalmanager.com/ 0.17_http://www.find-uk.com/ 0.051_http://websitedevelopmentcenter.com/ 0.195_http://www.gothicimage.co.uk/ 0.227_http://www.0fre.2e1d4o_m-yohttp:/u/d.coetom/xification.ws/ 0.166_http://www.1st-loan-application-uk.co.uk/

0.235_http://www.healthrecipes.com/ 0.151_http://www.mobyproject.com/ 0.4_http://www.cance0.1r-su68_ccehttpss.://cowwm/w.dr-phil.org/ 0.167_http://www.philb.com/

0.243_http://www.juice-fasting.ws/ 0.12_http://www.stopbyinternet.co.uk/ 0.078_http://www.weightprogram.ca/ 0.24_http://www.london-city-directory.co.uk/ 0.188_http://www.latest-mobile-phone-ringtones.co.uk/

0.183_http://www.compare-ca0.ri2n6su2_rahtntpce://.wcowm/w.earthstation9.com/ 0.236_http://www.sermonaudio.com/ 0.0_http://web.onetel.net.uk/ 0.0_http://lib.kts.ac.kr/

0.088_http://www.ccopc.org/

0.277_http://www.shop4vitamins.us/

0.027_http://supermo0.19d7e_ls.hnttlp/://www.opc.org/

0.18_http://www.dir.americanonlineuniversities.com/ 0.214_http://bwgreyscale.com/ 0.181_http://www.genericdrugspharmacy.com/ 0.157_http://www.levitra-2u.com/ 0.19_http://www.waghobahotels.com/

0.137_http://www.resumecouri0e.0r._caht/tp://www.speatbc.org/

Following the social process above, we design an algorithm 0.282_http://www.kirkpatrick.ca/

0.233_http://www.resumecourier.net/ 0.14_http://www.transportuniverse.com/ 0.151_http://sportsaccesso0.0ri3e8s._cohtm/tp://directory.b24.it/ 0.224_http://www.genital-warts-cream.com/ 0.201_http://www.worldofrecipes.com/ 0.057_http://dmoz.supereva.it/ 0.283_http://cd.nisim.com/

0.197_http://www.africaonline.com/ 0.119_http://www.pillrate.com/ 0.152_http://www.thewebrevealed.com/

0.113_http://www.lasstheticsalon.com/ 0.14_http://www.charmingtowns.com/ 0.161_http://www.qtm.net/ 0.042_http://www.eshoppinglink.com/ 0.24_http://www.portalscripts.com/ 0.058_http://www.bazaretrusco.it/ 0.205_http://www.freesearching.com/ 0.042_http://www.travel-in-europe.info/

0.115_http://grow-long-hair.dublon.ru/ 0.044_http://www.travel-in-italy.info/ 0.116_http://www.strongestnails.com/

0.061_http://www.ulisseagent.com/ 0.167_http://www.bioinformatics.vg/ 0.283_http://www.nisim.com/ 0.075_http://www.pfce0.n1t2ra_lh.cottpm/://www.submission.it/ 0.082_http://www.travel-in-usa.info/ 0.196_http://www.fasthair.ca/

0.143_http://www.dvdrentbuy.com/

0.21_http://www.xenical2go.com/ 0.109_http://www.all0sp.0i9ri9tu_ahlt.tcop:/m//c2.gostats.com/

0.209_http://www.thehealthinsurance.com/ 0.19_http://one-soul.net/

0.233_http://www.acyclovir-cheap.com/ 0.058_http://vi0a.1g0g9ia_reht.tcep:/rca/ch.iconem/se.smokershops.com/

0.153_http://www.viacreme-viacream-viagra.com/ 0.329_http://www.smo0.05ke8_nhetwtps.:/co/cam/talogo.cerca.com/

0.166_http://sendcigarettes.com/ 0.158_http://hilltop.nii.co.nz/ 0.326_http://kesd0a.1.n7i5i._coh.tntpz/://healthandwealth.nii.co.nz/ 0.0_http://www.whizseek.co.uk/

0.218_http://www.winforecast.co.nz/ 0.038_http://www.software-e-commerce.com/ 0.134_http://seduction-dating-tips.com/

0.147_http://www.gracecomputers.co.nz/ 0.037_http://www.gileadtoday.com/ 0.2_http://alphadiva.nii.co.nz/

0.272_http://www.bakerhall.co.nz/ 0.179_http://www.nii.co.nz/ 0.14_http://www.goback.co.nz/ 0.228_http://www.thinscalp.com/ 0.0_http://www.321fun.co.nz/ 0.219_http://fasttrack.nii.co.nz/ 0.138_http://parkzone.afternic.com/

0.22_http://www.netkushi.com/

0.176_http://www.lasercomb.net/ 0.04_http://warp.crystalad.com/ 0.071_http://villedenice.com/ 0.222_http://www.4tf-transfer-factor.com/ 0.097_http://www.vmat.com/ 0.162_http://www.dating-and-singles.com/ 0.1_http://www.penetang.com/ 0.126_http://www.aarens.com/ 0.064_http://vancouver-webpages.com/ 0.241_http://www.luvessentials.com/ 0.097_http://www.inkpatch.com/ 0.154_http://www.free-adult-personals-and-dating.com/ 0.187_http://www.maryland-surgery.us/ 0.187_http://www.singlespreview.com/ 0.269_http://www.valtrex-purchase.com/ 0.205_http://www.viagrahero.com/ 0.132_http://nsonline.com/ 0.264_http://www.jumeirabeachhotels.com/ that follows anti-propagandistic practices in order to recog- 0.3_http://www.essentialsint.com/ 0.169_http://www.lovefilipina.com/ 0.266_http://www.prozac-buy-online.com/ 0.138_http://www.calypsocayresort.com/ 0.294_http://www.shop4dietproducts.com/ 0.245_http://www.infowizards.com/

0.303_http://www.action-personals.com/

0.312_http://www.usahealthstores.com/

0.146_http://www.444.net/

0.04_http://www.travelpromote.us/

0.051_http://www.gm1a.com/

0.158_http://www.usatermlife.com/

0.112_http://www.fastseduction.com/

0.08_http://services.moneymaker6.biz/

0.197_http://www.parkingaccess.com/

0.203_http://www.sonata-sleeping-aid.com/

0.1007.2_0h2tt_ph:/t/tw0p.w:1//w75.wd_wrgh.tejtapozzy-e:rg//wews.wlcoe.ctcom/ri0smec-w.14ht9iecsu_ehltchtrgp0:ae/./i1rycowrs.5w7cow_nh.m/suhtteplrbt:a//s2nwtws.0wco00.b..ucoek/am/uty-spas.com/ 0.193_http://www.ba0d.2cre14d_ihtsettprvi://2celin.coe.m/com/ 0.0310_.h2t3tp1:_//whtwtpw://.wsewxuwa.colvismeagrati.c-sucom/rgery-news.com/ 0.387_http://www.womens-menopause-health.com/ 0.358_http://www.mamapriestess.com/ 0.208_http://www.phentermine-drugstore.com/ 0.277_http://www.betterbarley.com/ 0.22_http://www.phentermine-online.net/ 0.412_http://www.supergreen.biz/ 0.262_http://www.usernotfound.com/ 0.151_http://www.poisonoakandivy.com/ 0.334_http://www.hgh-hormone-releaser.com/ 0.195_http://achieve-your-dream.net/ 0.138_http://www.unsecured-bad-credit-personal-loans.com/ 0.245_http://www.hi-goodlooking.info/ 0.189_http://www.short-term-health-insurance.org/ 0.179_http://www.crystaldeodorantprotection.com/ 0.16_http://www.a1-healthinsurance-4u.com/ 0.257_http://www.natural-hormone-replacement.com/ 0.193_http://www.cheap-insurance-quote-online.com/ 0.227_http://www.hepatitisc-remedy.com/ 0.271_http://www.plant-maintenance.com/ 0.185_http://www.a1electricscooters.com/ 0.349_http://www.fishoilbenefits.com/ 0.333_http://www.allnaturalalternatives.com/ 0.138_http://www.ez-health-insurance-quotes.com/ 0.124_http://www.daisybaxter.com/ 0.0_http://www.initforlife.com/ 0.207_http://www.hismeds.com/ 0.169_http://www.india-herbs.com/ 0.135_http://www.pilates-direct.com/ 0.327_http://www.drugsnewsnetwork.com/ 0.242_http://www.sport-shops.com/ 0.211_http://www.private-labeling.com/ 0.227_http://www.fda-phentermine.com/

0.182_http://www.epharmacyfinder.com/ 0.062_http://timautrey.com/ 0.23_http://www.wmiweightloss.com/

0.129_http://www.kyes-world.com/ 0.059_http://www.danconley.com/ 0.067_http://english.hk.yahoo.com/ 0.366_http://heartspring.net/ 0.049_http://hightechapartments.tv/

0.293_http://en.mimi.hu/ 0.177_http://viagra-propecia-buy-online.net/ 0.338_http://www.asseenontvad.com/

0.214_http://www.hansonmedicalsystems.com/ 0.234_http://www.viagra-2buy4sex.com/ 0.114_http://www.ozprofessionals.com.au/ 0.145_http://www.hitscreen.com/ 0.288_http://www.leavesandroots.com/ 0.185_http://www.pillwatch.com/ 0.179_http://eastcoastmuscle.com/ 0.336_http://www.ace-internet.co.uk/ 0.178_http://www.chi-machine-sun-anacon.com/ 0.164_http://www.cialis-the-weekend-pill.com/ 0.133_http://www.mirago.co.uk/ 0.147_http://spiritdimension.com/ 0.266_http://216.82.75.65/ 0.232_http://www.topsellingdrugs.com/ 0.258_http://dreamyfeet.co.uk/ 0.128_http://www.hyptalk.com/ 0.066_http://www.bodybuilding-start.nl/ 0.253_http://www.buy-generic-propecia-online.com/ 0.164_http://www.zeropaid.com/ 0.337_http://www.selfgrowth.com/

0.187_http://www.enhancementresearch.com/ nize a spamming network. Our algorithm takes as input a 0.275_http://www.lift-chairs-101.com/ 0.0_http://www.synergyforlife.com/ 0.29_http://www.4allabouttickets.com/ 0.328_http://www.blastyourbench.com/ 0.117_http://www.epillz.com/

0.257_http://www.cheap-generic-pills.com/ 0.153_http://www.athletica.net/ 0.216_http://www.avonbylynne.com/

0.214_http://www.homemaidcleaning.com/ 0.175_http://www.electric-scooter-wheelchair-lift.com/

0.288_http://www.cottonwoodvetclinic.com/ 0.06_http://www.storesonline.com/

0.265_http://www.hydrocodone-vicodin.com/ 0.313_http://www.discount-viagra-sale.com/

0.077_http://www.shoppingbuy.biz/ 0.256_http://www.buy-cheap-viagra-online.cc/

0.427_http://www.calcium-factor.com/ 0.213_http://www.meridia-2order.com/

0.229_http://home.medicineexpress.co.nz/ 0.221_http://www.zyrtec-cheap.com/

0.109_http://www.bodybuildingco.com/ 0.283_http://www.myprivatedrugstore.com/

0.114_http://www.chicago.waghobahotels.com/ 0.274_http://www.buy-ultram-pain-relief.com/ 0.105_http://www.torontowebsites.com/

0.143_http://www.discountconferencecalls.com/ 0.558_http://www.superior-coral.com/ 0.211_http://www.buy-cheap-generic-viagra.com/ 0.276_http://www.1source-health.com/ 0.239_http://www.ultram-cheap-on-line.com/ 0.202_http://www.omega-3-online.org/ 0.212_http://www.lots-of-skincare.com/ 0.17_http://www.kalo-hair-removal.com/ 0.259_http://www.aciphex-savings.com/ 0.067_http://d5.dir.scd.yahoo.com/ 0.274_http://www.zyban-savings.com/ 0.208_http://www.quitsmokingplus.com/ 0.301_http://www.pain-relief-medication-now.com/ 0.253_http://herbal-nutrition.net/

0.251_http://www.soma-online-cheap-sale.com/ 0.232_http://www.vioxx-discounted.com/ 0.31_http://www.seasilveretc.com/

0.215_http://www.edietsave.com/ 0.186_http://www.caskets-discounted.com/ 0.268_http://www.baldwoman.com/

0.199_http://www.yourmed-direct.com/ 0.173_http://www.arthritis--relief.com/ 0.227_http://www.cheap-deals.net/ 0.243_http://www.flexeril-for-less.com/

0.156_http://www.cheap-discount-duty-free-cigarette-tobacco-guide.org/ 0.182_http://www.hair-loss-usa.com/ 0.445_http://www.phendimetrazine-direct.com/

0.139_http://www.cheap-capsule-long-distance.com/ 0.147_http://www.smokershops.com/ 0.254_http://www.buy-antidepressants.com/

0.21_http://www.ionamin-prescription-on-line.com/ 1.0_http://www.coral-calcium-benefits.com/ 0.215_http://www.cindycut.com/

0.278_http://www.atkinsdietplus.com/

0.123_http://www.emailgirls.com/ 0.233_http://www.mix-pills.us/ 0.27_http://www.apple-cider-vinegar-now.com/

0.205_http://manchester.hotels-booking-server.co.uk/ 0.121_http://www.realsocialdynamics.com/ 0.214_http://www.our-dentalplans.com/

0.157_http://www.internetaccelerators.net/ 0.228_http://www.dir.viagradukan.com/ 0.161_http://www.mortgage-rates-savings.com/ 0.159_http://www.mortgages-magazine.com/ 0.278_http://www.tatu.us/ 0.217_http://invisiblekeylogger.com/

0.302_http://www.blood-pressure-control.com/ 0.154_http://www.drugstore-online-pharmacy.com/ 0.0_http://www.1a-danksagungskarten.de/

0.186_http://www.accuclaim.biz/ 0.195_http://www.tzfoundation-for-childrens-fitness.org/ 0.201_http://www.parts-washers-direct.com/ 0.195_http://www.hairpiece-toupee.com/

0.055_http://www.bytehound.com/ 0.2_http://www.atlaslightingsupply.com/ 0.024_http://www.awakeningspirit.com/ 0.155_http://www.virtualmedonline.com/

0.274_http://www.all-lift-chairs.com/ 0.18_http://www.power-zone.com/ 0.212_http://www.olive-leaf-extract.org/

0.146_http://pub44.bravenet.com/ page that the user determined to be untrustworthy. This 0.207_http://www.buy-generic-cialis-pills.com/ 0.123_http://www.freeseek.org/ 0.04_http://www.nebulasearch.com/

0.261_http://www.thebestdietpill.com/ 0.314_http://www.one-cialis.com/ 0.163_http://www.creativeimaginations.net/

0.139_http://www.sinomedica.com/ 0.133_http://uk.dir.yahoo.com/ 0.163_http://www.cleanprostate.com/

0.165_http://www.dcproductsreview.com/ 0.272_http://www.acne-eliminated.com/ 0.159_http://guruwellness.com/ 0.161_http://k.st/

0.223_http://www.fioricet-tablets.com/ 0.19_http://www.lycopene-online.org/ 0.153_http://www.anywho.com/

0.141_http://www.canadian-pharmacy-discount.com/ 0.241_http://www.emu-oil-online.com/

0.321_http://www.visionweaver.com/ 0.204_http://www.oregano-oil.net/

0.245_http://www.tops4creditcards.co.uk/ 0.32_http://www.therapyzone.com/

0.251_http://www.larealestatemortgage.com/ 0.051_http://www.greenseek.de/

0.176_http://www.uk-consolidation-loans.co.uk/ 0.08_http://dmoz.fr/

0.158_http://www.insurancebargains.net/ 0.251_http://www.primal-defense.info/

0.196_http://www.badcreditoptions.com/ 0.069_http://www.free-debt-consolidation-loans.net/ 0.206_http://www.san24.com/

0.196_http://www.loanisland.com/

0.078_http://www.quotemonsterdirectory.com/ 0.065_http://pscontent.com/ 0.22_http://www.fastinsurancequotesonline.com/ 0.0_http://www.branchendino.de/

0.243_http://www.awg2002.nu.ca/ 0.192_http://www.internetmortgageleads.com/ 0.153_http://www.movie-source.com/ 0.133_http://www.downloadsnet.com/

0.132_http://www.the-online-shopping-guide.com/ 0.205_http://asia.dir.yahoo.com/ 0.099_http://www.the-online-shopping-store.com/ 0.087_http://www.cliou.com/ 0.229_http://www.financialfreedomsecrets.com/ 0.158_http://www.realfreesite.com/

0.162_http://www.internet-insurancequotes.com/ 0.158_http://www.lovecompass.com/ 0.181_http://www.virginiaestates.com/ 0.057_http://www.ottiolu.net/

0.128_http://www.online-credit-cards-center.com/ 0.238_http://www.luchtzak.be/ 0.122_http://www.looknitup.com/ 0.122_http://www.hostforweb.com/

0.15_http://www.gogoseek.com/ 0.178_http://www.online-sweepstakes.com/

0.083_http://www.savvyexerciseequipment.com/ 0.135_http://www.kyoopid.com/ 0.122_http://www.kmutiara.com/ 0.0_http://zoek.zonnet.nl/

0.139_http://www.home-life-loans.com/ 0.0_http://www.indbazaar.com/ 0.231_http://www.101lifestyle.com/ 0.042_http://guide.walla.co.il/

0.187_http://www.debtreductiongroup.com/ 0.0_http://porcino.org/ 0.043_http://www.activfitness.ro/ 0.129_http://www.dating-paradise.com/ 0.141_http://www.autoinsurancequotesinseconds.com/ 0.183_http://www.clubzone.com/

0.18_http://www.bfysportsnfitness.com/ 0.115_http://datingfortoday.com/ 0.206_http://www.einternetdirectory.co.uk/ 0.117_http://www.neorunner.com/

0.2_http://www.nolico.com/ 0.114_http://www.speeddog.com/ 0.121_http://www.date4uonline.com/ 0.093_http://i-directory.org/

0.224_http://www.haleyproductions.com/

0.156_http://www.herbalsfirst.com/ 0.091_http://www.freewebs.com/ 0.143_http://www.compweb.org/ page could have come to the user through web search re- 0.145_http://www.dentalwirx.com/ 0.054_http://www.snn.gr/ 0.213_http://www.treatmentofacne.com/ 0.206_http://motivation123.com/ 0.071_http://www.gitchy.com/

0.157_http://www.newhaven.dk/ 0.123_http://www.aeroadvance.com/ 0.311_http://www.nutritionstreet.com/ 0.191_http://www.dare-devils.co.uk/ 0.147_http://www.wab0a.81520_.hcottpm/://dir.eroticsurf.com/ 0.096_http://www.exerciseline.com/ 0.0_http://www.allsex.me.uk/ 0.183_http://gentshair.com/ 0.332_http://www.diet-pills-information.org/ 0.172_http://www.all-escort.com/ 0.117_http://www.nilacharal.com/

0.125_http://www.eros-london.com/ 0.147_http://www.indiaserver.com/ 0.223_http://www.cpap.net/ 0.187_http://www.rx-pills-r.us/ 0.18_http://www.angelsoflondon.com/

0.062_http://www.abakus-internet-marketing.de/ 0.112_http://www.mantraonline.com/ 0.068_http://www.mobilephonesites.co.uk/ 0.092_http://www.treasurecoasthealth.com/ 0.323_http://www.advantagesupplements.com/

0.102_http://www.akababy.com/ 0.144_http://www.online-yellow-p0a.1g6e1.co_hm/ttp://www.punterlink.co.uk/

0.218_http://www.formulazone.com/ 0.098_http://www.health-tech-life.com/ 0.0603.2_2h4ttp_:h//twtpw://ww.wisewx..omepen.uak/dultdirectory.com/ 0.079_http://www.click2advertising.com/

0.352_http://www.herballuxuries.com/ 0.038_http://24up.org/ 0.423_http://www.healthtreasures.net/ 0.37_http://www.aarthritisz.com/ 0.173_http://www.non0p.1lu1s.3_nhettt/p://www.entrainbow.com/

0.058_http://mechmates2k2.tripod.com/ 0.184_http://www.thefamilyoflight.net/ 0.202_http://www.drugprescribe.com/ 0.19_http://www.lovelandia.com/ 0.121_http://www.indiawebspace.com/ 0.158_http://www.bedtimeboutique.com/ 0.517_http://www.juicing.com/ 0.282_http://www.dental-plan-4u.com/ 0.294_http://www.extagencapsules.com/ 0.322_http://www.asiastartpage.com/ 0.251_http://www.lyricsmansion.com/ 0.218_http://www.chiangmaicraft.com/ 0.263_http://www.propecia-2-stop-hair-loss.com/ 0.0_http://www.pagoporbusqueda.com/ 0.178_http://www.startpageindia.com/

0.0_http://navegandaluz.com/ 0.079_http://expage.com/ 0.2_http://www.lowestpriceprescriptiondrugs.com/ 0.128_http://www.bargain-bouncer.co.uk/ 0.0_http://www.cad0i.z-i06n3te_rihottpr.:co//wm/ww.playasdecadiz.info/

0.0_http://www.win.it/ 0.0_http://www.webmujeractual.co0.m/183_http://www.outlookandalucia.com/ 0.235_http://www.colonics-online.com/ 0.256_http://www.antiwrinkleshop.com/

0.0_http://pub.alxnet.com/ 0.217_http://www.traductoringles.com/ 0.269_http://www.laughing-buddha-statue.com/ 0.0_http://www.memorycity.com/ 0.0_http://www.playasdetrafalgar.com/

0.053_http://directoriogratis.com/ 0.144_http://balivision.com/ 0.069_http://www.hypnosis-to-stop-smoking.com/ 0.262_http://structured-settlements.org/ 0.118_http://freewebhosting.hostdepartment.com/ 0.04_http://www.jobchef.ca/ 0.0_http://www.mispueblos.com/ 0.0_http://www.gheisa.es/ 0.604_http://www.bettercoralcalcium.com/ 0.359_http://www.noninz.com/ 0.038_http://www.juegosmix.com/ 0.048_h0ttp.0:/_/whtwtpw:/./tbuoriahrdots3el..comem/lodysoft.com/ 0.31_http://www.citibay.com/ 0.055_http://dir.ez2find.com/ 0.124_http://hometown.aol.com/ 0.242_http://www.ask4ink.com/

0.06_http://www.brandwand.com/ 0.0_http://directory.tiscali.it/ 0.266_http://www.head-cleaners.com/ 0.115_http://www.vivadiabetes.com/ 0.194_http://www.openideas.org/

0.205_http://www.a-personaldietitian.com/ 0.252_http://www.pilldrugs.com/ 0.019_http://w0w.2w4.9le_mmohttp:/n/wgrowwve.f.ocoodm/separationtheory.com/ 0.128_http://www.reviewhost.org/

0.266_http://www.health-dir.com/ 0.327_http://www.4promoproducts.com/ 0.2_http://www.oneworldbeat.org/

0.063_http://www.donlemmonproductions.com/ 0.132_http://www.laser-hair-removal-information.com/ 0.221_http://www.canadiandrugsolutions.com/ 0.303_http://www.refusetofail.com/ 0.171_http://www.hostindex.com/

0.336_http://www.exerciseandnutritionthetruth.com/ 0.263_http://www.chlorella-sun-chlorella.com/ 0.196_http://www.lower-your-cholesterol.com/ 0.211_http://www.startingwebmaster.com/

0.214_http://www.uncensoredentertainment.com/ 0.168_http://www.edrugonline.com/ 0.206_http://www.a-world-of-perfumes-cosmetics.com/ 0.237_http://www.lizardbones.com/ 0.276_http://www.greatbenefits4u.com/

0.36_http://www.health-ebiz.com/ 0.041_http://www.ripfast.com/ 0.038_http://www.reumann.net/ 0.177_http://www.nmsnt.org/

0.24_http://www.10-best-wellness-products.com/ 0.267_http://www.recapturehair.com/ 0.019_http://xu.frontside.com/ 0.29_http://www.trunoni.com/ 0.212_http://www.asktheinternettherapist.com/ 0.228_http://www.clear-your-acne.com/ 0.044_http://www.christiandatingsearch.com/ 0.389_http://www.smokeaway.org/ 0.245_http://www.expeditionhouse.com/ 0.233_http://www.omega-fatty-acid.com/ 0.285_http://www.russian-women-info.com/ 0.254_http://www.mortgage-loans-mortgage-leads.com/ 0.04_http://www.sserv.com/ 0.232_http://www.noni-juice.us/ 0.261_http://www.dating-faq.com/ 0.152_http://www.abcmortgage.net/ sults or via the recommendation of some trusted associate 0.294_http://www.innerlifewellness.com/ 0.233_http://www.omega3-zone.com/ 0.132_http://www.anyamani.com/ 0.226_http://www.bodyworks-nutrition.com/ 0.077_http://www.bluesbrotherscentral.com/ 0.27_http://www.thermoflowproducts.com/

0.126_http://www.premium-dating-services.com/ 0.297_http://www.medspecial.com/ 0.332_http://www.bosomnaturale.com/ 0.14_http://www.slow-your-aging.com/

0.146_http://www.angelfire.com/ 0.207_http://www.dietspace.com/ 0.244_http://www.grande-forme.com/ 0.134_http://www.nga-pa2000.org/ 0.078_http://www.energylife.co.uk/ 0.247_http://www.order-cheap-viagra-online.com/ 0.0_http://www.samplesite.org/ 0.168_http://www.payday-loan-resources.com/ 0.221_http://www.selectown.com/ 0.232_http://www.balancedlives.net/ 0.127_http://www.meetrussianbrides.com/ 0.187_http://online-loan.info/ 0.176_http://www.procareusa.com/ 0.122_http://crayonsandcrafts.com/

0.186_http://www.simple-remedies.com/ 0.258_http://www.teeth-whitening-products.org/ 0.192_http://www.hearts-online.com/ 0.156_http://www.excite.de/ 0.26_http://www.amoils.com/ 0.263_http://www.dermapure.com/ 0.268_http://www.sexualenhancementplus.com/ 0.149_http://www.knockoutdebt.com/ 0.316_http://www.pillowhold.com/ 0.258_http://www.revupthevolume.com/ 0.19_http://www.black-cohosh-online.org/ 0.096_http://www.answertoacne.net/ 0.184_http://www.electric-wheelchairs-101.com/ 0.177_http://www.business.com/ 0.38_http://allstarhealth.com/ 0.173_http://www.levitra-online-pharmacy.com/

0.443_http://www.clear-lens-and-colored-contact-lenses.com/ 0.145_http://www.lowpricepills.com/ 0.241_http://www.absoluterussiangirls.com/ 0.258_http://icashservices.com/ 0.125_http://www.getmedsonline.com/ 0.325_http://www.ultimate-orgasms-and-enhancement.com/ 0.228_http://www.sbskincare.com/ 0.0_http://www.phentermine-internet.com/ 0.251_http://www.penpalsanddating.com/ 0.094_http://www.ezflip.com/ 0.288_http://www.natureshealthsupply.com/ 0.181_http://www.creatine-information.com/ 0.339_http://www.1intra.com/ 0.195_http://www.discountmassagechairs.com/ 0.325_http://www.online-pharmacy-discounts.com/ 0.192_http://www.onlineprescriptionsportal.com/ 0.124_http://www.happynewdate.com/ 0.134_http://www.finance-box.com/ 0.297_http://www.internetarthritiscenter.com/ 0.306_http://www.one-levitra.com/ 0.04_http://www.ayurveda-herbs.com/ 0.614_http://www.healthtreasures.com/ 0.33_http://www.viagra-online-viagara-order-buy-impotence.com/ 0.136_http://www.sexualwelfare.com/ 0.224_http://www.datingtelegraph.com/ 0.077_http://ayurvedahelpline.com/ 0.372_http://www.waterfiltersetc.com/ 0.222_http://www.excite.es/ 0.16_http://www.costadelsolportal.com/ 0.153_http://www.amazingpheromoneoils.com/ 0.167_http:0//.w25w7w_.hatltlp:o//ewm.wwco.am/yurhelp.com/ 0.314_0h.t2tp3:_//whtwtpw://.wclwarow.ceimmut.com/nemedicine.com/ 0.446_0h.t2tp2:3//0w_.4hw5ttwp1.:_vi//hwtattwpbw:a//se.wcow0.couw.3n.m/7tbryro8u_y-vihsettpaso:g//rawapw-f.rewco0.e.mem/2-p57ren_s-hscrihtteppa:/tl/itwohnw-ns.waco.thum/oramelly.sacolm/on.com/ 0.094_http://research.internetfilter.com/ 0.234_http://www.debtresources4u.com/

0.191_http://www.russian-women-dating.net/ 0.165_http://www.badcredit-personalloans.com/

0.185_http://007dating.com/ 0.253_http://www.azoos.com/

0.189_http://trial-swing.netfirms.com/ 0.078_http://www.amortgagedirectory.com/

0.216_http://www.datingdynamics.com/ 0.225_http://www.agentblack.com/

0.169_http://onetoonedating.tripod.com/ 0.161_http://www.ahotmortgage.com/

0.216_http://freedating.a1finder.com/ 0.09_http://enjoy-prepaid.com/

0.134_http://kiss-personals.4t.com/ 0.113_http://www.clickpickinsure.com/

0.206_http://www.reallyfirst.com/ 0.252_http://treasurecoasthealth.com/

0.123_http://freedatingsingles.tripod.com/ 0.244_http://www.sweaty-palms.com/

0.175_http://gotodating.datehunters.com/ 0.241_http://www.safetygearhq.com/

0.027_http://www.datingcollector.com/ 0.205_http://www.findcontactlenses.com/

0.171_http://adultfriendfinder.a1finder.com/ 0.133_http://www.weightlosspills-rx.com/

0.128_http://adult_personals_plus.tripod.com/ 0.178_http://www.sa-tourmed.com/

0.306_http://www.matchmakingdirectory.com/ 0.159_http://www.moissanite-bridal-jewelry-engagement-ring-store.com/

0.306_http://matchmakingdirectory.com/ 0.182_http://www.usa-onlinepharmacy.com/

0.217_http://mega-dating.net/ 0.083_http://weight-loss-sports-hair-skin-spa-body-aromatherapy-oil.hic-india.com/

0.141_http://www.cometodating.com/ 0.132_http://www.discountcablebox.com/

0.173_http://www.stepbystepthreesome.com/ 0.226_http://www.mweightloss.com/

0.139_http://www.onlinedating2002.com/ 0.282_http://www.onlinepharmacyprescription.com/

0.148_http://www.tellmeabout.co.uk/ 0.159_http://www.florida-south.com/

(e.g., a society that the user belongs to). 0.187_http://www.slimtime.co.uk/ 0.124_http://www.fast-life-insurance-quotes.com/

0.083_http://www.cheap-inexpensive-london-hotels-accommodation.com/ 0.131_http://online-pharmacy.1clickguide.com/

0.301_http://www.ideamarketers.com/ 0.178_http://www.redjellyfish.com/

0.221_http://www.effectivepresentationskills.com/ 0.231_http://www.kingsrivergifts.com/

0.156_http://www.single-step.com/ 0.113_http://www.vegetarianstore.com/

0.242_http://www.achievemax.com/ 0.219_http://www.selfimprovementnewsletters.com/

0.197_http://com2.runboard.com/ 0.126_http://www.best-online-pills.com/

0.065_http://www.tamnhin.com/ 0.312_http://www.pills4you.org/

0.218_http://www.learnremoteviewing.com/ 0.165_http://www.south-beach-diet.biz/

0.191_http://www.worklifeinitiatives.org/ 0.114_http://www.active-hypnosis.com/

0.211_http://www.spirituallightjourneys.com/ 0.309_http://www.capsystemsunlimited.com/

0.171_http://www.treotiger.com/ 0.224_http://www.green-tea-benefits.com/

0.158_http://www.marketingtoolsforspeakers.com/ 0.166_http://www.e-weightloss-fitness.com/

0.26_http://www.getorganizednow.com/ 0.23_http://www.compumind.com/

0.129_http://www.hrera.com/ 0.193_http://a-to-z-books.com/

0.129_http://dateable.com/ 0.0_http://www.hypnosis-guide.com/

0.118_http://www.themms.com/ 0.114_http://hypnosisonline.com/

0.206_http://www.motivation123.com/ 0.04_http://www.internationalholistictherapiesdirectories.com/

0.112_http://www.byebyeclutter.com/ 0.157_http://www.ebizexperience.com/

0.253_http://www.webmomz.com/ 0.153_http://www.buy-soma-online-now.com/

0.084_http://www.motivational-messages.com/ 0.263_http://www.fear-of-flying-hypnosis.com/

0.19_http://www.corpstory.com/ 0.277_http://www.giftbasketsbycandice.com/

0.27_http://www.mommymagic.com/ 0.208_http://linkspider.co.uk/

0.196_http://www.e-nterests.com/ 0.235_http://www.canadaretail.ca/

0.171_http://www.columbiahotelstoday.com/ 0.0_http://hypnotize.enikprod.com/

0.214_http://www.vitaminandvitamins.com/ 0.196_http://www.unicalamus.org/

0.0_http://weightloss.enikprod.com/ 0.059_http://www.nutri-solutions.net/ 0.197_http://www.facial-skin-care-product.com/

0.32_http://www.simplehealth123.com/ 0.295_http://www.pug-secrets.com/

0.288_http://www.helpself.com/ 0.04_http://www.quiklynx.com/

0.32_http://www.1-supplement.com/ 0.207_http://store.thepromotionspot.com/

0.219_http://www.negative-calorie-foods.com/ 0.0_http://www.nwou.org/

0.168_http://www.a1-electric-wheelchairs.com/ 0.042_http://www.weightloss-pro.com/

0.149_http://www.phoenixathletic.com/ 0.31_http://weightlossandfitnesstips.com/

0.265_http://www.stair-lifts-stairlifts.com/ 0.14_http://atkinsdiet.nc38.com/

0.187_http://www.diet-i.com/ 0.199_http://weightloss-journey.com/

0.234_http://www.south-beach-diet-recipes-southbeach-diet.com/ 0.158_http://www.weightloss-lowcarb.com/

0.188_http://www.1dietsupplements.com/ 0.279_http://nexium.extramedsonline.com/

0.274_http://www.discounted-contact-lenses-online.com/ 0.063_http://www.fitmoves.com/

0.136_http://www.us-cash.com/ 0.322_http://www.weight-loss-help.com/

0.158_http://www.slimcommitment.com/ 0.257_http://www.healthfitnessplace.org/

0.451_http://www.biosync.us/ 0.151_http://www.dietright.org/

0.351_http://www.hghcompany.com/ 0.262_http://hustadlas5984.250free.com/

0.0_http://www.quikrelief.net/ 0.218_http://www.all-rx.com/

0.238_http://www.sallys-place.com/ 0.17_http://www.shop-inkjet-cartridge.com/

0.294_http://www.aceresponse.com/ 0.268_http://www.hair-electrolysis.com/

0.256_http://www.aboutsupplements.com/ 0.43_http://www.vitaminwomen.com/

0.281_http://www.dentalclick.net/ 0.251_http://www.electrolysisreferral.com/

0.181_http://www.ilovefreebies.com/ 0.199_http://beautylistings.com/

0.143_http://www.russellswimmers.com/ 0.231_http://bonose.com/

0.142_http://www.cash-it.com/ 0.181_http://www.active-beauty.com/

0.286_http://www.linkpopularityexchange.com/ 0.177_http://www.butlerwebs.com/

0.079_http://www.1-casino-gambling.org/ 0.248_http://www.generic-adipex.com/

0.191_http://www.transworldsystems.net/ 0.205_http://www.glutathione-report.com/

0.22_http://www.farreys.com/ 0.199_http://www.promdressesdirectory.com/

0.18_http://www.penis-size-enlargement-pills.allbreast.com/ 0.188_http://www.webhostserver.com/

0.223_http://www.dietsupporters.com/ 0.179_http://part-time-work-at-home-job.com/

0.584_http://www.coralcalciumetc.com/ 0.179_http://www.menagor.com/

0.291_http://www.illpumpyouup.com/ 0.21_http://ebizwhiz-publishing.com/

0.16_http://www.birth-control-pills-rx.com/ 0.211_http://www.fieldtex.com/

0.203_http://www.agsites.net/ 0.205_http://www.athomebusinessportal.com/

0.179_http://www.spy-patrol.com/ 0.232_http://www.ebooksnbytes.com/ 0.0_http://www.disque.co.uk/ 0.229_http://www.theallineed.com/ 0.181_http://zierbut.com/ 0.133_http://www.lauhost.com/ 0.153_http://www.finepetidtags.com/ 0.235_http://www.4momsathome.com/ 0.245_http://www.prescriptionlife.com/ 0.162_http://www.livingyourbest.com/ 0.221_http://www.azuregreen.com/ 0.197_http://homebasedbusiness.biz-whiz.com/ 0.311_http://www.hypnosis.demon.co.uk/ 0.058_http://www.2-tiersoftware.com/ 0.305_http://www.theonlinerx.com/ 0.254_http://www.enchantedspirit.org/ 0.192_http://www.sidiary.com/ 0.154_http://www.heypep.com/ 0.322_http://www.charminghealth.com/ 0.213_http://www.howtocorp.com/ 0.241_http://www.viagradukan.com/ 0.226_http://www.zongoo.com/ 0.309_http://prostate-health.net/ 0.25_http://www.urbanbombs.com/ • In the first stage, our algorithm follows for a few steps 0.422_http://www.amah.co.uk/ 0.3_http://www.natural-enhancement-enlargement-guide.com/ 0.223_http://www.juicefasting.com/ 0.06_http://www.eyeexcel.com/ 0.121_http://www.alchemyproject.net/ 0.33_http://www.naturalskinshop.com/ 0.096_http://www.atlantique1904.com/ 0.216_http://www.designcrafters.com/ 0.245_http://www.orangecountyrecovery.com/ 0.272_http://www.foreignpharmacies.com/ 0.207_http://www.mrods.com/ 0.165_http://www.aerostatair.com/ 0.194_http://www.addictions.com/ 0.167_http://www.rejuvenate-your-skin.com/ 0.249_http://www.dietsdontwork.org/

0.195_http://www.sober.com/

0.146_http://www.10acreranch.com/

0.194_http://www.recoverynetwork.com/

0.218_http://www.efa.org.au/

0.109_http://www.xenu.ch/

0.128_http://www.webcom.com/ 0.169_http://mathforge.net/

0.304_http://paudio.com/ 0.059_http://www.physics.kth.se/

0.04_http://www.sciencedaily.com/

0.197_http://www.thehalcyon.org/ 0.245_http://www.funsci.com/

0.195_http://www.genericprescriptionscanada.com/ 0.099_http://www.canadiandrugs.biz/

0.287_http://www.co0rd.2is.7l_uh/ttp://www.endtimesreport.com/ 0.137_h0t.t2p0:/9/w_whtwtp.e://xpwwasy.w.achffo/rdable-canadian-prescriptions.com/ 0.187_http://www.streetdrugs.com/ 0.257_http://www.worldofmagnets.co.uk/

0.071_http://www.aboutmyboobs.com/ 0.093_http://www0.f.h0ma52_.cohttm/p://www.canadiandrugs.ca/ 0.0404.1_7h4ttp_:h//tdtpir.:/i/-uwwnew..codem/caturgahomesforsale.com/ 0.227_http://www.leptoprin-anorex.com/

0.073_http://www.morrisonmuscle.com/

0.184_http://secure.canadiandrugs.ca/ 0.233_http://omega3-zone.com/ 0.23_http://www.rxforamericans.com/ 0.149_http://www.fiveforksmiddleschool.org/ 0.264_http://www.harmonize-your-hormones.com/ 0.194_http://0w.2w0w5._whetltlpb:e//winwgpwh.caarmanadcy.apcorem/scriptions.org/ 0.113_http://www.dunwoodygeorgiahomes.com/

0.171_http://www.headelementary.org/ 0.19_http://www.globalinetbiz.net/ 0.309_http://ateammasters.com/

0.0_http://www.fresh.co.il/ 0.059_http://www.dailyegyptian.com/

0.176_http://www.adsguide.com/ 0.437_http://ww0w.1.10fo9r1_h.cottpm/://www.onlinebusiness.com/ 0.204_http://penisenlargementsecret.com/ 0.158_http://www.elegantgowns.com/ 0.214_http://www.buy-viagra-best-deals.com/

0.0_http://ev-julia.chat.ru/ 0.081_http://www.vigrxoil.7p.com/ 0.137_http://www.indiwiz.com/ 0.154_http://www.humorindustry.com/ 0.207_http://www.contestcen.com/ 0.21_http://womencentral.net/ 0.192_http://spdos2000.tripod.com/ 0.129_http://www.price-shop-club.com/ 0.0_http://thaiall.thailandhosting0..n0e8t7/ _http://www.liveandlaugh.com/ 0.158_http://www.paytoclick.com/ 0.089_http://www.balancedlifecenter.com/ 0.279_http://www.evista.i-drugs.org/ 0.346_http://www.4-breastaugmentation.com/ 0.138_http://www.goistanbulhotels.com/ 0.233_http://directoryweb.candlesbystone.com/ 0.259_http://www.vitaltree.co0.m/215_http://www.startplane.com/ 0.199_http://www.freebeeforeignpharmacy.com/ 0.084_http://www.avda.com/ 0.07_http://workfromhome.com.ua/ 0.159_http://olewoodcrafter.tripod.com/ 0.046_http://www.sentimientobursatil.com/ 0.178_http://www.baseballcorner.com/ 0.22_http://www.evividsolutions.com/ 0.119_http://www.indonesia-travel-cigarettes.com/ 0.22_http://shoppingguide.biz/ 0.142_http://www.yourwebpagehere.com/ 0.26_http://www.pharmacycanadian.com/ 0.136_http://www.pcpolyzine.com/ 0.12_http://www.usa-unlimited-long-distance.com/ 0.378_http://www.biobust.co.uk/ 0.095_http://www.billtvshow.com/ 0.258_http://www.americas-drugstore.net/ 0.308_http://www.vitamins-minerals-supplements-good-health.com/ 0.098_http://directory.webguest.com/ 0.287_http://www.adipex-phentermine-sale.net/ the back links of the authority site that contains this 0.293_http://www.backmanual.com/

0.034_http://rochemiette.ca/ 0.233_http://www.bluecellworld.com/ 0.256_http://www.whitecranejournal.com/ 0.061_http://www.eu-verschreibung.com/ 0.15_http://www.1st-in-hemorrhoid-relief.com/ 0.376_http://www.ordernatural.com/ 0.33_http://www.goodbyeherpes.com/ 0.0_http://www.allw.com/ 0.0_http://acne.enikprod.com/ 0.176_http://www.apexrx.com/ 0.291_http://cosmicmysteries.com/ 0.234_http://www.pennydial.com/ 0.198_http://www.smithindustries.com/ 0.337_http://www.neweramarketing.com/ 0.077_http://www.webspawner.com/ 0.174_http://www.online-poker.fm/ 0.289_http://www.add-adhd-help-center.com/ 0.295_http://www.americas-drugstore.com/ 0.206_http://www.astrostarcharts.com/ 0.216_http://www.voicenews.us/ 0.26_http://www.1stpromotion.com/ 0.147_http://www.drugcraze.com/ 0.172_http://www.lushnutrition.com/ 0.076_http://acne-treatment.viraltactix.com/ 0.254_http://www.magneto-therapy.com/ 0.209_http://www.medbooks.com/ 0.145_http://pages.ivillage.com/ 0.157_http://www.efutonshop.com/ 0.279_http://www.radar3.com/ 0.275_http://www.breast-enlargement-pills-4u.net/ 0.243_http://www.everythingchristmas.com.au/ 0.274_http://www.showstopperexhibits.com/ 0.222_http://123greetings.com/ 0.165_http://www.beautysurg.com/ 0.063_http://www.fitnessconnection.net/

0.265_http://www.weightloss-drugstore.com/ 0.221_http://www.mytranquil.com/ 0.055_http://www.free-games-online.co.uk/ 0.199_http://www.pain-drugstore.com/ 0.3_http://www.nonijuice.us/ 0.194_http://www.levitrahome.com/ 0.249_http://www.ilora.ru/ 0.234_http://www.weightlifting-supplements.com/ 0.219_http://www.2-buy-cheap-viagra.com/ 0.201_http://www.fit-at-home.com/ 0.217_http://www.vsssleep.com/ 0.217_http://www.musclegaintips.com/

0.203_http://www.phendimetrazine-4sale.com/ 0.459_http://www.cholesterol-metabolizer.com/ 0.219_http://www.perfectweightclass.com/ 0.199_http://www.pillsguide.net/ 0.227_http://www.viagraclinic.us/ 0.191_http://www.directory.edietstar.com/ 0.241_http://www.seatcushioncomfort.com/ 0.341_http://www.herbalbodywrap.com/ 0.203_http://www.makemoneyprinting.com/ 0.232_http://www.free-mp3-music-download-sites.com/ 0.239_http://ilora.ru/ 0.4_http://www.vitaminstohealth.com/ 0.198_http://buydietpillonline.com/ 0.309_http://www.adjustable-beds-adjustable-bed.com/ 0.458_http://www.dressingyou.com/ 0.161_http://www.stretch-mark-repair.com/ 0.154_http://www.printinglogos.com/ 0.299_http://www.cod-liver-oil.org/ 0.36_http://www.createonlinebusiness.com/ 0.371_http://www.cardtemplates.com/ 0.381_http://www.designwisconsin.com/ 0.181_http://www.allusads.com/ 0.235_http://www.prescription-medsites.com/ 0.121_http://www.links4women.com/ 0.096_http://www.findwebs.com/ 0.234_http://www.1stfamily.com/ 0.136_0h.2tt6p3:/_/whwttpw:/./jwamewwsi.condmpusturitearcal.core.uok/nline.com/ 0.115_http://www.searchpixie.com/ 0.168_ht0tp.1:/5/w1w_hwtt.pmyn://wewtmew.tedes.ncodim/gest.com/ 0.34_0h.t1tp7:4//w_hwttwp.:t/h/wewcown.wfideennzetplemenis.tacolspm/inning.com/ 0.115_http://www.dubaihotelsonline.com/ 0.207_htt0p.:1//5w7w_wht.tbpyt://afirgndeintk.innae.cotfirms.m/ com/ 0.256_h0tt.p2:5//7w_whwttp.ca://wrewwwe.bbheoststdiunbga.coihom/tels.com/ 0.2090_.2h1tt8p_:/h/wttwp:w//w.dwiscfw.lay.yucorbm/alance.com/ 0.301_http://w0.w17w1.kra_htbtpi-h://wotwelws-re.absocartn.dcolem/.com/ 0.1806.1_0h7tt_ph:/t/twp:w//ww.bwake.buraendobsaaikerer.srecan/tals.com.ar/ 0.204.128_4h_ttph:t/t/pw:/w/www.knw.id-coati.ncog-stm/artpage.com/ 0.389_http://www.smartwomensupplements.com/ 0.01.2260_3h_thtptt:p//w://ww.wci.voduc.bacoiom/rchidhotel.com/ 0.149_http://w0.w0w_h.itntpd:i/a/ka.strfairtlebmaewkeijs.r.ncol/ m/ 0.174_0h.0tt4p2:/_/whwttpw:/./ewnwewtlo.bvelo.ncodim/ebook.com/ 0.138_htt0p.:2//7w8w_wh0.tsot.p1:9u//4wn_dwhtrawttp.na:q/me/uwiwlriitwcay..costnam/eir-ld.icofts-1m/01.com/ 0.142_http0:./2/w0w1_wh.ftretpe://awdwuwltp.aelrsohamranals.hwots/el.com/ 0.2205._2h1t6tp_:/h/wttpw:/w0/w.1siw4nw6u_.smoqhuttipt-smoj:o//.pcoubkim/2n3g.-sibrateve.conem/t.com/ 0.157_h0ttp.2:/6/w8_w0hw.t2t.pcn1:/8/-gw_0hwra.t1wtp8.h:d1/i/ax.w_thcowetmatwpm/.:d//tweas.twinnwge.pcht/eaarlin.coofhm/otels.com/ page. Thus, we create a portion of the web graph that 0.178_0h.2tt4p4:/_/wh0wt.t1pw5:/./1criw_whmsowttp.l0:o/n/.w1-sa9-caw01wg._2erb.hw5.cot-d7tipn_im/:geh//sotwt-ptpwfl:kna/w/wn.o0b.wcow.e2wlst8em/.6p-pd_hg0hhe.2atn.tco4rmapte6:/m/rmi_/whcy.wtntpw0ue:.s//.-e2/pst1roxpe6mop_reh0ss.etlt.ipn2fe:h0n/./o5coe0wl_t.t/w1fm/hi5twtnt6p.ezme_:ss./h/ttetcopl0dl:-me.sp/m/1/w3aw9-a.co_w0bh.m/2ott1pnu9:lt/i-i/n_wte.hcowftittpwnm/:.e0/e/wssl.mo2o0orka3rywg0_..hcot1hth_topm/ehme:e/t/tlwp.co.:wb//i0wm/z-w.w1.a7wihd8.ida_z.anhcoi-atltynpm/z.:/e/cowbwram/wska.4sen.acorchm/.co.uk/ supports the starting site (its “trust graph”). In the process we also sample the contents of the sites in this Powered by yFiles subgraph to determine their similarity to the starting site’s contents. Figure 1: The BCC of target site U-1. • In the second stage the trust graph’s structure and contents are analyzed. The subgraph that heavily sup- ports the starting site becomes suspicious for spam- according to the evaluator’s opinion, was clearly untrustwor- ming. In our experiments we found that the bicon- thy. In particular, the evaluator decided that he does not nected component (BCC) of the graph that includes s believe that a product for $39.95 would significantly increase is such an appropriate subgraph. muscle mass without increased exercise, decrease fat with- out change in diet or habits, enhance sexual performance, We divide the sites in the BCC into two groups: Those increase the good cholesterol while decreasing the bad, re- that have contents similar to s, and those that do not. grow hair, decrease blood pressure, remove wrinkles, and The former are considered members of a link farm, increase memory retention. In our experiments, we exam- while the latter are considered members of a mutual ined the neighborhoods six such sites, as well as two sites admiration society. The members of the link farm are judged to be trustworthy, labeled below as U-1 to U-6 and discredited, and the members of the MAS are down- T-1 to T-2, respectively. graded (although this result could perhaps be user- Figure 1 is a picture of a successful spammer U-1. The selectable, for a less conservative filtering). experiment revealed a neighborhood of 1380 sites, 266 of which were connected with 593 edges into a BCC forming 5. EXPERIMENTAL RESULTS the 3-trust neighborhood of the target site. The density We define the h-trust neighborhood of some site S as the and the size of the BCC is comparable to some of the larger largest biconnected component containing S of the graph BCCs of sites that openly promote “reciprocal linking”. composed of the sites that are no more than h links away The algorithm we outlined in the previous section would from S. In our experiments, we examined 3-trust neigh- be very difficult to implement completely and efficiently at borhoods of web sites, both trustworthy and not. We al- the browser side. We convert it to a heuristic that can be ready know that the neighborhoods of spammers and non- implemented on an average workstation and produce results spammers cannot be distinguished simply on graph-theoretic within minutes. The goal of implementing a quick algorithm terms [14, 5]. They can, however, be recognized from an introduces a number of simplifications explained below. untrustworthy starting point by backward propagation of To gather the back links of a site we use the Google API distrust. [15]. Still, some sites can have thousands of back links while To evaluate the trustworthiness of each site we had an others have only a few. For this experiment, we limited evaluator look at the sites of the BCC. A site was then de- the number of backlinks for each site to 30. We call this termined to be either Trustworthy, Untrustworthy, or Non- parameter the backlink fan. determined. The last category includes a variety of sites for Determining the similarity of a site to the starting site was which the evaluator could not make a clear determination done by sampling a few pages of each site. We have noticed due to the language used in the site, the subject matter, or from other experiments that when one samples 5–10 pages the fact that a Blog or Directory can not fall simply into per site one can get an excellent similarity measure. To one of the U/T categories. speed things up in this experiment, however, we only sam- It is perhaps valuable to reiterate here the fact that we are pled two pages per site. Even such a small sample, though, considering trustworthiness to be a personal decision, not an produced more than acceptable results. absolute quality of a site. One person’s gospel is another’s Similarity was determined using the df.idf ranking on the political propaganda, and our goal is to find tools that help universe of the sites explored. To decide on the cutoff point individuals make more informed decisions about the quality between LF’s and MAS’s, we created a site that contained a of the information they find on the web. few random news pages from Reuters. We call this site the In selecting the theme to evaluate, we considered a num- divider site. Pages more similar to the target site than is the ber of commercial, political, medical and financial issues. divider site are categorized as LF sites; others are grouped For the purpose of this paper we focused on a subject that, into MAS sites. Finally, to further reduce the effect of the explosive nature and ways to maximize it in a spamming network is described of the web, we introduced the concept of stop sites. A stop in [5]. A modification to the PageRank to take into account site is one that the user believes should not be included in the opinions of human editors, employees of a search engine, trust graph either because the trustworthiness of such a site is presented in [19]. A comprehensive treatment on social is known or because it cannot be defined. In the first group networks is presented in [38]. The connection between web we placed educational institutions as determined by their spammers and social propagandists, and how the evolution URL. In the latter we placed a few well known Directories of search engines can be understood as response to spam- and Blog sites. mers is presented in [32]. Propagation methods for trust We recognize that each of the decisions above can be and distrust are discussed in [17]. Some work on person- strengthened, thus strengthening the results of our approach. alized web search is presented in [20, 25]. The effect that In particular, increasing the fan parameter above 30 will search engines have on page popularity was discussed in [9]. recognize more sites in the neighborhood. Using a larger universe in calculating similarity and increasing the sam- pling of pages per site, will give a better approximation of 7. CONCLUSIONS AND FUTURE DIREC- the LF and MAS groups. Our rather conservative approach TIONS provided a solid proof of concept for our hypothesis, while In this paper we have argued that web spam is to cyber- remaining fast enough for browser-side implementation. world what propaganda is to society. As far as we know, We ran a breadth-first search on the backlinks for each this is the first time this relationship is noted. As evidence target site looking at the 3-trust neighborhood with a fan of this analogy and its importance, we have shown that the of 30 sites. We categorized the sites in the neighborhood evolution of search engines can be simply understood as the into members of a link farm (LF) or members of a mutual search engines’ response to defend against spam. New search admiration society (MAS) based on their similarity to the engines are not invented every few years, as it is sometimes target site. The sites were then evaluated for trustworthi- reported; they are developed when researchers have a good ness. Due to the effort involved, only a randomly chosen answer to spam. More importantly, we have shown that this 10% of the MAS sites were evaluated. All of the LF sites relationship can guide us towards developing heuristics that were evaluated, however. recognize spammers. In particular, we have presented auto- As you can see from the results below, there were almost matic ways of recognizing trust neighborhoods on the web no trustworthy sites in the 3-trust neighborhoods of the un- based on the biconnected component around some starting trustworthy ones. As one might expect, a trustworthy site site. Experimental results from a number of such instances is unlikely to deliberately link to an untrustworthy one, or show our algorithm’s ability of recognizing parts of a spam- even to one that “associates” with one. Not surprisingly, the ming network. statement is not as strong for the trustworthy sites, since un- With such results, the question arises as to what one trustworthy sites are free to link to whomever they choose should do once one recognizes a spamming network. This (although thanks to PageRank, spammers are unlikely to is a question that has not attracted much attention in the want to link to too many sites outside their spamming net- past. The “obvious” approach is that a search engine would work in order to avoid “leaking” rank [5]). delete such networks from its indices [12] or might down- grade them by some prespecified amount [19] as it has been Target T(LF) T(MAS) U(LF) U(MAS) reported in the past [36]. U-1 0% 0% 96% 88% Both of these approaches, however, require a universal U-2 0% 0% 100% 65% agreement of what constitutes spam. Such an agreement U-3 0% 0% 95% 100% cannot exist; one person’s spam may be another person’s U-4 0% 0% 88% 83% treasure. Should the search engines determine what is trust- U-5 0% 0% 100% 73% worthy and what is not? Willing or not, they are the de facto U-6 11% 9% 89% 57% arbiters of what information users see. As in a well-known T-1 86% 70% 14% 0% cartoon, the kid responds to the old man who has been look- T-2 64% 64% 33% 13% ing all his life for the meaning of life: “If it is not on Google or eBay, it does not exist.” Our experiments showed that the quality of the starting We believe that it is the users’ right and responsibility site was a very good predictor for the quality of the BCC to decide what is acceptable for them. Their browser, their sites. While most of the results above show no accidental or window to cyberworld, should enhance their ability to make erroneous linking of a trustworthy site to an untrustworthy this decision. User education is fundamental: People should one, we found such evidence in one experiment. know how search engines work and why, and how infor- mation appears on the web. But they should also have a browser that can help them determine the validity and 6. RELATED WORK trustworthiness of information. Web spamming has received a lot of attention lately [1, 3, The tool we described in an earlier section is a first step 4, 5, 12, 13, 19, 21, 22, 24, 27, 29, 30, 33]. The first papers in this direction. Ultimately, it would be used along with a to raise the issue were [30, 22]. The spammers’ success was set of trust certificates that contains the portable trust pref- noted in [13, 12, 10, 2, 4, 16, 23]. erences of the user, a set of preferences that the user can Characteristics of spamming sites based on diversion from accumulate over time. A combination of search engines ca- power laws are presented in [12]. Current tricks employed pable of providing indexed content and structure, including by spammers are detailed in [18]. An analysis of the popular identified neighborhoods, with a browser capable of filtering PageRank method employed by many search engines today those neighborhoods through the user’s trust preferences, would provide a new level of reliability to the user’s infor- [15] I. Google. The Google api. mation gathering. http://www.google.com/apis/. [16] L. Graham and P. T. Metaxas. “Of course it’s true; i 8. ACKNOWLEDGEMENTS saw it on the internet!”: Critical thinking in the internet era. Commun. ACM, 46(5):70–75, 2003. The authors would like to thank Mirena Chausheva, Mered- ith Beaton-Lacoste and Scott Dynes for their valuable con- [17] R. Guha, R. Kumar, P. Raghavan, and A. Tomkins. tributions. Propagation of trust and distrust. In WWW 2004, May 2004. [18] Z. Gyongui and H. Garcia-Molina. Web spam 9. REFERENCES taxonomy. Technical Report TR 2004-25, Stanford [1] B. Amento, L. Terveen, and W. Hill. Does authority University, 2004. mean quality? Predicting expert quality ratings of [19] Z. Gyongui, H. Garcia-Molina, and J. Pedersen. web documents. In Proceedings of the Twenty-Third Combating web spam with TrustRank. In VLDB Annual International ACM SIGIR Conference on 2004, Aug. 2004. Research and Development in Information Retrieval. [20] T. H. Haveliwala. Topic-sensitive pagerank. In ACM, 2000. Proceedings of the eleventh international conference on [2] A. Arasu, J. Cho, H. Garcia-Molina, A. Paepcke, and , pages 517–526. ACM Press, 2002. S. Raghavan. Searching the web. ACM Transactions [21] M. R. Henzinger. Hyperlink analysis for the web. on Internet Technology, 1(1):2–43, June 2001. IEEE Internet Computing, 5(1):45–50, 2001. [3] K. Bharat, A. Z. Broder, J. Dean, and M. R. [22] M. R. Henzinger, R. Motwani, and C. Silverstein. Henzinger. A comparison of techniques to find Challenges in web search engines. SIGIR Forum, mirrored hosts on the WWW. Journal of the 36(2):11–22, 2002. American Society of Information Science, [23] M. Hindman, K. Tsioutsiouliklis, and J. Johnson. 51(12):1114–1122, 2000. Googlearchy: How a few heavily-linked sites dominate [4] K. Bharat, B.-W. Chang, M. R. Henzinger, and politics on the web. In Annual Meeting of the Midwest M. Ruhl. Who links to whom: Mining linkage between Political Science Association, April 3-6 2003. web sites. In Proceedings of the 2001 IEEE [24] L. Introna and H. Nissenbaum. Defining the web: The International Conference on Data Mining, pages politics of search engines. Computer, 33(1):54–62, 51–58. IEEE Computer Society, 2001. 2000. [5] M. Bianchini, M. Gori, and F. Scarselli. PageRank [25] G. Jeh and J. Widom. Scaling personalized web and web communities. In Web Intelligence Conference search. In Proceedings of the twelfth international 2003, Oct. 2003. conference on World Wide Web, pages 271–279. ACM [6] S. Brin and L. Page. The anatomy of a large-scale Press, 2003. hypertextual Web search engine. Computer Networks [26] J. M. Kleinberg. Authoritative sources in a and ISDN Systems, 30(1–7):107–117, 1998. hyperlinked environment. Journal of the ACM, [7] A. Broder. A taxonomy of web search. SIGIR Forum, 46(5):604–632, 1999. 36(2):3–10, 2002. [27] R. Kumar, P. Raghavan, S. Rajagopalan, and [8] A. Broder, R. Kumar, F. Maghoul, P. Raghavan, A. Tomkins. Trawling the Web for emerging S. Rajagopalan, R. Stata, A. Tomkins, and J. Wiener. cyber-communities. Computer Networks (Amsterdam, Graph structure in the web. In Proceedings of the 9th Netherlands: 1999), 31(11–16):1481–1493, 1999. international World Wide Web conference on [28] A. Lee and E. L. (eds.). The Fine Art of Propaganda. Computer networks : the international journal of The Institute for Propaganda Analysis. Harcourt, computer and telecommunications netowrking, pages Brace and Co., 1939. 309–320. North-Holland Publishing Co., 2000. [29] C. A. Lynch. When documents deceive: trust and [9] J. Cho and S. Roy. Impact of search engines on page provenance as new factors for information retrieval in popularity. In WWW 2004, May 2004. a tangled web. J. Am. Soc. Inf. Sci. Technol., [10] T. S. Corey. Catching on-line traders in a web of lies: 52(1):12–17, 2001. The perils of internet stock fraud. Ford Marrin [30] M. Marchiori. The quest for correct information on Esposito, Witmeyer & Glesser, LLP, May 2001. the web: hyper search engines. Comput. Netw. ISDN http://www.fmew.com/archive/lies/. Syst., 29(8-13):1225–1235, 1997. [11] G. Cybenko, A. Giani, and P. Thompson. Cognitive [31] M. L. Maulding. Lycos: Design choices in an internet hacking: A battle for the mind. Computer, search service. IEEE Expert, 35(8):50–56, 2002. January-February( ):8–11, 1997. [12] D. Fetterly, M. Manasse, and M. Najork. Spam, damn [32] P. T. Metaxas. Web spam: An application of spam, and statistics. In WebDB2004, June 2004. Propaganda theory. Technical Report [13] D. Fetterly, M. Manasse, M. Najork, and J. Wiener. A CSD-TR27-2004, Wellesley College, 2004. large-scale study of the evolution of web pages. In [33] G. Pringle, L. Allison, and D. L. Dowe. What is a tall Proceedings of the twelfth international conference on poppy among web pages? In Proceedings of the World Wide Web, pages 669–678. ACM Press, 2003. seventh international conference on World Wide Web [14] G. W. Flake, S. Lawrence, C. L. Giles, and F. Coetzee. 7, pages 369–377. Elsevier Science Publishers B. V., Self-organization of the web and identification of 1998. communities. IEEE Computer, 35(3):66–71, 2002. [34] G. Salton. Dynamic document processing. Commun. ACM, 15(7):658–668, 1972. [35] C. Silverstein, H. Marais, M. Henzinger, and M. Moricz. Analysis of a very large web search engine query log. SIGIR Forum, 33(1):6–12, 1999. [36] M. Totty and M. Mangalindan. As google becomes web’s gatekeeper, sites fight to get in. In Wall Street Journal CCXLI(39), February 26 2003. [37] A. Vedder. Medical data, new information technologies and the need for normative principles other than privacy rules. In Law and Medicine. M. Freeman and A. Lewis (Eds.), (Series Current Legal Issues), pages 441–459. Oxford University Press, 2000. [38] S. Wasserman and K. Faust. Social Network Analysis: Methods and Applications. Cambridge University Press, 1994.