Page hijacking
From Wikipedia, the free encyclopedia
Page hijacking is a form of spamming the index of a search engine (spamdexing). It is achieved by creating a rogue copy of a popular website which shows contents similar to the original to a web crawler, but redirects web surfers to unrelated or malicious websites. Spammers can use this technique to achieve high rankings in result pages for certain key words.
Page hijacking is a form of cloaking, made possible because some web crawlers detect duplicates while indexing web pages. If two pages have the same content, only one of the URLs will be kept. A spammer will try to ensure that the rogue website is the one shown on the result pages.
Contents |
[edit] Case Study: Google Jacking
One form of this activity involves 302 server-side redirects on Google. Hundreds of 302 Google Jacking pages were said to have been reported to Google.[citation needed] While Google has not officially acknowledged that page hijacking is a real problem, several people have found to be victims of this phenomenon when checking the search engine rankings for their website. Because it is difficult to quantify how many pages have been hijacked, GoogleJacking.org was founded in May 2006 to help make Google aware of the significance of Google Jacking. Visitors can add themselves to a map, providing a visual indicator of how widespread the problem is.
[edit] Example of Page Hijacking
Suppose that a website offers difficult to find sizes of clothes. A common search entered to reach this website is really big t-shirts, which - when entered on popular search engines - made the website show up as the first result:
- SpecialClothes
- Offering clothes in sizes you cannot find elsewhere.
- www.example.com/
A spammer working for a competing company then creates a website that looks extremely similar to one listed and includes a special redirection script that redirects web surfers to the competitor's site, but shows the page to web crawlers. After several weeks, a web search for really big t-shirts then shows the following result:
- SpecialClothes
- Offering clothes in sizes you cannot find elsewhere... at better prices!
- www.example.net/
- —Show Similar Pages—
When web surfers click on this result, they are redirected to the competing website. The original result was hidden in the "Show Similar Pages" section.
[edit] See also
[edit] References
- "Google Regains Its Hijacked Listing; This Was A Big Deal, Folks!", SearchEngineWatch, May 26, 2005.
- "I heard Google needs more examples of 302 hijacking (entry #5)", SearchEngineWatch, 02-08-2005, 11:45 AM.
[edit] Tools and Information for Webmasters
- Webmaster Forums at the Open Directory Project (suggest site)
- Online tool that detects spam techniques on web pages
- A paper explaining various methods to determine webpage/blog spam
- A public, searchable database of blog spam pages or spam blogs
- AIRWeb' 05: First Workshop on Adversarial Information Retrieval on the Web - Research on search engine spamming
[edit] External links
- http://www.googlejacking.org - A website founded to make Google aware of how widespread the 302 Google Jacking problem is.
- http://clsc.net/research/google-302-page-hijack.htm - An in-depth look at page hijacking.
This article is part of the Spamming series. | |
---|---|
E-mail spam | DNSBL | Spamhaus | Anti-spam techniques | Spambot | Address munging E-mail authentication | Directory Harvest Attack | SpamCop | Dictionary spamming |
Spamdexing |
Google bomb | Keyword stuffing | Cloaking | Link farm | Web ring Referer spam | Blog spam | Spam blogs | Sping | Scraper site |
Telemarketing | Autodialer | Mobile phone spam | VoIP spam |
Scams | Phishing | Advance fee fraud | Lottery scam | Make money fast | Pump and dump |
Misc. | Messaging spam | Newsgroup spam | Flyposting History of spamming | Network Abuse Clearinghouse |