… a decade ago, the spam situation was so bad that search engines would regularly return off-topic webspam. For the most part, Google has successfully beaten that—even while some spammers resort to sneakier or even illegal tactics such as hacking websites. Today, English-language spam in Google’s results is less than half what it was five years ago, and even lower in other languages.
However, we have seen a slight uptick of spam in recent months
We recently launched a document-level classifier that makes it harder for spammy on-page content to rank highly. The new classifier is better at detecting spam on individual web pages, e.g., repeated spammy words—the sort of phrases you tend to see in junky, automated, self-promoting blog comments. We’ve also radically improved our ability to detect hacked sites.
We’ll explore … new ways for users to give more explicit feedback about spammy and low-quality sites.
As “pure webspam” has decreased over time, attention has shifted instead to “content farms,” which are sites with shallow or low-quality content. In 2010, we launched two major algorithmic changes focused on low-quality sites. Nonetheless, people are asking for even stronger action [on such sites]. We can and should do better.
January 21, 2011