Feeds

Google buys search engine – PageRank™ RIP?

Got bots?

  • alert
  • submit to reddit

Intelligent flash storage arrays

Google has bought Kaltix, a three-month-old, three-man Stanford startup that's working on personalized and context-sensitive search. Despite its battalion of PhDs, Google isn't too proud to acquire external search technologies, and earlier this year bought Applied Semantics for its CIRCA ontology, which "understands, organizes, and extracts knowledge from websites and information repositories in a way that mimics human thought".

Google has made no secret of its goal to "understand" the web, an acknowledgement that its current brute-force text index produces search results with little or no context. The popularity of Teoma demonstrates that even a small index can produce superior results for certain kind of searches. Teoma leans on existing classification systems.

While Google relied on PageRank™ to provide context, all was well. But PageRank is now widely acknowledged to be broken, so new, smarter tricks are required.

Regarded as heresy when we raised the issue last spring, now some of Google's warmest admirers, MetaFilter's Matt Haughey and web designer Jason Kottke have acknowledged the problem.

As Gary Stock noted here last May, Google "didn't foresee a tightly-bound body of wirers. They presumed that technicians at USC would link to the best papers from MIT, to the best local sites from a land trust or a river study - rather than a clique, a small group of people writing about each other constantly. They obviously bump the rankings system in a way for which it wasn't prepared."

Although it's tempting to suggest that bloggers broke PageRank™ it might equally be the case that the Blog Noise issue is emblematic rather than causal. Blog Noise - in the form of 'trackbacks', content-free pages and other chaff - is the most visible manifestation, but mindless list-generators are also to blame for Google's poor performance. And the truth is every successful search engine will find itself engaged in an arms race with gamers. (Deliciously, in the case of email spammer Elwyn Jenkins, a former e-currency salesman who proselytizes weblogs by day, and by night offers advice on how to improve your PageRank, the bloggers and the Google gamers are one and the same [includes screenshots]).

Daniel Brandt, who runs the 100,000-document NameBase archive, has been PageRank&trade's most severe critic, and acknowledges that it lives on in name only. Google no longer performs a monthly recalculation of PageRank values and anchor text is the most highly valued criteria for a search, he says. Which makes Google hardly distinguishable from AltaVista five years ago.

"Quantity not quality is the word on the street," he told us. "The old method of doing PageRank™ which had more integrity or consistency, to it is no longer being done. PageRank™ was a bad idea philosophically to begin with, but now some spammer can set up hundreds or thousands of sites automatically with anchor texts pointing to one page. Before, each would have such a tiny PageRank™ that it wouldn't amount to a hill of beans. Now you can do that and if the anchor text is carefully chosen it will make a difference," he reckons. "The cure is worse than the disease".

As an example he cites the results for the search "discount brokers" - of all the discount brokers, Google's top results is a page which has been empty for a year.

So perhaps Google needs to give a formal burial to PageRank™ rather than maintaing
ing a goulish afterlife as a marketing gimmick. The future promises to be much more interesting. ®

Related Links

"Make Money Online" - How bloggers game Google
PageRank is Dead [Zawodny]
"Google is busted" [Kottke]
Maybe I should write for the Register UK too? [Haughey] - follow-up

Related Stories

Google to fix blog noise problem
Blog noise is 'life or death' for Google
Google - the only archive we'll ever need?
Google heals the sick

Internet Security Threat Report 2014

More from The Register

next story
Facebook, Apple: LADIES! Why not FREEZE your EGGS? It's on the company!
No biological clockwatching when you work in Silicon Valley
Lords take revenge on REVENGE PORN publishers
Jilted Johns and Jennies with busy fingers face two years inside
Yes, yes, Steve Jobs. Look what I'VE done for you lately – Tim Cook
New iPhone biz baron points to Apple's (his) greatest successes
Happiness economics is bollocks. Oh, UK.gov just adopted it? Er ...
Opportunity doesn't knock; it costs us instead
Ex-US Navy fighter pilot MIT prof: Drones beat humans - I should know
'Missy' Cummings on UAVs, smartcars and dying from boredom
Sysadmin with EBOLA? Gartner's issued advice to debug your biz
Start hoarding cleaning supplies, analyst firm says, and assume your team will scatter
Facebook pays INFINITELY MORE UK corp tax than in 2012
Thanks for the £3k, Zuck. Doh! you're IN CREDIT. Guess not
Edward who? GCHQ boss dodges Snowden topic during last speech
UK spies would rather 'walk' than do 'mass surveillance'
prev story

Whitepapers

Forging a new future with identity relationship management
Learn about ForgeRock's next generation IRM platform and how it is designed to empower CEOS's and enterprises to engage with consumers.
Why and how to choose the right cloud vendor
The benefits of cloud-based storage in your processes. Eliminate onsite, disk-based backup and archiving in favor of cloud-based data protection.
Three 1TB solid state scorchers up for grabs
Big SSDs can be expensive but think big and think free because you could be the lucky winner of one of three 1TB Samsung SSD 840 EVO drives that we’re giving away worth over £300 apiece.
Reg Reader Research: SaaS based Email and Office Productivity Tools
Read this Reg reader report which provides advice and guidance for SMBs towards the use of SaaS based email and Office productivity tools.
Security for virtualized datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.