Feeds

Google releases open source web data sifting tool

Stop freebasing and Refine, Refine, Refine

The essential guide to IT transformation

Google has slapped some Mountain View Chocolate Factory lipstick onto the Freebase Gridworks software the company scooped up when it bought Metaweb Technologies in July.

The result is a project Google has dubbed Refine 2.0 that builds on Metaweb’s open source tech to clean up unwieldy data sets.

“Version 2.0 introduces a new extensions architecture, a reconciliation framework for linking records to other databases (like Freebase), and a ton of new transformation commands and expressions,” said Google.

The first iteration of Freebase Gridworks was used by a number of government agencies as a research tool including data.gov.uk and ProPublica, said the firm.

In July Google said that it hoped to "improve search and make the web richer and more meaningful for everyone" following the acquisition of Metaweb.

The original tool stored information on over 11 million "things in the world", including movies, books, TV shows, celebrities, locations, and companies. The company was set up to help customers use this data to enhance the design of their websites.

Google has more about the project here. ®

The essential guide to IT transformation

More from The Register

next story
Microsoft boots 1,500 dodgy apps from the Windows Store
DEVELOPERS! DEVELOPERS! DEVELOPERS! Naughty, misleading developers!
Apple promises to lift Curse of the Drained iPhone 5 Battery
Have you tried turning it off and...? Never mind, here's a replacement
Uber, Lyft and cutting corners: The true face of the Sharing Economy
Casual labour and tired ideas = not really web-tastic
Mozilla's 'Tiles' ads debut in new Firefox nightlies
You can try turning them off and on again
Linux turns 23 and Linus Torvalds celebrates as only he can
No, not with swearing, but by controlling the release cycle
Scratched PC-dispatch patch patched, hatched in batch rematch
Windows security update fixed after triggering blue screens (and screams) of death
prev story

Whitepapers

5 things you didn’t know about cloud backup
IT departments are embracing cloud backup, but there’s a lot you need to know before choosing a service provider. Learn all the critical things you need to know.
Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Backing up Big Data
Solving backup challenges and “protect everything from everywhere,” as we move into the era of big data management and the adoption of BYOD.
Consolidation: The Foundation for IT Business Transformation
In this whitepaper learn how effective consolidation of IT and business resources can enable multiple, meaningful business benefits.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?