Feeds

Google releases open source web data sifting tool

Stop freebasing and Refine, Refine, Refine

Beginner's guide to SSL certificates

Google has slapped some Mountain View Chocolate Factory lipstick onto the Freebase Gridworks software the company scooped up when it bought Metaweb Technologies in July.

The result is a project Google has dubbed Refine 2.0 that builds on Metaweb’s open source tech to clean up unwieldy data sets.

“Version 2.0 introduces a new extensions architecture, a reconciliation framework for linking records to other databases (like Freebase), and a ton of new transformation commands and expressions,” said Google.

The first iteration of Freebase Gridworks was used by a number of government agencies as a research tool including data.gov.uk and ProPublica, said the firm.

In July Google said that it hoped to "improve search and make the web richer and more meaningful for everyone" following the acquisition of Metaweb.

The original tool stored information on over 11 million "things in the world", including movies, books, TV shows, celebrities, locations, and companies. The company was set up to help customers use this data to enhance the design of their websites.

Google has more about the project here. ®

Top 5 reasons to deploy VMware with Tegile

More from The Register

next story
Download alert: Nearly ALL top 100 Android, iOS paid apps hacked
Attack of the Clones? Yeah, but much, much scarier – report
You stupid BRICK! PCs running Avast AV can't handle Windows fixes
Fix issued, fingers pointed, forums in flames
Microsoft: Your Linux Docker containers are now OURS to command
New tool lets admins wrangle Linux apps from Windows
Facebook, working on Facebook at Work, works on Facebook. At Work
You don't want your cat or drunk pics at the office
Soz, web devs: Google snatches its Wallet off the table
Killing off web service in 3 months... but app-happy bonkers are fine
First in line to order a Nexus 6? AT&T has a BRICK for you
Black Screen of Death plagues early Google-mobe batch
prev story

Whitepapers

Why and how to choose the right cloud vendor
The benefits of cloud-based storage in your processes. Eliminate onsite, disk-based backup and archiving in favor of cloud-based data protection.
Forging a new future with identity relationship management
Learn about ForgeRock's next generation IRM platform and how it is designed to empower CEOS's and enterprises to engage with consumers.
Designing and building an open ITOA architecture
Learn about a new IT data taxonomy defined by the four data sources of IT visibility: wire, machine, agent, and synthetic data sets.
How to determine if cloud backup is right for your servers
Two key factors, technical feasibility and TCO economics, that backup and IT operations managers should consider when assessing cloud backup.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?