Feeds

Yahoo! to open source floating Google-Amazon crossbreed

Our cloud is your cloud

Internet Security Threat Report 2014

Early next year, Yahoo! intends to open source its internal "cloud serving" platform, described as something halfway between Amazon's Elastic Compute Cloud and Google's App Engine.

Known simply as "Cloud" within the company, the platform is that piece of Yahoo! infrastructure that serves up its online applications. In short, it provides the company's internal developers with on-demand access to computing resources. But rather than offering raw virtual machines as Amazon EC2 does, it spins up "containers" of server power that are pre-configured for things like load-balancing and security. That way, developers needn't handle the load-balancing on their own.

Google App Engine also handles this sort of nitty-gritty on behalf of the developer, but it goes much further. It hides even more of the underlying infrastructure, and it puts tight restrictions on the design of applications so they'll conform with this infrastructure. It restricts what languages you use. It limits the libraries you can choose from. It even prevents you from making system requests that take more than 30 seconds or return more than 10MB of data.

With its "Cloud," Yahoo! abstracts some of the infrastructure, but it also lets you develop with all those standard LAMP stack tools you're used to. "We don't bless the language," Yahoo! chief architect Raymie Stata tells The Reg. "We bless the container."

The company says its current plan is to open source the platform early in 2011. And eventually, it will open source all its back-end platforms.

The company already uses the open source Hadoop for distributed number crunching - this is used to build its search webmap, among so many other tasks - and last June, it released its very own Hadoop distro. Then, in November, it released its Traffic Server, which handles edge caching, edge processing, and load balancing, while also managing traffic on the company's storage and server-virtualization services.

At some point, it will also open source its storage platform and its data pipeline.

All of which makes Yahoo! quite different from Google, which likes to keep its custom-built back-end platforms to itself. That said, Google has published papers describing its GFS distributed file system and MapReduce distributed number cruncher, and these became the basis for Hadoop. Since then, however, the company has developed a new file system known at least informally as GFS2, and this will eventually be rolled out as part of the company's "Caffeine" search infrastructure.

Amazon's EC2 is also closed, but using its APIs the open source Eucalyptus project has mimicked its setup for those looking to operate their own internal clouds. It's bundled with Ubuntu server, and it's the basis for the new federal government Nebula cloud that's under construction at NASA. ®

Beginner's guide to SSL certificates

More from The Register

next story
Just don't blame Bono! Apple iTunes music sales PLUMMET
Cupertino revenue hit by cheapo downloads, says report
The DRUGSTORES DON'T WORK, CVS makes IT WORSE ... for Apple Pay
Goog Wallet apparently also spurned in NFC lockdown
IBM, backing away from hardware? NEVER!
Don't be so sure, so-surers
Hey - who wants 4.8 TERABYTES almost AS FAST AS MEMORY?
China's Memblaze says they've got it in PCIe. Yow
Microsoft brings the CLOUD that GOES ON FOREVER
Sky's the limit with unrestricted space in the cloud
This time it's SO REAL: Overcoming the open-source orgasm myth with TODO
If the web giants need it to work, hey, maybe it'll work
'ANYTHING BUT STABLE' Netflix suffers BIG Europe-wide outage
Friday night LIVE? Nope. The only thing streaming are tears down my face
Google roolz! Nest buys Revolv, KILLS new sales of home hub
Take my temperature, I'm feeling a little bit dizzy
Storage array giants can use Azure to evacuate their back ends
Site Recovery can help to move snapshots around
prev story

Whitepapers

Why cloud backup?
Combining the latest advancements in disk-based backup with secure, integrated, cloud technologies offer organizations fast and assured recovery of their critical enterprise data.
Getting started with customer-focused identity management
Learn why identity is a fundamental requirement to digital growth, and how without it there is no way to identify and engage customers in a meaningful way.
Reg Reader Research: SaaS based Email and Office Productivity Tools
Read this Reg reader report which provides advice and guidance for SMBs towards the use of SaaS based email and Office productivity tools.
Intelligent flash storage arrays
Tegile Intelligent Storage Arrays with IntelliFlash helps IT boost storage utilization and effciency while delivering unmatched storage savings and performance.
Website security in corporate America
Find out how you rank among other IT managers testing your website's vulnerabilities.