Feeds

Yahoo! to open source floating Google-Amazon crossbreed

Our cloud is your cloud

Intelligent flash storage arrays

Early next year, Yahoo! intends to open source its internal "cloud serving" platform, described as something halfway between Amazon's Elastic Compute Cloud and Google's App Engine.

Known simply as "Cloud" within the company, the platform is that piece of Yahoo! infrastructure that serves up its online applications. In short, it provides the company's internal developers with on-demand access to computing resources. But rather than offering raw virtual machines as Amazon EC2 does, it spins up "containers" of server power that are pre-configured for things like load-balancing and security. That way, developers needn't handle the load-balancing on their own.

Google App Engine also handles this sort of nitty-gritty on behalf of the developer, but it goes much further. It hides even more of the underlying infrastructure, and it puts tight restrictions on the design of applications so they'll conform with this infrastructure. It restricts what languages you use. It limits the libraries you can choose from. It even prevents you from making system requests that take more than 30 seconds or return more than 10MB of data.

With its "Cloud," Yahoo! abstracts some of the infrastructure, but it also lets you develop with all those standard LAMP stack tools you're used to. "We don't bless the language," Yahoo! chief architect Raymie Stata tells The Reg. "We bless the container."

The company says its current plan is to open source the platform early in 2011. And eventually, it will open source all its back-end platforms.

The company already uses the open source Hadoop for distributed number crunching - this is used to build its search webmap, among so many other tasks - and last June, it released its very own Hadoop distro. Then, in November, it released its Traffic Server, which handles edge caching, edge processing, and load balancing, while also managing traffic on the company's storage and server-virtualization services.

At some point, it will also open source its storage platform and its data pipeline.

All of which makes Yahoo! quite different from Google, which likes to keep its custom-built back-end platforms to itself. That said, Google has published papers describing its GFS distributed file system and MapReduce distributed number cruncher, and these became the basis for Hadoop. Since then, however, the company has developed a new file system known at least informally as GFS2, and this will eventually be rolled out as part of the company's "Caffeine" search infrastructure.

Amazon's EC2 is also closed, but using its APIs the open source Eucalyptus project has mimicked its setup for those looking to operate their own internal clouds. It's bundled with Ubuntu server, and it's the basis for the new federal government Nebula cloud that's under construction at NASA. ®

Intelligent flash storage arrays

More from The Register

next story
NSA SOURCE CODE LEAK: Information slurp tools to appear online
Now you can run your own intelligence agency
Azure TITSUP caused by INFINITE LOOP
Fat fingered geo-block kept Aussies in the dark
NASA launches new climate model at SC14
75 days of supercomputing later ...
Yahoo! blames! MONSTER! email! OUTAGE! on! CUT! CABLE! bungle!
Weekend woe for BT as telco struggles to restore service
Cloud unicorns are extinct so DiData cloud mess was YOUR fault
Applications need to be built to handle TITSUP incidents
BOFH: WHERE did this 'fax-enabled' printer UPGRADE come from?
Don't worry about that cable, it's part of the config
Stop the IoT revolution! We need to figure out packet sizes first
Researchers test 802.15.4 and find we know nuh-think! about large scale sensor network ops
SanDisk vows: We'll have a 16TB SSD WHOPPER by 2016
Flash WORM has a serious use for archived photos and videos
Astro-boffins start opening universe simulation data
Got a supercomputer? Want to simulate a universe? Here you go
prev story

Whitepapers

Go beyond APM with real-time IT operations analytics
How IT operations teams can harness the wealth of wire data already flowing through their environment for real-time operational intelligence.
Why CIOs should rethink endpoint data protection in the age of mobility
Assessing trends in data protection, specifically with respect to mobile devices, BYOD, and remote employees.
A strategic approach to identity relationship management
ForgeRock commissioned Forrester to evaluate companies’ IAM practices and requirements when it comes to customer-facing scenarios versus employee-facing ones.
Reg Reader Research: SaaS based Email and Office Productivity Tools
Read this Reg reader report which provides advice and guidance for SMBs towards the use of SaaS based email and Office productivity tools.
Protecting against web application threats using SSL
SSL encryption can protect server‐to‐server communications, client devices, cloud resources, and other endpoints in order to help prevent the risk of data loss and losing customer trust.