ClearStory uncloaks with big data visualization vision
Teaching elephants to draw pictures for CEOs
Yet another big-data company has come out of stealth mode, and this one, called ClearStory Data, has its sights set on making it easier for companies to mash-up and visualize data sets rather than just focusing on data-munching itself.
The founders of ClearStory – Sharmila Mulligan, John Cieslewicz, and Vaibhav Nivargi – all worked together at Aster Data Systems, the columnar clustered database maker that was acquired by data warehousing pioneer Teradata for $263m a little more than a year ago. They are all well aware of the big data challenges facing companies in terms of the volume and types of data that they want to chew on and mix up to gain insight into their businesses and customers.
But rather than come up with cleverly named Hadoop distribution or yet another kind of data warehouse, ClearStory is taking on the data integration and visualization jobs with its forthcoming ClearStory Data Service. As the name suggests, it will be cloudy and offered as a service.
ClearStory founders John Cieslewicz,
Sharmila Mulligan, and Vaibhav Nivargi
The idea behind the ClearStory Data Service, CEO Mulligan explained to El Reg, is to create what she calls a "self-driven data exploration tool," something that knows how to integrate with various public data sets – such as DataSift, a partner of Twitter's that gives data analysts access to the full Tweet feed, Microsoft's Dallas data service for its Azure public clouds, or various government data repositories – and that can also be used to merge information with multiple data sets generated by transaction processing and web-log systems.
The problem that ClearStory will attack first is the integration of these disparate data sources. "These services all expose their own data APIs, but they are not for your average user," says Mulligan.
Each data set has its own eccentricities and APIs, and that can make mashing them up difficult. So the first thing that the ClearStory Data Service will do is mask the differences from those who want to mix data sets, and try to find correlations across many layers of data. You can think of it as a universal API-translator of sorts.
The second task that the ClearStory service will tackle is creating the data layers and presenting them graphically to data analysts. This visualization part of the service uses various – and unspecified – tools to display overlaid data sets inside of a normal web browser. (Well, if there were such a as a "normal" web browser.)
"Our big focus is on how you do the blending of the data," says Mulligan.
The idea, says Nivargi, the third cofounder, is to allow the ClearStory Data Service to run on public clouds and to interface with public data sets. Nivargi says that Amazon EC2, Eucalyptus, OpenStack, Microsoft Azure, and VMware vCloud public clouds are all moving towards compatibility. (El Reg is ever-supporting of standards, but disappointed with the level of standardization for Unix, Linux, and blade servers, just to name three.) Hope for future compatibility springs eternal, however, and ClearStory intends to get its service running on multiple clouds.
And for those companies that are anxious about letting their data outside of the firewall, ClearStory will also peddle a private cloud version of the data-exploration stack that can be run internally.
ClearStory was founded last September after a few months of kicking around some ideas among the three cofounders. The company is located in Palo Alto, California, and has just signed up Google Ventures, Andreessen Horowitz, and Khosla Ventures for its first round of venture capital funding, an amount it did not disclose.
Some of that first-round dough came from private investors, including Andy Rachleff, founder of Benchmark Capital; Anand Rajaraman and Venky Harinarayan, who are SVPs at Wal-mart Global e-Commerce and cofounders of Junglee and Kosmix, respectively; Tim Howes, cofounder of Rockmelt and ex-CTO at Netscape Communications; and Nitin Donde, a former executive at EMC, 3PAR, and Aster Data.
Mulligan says that ClearStory is signing up customers for early access to the product in late summer, and expects for it to be generally available by the end of the year. The company currently has ten employees, but now that it has cash, it is hiring. ®