Feeds

Big, distributed, and fast: Ehcache sucks up search

Java for the NoSQL generation

5 things you didn’t know about cloud backup

Open sourcers running the Ehcache distributed Java cache can now search their data in near real-time by harnessing lashed-together servers.

Terracotta, which bought the Ehcache project in 2009, has released Ehcache 2.4. It features an API extension that lets you perform object-level queries of data held in memory. The API is backwards compatible, so should work on older versions of Ehcache.

According to Terracotta, this lets you avoid performance bottlenecks encountered when crunching large amounts of data on a single server. Also, in using general purpose caching with your existing servers, you can sidestep expensive hardware appliances that funnel data through hundreds of cores, terabytes of memory, and Infiniband.

Terracotta said its architecture targets people crunching terabytes of data, rather than petabytes, and it claims searches of 48 seconds can now be executed in just half a second. Because Ehcache is built Java, you can also build search queries using Java rather than build search queries using a different query language.

Terracotta said it's in talks with business-intelligence tools vendors to plug their tools into the API to enable more sophisticated slicing and dicing of data.

The Ehcache project is used in about 70 per cent of Java caching. The ability to query data held in memory using the system comes as big-data providers look for ways help customers make sense of the information quickly amassing in their big-data silos.

Customers have been deploying a host of NoSQL architectures to catch data because NoSQL is seen as faster and more scalable than SQL databases in large server farms and on large web sites.

But inevitably, people now want to query the information gathered - data like searches, personal updates, and Tweets - but the search tools have been lacking.

Last month, open-source BI vendor Jaspersoft announced its Native Reporting Big Data project to build connectors that can natively query data in NoSQL databases and other stores.

Jaspersoft's project currently offers connectors for NoSQL databases Cassandra, CouchDB, MongoDB, Riak, and Neo4j; the Hadoop and Infinispan data crunching frameworks; key-value store Redis; and massively parallel processing (MMP) analytic database Vertica bought by Hewlett Packard this week.

Terracotta says Ehcache 2.4 is different from the NoSQL stable becuase it offers "tried and tested" enterprise Java that fits into existing Java architectures with "strongly consistent" data across different nodes. "We see ourselves as a bridge between the traditional and the new," Terracotta chief executive Amit Pandey said. ®

Build a business case: developing custom apps

More from The Register

next story
The Return of BSOD: Does ANYONE trust Microsoft patches?
Sysadmins, you're either fighting fires or seen as incompetents now
Linux turns 23 and Linus Torvalds celebrates as only he can
No, not with swearing, but by controlling the release cycle
China hopes home-grown OS will oust Microsoft
Doesn't much like Apple or Google, either
Sin COS to tan Windows? Chinese operating system to debut in autumn – report
Development alliance working on desktop, mobe software
Apple promises to lift Curse of the Drained iPhone 5 Battery
Have you tried turning it off and...? Never mind, here's a replacement
Eat up Martha! Microsoft slings handwriting recog into OneNote on Android
Freehand input on non-Windows kit for the first time
Linux kernel devs made to finger their dongles before contributing code
Two-factor auth enabled for Kernel.org repositories
This is how I set about making a fortune with my own startup
Would you leave your well-paid job to chase your dream?
prev story

Whitepapers

Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Endpoint data privacy in the cloud is easier than you think
Innovations in encryption and storage resolve issues of data privacy and key requirements for companies to look for in a solution.
Scale data protection with your virtual environment
To scale at the rate of virtualization growth, data protection solutions need to adopt new capabilities and simplify current features.
Boost IT visibility and business value
How building a great service catalog relieves pressure points and demonstrates the value of IT service management.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?