IBM intros Elastic Storage ... as used by human-crushing HPC brain Watson
10 billion files, 43 mins ... Where've we heard that before?
BM has announced Elastic Storage as a component of its software-defined storage portfolio, and it's actually a repackaging and renaming of the venerable GPFS product.
GPFS is IBM’s General Parallel File System for parallel access to massive numbers of files.
The IBM announcement says Elastic Storage “offers unprecedented performance, infinite scale, and is capable of reducing storage costs up to 90 per cent by automatically moving data onto the most economical storage device.”
That means tape, we understand.
Big Blue defines software storage thus:
Software-defined storage is a set of software capabilities that automatically manage data locally and globally, providing breakthrough speed in data access, easier administration and the ability to scale technology infrastructures quickly and more cost-effectively as data volumes expand. In addition, these advances can work with any company’s storage systems to provide automated and virtualised storage.
The announcements brings in some glamour from Watson, IBM’s Jeopardy computer system and we’re told that: “By using Elastic Storage capabilities, around five terabytes of Watson’s 'knowledge' (or 200 million pages of data) were loaded in only minutes into the computer’s memory,” and “IBM Research has demonstrated that Elastic Storage can successfully scan 10 billion files on a single cluster in just 43 minutes – a technology demonstration that translates into unequalled performance for clients analysing massive data repositories to extract business insights.”
At this point in the reading of IBM's announcement a bell went off in some brain cells: 10 million files in 43 minutes? Yes, we remember that. It was July 2011 and IBM used a Violin flash array to have GPFS scan in 10 billion files in 43 minutes.
Elastic Storage also exploits server-side Flash for up to six times increase in performance than with standard SAS disks … This feature recognises when a server has Flash storage and automatically uses that Flash as cache memory to improve performance.
Six times faster than standard SAS disk drives? I should hope so.
Does Elastic Storage now involve using IBM’s own FlashSystem flash array, the acquired TMS RamSan technology? Its Elastic Storage announcement mentions server-side flash is mentioned. An IBM spokesperson said: "The software can support any vendor's storage system, including IBM FlashSystem."
We’re told “Elastic Storage virtualises the storage allowing multiple systems and applications to share common pools of storage. This enables transparent global access to data without the need to modify applications and without the need for additional and often disruptive storage management applications.”
IBM says the National Center for Atmospheric Research’s Computational and Information Services Laboratory (CISL) stores and manages more than 50 petabytes of information between its Wyoming and Colorado centres and relies on Elastic Storage to give researchers fast access to vast amounts of diverse data. Last year CISL was talking about GPFS.
IBM’s announcement quotes Pamela Gillman, manager, Data Analysis Services Group, at CISL: “The IBM global file system software has enabled scalable, reliable and fast access to this information.”
Yep, GPFS it is.
We’re told: “A key component of Elastic Storage is its ability to automatically and intelligently move data to the most strategic and economic storage system available. … Elastic Storage can automatically move infrequently-used data to less expensive low cost tape drives, while storing more frequently-accessed data on high-speed Flash systems for quicker access.” That appears to be from Tivoli Storage Manager integration, with a dash of IBM Linear Tape File System (LTFS) added to the mix.
Elastic Storage supports OpenStack Cinder and Swift, and POSIX and Hadoop APIs.
To find out more about IBM's software-defined storage, go to this part of IBM’s web presence where the GPFS connection is made clear and Elastic Storage is described as a new code-name.
Elastic Storage v4.1 features:
- Enhanced security - native encryption and secure erase, NIST SP 800-131A encryption compliance
- Increased performance - Server-side IBM Elastic Storage Flash caches increase IO performance up to 6X
- Improved usability - data migration; AFM, FPO, and backup/restore enhancements; reliability, availability and serviceability enhancements
This webpage notes: “When integrated with IBM Tivoli Storage Manager (TSM) or IBM Linear Tape File System (LTFS), IBM Elastic Storage can uniquely manage the full data life cycle, delivering geometrically lower cost savings through policy driven automation and tiered storage management.”
IBM says: “Elastic Storage software will also be available as an IBM SoftLayer cloud service later this year,” implying you can buy the software directly from IBM or qualified partners.
There’s no pricing information provided. ®