Original URL: http://www.theregister.co.uk/2009/05/15/ibm_system_s_watson/

IBM supers shun nukes for biz analytics

BAO. It's the next ERP

By Timothy Prickett Morgan

Posted in The Channel, 15th May 2009 21:09 GMT

As part of its briefings to Wall Street and investors this week, IBM trotted out its recently announced System S streaming system and the "Watson" QA super. Both are variants of its BlueGene/P Power-Linux supercomputer lineup, but both are aimed at solving business problems instead of simulating weather or nuclear explosions.

Both machines are examples the underlying hardware that Big Blue expects to be the foundation of a huge wave of spending on business analytics and optimization, which the company's top brass pitched as the next big opportunity for hardware and software sales and which they said would grow to rival sales driven by the enterprise resource planning (ERP) software boom that started in the mid-1990s.

This BAO opportunity is quite large. (Heaven help us. We don't need another abbreviation but we also can't say business analytics and optimization every bloody time either). And just as ERP did, it cuts across all of IBM's divisions and groups. Speaking earlier this week at IBM's investor day, Frank Kern, general manager of IBM's Global Business Services group, said that the BAO market was just an evolution that has been taking place for decades as companies automated bookkeeping, moved on to MRP-II software to manage their manufacturing operations, and then extended out into automating the whole enterprise, from shop floor to front door, including supply chains and customer relationships, with the ERP phase.

(Kern's Global Business Services group doesn't do systems integration, outsourcing, maintenance, or other kinds of tech services. It raked in $19.6bn in 2008 selling more slippery kinds of services such as business process re-engineering and optimization).

Alongside of ERP, companies have deployed data warehouses and data marts to mine their ERP data to trends to help them better sell products and run their businesses in a more lean fashion. Business analytics and optimization wants to take this one step closer and actually create systems that can predict outcomes for business decisions before those decisions are made.

Such capability, which IBM's nerds at its Research division as well as in its Software Group and its Systems and Technology Group, is most likely one of the reasons why Big Blue can be so confident that it can hit its profit numbers in 2009 and 2010, which it was bragging about at the investor day conference. IBM must be running scenarios constantly to help it shift people around the globe to chase opportunities.

Kern said that the market opportunity for BAO products and services was about $105bn in 2008, which is 18.6 percent of the much wider "business automation" opportunity, which includes all manner of software, hardware, and services sold to corporations and governments to run their operations, which Big Blue pegs at around $566bn in 2008. Now, here are the bits that have IBM salivating about this BAO segment. Between 2007 and 2012, the BAO segment of the IT industry will have a compound annual growth rate of 8 per cent, which is twice the growth rate over that term for the industry as a whole.

And within Big Blue itself, which is obviously the heavy hitter in IT services, BAO is expected to grow at around 10 per cent this year and accelerate to between 15 and 20 per cent from 2010 onwards, accounting for more than $2bn in sales in 2010. Within five years, IBM expects BAO to drive as much sales for it as core ERP systems do.

IBM has caught the bug for predictive business decision-making as much as it ever caught the buzz for relational databases back in the late 1980s, data warehousing back in the 1990s, Linux in the early 2000s, and cloud computing today. You'll have to judge for yourself how much of this BAO stuff is real and how much is vaporware or putting a new paint job on data warehousing.

BAO times two

The two systems that IBM trotted out this week to make its point about BAO were not data warehouses or search engines in the traditional sense of those products, but they are indicative of the cross-group development effort that IBM is talking about when it discusses BAO.

The first is the Systems S streaming server, which we first told you about here when it went into prototype at Toronto Dominion Bank in early April as the back-end for an options trading system. System S takes a BlueGene/P massively parallel PowerPC-based supercomputer and slaps on some data streaming and processing middleware called InfoSphere Streams continuously searches through data feeds of all kinds (news feeds, stock tickers, video off TV and the Internet, it doesn't matter what) and continuously updates a predefined set of queries setup in the InfoSphere database to give you up-to-the-second information to make decisions.

TD Securities, the options trading arm of Toronto Dominion Bank, took the first prototype of the System S machine and used it as a front-end to its options trading system and showed that it could process 21 times more information than a prior system used by the bank and that the features in the System S software allowed the options trading system to crunch 5 million options valuations per second, which IBM said was 20 times faster than the world record up to that point.

At the investor day briefing, IBM said that it would be opening a European stream computing center in Dublin to help customers figure out how to use the InfoSphere Streams software running on BlueGene/P iron and said further that it was making the InfoSphere Streams code - which manages data feeds and chews on them - available for free to customers on a trial basis (and for an unspecified length of time) so they can play with it.

Presumably, you have to buy a BlueGene/P super to play. The BlueGene/P crams 1,024 of IBM's 850 MHz single-core PowerPC 450 chips into a rack, with a board comprised of four chips and 2 GB of DDR2 main memory shared by an SMP bus. A BlueGene/P rack has about 13.9 teraflops of number-crunching performance and costs about $1.3m. But IBM says that InfoSphere Streams can run on other boxes. (The prototype at TD Securities runs on a BlueGene/P with Fedora 8 Linux from Red Hat).

Governments - and particularly their law enforcement and public safety groups - are very keen on the capabilities that IBM says the System S setup has, as well as retailers, healthcare providers, financial institutions, and transportation companies.

The other machine that is going to weigh in heavily on this BAO expansion is the "Watson" Question and Answer engine, another offshoot of the BlueGene/P that was announced several weeks ago and that IBM is programming to take on the best human players of the game show Jeopardy next year.

John Kelly, who heads up IBM Research, described the Watson machine to the Wall Streeters and investors. He knew that there has been plenty of skepticism about the project, despite the fact that IBM did create Deep Blue and it did beat champion Gary Kasparov in chess matchups when IBM first tried to demonstrate its prowess with massively parallel computing back in the late 1990s. Making a machine called Watson to play Jeopardy against real people is a PR stunt, but IBM wouldn't do it unless it thought it could win and - more importantly - unless it thought the PR stunt was going to drive business in some way in the future.

The Watson has to hear a Jeopardy statement, sift through databases for what it might refer to, and rank possible questions that match that statement, pick an question or decide not to try to say a question, and do it all in under three seconds.

Kelly blew off the skeptics about Watson. "It is not a pipe dream, it is up and running," he said. "It beats me, and it will probably beat most of you in this room."

He went on to say that IBM had every intention of turning around the QA system at the heart of Watson and making it into a business decision engine, helping call centers cope with customers, doctors diagnose diseases, and businesses and governments of all kinds and sizes make decisions. And that is why IBM Research has over 150 mathematicians hammering away on BAO algorithms that can be spliced onto machines like System S and Watson for real-world customers.

It all sounds pretty far fetched, right? But whole layers of middle management are going to be made redundant if what IBM says is possible turns out to be doable. Perhaps IBM should have called it business optimization and analytics to get the correct boa constrictor image for the people currently in charge of making calls and sifting through data in the middles of organizations the world over. ®