Feeds

Taking a first bite out of Wolfram Alpha

The knowledge engine that knows the answer is 42

Security for virtualized datacentres

Is it the newest rival to Google, likely to knock Google off the top spot? No it isn’t. Does it provides a single answer to complex questions – unlike traditional search engines? Nope. Could it possibly be "a natural search engine"? Not quite.

The newest game in town, if Twitter is anything to go by, is the Wolfram Alpha computational knowledge engine. There is a good chance that in the long term it will revolutionise how we interact with the internet. But not yet. Right now, it is a very good start, and belongs firmly in the area loosely described as semantic web development.

The greatest threat to its success is likely to be the enormous hype that has broken out across mainstream press and media this last weekend. If Wolfram Alpha engineered this bubble, then there is at least a minor question mark to be raised over the strategic nous of their PR department. For if any one thing is certain, the non-specialist press just love to tear apart new technologies the moment they fail to live up to claims they never made in the first place.

The Wolfram Alpha engine is the brainchild of Stephen Wolfram, renowned physicist and CEO of Wolfram Research. In addition to his work on complex systems – a field in which he founded a research centre and journal in 1986 - he was also responsible for the development of a technical computing system known as Mathematica.

Until recently, this was the mainstay of the company, counting millions of users worldwide, and applied to a range of technical problems, such as trading systems, pricing and optimisation models and Monte Carlo simulations.

This system now underpins the Wolfram Alpha engine, ensuring that at a technical level, it is reasonably stable. The engine contains a number of key components. At the front end is a natural language interpreter, allowing users to feed the system questions in an approximation of English.

Behind this reside a number of key data sources which have been captured and standardised by Wolfram staff over many years. The number of data sets included are, at present, limited to a few hundred, although, as a spokesman for Wolfram pointed out, as one data set includes "all current and historical weather", whilst another includes "the English language", this measure may be deceptive.

Feed the Wolfram Alpha engine a question, either in natural English or using a simpler set of search terms, and it will attempt to come up with information that provides an answer – or answers – appropriate to the query. A major part of Wolfram Alpha’s function relates to its ability both to disambiguate unclear terms and to make a stab at providing an answer relevant to the likely cultural stance of the individual asking the question.

Feed it "gold" and "lead" and it is likely to come up with answers that relate to the physical properties of metals: give it "gold" and "red", and material relevant to colours will appear. Feed it "Cambridge" from a UK-based IP address, and data will be returned relevant to the Fenland City of that name: do so from a US-based address, and the Massachussetts Town of that name will appear.

"Rabbit", in the UK, should return data on the animal both as biological entity and source of food: in the US, according to Wolfram Alpha, the second level of meaning will be overlooked, as Americans simply do not eat rabbits.

So much for the product. Wolfram Alpha is very clearly a knowledge engine that is heavily biased towards drawing down data around topics it has already included in its underlying database. Unlike Google, it is not setting out to rule the web. Its commercial model envisages four strands to future revenues: partner licensing, major brands sponsorship, plus professional and corporate versions. Cost structures are not yet determined, but one suggestion is that companies could end up paying $10 per employee for a license.

It is also envisaged that companies might decide to incorporate into their private version of Wolfram Alpha data sets that require separate licensing (such as selected geodemographic directories) or their own private data sets. An insurance company might wish to load dates and locations of claim by type of event, a credit card company might want to incorporate purchase data, and so on.

In a seriously original debunking of the product, the Guardian complains that "Wolfram Alpha has trouble answering that enduring philosophical question: 'Why am I here?'", responding merely "Wolfram Alpha isn't sure what to do with your input."

It does not know the airspeed velocity of an swallow – laden or otherwise. However, Wolfram Research have bowed to the inevitable by allowing their engine to answer questions about the meaning of life (the universe and everything) with the answer, "42". Although the engine itself is data-focused, it will also return links to the most relevant web sites in respect of questions submitted.

Stephen Wolfram says that he is "not keen on the hype". Wolfram Alpha sits smack in the middle of fast-expanding research into the field of semantic search, whose goal is to turn interaction with the internet into something much closer to the intelligent man-machine dialogue exemplified by HAL 9000 in Stanley Kubrick’s 2001. That study, at present, is focussed on issues of disambiguation and the creation of "ontologies" – the re-structuring of data sets to align them more closely with the information requirements likely to be placed upon them.

In those aims, Wolfram Alpha looks like being a resounding success. ®

Secure remote control for conventional and virtual desktops

More from The Register

next story
Not appy with your Chromebook? Well now it can run Android apps
Google offers beta of tricky OS-inside-OS tech
Keep that consumer browser tat away from our software says Oracle
Big Red decides it will only support Firefox's Extended Support Releases
Greater dev access to iOS 8 will put us AT RISK from HACKERS
Knocking holes in Apple's walled garden could backfire, says securo-chap
NHS grows a NoSQL backbone and rips out its Oracle Spine
Open source? In the government? Ha ha! What, wait ...?
Google extends app refund window to two hours
You now have 120 minutes to finish that game instead of 15
Intel: Hey, enterprises, drop everything and DO HADOOP
Big Data analytics projected to run on more servers than any other app
prev story

Whitepapers

Secure remote control for conventional and virtual desktops
Balancing user privacy and privileged access, in accordance with compliance frameworks and legislation. Evaluating any potential remote control choice.
Intelligent flash storage arrays
Tegile Intelligent Storage Arrays with IntelliFlash helps IT boost storage utilization and effciency while delivering unmatched storage savings and performance.
Reg Reader Research: SaaS based Email and Office Productivity Tools
Read this Reg reader report which provides advice and guidance for SMBs towards the use of SaaS based email and Office productivity tools.
Security for virtualized datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.
Providing a secure and efficient Helpdesk
A single remote control platform for user support is be key to providing an efficient helpdesk. Retain full control over the way in which screen and keystroke data is transmitted.