Feeds

Mathematica man brews 'AI' Google Killer™

A New Kind of Pseudo-Science

Choosing a cloud hosting partner with confidence

Stephen Wolfram - the lovable George Costanza of the mathematics community who developed the invaluable Mathematica suite and wrote the much talked about but quickly forgotten "A New Kind of Science” - is trying his hand at artificial intelligence.

His new project, Wolfram Alpha, set to go live in May, combines natural language processing with machine understanding. You'll be able to get succinct answers to questions like "When was Google's stock at $300 per share?" or "How much did it snow in New England last year?” Allegedly.

It's a noble goal, to aggregate human knowledge in a large machine brain that's able to answer questions. The problem is: We already have Wikipedia and Google that - together - get the job done well enough. So, Wolfram Alpha seems more like a science fair project than a serious stab at machine intelligence. In a company blog post announcing it, Stephen Wolfram himself says that his new project started as a wouldn't-it-be-neat-if brainstorm.

"Fifty years ago, when computers were young, people assumed that they’d quickly be able to handle [systematic knowledge]. And that one would be able to ask a computer any factual question, and have it compute the answer. But it didn’t work out that way. Computers have been able to do many remarkable and unexpected things. But not that.”

Yes, true, computers have not yet been able to process knowledge on the level promised by science fiction novels of the nineteen fifties, but why not? It's not for lack of trying. AI researchers decades ago spent a lot of time and money and programmed a lot of Lisp, only to come up with a therapist named Eliza with whom you can hold a hollow conversation without the $150 per hour fee. It's not as if this year computers finally became powerful enough for proper machine understanding.

No, computers haven't solved this problem because there are no people who actually need it solved.

Stephen Wolfram has a history of being the answer to a question that nobody asked. His 2002 self-published manifesto, “A New Kind of Science” (abbreviated NKS for those of you who prefer the Church of Scientology method of using acronyms to make yourself sound more serious), ruffled some feathers in the scientific community. Wolfram argued that studying simple cellular automata, similar to Conway's Game of Life, will lead to greater discoveries in science. NKS was criticized for being an answer without a question, and it's possible that Wolfram is using this new project as a justification for his book:

"I’d always thought, though, that eventually [machine understanding] should be possible. And a few years ago, I realized that I was finally in a position to try to do it. I had two crucial ingredients: Mathematica and NKS. With Mathematica, I had a symbolic language to represent anything—as well as the algorithmic power to do any kind of computation. And with NKS, I had a paradigm for understanding how all sorts of complexity could arise from simple rules."

If you focus your attention and listen closely, you can hear his ego approaching critical mass, preparing to implode on itself.

Business school lecture aside, Wolfram does deserve some credit where it is due. The Mathematica software package has been - and will continue to be - a critical resource for many people who work in quantitative science. Wolfram Alpha is built on top of Mathematica, which shows how wide a range of problems this software can solve. So, is machine understanding a new and upcoming feature in the next version of the world's most expensive scientific calculator? Unlikely. From the announcement of Wolfram Alpha:

"Some people have thought the way forward must be to somehow automatically understand the natural language that exists on the web. Perhaps getting the web semantically tagged to make that easier. But armed with Mathematica and NKS I realized there’s another way: explicitly implement methods and models, as algorithms, and explicitly curate all data so that it is immediately computable."

If you've ever dealt with real world machine learning, your snake oil detector should be deafening you right now. Explicitly curate all the data? Surely, Stephen, you have come up with an elegant mathematical solution to do this? After all, anybody who has done real world machine learning will tell you that the vast majority of your time is spent cleaning the data. It's a cruel twist that academics who teach machine learning gloss over the most important part and simply focus on the clean mathematical models. So, if this were a real breakthrough in machine intelligence, the input would be essentially arbitrary, but how does Wolfram Alpha do it?

"Every different kind of method and model—and data—has its own special features and character. With a mixture of Mathematica and NKS automation, and a lot of human experts, I’m happy to say that we’ve gotten a very long way."

That sounds an awful lot like the marriage of some Python scripts with a few hundred bucks spent hiring third world workers through Amazon Mechanical Turk.

Given that, Wolfram Alpha doesn't seem terribly innovative. Correct me if I am wrong, and I know you will, but there was a programming language in 1972 called Prolog that could take carefully curated declarative statements and allow you to run logical queries over them. Something like "If the standard rate of chucking is 10 logs per minute, and all woodchucks can chuck, how much wood could a woodchuck chuck if a woodchuck could chuck wood?”

Fortunately, the full on boot to the face of media hype hasn't started yet. This is a bit of a curiosity, because Wolfram Alpha makes for a good "underfunded, smart guy taking on Google.” We heard that story with Powerset and Cuil, both of which amounted to nothing more than a comedy act. Who knows, maybe Stephen Wolfram really is cooking up something. Maybe he's just feeding his ego. In May, we'll get to see how useful the system really is. Whichever the case may be, answering the question "When was Google's stock at $300?" is a parlor trick. Answering the question "When will Google's stock be worth $300" - that might be worth something. ®

Ted Dziuba is a co-founder at Milo.com You can read his regular Reg column, Fail and You, every other Monday.

Business security measures using SSL

More from The Register

next story
'Windows 9' LEAK: Microsoft's playing catchup with Linux
Multiple desktops and live tiles in restored Start button star in new vids
Not appy with your Chromebook? Well now it can run Android apps
Google offers beta of tricky OS-inside-OS tech
New 'Cosmos' browser surfs the net by TXT alone
No data plan? No WiFi? No worries ... except sluggish download speed
Greater dev access to iOS 8 will put us AT RISK from HACKERS
Knocking holes in Apple's walled garden could backfire, says securo-chap
NHS grows a NoSQL backbone and rips out its Oracle Spine
Open source? In the government? Ha ha! What, wait ...?
Google extends app refund window to two hours
You now have 120 minutes to finish that game instead of 15
Intel: Hey, enterprises, drop everything and DO HADOOP
Big Data analytics projected to run on more servers than any other app
prev story

Whitepapers

Providing a secure and efficient Helpdesk
A single remote control platform for user support is be key to providing an efficient helpdesk. Retain full control over the way in which screen and keystroke data is transmitted.
Saudi Petroleum chooses Tegile storage solution
A storage solution that addresses company growth and performance for business-critical applications of caseware archive and search along with other key operational systems.
Security and trust: The backbone of doing business over the internet
Explores the current state of website security and the contributions Symantec is making to help organizations protect critical data and build trust with customers.
Reg Reader Research: SaaS based Email and Office Productivity Tools
Read this Reg reader report which provides advice and guidance for SMBs towards the use of SaaS based email and Office productivity tools.
Security for virtualized datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.