The Register® — Biting the hand that feeds IT

Feeds

Google widens search net and takes on Siri with iOS app

iOS speech search and adding Gmail results

Customer Success Testimonial: Recovery is Everything

Google is moving closer to a planned search singularity with the extension of the Knowledge Graph system, a trial to allow personal Gmail search results to be included in generic web searches, and an iOS app that takes voice requests and tries to answer.

Knowledge Graph, launched in May for English-speaking users of Google, uses the Chocolate Factory's massive database of searches to find semantic links between search terms. It has linked 500 million people and places with 3.5 billion attributes or connections, which are used to derive possible search results.

Where the user is looking for a list or group of subjects, Google has also added a ribbon of pictures across the top of the browser window that could be of interest. These can be clicked on to generate a new set of search results.

Google has also opened up a trial where signed-in Google users can search their Gmail accounts in the general search page. In tests it is pretty accurate and not too scary, and doesn't display matches automatically, or expose every single email on a topic.

Gmail search

A bit personal, but only if you ask (click to enlarge)

Finally, Google is taking on Siri with a Google Search iOS application, which uses its speech recognition engine to process verbal searches and, if possible, speaks the results back to you. We haven't seen it in action but this sounds like an attempt at Siri, Apple's search agent that is both loved and hated.

"It’s very much like the computer I dreamt about as a child growing up in India, glued to our black-and-white TV for every episode of Star Trek," said Google senior vice president, Amit Singhal, as Google launched the upgrades in an event at San Francisco.

"I imagined a future where a starship computer would be able to answer any question I might ask, instantly. Today, we’re closer to that dream than I ever thought possible during my working life." ®

Magic Quadrant for Enterprise Backup/Recovery

Re: How things change

Modern speech synthesis uses a matrix of phonemes at its core. The "Siri" voice in the UK is the same as the "Daniel" voice available for OS X Lion / Mountain Lion users and is based on the voice of Jon Briggs.

Jon went into a studio some years ago and recorded 5000 phrases in a monotone voice. From those phrases, a complete set of phonemes was extracted and these are what the synthesiser plays, adding pitch and volume changes to simulate stress and intonation.

This is why, although Siri sounds a lot better than the older fully synthesised systems of old (think Prof. Stephen Hawking), we can still tell that it's a synthetic voice. When humans modulate their voices, they do so by changing a lot more than mere pitch and volume: they constrict their throat, move their tongue, lips and change mouth-shape, change how they breathe, and so on. Our ears are trained to notice these changes, so we're still in the voice synthesis equivalent of the Uncanny Valley. But we're getting pretty damned close.

Crossing the valley completely is technically feasible, but will require paying the voice artist to record those 5000 phrases multiple times, in multiple intonations and projection levels. That alone will add days to the recording process, and it'll be very, very boring. Once processed, the resulting voice phoneme set will also be much bigger—the UK "Siri" voice is a 500MB-ish download already—and that's the biggest problem. It'd take much, much longer to process the raw sample data and produce the final phoneme matrices, and it'll take up a whopping great chunk of storage space too. Realistically, computers will need to come with either much faster storage, or a lot more RAM, as standard.

4
0
Anonymous Coward

How things change

I don’t believe that your phone should be an assistant. Your phone is a tool for communicating. You shouldn’t be communicating with the phone; you should be communicating with somebody on the other side of the phone.

– Andy Rubin, SVP of Mobile at Google and founder of Android,October 19, 2011

Rubin->reality translation:

"Shit, we don't have that feature yet"

4
0

Re: Be careful what you wish for

This is bloody true actually. Imagine a computer that simply answered your kids' questions about Santa with a flat "No", or labelled your favourite religion a "simplistic control scam". Or reminded you that political corruption and conspiracy is very, very far from "haha impossible" and that your generation will be regarded as morons for believing this for the next 1000 years.

Yeah, truth has limited use to humans. We tend to favour comfort.

2
0

More from The Register

Bjarne Again: Hallelujah for C++
Plus: Now officially OK to admit you never used STL algorithms
Interwebs taunt Sir Jony over Apple eye candy makeover
Hey Ive, Ive... add more unicorns, willya?
Nuke plants to rely on PDP-11 code UNTIL 2050!
Programmers and their walking sticks converge in Canada
SCO vs. IBM battle resumes over ownership of Unix
Zombie lawsuit back and wants to suck the brains out of Linux
Red Hat to ditch MySQL for MariaDB in RHEL 7
So long, Oracle! Don't let the door hit you on the way out
Shy? Socially inadequate? Fiddling with your phone could help
App 'tells the brutal truth' about social inadequates' chatup lines
Java EE 7 melds HTML5 with enterprise apps
New release arrives with GlassFish, NetBeans support
 breaking news
'Office Facebook' firm Tibbr wants you to PAY for mobe-meetings app
Great idea. Punters won't cough for it though
 breaking news
The only Waze is Google: Ad giant tipped to gobble map app 'for $1.3bn'
Pac-Man-satnav-ish upstart in bidding war with Apple, Facebook
 breaking news
PM Cameron calls for modern, programmable computers! (We think)
IT education musings to G8 chiefs to mystify IT industry