Feeds

PCs learn to lip read

Software boosts accuracy of speech recognition

  • alert
  • submit to reddit

Internet Security Threat Report 2014

Scientists are teaching computers to lip read as part of research into improving speech recognition software at Carnegie Mellon University in Pittsburgh. The problem is that just like us, computers have trouble following speech in a noisy room. Whether we realise it or not, we compensate for reduced hearing by lip reading, and the idea is that computers can do this too. Alex Waibel, a computer scientist at the US university, has developed software that can do just that. Called NLips, the software improves the accuracy of speech recognition software to about 93 per cent. And it boosts the accuracy when there is a lot of background noise. Correct recognition falls to about 60 per cent if there is background noise. Waibel's software boosts this to 85 per cent. The software breaks sounds down into chunks called phonemes, like most speech recognition programs. Computer mounted cameras record lip movements and adjust for slight head movements. The footage is monitored by a neural network for 50 visual phoneme equivalents. The two streams of information are combined to produce the final text. Waibel told the New Scientist that the visual technology was hopeless on its own. It works so well because it is "looking at all these signals and capturing the perceptual world in its entirety, just as humans do." So far the research has demonstrated correct spelling of words, letter by letter, but the team hopes to move onto continuous speech soon and says that the transition should be uncomplicated. ®

Security for virtualized datacentres

More from The Register

next story
Facebook pays INFINITELY MORE UK corp tax than in 2012
Thanks for the £3k, Zuck. Doh! you're IN CREDIT. Guess not
Big Content outs piracy hotbeds: São Paulo, Beijing ... TORONTO?
MPAA calls Canadians a bunch of bootlegging movie thieves
Google Glassholes are UNDATEABLE – HP exec
You need an emotional connection, says touchy-feely MD... We can do that
Lawyers mobilise angry mob against Apple over alleged 2011 Macbook Pro crapness
We suffered 'random bouts of graphical distortion' - fanbois
Just don't blame Bono! Apple iTunes music sales PLUMMET
Cupertino revenue hit by cheapo downloads, says report
US court SHUTS DOWN 'scammers posing as Microsoft, Facebook support staff'
Netizens allegedly duped into paying for bogus tech advice
Feds seek potential 'second Snowden' gov doc leaker – report
Hang on, Ed wasn't here when we compiled THIS document
prev story

Whitepapers

Why cloud backup?
Combining the latest advancements in disk-based backup with secure, integrated, cloud technologies offer organizations fast and assured recovery of their critical enterprise data.
Forging a new future with identity relationship management
Learn about ForgeRock's next generation IRM platform and how it is designed to empower CEOS's and enterprises to engage with consumers.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?
New hybrid storage solutions
Tackling data challenges through emerging hybrid storage solutions that enable optimum database performance whilst managing costs and increasingly large data stores.
Reducing the cost and complexity of web vulnerability management
How using vulnerability assessments to identify exploitable weaknesses and take corrective action can reduce the risk of hackers finding your site and attacking it.