Feeds

PCs learn to lip read

Software boosts accuracy of speech recognition

  • alert
  • submit to reddit

The essential guide to IT transformation

Scientists are teaching computers to lip read as part of research into improving speech recognition software at Carnegie Mellon University in Pittsburgh. The problem is that just like us, computers have trouble following speech in a noisy room. Whether we realise it or not, we compensate for reduced hearing by lip reading, and the idea is that computers can do this too. Alex Waibel, a computer scientist at the US university, has developed software that can do just that. Called NLips, the software improves the accuracy of speech recognition software to about 93 per cent. And it boosts the accuracy when there is a lot of background noise. Correct recognition falls to about 60 per cent if there is background noise. Waibel's software boosts this to 85 per cent. The software breaks sounds down into chunks called phonemes, like most speech recognition programs. Computer mounted cameras record lip movements and adjust for slight head movements. The footage is monitored by a neural network for 50 visual phoneme equivalents. The two streams of information are combined to produce the final text. Waibel told the New Scientist that the visual technology was hopeless on its own. It works so well because it is "looking at all these signals and capturing the perceptual world in its entirety, just as humans do." So far the research has demonstrated correct spelling of words, letter by letter, but the team hopes to move onto continuous speech soon and says that the transition should be uncomplicated. ®

Boost IT visibility and business value

More from The Register

next story
6 Obvious Reasons Why Facebook Will Ban This Article (Thank God)
Clampdown on clickbait ... and El Reg is OK with this
No, thank you. I will not code for the Caliphate
Some assignments, even the Bongster decline must
Caught red-handed: UK cops, PCSOs, specials behaving badly… on social media
No Mr Fuzz, don't ask a crime victim to be your pal on Facebook
Barnes & Noble: Swallow a Samsung Nook tablet, please ... pretty please
Novelslab finally on sale with ($199 - $20) price tag
Ballmer leaves Microsoft board to spend more time with his b-balls
From Clippy to Clippers: Hi, I see you're running an NBA team now ...
Banking apps: Handy, can grab all your money... and RIDDLED with coding flaws
Yep, that one place you'd hoped you wouldn't find 'em
Video of US journalist 'beheading' pulled from social media
Yanked footage featured British-accented attacker and US journo James Foley
Call of Duty daddy considers launching own movie studio
Activision Blizzard might like quality control of a CoD film
Primetime precrime? Minority Report TV series 'being developed'
I have to know. I have to find out what happened to my life
prev story

Whitepapers

Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Endpoint data privacy in the cloud is easier than you think
Innovations in encryption and storage resolve issues of data privacy and key requirements for companies to look for in a solution.
Scale data protection with your virtual environment
To scale at the rate of virtualization growth, data protection solutions need to adopt new capabilities and simplify current features.
Boost IT visibility and business value
How building a great service catalog relieves pressure points and demonstrates the value of IT service management.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?