Feeds

PCs learn to lip read

Software boosts accuracy of speech recognition

  • alert
  • submit to reddit

Secure remote control for conventional and virtual desktops

Scientists are teaching computers to lip read as part of research into improving speech recognition software at Carnegie Mellon University in Pittsburgh. The problem is that just like us, computers have trouble following speech in a noisy room. Whether we realise it or not, we compensate for reduced hearing by lip reading, and the idea is that computers can do this too. Alex Waibel, a computer scientist at the US university, has developed software that can do just that. Called NLips, the software improves the accuracy of speech recognition software to about 93 per cent. And it boosts the accuracy when there is a lot of background noise. Correct recognition falls to about 60 per cent if there is background noise. Waibel's software boosts this to 85 per cent. The software breaks sounds down into chunks called phonemes, like most speech recognition programs. Computer mounted cameras record lip movements and adjust for slight head movements. The footage is monitored by a neural network for 50 visual phoneme equivalents. The two streams of information are combined to produce the final text. Waibel told the New Scientist that the visual technology was hopeless on its own. It works so well because it is "looking at all these signals and capturing the perceptual world in its entirety, just as humans do." So far the research has demonstrated correct spelling of words, letter by letter, but the team hopes to move onto continuous speech soon and says that the transition should be uncomplicated. ®

Providing a secure and efficient Helpdesk

More from The Register

next story
Facebook, Apple: LADIES! Why not FREEZE your EGGS? It's on the company!
No biological clockwatching when you work in Silicon Valley
Lords take revenge on REVENGE PORN publishers
Jilted Johns and Jennies with busy fingers face two years inside
Yes, yes, Steve Jobs. Look what I'VE done for you lately – Tim Cook
New iPhone biz baron points to Apple's (his) greatest successes
Happiness economics is bollocks. Oh, UK.gov just adopted it? Er ...
Opportunity doesn't knock; it costs us instead
Ex-US Navy fighter pilot MIT prof: Drones beat humans - I should know
'Missy' Cummings on UAVs, smartcars and dying from boredom
Sysadmin with EBOLA? Gartner's issued advice to debug your biz
Start hoarding cleaning supplies, analyst firm says, and assume your team will scatter
Edward who? GCHQ boss dodges Snowden topic during last speech
UK spies would rather 'walk' than do 'mass surveillance'
YARR! Pirates walk the plank: DMCA magnets sink in Google results
Spaffing copyrighted stuff over the web? No search ranking for you
prev story

Whitepapers

Forging a new future with identity relationship management
Learn about ForgeRock's next generation IRM platform and how it is designed to empower CEOS's and enterprises to engage with consumers.
Why and how to choose the right cloud vendor
The benefits of cloud-based storage in your processes. Eliminate onsite, disk-based backup and archiving in favor of cloud-based data protection.
Three 1TB solid state scorchers up for grabs
Big SSDs can be expensive but think big and think free because you could be the lucky winner of one of three 1TB Samsung SSD 840 EVO drives that we’re giving away worth over £300 apiece.
Reg Reader Research: SaaS based Email and Office Productivity Tools
Read this Reg reader report which provides advice and guidance for SMBs towards the use of SaaS based email and Office productivity tools.
Security for virtualized datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.