Feeds

Intel gives away lip-reading speech recognition code

Lip service

  • alert
  • submit to reddit

Secure remote control for conventional and virtual desktops

Intel has released lip-reading visual speech recognition software under an open source licence.

Called Audio Visual Speech Recognition (AVSR), the software is part of Intel's OpenCV computer vision and facial recognition code library. Essentially, it tracks the speaker's mouth movements as individual character and syllable sounds are formed. Intel reckons the technique to be far more accurate than traditional speech recognition algorithms, which analyse sounds rather than images.

That's not to say the results are perfect, and Intel's announcement implies that the system works better when coupled with facial recognition to identify 'known' speakers. Indeed, Intel's web site shows that the best results can be achieved with a mix of video and audio recognition algorithms, the one giving weight to the choices made by the other, particularly as the levels of background noise increase.

The code was developed by Intel's Research subsidiary, part of whose remit is to develop applications that make the most of mainstream PCs' processing power. In other words, Intel is developing code that helps encourage users to upgrade to more powerful chips, ideally - and given chip makers' relative market shares, almost certainly - those made by Intel.

It's motives may not be entirely philanthropic, but at least Intel is giving the code away with a minimum of restrictions. ®

Related Links

Intel's AVSR page
Intel's OpenCV page

Secure remote control for conventional and virtual desktops

More from The Register

next story
The Return of BSOD: Does ANYONE trust Microsoft patches?
Sysadmins, you're either fighting fires or seen as incompetents now
Munich considers dumping Linux for ... GULP ... Windows!
Give a penguinista a hug, the Outlook's not good for open source's poster child
Intel's Raspberry Pi rival Galileo can now run Windows
Behold the Internet of Things. Wintel Things
Linux Foundation says many Linux admins and engineers are certifiable
Floats exam program to help IT employers lock up talent
Microsoft cries UNINSTALL in the wake of Blue Screens of Death™
Cache crash causes contained choloric calamity
Eat up Martha! Microsoft slings handwriting recog into OneNote on Android
Freehand input on non-Windows kit for the first time
prev story

Whitepapers

Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Top 10 endpoint backup mistakes
Avoid the ten endpoint backup mistakes to ensure that your critical corporate data is protected and end user productivity is improved.
Top 8 considerations to enable and simplify mobility
In this whitepaper learn how to successfully add mobile capabilities simply and cost effectively.
Rethinking backup and recovery in the modern data center
Combining intelligence, operational analytics, and automation to enable efficient, data-driven IT organizations using the HP ABR approach.
Reg Reader Research: SaaS based Email and Office Productivity Tools
Read this Reg reader report which provides advice and guidance for SMBs towards the use of SaaS based email and Office productivity tools.