Mozilla hoping to open source voice samples for future AI devs

Prying open speech recognition

By Richard Chirgwin

Posted in Artificial Intelligence, 20th July 2017 02:56 GMT

Mozilla has decided speech recognition should be open source, and has launched a project to achieve just that, Project Common Voice.

What the browser builder wants, it says, is an open source data set for voice recognition apps.

The open source community, Mozilla's Daniel Kessler writes, is the “next wave of innovators” – but with speech datasets locked up behind proprietary walls, they're left out.

That also skews speech recognition to the most lucrative markets (English, Chinese and “a select group of languages”), whereas Mozilla hopes enough participants will let speakers of less-common languages talk to their browsers.

And that's where the open data-gathering comes in: if you're interested, the Project Common Voice site lets users record their own voice (reading sentences to the system, starting for now with English), or review how accurately the software recognises other speakers.

(Vulture South's observation is that the page works better in Firefox than in Chrome – surprise! – and that naturally enough, you have to give the page permission to use your microphone.)

Ultimately the company wants to gather 10,000 hours of recordings for release in Q4 of this year. Presumably, once developers and researchers have their hands on the initial sample, the project will move on to other languages. ®

Sign up to our NewsletterGet IT in your inbox daily


More from The Register

Mozilla edict: 'Web-accessible' features need 'secure contexts'

If an API or feature needs the 'net, it needs HTTPS under Mozilla's new plan

Mozilla releases voice dataset and transcription engine

Baidu's Deep Speech with TensorFlow under the covers

Mozilla devs discuss ditching Dutch CA, because cryptowars

We don' want no STEENKIN' proxies, as will be possible under new local laws

Mozilla and Yahoo! trade sueballs over Firefox-Google search deal

'Your search is trash and you stopped paying ' vs. 'we had a deal you can't walk away from'

Mozilla whips out Rusty new Firefox Quantum (and that's a good thing)

Landmark build promises to be faster, slimmer, better at multi-threading

Mozilla extends, and ends, Firefox support for Windows XP and Vista

Even Extended Support Releases will be naked and alone as of June 2018

Meet VRfox: Mozilla's latest attempt at regaining browser share

v55 first desktop browser to support WebVR standard

Mozilla ponders making telemetry opt-out, 'cos hardly anyone opted in

Browser-maker wants to compile global top 100 sites list, promises to protect privacy

Mozilla makes first-ever acquisition: Web-clipping app 'Pocket'

App scrapes content into devices for later viewing, even offline, advances Moz mission to make web accessible

Mozilla takes a turn slapping Symantec's certification SNAFU

Take Google's advice and get out of CA infrastructure'