Emergent Tech

Artificial Intelligence

Mozilla hoping to open source voice samples for future AI devs

Prying open speech recognition

By Richard Chirgwin


Mozilla has decided speech recognition should be open source, and has launched a project to achieve just that, Project Common Voice.

What the browser builder wants, it says, is an open source data set for voice recognition apps.

The open source community, Mozilla's Daniel Kessler writes, is the “next wave of innovators” – but with speech datasets locked up behind proprietary walls, they're left out.

That also skews speech recognition to the most lucrative markets (English, Chinese and “a select group of languages”), whereas Mozilla hopes enough participants will let speakers of less-common languages talk to their browsers.

And that's where the open data-gathering comes in: if you're interested, the Project Common Voice site lets users record their own voice (reading sentences to the system, starting for now with English), or review how accurately the software recognises other speakers.

(Vulture South's observation is that the page works better in Firefox than in Chrome – surprise! – and that naturally enough, you have to give the page permission to use your microphone.)

Ultimately the company wants to gather 10,000 hours of recordings for release in Q4 of this year. Presumably, once developers and researchers have their hands on the initial sample, the project will move on to other languages. ®

Sign up to our NewsletterGet IT in your inbox daily


More from The Register

Mozilla wants to seduce BOFHs with button-down Firefox

Control. Control. Control

Mozilla rejects your reality and substitutes its own … browser for VR and AR goggles

Enter another dimension, not only of sight and sound but of mind …

Mozilla pulls ads from Facebook after spat over privacy controls

UK advertisers' society has also fired a warning shot

Mozilla edict: 'Web-accessible' features need 'secure contexts'

If an API or feature needs the 'net, it needs HTTPS under Mozilla's new plan

Aw, all grown up: Mozilla moves WebAssembly into sparsely furnished Studio apartment

Invites devs for tour amid ongoing construction

Mozilla sends more snooping Web APIs to smartphone Siberia

Light and proximity sensors blocked for Firefox 62

Mozilla releases voice dataset and transcription engine

Baidu's Deep Speech with TensorFlow under the covers

Mozilla devs discuss ditching Dutch CA, because cryptowars

We don' want no STEENKIN' proxies, as will be possible under new local laws

Mozilla and Yahoo! trade sueballs over Firefox-Google search deal

'Your search is trash and you stopped paying ' vs. 'we had a deal you can't walk away from'

Mozilla whips out Rusty new Firefox Quantum (and that's a good thing)

Landmark build promises to be faster, slimmer, better at multi-threading