Common Voice

Common Voice is Mozilla's initiative to help teach machines how real people speak. However, It does have some audio be used as a source for audio documents to be listened to by users.

[edit] Overview

Mozilla releases the largest to-date public domain transcribed voice dataset on February 28, 2019. Mozilla crowdsources the largest dataset of human voices available for use, including 18 different languages, adding up to almost 1,400 hours of recorded voice data from more than 42,000 contributors. The files are in MP3 format with corresponding text data file.

[edit] Datasets

https://voice.mozilla.org/en/datasets

DeepSpeech - The Common Voice dataset complements Mozilla’s open source voice recognition engine Deep Speech, which you can use to build speech recognition applications. Read our Github overview or join the DeepSpeech Discourse to learn how to get started.
Discourse - Have questions about Common Voice? Join us on our Discourse forum.
LibriSpeech - LibriSpeech is a corpus of approximately 1000 hours of 16Khz read English speech derived from read audiobooks from the LibriVox project.
TED-LIUM Corpus - The TED-LIUM corpus was made from audio talks and their transcriptions available on the TED website.
VoxForge - VoxForge was set up to collect transcribed speech for use with Free and Open Source Speech Recognition Engines.
Tatoeba - Tatoeba is a large database of sentences, translations, and spoken audio for use in language learning. This download contains spoken English recorded by their community.

[edit] For more information

Blog announcing Common Voice
https://voice.mozilla.org/en Common Voice web site available in multiple languages.
https://github.com/mozilla/DeepSpeech/wiki wiki page for DeepSpeech, the recognition engine.
see Dictation for voice recognition topics.

Common Voice

[edit] Overview

[edit] Datasets

[edit] For more information

Personal tools

Namespaces

Variants

Views

Actions

Search

Navigation

MobileRead Networks

Toolbox