Audio transcription of TED talks. 1495 TED talks audio recordings along with full text transcriptions of those recordings.
English-only speech data used most recently in the Deep Speech paper from Baidu.
speechClean speech dataset of accented english. Useful for instances in which you expect to need robustness to different accents or intonations.
speechNoisy speech recognition challenge dataset. Dataset contains real simulated and clean voice recordings. Real being actual recordings of 4 speakers in ne...
speechAudioSet consists of an expanding ontology of 632 audio event classes and a collection of 2,084,320 human-labeled 10-second sound clips drawn from YouTu...
google, vehicle, music, speechAudio books data set of text and speech. Nearly 500 hours of clean speech of various audio books read by multiple speakers, organized by chapters of the...
speech