CHIME

Audio Dataset

Homepage

http://spandh.dcs.shef.ac.uk/chime_challenge/data.html

Description

Noisy speech recognition challenge dataset. Dataset contains real simulated and clean voice recordings. Real being actual recordings of 4 speakers in nearly 9000 recordings over 4 noisy locations, simulated is generated by combining multiple environments over speech utterances and clean being non-noisy recordings.

Discussion

Related datasets

VoxForge

Clean speech dataset of accented english. Useful for instances in which you expect to need robustness to different accents or intonations.

speech

Audio

2000 HUB5 English

English-only speech data used most recently in the Deep Speech paper from Baidu.

speech

Audio

TED-LIUM

Audio transcription of TED talks. 1495 TED talks audio recordings along with full text transcriptions of those recordings.

speech

Audio

Google Audioset

AudioSet consists of an expanding ontology of 632 audio event classes and a collection of 2,084,320 human-labeled 10-second sound clips drawn from YouTube…

google, music, speech, vehicle

Others

LibriSpeech

Audio books data set of text and speech. Nearly 500 hours of clean speech of various audio books read by multiple speakers, organized by chapters of the b…

speech

Audio

TIMIT

English-only speech recognition dataset.

speech

Audio

CHIME

Homepage

Description

Tags

Discussion

Related datasets

VoxForge

2000 HUB5 English

TED-LIUM

Google Audioset

LibriSpeech

TIMIT