BUT Speech@FIT Reverb Database

This is the first release of BUT Speech@FIT Reverb Database. The database is being built with respect to collect a large number of various Room Impulse Responses, Room environmental noises (or "silences"), Retransmitted speech (for ASR and SID testing), and meta-data (positions of microphones, speakers etc.).

The goal is to provide speech community with a dataset for data enhancement and distant microphone or microphone array experiments in ASR and SID.

The database has Apache 2.0 license and you can download it here: http://www.fit.vutbr.cz/~szoke/speech/ReverbDB/BUT_ReverbDB_rel_18_11.tgz [126 GB]

The BUT Speech@FIT Reverb Dataset consists of 7 rooms:

Size Volume # RIRs Ret. Type
Q301 10.7x6.9x2.6 192 31 x 3 1 Office
L207 4.6x6.9x3.1 98 31 x 6 3 Office
L212 7.5x4.6x3.1 107 31 x 5 2 Office
L227 6.2x2.6x14.2 229 31 x 5 3 Stairs
R112 4.4x2.8x2.6* ~40 31 x 5 0 Hotel room
CR2 28.2x11.1x3.3 1033 31 x 4 0 Conf. room
E112 11.5x20.1x4.8* ~900 31 x 2 0 Lect. room
D105 17.2x22.8x6.9* ~2000 31 x 6 1 Lect. room

We placed 31 microphones in both rooms. The source (a hi-fi loudspeaker) was placed on 5 positions in average. We measured RIRs (using exponential sine sweep method) for each speaker position. Next we recorded environmental noise (silence). There was a radio at background playing in one speaker position in the office.

We also retransmitted LibriSpeech Test-clean dataset for some of the positions of speaker (column Ret. in the table above). This data is freely available from our web-pages along with the RIRs. We also retransmitted a portion of NIST Speaker recognition evaluation 2010 dataset, and HUB5 2000 RT eval set. The availability of this data is limited to sites that have valid LDC license to the original data.

All microphone positions are measured and stored in meta-files. We pre-calculated positions of microphones and speakers in Cartesian and polar coordinates as absolute and relative (to the speaker).

Please see attached README.txt for more detailed description of data.

More rooms and environments will come soon. If you want to publish a paper using this dataset, please cite: http://arxiv.org/abs/1811.06795 and refer to this page. Recipe for experiments reported in the paper is here: AMI_Kaldi_recipe.tar.gz

Feel free to provide us with your feedback to szoke@fit.vutbr.cz with a subject mentioning BUT-ReverbDB.