Resources

Software

Speech transcription ANTS Multipass system for transcribing audio data, and in particular radio or TV shows. The audio stream is first split into homogeneous segments of a manageable size, and then each segment is decoded using the most adequate acoustic model with a large vocabulary continuous speech recognition engine (Julius or Sphinx). CoALT Software for …

Datasets

LibriMix LibriMix is an open source dataset for speech source separation in noisy environments. It is derived from LibriSpeech speech signals (clean subset) and WHAM noises, both of which are free to use. It hence offers a free alternative to the WHAM dataset and complements it. Github Page: https://github.com/JorisCos/LibriMix Reference: LibriMix: An Open-Source Dataset for …