Author's posts
Nov 30
Arabic speech synthesis
Speaker: Amal Houidhek Date: November 30, 2017 Abstract: The first part of the presentation investigates statistical parametric speech synthesis (SPSS) of Modern Standard Arabic (MSA): Hidden Markov Models (HMM)-based speech synthesis system relies on a description of speech segments corresponding to phonemes, with a large set of features that represent phonetic, phonologic, linguistic and contextual aspects. …
Oct 05
An annihilation filter approach for the blind identification of speech excited SIMO acoustic systems
Speaker: Mathieu Hu Date: October 5, 2017 Abstract: The characterization of the room impulse responses via the cross-relation is reinterpreted for noisy conditions and exploited in this work to propose an approach for the blind identification of acoustic channels from reverberant noisy speech signals. In this novel approach, which aims to annihilate the speech content from …
Sep 14
Dynamic out-of-vocabulary retrieval for automatic speech recognition
Speaker: Amélie Greiner Date: September 14, 2017 Abstract: To perform a transcription, a speech recognition system relies on a vocabulary that contains all the words that can be transcribed. In practice, it is impossible to include all the existing words in this vocabulary, which therefore contains only the most common words of the language. Out-of-vocabulary words …
Sep 07
Virtual Acoustic Space Learning for Auditory Scene Geometry Estimation
Speaker: Antoine Deleforge (Researcher, INRIA Rennes) Date: September 7, 2017 Abstract: Most auditory scene analysis methods (source separation, denoising, dereverberation, etc.) rely on some geometrical information about the system: Where are the sources? Where are the microphones? What is around or between them? Since the geometrical configuration of real-world systems is often very complex, classical approaches …
Aug 31
Anti-Spoofing Methods for Speaker Verification: Recent Advancements and Future Directions
Speaker: Md Sahidullah (Visiting Researcher) Date: August 31, 2017 Abstract: Automatic speaker verification (ASV) technology is recently finding its way to end-user applications. This voice-based authentication technology shows promising recognition performance in the controlled conditions. However, ASV technology is highly vulnerable to spoofing attacks where an intruder uses a synthetic or recorded voice to get illegitimate …
Jun 29
Black-box Optimization of Deep Neural Networks for Acoustic Modeling
Speaker: Aman Zaid Berhe Date: June 29, 2017 Abstract: Deep neural networks are now the state-of-the-art in acoustic modeling for automatic speech recognition. The allow obtaining robust and high accuracy acoustic models. However, these models have a lot of hyper-parameters. Hyper-parameters optimization is very tedious yet essential tasks to successfully train very deep neural networks. We …
Jun 22
HRTF range extrapolation by spherical harmonics decomposition
Speaker: Lauréline Perotin Date: June 22, 2017 Abstract: In order to locate sound in space, to know from which direction it comes from but also from how far away, our brain analyses all the distortions applied the soundwave from its (estimated) origin to our eardrums. Those reflections, diffractions and other propagation-related transformations are contained in the …
Jun 15
An Extended Experimental Investigation of DNN Uncertainty Propagation with Uncertainty Related Features for Noise Robust ASR
Speaker: Karan Nathwani (post-doctoral fellow) Date: June 15, 2017 Abstract: Recently, the idea of estimating the uncertainty about the features obtained after speech enhancement and propagating it to dynamically adapt deep neural network (DNN) based acoustic models has raised some interest. However, the results in the literature were reported on simulated noisy datasets for a limited variety …
May 18
Robust Online Direction of Arrival Estimation using Spherical Arrays
Speaker: V. Vishnu Vardan Varanasi Date: May 18, 2017 Abstract: DOA Estimation is a challenging task especially in presence of noise and reverberation. Various applications of acoustic source localization include Distant Automatic Speech Recognition, Music Information Retrieval etc. Spherical Microphone Array(SMA) captures spherical variation of acoustic field with spherical harmonics. A wide range of DOA estimation algorithms …
May 11
Explaining the parameterized Wiener filter with alpha-stable processes
Speaker: Mathieu Fontaine Date: May 11, 2017 Abstract: We introduce a new method for single-channel denoising that sheds new light on classical early developments on this topic that occurred in the 70’s and 80’s with Wiener filtering and spectral subtraction. Operating both in the short-time Fourier transform domain, these methods consist in estimating the power spectral …