Aditya Arie Nugraha

Author's posts

Nov 30

Arabic speech synthesis

Speaker: Amal Houidhek Date: November 30, 2017 Abstract: The first part of the presentation investigates statistical parametric speech synthesis (SPSS) of Modern Standard Arabic (MSA): Hidden Markov Models (HMM)-based speech synthesis system relies on a description of speech segments corresponding to phonemes, with a large set of features that represent phonetic, phonologic, linguistic and contextual aspects. …

Continue reading

Oct 05

An annihilation filter approach for the blind identification of speech excited SIMO acoustic systems

Speaker: Mathieu Hu Date: October 5, 2017 Abstract: The characterization of the room impulse responses via the cross-relation is reinterpreted for noisy conditions and exploited in this work to propose an approach for the blind identification of acoustic channels from reverberant noisy speech signals. In this novel approach, which aims to annihilate the speech content from …

Continue reading

Sep 14

Dynamic out-of-vocabulary retrieval for automatic speech recognition

Speaker: Amélie Greiner Date: September 14, 2017 Abstract: To perform a transcription, a speech recognition system relies on a vocabulary that contains all the words that can be transcribed. In practice, it is impossible to include all the existing words in this vocabulary, which therefore contains only the most common words of the language. Out-of-vocabulary words …

Continue reading

Sep 07

Virtual Acoustic Space Learning for Auditory Scene Geometry Estimation

Speaker: Antoine Deleforge (Researcher, INRIA Rennes) Date: September 7, 2017 Abstract: Most auditory scene analysis methods (source separation, denoising, dereverberation, etc.) rely on some geometrical information about the system: Where are the sources? Where are the microphones? What is around or between them? Since the geometrical configuration of real-world systems is often very complex, classical approaches …

Continue reading

Aug 31

Anti-Spoofing Methods for Speaker Verification: Recent Advancements and Future Directions

Speaker: Md Sahidullah (Visiting Researcher) Date: August 31, 2017 Abstract: Automatic speaker verification (ASV) technology is recently finding its way to end-user applications. This voice-based authentication technology shows promising recognition performance in the controlled conditions. However, ASV technology is highly vulnerable to spoofing attacks where an intruder uses a synthetic or recorded voice to get illegitimate …

Continue reading

Jun 29

Black-box Optimization of Deep Neural Networks for Acoustic Modeling

Speaker: Aman Zaid Berhe Date: June 29, 2017 Abstract: Deep neural networks are now the state-of-the-art in acoustic modeling for automatic speech recognition. The allow obtaining robust and high accuracy acoustic models. However, these models have a lot of hyper-parameters. Hyper-parameters optimization is very tedious yet essential tasks to successfully train very deep neural networks. We …

Continue reading

Jun 22

HRTF range extrapolation by spherical harmonics decomposition

Speaker: Lauréline Perotin Date: June 22, 2017 Abstract: In order to locate sound in space, to know from which direction it comes from but also from how far away, our brain analyses all the distortions applied the soundwave from its (estimated) origin to our eardrums. Those reflections, diffractions and other propagation-related transformations are contained in the …

Continue reading

Jun 15

An Extended Experimental Investigation of DNN Uncertainty Propagation with Uncertainty Related Features for Noise Robust ASR

Speaker: Karan Nathwani (post-doctoral fellow) Date: June 15, 2017 Abstract: Recently, the idea of estimating the uncertainty about the features obtained after speech enhancement and propagating it to dynamically adapt deep neural network (DNN) based acoustic models has raised some interest. However, the results in the literature were reported on simulated noisy datasets for a limited variety …

Continue reading

May 18

Robust Online Direction of Arrival Estimation using Spherical Arrays

Speaker: V. Vishnu Vardan Varanasi Date: May 18, 2017 Abstract: DOA Estimation is a challenging task especially in presence of noise and reverberation. Various applications of acoustic source localization include Distant Automatic Speech Recognition, Music Information Retrieval etc. Spherical Microphone Array(SMA) captures spherical variation of acoustic field with spherical harmonics. A wide range of DOA estimation algorithms …

Continue reading

May 11

Explaining the parameterized Wiener filter with alpha-stable processes

Speaker: Mathieu Fontaine Date: May 11, 2017 Abstract: We introduce a new method for single-channel denoising that sheds new light on classical early developments on this topic that occurred in the 70’s and 80’s with Wiener filtering and spectral subtraction. Operating both in the short-time Fourier transform domain, these methods consist in estimating the power spectral …

Continue reading