Radu HORAUD

Author's posts

Modeling Reverberant Mixtures for Multichannel Audio-Source Separation

Wednesday, 22 March 2017, 10:30 – 11:30 am, room F107, INRIA Montbonnot Seminar by Simon Leglaive, Telecom ParisTech, Paris   Abstract: We tackle the problem of multichannel audio-source separation in under-determined reverberant mixtures. The aim of this talk is to present source separation approaches that can take advantage of prior knowledge on the mixing filters, …

Continue reading

Audio-visual diarization dataset now available for download

We just made public our novel AVDIAR dataset. AVDIAR stands for “audio-visual diarization”. The dataset contains recordings of social gatherings done with two cameras and six microphones. Both the audio and visual data were carefully annotated, such that it is possible to evaluate the performance of various algorithms, such as person tracking, speech-source localization, speaker …

Continue reading

IEEE ICPR’16: Best Scientific Paper Award!

Xavier Alameda-Pineda and his co-authors from the University of Trento received the Intel best scientific paper award (INTEL BSPA), track image, speech, signal, and video processing, at the 23rd IEEE International Conference on Pattern Recognition (ICPR’16), Cancun, Mexico, 4-8 December 2016, for their paper Multi-Paced Dictionary Learning for Cross-Domain Retrieval and Recognition. IEEE ICPR is …

Continue reading

(Closed) MSc. Project: Deep Learning for Voice Activity Detection

MSc project on “Deep Learning for Voice Activity Detection” Duration: 6 months Short description: Voice Activity Detection (VAD) is a technique that classifies a (possibly noisy) audio signal into speech and non-speech segments. It is an essential building block for many speech-based systems, such as speech recognition and spoken dialog for human-computer and human-robot interaction, …

Continue reading

Augmented parametric shapes for real-time dense 3D modeling using an RGB-D camera

Friday, December 9, 2016,11:00 am to 12:00 pm, room F108, INRIA Montbonnot Seminar by Diego Thomas, University of Kyushu, Fukuoka, Japan   Abstract: Consumer grade RGB-D cameras such as the Kinect camera or the Asus Xtion pro camera have become the commodity tool to build dense 3D models of indoor scenes. The volumetric Truncated Signed …

Continue reading

A Hybrid Approach for Speech Enhancement Using GMM and Deep Neural Network Phoneme Classifier

Tuesday, October 18, 2016, 4:00 pm to 5:00 pm, room F108, INRIA Montbonnot Seminar by Sharon Gannot, Bar Ilan University Abstract: In this work, we propose a hybrid approach for single microphone speech enhancement, merging the generative Mixture of Gaussians (MoG) model and the discriminative deep neural network (DNN). The proposed algorithm is executed in …

Continue reading

(Closed) M.Sc. Project: Reinforcement and Deep Learning applied to Human-Robot Dialog

M.Sc. project on “Reinforcement and Deep Learning applied to Human-Robot Dialog” Duration: 6 months (and it may continue with a PhD) Short description: The main goal of this project is to design and develop an automatic system to be exploited by a humanoid robot in multiparty Human-robot interaction (i.e. involving several participants). The system has to …

Continue reading

(Closed) Offre d’emploi : ingénieur traitement du signal et de l’image

Offre d’emploi CDD : ingénieur expert de développement en traitement du signal et de l’image pour la robotique Prise de fonction et durée : octobre/novembre 2016, 12 mois renouvelable (jusqu’à 36 mois) Mission : Dans le cadre du projet ERC VHIA, l’équipe PERCEPTION (https://team.inria.fr/perception), du centre de recherche INRIA Grenoble Rhône-Alpes situé à Montbonnot Saint-Martin, …

Continue reading

Binaural sound reproduction for the hearing impaired

Thursday, October 6, 2016, 10:00 am to 11:00 am, room F107, INRIA Montbonnot Seminar by Noam Shabtai, Ben Gourion University Abstract: In most hearing aids systems, microphone array signal processing algorithms may be employed in order to reduce the noise and enhance the signals that are arriving from specific directions. However, most noise reduction algorithms …

Continue reading

Modelling face-to-face conversational interaction with robots

Thursday, October 6, 2016, 11:00 am to 12:00 am, room F107, INRIA Montbonnot Seminar by Gabriel Skantze, KTH, Stockholm, Sweden Abstract: When humans interact and collaborate with each other, they coordinate their turn-taking behaviours using verbal and non-verbal signals, expressed in the face and voice. If robots of the future are supposed to engage in social interaction with humans, it …

Continue reading