Uncategorized – RobotLearn

Diffusion-based Unsupervised Audio-visual Speech Enhancement

Xavier ALAMEDA-PINEDA 2025/01/11 2025/04/11Research, Sound, Uncategorized

by Jean-Eudes Ayilo, Mostafa Sadeghi, Romain Serizel, Xavier Alameda-Pineda IEEE International Conference on Audio, Speech, and Signal Processing [ paper ] [ code ] Abstract: —This paper proposes a new unsupervised audiovisual speech enhancement (AVSE) approach that combines a diffusion-based audio-visual speech generative model with a non-negative matrix factorization (NMF)…

AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder

Xavier ALAMEDA-PINEDA 2025/01/11 2025/04/11Research, Sound, Uncategorized

by Samir Sadok, Simon Leglaive, Laurent Girin, Gaël Richard, Xavier Alameda-Pineda IEEE International Conference on Audio, Speech, and Signal Processing [ paper ] [ code ] Abstract: This article introduces AnCoGen, a novel method that leverages a masked autoencoder to unify the analysis, control, and generation of speech signals within…

Lost and found: Overcoming detector failures in online multi-object tracking

Xavier ALAMEDA-PINEDA 2024/09/02 2025/04/11Research, Uncategorized, Vision

by Lorenzo Vaquero, Yihong Xu, Xavier Alameda-Pineda, Víctor M Brea, Manuel Mucientes European Conference on Computer Vision [ paper ] [ code ] Abstract: Multi-object tracking (MOT) endeavors to precisely estimate the positions and identities of multiple objects over time. The prevailing approach, tracking-by-detection (TbD), first detects objects and then…

Vq-hps: Human pose and shape estimation in a vector-quantized latent space

Xavier ALAMEDA-PINEDA 2024/09/02 2025/04/11Research, Uncategorized, Vision

by Guénolé Fiche, Simon Leglaive, Xavier Alameda-Pineda, Antonio Agudo, Francesc Moreno-Noguer European Conference on Computer Vision [ paper ] [ code ] Abstract: Previous works on Human Pose and Shape Estimation (HPSE) from RGB images can be broadly categorized into two main groups: parametric and non-parametric approaches. Parametric techniques leverage…

Navigating the Practical Pitfalls of Reinforcement Learning for Social Robot Navigation

Xavier ALAMEDA-PINEDA 2024/08/03 2025/04/11Reinforcement Learning, Research, Uncategorized

by Dhimiter Pikuli, Jordan Cosio, Xavier Alameda-Pineda, Pierre-Brice Wieber, Thierry Fraichard Robotics: Science and Systems (RSS) Workshop on Unsolved Problems in Social Robot Navigation [ paper ] Navigation is one of the essential tasks in order for robots to be deployed in environments shared with humans. The problem becomes increasingly…

Learning for Companion Robots: Preparation and Adaptation

Xavier ALAMEDA-PINEDA 2024/07/11 2025/04/11Reinforcement Learning, Research, Sound, Uncategorized, Vision

Xavier Alameda-Pineda was a keynote speaker at RFIAP/cAP 2024, on the topic of Learning for Companion Robots: Preparation and Adaptation.

Deep Regression Models and Computer Vision Applications for Multiperson Human-Robot Interaction

Stephane LATHUILIERE 2018/05/17 2021/08/04News, Seminars, Uncategorized

PhD defense by Stéphane Lathuilière Tuesday 22nd May 2018, 11:00, Grand Amphithéatre INRIA Grenoble Rhône-Alpes, Montbonnot Saint-Martin Abstract: In order to interact with humans, robots need to perform basic perception tasks such as face detection, human pose estimation or speech recognition. However, in order have a natural interaction with humans,…

Audio-Visual Analysis in the Framework of Humans Interacting with Robots

Radu HORAUD 2018/04/09 2018/04/09News, Seminars, Uncategorized

PhD defense by Israel D. Gebru Friday 13 April 2018, 9:30 – 10:30, Grand Amphithéatre INRIA Grenoble Rhône-Alpes, Montbonnot Saint-Martin In recent years, there has been a growing interest in human-robot interaction (HRI), with the aim to enable robots to naturally interact and communicate with humans. Natural interaction implies that…

Binaural sound reproduction for the hearing impaired

Radu HORAUD 2016/09/30 2016/09/30Seminars, Uncategorized

Thursday, October 6, 2016, 10:00 am to 11:00 am, room F107, INRIA Montbonnot Seminar by Noam Shabtai, Ben Gourion University Abstract: In most hearing aids systems, microphone array signal processing algorithms may be employed in order to reduce the noise and enhance the signals that are arriving from specific directions….

January 2015: two accepted papers

Radu HORAUD 2015/02/02 2015/03/01News, Uncategorized

Two papers just accepted for publications in IEEE TPAMI and IEEE TASLP: Fusion of Range and Stereo Data for High-Resolution Scene-Modeling Georgios Evangelidis, Miles Hansard, Radu Horaud IEEE Transactions on Pattern Analysis and Machine Intelligence, Institute of Electrical and Electronics Engineers (IEEE), 2015, pp.14. <http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=7031946>. <10.1109/TPAMI.2015.2400465> Co-Localization of Audio Sources in…