Xavier ALAMEDA-PINEDA

Author's posts

(Closed) MSc. Project on Speaker identity modeling with deep learning for re-identification

MSc. Project on Speaker identity modeling with deep learning for re-identification Short description: Speaker identification is the task that aims at determining which speaker has produced a given utterance [1]. On the other hand, speaker verification or re-identification aims at determining whether there is a match between a given speech utterance and a target speaker …

Continue reading

(Closed) MSc. Project on Coupled Audio-visual Multi-speaker Tracking

MSc. Project on Coupled Audio-visual Multi-speaker Tracking Short description: Multi-speaker tracking has been widely investigated and the Perception team contributed with a consistent methodological framework based on variational Bayes techniques [1-4]. Often, audio-visual tracking methods first map all auditory and visual information in the same space, to later on run a tracking algorithm. However, in …

Continue reading

(Closed) MSc. Project on Gazeable Objects

MSc project on “Gazeable Objects” Duration: about 6 months Short description: Gaze is the direction towards which a person is looking. The automatic estimation of the gaze from a single image and from videos has been a hot research topic in previous years [1-4]. Often, researchers studied gaze from a human-centered perspective, trying to answer the …

Continue reading

Multi-Microphone Speaker Localization on Manifolds: Achievements and Challenges

Multi-Microphone Speaker Localization on Manifolds: Achievements and Challenges Wednesday, September 27th 2017, 10:30 – 12:00, room F107, INRIA Montbonnot Seminar by Prof. Sharon Gannot, Bar-Ilan University, Israel joint work with Bracha Laufer-Goldshtein, Bar-Ilan University, Israel and Prof. Ronen Talmon, The Technion-IIT, Israel   Abstract: Speech enhancement is a core problem in audio signal processing, with commercial applications …

Continue reading