Category: Job Offers

(Closed) MSc. Project on Speaker identity modeling with deep learning for re-identification

MSc. Project on Speaker identity modeling with deep learning for re-identification Short description: Speaker identification is the task that aims at determining which speaker has produced a given utterance [1]. On the other hand, speaker verification or re-identification aims at determining whether there is a match between a given speech utterance and a target speaker …

Continue reading

(Closed) MSc. Project on Coupled Audio-visual Multi-speaker Tracking

MSc. Project on Coupled Audio-visual Multi-speaker Tracking Short description: Multi-speaker tracking has been widely investigated and the Perception team contributed with a consistent methodological framework based on variational Bayes techniques [1-4]. Often, audio-visual tracking methods first map all auditory and visual information in the same space, to later on run a tracking algorithm. However, in …

Continue reading

(Closed) MSc. Project on Gazeable Objects

MSc project on “Gazeable Objects” Duration: about 6 months Short description: Gaze is the direction towards which a person is looking. The automatic estimation of the gaze from a single image and from videos has been a hot research topic in previous years [1-4]. Often, researchers studied gaze from a human-centered perspective, trying to answer the …

Continue reading

(Closed) MSc. Project: Speech enhancement with deep neural networks

MSc project on “Speech enhancement with deep neural networks” Duration: about 6 months Short description: Speech enhancement [1] is an important preprocessing step to various speech information retrieval tasks such as automatic speech recognition. The goal of a speech enhancement method is to provide a clean speech signal from a noisy recording that contains interfering audio …

Continue reading

(Closed) MSc. Project: Robust voice activity detection with deep neural networks

MSc project on “Robust voice activity detection with deep neural networks” Duration: about 6 months Short description: Voice activity detection (VAD) is a segmentation problem of a given audio signal into speech and non-speech sections. It constitutes an essential part in many modern speech-based systems such as those for speech and speaker recognition, speech enhancement, emotion …

Continue reading

Software engineer / Audio-visual perception for robotics

Context Perception team (https://team.inria.fr/perception), at INRIA Grenoble Rhône-Alpes and Jean Kuntzman Laboratory at Grenoble Alpes University, works on computational models for mapping images and sounds onto meaning and actions. The team members address these challenging topics: computer vision, auditory signal processing and scene analysis, machine learning, and robotics. In particular, we develop methods for the …

Continue reading

(Closed) MSc project: Deep Reinforcement Learning for Human-Robot Interaction

Duration: 6 months Short description: This internship proposal is part of a broader research project where our goal is to provide robots with basic social skills, such that they are able to interact with human beings in a fluid and socially acceptable manner. With this long-term goal in mind, we designed and developed a reinforcement …

Continue reading

(Closed) MSc. Project: Deep Learning for Voice Activity Detection

MSc project on “Deep Learning for Voice Activity Detection” Duration: 6 months Short description: Voice Activity Detection (VAD) is a technique that classifies a (possibly noisy) audio signal into speech and non-speech segments. It is an essential building block for many speech-based systems, such as speech recognition and spoken dialog for human-computer and human-robot interaction, …

Continue reading

(Closed) M.Sc. Project: Reinforcement and Deep Learning applied to Human-Robot Dialog

M.Sc. project on “Reinforcement and Deep Learning applied to Human-Robot Dialog” Duration: 6 months (and it may continue with a PhD) Short description: The main goal of this project is to design and develop an automatic system to be exploited by a humanoid robot in multiparty Human-robot interaction (i.e. involving several participants). The system has to …

Continue reading

(Closed) Offre d’emploi : ingénieur traitement du signal et de l’image

Offre d’emploi CDD : ingénieur expert de développement en traitement du signal et de l’image pour la robotique Prise de fonction et durée : octobre/novembre 2016, 12 mois renouvelable (jusqu’à 36 mois) Mission : Dans le cadre du projet ERC VHIA, l’équipe PERCEPTION (https://team.inria.fr/perception), du centre de recherche INRIA Grenoble Rhône-Alpes situé à Montbonnot Saint-Martin, …

Continue reading