EARS

Embodied Audition for Robots (EARS)

EARS explores new algorithms for enhancing the auditive capabilities of humanoid robots. A main focus is to develop the fundamentals for a natural spoken dialogue between humans and robots in adverse acoustical environments

EARS publications by the PERCEPTION team

The European FP7 STREP project EARS started on 1 January 2014 for a period of three years (until 31 December 2016). The PERCEPTION team is associated to five other partners: University Erlangen-Nuremberg (prof. Walter Kellermann, project coordinator), Imperial College London (prof. Patrick Naylor), Ben-Gourion University of the Negev (prof. Boaz Rafaely), Humboldt University Berlin (Prof. Verena Hafner), and Aldebaran Robotics (Dr. Rodolphe Gelin).

Project website: http://robot-ears.eu/

Summary (as submitted at the start of the project)

The success of future natural intuitive human-robot interaction (HRI) will critically depend on how responsive the robot will be to all forms of human expressions and how well it will be aware of its environment. With acoustic signals distinctively characterizing physical environments and speech being the most effective means of communication among humans, truly humanoid robots must be able to fully extract the rich auditory information from their environment and to use voice communication as much as humans do. While vision-based HRI is well developed, current limitations in robot audition do not allow for such an effective, natural acoustic human-robot communication in real-world environments, mainly because of the severe degradation of the desired acoustic signals due to noise, interference and reverberation when captured by the robot’s microphones. To overcome these limitations, EARS will provide intelligent “ears” with close-to-human auditory capabilities and use it for HRI in complex real-world environments. Novel microphone arrays and powerful signal processing algorithms shall be able to localize and track multiple sound sources of interest and to extract and recognize the desired signals. After fusion with robot vision, embodied robot cognition will then derive HRI actions and knowledge on the entire scenario, and feed this back to the acoustic interface for further auditory scene analysis. As a prototypical application, EARS will consider a welcoming robot in a hotel lobby offering all the above challenges. Representing a large class of generic applications, this scenario is of key interest to industry and, thus, a leading European robot manufacturer will integrate EARS’s results into a robot platform for the consumer market and validate it. In addition, the provision of open-source software and an advisory board with key players from the relevant robot industry should help to make EARS a turnkey project for promoting audition in the robotics world.

Perception team members involved in the EARS project

Xiaofei Li, Dionyssos Kounades-Bastian, Israel Dejene-Gebru, Soraya Arias, Fabien Badeig, Laurent Girin and Radu Horaud.