A Proposal-based Paradigm for Self-supervised Sound Source Localization in Videos
![](https://team.inria.fr/robotlearn/files/2022/04/xuan-cvpr-2022-teaser-350x280.png)
Hanyu Xuan, Zhiliang Wu, Jian Yang, Yan Yan, Xavier Alameda-Pineda IEEE/CVF International Conference on Computer Vision (CVPR) 2022, New Orleans, US [HAL] Abstract. Humans can easily recognize where and how the sound is produced via watching a scene and listening to corresponding audio cues. To achieve such cross-modal perception on machines, existing methods…