Return to Research

Video Grounding

From Video Matching to Video Grounding
G. Evangelidis, F. Diego, R. Horaud


This paper addresses the background estimation problem for videos captured by moving cameras, referred to as video grounding. It essentially aims at reconstructing a video, as if it would be without foreground objects, e.g. cars or people. What differentiates video grounding from known background estimation methods is that the camera follows unconstrained motion so that background undergoes ongoing changes. We build on video matching aspects since more videos contribute to the reconstruction. Without loss of generality, we investigate a challenging case where videos are recorded by in-vehicle cameras that follow the same road. Other than video synchronization and spatiotemporal alignment, we focus on the background reconstruction by exploiting inter- and intra-sequence similarities. In this context, we propose a Markov random field formulation that integrates the temporal coherence of videos while it exploits the decisions of a support vector machine classifier about the backgroundness of regions in video frames. Experiments with real sequences recorded by moving vehicles verify the potential of the video grounding algorithm against state-of-art baselines.


Download PDF

From Video Matching to Video Grounding
G. Evangelidis, F. Diego, R. Horaud,
ICCV Workshop on Computer Vision in Vehicle Technology: From Earth to Mars, Sidney, 2013


author = {Evangelidis, G. and Diego, F. and Horaud, R.},
title = {From Video Matching to Video Grounding},
booktitle = {ICCV Workshop on Computer Vision in Vehicle Technology},
year = {2013},

Video results

Highway Rural

Chen et al. [1]

Chen et al. [1]
Whyte et al. [2]
Whyte et al. [2]

Proposed algorithm

Proposed algorithm


[1] X. Chen, Y. Shen, and Y. H. Yang. Background estimation using graph cuts and inpainting. In Proceedings of Graphics Interface (GI) 2010
[2] O. Whyte, J. Sivic, and A. Zisserman. Get out of my picture! internet-based inpainting. In Proceedings of the 20th British Machine Vision Conference, London, 2009