MENU: Home Bio Affiliations Research Teaching Publications Videos Collaborators/Students Contact FAQ ©2007-15 RSS

Presentation (2012): CMU Robotics Institute Seminar

Video Analysis and Enhancement: Video Stabilization and Rolling Shutter Removal on YouTube

Irfan Essa
Georgia Tech
School of Interactive Computing
GVU and RIM @ GT Centers

October 19, 2012, 3:30 PM, NSH 1305


In this talk, I will discuss a variety of approaches my group is working on for video analysis and enhancement. In particular, I will describe our approach for a video stabilizer, currently implemented and running on YouTube, and its extensions.

This method generates stabilized videos by employing L1-optimal camera paths to remove undesirable motions [1]. We compute camera paths that are optimally partitioned into constant, linear and parabolic segments mimicking the camera motions employed by professional cinematographers. We propose a linear programming framework to minimize the first, second, and third derivatives of the resulting camera path. Our method allows for video stabilization beyond the conventional filtering that only suppresses high frequency jitter. An additional challenge in videos shot from mobile phones are rolling shutter distortions. Modern CMOS cameras capture the frame one scan-line at a time, which results in non-rigid image distortions such as shear and wobble. I will demonstrate a solution based on a novel mixture model of homographies parametrized by scan-line blocks to correct these rolling shutter distortions [2]. Our method does not rely on a-priori knowledge of the readout time nor requires prior camera calibration. A thorough evaluation based on a user study and direct comparisons to other approaches, demonstrates a general preference for our algorithm.

I will conclude the talk by showcasing a live demo of the stabilizer. This work is in collaboration with Matthias Grundmann and Vivek Kwatra at Google, and appears in following two papers.

Time permitting, I will discuss some other projects we are working on, including video segmentation and retargetting.

[1] Matthias Grundmann, Vivek Kwatra, Irfan Essa, CVPR 2011,

[2] Matthias Grundmann, Vivek Kwatra, Daniel Castro Irfan Essa, ICCP 2012, Best paper,

Host: Takeo Kanade

via Robotics Institute: Talks and Seminars.

Tags: , , , , | Categories: Computational Photography and Video, Matthias Grundmann, Presentations, Vivek Kwatra | Date: October 19th, 2012 | By: Irfan Essa |

No Comments »

You can follow any responses to this entry through the RSS 2.0 feed. You can leave a response, or trackback from your own site.

Leave a Reply