<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>prof.irfanessa.com &#187; CVPR</title>
	<atom:link href="http://prof.irfanessa.com/tag/cvpr/feed/" rel="self" type="application/rss+xml" />
	<link>http://prof.irfanessa.com</link>
	<description>Irfan Essa&#039;s Academic Activities</description>
	<lastBuildDate>Wed, 25 Jan 2012 23:42:09 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	
		<item>
		<title>DEMO (2011): Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths &#8211; from Google Research Blog</title>
		<link>http://prof.irfanessa.com/2011/06/20/videostabilization-youtube/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=videostabilization-youtube</link>
		<comments>http://prof.irfanessa.com/2011/06/20/videostabilization-youtube/#comments</comments>
		<pubDate>Mon, 20 Jun 2011 22:17:47 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[In The News]]></category>
		<category><![CDATA[Matthias Grundmann]]></category>
		<category><![CDATA[Mobile Computing]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Vivek Kwatra]]></category>
		<category><![CDATA[2011]]></category>
		<category><![CDATA[Computational Photography]]></category>
		<category><![CDATA[Computational Video]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[CVPR]]></category>
		<category><![CDATA[Video Stabilization]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=738</guid>
		<description><![CDATA[via Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths &#8211; Google Research Blog. Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths Posted by Matthias Grundmann, Vivek Kwatra, and Irfan Essa, Earlier this year, we announced the launch of new features on the YouTube Video Editor, including stabilization for shaky videos, with the ability to preview them in [...]]]></description>
			<content:encoded><![CDATA[<p>via <a href="http://googleresearch.blogspot.com/2011/06/auto-directed-video-stabilization-with.html">Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths &#8211; Google Research Blog</a>.</p>
<blockquote>
<h4>Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths<br />
Posted by <a href="http://research.google.com/pubs/author38919.html">Matthias Grundmann</a>, <a href="http://research.google.com/pubs/author38000.html">Vivek Kwatra</a>, and <a href="http://www.irfanessa.com/Work/Welcome.html">Irfan Essa</a>,</h4>
<p>Earlier this year, we announced the launch of new features on the YouTube Video Editor, including stabilization for shaky videos, with the ability to preview them in real-time. The core technology behind this feature is detailed in this paper, which will be presented at the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR 2011).</p>
<p>Casually shot videos captured by handheld or mobile cameras suffer from significant amount of shake. Existing in-camera stabilization methods dampen high-frequency jitter but do not suppress low-frequency movements and bounces, such as those observed in videos captured by a walking person. On the other hand, most professionally shot videos usually consist of carefully designed camera configurations, using specialized equipment such as tripods or camera dollies, and employ ease-in and ease-out for transitions. Our goal was to devise a completely automatic method for converting casual shaky footage into more pleasant and professional looking videos.</p>
<p style="text-align: center;">
<p><a href="http://www.youtube.com/watch?v=0MiY-PNy-GU&#038;fmt=18">http://www.youtube.com/watch?v=0MiY-PNy-GU</a></p>
</p>
<p>Our technique mimics the cinematographic principles outlined above by automatically determining the best camera path using a robust optimization technique. The original, shaky camera path is divided into a set of segments, each approximated by either a constant, linear or parabolic motion. Our optimization finds the best of all possible partitions using a computationally efficient and stable algorithm.</p>
<p>To achieve real-time performance on the web, we distribute the computation across multiple machines in the cloud. This enables us to provide users with a real-time preview and interactive control of the stabilized result. Above we provide a video demonstration of how to use this feature on the YouTube Editor. We will also demo this live at Google’s exhibition booth in CVPR 2011.</p></blockquote>
<p>For more details see the <a href="http://www.cc.gatech.edu/cpl/projects/videostabilization/">Project Site</a>. See the <a href="http://www.youtube.com/watch?v=0MiY-PNy-GU&amp;feature=player_embedded">youtube video of the system on youtube</a>. See the <a href="http://www.cc.gatech.edu/cpl/projects/videostabilization/stabilization.pdf">paper in PDF</a>, and a <a href="http://www.youtube.com/watch?v=i5keG1Y810U&amp;feature=player_embedded">technical video of the work</a>.</p>
<p>Full paper is</p>
<ul>
<li><a href="http://research.google.com/pubs/author38919.html">Matthias Grundmann</a>, <a href="http://research.google.com/pubs/author38000.html">Vivek Kwatra</a>, and <a href="http://www.irfanessa.com/Work/Welcome.html">Irfan Essa</a> (2011), &#8220;Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths,&#8221; In Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition (CVPR 2011), Colorado Springs, CO, USA. [<a href="http://www.cc.gatech.edu/cpl/projects/videostabilization/stabilization.pdf">PDF</a>][<a href="http://www.youtube.com/watch?v=i5keG1Y810U&amp;feature=player_embedded">Video</a>][<a href="http://googleresearch.blogspot.com/2011/06/auto-directed-video-stabilization-with.html">Blog</a>][<a href="http://www.youtube.com/watch?v=0MiY-PNy-GU&amp;feature=player_embedded">Demo</a>][<a href="http://www.cc.gatech.edu/cpl/projects/videostabilization/">Project Site</a>]</li>
</ul>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2011/06/20/videostabilization-youtube/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper (2011) in IEEE CVPR: &#8220;Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths&#8221;</title>
		<link>http://prof.irfanessa.com/2011/06/19/videostabilization/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=videostabilization</link>
		<comments>http://prof.irfanessa.com/2011/06/19/videostabilization/#comments</comments>
		<pubDate>Sun, 19 Jun 2011 22:36:41 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[Matthias Grundmann]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Vivek Kwatra]]></category>
		<category><![CDATA[2011]]></category>
		<category><![CDATA[Computational Photography]]></category>
		<category><![CDATA[Computational Video]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[CVPR]]></category>
		<category><![CDATA[Video Stabilization]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=747</guid>
		<description><![CDATA[Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths Grundmann, Kwatra, and Essa (2011), “Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2011.  [PDF] [WEBSITE][VIDEO] [DEMO][Google Research Blog] [BIBTEX] @inproceedings{2011-Grundmann-AVSWROCP, Author = {M. Grundmann and V. Kwatra and I. Essa}, Booktitle = {Proceedings of IEEE Conference [...]]]></description>
			<content:encoded><![CDATA[<h3>Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths</h3>
<ul>
<li>Grundmann, Kwatra, and Essa (2011), “Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2011.  <a title="PDF" href="http://www.cc.gatech.edu/~irfan/p/2011-Grundmann-AVSWROCP">[PDF]</a> <a title="Project Website" href="http://www.cc.gatech.edu/cpl/projects/videostabilization/">[WEBSITE]</a><a title="VIDEO" href="http://www.youtube.com/watch?v=i5keG1Y810U">[VIDEO]</a> <a title="DEMO" href="http://www.youtube.com/watch?v=0MiY-PNy-GU">[DEMO][</a>Google Research <a href="http://googleresearch.blogspot.com/2011/06/auto-directed-video-stabilization-with.html">Blog</a><a title="DEMO" href="http://www.youtube.com/watch?v=0MiY-PNy-GU">]</a> <a id="papercite_3" class="papercite_toggle" href="javascript:void(0)">[BIBTEX]</a>
<pre id="papercite_3_block" class="papercite_bibtex"><code> @inproceedings{2011-Grundmann-AVSWROCP, Author = {M. Grundmann and V. Kwatra and I. Essa}, Booktitle = {Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, Month = {June}, Pdf = {http://www.cc.gatech.edu/~irfan/p/2011-Grundmann-AVSWROCP}, Publisher = {IEEE Computer Society}, Title = {Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths}, Url = {http://www.cc.gatech.edu/cpl/projects/videostabilization/}, Video = {http://www.youtube.com/watch?v=i5keG1Y810U}, Year = {2011}}</code></pre>
</li>
</ul>
<h4 style="text-align: left;"><strong>Abstract</strong></h4>
<p style="text-align: left;">We present a novel algorithm for automatically applying constrainable, L1-optimal camera paths to generate stabilized videos by removing undesired motions. Our goal is to compute camera paths that are composed of constant, linear and parabolic segments mimicking the camera motions employed by professional cinematographers. To this end, our algorithm is based on a linear programming framework to minimize the first, second, and third derivatives of the resulting camera path. Our method allows for video stabilization beyond the conventional filtering of camera paths that only suppresses high frequency jitter. We incorporate additional constraints on the path of the camera directly in our algorithm, allowing for stabilized and retargeted videos. Our approach accomplishes this without the need of user interaction or costly 3D reconstruction of the scene, and works as a post-process for videos from any camera or from an online source.</p>
<p><img class="aligncenter" src="http://www.cc.gatech.edu/cpl/projects/videostabilization/teaser.png" alt="" width="500" /></p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2011/06/19/videostabilization/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper in CVPR (2010): &#8220;Motion Field to Predict Play Evolution in Dynamic Sport Scenes</title>
		<link>http://prof.irfanessa.com/2010/06/13/playevolution-cvpr2010/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=playevolution-cvpr2010</link>
		<comments>http://prof.irfanessa.com/2010/06/13/playevolution-cvpr2010/#comments</comments>
		<pubDate>Sun, 13 Jun 2010 15:25:10 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Jessica Hodgins]]></category>
		<category><![CDATA[Kihwan Kim]]></category>
		<category><![CDATA[Matthias Grundmann]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Sports Visualization]]></category>
		<category><![CDATA[2010]]></category>
		<category><![CDATA[Computational Video]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[CVPR]]></category>
		<category><![CDATA[Publications]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=659</guid>
		<description><![CDATA[Kihwan Kim, Matthias Grundmann, Ariel Shamir, Iain Matthews, Jessica Hodgins, Irfan Essa (2010) &#8220;Motion Field to Predict Play Evolution in Dynamic Sport Scenes&#8221; in Proceedings of IEEE Computer Vision and Pattern Recognition Conference (CVPR), San Francisco, CA, USA, June 2010 [PDF][Website][DOI][Video (Youtube)]. Abstract Videos of multi-player team sports provide a challenging domain for dynamic scene analysis. Player actions [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.cc.gatech.edu/~kihwan23">Kihwan Kim</a>, <a href="http://www.cc.gatech.edu/~grundman">Matthias Grundmann</a>, <a href="http://www.faculty.idc.ac.il/arik/">Ariel Shamir</a>, Iain Matthews, Jessica Hodgins, <a href="http://www.irfanessa.com">Irfan Essa</a> (2010) &#8220;<a href="http://www.cc.gatech.edu/cpl/projects/playevolution/">Motion Field to Predict Play Evolution in Dynamic Sport Scenes</a>&#8221; in Proceedings of <a href="http://cvl.umiacs.umd.edu/conferences/cvpr2010/" target="_blank">IEEE Computer Vision and Pattern Recognition Conference (CVPR)</a>, San Francisco, CA, USA, June 2010 [<a href="http://www.cc.gatech.edu/cpl/projects/playevolution/cvpr2010-pe.pdf">PDF</a>][<a href="http://www.cc.gatech.edu/cpl/projects/playevolution/">Website</a>][DOI][<a href="http://www.youtube.com/watch?v=jrksnCR1S0s&amp;feature=player_embedded">Video (Youtube)</a>].</p>
<p style="text-align: center;"><strong>Abstract</strong></p>
<p style="text-align: justify;">Videos of multi-player team sports provide a challenging domain for dynamic scene analysis. Player actions and interactions are complex as they are driven by many factors, such as the short-term goals of the individual player, the overall team strategy, the rules of the sport, and the current context of the game. We show that constrained multi-agent events can be analyzed and even predicted from video. Such analysis requires estimating the global movements of all players in the scene at any time, and is needed for modeling and predicting how the multi-agent play evolves over time on the field. To this end, we propose a novel approach to detect the locations of where the play evolution will proceed, e.g. where interesting events will occur, by tracking player positions and movements over time. We start by extracting the ground level sparse movement of players in each time-step, and then generate a dense motion field. Using this field we detect locations where the motion converges, implying positions towards which the play is evolving. We evaluate our approach by analyzing videos of a variety of complex soccer plays.</p>
<p style="text-align: center;"><a href="http://www.cc.gatech.edu/cpl/projects/playevolution/"><img class="aligncenter" title="CVPR 2010 Paper on Play Evolution" src="http://www.cc.gatech.edu/cpl/projects/playevolution/images/teaser.jpg" alt="CVPR 2010 Paper on Play Evolution" width="500" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2010/06/13/playevolution-cvpr2010/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper in CVPR (2010): &#8220;Discontinuous Seam-Carving for Video Retargeting&#8221;</title>
		<link>http://prof.irfanessa.com/2010/06/13/videoretargeting-cvp2010/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=videoretargeting-cvp2010</link>
		<comments>http://prof.irfanessa.com/2010/06/13/videoretargeting-cvp2010/#comments</comments>
		<pubDate>Sun, 13 Jun 2010 15:04:48 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[Matthias Grundmann]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Vivek Kwatra]]></category>
		<category><![CDATA[2010]]></category>
		<category><![CDATA[Computational Photography]]></category>
		<category><![CDATA[Computational Video]]></category>
		<category><![CDATA[CVPR]]></category>
		<category><![CDATA[Publications]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=654</guid>
		<description><![CDATA[Discontinuous Seam-Carving for Video Retargeting Matthias Grundmann, Vivek Kwatra, Mei Han, Irfan Essa (2010) &#8220;Discontinuous Seam-Carving for Video Retargeting&#8221; in Proceedings of IEEE Computer Vision and Pattern Recognition Conference (CVPR), San Francisco, CA, USA, June 2010 [PDF][Website][DOI][Video (Youtube)]. Grundmann, Kwatra, Han, and Essa (2010), &#8220;Discontinuous Seam-Carving for Video Retargeting,&#8221; in Proceedings of IEEE Conference on Computer Vision and [...]]]></description>
			<content:encoded><![CDATA[<h3><a href="http://www.cc.gatech.edu/cpl/projects/videoretargeting/" target="_blank">Discontinuous Seam-Carving for Video Retargeting</a></h3>
<ul>
<li><a href="http://www.mgrundmann.com">Matthias Grundmann</a>, <a href="http://www.google.com/research/pubs/author38000.html">Vivek Kwatra</a>, <a href="http://research.google.com/pubs/author13553.html">Mei Han</a>, <a href="http://www.irfanessa.com">Irfan Essa</a> (2010) &#8220;<a href="http://www.cc.gatech.edu/cpl/projects/videoretargeting/" target="_blank">Discontinuous Seam-Carving for Video Retargeting</a>&#8221; in <em>Proceedings of </em><a href="http://cvl.umiacs.umd.edu/conferences/cvpr2010/" target="_blank"><em>IEEE Computer Vision and Pattern Recognition Conference (CVPR)</em></a>, San Francisco, CA, USA, June 2010 [<a href="http://www.cc.gatech.edu/cpl/projects/videoretargeting/cvpr2010_videoretargeting.pdf">PDF</a>][<a href="http://www.cc.gatech.edu/cpl/projects/videoretargeting/" target="_blank">Website</a>][DOI][<a href="http://www.youtube.com/watch?v=8qMovlLlr_0&amp;feature=player_embedded">Video (Youtube)</a>].</li>
</ul>
<div>
<ul class="papercite_bibliography">
<li>        Grundmann, Kwatra, Han, and Essa (2010), &#8220;Discontinuous Seam-Carving for Video Retargeting,&#8221; in <span style="font-style: italic">Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR)</span>,  2010.                             <a href="javascript:void(0)" id="papercite_1" class="papercite_toggle">[BIBTEX]</a>
<pre class="papercite_bibtex" id="papercite_1_block"><code>@inproceedings{2010-Grundmann-DSVR,
  Author = {M. Grundmann and V. Kwatra and M. Han and I. Essa},
  Booktitle = {Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  Date-Modified = {2011-12-08 21:27:48 +0000},
  Month = {June},
  Publisher = {IEEE Computer Society},
  Title = {Discontinuous Seam-Carving for Video Retargeting},
  Year = {2010}}</code></pre>
</li>
</ul>
</div>
<h4><strong style="text-align: center;">Abstract</strong></h4>
<p style="text-align: left;">We introduce a new algorithm for video retargeting that uses discontinuous seam-carving in both space and time for resizing videos. Our algorithm relies on a novel appearance-based temporal coherence formulation that allows for frame-by-frame processing and results in temporally discontinuous seams, as opposed to geometrically smooth and continuous seams. This formulation optimizes the difference in appearance of the resultant retargeted frame to the optimal temporally coherent one, and allows for carving around fast moving salient regions.</p>
<p style="text-align: left;">Additionally, we generalize the idea of appearance-based coherence to the spatial domain by introducing piece-wise spatial seams. Our spatial coherence measure minimizes the change in gradients during retargeting, which preserves spatial detail better than minimization of color difference alone. We also show that per-frame saliency (gradient- based or feature-based) does not always produce desirable retargeting results and propose a novel automatically computed measure of spatio-temporal saliency. As needed, a user may also augment the saliency by interactive region-brushing. Our retargeting algorithm processes the video sequentially, making it conducive for streaming applications.</p>
<p style="text-align: center;"><a href="http://www.cc.gatech.edu/cpl/projects/videoretargeting/"><img class="aligncenter" title="CVPR 2010 Video Retargeting Teaser" src="http://www.cc.gatech.edu/cpl/projects/videoretargeting/teaser.png" alt="Examples from our CVPR 2010 Paper" width="500" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2010/06/13/videoretargeting-cvp2010/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper in CVPR (2010): &#8220;Efficient Hierarchical Graph-Based Video Segmentation</title>
		<link>http://prof.irfanessa.com/2010/06/13/videosegmentation-cvpr2010/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=videosegmentation-cvpr2010</link>
		<comments>http://prof.irfanessa.com/2010/06/13/videosegmentation-cvpr2010/#comments</comments>
		<pubDate>Sun, 13 Jun 2010 14:59:24 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[Matthias Grundmann]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Vivek Kwatra]]></category>
		<category><![CDATA[2010]]></category>
		<category><![CDATA[Computational Photography]]></category>
		<category><![CDATA[Computational Video]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[CVPR]]></category>
		<category><![CDATA[Publications]]></category>
		<category><![CDATA[Video Segmentation]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=647</guid>
		<description><![CDATA[Matthias Grundmann, Vivek Kwatra, Mei Han, Irfan Essa (2010) &#8220;Efficient Hierarchical Graph-Based Video Segmentation&#8221; in Proceedings of IEEE Computer Vision and Pattern Recognition Conference (CVPR), San Francisco, CA, USA, June 2010 [PDF][Website][DOI][Video (Youtube)]. Abstract We present an efficient and scalable technique for spatio- temporal segmentation of long video sequences using a hierarchical graph-based algorithm. We begin by over- [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.mgrundmann.com">Matthias Grundmann</a>, <a href="http://www.google.com/research/pubs/author38000.html">Vivek Kwatra</a>, <a href="http://research.google.com/pubs/author13553.html">Mei Han</a>, <a href="http://www.irfanessa.com">Irfan Essa</a> (2010) <a href="http://www.cc.gatech.edu/cpl/projects/videosegmentation/">&#8220;Efficient Hierarchical Graph-Based Video Segmentation</a>&#8221; in <em>Proceedings of </em><a href="http://cvl.umiacs.umd.edu/conferences/cvpr2010/" target="_blank"><em>IEEE Computer Vision and Pattern Recognition Conference (CVPR)</em></a>, San Francisco, CA, USA, June 2010 [<a href="http://www.cc.gatech.edu/cpl/projects/videosegmentation/cvpr2010_videosegmentation.pdf" target="_blank">PDF</a>][<a href="http://www.cc.gatech.edu/cpl/projects/videosegmentation/" target="_blank">Website</a>][DOI][<a href="http://www.youtube.com/watch?v=juDvLrFQF0U" target="_blank">Video</a> (Youtube)].</p>
<p style="text-align: center;"><strong>Abstract</strong></p>
<p style="text-align: justify;">We present an efficient and scalable technique for spatio- temporal segmentation of long video sequences using a hierarchical graph-based algorithm. We begin by over- segmenting a volumetric video graph into space-time regions grouped by appearance. We then construct a &#8220;region graph” over the obtained segmentation and iteratively repeat this process over multiple levels to create a tree of spatio-temporal segmentations. This hierarchical approach generates high quality segmentations, which are temporally coherent with stable region boundaries, and allows subse- quent applications to choose from varying levels of granularity. We further improve segmentation quality by using dense optical flow to guide temporal connections in the initial graph.</p>
<p style="text-align: justify;">We also propose two novel approaches to improve the scalability of our technique: (a) a parallel out- of-core algorithm that can process volumes much larger than an in-core algorithm, and (b) a clip-based process- ing algorithm that divides the video into overlapping clips in time, and segments them successively while enforcing consistency.</p>
<p style="text-align: justify;">We demonstrate hierarchical segmentations on video shots as long as 40 seconds, and even support a streaming mode for arbitrarily long videos, albeit without the ability to process them hierarchically.</p>
<p style="text-align: center;"><a href="http://www.cc.gatech.edu/cpl/projects/videosegmentation/"><img class="aligncenter" src="http://prof.irfanessa.com/wp-content/uploads/2010/08/teaser.png" alt="VideoSegmentation Teaser" width="500" height="156" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2010/06/13/videosegmentation-cvpr2010/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper in CVPR (2010): &#8220;Player Localization Using Multiple Static Cameras for Sports Visualization&#8221;</title>
		<link>http://prof.irfanessa.com/2010/06/13/playerlocalization-cvpr2010/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=playerlocalization-cvpr2010</link>
		<comments>http://prof.irfanessa.com/2010/06/13/playerlocalization-cvpr2010/#comments</comments>
		<pubDate>Sun, 13 Jun 2010 14:22:44 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Jessica Hodgins]]></category>
		<category><![CDATA[Kihwan Kim]]></category>
		<category><![CDATA[Matthias Grundmann]]></category>
		<category><![CDATA[Numerical Machine Learning]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Raffay Hamid]]></category>
		<category><![CDATA[Sports Visualization]]></category>
		<category><![CDATA[2010]]></category>
		<category><![CDATA[Computational Video]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[CVPR]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=675</guid>
		<description><![CDATA[Raffay Hamid, Ram Krishan Kumar, Matthias Grundmann, Kihwan Kim, Irfan Essa, Jessica Hodgins (2010), &#8220;Player Localization Using Multiple Static Cameras for Sports Visualization&#8221; In Proceedings of IEEE Computer Vision and Pattern Recognition Conference (CVPR), San Francisco, CA, USA, June 2010 [PDF][Website][DOI][Video (Youtube)]. Abstract We present a novel approach for robust localization of multiple people observed using multiple [...]]]></description>
			<content:encoded><![CDATA[<p>Raffay Hamid, Ram Krishan Kumar, Matthias Grundmann, Kihwan Kim, Irfan Essa, Jessica Hodgins (2010), &#8220;Player Localization Using Multiple Static Cameras for Sports Visualization&#8221; In <em>Proceedings of </em><a href="http://cvl.umiacs.umd.edu/conferences/cvpr2010/" target="_blank"><em>IEEE Computer Vision and Pattern Recognition Conference (CVPR)</em></a>, San Francisco, CA, USA, June 2010 [<a href="http://www.raffayhamid.com/hamid_cvpr2010.pdf" target="_blank">PDF</a>][<a href="http://www.raffayhamid.com/sports_viz.shtml" target="_blank">Website</a>][DOI][<a href="http://www.youtube.com/watch?v=VwzAMi9pUDQ&amp;feature=player_embedded" target="_blank">Video (Youtube)</a>].</p>
<p style="text-align: center;"><strong>Abstract</strong></p>
<p style="text-align: justify;">We present a novel approach for robust localization of multiple people observed using multiple cameras. We usethis location information to generate sports visualizations,which include displaying a virtual offside line in soccer games, and showing players&#8217; positions and motion patterns.Our main contribution is the modeling and analysis for the problem of fusing corresponding players&#8217; positional informationas finding minimum weight K-length cycles in complete K-partite graphs. To this end, we use a dynamic programmingbased approach that varies over a continuum of being maximally to minimally greedy in terms of the numberof paths explored at each iteration. We present an end-to-end sports visualization framework that employs our proposed algorithm-class. We demonstrate the robustness of our framework by testing it on 60; 000 frames of soccerfootage captured over 5 different illumination conditions, play types, and team attire.</p>
<p style="text-align: center;"><a href="http://prof.irfanessa.com/wp-content/uploads/2010/08/2010-Hamid-CVPR2010.png"><img class="aligncenter size-full wp-image-676" title="2010-Hamid-CVPR2010" src="http://prof.irfanessa.com/wp-content/uploads/2010/08/2010-Hamid-CVPR2010.png" alt="Teaser Image from CVPR 2010 paper" width="500" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2010/06/13/playerlocalization-cvpr2010/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CVPR 2010: Accepted Papers</title>
		<link>http://prof.irfanessa.com/2010/04/01/cvpr-2010-accepted-papers/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=cvpr-2010-accepted-papers</link>
		<comments>http://prof.irfanessa.com/2010/04/01/cvpr-2010-accepted-papers/#comments</comments>
		<pubDate>Thu, 01 Apr 2010 15:31:12 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[Jessica Hodgins]]></category>
		<category><![CDATA[Kihwan Kim]]></category>
		<category><![CDATA[Matthias Grundmann]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Vivek Kwatra]]></category>
		<category><![CDATA[2010]]></category>
		<category><![CDATA[CVPR]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=645</guid>
		<description><![CDATA[We have the following 4 papers that have been accepted for publications in IEEE CVPR 2010. More details forthcoming, with links to more details. Matthias Grundmann, Vivek Kwatra, Mei Han, and Irfan Essa (2010) &#8220;Discontinuous Seam-Carving for Video Retargeting&#8221; (a GA Tech, Google Collaboration) Matthias Grundmann, Vivek Kwatra, Mei Han, and Irfan Essa (2010) &#8220;Efficient Hierarchical Graph-Based Video [...]]]></description>
			<content:encoded><![CDATA[<div>We have the following 4 papers that have been accepted for publications in IEEE <a href="http://www.cvpr2010.org" target="_blank">CVPR 2010</a>. More details forthcoming, with links to more details.</div>
<ul>
<li>Matthias Grundmann, Vivek Kwatra, Mei Han, and Irfan Essa (2010) &#8220;Discontinuous Seam-Carving for Video Retargeting&#8221; (a GA Tech, Google Collaboration)</li>
<li>Matthias Grundmann, Vivek Kwatra, Mei Han, and Irfan Essa (2010) &#8220;Efficient Hierarchical Graph-Based Video Segmentation&#8221; (a GA Tech, Google Collaboration)</li>
<li>Kihwan Kim, Matthias Grundmann, Ariel Shamir, Iain Matthews, Jessica Hodgins, and Irfan Essa (2010) &#8220;Motion Fields to Predict Play Evolution in Dynamic Sport Scenes&#8221; (a GA Tech, Disney Collaboration)</li>
<li>Raffay Hamid, Ramkrishan Kumar, Matthias Grundmann, Kihwan Kim, Irfan Essa, and Jessica Hodgins (2010) &#8220;Player Localization Using Multiple Static Cameras for Sports Visualization&#8221; (a GA Tech, Disney Collaboration)</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2010/04/01/cvpr-2010-accepted-papers/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>EVENT: CVPR 2009 Decisions are Announced.</title>
		<link>http://prof.irfanessa.com/2009/02/25/event-cvpr-2009-decisions/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=event-cvpr-2009-decisions</link>
		<comments>http://prof.irfanessa.com/2009/02/25/event-cvpr-2009-decisions/#comments</comments>
		<pubDate>Wed, 25 Feb 2009 13:27:59 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Events]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[2009]]></category>
		<category><![CDATA[CVPR]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/?p=433</guid>
		<description><![CDATA[CVPR 2009 site is http://www.cvpr2009.org. (see ORALs and  POSTERs) Decisions were announced after the AC Meeting. See the stats Registration is OPEN (http://www.cvpr2009.org/registration) Hotel Registration to OPEN soon (http://www.cvpr2009.org/travel-and-hotel)]]></description>
			<content:encoded><![CDATA[<ul>
<li>CVPR 2009 site is <a href="http://www.cvpr2009.org/paper-reviewing" target="_blank">http://www.cvpr2009.org</a>. (see <a href="http://www.cvpr2009.org/orals" target="_blank">ORALs</a> and  <a href="http://www.cvpr2009.org/posters" target="_blank">POSTERs</a>)</li>
<li>Decisions were announced after the AC Meeting. See the <a href="http://www.cvpr2009.org/stats">stats</a></li>
<li>Registration is OPEN (<a href="http://www.cvpr2009.org/registration">http://www.cvpr2009.org/registration</a>)</li>
<li>Hotel Registration to OPEN soon (<a href="http://www.cvpr2009.org/travel-and-hotel">http://www.cvpr2009.org/travel-and-hotel</a>)</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2009/02/25/event-cvpr-2009-decisions/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper: IEEE CVPR (2007) &#8220;Tree-based Classifiers for Bilayer Video Segmentation&#8221;</title>
		<link>http://prof.irfanessa.com/2007/06/17/paper-ieee-cvpr-2007-tree-based-classifiers-for-bilayer-video-segmentation/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=paper-ieee-cvpr-2007-tree-based-classifiers-for-bilayer-video-segmentation</link>
		<comments>http://prof.irfanessa.com/2007/06/17/paper-ieee-cvpr-2007-tree-based-classifiers-for-bilayer-video-segmentation/#comments</comments>
		<pubDate>Sun, 17 Jun 2007 15:18:24 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[0205507]]></category>
		<category><![CDATA[Antonio Crimisini]]></category>
		<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[Funding]]></category>
		<category><![CDATA[John Winn]]></category>
		<category><![CDATA[Numerical Machine Learning]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Pei Yin]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[2007]]></category>
		<category><![CDATA[Computational Photography]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[CVPR]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/2007/06/17/paper-ieee-cvpr-2007-tree-based-classifiers-for-bilayer-video-segmentation/</guid>
		<description><![CDATA[Yin, Pei Criminisi, Antonio Winn, John Essa, Irfan (2007), Tree-based Classifiers for Bilayer Video Segmentation In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2007. CVPR &#8217;07, 17-22 June 2007, page(s): 1 &#8211; 8, Location: Minneapolis, MN, USA, ISBN: 1-4244-1180-7, Digital Object Identifier: 10.1109/CVPR.2007.383008 Abstract This paper presents an algorithm for the automatic segmentation of monocular videos [...]]]></description>
			<content:encoded><![CDATA[<p>Yin, Pei   Criminisi, Antonio   Winn, John   Essa, Irfan (2007), <a href="http://ieeexplore.ieee.org/search/srchabstract.jsp?arnumber=4270033&amp;isnumber=4269956&amp;punumber=4269955&amp;k2dockey=4270033@ieeecnfs&amp;query=%28%28essa%29%3Cin%3Eau+%29&amp;pos=6">Tree-based Classifiers for Bilayer Video Segmentation</a> In Proceedings of <em>IEEE Conference on Computer Vision and Pattern Recognition, 2007. CVPR &#8217;07</em>, 17-22 June 2007, page(s): 1 &#8211; 8, Location: Minneapolis, MN, USA, ISBN: 1-4244-1180-7, Digital Object Identifier: 10.1109/CVPR.2007.383008</p>
<p align="center"><strong>Abstract</strong></p>
<p style="text-align: justify;">This paper presents an algorithm for the automatic segmentation of monocular videos into foreground and background layers. Correct segmentations are produced even in the presence of large background motion with nearly stationary foreground. There are three key contributions. The first is the introduction of a novel motion representation, &#8220;motons&#8221;, inspired by research in object recognition. Second, we propose learning the segmentation likelihood from the spatial context of motion. The learning is efficiently performed by Random Forests. The third contribution is a general taxonomy of tree-based classifiers, which facilitates theoretical and experimental comparisons of several known classification algorithms, as well as spawning new ones. Diverse visual cues such as motion, motion context, colour, contrast and spatial priors are fused together by means of a Conditional Random Field (CRF) model. Segmentation is then achieved by binary min-cut. Our algorithm requires no initialization. Experiments on many video-chat type sequences demonstrate the effectiveness of our algorithm in a variety of scenes. The segmentation results are comparable to those obtained by stereo systems.</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2007/06/17/paper-ieee-cvpr-2007-tree-based-classifiers-for-bilayer-video-segmentation/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper: IEEE CVPR (2004) &#8220;Asymmetrically boosted HMM for speech reading&#8221;</title>
		<link>http://prof.irfanessa.com/2004/06/02/ieeexplore-asymmetrically-boosted-hmm-for-speech-reading/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=ieeexplore-asymmetrically-boosted-hmm-for-speech-reading</link>
		<comments>http://prof.irfanessa.com/2004/06/02/ieeexplore-asymmetrically-boosted-hmm-for-speech-reading/#comments</comments>
		<pubDate>Wed, 02 Jun 2004 22:46:44 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[0205507]]></category>
		<category><![CDATA[Funding]]></category>
		<category><![CDATA[James Rehg]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Pei Yin]]></category>
		<category><![CDATA[2004]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[CVPR]]></category>
		<category><![CDATA[Faces]]></category>
		<category><![CDATA[NSF]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/2004/06/02/ieeexplore-asymmetrically-boosted-hmm-for-speech-reading/</guid>
		<description><![CDATA[Pei Yin Essa, I. Rehg, J.M. (2004) &#8220;Asymmetrically boosted HMM for speech reading,&#8221;, In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004 (CVPR 2004). Publication Date: 27 June-2 July 2004, Volume: 2, On page(s): II-755 &#8211; II-761 Vol.2 ISSN: 1063-6919, ISBN: 0-7695-2158-, INSPEC Accession Number:8161546, Digital Object Identifier: 10.1109/CVPR.2004.1315240 [...]]]></description>
			<content:encoded><![CDATA[<p>Pei Yin   Essa, I.   Rehg, J.M. (2004) &#8220;<a href="http://ieeexplore.ieee.org/search/srchabstract.jsp?arnumber=1315240&amp;isnumber=29134&amp;punumber=9183&amp;k2dockey=1315240@ieeecnfs&amp;query=%28%28essa%29%3Cin%3Eau+%29&amp;pos=22">Asymmetrically boosted HMM for speech reading</a>,&#8221;, In <em>Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004 (CVPR 2004)</em>. Publication Date: 27 June-2 July 2004, Volume: 2, On page(s): II-755 &#8211; II-761 Vol.2 ISSN: 1063-6919, ISBN: 0-7695-2158-, INSPEC Accession Number:8161546, Digital Object Identifier: 10.1109/CVPR.2004.1315240</p>
<p align="center"><strong>Abstract</strong></p>
<p style="text-align: justify;">Speech reading, also known as lip reading, is aimed at extracting visual cues of lip and facial movements to aid in recognition of speech. The main hurdle for speech reading is that visual measurements of lip and facial motion lack information-rich features like the Mel frequency cepstral coefficients (MFCC), widely used in acoustic speech recognition. These MFCC are used with hidden Markov models (HMM) in most speech recognition systems at present. Speech reading could greatly benefit from automatic selection and formation of informative features from measurements in the visual domain. These new features can then be used with HMM to capture the dynamics of lip movement and eventual recognition of lip shapes. Towards this end, we use AdaBoost methods for automatic visual feature formation. Specifically, we design an asymmetric variant of AdaBoost M2 algorithm to deal with the ill-posed multi-class sample distribution inherent in our problem. Our experiments show that the boosted HMM approach outperforms conventional AdaBoost and HMM classifiers. Our primary contributions are in the design of (a) boosted HMM and (b) asymmetric multi-class boosting.</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2004/06/02/ieeexplore-asymmetrically-boosted-hmm-for-speech-reading/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Served from: prof.irfanessa.com @ 2012-02-05 14:20:15 -->
