<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>prof.irfanessa.com &#187; Computer Vision</title>
	<atom:link href="http://prof.irfanessa.com/tag/computer-vision/feed/" rel="self" type="application/rss+xml" />
	<link>http://prof.irfanessa.com</link>
	<description>Irfan Essa&#039;s Academic Activities</description>
	<lastBuildDate>Wed, 25 Jan 2012 23:42:09 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	
		<item>
		<title>Kihwan Kim&#8217;s Thesis Defense (2011): &#8220;Spatio-temporal Data Interpolation for Dynamic Scene Analysis&#8221;</title>
		<link>http://prof.irfanessa.com/2011/12/06/kihwan-kims-phd2011/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=kihwan-kims-phd2011</link>
		<comments>http://prof.irfanessa.com/2011/12/06/kihwan-kims-phd2011/#comments</comments>
		<pubDate>Tue, 06 Dec 2011 19:05:33 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[Kihwan Kim]]></category>
		<category><![CDATA[Modeling and Animation]]></category>
		<category><![CDATA[Multimedia]]></category>
		<category><![CDATA[PhD]]></category>
		<category><![CDATA[Security]]></category>
		<category><![CDATA[Visual Surviellance]]></category>
		<category><![CDATA[WWW]]></category>
		<category><![CDATA[2011]]></category>
		<category><![CDATA[Computational Photography]]></category>
		<category><![CDATA[Computational Video]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[Thesis]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=1127</guid>
		<description><![CDATA[Spatio-temporal Data Interpolation for Dynamic Scene Analysis Kihwan Kim, PhD Candidate School of Interactive Computing, College of Computing, Georgia Institute of Technology Date: Tuesday, December 6, 2011 Time: 1:00 pm – 3:00 pm EST Location: Technology Square Research Building (TSRB) Room 223 Abstract Analysis and visualization of dynamic scenes is often constrained by the amount of spatio-temporal [...]]]></description>
			<content:encoded><![CDATA[<h4><a href="http://prof.irfanessa.com/2011/12/06/kihwan-kims-phd2011/image/" rel="attachment wp-att-1130"><img class=" wp-image-1130 alignright" title="Kihwan Kim PhD" src="http://prof.irfanessa.com/wp-content/uploads/2011/12/image-259x300.jpg" alt="" width="181" height="210" /></a>Spatio-temporal Data Interpolation for Dynamic Scene Analysis</h4>
<p>Kihwan Kim, PhD Candidate</p>
<p>School of Interactive Computing, College of Computing, Georgia Institute of Technology</p>
<p>Date: Tuesday, December 6, 2011</p>
<p>Time: 1:00 pm – 3:00 pm EST</p>
<p>Location: Technology Square Research Building (TSRB) Room 223</p>
<p><strong>Abstract</strong></p>
<p>Analysis and visualization of dynamic scenes is often constrained by the amount of spatio-temporal information available from the environment. In most scenarios, we have to account for incomplete information and sparse motion data, requiring us to employ interpolation and approximation methods to fill for the missing information. Scattered data interpolation and approximation techniques have been widely used for solving the problem of completing surfaces and images with incomplete input data. We introduce approaches for such data interpolation and approximation from limited sensors, into the domain of analyzing and visualizing dynamic scenes. Data from dynamic scenes is subject to constraints due to the spatial layout of the scene and/or the configurations of video cameras in use. Such constraints include: (1) sparsely available cameras observing the scene, (2) limited field of view provided by the cameras in use, (3) incomplete motion at a specific moment, and (4) varying frame rates due to different exposures and resolutions.</p>
<p>In this thesis, we establish these forms of incompleteness in the scene, as spatio- temporal uncertainties, and propose solutions for resolving the uncertainties by applying scattered data approximation into a spatio-temporal domain.</p>
<p>The main contributions of this research are as follows: First, we provide an effi- cient framework to visualize large-scale dynamic scenes from distributed static videos. Second, we adopt Radial Basis Function (RBF) interpolation to the spatio-temporal domain to generate global motion tendency. The tendency, represented by a dense flow field, is used to optimally pan and tilt a video camera. Third, we propose a method to represent motion trajectories using stochastic vector fields. Gaussian Pro- cess Regression (GPR) is used to generate a dense vector field and the certainty of each vector in the field. The generated stochastic fields are used for recognizing motion patterns under varying frame-rate and incompleteness of the input videos. Fourth, we also show that the stochastic representation of vector field can also be used for modeling global tendency to detect the region of interests in dynamic scenes with camera motion. We evaluate and demonstrate our approaches in several applications for visualizing virtual cities, automating sports broadcasting, and recognizing traffic patterns in surveillance videos.</p>
<p>Committee:</p>
<ul>
<li>Prof. Irfan Essa (Advisor, School of Interactive Computing, Georgia Institute of Technology)</li>
<li>Prof. James M. Rehg (School of Interactive Computing, Georgia Institute of Technology)</li>
<li>Prof. Thad Starner (School of Interactive Computing, Georgia Institute of Technology)</li>
<li>Prof. Greg Turk (School of Interactive Computing, Georgia Institute of Technology)</li>
<li>Prof. Jessica K. Hodgins (Robotics Institute, Carnegie Mellon University, and Disney Research Pittsburgh)</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2011/12/06/kihwan-kims-phd2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper in ICCV 2011: &#8220;Gaussian Process Regression Flow for Analysis of Motion Trajectories&#8221;</title>
		<link>http://prof.irfanessa.com/2011/10/28/gprf-iccv2011/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=gprf-iccv2011</link>
		<comments>http://prof.irfanessa.com/2011/10/28/gprf-iccv2011/#comments</comments>
		<pubDate>Fri, 28 Oct 2011 13:21:25 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[DARPA]]></category>
		<category><![CDATA[Kihwan Kim]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[2011]]></category>
		<category><![CDATA[Computational Video]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[ICCV]]></category>
		<category><![CDATA[Publications]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=786</guid>
		<description><![CDATA[Gaussian Process Regression Flow for Analysis of Motion Trajectories Kim, Lee, and Essa (2011), “Gaussian Process Regression Flow for Analysis of Motion Trajectories,” in Proceedings of IEEE International Conference on Computer Vision (ICCV), 2011. [PDF] [WEBSITE] [VIDEO] [BIBTEX] @inproceedings{Kim2011-GPRF, Author = {K. Kim and D. Lee and I. Essa}, Booktitle = {Proceedings of IEEE International Conference on Computer Vision [...]]]></description>
			<content:encoded><![CDATA[<h3 class="p1" style="text-align: justify;">Gaussian Process Regression Flow for Analysis of Motion Trajectories</h3>
<ul>
<li>Kim, Lee, and Essa (2011), “Gaussian Process Regression Flow for Analysis of Motion Trajectories,” in Proceedings of IEEE International Conference on Computer Vision (ICCV), 2011. <a title="PDF" href="http://www.kihwan23.com/papers/ICCV2011/gprf_iccv2011.pdf">[PDF]</a> <a title="Project Website" href="http://www.cc.gatech.edu/cpl/projects/gprf/">[WEBSITE]</a> <a title="VIDEO" href="http://www.youtube.com/watch?v=UtLr37hDQz0">[VIDEO]</a> <a id="papercite_2" class="papercite_toggle" href="javascript:void(0)">[BIBTEX]</a>
<pre id="papercite_2_block" class="papercite_bibtex"><code> @inproceedings{Kim2011-GPRF, Author = {K. Kim and D. Lee and I. Essa}, Booktitle = {Proceedings of IEEE International Conference on Computer Vision (ICCV)}, Month = {November}, Pdf = {http://www.cc.gatech.edu/~irfan/p/2011-Kim-GPRFAMT.pdf}, Publisher = {IEEE Computer Society}, Title = {Gaussian Process Regression Flow for Analysis of Motion Trajectories}, Url = {http://www.cc.gatech.edu/cpl/projects/gprf/}, Video = {http://www.youtube.com/watch?v=UtLr37hDQz0}, Year = {2011}}</code></pre>
</li>
</ul>
<p><strong style="text-align: justify;">Abstract</strong></p>
<p style="text-align: left;">Analysis and Recognition of motions and activities of objects in videos requires effective representations for analysis and matching of motion trajectories. In this paper, we introduce a new representation speciﬁcally aimed at matching motion trajectories. We model a trajectory as a continuous dense ﬂow ﬁeld from a sparse set of vector sequences using Gaussian Process Regression. Furthermore, we introduce a random sampling strategy for learning stable classes of motions from limited data.</p>
<p style="text-align: left;">Our representation allows for incrementally predicting possible paths and detecting anomalous events from online trajectories. This representation also supports matching of complex motions with acceleration changes and pauses or stops within a trajectory. We use the proposed approach for classifying and predicting motion trajectories in trafﬁc monitoring domains and test on several data sets. We show that our approach works well on various types of complete and incomplete trajectories from a variety of video data sets with different frame rates</p>
<p style="text-align: center;">
<p><a href="http://www.youtube.com/watch?v=UtLr37hDQz0&#038;fmt=18">http://www.youtube.com/watch?v=UtLr37hDQz0</a></p></p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2011/10/28/gprf-iccv2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>In the News (2011): &#8220;Shake it like an Instagram picture — Online Video News&#8221;</title>
		<link>http://prof.irfanessa.com/2011/09/15/live-on-youtube-2/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=live-on-youtube-2</link>
		<comments>http://prof.irfanessa.com/2011/09/15/live-on-youtube-2/#comments</comments>
		<pubDate>Thu, 15 Sep 2011 22:03:05 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Collaborators]]></category>
		<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[In The News]]></category>
		<category><![CDATA[Matthias Grundmann]]></category>
		<category><![CDATA[Vivek Kwatra]]></category>
		<category><![CDATA[WWW]]></category>
		<category><![CDATA[2011]]></category>
		<category><![CDATA[Computational Photography]]></category>
		<category><![CDATA[Computational Video]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[Video Stabilization]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=807</guid>
		<description><![CDATA[YouTube effects: Shake it like an Instagram picture via YouTube effects: Shake it like an Instagram picture — Online Video News. YouTube users can now apply a number of Instagram-like effects to their videos, giving them a cartoonish or Lomo-like look with the click of a button. The effects are part of a new editing feature that [...]]]></description>
			<content:encoded><![CDATA[<h3>YouTube effects: Shake it like an Instagram picture</h3>
<p>via <a href="http://gigaom.com/video/youtube-image-stabilization/">YouTube effects: Shake it like an Instagram picture — Online Video News</a>.</p>
<blockquote><p>YouTube users can now apply a number of Instagram-like effects to their videos, giving them a cartoonish or Lomo-like look with the click of a button. The effects are part of a new editing feature that also includes cropping and advanced image stabilization.</p>
<p>Taking the shaking out of video uploads should go a long way towards making some of the amateur footage captured on mobile phones more watchable, but it can also be resource-intensive — which is why Google’s engineers invented an entirely new approach toward image stabilization.</p>
<p>The new editing functionality will be part of YouTube’s video page, where a new “Edit video” button will offer access to filters and other editing functionality. This type of post-processing is separate from YouTube’s video editor, which allows to produce new videos based on existing clips.</p>
<p style="text-align: center;">
<p><a href="http://www.youtube.com/watch?v=G-n9p28Yh8w&#038;fmt=18">http://www.youtube.com/watch?v=G-n9p28Yh8w</a></p>
</p>
<p style="text-align: center;">
<p><a href="http://www.youtube.com/watch?v=i5keG1Y810U&#038;fmt=18">http://www.youtube.com/watch?v=i5keG1Y810U</a></p>
</p>
</blockquote>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2011/09/15/live-on-youtube-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Funding (2011) NSF (1146352) &#8220;EAGER: Linguistic Task Transfer for Humans and Cyber Systems&#8221;</title>
		<link>http://prof.irfanessa.com/2011/09/01/nsf-2011-eager/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=nsf-2011-eager</link>
		<comments>http://prof.irfanessa.com/2011/09/01/nsf-2011-eager/#comments</comments>
		<pubDate>Thu, 01 Sep 2011 21:04:10 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Mike Stilman]]></category>
		<category><![CDATA[NSF]]></category>
		<category><![CDATA[Robotics]]></category>
		<category><![CDATA[2011]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[Funding]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=814</guid>
		<description><![CDATA[EAGER: Linguistic Task Transfer for Humans and Cyber Systems (Mike Stillman, Irfan Essa) NSF/RI This project, investigating formal languages as a general methodology for task transfer between distinct cyber-physical systems such as humans and robots, aims to expand the science of cyber physical systems by developing Motion Grammars that will enable task transfer between distinct [...]]]></description>
			<content:encoded><![CDATA[<h3>EAGER: Linguistic Task Transfer for Humans and Cyber Systems (Mike Stillman, Irfan Essa) NSF/RI</h3>
<blockquote><p>This project, investigating formal languages as a general methodology for task transfer between distinct cyber-physical systems such as humans and robots, aims to expand the science of cyber physical systems by developing Motion Grammars that will enable task transfer between distinct systems.</p>
<p>Formal languages are tools for encoding, describing and transferring structured knowledge. In natural language, the latter process is called communication. Similarly, we will develop a formal language through which arbitrary cyber-physical systems communicate tasks via structured actions. This investigation of Motion Grammars will contribute to the science of human cognition and the engineering of cyber-physical algorithms. By observing human activities during manipulation we will develop a novel class of hybrid control algorithms based on linguistic representations of task execution. These algorithms will broaden the capabilities of man-made systems and provide the infrastructure for motion transfer between humans, robots and broader systems in a generic context. Furthermore, the representation in a rigorous grammatical context will enable formal verification and validation in future work.<br />
<strong>Broader Impacts</strong>: The proposed research has direct applications to new solutions for manufacturing, medical treatments such as surgery, logistics and food processing. In turn, each of these areas has a significant impact on the efficiency and convenience of our daily lives. The PIs serve as coordinators of graduate/undergraduate programs and mentors to community schools. In order to guarantee that women and minorities have a significant role in the research, the PIs will annually invite K-12 students from Atlanta schools with primarily African American populations to the laboratories. One-day robot classes will be conducted that engage students in the excitement of hands-on science by interactively using lab equipment to transfer their manipulation skills to a robot arm.</p></blockquote>
<p>Via <a href="http://www.nsf.gov/awardsearch/showAward.do?AwardNumber=1146352">Award#1146352 &#8211; EAGER: Linguistic Task Transfer for Humans and Cyber Systems</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2011/09/01/nsf-2011-eager/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>DEMO (2011): Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths &#8211; from Google Research Blog</title>
		<link>http://prof.irfanessa.com/2011/06/20/videostabilization-youtube/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=videostabilization-youtube</link>
		<comments>http://prof.irfanessa.com/2011/06/20/videostabilization-youtube/#comments</comments>
		<pubDate>Mon, 20 Jun 2011 22:17:47 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[In The News]]></category>
		<category><![CDATA[Matthias Grundmann]]></category>
		<category><![CDATA[Mobile Computing]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Vivek Kwatra]]></category>
		<category><![CDATA[2011]]></category>
		<category><![CDATA[Computational Photography]]></category>
		<category><![CDATA[Computational Video]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[CVPR]]></category>
		<category><![CDATA[Video Stabilization]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=738</guid>
		<description><![CDATA[via Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths &#8211; Google Research Blog. Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths Posted by Matthias Grundmann, Vivek Kwatra, and Irfan Essa, Earlier this year, we announced the launch of new features on the YouTube Video Editor, including stabilization for shaky videos, with the ability to preview them in [...]]]></description>
			<content:encoded><![CDATA[<p>via <a href="http://googleresearch.blogspot.com/2011/06/auto-directed-video-stabilization-with.html">Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths &#8211; Google Research Blog</a>.</p>
<blockquote>
<h4>Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths<br />
Posted by <a href="http://research.google.com/pubs/author38919.html">Matthias Grundmann</a>, <a href="http://research.google.com/pubs/author38000.html">Vivek Kwatra</a>, and <a href="http://www.irfanessa.com/Work/Welcome.html">Irfan Essa</a>,</h4>
<p>Earlier this year, we announced the launch of new features on the YouTube Video Editor, including stabilization for shaky videos, with the ability to preview them in real-time. The core technology behind this feature is detailed in this paper, which will be presented at the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR 2011).</p>
<p>Casually shot videos captured by handheld or mobile cameras suffer from significant amount of shake. Existing in-camera stabilization methods dampen high-frequency jitter but do not suppress low-frequency movements and bounces, such as those observed in videos captured by a walking person. On the other hand, most professionally shot videos usually consist of carefully designed camera configurations, using specialized equipment such as tripods or camera dollies, and employ ease-in and ease-out for transitions. Our goal was to devise a completely automatic method for converting casual shaky footage into more pleasant and professional looking videos.</p>
<p style="text-align: center;">
<p><a href="http://www.youtube.com/watch?v=0MiY-PNy-GU&#038;fmt=18">http://www.youtube.com/watch?v=0MiY-PNy-GU</a></p>
</p>
<p>Our technique mimics the cinematographic principles outlined above by automatically determining the best camera path using a robust optimization technique. The original, shaky camera path is divided into a set of segments, each approximated by either a constant, linear or parabolic motion. Our optimization finds the best of all possible partitions using a computationally efficient and stable algorithm.</p>
<p>To achieve real-time performance on the web, we distribute the computation across multiple machines in the cloud. This enables us to provide users with a real-time preview and interactive control of the stabilized result. Above we provide a video demonstration of how to use this feature on the YouTube Editor. We will also demo this live at Google’s exhibition booth in CVPR 2011.</p></blockquote>
<p>For more details see the <a href="http://www.cc.gatech.edu/cpl/projects/videostabilization/">Project Site</a>. See the <a href="http://www.youtube.com/watch?v=0MiY-PNy-GU&amp;feature=player_embedded">youtube video of the system on youtube</a>. See the <a href="http://www.cc.gatech.edu/cpl/projects/videostabilization/stabilization.pdf">paper in PDF</a>, and a <a href="http://www.youtube.com/watch?v=i5keG1Y810U&amp;feature=player_embedded">technical video of the work</a>.</p>
<p>Full paper is</p>
<ul>
<li><a href="http://research.google.com/pubs/author38919.html">Matthias Grundmann</a>, <a href="http://research.google.com/pubs/author38000.html">Vivek Kwatra</a>, and <a href="http://www.irfanessa.com/Work/Welcome.html">Irfan Essa</a> (2011), &#8220;Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths,&#8221; In Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition (CVPR 2011), Colorado Springs, CO, USA. [<a href="http://www.cc.gatech.edu/cpl/projects/videostabilization/stabilization.pdf">PDF</a>][<a href="http://www.youtube.com/watch?v=i5keG1Y810U&amp;feature=player_embedded">Video</a>][<a href="http://googleresearch.blogspot.com/2011/06/auto-directed-video-stabilization-with.html">Blog</a>][<a href="http://www.youtube.com/watch?v=0MiY-PNy-GU&amp;feature=player_embedded">Demo</a>][<a href="http://www.cc.gatech.edu/cpl/projects/videostabilization/">Project Site</a>]</li>
</ul>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2011/06/20/videostabilization-youtube/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper (2011) in IEEE CVPR: &#8220;Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths&#8221;</title>
		<link>http://prof.irfanessa.com/2011/06/19/videostabilization/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=videostabilization</link>
		<comments>http://prof.irfanessa.com/2011/06/19/videostabilization/#comments</comments>
		<pubDate>Sun, 19 Jun 2011 22:36:41 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[Matthias Grundmann]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Vivek Kwatra]]></category>
		<category><![CDATA[2011]]></category>
		<category><![CDATA[Computational Photography]]></category>
		<category><![CDATA[Computational Video]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[CVPR]]></category>
		<category><![CDATA[Video Stabilization]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=747</guid>
		<description><![CDATA[Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths Grundmann, Kwatra, and Essa (2011), “Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2011.  [PDF] [WEBSITE][VIDEO] [DEMO][Google Research Blog] [BIBTEX] @inproceedings{2011-Grundmann-AVSWROCP, Author = {M. Grundmann and V. Kwatra and I. Essa}, Booktitle = {Proceedings of IEEE Conference [...]]]></description>
			<content:encoded><![CDATA[<h3>Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths</h3>
<ul>
<li>Grundmann, Kwatra, and Essa (2011), “Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2011.  <a title="PDF" href="http://www.cc.gatech.edu/~irfan/p/2011-Grundmann-AVSWROCP">[PDF]</a> <a title="Project Website" href="http://www.cc.gatech.edu/cpl/projects/videostabilization/">[WEBSITE]</a><a title="VIDEO" href="http://www.youtube.com/watch?v=i5keG1Y810U">[VIDEO]</a> <a title="DEMO" href="http://www.youtube.com/watch?v=0MiY-PNy-GU">[DEMO][</a>Google Research <a href="http://googleresearch.blogspot.com/2011/06/auto-directed-video-stabilization-with.html">Blog</a><a title="DEMO" href="http://www.youtube.com/watch?v=0MiY-PNy-GU">]</a> <a id="papercite_3" class="papercite_toggle" href="javascript:void(0)">[BIBTEX]</a>
<pre id="papercite_3_block" class="papercite_bibtex"><code> @inproceedings{2011-Grundmann-AVSWROCP, Author = {M. Grundmann and V. Kwatra and I. Essa}, Booktitle = {Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, Month = {June}, Pdf = {http://www.cc.gatech.edu/~irfan/p/2011-Grundmann-AVSWROCP}, Publisher = {IEEE Computer Society}, Title = {Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths}, Url = {http://www.cc.gatech.edu/cpl/projects/videostabilization/}, Video = {http://www.youtube.com/watch?v=i5keG1Y810U}, Year = {2011}}</code></pre>
</li>
</ul>
<h4 style="text-align: left;"><strong>Abstract</strong></h4>
<p style="text-align: left;">We present a novel algorithm for automatically applying constrainable, L1-optimal camera paths to generate stabilized videos by removing undesired motions. Our goal is to compute camera paths that are composed of constant, linear and parabolic segments mimicking the camera motions employed by professional cinematographers. To this end, our algorithm is based on a linear programming framework to minimize the first, second, and third derivatives of the resulting camera path. Our method allows for video stabilization beyond the conventional filtering of camera paths that only suppresses high frequency jitter. We incorporate additional constraints on the path of the camera directly in our algorithm, allowing for stabilized and retargeted videos. Our approach accomplishes this without the need of user interaction or costly 3D reconstruction of the scene, and works as a post-process for videos from any camera or from an online source.</p>
<p><img class="aligncenter" src="http://www.cc.gatech.edu/cpl/projects/videostabilization/teaser.png" alt="" width="500" /></p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2011/06/19/videostabilization/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Presentation (2011) at IBPRIA 2011: &#8220;Spatio-Temporal Video Analysis and Visual Activity Recognition&#8221;</title>
		<link>http://prof.irfanessa.com/2011/06/08/ibpria2011/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=ibpria2011</link>
		<comments>http://prof.irfanessa.com/2011/06/08/ibpria2011/#comments</comments>
		<pubDate>Wed, 08 Jun 2011 22:41:57 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[Kihwan Kim]]></category>
		<category><![CDATA[Matthias Grundmann]]></category>
		<category><![CDATA[Multimedia]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Presentations]]></category>
		<category><![CDATA[2011]]></category>
		<category><![CDATA[Computational Photography]]></category>
		<category><![CDATA[Computational Video]]></category>
		<category><![CDATA[Computer Vision]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=750</guid>
		<description><![CDATA[&#8220;Spatio-Temporal Video Analysis and Visual Activity Recognition&#8221; at the Iberian Conference on Pattern Recognition and Image Analysis  (IbPRIA) 2011 Conference in Las Palmas de Gran Canaria. Spain. June 8-10. Abstract My research group is focused on a variety of approaches for (a) low-level video analysis and synthesis and (b) recognizing activities in videos. In this talk, I [...]]]></description>
			<content:encoded><![CDATA[<p>&#8220;Spatio-Temporal Video Analysis and Visual Activity Recognition&#8221; at the <a href="http://www.ibpria2011.ulpgc.es/">Iberian Conference on Pattern Recognition and Image Analysis  (IbPRIA) 2011 Conference</a> in Las Palmas de Gran Canaria. Spain. June 8-10.</p>
<h4 style="text-align: left;"><strong>Abstract</strong></h4>
<p style="text-align: left;">My research group is focused on a variety of approaches for (a) low-level video analysis and synthesis and (b) recognizing activities in videos. In this talk, I will concentrate on two of our recent efforts. One effort aimed at robust spatio-temporal segmentation of video and another on using motion and flow to recognize and predict actions from video.</p>
<p style="text-align: left;">In the first part of the talk, I will present an efficient and scalable technique for spatio-temporal segmentation of long video sequences using a hierarchical graph-based algorithm. In this work, we begin by over segmenting a volumetric video graph into space-time regions grouped by appearance. We then construct a “region graph” over the obtained segmentation and iteratively repeat this process over multiple levels to create a tree of spatio-temporal segmentations. This hierarchical approach generates high quality segmentations, which are temporally coherent with stable region boundaries, and allows subsequent applications to choose from varying levels of granularity. We further improve segmentation quality by using dense optical flow to guide temporal connections in the initial graph. I will demonstrate a variety of examples of how this robust segmentation works, and will show additional examples of video-retargeting that use spatio-temporal saliency derived from this segmentation approach. (Matthias Grundmann, Vivek Kwatra, Mei Han, Irfan Essa, CVPR 2010, in collaboration with Google Research).</p>
<p style="text-align: left;">In the second part of this talk, I will show that constrained multi-agent events can be analyzed and even predicted from video. Such analysis requires estimating the global movements of all players in the scene at any time, and is needed for modeling and predicting how the multi-agent play evolves over time on the playing field. To this end, we propose a novel approach to detect the locations of where the play evolution will proceed, e.g. where interesting events will occur, by tracking player positions and movements over time. To achieve this, we extract the ground level sparse movement of players in each time-step, and then generate a dense motion field. Using this field we detect locations where the motion converges, implying positions towards which the play is evolving. I will show examples of how we have tested this approach for soccer, basketball and hockey. (Kihwan Kim, Matthias Grundmann, Ariel Shamir, Iain Matthews, Jessica Hodgins, Irfan Essa, CVPR 2010, in collaboration with Disney Research).</p>
<p style="text-align: left;">Time permitting, I will show some more videos of our recent work on video analysis and synthesis. For more information, papers, and videos, see <a href="http://prof.irfanessa.com/">my website</a>.</p>
<p style="text-align: center;"><img class="aligncenter" src="http://ibpria2011.ulpgc.es/files/images/ibpria2011_b02.jpg" alt="" width="500" /></p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2011/06/08/ibpria2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Going Live on YouTube (2011): Lights, Camera&#8230; EDIT! New Features for the YouTube Video Editor</title>
		<link>http://prof.irfanessa.com/2011/03/21/live-on-youtube/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=live-on-youtube</link>
		<comments>http://prof.irfanessa.com/2011/03/21/live-on-youtube/#comments</comments>
		<pubDate>Mon, 21 Mar 2011 21:37:18 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[In The News]]></category>
		<category><![CDATA[Matthias Grundmann]]></category>
		<category><![CDATA[Multimedia]]></category>
		<category><![CDATA[Vivek Kwatra]]></category>
		<category><![CDATA[WWW]]></category>
		<category><![CDATA[2011]]></category>
		<category><![CDATA[Computational Photography]]></category>
		<category><![CDATA[Computational Video]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[Video Stabilization]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=803</guid>
		<description><![CDATA[via YouTube Blog: Lights, Camera&#8230; EDIT! New Features for the YouTube Video Editor. Lights, Camera&#8230; EDIT! New Features for the YouTube Video Editor Nine months ago we launched our cloud-based video editor. It was a simple product built to provide our users with simple editing tools. Although it didn’t have all the features available on paid [...]]]></description>
			<content:encoded><![CDATA[<p>via <a href="http://youtube-global.blogspot.com/2011/03/lights-camera-edit-new-features-for.html">YouTube Blog: Lights, Camera&#8230; EDIT! New Features for the YouTube Video Editor</a>.</p>
<blockquote>
<h3>Lights, Camera&#8230; EDIT! New Features for the YouTube Video Editor</h3>
<p>Nine months ago we launched our cloud-based video editor. It was a simple product built to provide our users with simple editing tools. Although it didn’t have all the features available on paid desktop editing software, the idea was that the vast majority of people’s video editing needs are pretty basic and straight-forward and we could provide these features with a free editor available on the Web. Since launch, hundreds of thousands of videos have been published using the YouTube Video Editor and we’ve regularly pushed out new feature enhancements to the product, including:</p>
<ul>
<li>Video transitions (crossfade, wipe, slide)</li>
<li>The ability to save projects across sessions</li>
<li>Increased clips allowed in the editor from 6 to 17</li>
<li>Video rotation (from portrait to landscape and vice versa &#8211; great for videos shot on mobile)</li>
<li>Shape transitions (heart, star, diamond, and Jack-O-Lantern for Halloween)</li>
<li>Audio mixing (AudioSwap track mixed with original audio)</li>
<li>Effects (brightness/contrast, black &amp; white)</li>
</ul>
<p>A new user interface and project menu for multiple saved projects</p>
<p>While many of these are familiar features also available on desktop software, today, we’re excited to unveil two new features that the team has been working on over the last couple of months that take unique advantage of the cloud:</p>
<h4>Stabilizer</h4>
<p>Ever shoot a shaky video that’s so jittery, it’s actually hard to watch? Professional cinematographers use stabilization equipment such as tripods or camera dollies to keep their shots smooth and steady. Our team mimicked these cinematographic principles by automatically determining the best camera path for you through a unified optimization technique. In plain English, you can smooth some of those unsteady videos with the click of a button. We also wanted you to be able to preview these results in real-time, before publishing the finished product to the Web. We can do this by harnessing the power of the cloud by splitting the computation required for stabilizing the video into chunks and distributed them across different servers. This allows us to use the power of many machines in parallel, computing and streaming the stabilized results quickly into the preview. You can check out the paper we’re publishing entitled “Auto-Directed Video Stabilization with Robust L1 Optimal Camera Paths.” Want to see stabilizer in action? You can test it out for yourself, or check out these two videos. The first is without stabilizer.</p>
<p style="text-align: center;">
<p><a href="http://www.youtube.com/watch?v=NnyYXbk8MXU&#038;fmt=18">http://www.youtube.com/watch?v=NnyYXbk8MXU</a></p>
</p>
<p style="text-align: center;">And now, with the stabilizer:</p>
<p style="text-align: center;">
<p><a href="http://www.youtube.com/watch?v=sLpkdrjsRxY&#038;fmt=18">http://www.youtube.com/watch?v=sLpkdrjsRxY</a></p>
</p>
</blockquote>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2011/03/21/live-on-youtube/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Funding (2011): NSF (1059362): &#8220;II-New: Motion Grammar Laboratory&#8221;</title>
		<link>http://prof.irfanessa.com/2011/03/01/nsf-2011-mri/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=nsf-2011-mri</link>
		<comments>http://prof.irfanessa.com/2011/03/01/nsf-2011-mri/#comments</comments>
		<pubDate>Tue, 01 Mar 2011 21:10:18 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Henrik Christensen]]></category>
		<category><![CDATA[Mike Stilman]]></category>
		<category><![CDATA[NSF]]></category>
		<category><![CDATA[2011]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[Funding]]></category>
		<category><![CDATA[Robotics]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=819</guid>
		<description><![CDATA[II-New: Motion Grammar Laboratory (Stillman, Essa, Egerstadt, Christensen, Ueda) Division of Computer and Network Systems Instrumentation Grant. An anthropomorphic robot arm and a human capture system enable the autonomous performance of assembly tasks with significant uncertainty in problem specifications and environments. This line of work is investigated through sequences of manipulation actions where the guarantee of the completion [...]]]></description>
			<content:encoded><![CDATA[<h3>II-New: Motion Grammar Laboratory (Stillman, Essa, Egerstadt, Christensen, Ueda) <a href="http://www.nsf.gov/div/index.jsp?div=CNS">Division of Computer and Network Systems</a> Instrumentation Grant.</h3>
<p style="text-align: justify;">An anthropomorphic robot arm and a human capture system enable the autonomous performance of assembly tasks with significant uncertainty in problem specifications and environments. This line of work is investigated through sequences of manipulation actions where the guarantee of the completion of task-level objectives is rooted in the discovery of the semantic structure of human manipulation. New research directions in anthropomorphic robotics are explored including programming by demonstration, activity recognition, control and estimation and planning.</p>
<p style="text-align: justify;">The motion grammar laboratory infrastructure allows a great opportunity for research and education. New classroom experiences for undergraduates and graduates provide practical experience in robot human interaction and activity process sharing. This opens possibilities for human training and rehabilitation, as well as assistive personal robotic, and opens the door to a host of technological innovations.</p>
<p>via <a href="http://www.nsf.gov/awardsearch/showAward.do?AwardNumber=1059362">Award#1059362 &#8211; II-New: Motion Grammar Laboratory</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2011/03/01/nsf-2011-mri/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper (2011) in Virtual Reality: &#8220;Augmenting aerial earth maps with dynamic information from videos&#8221;</title>
		<link>http://prof.irfanessa.com/2011/02/02/vr-2011/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=vr-2011</link>
		<comments>http://prof.irfanessa.com/2011/02/02/vr-2011/#comments</comments>
		<pubDate>Wed, 02 Feb 2011 22:58:09 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[Kihwan Kim]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Sangmin Oh]]></category>
		<category><![CDATA[2011]]></category>
		<category><![CDATA[Augmented Reality]]></category>
		<category><![CDATA[Computational Photography]]></category>
		<category><![CDATA[Computer Vision]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=713</guid>
		<description><![CDATA[Augmenting aerial earth maps with dynamic information from videos Kim, Oh, Lee, and Essa (2011), “Augmenting aerial earth maps with dynamic information from videos,” Journal of Virtual Reality, Special Issue on Augmented Reality, vol. 15, iss. 2-3, pp. 1359-4338, 2011.  [PDF] [WEBSITE] [VIDEO] [DOI] [SpringerLink][BIBTEX] @article{2011-Kim-AAEMWDIFV, Author = {K. Kim and S. Oh and J. Lee and I. Essa}, Doi [...]]]></description>
			<content:encoded><![CDATA[<p><span class="Apple-style-span" style="font-size: 15px; font-weight: bold;">Augmenting aerial earth maps with dynamic information from videos</span></p>
<ul>
<li>Kim, Oh, Lee, and Essa (2011), “Augmenting aerial earth maps with dynamic information from videos,” <em>Journal of Virtual Reality, Special Issue on Augmented Reality</em>, vol. 15, iss. 2-3, pp. 1359-4338, 2011.  <a title="PDF" href="http://www.cc.gatech.edu/~irfan/p/2011-Kim-AAEMWDIFV.pdf">[PDF]</a> <a title="Project Website" href="http://www.cc.gatech.edu/cpl/projects/augearth">[WEBSITE]</a> <a title="VIDEO" href="http://www.youtube.com/watch?v=TPk88soc2qw">[VIDEO]</a> <a title="View document on publisher site" href="http://dx.doi.org/10.1007/s10055-010-0186-2">[DOI]</a> [<a href="http://www.springerlink.com/content/02q63031373n5445">SpringerLink</a>]<a id="papercite_0" class="papercite_toggle" href="javascript:void(0)">[BIBTEX]</a>
<pre id="papercite_0_block" class="papercite_bibtex"><code>
@article{2011-Kim-AAEMWDIFV,
 Author = {K. Kim and S. Oh and J. Lee and I. Essa},
 Doi = {10.1007/s10055-010-0186-2},
 Journal = {Journal of Virtual Reality, Special Issue on Augmented Reality},
 Number = {2-3},
 Pages = {1359-4338},
 Pdf = {http://www.cc.gatech.edu/~irfan/p/2011-Kim-AAEMWDIFV.pdf},
 Title = {Augmenting aerial earth maps with dynamic information from videos},
 Url = {http://www.cc.gatech.edu/cpl/projects/augearth},
 Video = {http://www.youtube.com/watch?v=TPk88soc2qw},
 Volume = {15},
 Year = {2011}}</code></pre>
</li>
</ul>
<h4>Abstract</h4>
<p>We introduce methods for augmenting aerial visualizations of Earth (from tools such as Google Earth or Microsoft Virtual Earth) with dynamic information obtained from videos. Our goal is to make Augmented Earth Maps that visualize plausible live views of dynamic scenes in a city. We propose different approaches to analyze videos of pedestrians and cars in real situations, under differing conditions to extract dynamic information. Then, we augment an Aerial Earth Maps (AEMs) with the extracted live and dynamic content. We also analyze natural phenomenon (skies, clouds) and project information from these to the AEMs to add to the visual reality. Our primary contributions are: (1) Analyzing videos with different viewpoints, coverage, and overlaps to extract relevant information about view geometry and movements, with limited user input. (2) Projecting this information appropriately to the viewpoint of the AEMs and modeling the dynamics in the scene from observations to allow inference (in case of missing data) and synthesis. We demonstrate this over a variety of camera configurations and conditions. (3) The modeled information from videos is registered to the AEMs to render appropriate movements and related dynamics. We demonstrate this with traffic flow, people movements, and cloud motions. All of these approaches are brought together as a prototype system for a real-time visualization of a city that is alive and engaging.</p>
<p style="text-align: center;"><a href="http://www.cc.gatech.edu/cpl/projects/augearth/images/teaser.jpg"><img class="aligncenter" title="Augmented Earth" src="http://www.cc.gatech.edu/cpl/projects/augearth/images/teaser.jpg" alt="Augmented Earth" width="500" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2011/02/02/vr-2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Poster STS 2011: &#8220;3-Dimensional Visualization of the Operating Room Using Advanced Motion Capture: A Novel Paradigm to Expand Simulation-Based Surgical Education&#8221;</title>
		<link>http://prof.irfanessa.com/2011/02/02/sts-2011/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=sts-2011</link>
		<comments>http://prof.irfanessa.com/2011/02/02/sts-2011/#comments</comments>
		<pubDate>Wed, 02 Feb 2011 22:51:05 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[Eric Sarin]]></category>
		<category><![CDATA[Health Systems]]></category>
		<category><![CDATA[Kihwan Kim]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[William Cooper]]></category>
		<category><![CDATA[2011]]></category>
		<category><![CDATA[Computational Photography]]></category>
		<category><![CDATA[Computational Video]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[Health]]></category>
		<category><![CDATA[Medical]]></category>
		<category><![CDATA[Surgery]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=704</guid>
		<description><![CDATA[3-Dimensional Visualization of the Operating Room Using Advanced Motion Capture: A Novel Paradigm to Expand Simulation-Based Surgical Education Sarin, Kim, Essa, and Cooper (2011), “3-Dimensional Visualization of the Operating Room Using Advanced Motion Capture: A Novel Paradigm to Expand Simulation-Based Surgical Education,” in Proccedings of Society of Thoracic Surgeons Annual Meeting, Society of Thoracic Surgeons, 2011.  [...]]]></description>
			<content:encoded><![CDATA[<h3>3-Dimensional Visualization of the Operating Room Using Advanced Motion Capture: A Novel Paradigm to Expand Simulation-Based Surgical Education</h3>
<ul>
<li>Sarin, Kim, Essa, and Cooper (2011), “3-Dimensional Visualization of the Operating Room Using Advanced Motion Capture: A Novel Paradigm to Expand Simulation-Based Surgical Education,” in Proccedings of Society of Thoracic Surgeons Annual Meeting, Society of Thoracic Surgeons, 2011.  <a title="BLOG" href="http://prof.irfanessa.com/2011/02/02/sts-2011/">[BLOG]</a><a href="javascript:void(0)" id="papercite_0" class="papercite_toggle">[BIBTEX]</a>
<pre class="papercite_bibtex" id="papercite_0_block"><code>
@incollection{2011-Sarin-3VORUAMCNPESSE,
  Author = {E. L. Sarin and K. Kim and I. Essa and W. A. Cooper},
  Blog = {http://prof.irfanessa.com/2011/02/02/sts-2011/},
  Booktitle = {Proccedings of Society of Thoracic Surgeons Annual Meeting},
  Month = {January},
  Publisher = {Society of Thoracic Surgeons},
  Title = {3-Dimensional Visualization of the Operating Room Using Advanced Motion Capture: A Novel Paradigm to Expand Simulation-Based Surgical Education},
  Type = {Poster and Video Presentation},
  Year = {2011}}</code></pre>
</li>
</ul>
<p>A collaborative project between School of Interactive Computing, Georgia Institute of Technology, Atlanta, Georgia, Division of Cardiothoracic Surgery, Emory University School of Medicine, Atlanta, Georgia, and Inova Heart and Vascular Institute1, Fairfax, Virginia. This was a Video and a Poster presentation at the <a href="http://www.sts.org/">Society of Thoracic Surgeons</a> <a href="http://www.sts.org/education-meetings/sts-annual-meeting">Annual Meeting in San Diego, CA, Jan 2011</a>.</p>
<p style="text-align: center;"><span style="font-weight: normal; font-size: 13px;"><a href="http://prof.irfanessa.com/wp-content/uploads/2011/02/Slide1.jpg"><img class="aligncenter size-large wp-image-705" title="Poster for Society of Thoracic Surgeon's Annual Meeting" src="http://prof.irfanessa.com/wp-content/uploads/2011/02/Slide1-1024x731.jpg" alt="Poster for Society of Thoracic Surgeon's Annual Meeting" width="500" height=" " /></a><br />
</span></p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2011/02/02/sts-2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper (2011) in IEEE PAMI: &#8220;Bilayer Segmentation of Webcam Videos Using Tree-Based Classifiers &#8220;</title>
		<link>http://prof.irfanessa.com/2011/01/12/pami-201/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=pami-201</link>
		<comments>http://prof.irfanessa.com/2011/01/12/pami-201/#comments</comments>
		<pubDate>Wed, 12 Jan 2011 23:14:23 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Antonio Crimisini]]></category>
		<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[John Winn]]></category>
		<category><![CDATA[Numerical Machine Learning]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Pei Yin]]></category>
		<category><![CDATA[Computational Video]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[Video Segmentation]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=720</guid>
		<description><![CDATA[Bilayer Segmentation of Webcam Videos Using Tree-Based Classifiers Pei Yin, A. Criminisi, J. Winn, I. Essa (2011), &#8220;Bilayer Segmentation of Webcam Videos Using Tree-Based Classifiers&#8221; in Pattern Analysis and Machine Intelligence, IEEE Transactions on, Jan. 2011, Volume :  33 ,  Issue:1, ISSN :  0162-8828, Digital Object Identifier :  10.1109/TPAMI.2010.65,  IEEE Computer Society [Project Page&#124;DOI] ABSTRACT This paper [...]]]></description>
			<content:encoded><![CDATA[<h3>Bilayer Segmentation of Webcam Videos Using Tree-Based Classifiers</h3>
<p>Pei Yin, A. Criminisi, J. Winn, I. Essa (2011), &#8220;Bilayer Segmentation of Webcam Videos Using Tree-Based Classifiers&#8221; in <em>Pattern Analysis and Machine Intelligence, IEEE Transactions on, </em>Jan. 2011, Volume :  33 ,  Issue:1, ISSN :  0162-8828, Digital Object Identifier :  10.1109/TPAMI.2010.65,  IEEE Computer Society [<a href="http://www.cc.gatech.edu/cpl/projects/bilayer-segmentation/">Project Page</a>|<a href="http://dx.doi.org/10.1109/TPAMI.2010.65">DOI</a>]</p>
<p style="text-align: center;"><strong>ABSTRACT</strong></p>
<p style="text-align: justify;">This paper presents an automatic segmentation algorithm for video frames captured by a (monocular) webcam that closely approximates depth segmentation from a stereo camera. The frames are segmented into foreground and background layers that comprise a subject (participant) and other objects and individuals. The algorithm produces correct segmentations even in the presence of large background motion with a nearly stationary foreground. This research makes three key contributions: First, we introduce a novel motion representation, referred to as “motons,” inspired by research in object recognition. Second, we propose estimating the segmentation likelihood from the spatial context of motion. The estimation is efficiently learned by random forests. Third, we introduce a general taxonomy of tree-based classifiers that facilitates both theoretical and experimental comparisons of several known classification algorithms and generates new ones. In our bilayer segmentation algorithm, diverse visual cues such as motion, motion context, color, contrast, and spatial priors are fused by means of a conditional random field (CRF) model. Segmentation is then achieved by binary min-cut. Experiments on many sequences of our videochat application demonstrate that our algorithm, which requires no initialization, is effective in a variety of scenes, and the segmentation results are comparable to those obtained by stereo systems.<img class="aligncenter" src="http://www.cc.gatech.edu/cpl/projects/bilayer-segmentation/TeaserResult.PNG" alt="" width="500" /></p>
<p>via <a href="http://ieeexplore.ieee.org/search/freesrchabstract.jsp?tp=&amp;arnumber=5432210&amp;queryText%3Dpei+yin%26refinements%3D4290827373%26openedRefinements%3D*%26searchField%3DSearch+All">IEEE Xplore &#8211; Abstract Page</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2011/01/12/pami-201/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Fall 2010 GRASP Seminar &#8211; Irfan Essa, Georgia Institute Of Technology, &#8220;Two Talks On Video Analysis: 1 Segmentation Of Video And 2 Prediction Of Actions In Video&#8221; &#124; GRASP Laboratory &#8211; University Of Pennsylvania</title>
		<link>http://prof.irfanessa.com/2010/09/20/fall-2010-grasp-seminar-irfan-essa-georgia-institute-of-technology-two-talks-on-video-analysis-1-segmentation-of-video-and-2-prediction-of-actions-in-video-grasp-laboratory-university-of/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=fall-2010-grasp-seminar-irfan-essa-georgia-institute-of-technology-two-talks-on-video-analysis-1-segmentation-of-video-and-2-prediction-of-actions-in-video-grasp-laboratory-university-of</link>
		<comments>http://prof.irfanessa.com/2010/09/20/fall-2010-grasp-seminar-irfan-essa-georgia-institute-of-technology-two-talks-on-video-analysis-1-segmentation-of-video-and-2-prediction-of-actions-in-video-grasp-laboratory-university-of/#comments</comments>
		<pubDate>Mon, 20 Sep 2010 19:37:38 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[Presentations]]></category>
		<category><![CDATA[2010]]></category>
		<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Computational Photography]]></category>
		<category><![CDATA[Computational Video]]></category>
		<category><![CDATA[Computer Vision]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=680</guid>
		<description><![CDATA[Fall 2010 GRASP Seminar &#8211; Irfan Essa, Georgia Institute Of Technology, &#8220;Two Talks On Video Analysis: 1 Segmentation Of Video And 2 Prediction Of Actions In Video&#8221; &#124; GRASP Laboratory &#8211; University Of Pennsylvania. Friday September 24, 2010 from 11:00am to 12:00pm My research group is focused on a variety if approaches for video analysis [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://grasp.upenn.edu/seminars/Irfan_Essa">Fall 2010 GRASP Seminar &#8211; Irfan Essa, Georgia Institute Of Technology, &#8220;Two Talks On Video Analysis: 1 Segmentation Of Video And 2 Prediction Of Actions In Video&#8221; | GRASP Laboratory &#8211; University Of Pennsylvania</a>.</p>
<p><span style="font-family: Verdana, Arial; line-height: 16px; font-size: 10px; color: #3c3c3c;"><em>Friday September 24, 2010 from 11:00am to 12:00pm</em></span></p>
<p><span style="font-family: Verdana, Arial; line-height: 16px; font-size: 10px; color: #3c3c3c;"></p>
<div id="main_content">
<div class="left_column no-right" style="float: none; width: 891px;">
<div class="seminar content clear" style="clear: both;">
<p>My research group is focused on a variety if approaches for video analysis and synthesis. In this talk, I will focus on two of our recent efforts.  One effort aimed at robust spatio-temporal segmentation of video and another on using motion and flow to predict actions from video.</p>
<p>In the first part of the talk, I will present an efficient and scalable technique for spatio-temporal segmentation of long video sequences using a hierarchical graph-based algorithm. In this effort, we begin by over segmenting a volumetric video graph into space-time regions grouped by appearance. We then construct a “region graph” over the obtained segmentation and iteratively repeat this process over multiple levels to create a tree of spatio-temporal segmentations. This hierarchical approach generates high quality segmentations, which are temporally coherent with stable region boundaries, and allows subsequent applications to choose from varying levels of granularity. We further improve segmentation quality by using dense optical flow to guide temporal connections in the initial graph. I will demonstrate a variety of examples of how this robust segmentation works, and will show additional examples of video-retargeting that use the saliency from this segmentation approach.  (Matthias Grundmann, Vivek Kwatra, Mei Han, Irfan Essa, CVPR 2010, in collaboration with Google Research).</p>
<p>In the second part of this talk, I will show that constrained multi-agent events can be analyzed and even predicted from video. Such analysis requires estimating the global movements of all players in the scene at any time, and is needed for modeling and predicting how the multi-agent play evolves over time on the field. To this end, we propose a novel approach to detect the locations of where the play evolution will proceed, e.g. where interesting events will occur, by tracking player positions and movements over time. To achieve this, we extract the ground level sparse movement of players in each time-step, and then generate a dense motion field. Using this field we detect locations where the motion converges, implying positions towards which the play is evolving. I will show examples of how we have tested this approach for soccer, basketball and hockey. (Kihwan Kim, Matthias Grundmann, Ariel Shamir, Iain Matthews, Jessica Hodgins, Irfan Essa, CVPR 2010, in collaboration with Disney Research).</p>
<p><span>Time permitting, I will show some more videos of our recent work on video analysis and synthesis. For more information, papers, and videos, see my website at<a class="smarterwiki-linkify" style="outline-style: none; outline-width: initial; outline-color: initial; color: #9a0000; text-decoration: none; border: initial none initial;" href="http://prof.irfanessa.com/">http://prof.irfanessa.com/</a></span><br />
<strong>Presenter&#8217;s Biography:</strong></p>
<p>Irfan Essa is a Professor in the School of Interactive Computing(iC) of the College of Computing (CoC), and Adjunct Professor in the School of Electrical and Computer Engineering, Georgia Institute of Technology (GA Tech), in Atlanta, Georgia, USA.</p>
<p>Irfan Essa works in the areas of Computer Vision, Computer Graphics, Computational Perception, Robotics and Computer Animation, with potential impact on Video Analysis and Production (e.g., Computational Photography &amp; Video, Image-based Modeling and Rendering, etc.) Human Computer Interaction, and Artificial Intelligence research. Specifically, he is interested in the analysis, interpretation, authoring, and synthesis (of video), with the goals of building aware environments, recognizing, modeling human activities, and behaviors, and developing dynamic and generative representations of time-varying streams. He has published over a 150 scholarly articles in leading journals and conference venues on these topics and has awards for his research and teaching.</p>
<p>He joined Georgia Tech Faculty in 1996 after his earning his MS (1990), Ph.D. (1994), and holding research faculty position at the Massachusetts Institute of Technology (Media Lab) [1988-1996]. His Doctoral Research was in the area of Facial Recognition, Analysis, and Synthesis.</p>
<div></div>
</div>
</div>
</div>
<p></span></p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2010/09/20/fall-2010-grasp-seminar-irfan-essa-georgia-institute-of-technology-two-talks-on-video-analysis-1-segmentation-of-video-and-2-prediction-of-actions-in-video-grasp-laboratory-university-of/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper in CVPR (2010): &#8220;Motion Field to Predict Play Evolution in Dynamic Sport Scenes</title>
		<link>http://prof.irfanessa.com/2010/06/13/playevolution-cvpr2010/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=playevolution-cvpr2010</link>
		<comments>http://prof.irfanessa.com/2010/06/13/playevolution-cvpr2010/#comments</comments>
		<pubDate>Sun, 13 Jun 2010 15:25:10 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Jessica Hodgins]]></category>
		<category><![CDATA[Kihwan Kim]]></category>
		<category><![CDATA[Matthias Grundmann]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Sports Visualization]]></category>
		<category><![CDATA[2010]]></category>
		<category><![CDATA[Computational Video]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[CVPR]]></category>
		<category><![CDATA[Publications]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=659</guid>
		<description><![CDATA[Kihwan Kim, Matthias Grundmann, Ariel Shamir, Iain Matthews, Jessica Hodgins, Irfan Essa (2010) &#8220;Motion Field to Predict Play Evolution in Dynamic Sport Scenes&#8221; in Proceedings of IEEE Computer Vision and Pattern Recognition Conference (CVPR), San Francisco, CA, USA, June 2010 [PDF][Website][DOI][Video (Youtube)]. Abstract Videos of multi-player team sports provide a challenging domain for dynamic scene analysis. Player actions [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.cc.gatech.edu/~kihwan23">Kihwan Kim</a>, <a href="http://www.cc.gatech.edu/~grundman">Matthias Grundmann</a>, <a href="http://www.faculty.idc.ac.il/arik/">Ariel Shamir</a>, Iain Matthews, Jessica Hodgins, <a href="http://www.irfanessa.com">Irfan Essa</a> (2010) &#8220;<a href="http://www.cc.gatech.edu/cpl/projects/playevolution/">Motion Field to Predict Play Evolution in Dynamic Sport Scenes</a>&#8221; in Proceedings of <a href="http://cvl.umiacs.umd.edu/conferences/cvpr2010/" target="_blank">IEEE Computer Vision and Pattern Recognition Conference (CVPR)</a>, San Francisco, CA, USA, June 2010 [<a href="http://www.cc.gatech.edu/cpl/projects/playevolution/cvpr2010-pe.pdf">PDF</a>][<a href="http://www.cc.gatech.edu/cpl/projects/playevolution/">Website</a>][DOI][<a href="http://www.youtube.com/watch?v=jrksnCR1S0s&amp;feature=player_embedded">Video (Youtube)</a>].</p>
<p style="text-align: center;"><strong>Abstract</strong></p>
<p style="text-align: justify;">Videos of multi-player team sports provide a challenging domain for dynamic scene analysis. Player actions and interactions are complex as they are driven by many factors, such as the short-term goals of the individual player, the overall team strategy, the rules of the sport, and the current context of the game. We show that constrained multi-agent events can be analyzed and even predicted from video. Such analysis requires estimating the global movements of all players in the scene at any time, and is needed for modeling and predicting how the multi-agent play evolves over time on the field. To this end, we propose a novel approach to detect the locations of where the play evolution will proceed, e.g. where interesting events will occur, by tracking player positions and movements over time. We start by extracting the ground level sparse movement of players in each time-step, and then generate a dense motion field. Using this field we detect locations where the motion converges, implying positions towards which the play is evolving. We evaluate our approach by analyzing videos of a variety of complex soccer plays.</p>
<p style="text-align: center;"><a href="http://www.cc.gatech.edu/cpl/projects/playevolution/"><img class="aligncenter" title="CVPR 2010 Paper on Play Evolution" src="http://www.cc.gatech.edu/cpl/projects/playevolution/images/teaser.jpg" alt="CVPR 2010 Paper on Play Evolution" width="500" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2010/06/13/playevolution-cvpr2010/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper in CVPR (2010): &#8220;Efficient Hierarchical Graph-Based Video Segmentation</title>
		<link>http://prof.irfanessa.com/2010/06/13/videosegmentation-cvpr2010/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=videosegmentation-cvpr2010</link>
		<comments>http://prof.irfanessa.com/2010/06/13/videosegmentation-cvpr2010/#comments</comments>
		<pubDate>Sun, 13 Jun 2010 14:59:24 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[Matthias Grundmann]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Vivek Kwatra]]></category>
		<category><![CDATA[2010]]></category>
		<category><![CDATA[Computational Photography]]></category>
		<category><![CDATA[Computational Video]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[CVPR]]></category>
		<category><![CDATA[Publications]]></category>
		<category><![CDATA[Video Segmentation]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=647</guid>
		<description><![CDATA[Matthias Grundmann, Vivek Kwatra, Mei Han, Irfan Essa (2010) &#8220;Efficient Hierarchical Graph-Based Video Segmentation&#8221; in Proceedings of IEEE Computer Vision and Pattern Recognition Conference (CVPR), San Francisco, CA, USA, June 2010 [PDF][Website][DOI][Video (Youtube)]. Abstract We present an efficient and scalable technique for spatio- temporal segmentation of long video sequences using a hierarchical graph-based algorithm. We begin by over- [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.mgrundmann.com">Matthias Grundmann</a>, <a href="http://www.google.com/research/pubs/author38000.html">Vivek Kwatra</a>, <a href="http://research.google.com/pubs/author13553.html">Mei Han</a>, <a href="http://www.irfanessa.com">Irfan Essa</a> (2010) <a href="http://www.cc.gatech.edu/cpl/projects/videosegmentation/">&#8220;Efficient Hierarchical Graph-Based Video Segmentation</a>&#8221; in <em>Proceedings of </em><a href="http://cvl.umiacs.umd.edu/conferences/cvpr2010/" target="_blank"><em>IEEE Computer Vision and Pattern Recognition Conference (CVPR)</em></a>, San Francisco, CA, USA, June 2010 [<a href="http://www.cc.gatech.edu/cpl/projects/videosegmentation/cvpr2010_videosegmentation.pdf" target="_blank">PDF</a>][<a href="http://www.cc.gatech.edu/cpl/projects/videosegmentation/" target="_blank">Website</a>][DOI][<a href="http://www.youtube.com/watch?v=juDvLrFQF0U" target="_blank">Video</a> (Youtube)].</p>
<p style="text-align: center;"><strong>Abstract</strong></p>
<p style="text-align: justify;">We present an efficient and scalable technique for spatio- temporal segmentation of long video sequences using a hierarchical graph-based algorithm. We begin by over- segmenting a volumetric video graph into space-time regions grouped by appearance. We then construct a &#8220;region graph” over the obtained segmentation and iteratively repeat this process over multiple levels to create a tree of spatio-temporal segmentations. This hierarchical approach generates high quality segmentations, which are temporally coherent with stable region boundaries, and allows subse- quent applications to choose from varying levels of granularity. We further improve segmentation quality by using dense optical flow to guide temporal connections in the initial graph.</p>
<p style="text-align: justify;">We also propose two novel approaches to improve the scalability of our technique: (a) a parallel out- of-core algorithm that can process volumes much larger than an in-core algorithm, and (b) a clip-based process- ing algorithm that divides the video into overlapping clips in time, and segments them successively while enforcing consistency.</p>
<p style="text-align: justify;">We demonstrate hierarchical segmentations on video shots as long as 40 seconds, and even support a streaming mode for arbitrarily long videos, albeit without the ability to process them hierarchically.</p>
<p style="text-align: center;"><a href="http://www.cc.gatech.edu/cpl/projects/videosegmentation/"><img class="aligncenter" src="http://prof.irfanessa.com/wp-content/uploads/2010/08/teaser.png" alt="VideoSegmentation Teaser" width="500" height="156" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2010/06/13/videosegmentation-cvpr2010/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper in CVPR (2010): &#8220;Player Localization Using Multiple Static Cameras for Sports Visualization&#8221;</title>
		<link>http://prof.irfanessa.com/2010/06/13/playerlocalization-cvpr2010/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=playerlocalization-cvpr2010</link>
		<comments>http://prof.irfanessa.com/2010/06/13/playerlocalization-cvpr2010/#comments</comments>
		<pubDate>Sun, 13 Jun 2010 14:22:44 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Jessica Hodgins]]></category>
		<category><![CDATA[Kihwan Kim]]></category>
		<category><![CDATA[Matthias Grundmann]]></category>
		<category><![CDATA[Numerical Machine Learning]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Raffay Hamid]]></category>
		<category><![CDATA[Sports Visualization]]></category>
		<category><![CDATA[2010]]></category>
		<category><![CDATA[Computational Video]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[CVPR]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=675</guid>
		<description><![CDATA[Raffay Hamid, Ram Krishan Kumar, Matthias Grundmann, Kihwan Kim, Irfan Essa, Jessica Hodgins (2010), &#8220;Player Localization Using Multiple Static Cameras for Sports Visualization&#8221; In Proceedings of IEEE Computer Vision and Pattern Recognition Conference (CVPR), San Francisco, CA, USA, June 2010 [PDF][Website][DOI][Video (Youtube)]. Abstract We present a novel approach for robust localization of multiple people observed using multiple [...]]]></description>
			<content:encoded><![CDATA[<p>Raffay Hamid, Ram Krishan Kumar, Matthias Grundmann, Kihwan Kim, Irfan Essa, Jessica Hodgins (2010), &#8220;Player Localization Using Multiple Static Cameras for Sports Visualization&#8221; In <em>Proceedings of </em><a href="http://cvl.umiacs.umd.edu/conferences/cvpr2010/" target="_blank"><em>IEEE Computer Vision and Pattern Recognition Conference (CVPR)</em></a>, San Francisco, CA, USA, June 2010 [<a href="http://www.raffayhamid.com/hamid_cvpr2010.pdf" target="_blank">PDF</a>][<a href="http://www.raffayhamid.com/sports_viz.shtml" target="_blank">Website</a>][DOI][<a href="http://www.youtube.com/watch?v=VwzAMi9pUDQ&amp;feature=player_embedded" target="_blank">Video (Youtube)</a>].</p>
<p style="text-align: center;"><strong>Abstract</strong></p>
<p style="text-align: justify;">We present a novel approach for robust localization of multiple people observed using multiple cameras. We usethis location information to generate sports visualizations,which include displaying a virtual offside line in soccer games, and showing players&#8217; positions and motion patterns.Our main contribution is the modeling and analysis for the problem of fusing corresponding players&#8217; positional informationas finding minimum weight K-length cycles in complete K-partite graphs. To this end, we use a dynamic programmingbased approach that varies over a continuum of being maximally to minimally greedy in terms of the numberof paths explored at each iteration. We present an end-to-end sports visualization framework that employs our proposed algorithm-class. We demonstrate the robustness of our framework by testing it on 60; 000 frames of soccerfootage captured over 5 different illumination conditions, play types, and team attire.</p>
<p style="text-align: center;"><a href="http://prof.irfanessa.com/wp-content/uploads/2010/08/2010-Hamid-CVPR2010.png"><img class="aligncenter size-full wp-image-676" title="2010-Hamid-CVPR2010" src="http://prof.irfanessa.com/wp-content/uploads/2010/08/2010-Hamid-CVPR2010.png" alt="Teaser Image from CVPR 2010 paper" width="500" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2010/06/13/playerlocalization-cvpr2010/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Funding (2010): DARPA Funding on &#8220;PERSISTENT STARE EXPLOITATION AND ANALYSIS SYSTEM (PERSEAS)&#8221;</title>
		<link>http://prof.irfanessa.com/2010/06/01/funding-2010-darpa-funding-on-persistent-stare-exploitation-and-analysis-system-perseas/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=funding-2010-darpa-funding-on-persistent-stare-exploitation-and-analysis-system-perseas</link>
		<comments>http://prof.irfanessa.com/2010/06/01/funding-2010-darpa-funding-on-persistent-stare-exploitation-and-analysis-system-perseas/#comments</comments>
		<pubDate>Tue, 01 Jun 2010 13:09:01 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Grant Schindler]]></category>
		<category><![CDATA[PERSEAS]]></category>
		<category><![CDATA[Visual Surviellance]]></category>
		<category><![CDATA[2010]]></category>
		<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[DARPA]]></category>
		<category><![CDATA[Funding]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=831</guid>
		<description><![CDATA[ Persistent Stare Exploitation and Analysis System (PerSEAS) Part of the team led by Kitware Inc to work on Defense Advanced Research Projects Agency &#8211; Persistent Stare Exploitation and Analysis System (PerSEAS). The Persistent Stare Exploitation and Analysis System (PerSEAS) program is developing the capability to automatically and interactively identify potential threats as they emerge based on the [...]]]></description>
			<content:encoded><![CDATA[<h3> <a href="http://www.darpa.mil/Our_Work/I2O/Programs/Persistent_Stare_Exploitation_and_Analysis_System_(PerSEAS).aspx">Persistent Stare Exploitation and Analysis System (PerSEAS)</a></h3>
<p>Part of the team led by <a href="http://www.kitware.com/">Kitware Inc</a> to work on <a href="http://www.darpa.mil/Our_Work/I2O/Programs/Persistent_Stare_Exploitation_and_Analysis_System_(PerSEAS).aspx">Defense Advanced Research Projects Agency &#8211; Persistent Stare Exploitation and Analysis System (PerSEAS)</a>.</p>
<blockquote><p>The Persistent Stare Exploitation and Analysis System (PerSEAS) program is developing the capability to automatically and interactively identify potential threats as they emerge based on the correlation of multiple disparate activities and events in wide area motion imagery (WAMI) and multi-INT data.  PerSEAS will enable new methods of threat hypothesis adjudication and forensic analysis through activity-based modeling and inferencing capabilities.</p></blockquote>
<p>See</p>
<ul>
<li><a href="https://www.fbo.gov/index?s=opportunity&amp;mode=form&amp;id=f0144cfa4fb1ebbd4ac95f43a3a24540&amp;tab=core&amp;_cview=0">DARPA BAA</a></li>
<li><a href="http://www.wired.com/dangerroom/2009/09/pentagon-spy-cams-to-find-threats-in-weak-evidence/">Wired Article on this project</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2010/06/01/funding-2010-darpa-funding-on-persistent-stare-exploitation-and-analysis-system-perseas/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper in Advanced Robotics (2009): &#8220;Human Action Recognition Using Global Point Feature Histograms and Action Shapes&#8221;</title>
		<link>http://prof.irfanessa.com/2009/10/29/paper-advanced-robotics-2009/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=paper-advanced-robotics-2009</link>
		<comments>http://prof.irfanessa.com/2009/10/29/paper-advanced-robotics-2009/#comments</comments>
		<pubDate>Thu, 29 Oct 2009 14:58:56 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Franzi Meier]]></category>
		<category><![CDATA[Intelligent Environments]]></category>
		<category><![CDATA[Michael Beetz]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[2009]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[Publications]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=622</guid>
		<description><![CDATA[Radu Bogdan Rusu, Jan Bandouch, Franziska Meier, Irfan Essa and Michael Beetz (2009) &#8220;Human Action Recognition Using Global Point Feature Histograms and Action Shapes&#8221;, in Journal of Advanced Robotics, volume 23, pages 1873–1908, Koninklijke Brill NV, Leiden and The Robotics Society of Japan, 2009. [ DOI &#124; PDF] Abstract This paper investigates the recognition of [...]]]></description>
			<content:encoded><![CDATA[<p>Radu Bogdan Rusu,  Jan Bandouch, Franziska Meier,  Irfan Essa and Michael Beetz (2009) &#8220;Human Action Recognition Using Global Point Feature Histograms and Action Shapes&#8221;, in Journal of Advanced Robotics, volume 23, pages 1873–1908, Koninklijke Brill NV, Leiden and The Robotics Society of Japan, 2009. [ <a href="http://dx.doi.org/DOI:10.1163/016918609X12518783330243">DOI </a> | PDF]</p>
<p style="text-align: center;"><strong>Abstract</strong></p>
<p style="text-align: justify;">This paper investigates the recognition of human actions from three-dimensional (3-D) point clouds that encode the motions of people acting in sensor-distributed indoor environments. Data streams are time sequences of silhouettes extracted from cameras in the environment. From the 2-D silhouette contours we generate space–time streams by continuously aligning and stacking the contours along the time axis as third spatial dimension. The space–time stream of an observation sequence is segmented into parts corresponding to subactions using a pattern matching technique based on suffix trees and interval scheduling. Then, the segmented space–time shapes are processed by treating the shapes as 3-D point clouds and estimating global point feature histograms for them. The resultant models are clustered using statistical analysis and our experimental results indicate that the presented methods robustly derive different action classes. This holds despite large intra-class variance in the recorded datasets due to performances from different persons at different time intervals.</p>
<p style="text-align: justify;">© Koninklijke Brill NV, Leiden and The Robotics Society of Japan, 2009</p>
<p style="text-align: justify;">
<div id="attachment_625" class="wp-caption aligncenter" style="width: 512px"><img class="size-large wp-image-625  " title="2009-Rusu-etal-AR23-B" src="http://prof.irfanessa.com/wp-content/uploads/2009/10/2009-Rusu-etal-AR23-B-1024x193.png" alt="Overview of the approach." width="502" height="95" /><p class="wp-caption-text">Overview of the approach.</p></div>
<p><strong>Keywords: </strong>Action recognition, point cloud, global features, action segmentation</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2009/10/29/paper-advanced-robotics-2009/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper ISMAR 2009: &#8220;Augmenting Aerial Earth Maps with Dynamic Information&#8221;</title>
		<link>http://prof.irfanessa.com/2009/10/20/augearth-ismar2009/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=augearth-ismar2009</link>
		<comments>http://prof.irfanessa.com/2009/10/20/augearth-ismar2009/#comments</comments>
		<pubDate>Tue, 20 Oct 2009 23:07:42 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Computational Journalism]]></category>
		<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[Kihwan Kim]]></category>
		<category><![CDATA[Modeling and Animation]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[2009]]></category>
		<category><![CDATA[Animation]]></category>
		<category><![CDATA[Computational Photography]]></category>
		<category><![CDATA[Computational Video]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[Crowdsourcing]]></category>
		<category><![CDATA[IEEE]]></category>
		<category><![CDATA[ISMAR]]></category>
		<category><![CDATA[Publications]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=556</guid>
		<description><![CDATA[Augmenting Aerial Earth Maps with Dynamic Information Kim, Oh, Lee, and Essa (2009), &#8220;Augmenting Aerial Earth Maps with Dynamic Information,&#8221; in Proceedings of IEEE International Symposium on Mixed and Augmented Reality (ISMAR), 2009. [PDF] [WEBSITE] [DOI] [BLOG] [BIBTEX] @inproceedings{2009-Kim-AAEMWDI, Author = {K. Kim and S. Oh and J. Lee and I. Essa}, Blog = {http://prof.irfanessa.com/2009/10/20/augearth-ismar2009/}, [...]]]></description>
			<content:encoded><![CDATA[<h3>Augmenting Aerial Earth Maps with Dynamic Information</h3>
<ul class="papercite_bibliography">
<li>        Kim, Oh, Lee, and Essa (2009), &#8220;Augmenting Aerial Earth Maps with Dynamic Information,&#8221; in <span style="font-style: italic">Proceedings of IEEE International Symposium on Mixed and Augmented Reality (ISMAR)</span>,  2009.      <a href="http://www.cc.gatech.edu/~irfan/p/2009-Kim-AAEMWDI.pdf" class='papercite_pdf' title='PDF'>[PDF]</a>     <a href="http://www.cc.gatech.edu/cpl/projects/augearth/" class='papercite_pdf' title='Project Website'>[WEBSITE]</a>             <a href='http://dx.doi.org/10.1109/ISMAR.2009.5336505' class='papercite_doi' title='View document on publisher site'> [DOI]</a>      <a href="http://prof.irfanessa.com/2009/10/20/augearth-ismar2009/" class='papercite_blog' title='BLOG'>[BLOG]</a>    <a href="javascript:void(0)" id="papercite_1" class="papercite_toggle">[BIBTEX]</a>
<pre class="papercite_bibtex" id="papercite_1_block"><code>@inproceedings{2009-Kim-AAEMWDI,
  Author = {K. Kim and S. Oh and J. Lee and I. Essa},
  Blog = {http://prof.irfanessa.com/2009/10/20/augearth-ismar2009/},
  Booktitle = {Proceedings of IEEE International Symposium on Mixed and Augmented Reality (ISMAR)},
  Date-Modified = {2011-12-13 15:15:34 +0000},
  Doi = {10.1109/ISMAR.2009.5336505},
  Month = {October},
  Pdf = {http://www.cc.gatech.edu/~irfan/p/2009-Kim-AAEMWDI.pdf},
  Title = {Augmenting Aerial Earth Maps with Dynamic Information},
  Url = {http://www.cc.gatech.edu/cpl/projects/augearth/},
  Year = {2009},
  Bdsk-Url-1 = {http://www.cc.gatech.edu/cpl/projects/augearth/},
  Bdsk-Url-2 = {http://dx.doi.org/10.1109/ISMAR.2009.5336505}}</code></pre>
</li>
</ul>
<h4 style="text-align: left;"><strong>Abstract</strong></h4>
<p style="text-align: left;">We introduce methods for augmenting aerial visualizations of Earth (from tools such as Google Earth or Microsoft Virtual Earth) with dynamic information obtained from videos. Our goal is to make Augmented Earth Maps that visualize the live broadcast of dynamic sceneries within a city. We propose different approaches to analyze videos of pedestrians and cars, under differing conditions and then augment Aerial Earth Maps (AEMs) with live and dynamic information. We also analyze natural phenomenon (clouds) and project information from these to the AEMs to add the visual reality.</p>
<p style="text-align: left;">For Journal Version of this paper, please see http://prof.irfanessa.com/2011/02/02/vr-2011/</p>
<p style="text-align: center;">
<p><a href="http://www.youtube.com/watch?v=TPk88soc2qw&#038;fmt=18">http://www.youtube.com/watch?v=TPk88soc2qw</a></p></p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2009/10/20/augearth-ismar2009/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Event (2009): IEEE Workshop on Computer Vision for Humanoid Robots in Real Environment</title>
		<link>http://prof.irfanessa.com/2009/09/23/event-2009-ieee-workshop-on-computer-vision-for-humanoid-robots-in-real-environment/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=event-2009-ieee-workshop-on-computer-vision-for-humanoid-robots-in-real-environment</link>
		<comments>http://prof.irfanessa.com/2009/09/23/event-2009-ieee-workshop-on-computer-vision-for-humanoid-robots-in-real-environment/#comments</comments>
		<pubDate>Wed, 23 Sep 2009 15:30:28 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Events]]></category>
		<category><![CDATA[2009]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[Humanoids]]></category>
		<category><![CDATA[Robotics]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=560</guid>
		<description><![CDATA[IEEE Workshop on Computer Vision for Humanoid Robots in Real Environments. I am co-organizing the First IEEE Workshop on Computer Vision for Humanoids in conjunction with ICCV Conference in Kyoto, Japan.  This workshop will be held September 27, 2009. (9:30am &#8211; 6:00pm). The goal of this workshop is to bring together experts from the fields [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://humanoidscv.ime.cmc.osaka-u.ac.jp/">IEEE Workshop on Computer Vision for Humanoid Robots in Real Environments</a>.</p>
<p>I am co-organizing the First IEEE Workshop on Computer Vision for Humanoids in conjunction with ICCV Conference in Kyoto, Japan.  This workshop will be held September 27, 2009. (9:30am &#8211; 6:00pm).</p>
<p style="line-height: 20px;">The goal of this workshop is to bring together experts from the fields of computer vision and robotics that are working on humanoid robots with vision as one of the primary modalities. Topics of interest include and are not limited to:</p>
<ul class="list1" style="margin-top: 0.5em; margin-bottom: 0.5em; line-height: 20px; list-style-type: disc; padding-left: 16px; margin-left: 16px;">
<li>Visual Learning in Robots</li>
<li>Human Robot Interaction</li>
<li>Grasping and Manipulation</li>
<li>Learning by Demonstration</li>
<li>Task Learning for Robots</li>
<li>Activity Recognition and Discovery for Robot</li>
<li>Humanoid Navigation in Real Environments</li>
<li>Vision Devices and Systems for Robot Applications</li>
<li>Application of Humanoid Robots (Indoor/Outdoor, Entertainment)</li>
</ul>
<p style="line-height: 20px;">This is the first attempt at a workshop that crosses from Humanoids Research to Computer Vision Research.</p>
<p style="line-height: 20px;">The workshop includes six invited talks as well as an open poster session, where all participants are expected to present a poster describing their recent work.</p>
<p>Location: Kyoto University, Faculty of Engineering Bldg.#3, 2F, Room W201, in conjunction with ICCV. (See http://www.iccv2009.org/workshops/index.html).</p>
<p>For schedule, abstracts and other information, see the workshop <a href="http://humanoidscv.ime.cmc.osaka-u.ac.jp/">website</a>. More information about ICCV at http://www.iccv2009.org/.</p>
<div id="attachment_573" class="wp-caption aligncenter" style="width: 310px"><img class="size-medium wp-image-573" title="20090927_1491" src="http://prof.irfanessa.com/wp-content/uploads/2009/09/20090927_1491-300x168.jpg" alt="Invited Speakers and Organizers after the Workshop" width="300" height="168" /><p class="wp-caption-text">Invited Speakers and Organizers after the Workshop</p></div>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2009/09/23/event-2009-ieee-workshop-on-computer-vision-for-humanoid-robots-in-real-environment/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Served from: prof.irfanessa.com @ 2012-02-05 15:30:31 -->
