<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>prof.irfanessa.com &#187; Activity Recognition</title>
	<atom:link href="http://prof.irfanessa.com/tag/activity-recognition/feed/" rel="self" type="application/rss+xml" />
	<link>http://prof.irfanessa.com</link>
	<description>Irfan Essa&#039;s Academic Activities</description>
	<lastBuildDate>Wed, 25 Jan 2012 23:42:09 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	
		<item>
		<title>Paper in ICCV 2011: &#8220;Gaussian Process Regression Flow for Analysis of Motion Trajectories&#8221;</title>
		<link>http://prof.irfanessa.com/2011/10/28/gprf-iccv2011/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=gprf-iccv2011</link>
		<comments>http://prof.irfanessa.com/2011/10/28/gprf-iccv2011/#comments</comments>
		<pubDate>Fri, 28 Oct 2011 13:21:25 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[DARPA]]></category>
		<category><![CDATA[Kihwan Kim]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[2011]]></category>
		<category><![CDATA[Computational Video]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[ICCV]]></category>
		<category><![CDATA[Publications]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=786</guid>
		<description><![CDATA[Gaussian Process Regression Flow for Analysis of Motion Trajectories Kim, Lee, and Essa (2011), “Gaussian Process Regression Flow for Analysis of Motion Trajectories,” in Proceedings of IEEE International Conference on Computer Vision (ICCV), 2011. [PDF] [WEBSITE] [VIDEO] [BIBTEX] @inproceedings{Kim2011-GPRF, Author = {K. Kim and D. Lee and I. Essa}, Booktitle = {Proceedings of IEEE International Conference on Computer Vision [...]]]></description>
			<content:encoded><![CDATA[<h3 class="p1" style="text-align: justify;">Gaussian Process Regression Flow for Analysis of Motion Trajectories</h3>
<ul>
<li>Kim, Lee, and Essa (2011), “Gaussian Process Regression Flow for Analysis of Motion Trajectories,” in Proceedings of IEEE International Conference on Computer Vision (ICCV), 2011. <a title="PDF" href="http://www.kihwan23.com/papers/ICCV2011/gprf_iccv2011.pdf">[PDF]</a> <a title="Project Website" href="http://www.cc.gatech.edu/cpl/projects/gprf/">[WEBSITE]</a> <a title="VIDEO" href="http://www.youtube.com/watch?v=UtLr37hDQz0">[VIDEO]</a> <a id="papercite_2" class="papercite_toggle" href="javascript:void(0)">[BIBTEX]</a>
<pre id="papercite_2_block" class="papercite_bibtex"><code> @inproceedings{Kim2011-GPRF, Author = {K. Kim and D. Lee and I. Essa}, Booktitle = {Proceedings of IEEE International Conference on Computer Vision (ICCV)}, Month = {November}, Pdf = {http://www.cc.gatech.edu/~irfan/p/2011-Kim-GPRFAMT.pdf}, Publisher = {IEEE Computer Society}, Title = {Gaussian Process Regression Flow for Analysis of Motion Trajectories}, Url = {http://www.cc.gatech.edu/cpl/projects/gprf/}, Video = {http://www.youtube.com/watch?v=UtLr37hDQz0}, Year = {2011}}</code></pre>
</li>
</ul>
<p><strong style="text-align: justify;">Abstract</strong></p>
<p style="text-align: left;">Analysis and Recognition of motions and activities of objects in videos requires effective representations for analysis and matching of motion trajectories. In this paper, we introduce a new representation speciﬁcally aimed at matching motion trajectories. We model a trajectory as a continuous dense ﬂow ﬁeld from a sparse set of vector sequences using Gaussian Process Regression. Furthermore, we introduce a random sampling strategy for learning stable classes of motions from limited data.</p>
<p style="text-align: left;">Our representation allows for incrementally predicting possible paths and detecting anomalous events from online trajectories. This representation also supports matching of complex motions with acceleration changes and pauses or stops within a trajectory. We use the proposed approach for classifying and predicting motion trajectories in trafﬁc monitoring domains and test on several data sets. We show that our approach works well on various types of complete and incomplete trajectories from a variety of video data sets with different frame rates</p>
<p style="text-align: center;">
<p><a href="http://www.youtube.com/watch?v=UtLr37hDQz0&#038;fmt=18">http://www.youtube.com/watch?v=UtLr37hDQz0</a></p></p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2011/10/28/gprf-iccv2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Presentation (2011) at IBPRIA 2011: &#8220;Spatio-Temporal Video Analysis and Visual Activity Recognition&#8221;</title>
		<link>http://prof.irfanessa.com/2011/06/08/ibpria2011/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=ibpria2011</link>
		<comments>http://prof.irfanessa.com/2011/06/08/ibpria2011/#comments</comments>
		<pubDate>Wed, 08 Jun 2011 22:41:57 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[Kihwan Kim]]></category>
		<category><![CDATA[Matthias Grundmann]]></category>
		<category><![CDATA[Multimedia]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Presentations]]></category>
		<category><![CDATA[2011]]></category>
		<category><![CDATA[Computational Photography]]></category>
		<category><![CDATA[Computational Video]]></category>
		<category><![CDATA[Computer Vision]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=750</guid>
		<description><![CDATA[&#8220;Spatio-Temporal Video Analysis and Visual Activity Recognition&#8221; at the Iberian Conference on Pattern Recognition and Image Analysis  (IbPRIA) 2011 Conference in Las Palmas de Gran Canaria. Spain. June 8-10. Abstract My research group is focused on a variety of approaches for (a) low-level video analysis and synthesis and (b) recognizing activities in videos. In this talk, I [...]]]></description>
			<content:encoded><![CDATA[<p>&#8220;Spatio-Temporal Video Analysis and Visual Activity Recognition&#8221; at the <a href="http://www.ibpria2011.ulpgc.es/">Iberian Conference on Pattern Recognition and Image Analysis  (IbPRIA) 2011 Conference</a> in Las Palmas de Gran Canaria. Spain. June 8-10.</p>
<h4 style="text-align: left;"><strong>Abstract</strong></h4>
<p style="text-align: left;">My research group is focused on a variety of approaches for (a) low-level video analysis and synthesis and (b) recognizing activities in videos. In this talk, I will concentrate on two of our recent efforts. One effort aimed at robust spatio-temporal segmentation of video and another on using motion and flow to recognize and predict actions from video.</p>
<p style="text-align: left;">In the first part of the talk, I will present an efficient and scalable technique for spatio-temporal segmentation of long video sequences using a hierarchical graph-based algorithm. In this work, we begin by over segmenting a volumetric video graph into space-time regions grouped by appearance. We then construct a “region graph” over the obtained segmentation and iteratively repeat this process over multiple levels to create a tree of spatio-temporal segmentations. This hierarchical approach generates high quality segmentations, which are temporally coherent with stable region boundaries, and allows subsequent applications to choose from varying levels of granularity. We further improve segmentation quality by using dense optical flow to guide temporal connections in the initial graph. I will demonstrate a variety of examples of how this robust segmentation works, and will show additional examples of video-retargeting that use spatio-temporal saliency derived from this segmentation approach. (Matthias Grundmann, Vivek Kwatra, Mei Han, Irfan Essa, CVPR 2010, in collaboration with Google Research).</p>
<p style="text-align: left;">In the second part of this talk, I will show that constrained multi-agent events can be analyzed and even predicted from video. Such analysis requires estimating the global movements of all players in the scene at any time, and is needed for modeling and predicting how the multi-agent play evolves over time on the playing field. To this end, we propose a novel approach to detect the locations of where the play evolution will proceed, e.g. where interesting events will occur, by tracking player positions and movements over time. To achieve this, we extract the ground level sparse movement of players in each time-step, and then generate a dense motion field. Using this field we detect locations where the motion converges, implying positions towards which the play is evolving. I will show examples of how we have tested this approach for soccer, basketball and hockey. (Kihwan Kim, Matthias Grundmann, Ariel Shamir, Iain Matthews, Jessica Hodgins, Irfan Essa, CVPR 2010, in collaboration with Disney Research).</p>
<p style="text-align: left;">Time permitting, I will show some more videos of our recent work on video analysis and synthesis. For more information, papers, and videos, see <a href="http://prof.irfanessa.com/">my website</a>.</p>
<p style="text-align: center;"><img class="aligncenter" src="http://ibpria2011.ulpgc.es/files/images/ibpria2011_b02.jpg" alt="" width="500" /></p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2011/06/08/ibpria2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Fall 2010 GRASP Seminar &#8211; Irfan Essa, Georgia Institute Of Technology, &#8220;Two Talks On Video Analysis: 1 Segmentation Of Video And 2 Prediction Of Actions In Video&#8221; &#124; GRASP Laboratory &#8211; University Of Pennsylvania</title>
		<link>http://prof.irfanessa.com/2010/09/20/fall-2010-grasp-seminar-irfan-essa-georgia-institute-of-technology-two-talks-on-video-analysis-1-segmentation-of-video-and-2-prediction-of-actions-in-video-grasp-laboratory-university-of/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=fall-2010-grasp-seminar-irfan-essa-georgia-institute-of-technology-two-talks-on-video-analysis-1-segmentation-of-video-and-2-prediction-of-actions-in-video-grasp-laboratory-university-of</link>
		<comments>http://prof.irfanessa.com/2010/09/20/fall-2010-grasp-seminar-irfan-essa-georgia-institute-of-technology-two-talks-on-video-analysis-1-segmentation-of-video-and-2-prediction-of-actions-in-video-grasp-laboratory-university-of/#comments</comments>
		<pubDate>Mon, 20 Sep 2010 19:37:38 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[Presentations]]></category>
		<category><![CDATA[2010]]></category>
		<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Computational Photography]]></category>
		<category><![CDATA[Computational Video]]></category>
		<category><![CDATA[Computer Vision]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=680</guid>
		<description><![CDATA[Fall 2010 GRASP Seminar &#8211; Irfan Essa, Georgia Institute Of Technology, &#8220;Two Talks On Video Analysis: 1 Segmentation Of Video And 2 Prediction Of Actions In Video&#8221; &#124; GRASP Laboratory &#8211; University Of Pennsylvania. Friday September 24, 2010 from 11:00am to 12:00pm My research group is focused on a variety if approaches for video analysis [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://grasp.upenn.edu/seminars/Irfan_Essa">Fall 2010 GRASP Seminar &#8211; Irfan Essa, Georgia Institute Of Technology, &#8220;Two Talks On Video Analysis: 1 Segmentation Of Video And 2 Prediction Of Actions In Video&#8221; | GRASP Laboratory &#8211; University Of Pennsylvania</a>.</p>
<p><span style="font-family: Verdana, Arial; line-height: 16px; font-size: 10px; color: #3c3c3c;"><em>Friday September 24, 2010 from 11:00am to 12:00pm</em></span></p>
<p><span style="font-family: Verdana, Arial; line-height: 16px; font-size: 10px; color: #3c3c3c;"></p>
<div id="main_content">
<div class="left_column no-right" style="float: none; width: 891px;">
<div class="seminar content clear" style="clear: both;">
<p>My research group is focused on a variety if approaches for video analysis and synthesis. In this talk, I will focus on two of our recent efforts.  One effort aimed at robust spatio-temporal segmentation of video and another on using motion and flow to predict actions from video.</p>
<p>In the first part of the talk, I will present an efficient and scalable technique for spatio-temporal segmentation of long video sequences using a hierarchical graph-based algorithm. In this effort, we begin by over segmenting a volumetric video graph into space-time regions grouped by appearance. We then construct a “region graph” over the obtained segmentation and iteratively repeat this process over multiple levels to create a tree of spatio-temporal segmentations. This hierarchical approach generates high quality segmentations, which are temporally coherent with stable region boundaries, and allows subsequent applications to choose from varying levels of granularity. We further improve segmentation quality by using dense optical flow to guide temporal connections in the initial graph. I will demonstrate a variety of examples of how this robust segmentation works, and will show additional examples of video-retargeting that use the saliency from this segmentation approach.  (Matthias Grundmann, Vivek Kwatra, Mei Han, Irfan Essa, CVPR 2010, in collaboration with Google Research).</p>
<p>In the second part of this talk, I will show that constrained multi-agent events can be analyzed and even predicted from video. Such analysis requires estimating the global movements of all players in the scene at any time, and is needed for modeling and predicting how the multi-agent play evolves over time on the field. To this end, we propose a novel approach to detect the locations of where the play evolution will proceed, e.g. where interesting events will occur, by tracking player positions and movements over time. To achieve this, we extract the ground level sparse movement of players in each time-step, and then generate a dense motion field. Using this field we detect locations where the motion converges, implying positions towards which the play is evolving. I will show examples of how we have tested this approach for soccer, basketball and hockey. (Kihwan Kim, Matthias Grundmann, Ariel Shamir, Iain Matthews, Jessica Hodgins, Irfan Essa, CVPR 2010, in collaboration with Disney Research).</p>
<p><span>Time permitting, I will show some more videos of our recent work on video analysis and synthesis. For more information, papers, and videos, see my website at<a class="smarterwiki-linkify" style="outline-style: none; outline-width: initial; outline-color: initial; color: #9a0000; text-decoration: none; border: initial none initial;" href="http://prof.irfanessa.com/">http://prof.irfanessa.com/</a></span><br />
<strong>Presenter&#8217;s Biography:</strong></p>
<p>Irfan Essa is a Professor in the School of Interactive Computing(iC) of the College of Computing (CoC), and Adjunct Professor in the School of Electrical and Computer Engineering, Georgia Institute of Technology (GA Tech), in Atlanta, Georgia, USA.</p>
<p>Irfan Essa works in the areas of Computer Vision, Computer Graphics, Computational Perception, Robotics and Computer Animation, with potential impact on Video Analysis and Production (e.g., Computational Photography &amp; Video, Image-based Modeling and Rendering, etc.) Human Computer Interaction, and Artificial Intelligence research. Specifically, he is interested in the analysis, interpretation, authoring, and synthesis (of video), with the goals of building aware environments, recognizing, modeling human activities, and behaviors, and developing dynamic and generative representations of time-varying streams. He has published over a 150 scholarly articles in leading journals and conference venues on these topics and has awards for his research and teaching.</p>
<p>He joined Georgia Tech Faculty in 1996 after his earning his MS (1990), Ph.D. (1994), and holding research faculty position at the Massachusetts Institute of Technology (Media Lab) [1988-1996]. His Doctoral Research was in the area of Facial Recognition, Analysis, and Synthesis.</p>
<div></div>
</div>
</div>
</div>
<p></span></p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2010/09/20/fall-2010-grasp-seminar-irfan-essa-georgia-institute-of-technology-two-talks-on-video-analysis-1-segmentation-of-video-and-2-prediction-of-actions-in-video-grasp-laboratory-university-of/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>In the News (2010): DARPA Awards Kitware a $13.8 Million Contract for Online Threat Detection and Forensic Analysis in Wide-Area Motion Imagery</title>
		<link>http://prof.irfanessa.com/2010/09/02/kitware-persea/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=kitware-persea</link>
		<comments>http://prof.irfanessa.com/2010/09/02/kitware-persea/#comments</comments>
		<pubDate>Thu, 02 Sep 2010 13:22:24 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Grant Schindler]]></category>
		<category><![CDATA[PERSEAS]]></category>
		<category><![CDATA[Visual Surviellance]]></category>
		<category><![CDATA[2010]]></category>
		<category><![CDATA[DARPA]]></category>
		<category><![CDATA[Funding]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=836</guid>
		<description><![CDATA[via Kitware &#8211; News: DARPA Awards Kitware a $13.8 Million Contract for Online Threat Detection and Forensic Analysis in Wide-Area Motion Imagery. Kitware has received a $13,883,314 contract from Defense Advanced Research Projects Agency (DARPA) to develop a software system capable of automatically and interactively discovering actionable intelligence from wide area motion imagery (WAMI) of complex [...]]]></description>
			<content:encoded><![CDATA[<p>via <a href="http://www.kitware.com/news/home/browse/Kitware%3F2010_07_19%26DARPA+Awards+Kitware+a+$13.8+Million+Contract+for+Online+Threat+Detection+and+Forensic+Analysis+in+Wide-Area+Motion+Imagery">Kitware &#8211; News: DARPA Awards Kitware a $13.8 Million Contract for Online Threat Detection and Forensic Analysis in Wide-Area Motion Imagery</a>.</p>
<blockquote><p>Kitware has received a $13,883,314 contract from Defense Advanced Research Projects Agency (DARPA) to develop a software system capable of automatically and interactively discovering actionable intelligence from wide area motion imagery (WAMI) of complex urban, suburban, and rural environments.</p>
<p>The primary information elements in WAMI data are moving entities in the context of roads, buildings, and other scene features. These entities, while exploitable, often yield fragmented tracks in complex urban environments due to occlusions, stops, and other factors. Kitware&#8217;s software system will use algorithmic solutions to associate tracks and then identify and integrate local events to detect potential threats and perform forensic analysis.</p>
<p>The developed algorithms will form the basis of a software prototype called the Persistent Stare Exploitation and Analysis System (PerSEAS) that will significantly augment an end-user&#8217;s ability to discover novel intelligence using models of activities, normalcy, and context. Since the vast majority of events are normal and pose no threat, the models must cross-integrate singular events to discover relationships and anomalies that are indicative of suspicious behavior or match previously learned &#8211; or defined &#8211; threat activity.</p>
<p>The advanced PerSEAS system will markedly improve an analyst&#8217;s ability to handle burgeoning WAMI data and reduce the time required to perform many current exploitation tasks, greatly enhancing the military&#8217;s capability to analyze and utilize the data for forensic analysis and through the issuance of timely threat alerts with a minimal number of false alarms.</p>
<p>Due to the complex, multi-disciplinary nature of the research, Kitware will partner with academic experts in the fields of computer vision, probabilistic reasoning, machine learning and other related domains. Phase I of the research is expected to be completed in two years.</p>
<p>The awarded contract will expand Kitware&#8217;s leadership in the field of computer vision, video analysis and advanced visualization software. The project will build upon our previous DARPA-sponsored research into content-based video retrieval on the VIRAT program; anomaly detection on the PANDA program; and the recognition of complex multi-agent activities in video.</p>
<p>To meet the PerSEAS program&#8217;s needs, Kitware has assembled a world-class team including four leading defense technology companies, Northrop Grumman Corporation, ; Honeywell Automation and Control Solutions Laboratories, Aptima, Inc., and Navia, Inc. As well as multiple internationally-renowned research institutions, including: the University of California, Berkeley; Computer Vision Laboratory, University of Maryland; Rensselaer Polytechnic Institute; the Computer Vision Lab at the University of Central Florida; <strong>the School of Interactive Computing at Georgia Tech and its affiliated Center for Robotics &amp; Intelligent Machines</strong>; and Columbia University.</p></blockquote>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2010/09/02/kitware-persea/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper in CVPR (2010): &#8220;Motion Field to Predict Play Evolution in Dynamic Sport Scenes</title>
		<link>http://prof.irfanessa.com/2010/06/13/playevolution-cvpr2010/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=playevolution-cvpr2010</link>
		<comments>http://prof.irfanessa.com/2010/06/13/playevolution-cvpr2010/#comments</comments>
		<pubDate>Sun, 13 Jun 2010 15:25:10 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Jessica Hodgins]]></category>
		<category><![CDATA[Kihwan Kim]]></category>
		<category><![CDATA[Matthias Grundmann]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Sports Visualization]]></category>
		<category><![CDATA[2010]]></category>
		<category><![CDATA[Computational Video]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[CVPR]]></category>
		<category><![CDATA[Publications]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=659</guid>
		<description><![CDATA[Kihwan Kim, Matthias Grundmann, Ariel Shamir, Iain Matthews, Jessica Hodgins, Irfan Essa (2010) &#8220;Motion Field to Predict Play Evolution in Dynamic Sport Scenes&#8221; in Proceedings of IEEE Computer Vision and Pattern Recognition Conference (CVPR), San Francisco, CA, USA, June 2010 [PDF][Website][DOI][Video (Youtube)]. Abstract Videos of multi-player team sports provide a challenging domain for dynamic scene analysis. Player actions [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.cc.gatech.edu/~kihwan23">Kihwan Kim</a>, <a href="http://www.cc.gatech.edu/~grundman">Matthias Grundmann</a>, <a href="http://www.faculty.idc.ac.il/arik/">Ariel Shamir</a>, Iain Matthews, Jessica Hodgins, <a href="http://www.irfanessa.com">Irfan Essa</a> (2010) &#8220;<a href="http://www.cc.gatech.edu/cpl/projects/playevolution/">Motion Field to Predict Play Evolution in Dynamic Sport Scenes</a>&#8221; in Proceedings of <a href="http://cvl.umiacs.umd.edu/conferences/cvpr2010/" target="_blank">IEEE Computer Vision and Pattern Recognition Conference (CVPR)</a>, San Francisco, CA, USA, June 2010 [<a href="http://www.cc.gatech.edu/cpl/projects/playevolution/cvpr2010-pe.pdf">PDF</a>][<a href="http://www.cc.gatech.edu/cpl/projects/playevolution/">Website</a>][DOI][<a href="http://www.youtube.com/watch?v=jrksnCR1S0s&amp;feature=player_embedded">Video (Youtube)</a>].</p>
<p style="text-align: center;"><strong>Abstract</strong></p>
<p style="text-align: justify;">Videos of multi-player team sports provide a challenging domain for dynamic scene analysis. Player actions and interactions are complex as they are driven by many factors, such as the short-term goals of the individual player, the overall team strategy, the rules of the sport, and the current context of the game. We show that constrained multi-agent events can be analyzed and even predicted from video. Such analysis requires estimating the global movements of all players in the scene at any time, and is needed for modeling and predicting how the multi-agent play evolves over time on the field. To this end, we propose a novel approach to detect the locations of where the play evolution will proceed, e.g. where interesting events will occur, by tracking player positions and movements over time. We start by extracting the ground level sparse movement of players in each time-step, and then generate a dense motion field. Using this field we detect locations where the motion converges, implying positions towards which the play is evolving. We evaluate our approach by analyzing videos of a variety of complex soccer plays.</p>
<p style="text-align: center;"><a href="http://www.cc.gatech.edu/cpl/projects/playevolution/"><img class="aligncenter" title="CVPR 2010 Paper on Play Evolution" src="http://www.cc.gatech.edu/cpl/projects/playevolution/images/teaser.jpg" alt="CVPR 2010 Paper on Play Evolution" width="500" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2010/06/13/playevolution-cvpr2010/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper in CVPR (2010): &#8220;Player Localization Using Multiple Static Cameras for Sports Visualization&#8221;</title>
		<link>http://prof.irfanessa.com/2010/06/13/playerlocalization-cvpr2010/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=playerlocalization-cvpr2010</link>
		<comments>http://prof.irfanessa.com/2010/06/13/playerlocalization-cvpr2010/#comments</comments>
		<pubDate>Sun, 13 Jun 2010 14:22:44 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Jessica Hodgins]]></category>
		<category><![CDATA[Kihwan Kim]]></category>
		<category><![CDATA[Matthias Grundmann]]></category>
		<category><![CDATA[Numerical Machine Learning]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Raffay Hamid]]></category>
		<category><![CDATA[Sports Visualization]]></category>
		<category><![CDATA[2010]]></category>
		<category><![CDATA[Computational Video]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[CVPR]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=675</guid>
		<description><![CDATA[Raffay Hamid, Ram Krishan Kumar, Matthias Grundmann, Kihwan Kim, Irfan Essa, Jessica Hodgins (2010), &#8220;Player Localization Using Multiple Static Cameras for Sports Visualization&#8221; In Proceedings of IEEE Computer Vision and Pattern Recognition Conference (CVPR), San Francisco, CA, USA, June 2010 [PDF][Website][DOI][Video (Youtube)]. Abstract We present a novel approach for robust localization of multiple people observed using multiple [...]]]></description>
			<content:encoded><![CDATA[<p>Raffay Hamid, Ram Krishan Kumar, Matthias Grundmann, Kihwan Kim, Irfan Essa, Jessica Hodgins (2010), &#8220;Player Localization Using Multiple Static Cameras for Sports Visualization&#8221; In <em>Proceedings of </em><a href="http://cvl.umiacs.umd.edu/conferences/cvpr2010/" target="_blank"><em>IEEE Computer Vision and Pattern Recognition Conference (CVPR)</em></a>, San Francisco, CA, USA, June 2010 [<a href="http://www.raffayhamid.com/hamid_cvpr2010.pdf" target="_blank">PDF</a>][<a href="http://www.raffayhamid.com/sports_viz.shtml" target="_blank">Website</a>][DOI][<a href="http://www.youtube.com/watch?v=VwzAMi9pUDQ&amp;feature=player_embedded" target="_blank">Video (Youtube)</a>].</p>
<p style="text-align: center;"><strong>Abstract</strong></p>
<p style="text-align: justify;">We present a novel approach for robust localization of multiple people observed using multiple cameras. We usethis location information to generate sports visualizations,which include displaying a virtual offside line in soccer games, and showing players&#8217; positions and motion patterns.Our main contribution is the modeling and analysis for the problem of fusing corresponding players&#8217; positional informationas finding minimum weight K-length cycles in complete K-partite graphs. To this end, we use a dynamic programmingbased approach that varies over a continuum of being maximally to minimally greedy in terms of the numberof paths explored at each iteration. We present an end-to-end sports visualization framework that employs our proposed algorithm-class. We demonstrate the robustness of our framework by testing it on 60; 000 frames of soccerfootage captured over 5 different illumination conditions, play types, and team attire.</p>
<p style="text-align: center;"><a href="http://prof.irfanessa.com/wp-content/uploads/2010/08/2010-Hamid-CVPR2010.png"><img class="aligncenter size-full wp-image-676" title="2010-Hamid-CVPR2010" src="http://prof.irfanessa.com/wp-content/uploads/2010/08/2010-Hamid-CVPR2010.png" alt="Teaser Image from CVPR 2010 paper" width="500" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2010/06/13/playerlocalization-cvpr2010/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Funding (2010): DARPA Funding on &#8220;PERSISTENT STARE EXPLOITATION AND ANALYSIS SYSTEM (PERSEAS)&#8221;</title>
		<link>http://prof.irfanessa.com/2010/06/01/funding-2010-darpa-funding-on-persistent-stare-exploitation-and-analysis-system-perseas/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=funding-2010-darpa-funding-on-persistent-stare-exploitation-and-analysis-system-perseas</link>
		<comments>http://prof.irfanessa.com/2010/06/01/funding-2010-darpa-funding-on-persistent-stare-exploitation-and-analysis-system-perseas/#comments</comments>
		<pubDate>Tue, 01 Jun 2010 13:09:01 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Grant Schindler]]></category>
		<category><![CDATA[PERSEAS]]></category>
		<category><![CDATA[Visual Surviellance]]></category>
		<category><![CDATA[2010]]></category>
		<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[DARPA]]></category>
		<category><![CDATA[Funding]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=831</guid>
		<description><![CDATA[ Persistent Stare Exploitation and Analysis System (PerSEAS) Part of the team led by Kitware Inc to work on Defense Advanced Research Projects Agency &#8211; Persistent Stare Exploitation and Analysis System (PerSEAS). The Persistent Stare Exploitation and Analysis System (PerSEAS) program is developing the capability to automatically and interactively identify potential threats as they emerge based on the [...]]]></description>
			<content:encoded><![CDATA[<h3> <a href="http://www.darpa.mil/Our_Work/I2O/Programs/Persistent_Stare_Exploitation_and_Analysis_System_(PerSEAS).aspx">Persistent Stare Exploitation and Analysis System (PerSEAS)</a></h3>
<p>Part of the team led by <a href="http://www.kitware.com/">Kitware Inc</a> to work on <a href="http://www.darpa.mil/Our_Work/I2O/Programs/Persistent_Stare_Exploitation_and_Analysis_System_(PerSEAS).aspx">Defense Advanced Research Projects Agency &#8211; Persistent Stare Exploitation and Analysis System (PerSEAS)</a>.</p>
<blockquote><p>The Persistent Stare Exploitation and Analysis System (PerSEAS) program is developing the capability to automatically and interactively identify potential threats as they emerge based on the correlation of multiple disparate activities and events in wide area motion imagery (WAMI) and multi-INT data.  PerSEAS will enable new methods of threat hypothesis adjudication and forensic analysis through activity-based modeling and inferencing capabilities.</p></blockquote>
<p>See</p>
<ul>
<li><a href="https://www.fbo.gov/index?s=opportunity&amp;mode=form&amp;id=f0144cfa4fb1ebbd4ac95f43a3a24540&amp;tab=core&amp;_cview=0">DARPA BAA</a></li>
<li><a href="http://www.wired.com/dangerroom/2009/09/pentagon-spy-cams-to-find-threats-in-weak-evidence/">Wired Article on this project</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2010/06/01/funding-2010-darpa-funding-on-persistent-stare-exploitation-and-analysis-system-perseas/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper in Advanced Robotics (2009): &#8220;Human Action Recognition Using Global Point Feature Histograms and Action Shapes&#8221;</title>
		<link>http://prof.irfanessa.com/2009/10/29/paper-advanced-robotics-2009/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=paper-advanced-robotics-2009</link>
		<comments>http://prof.irfanessa.com/2009/10/29/paper-advanced-robotics-2009/#comments</comments>
		<pubDate>Thu, 29 Oct 2009 14:58:56 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Franzi Meier]]></category>
		<category><![CDATA[Intelligent Environments]]></category>
		<category><![CDATA[Michael Beetz]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[2009]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[Publications]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=622</guid>
		<description><![CDATA[Radu Bogdan Rusu, Jan Bandouch, Franziska Meier, Irfan Essa and Michael Beetz (2009) &#8220;Human Action Recognition Using Global Point Feature Histograms and Action Shapes&#8221;, in Journal of Advanced Robotics, volume 23, pages 1873–1908, Koninklijke Brill NV, Leiden and The Robotics Society of Japan, 2009. [ DOI &#124; PDF] Abstract This paper investigates the recognition of [...]]]></description>
			<content:encoded><![CDATA[<p>Radu Bogdan Rusu,  Jan Bandouch, Franziska Meier,  Irfan Essa and Michael Beetz (2009) &#8220;Human Action Recognition Using Global Point Feature Histograms and Action Shapes&#8221;, in Journal of Advanced Robotics, volume 23, pages 1873–1908, Koninklijke Brill NV, Leiden and The Robotics Society of Japan, 2009. [ <a href="http://dx.doi.org/DOI:10.1163/016918609X12518783330243">DOI </a> | PDF]</p>
<p style="text-align: center;"><strong>Abstract</strong></p>
<p style="text-align: justify;">This paper investigates the recognition of human actions from three-dimensional (3-D) point clouds that encode the motions of people acting in sensor-distributed indoor environments. Data streams are time sequences of silhouettes extracted from cameras in the environment. From the 2-D silhouette contours we generate space–time streams by continuously aligning and stacking the contours along the time axis as third spatial dimension. The space–time stream of an observation sequence is segmented into parts corresponding to subactions using a pattern matching technique based on suffix trees and interval scheduling. Then, the segmented space–time shapes are processed by treating the shapes as 3-D point clouds and estimating global point feature histograms for them. The resultant models are clustered using statistical analysis and our experimental results indicate that the presented methods robustly derive different action classes. This holds despite large intra-class variance in the recorded datasets due to performances from different persons at different time intervals.</p>
<p style="text-align: justify;">© Koninklijke Brill NV, Leiden and The Robotics Society of Japan, 2009</p>
<p style="text-align: justify;">
<div id="attachment_625" class="wp-caption aligncenter" style="width: 512px"><img class="size-large wp-image-625  " title="2009-Rusu-etal-AR23-B" src="http://prof.irfanessa.com/wp-content/uploads/2009/10/2009-Rusu-etal-AR23-B-1024x193.png" alt="Overview of the approach." width="502" height="95" /><p class="wp-caption-text">Overview of the approach.</p></div>
<p><strong>Keywords: </strong>Action recognition, point cloud, global features, action segmentation</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2009/10/29/paper-advanced-robotics-2009/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper in Artificial Intelligence (2009): &#8220;A novel sequence representation for unsupervised analysis of human activities&#8221;</title>
		<link>http://prof.irfanessa.com/2009/09/20/hamid-aij2009/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=hamid-aij2009</link>
		<comments>http://prof.irfanessa.com/2009/09/20/hamid-aij2009/#comments</comments>
		<pubDate>Sun, 20 Sep 2009 16:33:44 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Aaron Bobick]]></category>
		<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Charles Isbell]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Raffay Hamid]]></category>
		<category><![CDATA[Siddhartha Maddi]]></category>
		<category><![CDATA[2009]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[AIJ]]></category>
		<category><![CDATA[Aware Home]]></category>
		<category><![CDATA[Publications]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=662</guid>
		<description><![CDATA[A novel sequence representation for unsupervised analysis of human activities Raffay Hamid, Siddhartha Maddi, Amos Johnson, Aaron Bobick, Irfan Essaand Charles Isbell (2009) &#8220;A novel sequence representation for unsupervised analysis of human activities&#8221; in Artificial Intelligence, Volume 173, Issue 14, September 2009, Pages 1221-1244. [PDF][DOI][Science Direct] Hamid, Maddi, Johnson, Bobick, Essa, and Isbell (2009), &#8220;A [...]]]></description>
			<content:encoded><![CDATA[<h3>A novel sequence representation for unsupervised analysis of human activities</h3>
<ul>
<li>Raffay Hamid, Siddhartha Maddi, Amos Johnson, Aaron Bobick, Irfan Essaand Charles Isbell (2009) &#8220;A novel sequence representation for unsupervised analysis of human activities&#8221; in <a href="http://www.sciencedirect.com/science/journal/00043702">Artificial Intelligence</a>, <a href="http://prof.irfanessa.com/science?_ob=PublicationURL&amp;_tockey=%23TOC%235617%232009%23998269985%231345081%23FLP%23&amp;_cdi=5617&amp;_pubType=J&amp;view=c&amp;_auth=y&amp;_acct=C000050221&amp;_version=1&amp;_urlVersion=0&amp;_userid=10&amp;md5=3bc8fa81ab117ec54264b40e31280668">Volume 173, Issue 14</a>, September 2009, Pages 1221-1244. [<a href="http://www.raffayhamid.com/hamid_aij_09.pdf">PDF</a>][<a href="http://dx.doi.org/10.1016/j.artint.2009.05.002">DOI</a>][<a href="http://www.sciencedirect.com/science?_ob=ArticleURL&amp;_udi=B6TYF-4WDGCS0-3&amp;_user=10&amp;_coverDate=09/30/2009&amp;_rdoc=1&amp;_fmt=high&amp;_orig=search&amp;_sort=d&amp;_docanchor=&amp;view=c&amp;_searchStrId=1436844856&amp;_rerunOrigin=google&amp;_acct=C000050221&amp;_version=1&amp;_urlVersion=0&amp;_userid=10&amp;md5=b37382908cebaaa08c0fba0438d6eca8">Science Direct</a>]</li>
</ul>
<ul class="papercite_bibliography">
<li>        Hamid, Maddi, Johnson, Bobick, Essa, and Isbell (2009), &#8220;A Novel Sequence Representation for Unsupervised Analysis of Human Activities,&#8221; <span style="font-style: italic">Artificial Intelligence Journal</span>, 2009.                             <a href="javascript:void(0)" id="papercite_1" class="papercite_toggle">[BIBTEX]</a>
<pre class="papercite_bibtex" id="papercite_1_block"><code>@article{2009-Hamid-NSRUAHA,
  Author = {R. Hamid and S. Maddi and A. Johnson and A. Bobick and I. Essa and C. Isbell},
  Date-Modified = {2011-12-08 21:27:48 +0000},
  Journal = {Artificial Intelligence Journal},
  Month = {May},
  Title = {A Novel Sequence Representation for Unsupervised Analysis of Human Activities},
  Year = {2009}}</code></pre>
</li>
</ul>
<h4 style="text-align: left;">Abstract</h4>
<p style="text-align: justify;">Formalizing computational models for everyday human activities remains an open challenge. Many previous approaches towards this end assume prior knowledge about the structure of activities, using which explicitly defined models are learned in a completely supervised manner. For a majority of everyday environments however, the structure of the in situ activities is generally not known a priori. In this paper we investigate knowledge representations and manipulation techniques that facilitate learning of human activities in a minimally supervised manner. The key contribution of this work is the idea that global structural information of human activities can be encoded using a subset of their local event subsequences, and that this encoding is sufficient for activity-class discovery and classification.</p>
<p style="text-align: justify;">In particular, we investigate modeling activity sequences in terms of their constituent subsequences that we call event n-grams. Exploiting this representation, we propose a computational framework to automatically discover the various activity-classes taking place in an environment. We model these activity-classes as maximally similar activity-cliques in a completely connected graph of activities, and describe how to discover them efficiently. Moreover, we propose methods for finding characterizations of these discovered classes from a holistic as well as a by-parts perspective. Using such characterizations, we present a method to classify a new activity to one of the discovered activity-classes, and to automatically detect whether it is anomalous with respect to the general characteristics of its membership class. Our results show the efficacy of our approach in a variety of everyday environments.</p>
<p>Keywords: Temporal reasoning; Scene analysis; Computer vision</p>
<p style="text-align: center;"><a href="http://prof.irfanessa.com/wp-content/uploads/2010/08/2009-Hamid-AIJ.png"><img class="aligncenter size-full wp-image-663" title="2009-Hamid-AIJ" src="http://prof.irfanessa.com/wp-content/uploads/2010/08/2009-Hamid-AIJ.png" alt="Hamid et al AIJ Paper" width="500" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2009/09/20/hamid-aij2009/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper: ICPR (2008) &#8220;3D Shape Context and Distance Transform for Action Recognition&#8221;</title>
		<link>http://prof.irfanessa.com/2008/12/08/paper-icpr-2008-3d-shape-context-and-distance-transform-for-action-recognition/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=paper-icpr-2008-3d-shape-context-and-distance-transform-for-action-recognition</link>
		<comments>http://prof.irfanessa.com/2008/12/08/paper-icpr-2008-3d-shape-context-and-distance-transform-for-action-recognition/#comments</comments>
		<pubDate>Mon, 08 Dec 2008 20:22:26 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Aware Home]]></category>
		<category><![CDATA[Face and Gesture]]></category>
		<category><![CDATA[Franzi Meier]]></category>
		<category><![CDATA[Matthias Grundmann]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[2008]]></category>
		<category><![CDATA[Computer Vision]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/?p=146</guid>
		<description><![CDATA[M. Grundmann, F. Meier, and I. Essa (2008) &#8220;3D Shape Context and Distance Transform for Action Recognition&#8221;, In Proceedings of International Conference on Pattern Recognition (ICPR) 2008, Tampa, FL. [Project Page &#124; DOI &#124; PDF] ABSTRACT We propose the use of 3D (2D+time) Shape Context to recognize the spatial and temporal details inherent in human [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: left;">M. Grundmann, F. Meier, and I. Essa (2008) &#8220;3D Shape Context and Distance Transform for Action Recognition&#8221;, In <em>Proceedings of <a href="http://www.icpr2008.org/" target="_blank">International Conference on Pattern Recognition</a></em> (ICPR) 2008, Tampa, FL. [<a href="http://www.mgrundmann.com/icpr2008.html" target="_blank">Project Page</a> | <a href="http://dx.doi.org/10.1109/ICPR.2008.4761435" target="_blank">DOI</a> | <a href="http://www.mgrundmann.com/pdfs/icpr2008.pdf" target="_blank">PDF</a>]</p>
<p style="text-align: center;">ABSTRACT</p>
<p style="text-align: justify;"><a href="http://academics.irfanessa.com/wp-content/uploads/2008/08/3dfigure_feat_small.png"><img class="alignleft size-medium wp-image-163" title="3dfigure_feat_small" src="http://academics.irfanessa.com/wp-content/uploads/2008/08/3dfigure_feat_small-300x179.png" alt="" width="300" height="179" /></a>We propose the use of 3D (2D+time) Shape Context to recognize the spatial and temporal details inherent in human actions. We represent an action in a video sequence by a 3D point cloud extracted by sampling 2D silhouettes over time. A non-uniform sampling method is introduced that gives preference to fast moving body parts using a Euclidean 3D Distance Transform. Actions are then classified by matching the extracted point clouds. Our proposed approach is based on a global matching and does not require specific training to learn the model. We test the approach thoroughly on two publicly available datasets and compare to several state-of-the-art methods. The achieved classification accuracy is on par with or superior to the best results reported to date.</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2008/12/08/paper-icpr-2008-3d-shape-context-and-distance-transform-for-action-recognition/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Thesis Raffay Hamid PhD (2008): &#8220;A Computational Framework For Unsupervised Analysis of Everyday Human Activities&#8221;</title>
		<link>http://prof.irfanessa.com/2008/06/18/thesis-raffay-hamid-phd-2008-a-computational-framework-for-unsupervised-analysis-of-everyday-human-activities/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=thesis-raffay-hamid-phd-2008-a-computational-framework-for-unsupervised-analysis-of-everyday-human-activities</link>
		<comments>http://prof.irfanessa.com/2008/06/18/thesis-raffay-hamid-phd-2008-a-computational-framework-for-unsupervised-analysis-of-everyday-human-activities/#comments</comments>
		<pubDate>Wed, 18 Jun 2008 18:12:20 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Aaron Bobick]]></category>
		<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Numerical Machine Learning]]></category>
		<category><![CDATA[PhD]]></category>
		<category><![CDATA[Raffay Hamid]]></category>
		<category><![CDATA[2008]]></category>
		<category><![CDATA[Thesis]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/?p=502</guid>
		<description><![CDATA[M. Raffay Hamid PhD (2008), &#8220;A Computational Framework For Unsupervised Analysis of Everyday Human Activities&#8220;, PhD Thesis, Georgia Institute of Techniology, College of Computing, Atlanta, GA. (Advisor: Aaron Bobick &#38; Irfan Essa) Abstract In order to make computers proactive and assistive, we must enable them to perceive, learn, and predict what is happening in their surroundings. [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://etd.gatech.edu/theses/available/etd-06232008-101404/"></a></p>
<p>M. Raffay Hamid PhD (2008), &#8220;<a href="http://etd.gatech.edu/theses/available/etd-06232008-101404/">A Computational Framework For Unsupervised Analysis of Everyday Human Activities</a>&#8220;, PhD Thesis, Georgia Institute of Techniology, College of Computing, Atlanta, GA. (Advisor: Aaron Bobick &amp; Irfan Essa)</p>
<p style="text-align: center;"><strong>Abstract</strong></p>
<p style="text-align: justify;"><strong></strong>In order to make computers proactive and assistive, we must enable them to perceive, learn, and predict what is happening in their surroundings. This presents us with the challenge of formalizing computational models of everyday human activities. For a majority of environments, the structure of the in situ activities is generally not known a priori. This thesis therefore investigates knowledge representations and manipulation techniques that can facilitate learning of such everyday human activities in a minimally supervised manner. </p>
<p style="text-align: justify;">A key step towards this end is finding appropriate representations for human activities. We posit that if we chose to describe activities as finite sequences of an appropriate set of events, then the global structure of these activities can be uniquely encoded using their local event sub-sequences. With this perspective at hand, we particularly investigate representations that characterize activities in terms of their fixed and variable length event subsequences. We comparatively analyze these representations in terms of their representational scope, feature cardinality and noise sensitivity.</p>
<p style="text-align: justify;">Exploiting such representations, we propose a computational framework to discover the various activity-classes taking place in an environment. We model these activity-classes as maximally similar activity-cliques in a completely connected graph of activities, and describe how to discover them efficiently. Moreover, we propose methods for finding concise characterizations of these discovered activity-classes, both from a holistic as well as a by-parts perspective. Using such characterizations, we present an incremental method to classify</p>
<p style="text-align: justify;">a new activity instance to one of the discovered activity-classes, and to automatically detect if it is anomalous with respect to the general characteristics of its membership class. Our results show the efficacy of our framework in a variety of everyday environments</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2008/06/18/thesis-raffay-hamid-phd-2008-a-computational-framework-for-unsupervised-analysis-of-everyday-human-activities/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Thesis David Minnen PhD (2008): &#8220;Unsupervised Discovery of Activity Primitives from Multivariate Sensor Data&#8221;</title>
		<link>http://prof.irfanessa.com/2008/06/18/david-minnen-phd-2008/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=david-minnen-phd-2008</link>
		<comments>http://prof.irfanessa.com/2008/06/18/david-minnen-phd-2008/#comments</comments>
		<pubDate>Wed, 18 Jun 2008 18:05:43 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[David Minnen]]></category>
		<category><![CDATA[PhD]]></category>
		<category><![CDATA[Thad Starner]]></category>
		<category><![CDATA[2008]]></category>
		<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[Thesis]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/?p=498</guid>
		<description><![CDATA[Unsupervised Discovery of Activity Primitives from Multivariate Sensor Data David Minnen PhD (2008): &#8220;Unsupervised Discovery of Activity Primitives from Multivariate Sensor Data&#8220; Georgia Institute of Techniology, College of Computing, Atlanta, GA. (Advisors: Thad Starner &#38; Irfan Essa) Abstract &#160; This research addresses the problem of temporal pattern discovery in real-valued, multivariate sensor data. Several algorithms were [...]]]></description>
			<content:encoded><![CDATA[<h3><a href="http://etd.gatech.edu/theses/available/etd-07072008-090103/" target="_blank">Unsupervised Discovery of Activity Primitives from Multivariate Sensor Data</a></h3>
<ul>
<li>David Minnen PhD (2008): &#8220;<a href="http://etd.gatech.edu/theses/available/etd-07072008-090103/" target="_blank">Unsupervised Discovery of Activity Primitives from Multivariate Sensor Data</a>&#8220; Georgia Institute of Techniology, College of Computing, Atlanta, GA. (Advisors: Thad Starner &amp; Irfan Essa)</li>
</ul>
<h4 style="text-align: left;"><strong>Abstract</strong></h4>
<p>&nbsp;</p>
<p><img class="size-medium wp-image-977 alignright" style="border-width: 1px; border-color: black; border-style: solid; margin: 2px;" title="minnen_david_c_200808_phd" src="http://prof.irfanessa.com/wp-content/uploads/2008/06/minnen_david_c_200808_phd-231x300.png" alt="" width="231" height="300" /></p>
<p style="text-align: justify;">This research addresses the problem of temporal pattern discovery in real-valued, multivariate sensor data. Several algorithms were developed, and subsequent evaluation demonstrates that they can efficiently and accurately discover unknown recurring patterns in time series data taken from many different domains. Different data representations and motif models were investigated in order to design an algorithm with an improved balance between run-time and detection accuracy. The different data representations are used to quickly filter large data sets in order to detect potential patterns that form the basis of a more detailed analysis. The representations include global discretization, which can be efficiently analyzed using a suffix tree, local discretization with a corresponding random projection algorithm for locating similar pairs of subsequences, and a density-based detection method that operates on the original, real-valued data. In addition, a new variation of the multivariate motif discovery problem is proposed in which each pattern may span only a subset of the input features. An algorithm that can efficiently discover such &#8220;subdimensional&#8221; patterns was developed and evaluated. The discovery algorithms are evaluated by measuring the detection accuracy of discovered patterns relative to a set of expected patterns for each data set. The data sets used for evaluation are drawn from a variety of domains including speech, on-body inertial sensors, music, American Sign Language video, and GPS tracks.</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2008/06/18/david-minnen-phd-2008/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper: IEEE Data Mining Conference 2007 &#8220;Detecting Subdimensional Motifs: An Efficient Algorithm for Generalized Multivariate Pattern Discovery&#8221;</title>
		<link>http://prof.irfanessa.com/2007/10/28/minnen-icdm2007/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=minnen-icdm2007</link>
		<comments>http://prof.irfanessa.com/2007/10/28/minnen-icdm2007/#comments</comments>
		<pubDate>Sun, 28 Oct 2007 14:33:29 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Charles Isbell]]></category>
		<category><![CDATA[David Minnen]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Thad Starner]]></category>
		<category><![CDATA[Data Mining]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/2007/10/28/paper-ieee-data-mining-conference-2007-detecting-subdimensional-motifs-an-efficient-algorithm-for-generalized-multivariate-pattern-discovery/</guid>
		<description><![CDATA[Detecting Subdimensional Motifs: An Efficient Algorithm for Generalized Multivariate Pattern Discovery Minnen, Essa, Isbell, and Starner (2007), &#8220;Detecting Subdimensional Motifs: An Efficient Algorithm for Generalized Multivariate Pattern Discovery,&#8221; in Proceedings of IEEE International Conference on Data Mining (ICDM), 2007. [PDF] [DOI] [BIBTEX] @inproceedings{2007-Minnen-DSMEAGMPD, Author = {D. Minnen and I. Essa and C. Isbell and T. Starner}, Booktitle = [...]]]></description>
			<content:encoded><![CDATA[<h3>Detecting Subdimensional Motifs: An Efficient Algorithm for Generalized Multivariate Pattern Discovery</h3>
<ul>
<li>Minnen, Essa, Isbell, and Starner (2007), &#8220;Detecting Subdimensional Motifs: An Efficient Algorithm for Generalized Multivariate Pattern Discovery,&#8221; in Proceedings of IEEE International Conference on Data Mining (ICDM), 2007. <a title="PDF" href="http://www.cc.gatech.edu/~irfan/p/2007-Minnen-DSMEAGMPD.pdf">[PDF]</a> <a title="View document on publisher site" href="http://dx.doi.org/10.1109/ICDM.2007.52">[DOI]</a><br />
<a href="javascript:void(0)" id="papercite_23" class="papercite_toggle">[BIBTEX]</a></p>
<pre class="papercite_bibtex" id="papercite_23_block"><code>@inproceedings{2007-Minnen-DSMEAGMPD,
  Author = {D. Minnen and I. Essa and C. Isbell and T. Starner},
  Booktitle = {Proceedings of IEEE International Conference on Data Mining (ICDM)},
  Doi = {10.1109/ICDM.2007.52},
  Month = {October},
  Pdf = {http://www.cc.gatech.edu/~irfan/p/2007-Minnen-DSMEAGMPD.pdf},
  Title = {Detecting Subdimensional Motifs: An Efficient Algorithm for Generalized Multivariate Pattern Discovery},
  Year = {2007}}</code></pre>
</li>
</ul>
<p><strong>Abstract</strong></p>
<p style="text-align: justify;"><a title="ICDMPaper" href="http://academics.irfanessa.com/wp-content/uploads/2007/11/minnen-icdm2007s.jpg"><img style="border-width: 1px; border-color: black; border-style: solid; margin: 2px;" src="http://academics.irfanessa.com/wp-content/uploads/2007/11/minnen-icdm2007s.jpg" alt="ICDMPaper" width="153" height="197" align="right" /></a> Discovering recurring patterns in time series data is a fundamental problem for temporal data mining. This paper addresses the problem of locating subdimensional motifs in real-valued, multivariate time series, which requires the simultaneous discovery of sets of recurring patterns along with the corresponding relevant dimensions. While many approaches to motif discovery have been developed, most are restricted to categorical data, univariate time series, or multivariate data in which the temporal patterns span all of the dimensions. In this paper, we present an expected linear-time algorithm that addresses a generalization of multivariate pattern discovery in which each motif may span only a subset of the dimensions. To validate our algorithm, we discuss its theoretical properties and empirically evaluate it using several data sets including synthetic data and motion capture data collected by an on-body inertial sensor.</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2007/10/28/minnen-icdm2007/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper: ICCV 2007, &#8220;Structure from Statistics &#8211; Unsupervised Activity Analysis using Suffix Trees&#8221;</title>
		<link>http://prof.irfanessa.com/2007/10/15/hamid-iccv2007/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=hamid-iccv2007</link>
		<comments>http://prof.irfanessa.com/2007/10/15/hamid-iccv2007/#comments</comments>
		<pubDate>Mon, 15 Oct 2007 19:56:31 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Aaron Bobick]]></category>
		<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Aware Home]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Raffay Hamid]]></category>
		<category><![CDATA[2007]]></category>
		<category><![CDATA[Computer Vision]]></category>

		<guid isPermaLink="false">http://essa.org/irfan/wp/?p=31</guid>
		<description><![CDATA[R. Hamid, S. Maddi, A. Bobick, I. Essa (2007). Structure from Statistics &#8211; Unsupervised Activity Analysis using Suffix Trees, At the International Conference on Computer Vision 2007. October 2007, Rio de Janeiro, BRAZIL [PDF][DOI] Abstract Models of activity structure for unconstrained environments are generally not available a priori. Recent representational approaches to this end are limited by [...]]]></description>
			<content:encoded><![CDATA[<ul>
<li> R. Hamid, S. Maddi, A. Bobick, I. Essa (2007). Structure from Statistics &#8211; Unsupervised Activity Analysis using Suffix Trees, At the <a href="http://ieeexplore.ieee.org/xpl/mostRecentIssue.jsp?punumber=4408818">International Conference on Computer Vision 2007</a>. October 2007, Rio de Janeiro, BRAZIL [<a href="http://www.raffayhamid.com/iccv_07.pdf">PDF</a>][<a href="http://dx.doi.org/10.1109/ICCV.2007.4408894">DOI</a>]</li>
</ul>
<p style="text-align: center;"><strong>Abstract</strong></p>
<p><a href="http://academics.irfanessa.com/wp-content/uploads/2008/05/iccv07-fig.jpg"><img class="alignleft size-medium wp-image-132" style="float: left; margin: 5px;" title="ICCV07-SuffixTreeFig" src="http://academics.irfanessa.com/wp-content/uploads/2008/05/iccv07-fig-300x168.jpg" alt="" width="300" height="168" /></a>Models of activity structure for unconstrained environments are generally not available a priori. Recent representational approaches to this end are limited by their computational complexity, and ability to capture activity structure only up to some fixed temporal scale. In this work, we propose Suffix Trees as an activity representation to efficiently extract structure of activities by analyzing their constituent event-subsequences over multiple temporal scales. We empirically compare Suffix Trees with some of the previous approaches in terms of feature cardinality, discriminative prowess, noise sensitivity and activity-class discovery. Finally, exploiting properties of Suffix Trees, we present a novel perspective on anomalous subsequences of activities, and propose an algorithm to detect them in linear-time. We present comparative results over experimental data, collected from a kitchen environment to demonstrate the competence of our proposed framework.</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2007/10/15/hamid-iccv2007/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper: ACM IWVSSN (2006) &#8220;Unsupervised Analysis of Activity Sequences Using Event Motifs&#8221;</title>
		<link>http://prof.irfanessa.com/2006/10/23/paper-acm-iwvssn-2006-unsupervised-analysis-of-activity-sequences-using-event-motifs/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=paper-acm-iwvssn-2006-unsupervised-analysis-of-activity-sequences-using-event-motifs</link>
		<comments>http://prof.irfanessa.com/2006/10/23/paper-acm-iwvssn-2006-unsupervised-analysis-of-activity-sequences-using-event-motifs/#comments</comments>
		<pubDate>Mon, 23 Oct 2006 22:59:37 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[AAAI/IJCAI/UAI]]></category>
		<category><![CDATA[Aaron Bobick]]></category>
		<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Aware Home]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Raffay Hamid]]></category>
		<category><![CDATA[Siddhartha Maddi]]></category>
		<category><![CDATA[2007]]></category>
		<category><![CDATA[Computer Vision]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/2008/01/23/paper-acm-iwvssn-2006-unsupervised-analysis-of-activity-sequences-using-event-motifs/</guid>
		<description><![CDATA[R. Hamid, S. Maddi, A. Bobick, I. Essa. &#8220;Unsupervised Analysis of Activity Sequences Using Event Motifs&#8221;, In proceedings of 4th ACM International Workshop on Video Surveillance and Sensor Networks (in conjunction with ACM Multimedia 2006). Abstract We present an unsupervised framework to discover characterizations of everyday human activities, and demonstrate how such representations can be [...]]]></description>
			<content:encoded><![CDATA[<ul>
<li>R. Hamid, S. Maddi, A. Bobick, I. Essa.  		&#8220;Unsupervised Analysis of Activity Sequences Using Event Motifs&#8221;, In proceedings of  		4th ACM International Workshop on Video Surveillance and Sensor Networks  		(in conjunction with ACM Multimedia 2006).</li>
</ul>
<p style="text-align: center;"><strong>Abstract</strong></p>
<p style="text-align: justify;">We present an unsupervised framework to discover characterizations of everyday human activities, and demonstrate how such representations can be used to extract points of interest in event-streams. We begin with the usage of Suffix Trees as an efficient activity-representation to analyze the global structural information of activities, using their local event statistics over the entire continuum of their temporal resolution. Exploiting this representation, we discover characterizing event-subsequences and present their usage in an ensemble-based framework for activity classification. Finally, we propose a method to automatically detect subsequences of events that are locally atypical in a structural sense. Results over extensive data-sets, collected from multiple sensor-rich environments are presented, to show the competence and scalability of the proposed framework.</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2006/10/23/paper-acm-iwvssn-2006-unsupervised-analysis-of-activity-sequences-using-event-motifs/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper: IEEE ISWC (2006) &#8220;Discovering Characteristic Actions from On-Body Sensor Data&#8221;</title>
		<link>http://prof.irfanessa.com/2006/10/14/paper-ieee-iswc-2006-discovering-characteristic-actions-from-on-body-sensor-data/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=paper-ieee-iswc-2006-discovering-characteristic-actions-from-on-body-sensor-data</link>
		<comments>http://prof.irfanessa.com/2006/10/14/paper-ieee-iswc-2006-discovering-characteristic-actions-from-on-body-sensor-data/#comments</comments>
		<pubDate>Sat, 14 Oct 2006 15:28:09 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Charles Isbell]]></category>
		<category><![CDATA[David Minnen]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Thad Starner]]></category>
		<category><![CDATA[Wearables]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/2006/10/14/paper-ieee-iswc-2006-discovering-characteristic-actions-from-on-body-sensor-data/</guid>
		<description><![CDATA[Discovering Characteristic Actions from On-Body Sensor Data (IEEEXplore) Minnen, D. Starner, T. Essa, I. Isbell, C. College of Computing, Georgia Institute of Technology, Atlanta, GA 30332 USA. This paper appears in: Wearable Computers, 2006 10th IEEE International Symposium on Publication Date: Oct. 2006 On page(s): 11 &#8211; 18 Number of Pages: 11 &#8211; 18 Location: [...]]]></description>
			<content:encoded><![CDATA[<p>Discovering Characteristic Actions from On-Body Sensor Data <a href="http://ieeexplore.ieee.org/search/srchabstract.jsp?arnumber=4067720&amp;isnumber=4067708&amp;punumber=4067707&amp;k2dockey=4067720@ieeecnfs&amp;query=%28%28essa%29%3Cin%3Eau+%29&amp;pos=13">(IEEEXplore)</a></p>
<p>Minnen, D. Starner, T. Essa, I. Isbell, C.<br />
College of Computing, Georgia Institute of Technology, Atlanta, GA 30332 USA.<br />
This paper appears in: Wearable Computers, 2006 10th IEEE International Symposium on<br />
Publication Date: Oct. 2006<br />
On page(s): 11 &#8211; 18<br />
Number of Pages: 11 &#8211; 18<br />
Location: Montreux, Switzerland<br />
ISSN: 1550-4816<br />
ISBN: 1-4244-0598-x<br />
Digital Object Identifier: 10.1109/ISWC.2006.286337<br />
Posted online: 2007-01-22 09:58:15.0</p>
<p align="center"><strong>Abstract</strong></p>
<p>We present an approach to activity discovery, the unsupervised identification and modeling of human actions embedded in a larger sensor stream. Activity discovery can be seen as the inverse of the activity recognition problem. Rather than learn models from hand-labeled sequences, we attempt to discover motifs, sets of similar subsequences within the raw sensor stream, without the benefit of labels or manual segmentation. These motifs are statistically unlikely and thus typically correspond to important or characteristic actions within the activity. The problem of activity discovery differs from typicalmotif discovery, such as locating protein binding sites, because of the nature of time series data representing human activity. For example, in activity data, motifs will tend to be sparsely distributed, vary in length, and may only exhibit intra-motif similarity after appropriate time warping. In this paper, we motivate the activity discovery problem and present our approach for efficient discovery of meaningful actions from sensor data representing human activity. We empirically evaluate the approach on an exercise data set captured by a wrist-mounted, three-axis inertial sensor. Our algorithm successfully discovers motifs that correspond to the real exercises with a recall rate of 96.3% and overall accuracy of 86.7% over six exercises and 864 occurrences.</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2006/10/14/paper-ieee-iswc-2006-discovering-characteristic-actions-from-on-body-sensor-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper: IEEE CVPR (2006) &#8220;Learning Temporal Sequence Model from Partially Labeled Data&#8221;</title>
		<link>http://prof.irfanessa.com/2006/06/14/ieeexplore-learning-temporal-sequence-model-from-partially-labeled-data/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=ieeexplore-learning-temporal-sequence-model-from-partially-labeled-data</link>
		<comments>http://prof.irfanessa.com/2006/06/14/ieeexplore-learning-temporal-sequence-model-from-partially-labeled-data/#comments</comments>
		<pubDate>Wed, 14 Jun 2006 17:06:45 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Aaron Bobick]]></category>
		<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Aware Home]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Yifan Shi]]></category>
		<category><![CDATA[2006]]></category>
		<category><![CDATA[Computer Vision]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/2006/06/14/ieeexplore-learning-temporal-sequence-model-from-partially-labeled-data/</guid>
		<description><![CDATA[Yifan Shi, Bobick, A. Essa, I. (2006), &#8220;Learning Temporal Sequence Model from Partially Labeled Data&#8221; Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2006 Volume: 2, page(s): 1631 &#8211; 1638, ISSN: 1063-6919, ISBN: 0-7695-2597-0, Digital Object Identifier: 10.1109/CVPR.2006.174 [IEEEXplore] Abstract Graphical models are often used to represent and recognize activities. Purely [...]]]></description>
			<content:encoded><![CDATA[<p><strong></strong><a href="http://ieeexplore.ieee.org/search/srchabstract.jsp?arnumber=1640951&amp;isnumber=34374&amp;punumber=10924&amp;k2dockey=1640951@ieeecnfs&amp;query=%28%28essa%29%3Cin%3Eau+%29&amp;pos=15"></a>Yifan Shi, Bobick, A.   Essa, I. (2006), &#8220;<strong>Learning Temporal Sequence Model from Partially Labeled Data&#8221;</strong> Proceedings of <em>IEEE Computer Society Conference on Computer Vision and Pattern Recognition</em>, 2006<br />
Volume: 2, page(s): 1631 &#8211; 1638, ISSN: 1063-6919, ISBN: 0-7695-2597-0, Digital Object Identifier: 10.1109/CVPR.2006.174 <a href="http://ieeexplore.ieee.org/search/srchabstract.jsp?arnumber=1640951&amp;isnumber=34374&amp;punumber=10924&amp;k2dockey=1640951@ieeecnfs&amp;query=%28%28essa%29%3Cin%3Eau+%29&amp;pos=15">[IEEEXplore]</a></p>
<p align="center"><strong>Abstract</strong></p>
<p style="text-align: justify;">Graphical models are often used to represent and recognize activities. Purely unsupervised methods (such as HMMs) can be trained automatically but yield models whose internal structure &#8211; the nodes &#8211; are difficult to interpret semantically. Manually constructed networks typically have nodes corresponding to sub-events, but the programming and training of these networks is tedious and requires extensive domain expertise. In this paper, we propose a semi-supervised approach in which a manually structured, Propagation Network (a form of a DBN) is initialized from a small amount of fully annotated data, and then refined by an EM-based learning method in an unsupervised fashion. During node refinement (the M step) a boosting-based algorithm is employed to train the evidence detectors of individual nodes. Experiments on a variety of data types &#8211; vision and inertial measurements &#8211; in several tasks demonstrate the ability to learn from as little as one fully annotated example accompanied by a small number of positive but non-annotated training examples. The system is applied to both recognition and anomaly detection tasks.</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2006/06/14/ieeexplore-learning-temporal-sequence-model-from-partially-labeled-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper: IEEE CVPR (2005) &#8220;Tracking multiple objects through occlusions&#8221;</title>
		<link>http://prof.irfanessa.com/2005/06/20/paper-ieee-cvpr-2005-tracking-multiple-objects-through-occlusions/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=paper-ieee-cvpr-2005-tracking-multiple-objects-through-occlusions</link>
		<comments>http://prof.irfanessa.com/2005/06/20/paper-ieee-cvpr-2005-tracking-multiple-objects-through-occlusions/#comments</comments>
		<pubDate>Mon, 20 Jun 2005 17:13:50 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Aware Home]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Yan Huang]]></category>
		<category><![CDATA[2005]]></category>
		<category><![CDATA[Computer Vision]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/2005/06/20/paper-ieee-cvpr-2005-tracking-multiple-objects-through-occlusions/</guid>
		<description><![CDATA[Huang, Y and Essa, I. (2005) &#8220;Tracking multiple objects through occlusions&#8221;,  In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2005 (CVPR 2005), Volume: 2 page(s): 1051 &#8211; 1058 vol. 2, ISSN: 1063-6919, ISBN: 0-7695-2372-2, INSPEC Accession Number:8633324 DOI: 10.1109/CVPR.2005.350, [IEEEXplore#] 20-25 June 2005 ABSTRACT We present an approach for tracking [...]]]></description>
			<content:encoded><![CDATA[<p>Huang, Y and Essa, I. (2005) &#8220;Tracking multiple objects through occlusions&#8221;,  In Proceedings of IEEE Computer Society Conference on <em>Computer Vision and Pattern Recognition</em>, 2005 (CVPR 2005), Volume: 2 page(s): 1051 &#8211; 1058 vol. 2, ISSN: 1063-6919, ISBN: 0-7695-2372-2, INSPEC Accession Number:8633324 DOI: 10.1109/CVPR.2005.350<a href="http://ieeexplore.ieee.org/search/srchabstract.jsp?arnumber=1467559&amp;isnumber=31473&amp;punumber=9901&amp;k2dockey=1467559@ieeecnfs&amp;query=%28%28essa%29%3Cin%3Eau+%29&amp;pos=18">, [IEEEXplore#]</a> 20-25 June 2005</p>
<p align="center"><strong>ABSTRACT</strong></p>
<p style="text-align: justify;">We present an approach for tracking varying number of objects through both temporally and spatially significant occlusions. Our method builds on the idea of object permanence to reason about occlusions. To this end, tracking is performed at both the region level and the object level. At the region level, a customized genetic algorithm is used to search for optimal region tracks. This limits the scope of object trajectories. At the object level, each object is located based on adaptive appearance models, spatial distributions and inter-occlusion relationships. The proposed architecture is capable of tracking objects even in the presence of long periods of full occlusions. We demonstrate the viability of this approach by experimenting on several videos of a user interacting with a variety of objects on a desktop.</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2005/06/20/paper-ieee-cvpr-2005-tracking-multiple-objects-through-occlusions/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Talk at USC&#8217;s IRIS (2004): &#8220;Temporal Reasoning from Video to Temporal Synthesis of Video&#8221;</title>
		<link>http://prof.irfanessa.com/2004/10/30/talk-at-uscs-iris-2004/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=talk-at-uscs-iris-2004</link>
		<comments>http://prof.irfanessa.com/2004/10/30/talk-at-uscs-iris-2004/#comments</comments>
		<pubDate>Sun, 31 Oct 2004 01:09:39 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Aware Home]]></category>
		<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[Presentations]]></category>
		<category><![CDATA[2004]]></category>
		<category><![CDATA[Computer Vision]]></category>

		<guid isPermaLink="false">http://irfan.essa.org/wp/2004/10/30/talk-at-uscs-iris-2004/</guid>
		<description><![CDATA[Irfan Essa (2004), &#8220;Temporal Reasoning from Video to Temporal Synthesis of Video&#8221; Talk at USC&#8217;s IRIS-Vision Seminars (Fall 2004). Temporal Reasoning from Video to Temporal Synthesis of Video Abstract In this talk, I will present some ongoing work on extracting spatio-temporal cues from video for both synthesis of novel video sequences, and recognition of complex [...]]]></description>
			<content:encoded><![CDATA[<ul>
<li>Irfan Essa (2004), &#8220;Temporal Reasoning from Video to Temporal Synthesis of Video&#8221;<a href="http://iris.usc.edu/Information/seminars/essa.html"> Talk at USC&#8217;s IRIS-Vision Seminars (Fall 2004).</a></li>
</ul>
<p align="center"><strong>Temporal Reasoning from Video to Temporal Synthesis of Video</strong></p>
<p align="center">Abstract</p>
<p style="text-align: justify;">In this talk, I will present some ongoing work on extracting spatio-temporal cues from video for both synthesis of novel video sequences, and recognition of complex activities. I will start off with some of our earlier work on Video Textures, where repeating information is extracted to generate extended sequences of videos. I will then describe some of our extensions to this approach that allow for controlled generation of animations of video sprites. We have developed various learning and optimization techniques that allow for video-based animations of photo-realistic characters. Then I will describe our new approach for image and video synthesis that builds on optimal patch-based copying of samples. I will show how our method allows for iterative refinement and extends to synthesis of both images and video from very limited samples. In the next part of my talk, I will describe how a similar analysis of video can be used to recognize what a person is doing in a scene. Such an analysis of video, aimed at recognition, requires more contextual information about the environment. I will show how we leverage contextual information shared between actions and objects to recognize what is happening in complex environments. I will also show that by adding some form of grammar (we use Stochastic Context Free Grammar) we can recognize very complex, multi-tasked activities.</p>
<p style="text-align: justify;">If time permits, I will describe (very briefly) the Aware Home project at Georgia Tech, which is one primary area of ongoing and future research for me and my group. Further information on my work with videos is available from my webpage at <a href="http://www.cc.gatech.edu/%7Eirfan">http://www.cc.gatech.edu/~irfan</a></p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2004/10/30/talk-at-uscs-iris-2004/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper: IEEE CVPR (2004) &#8220;Propagation networks for recognition of partially ordered sequential action&#8221;</title>
		<link>http://prof.irfanessa.com/2004/06/02/ieeexplore-propagation-networks-for-recognition-of-partially-ordered-sequential-action/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=ieeexplore-propagation-networks-for-recognition-of-partially-ordered-sequential-action</link>
		<comments>http://prof.irfanessa.com/2004/06/02/ieeexplore-propagation-networks-for-recognition-of-partially-ordered-sequential-action/#comments</comments>
		<pubDate>Wed, 02 Jun 2004 22:44:31 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Aaron Bobick]]></category>
		<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Aware Home]]></category>
		<category><![CDATA[David Minnen]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Yan Huang]]></category>
		<category><![CDATA[Yifan Shi]]></category>
		<category><![CDATA[2004]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[DVFX]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/2004/06/02/ieeexplore-propagation-networks-for-recognition-of-partially-ordered-sequential-action/</guid>
		<description><![CDATA[Yifan Shi, Yan Huang, Minnen, D., Bobick, A., Essa, I. (2004), &#8220;Propagation networks for recognition of partially ordered sequential action&#8221; In Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004 (CVPR 2004). Volume: 2, page(s): II-862 &#8211; II-869 Vol.2, ISSN: 1063-6919, ISBN: 0-7695-2158-4, INSPEC Accession Number:8161557, Digital Object Identifier: [...]]]></description>
			<content:encoded><![CDATA[<p>Yifan Shi, Yan Huang,   Minnen, D.,   Bobick, A.,   Essa, I. (2004), &#8220;Propagation networks for recognition of partially ordered sequential action&#8221; In Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004 (CVPR 2004). Volume: 2, page(s): II-862 &#8211; II-869 Vol.2, ISSN: 1063-6919, ISBN: 0-7695-2158-4, INSPEC Accession Number:8161557, Digital Object Identifier: 10.1109/CVPR.2004.1315255, 27 June-2 July 2004<a href="http://ieeexplore.ieee.org/search/srchabstract.jsp?arnumber=1315255&amp;isnumber=29134&amp;punumber=9183&amp;k2dockey=1315255@ieeecnfs&amp;query=%28%28essa%29%3Cin%3Eau+%29&amp;pos=21"> (IEEEXplore)</a></p>
<p align="center"><strong>Abstract</strong></p>
<p style="text-align: justify;">We present propagation networks (P-nets), a novel approach for representing and recognizing sequential activities that include parallel streams of action. We represent each activity using partially ordered intervals. Each interval is restricted by both temporal and logical constraints, including information about its duration and its temporal relationship with other intervals. P-nets associate one node with each temporal interval. Each node is triggered according to a probability density function that depends on the state of its parent nodes. Each node also has an associated observation function that characterizes supporting perceptual evidence. To facilitate real-time analysis, we introduce a particle filter framework to explore the conditional state space. We modify the original condensation algorithm to more efficiently sample a discrete state space (D-condensation). Experiments in the domain of blood glucose monitor calibration demonstrate both the representational power of P-nets and the effectiveness of the D-condensation algorithm.</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2004/06/02/ieeexplore-propagation-networks-for-recognition-of-partially-ordered-sequential-action/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Served from: prof.irfanessa.com @ 2012-02-05 15:43:24 -->
