<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Irfan Essa&#039;s Academic Activities &#187; Collaborators</title>
	<atom:link href="http://prof.irfanessa.com/category/collaborators/feed/" rel="self" type="application/rss+xml" />
	<link>http://prof.irfanessa.com</link>
	<description>Academic/Professional Activities</description>
	<lastBuildDate>Thu, 01 Apr 2010 15:31:12 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=abc</generator>
		<item>
		<title>CVPR 2010: Accepted Papers</title>
		<link>http://prof.irfanessa.com/2010/04/01/cvpr-2010-accepted-papers/</link>
		<comments>http://prof.irfanessa.com/2010/04/01/cvpr-2010-accepted-papers/#comments</comments>
		<pubDate>Thu, 01 Apr 2010 15:31:12 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[Jessica Hodgins]]></category>
		<category><![CDATA[Kihwan Kim]]></category>
		<category><![CDATA[Matthias Grundmann]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Vivek Kwatra]]></category>
		<category><![CDATA[2010]]></category>
		<category><![CDATA[CVPR]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=645</guid>
		<description><![CDATA[We have the following 4 papers that have been accepted for publications in IEEE CVPR 2010. More details forthcoming, with links to more details. Matthias Grundmann, Vivek Kwatra, Mei Han, and Irfan Essa (2010) &#8220;Discontinuous Seam-Carving for Video Retargeting&#8221; (a GA Tech, Google Collaboration) Matthias Grundmann, Vivek Kwatra, Mei Han, and Irfan Essa (2010) &#8220;Efficient Hierarchical Graph-Based Video [...]]]></description>
			<content:encoded><![CDATA[<div>We have the following 4 papers that have been accepted for publications in IEEE <a href="http://www.cvpr2010.org" target="_blank">CVPR 2010</a>. More details forthcoming, with links to more details.</div>
<ul>
<li>Matthias Grundmann, Vivek Kwatra, Mei Han, and Irfan Essa (2010) &#8220;Discontinuous Seam-Carving for Video Retargeting&#8221; (a GA Tech, Google Collaboration)</li>
<li>Matthias Grundmann, Vivek Kwatra, Mei Han, and Irfan Essa (2010) &#8220;Efficient Hierarchical Graph-Based Video Segmentation&#8221; (a GA Tech, Google Collaboration)</li>
<li>Kihwan Kim, Matthias Grundmann, Ariel Shamir, Iain Matthews, Jessica Hodgins, and Irfan Essa (2010) &#8220;Motion Fields to Predict Play Evolution in Dynamic Sport Scenes&#8221; (a GA Tech, Disney Collaboration)</li>
<li>Raffay Hamid, Ramkrishan Kumar, Matthias Grundmann, Kihwan Kim, Irfan Essa, and Jessica Hodgins (2010) &#8220;Player Localization Using Multiple Static Cameras for Sports Visualization&#8221; (a GA Tech, Disney Collaboration)</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2010/04/01/cvpr-2010-accepted-papers/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper Advanced Robotics (2009): &#8220;Human Action Recognition Using Global Point Feature Histograms and Action Shapes&#8221;</title>
		<link>http://prof.irfanessa.com/2009/10/29/paper-advanced-robotics-2009/</link>
		<comments>http://prof.irfanessa.com/2009/10/29/paper-advanced-robotics-2009/#comments</comments>
		<pubDate>Thu, 29 Oct 2009 14:58:56 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Franzi Meier]]></category>
		<category><![CDATA[Intelligent Environments]]></category>
		<category><![CDATA[Michael Beetz]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[2009]]></category>
		<category><![CDATA[Computer Vision]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=622</guid>
		<description><![CDATA[Radu Bogdan Rusu, Jan Bandouch, Franziska Meier, Irfan Essa and Michael Beetz (2009) &#8220;Human Action Recognition Using Global Point Feature Histograms and Action Shapes&#8221;, in Journal of Advanced Robotics, volume 23, pages 1873–1908, Koninklijke Brill NV, Leiden and The Robotics Society of Japan, 2009. [ DOI &#124; PDF] Abstract This paper investigates the recognition of [...]]]></description>
			<content:encoded><![CDATA[<p>Radu Bogdan Rusu,  Jan Bandouch, Franziska Meier,  Irfan Essa and Michael Beetz (2009) &#8220;Human Action Recognition Using Global Point Feature Histograms and Action Shapes&#8221;, in Journal of Advanced Robotics, volume 23, pages 1873–1908, Koninklijke Brill NV, Leiden and The Robotics Society of Japan, 2009. [ <a href="http://dx.doi.org/DOI:10.1163/016918609X12518783330243">DOI </a> | PDF]</p>
<p style="text-align: center;"><strong>Abstract</strong></p>
<p style="text-align: justify;">This paper investigates the recognition of human actions from three-dimensional (3-D) point clouds that encode the motions of people acting in sensor-distributed indoor environments. Data streams are time sequences of silhouettes extracted from cameras in the environment. From the 2-D silhouette contours we generate space–time streams by continuously aligning and stacking the contours along the time axis as third spatial dimension. The space–time stream of an observation sequence is segmented into parts corresponding to subactions using a pattern matching technique based on suffix trees and interval scheduling. Then, the segmented space–time shapes are processed by treating the shapes as 3-D point clouds and estimating global point feature histograms for them. The resultant models are clustered using statistical analysis and our experimental results indicate that the presented methods robustly derive different action classes. This holds despite large intra-class variance in the recorded datasets due to performances from different persons at different time intervals.</p>
<p style="text-align: justify;">© Koninklijke Brill NV, Leiden and The Robotics Society of Japan, 2009</p>
<p style="text-align: justify;">
<div id="attachment_625" class="wp-caption aligncenter" style="width: 512px"><img class="size-large wp-image-625  " title="2009-Rusu-etal-AR23-B" src="http://prof.irfanessa.com/wp-content/uploads/2009/10/2009-Rusu-etal-AR23-B-1024x193.png" alt="Overview of the approach." width="502" height="95" /><p class="wp-caption-text">Overview of the approach.</p></div>
<p><strong>Keywords: </strong>Action recognition, point cloud, global features, action segmentation</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2009/10/29/paper-advanced-robotics-2009/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper ISMAR 2009 (IEEE International Symposium on Mixed and Augmented Reality): &#8220;Augmenting Aerial Earth Maps with Dynamic Information&#8221;</title>
		<link>http://prof.irfanessa.com/2009/10/20/paper-2009-in-ismar-ieee-international-symposium-on-mixed-and-augmented-reality-augmenting-aerial-earth-maps-with-dynamic-information/</link>
		<comments>http://prof.irfanessa.com/2009/10/20/paper-2009-in-ismar-ieee-international-symposium-on-mixed-and-augmented-reality-augmenting-aerial-earth-maps-with-dynamic-information/#comments</comments>
		<pubDate>Tue, 20 Oct 2009 23:07:42 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Computational Journalism]]></category>
		<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[Kihwan Kim]]></category>
		<category><![CDATA[Modeling and Animation]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[2009]]></category>
		<category><![CDATA[Animation]]></category>
		<category><![CDATA[CnJ]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[Crowdsourcing]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=556</guid>
		<description><![CDATA[Kihwan Kim, Sangmin Oh, Jeonggyu Lee and Irfan Essa (2009), &#8220;Augmenting Aerial Earth Maps with Dynamic Information,&#8221; In Proceedings of IEEE International Symposium on Mixed and Augmented Reality (ISMAR), Orlando, FL, USA, October 2009 [Project Site, Video (AVI/DiVX), Video (Youtube) Paper (pdf)]. Abstract We introduce methods for augmenting aerial visualizations of Earth (from tools such [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: justify;">Kihwan Kim, Sangmin Oh, Jeonggyu Lee and Irfan Essa (2009), &#8220;Augmenting Aerial Earth Maps with Dynamic Information,&#8221; In Proceedings of <em>IEEE International Symposium on Mixed and Augmented Reality (ISMAR), </em>Orlando, FL, USA, October 2009 [<a href="http://www.cc.gatech.edu/cpl/projects/augearth/" target="_blank">Project Site</a>, <a href="http://www.kihwan23.com/augearth/augearth_ismar09_kim.avi">Video (AVI/DiVX)</a>, <a href="&lt;http://www.youtube.com/v/TPhttp://www.youtube.com/watch?v=TPk88soc2qw">Video (Youtube)</a><a href="http://www.cc.gatech.edu/cpl/projects/augearth/augearth_ismar_reduce.pdf"> Paper (pdf)</a>].</p>
<p style="text-align: center;"><strong>Abstract</strong></p>
<p style="text-align: justify;">We introduce methods for augmenting aerial visualizations of Earth (from tools such as Google Earth or Microsoft Virtual Earth) with dynamic information obtained from videos. Our goal is to make Augmented Earth Maps that visualize the live broadcast of dynamic sceneries within a city. We propose different approaches to analyze videos of pedestrians and cars, under differing conditions and then augment Aerial Earth Maps (AEMs) with live and dynamic information. We also analyze natural phenomenon (clouds) and project information from these to the AEMs to add the visual reality.</p>
<p><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="425" height="344" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowScriptAccess" value="always" /><param name="src" value="http://www.youtube.com/v/TPk88soc2qw&amp;color1=0xb1b1b1&amp;color2=0xcfcfcf&amp;hl=en&amp;feature=player_embedded&amp;fs=1" /><param name="allowfullscreen" value="true" /><embed type="application/x-shockwave-flash" width="425" height="344" src="http://www.youtube.com/v/TPk88soc2qw&amp;color1=0xb1b1b1&amp;color2=0xcfcfcf&amp;hl=en&amp;feature=player_embedded&amp;fs=1" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2009/10/20/paper-2009-in-ismar-ieee-international-symposium-on-mixed-and-augmented-reality-augmenting-aerial-earth-maps-with-dynamic-information/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
<enclosure url="http://www.kihwan23.com/augearth/augearth_ismar09_kim.avi" length="81762078" type="video/x-msvideo" />
		</item>
		<item>
		<title>In the News (2009):  CNN.com &#8220;Augmenting Earth Maps&#8221;</title>
		<link>http://prof.irfanessa.com/2009/10/13/in-the-news-2009-video-breaking-news-videos-from-cnn-com/</link>
		<comments>http://prof.irfanessa.com/2009/10/13/in-the-news-2009-video-breaking-news-videos-from-cnn-com/#comments</comments>
		<pubDate>Tue, 13 Oct 2009 19:30:19 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[In The News]]></category>
		<category><![CDATA[Kihwan Kim]]></category>
		<category><![CDATA[2009]]></category>
		<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[News]]></category>

		<guid isPermaLink="false">http://prof.irfanessa.com/?p=564</guid>
		<description><![CDATA[Video &#8211; Breaking News Videos from CNN.com. Check out the media coverage of our new paper to appear in ISMAR 2009, in October. Also see &#8220;Latest videos makes Google Earth cities bustle&#8221; New Scientist (Sep 30, 2009 Issue) &#8220;Video: Google Earth animated with real time human and vehicular traffic&#8221; Endgadget (Sep 30, 2009)]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.cnn.com/video/#/video/tech/2009/10/01/nr.augmented.earth.cnn">Video &#8211; Breaking News Videos from CNN.com</a>.</p>
<p>Check out the media coverage of our new paper to appear in ISMAR 2009, in October.</p>
<p>Also see</p>
<ul>
<li>&#8220;Latest videos makes Google Earth cities bustle&#8221; New Scientist (<a href="http://www.newscientist.com/article/mg20427285.600-latest-videos-makes-google-earth-cities-bustle.html" target="_blank">Sep 30, 2009 Issue</a>)</li>
<li>&#8220;Video: Google Earth animated with real time human and vehicular traffic&#8221; Endgadget (<a href="Video: Google Earth animated with real time human and vehicular traffic" target="_blank">Sep 30, 2009</a>)</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2009/10/13/in-the-news-2009-video-breaking-news-videos-from-cnn-com/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper (2009) In IEEE Transactions on Visualization and CG &#8220;Fluid Simulation with Articulated Bodies&#8221;</title>
		<link>http://prof.irfanessa.com/2009/06/10/swimmer/</link>
		<comments>http://prof.irfanessa.com/2009/06/10/swimmer/#comments</comments>
		<pubDate>Wed, 10 Jun 2009 12:20:18 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Greg Turk]]></category>
		<category><![CDATA[Modeling and Animation]]></category>
		<category><![CDATA[Nipun Kwatra]]></category>
		<category><![CDATA[2009]]></category>
		<category><![CDATA[Animation]]></category>
		<category><![CDATA[Motion Capture]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/?p=529</guid>
		<description><![CDATA[Nipun Kwatra, Chris Wojtan, Mark Carlson, Irfan A. Essa, Peter J. Mucha, Greg Turk (2009), &#8220;Fluid Simulation with Articulated Bodies&#8220;, IEEE Transactions on Visualization and Computer Graphics, 10 Jun. 2009. IEEE computer Society Digital Library. IEEE Computer Society. [DOI &#124; PDF (see copyright) &#124; Video &#124; Website] Abstract We present an algorithm for creating realistic [...]]]></description>
			<content:encoded><![CDATA[<p>Nipun Kwatra, Chris Wojtan, Mark Carlson, Irfan A. Essa, Peter J. Mucha, Greg Turk (2009), &#8220;<a href="http://www2.computer.org/portal/web/csdl/doi/10.1109/TVCG.2009.66">Fluid Simulation with Articulated Bodies</a>&#8220;, <em>IEEE Transactions on Visualization and Computer Graphics</em>, 10 Jun. 2009. IEEE computer Society Digital Library. IEEE Computer Society. [<a href="&lt;http://doi.ieeecomputersociety.org/10.1109/TVCG.2009.66&gt;" target="_blank">DOI</a> | <a href="http://www.cc.gatech.edu/cpl/projects/swimmer/paper/MF.pdf" target="_blank">PDF</a> (see <a href="./copyright" target="_blank">copyright</a>) | <a href="http://www.cc.gatech.edu/cpl/projects/swimmer/video/MF.avi" target="_blank">Video</a> | <a href="http://www.cc.gatech.edu/cpl/projects/swimmer/" target="_blank">Website</a>]</p>
<p style="text-align: center;"><strong>Abstract</strong></p>
<p style="text-align: justify;">We present an algorithm for creating realistic animations of characters that are swimming through fluids. Our approach combines dynamic simulation with data-driven kinematic motions (motion capture data) to produce realistic animation in a fluid. The interaction of the articulated body with the fluid is performed by incorporating joint constraints with rigid animation and by extending a solid/fluid coupling method to handle articulated chains. Our solver takes as input the current state of the simulation and calculates the angular and linear accelerations of the connected bodies needed to match a particular motion sequence for the articulated body. These accelerations are used to estimate the forces and torques that are then applied to each joint. Based on this approach, we demonstrate simulated swimming results for a variety of different strokes, including crawl, backstroke, breaststroke and butterfly. The ability to have articulated bodies interact with fluids also allows us to generate simulations of simple water creatures that are driven by simple controllers.</p>
<p style="text-align: center;"><img class="size-large wp-image-530  aligncenter" title="teaser" src="http://academics.irfanessa.com/wp-content/uploads/2009/06/teaser-1024x338.jpg" alt="teaser" width="400" /></p>
<p style="text-align: justify;">
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2009/06/10/swimmer/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
<enclosure url="http://www.cc.gatech.edu/cpl/projects/swimmer/video/MF.avi" length="84597040" type="video/x-msvideo" />
		</item>
		<item>
		<title>Paper (2009) ACM CHI: &#8220;Videolyzer: Quality Analysis of Online Informational Video for Bloggers and Journalists&#8221;</title>
		<link>http://prof.irfanessa.com/2009/03/04/paper-chi-2009/</link>
		<comments>http://prof.irfanessa.com/2009/03/04/paper-chi-2009/#comments</comments>
		<pubDate>Wed, 04 Mar 2009 13:55:20 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[ACM UIST/CHI]]></category>
		<category><![CDATA[Computational Journalism]]></category>
		<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[Nick Diakopoulos]]></category>
		<category><![CDATA[2009]]></category>
		<category><![CDATA[CHI]]></category>
		<category><![CDATA[CnJ]]></category>
		<category><![CDATA[Crowdsourcing]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/?p=470</guid>
		<description><![CDATA[N. Diakopoulos, S. Goldenberg, I. Essa (2009). &#8220;Videolyzer: Quality Analysis of Online Informational Video for Bloggers and Journalists.&#8221; ACM Conference on Human Factors in Computing Systems (CHI). April, 2009. [PDF] [Project Site] [Video] (CHI 2009 &#8211; Digital Life New World &#8211; CHI 2009 Advance Program) Abstract Tools to aid people in making sense of the information quality [...]]]></description>
			<content:encoded><![CDATA[<p>N. Diakopoulos, S. Goldenberg, I. Essa (2009). &#8220;Videolyzer: Quality Analysis of Online Informational Video for Bloggers and Journalists.&#8221; <em>ACM Conference on Human Factors in Computing Systems (CHI)</em>. April, 2009.<em> </em>[<a href="http://www.deakondesign.com/Documents/paper0553-diakopoulos_dl.pdf">PDF</a>] [<a href="http://www.videolyzer.com/" target="_blank">Project Site</a>] [<a href="http://www.deakondesign.com/videos/videolyzer_chi_video.php">Video</a>] <a href="http://www.chi2009.org/Attending/AdvanceProgram/75.html"><span style="color: #000000; text-decoration: none;">(</span></a><a href="http://www.chi2009.org/Attending/AdvanceProgram/75.html">CHI 2009 &#8211; Digital Life New World &#8211; CHI 2009 Advance Program)</a></p>
<p style="text-align: center;"><strong>Abstract</strong></p>
<p><img class="size-full wp-image-480   alignleft" style="margin: 5px;" title="paper0553-diakopoulos_dl" src="http://academics.irfanessa.com/wp-content/uploads/2009/02/paper0553-diakopoulos_dl.jpg" alt="Screen Shot of Videolyzer" width="192" height="208" /></p>
<div style="text-align: justify;">Tools to aid people in making sense of the information quality of online informational video are essential for media consumers seeking to be well informed. Our application, Videolyzer, addresses the information quality problem in video by allowing politically motivated bloggers or journalists to analyze, collect, and share criticisms of the information quality of online political videos. Our interface innovates by providing a fine-grained and tightly coupled interaction paradigm between the timeline, the time-synced transcript, and annotations. We also incorporate automatic textual and video content analysis to suggest areas of interest for further assessment by a person. We present an evaluation of Videolyzer looking at the user experience, usefulness, and behavior around the novel features of the UI as well as report on the collaborative dynamic of the discourse generated with the tool.</div>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2009/03/04/paper-chi-2009/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper (2009) In ACM Symposium on Interactive 3D Graphics &#8220;Human Video Textures&#8221;</title>
		<link>http://prof.irfanessa.com/2009/03/01/paper-2009-acm-symposium-on-interactive-human-video-textures/</link>
		<comments>http://prof.irfanessa.com/2009/03/01/paper-2009-acm-symposium-on-interactive-human-video-textures/#comments</comments>
		<pubDate>Sun, 01 Mar 2009 19:43:45 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[ACM SIGGRAPH]]></category>
		<category><![CDATA[Computational Photography and Video]]></category>
		<category><![CDATA[James Rehg]]></category>
		<category><![CDATA[Matt Flagg]]></category>
		<category><![CDATA[Modeling and Animation]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Sing Bing Kang]]></category>
		<category><![CDATA[2009]]></category>
		<category><![CDATA[Animation]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[Motion Capture]]></category>
		<category><![CDATA[Video Textures]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/?p=473</guid>
		<description><![CDATA[Matthew Flagg, Atsushi Nakazawa, Qiushuang Zhang, Sing Bing Kang, Young Kee Ryu, Irfan Essa, James M. Rehg (2009), Human Video Textures In Proceedings of the ACM Symposium on Interactive 3D Graphics and Games 2009 (I3D ’09), Boston, MA, February 27-March 1 (Fri-Sun), 2009 [PDF (see Copyright) &#124; Video in DiVx &#124; Website ] Abstract This paper describes a data-driven approach for [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.cc.gatech.edu/cpl/projects/humanvideotextures/"></a></p>
<p><a href="http://www.cc.gatech.edu/~mflagg">Matthew Flagg</a>, <a href="http://www.ime.cmc.osaka-u.ac.jp/~nakazawa/wiki/">Atsushi Nakazawa</a>, Qiushuang Zhang, <a href="http://research.microsoft.com/en-us/people/sbkang/">Sing Bing Kang</a>, Young Kee Ryu, <a href="http://www.irfanessa.com/">Irfan Essa</a>, <a href="http://www.cc.gatech.edu/~rehg">James M. Rehg</a> (2009), <a href="http://www.cc.gatech.edu/cpl/projects/humanvideotextures/">Human Video Textures</a> In Proceedings of the ACM Symposium on Interactive 3D Graphics and Games 2009 (<a href="http://graphics.cs.williams.edu/i3d09/" target="_blank">I3D ’09</a>), Boston, MA, February 27-March 1 (Fri-Sun), 2009 [<a href="http://www.cc.gatech.edu/cpl/projects/humanvideotextures/HVT.pdf" target="_blank">PDF</a> (see <a href="./copyright" target="_blank">Copyright</a>) | <a href="http://www.cc.gatech.edu/cpl/projects/humanvideotextures/hvt-i3d.avi">Video</a> in DiVx | Website ]</p>
<tbody></tbody>
<p style="text-align: center;"><strong>Abstract</strong></p>
<p style="text-align: justify;">This paper describes a data-driven approach for generating photorealistic animations of human motion. Each animation sequence follows a user-choreographed path and plays continuously by seamlessly transitioning between different segments of the captured data. To produce these animations, we capitalize on the complementary characteristics of motion capture data and video. We customize our capture system to record motion capture data that are synchronized with our video source. Candidate transition points in video clips are identified using a new similarity metric based on 3-D marker trajectories and their 2-D projections into video. Once the transitions have been identified, a video-based motion graph is constructed. We further exploit hybrid motion and video data to ensure that the transitions are seamless when generating animations. Motion capture marker projections serve as control points for segmentation of layers and nonrigid transformation of regions. This allows warping and blending to generate seamless in-between frames for animation. We show a series of choreographed animations of walks and martial arts scenes as validation of our approach.</p>
<div class="wp-caption aligncenter" style="width: 514px"><span style="text-decoration: underline;"><img class="   aligncenter" title="Human Video Textures" src="http://www.cc.gatech.edu/cpl/projects/humanvideotextures/graphics/teaser.png" alt="Example Image from Project" width="504" height="156" /> </span><p class="wp-caption-text">Human Video Textures (Output Rendered as a Collage!)</p></div>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2009/03/01/paper-2009-acm-symposium-on-interactive-human-video-textures/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
<enclosure url="http://www.cc.gatech.edu/cpl/projects/humanvideotextures/hvt-i3d.avi" length="75996996" type="video/x-msvideo" />
		</item>
		<item>
		<title>Paper (2009): ICASSP &#8220;Learning Basic Units in American Sign Language using Discriminative Segmental Feature Selection&#8221;</title>
		<link>http://prof.irfanessa.com/2009/02/04/paper-2009-icassp-learning-basic-units-in-american-sign-language-using-discriminative-segmental-feature-selection/</link>
		<comments>http://prof.irfanessa.com/2009/02/04/paper-2009-icassp-learning-basic-units-in-american-sign-language-using-discriminative-segmental-feature-selection/#comments</comments>
		<pubDate>Wed, 04 Feb 2009 13:21:47 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Face and Gesture]]></category>
		<category><![CDATA[Funding]]></category>
		<category><![CDATA[ICASSP]]></category>
		<category><![CDATA[James Rehg]]></category>
		<category><![CDATA[NSF (0205507)]]></category>
		<category><![CDATA[Numerical Machine Learning]]></category>
		<category><![CDATA[Pei Yin]]></category>
		<category><![CDATA[Thad Starner]]></category>
		<category><![CDATA[2009]]></category>
		<category><![CDATA[Gesture]]></category>
		<category><![CDATA[NSF]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/?p=464</guid>
		<description><![CDATA[Pei Yin, Thad Starner, Harley Hamilton, Irfan Essa, James M. Rehg (2009), &#8221;Learning Basic Units in American Sign Language using Discriminative Segmental Feature Selection&#8221; in IEEE Conference on Acoustics, Speech, and Signal Processing 2009 (ICASSP 2009). Session: Spoken Language Understanding I, Tuesday, April 21, 11:00 &#8211; 13:00, Taipei, Taiwan. ABSTRACT The natural language for most deaf signers in [...]]]></description>
			<content:encoded><![CDATA[<p>Pei Yin, Thad Starner, Harley Hamilton, Irfan Essa, James M. Rehg (2009), &#8221;Learning Basic Units in American Sign Language using Discriminative Segmental Feature Selection&#8221; in <em>IEEE Conference on Acoustics, Speech, and Signal Processing 2009 (</em><a href="http://www.icassp09.com/default.asp" target="_blank"><em>ICASSP 2009</em></a><em>)</em>. Session: Spoken Language Understanding I, Tuesday, April 21, 11:00 &#8211; 13:00, Taipei, Taiwan.</p>
<p style="text-align: center;"><strong>ABSTRACT</strong></p>
<p style="text-align: justify;">The natural language for most deaf signers in the United States is American Sign Language (ASL). ASL has internal structure like spoken languages, and ASL linguists have introduced several phonemic models. The study of ASL phonemes is not only interesting to linguists, but also useful for scalability in recognition by machines. Since machine perception is different than human perception, this paper learns the basic units for ASL directly from data. Comparing with previous studies, our approach computes a set of data-driven units (fenemes) discriminatively from the results of segmental feature selection. The learning iterates the following two steps: first apply discriminative feature selection segmentally to the signs, and then tie the most similar temporal segments to re-train. Intuitively, the sign parts indistinguishable to machines are merged to form basic units, which we call ASL fenemes. Experiments on publicly available ASL recognition data show that the extracted data-driven fenemes are meaningful, and recognition using those fenemes achieves improved accuracy at reduced model complexity</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2009/02/04/paper-2009-icassp-learning-basic-units-in-american-sign-language-using-discriminative-segmental-feature-selection/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper: ICPR (2008) &#8220;3D Shape Context and Distance Transform for Action Recognition&#8221;</title>
		<link>http://prof.irfanessa.com/2008/12/08/paper-icpr-2008-3d-shape-context-and-distance-transform-for-action-recognition/</link>
		<comments>http://prof.irfanessa.com/2008/12/08/paper-icpr-2008-3d-shape-context-and-distance-transform-for-action-recognition/#comments</comments>
		<pubDate>Mon, 08 Dec 2008 20:22:26 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Aware Home]]></category>
		<category><![CDATA[Face and Gesture]]></category>
		<category><![CDATA[Franzi Meier]]></category>
		<category><![CDATA[Matthias Grundmann]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[2008]]></category>
		<category><![CDATA[Computer Vision]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/?p=146</guid>
		<description><![CDATA[M. Grundmann, F. Meier, and I. Essa (2008) &#8220;3D Shape Context and Distance Transform for Action Recognition&#8221;, In Proceedings of International Conference on Pattern Recognition (ICPR) 2008, Tampa, FL. [Project Page &#124; DOI &#124; PDF] ABSTRACT We propose the use of 3D (2D+time) Shape Context to recognize the spatial and temporal details inherent in human [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: left;">M. Grundmann, F. Meier, and I. Essa (2008) &#8220;3D Shape Context and Distance Transform for Action Recognition&#8221;, In <em>Proceedings of <a href="http://www.icpr2008.org/" target="_blank">International Conference on Pattern Recognition</a></em> (ICPR) 2008, Tampa, FL. [<a href="http://www.mgrundmann.com/icpr2008.html" target="_blank">Project Page</a> | <a href="http://dx.doi.org/10.1109/ICPR.2008.4761435" target="_blank">DOI</a> | <a href="http://www.mgrundmann.com/pdfs/icpr2008.pdf" target="_blank">PDF</a>]</p>
<p style="text-align: center;">ABSTRACT</p>
<p style="text-align: justify;"><a href="http://academics.irfanessa.com/wp-content/uploads/2008/08/3dfigure_feat_small.png"><img class="alignleft size-medium wp-image-163" title="3dfigure_feat_small" src="http://academics.irfanessa.com/wp-content/uploads/2008/08/3dfigure_feat_small-300x179.png" alt="" width="300" height="179" /></a>We propose the use of 3D (2D+time) Shape Context to recognize the spatial and temporal details inherent in human actions. We represent an action in a video sequence by a 3D point cloud extracted by sampling 2D silhouettes over time. A non-uniform sampling method is introduced that gives preference to fast moving body parts using a Euclidean 3D Distance Transform. Actions are then classified by matching the extracted point clouds. Our proposed approach is based on a global matching and does not require specific training to learn the model. We test the approach thoroughly on two publicly available datasets and compare to several state-of-the-art methods. The achieved classification accuracy is on par with or superior to the best results reported to date.</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2008/12/08/paper-icpr-2008-3d-shape-context-and-distance-transform-for-action-recognition/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Disney Research, Pittsburgh</title>
		<link>http://prof.irfanessa.com/2008/10/23/disney-research-pittsburgh/</link>
		<comments>http://prof.irfanessa.com/2008/10/23/disney-research-pittsburgh/#comments</comments>
		<pubDate>Thu, 23 Oct 2008 17:03:10 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Jessica Hodgins]]></category>
		<category><![CDATA[2008]]></category>
		<category><![CDATA[2009]]></category>
		<category><![CDATA[Disney]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/?p=370</guid>
		<description><![CDATA[This academic year, I am spending some time working with the newly formed Disney Research, Pittsburgh, (Directed by Jessica Hodgins) formed next to CMU.  The press release is announcing this lab is here (Carnegie Mellon SCS Press Release). I am also hanging out with folks at the CMU Robotics Institute and have started some new [...]]]></description>
			<content:encoded><![CDATA[<div style="float: right;"><a href="http://www.cs.cmu.edu/news/releases/disney.html"><img class="alignright" src="http://graphics.cs.cmu.edu/i/news/disney_research.png" alt="" width="152" height="100" /></a></div>
<p>This academic year, I am spending some time working with the newly formed Disney Research, Pittsburgh, (Directed by <a href="http://www.cs.cmu.edu/~jkh/" target="_blank">Jessica Hodgins</a>) formed next to <a href="http://www.cmu.edu" target="_blank">CMU</a>.  The press release is announcing this lab is here <span style="text-decoration: none; color: #000000;">(</span><a href="http://news.cs.cmu.edu/article.php?a=441" target="_blank">Carnegie Mellon SCS Press Release</a>). I am also hanging out with folks at the <a href="http://www.ri.cmu.edu" target="_blank">CMU Robotics Institute</a> and have started some new collaborations.  So now depending on when, you can find me either in Atlanta (at GA Tech) or in Pittsburgh (at Disney Lab or CMU) [OR on a airplane between Pittsburgh and Atlanta].</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2008/10/23/disney-research-pittsburgh/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper: ACM Multimedia (2008) &#8220;Audio Puzzler: Piecing Together Time-Stamped Speech Transcripts with a Puzzle Game&#8221;</title>
		<link>http://prof.irfanessa.com/2008/10/18/paper-acm-multimedia-2008-audio-puzzler-piecing-together-time-stamped-speech-transcripts-with-a-puzzle-game/</link>
		<comments>http://prof.irfanessa.com/2008/10/18/paper-acm-multimedia-2008-audio-puzzler-piecing-together-time-stamped-speech-transcripts-with-a-puzzle-game/#comments</comments>
		<pubDate>Sat, 18 Oct 2008 20:30:24 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[ACM MM]]></category>
		<category><![CDATA[Computational Journalism]]></category>
		<category><![CDATA[Multimedia]]></category>
		<category><![CDATA[Nick Diakopoulos]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[2008]]></category>
		<category><![CDATA[CnJ]]></category>
		<category><![CDATA[Crowdsourcing]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/?p=152</guid>
		<description><![CDATA[N. Diakopoulos, K. Luther, I. Essa (2008), &#8220;Audio Puzzler: Piecing Together Time-Stamped Speech Transcripts with a Puzzle Game.&#8221; In Proceedings of  ACM International Conference on Multimedia 2008. Vancouver, BC, CANANDA  [Project Link] ABSTRACT We have developed an audio-based casual puzzle game which produces a time-stamped transcription of spokenaudio as a by-product of play. Our evaluation [...]]]></description>
			<content:encoded><![CDATA[<p>N. Diakopoulos, K. Luther, I. Essa (2008), &#8220;Audio Puzzler: Piecing Together Time-Stamped Speech Transcripts with a Puzzle Game.&#8221; In Proceedings of  <a href="http://www.mcrlab.uottawa.ca/acmmm2008/" target="_blank">ACM International Conference on Multimedia 2008</a>. Vancouver, BC, CANANDA  [<a href="http://www.deakondesign.com/?p=51">Project Link</a>]</p>
<p style="text-align: center;"><strong>ABSTRACT</strong></p>
<p style="text-align: justify;">We have developed an audio-based casual puzzle game which produces a time-stamped transcription of spoken<img class="alignright size-medium wp-image-405" title="ap" src="http://academics.irfanessa.com/wp-content/uploads/2008/10/ap-300x198.jpg" alt="ap" width="300" height="198" />audio as a by-product of play. Our evaluation of the game indicates that it is both fun and challenging. The transcripts generated using the game are more accurate than those produced using a standard automatic transcription system and the time-stamps of words are within several hundred milliseconds of ground truth.</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2008/10/18/paper-acm-multimedia-2008-audio-puzzler-piecing-together-time-stamped-speech-transcripts-with-a-puzzle-game/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Research: Videolyzer (Online DEMO, try it out!)</title>
		<link>http://prof.irfanessa.com/2008/10/15/project-videolyzer/</link>
		<comments>http://prof.irfanessa.com/2008/10/15/project-videolyzer/#comments</comments>
		<pubDate>Thu, 16 Oct 2008 01:22:15 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Collaborators]]></category>
		<category><![CDATA[Computational Journalism]]></category>
		<category><![CDATA[Nick Diakopoulos]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[2008]]></category>
		<category><![CDATA[CnJ]]></category>
		<category><![CDATA[Crowdsourcing]]></category>
		<category><![CDATA[Demo]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/?p=375</guid>
		<description><![CDATA[An Online DEMO of Videolyzer, a project by my PhD Student, Nick Diakopolous. Videolyzer is a tool designed to help journalists and bloggers collect, organize, and present information about the quality (i.e. validity, reliability, etc.) of online videos. It makes it possible to evaluate and make sense of things like comments, claims, and sources as [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.videolyzer.com/">An Online DEMO of Videolyzer</a>, a project by my PhD Student, Nick Diakopolous.</p>
<p>Videolyzer is a tool designed to help journalists and bloggers collect, organize, and present information about the quality (i.e. validity, reliability, etc.) of online videos. It makes it possible to evaluate and make sense of things like comments, claims, and sources as they relate to the video. Users can comment and annotate pieces of the video (called &#8220;anchors&#8221;) to provide a more fine-grained description of the information in the video. The interface also incorporates a tightly integrated transcript of what&#8217;s spoken in the video to make it easier to navigate the dense information there. Finally, Videolyzer allows for collaboration among many people. Users can build off of each other&#8217;s annotations and rate each other in a form of distributed vetting and peer-evaluation.</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2008/10/15/project-videolyzer/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper: ISWC (2008) &#8220;Localization and 3D Reconstruction of Urban Scenes Using GPS&#8221;</title>
		<link>http://prof.irfanessa.com/2008/09/28/paper-iswc-2008-localization-and-3d-reconstruction-of-urban-scenes-using-gps/</link>
		<comments>http://prof.irfanessa.com/2008/09/28/paper-iswc-2008-localization-and-3d-reconstruction-of-urban-scenes-using-gps/#comments</comments>
		<pubDate>Sun, 28 Sep 2008 20:59:27 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[ISWC]]></category>
		<category><![CDATA[Kihwan Kim]]></category>
		<category><![CDATA[Mobile Computing]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Thad Starner]]></category>
		<category><![CDATA[2008]]></category>
		<category><![CDATA[CnJ]]></category>
		<category><![CDATA[Crowdsourcing]]></category>
		<category><![CDATA[Peer Production]]></category>
		<category><![CDATA[Wearables]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/?p=159</guid>
		<description><![CDATA[Kihwan Kim, Jay Summet, Thad Starner, Daniel Ashbrook, Mrunal Kapade and Irfan Essa  (2008) &#8220;Localization and 3D Reconstruction of Urban Scenes Using GPS&#8221; In Proceedings of IEEE Symposium on Wearable Computing (ISWC) 2008 (To Appear). [PDF] ABSTRACT Using off-the-shelf Global Positioning System (GPS) units, we reconstruct buildings in 3D by exploiting the reduction in signal [...]]]></description>
			<content:encoded><![CDATA[<p>Kihwan Kim,  Jay Summet, Thad Starner, Daniel Ashbrook, Mrunal Kapade and Irfan Essa  (2008) &#8220;Localization and 3D Reconstruction of Urban Scenes Using GPS&#8221; In Proceedings of <a href="http://www.iswc.net/" target="_blank">IEEE Symposium on Wearable Computing (ISWC)</a> 2008 (To Appear).               <a href="http://www.cc.gatech.edu/research/reports/GT-IC-08-06">[PDF]</a></p>
<p style="text-align: center;"><strong>ABSTRACT</strong></p>
<p><img class="alignleft size-full wp-image-402" style="margin: 5px;" title="research_gpsray" src="http://academics.irfanessa.com/wp-content/uploads/2008/09/research_gpsray.jpg" alt="research_gpsray" width="240" height="196" /></p>
<p style="text-align: justify;">Using off-the-shelf Global Positioning System (GPS) units, we reconstruct buildings in 3D by exploiting the reduction in signal to noise ratio (SNR) that occurs when the buildings obstruct the line-of-sight between the moving units and the orbiting satellites. We measure the size and height of skyscrapers as well as automatically constructing a density map representing the location of multiple buildings in an urban landscape.  If deployed on a large scale, via a cellular service provider&#8217;s GPS-enabled mobile phones or GPS-tracked delivery vehicles, the system could provide an inexpensive means of continuously creating and updating 3D maps of urban environments.</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2008/09/28/paper-iswc-2008-localization-and-3d-reconstruction-of-urban-scenes-using-gps/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper: Pragmatic Web (2008) &#8220;An Annotation Model for Making Sense of Information Quality in Online Videos&#8221;</title>
		<link>http://prof.irfanessa.com/2008/09/28/paper-pragmatic-web-2008-an-annotation-model-for-making-sense-of-information-quality-in-online-videos/</link>
		<comments>http://prof.irfanessa.com/2008/09/28/paper-pragmatic-web-2008-an-annotation-model-for-making-sense-of-information-quality-in-online-videos/#comments</comments>
		<pubDate>Sun, 28 Sep 2008 20:37:17 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Computational Journalism]]></category>
		<category><![CDATA[Multimedia]]></category>
		<category><![CDATA[Nick Diakopoulos]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[2008]]></category>
		<category><![CDATA[CnJ]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/?p=155</guid>
		<description><![CDATA[N. Diakopoulos, I. Essa. (2008) &#8220;An Annotation Model for Making Sense of Information Quality in Online Videos.&#8221; Proceedings of the International Conference on the Pragmatic Web. 28–30 Sept. 2008, Uppsala, Sweden (To Appear) ABSTRACT Making sense of the information quality of online media including things such as the accuracy and validity of claims and the [...]]]></description>
			<content:encoded><![CDATA[<p>N. Diakopoulos, I. Essa. (2008) &#8220;An Annotation Model for Making Sense of Information Quality in Online Videos.&#8221; <em>Proceedings of the <a href="http://www.pragmaticweb.info/index.php?option=com_content&amp;task=view&amp;id=48&amp;Itemid=1" target="_blank">International Conference on the Pragmatic Web</a>. </em>28–30 Sept. 2008, Uppsala, Sweden (To Appear)</p>
<p style="text-align: center;"><strong>ABSTRACT</strong></p>
<p style="text-align: justify;">Making sense of the information quality of online media including things such as the accuracy and validity of claims and the reliability of sources is essential for people to be well-informed. We are developing Videolyzer to address the challenge of information quality sense-making by allowing motivated individuals to analyze, collect, share, and respond to criticisms of the information quality of online political videos and their transcripts. In this paper specifically we present a model of how the annotation ontology and collaborative dynamics embedded in Videolyzer can enhance information quality.</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2008/09/28/paper-pragmatic-web-2008-an-annotation-model-for-making-sense-of-information-quality-in-online-videos/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Funding (2007): NSF &#8220;Web on Demand &#8211; Bridging the Gap Between Social Networks and Ad Hoc Networking&#8221;</title>
		<link>http://prof.irfanessa.com/2008/09/01/funding-2007-nsf-web-on-demand-bridging-the-gap-between-social-networks-and-ad-hoc-networking/</link>
		<comments>http://prof.irfanessa.com/2008/09/01/funding-2007-nsf-web-on-demand-bridging-the-gap-between-social-networks-and-ad-hoc-networking/#comments</comments>
		<pubDate>Mon, 01 Sep 2008 13:04:55 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Computational Journalism]]></category>
		<category><![CDATA[Kishore Ramachandran]]></category>
		<category><![CDATA[Mobile Computing]]></category>
		<category><![CDATA[2008]]></category>
		<category><![CDATA[CnJ]]></category>
		<category><![CDATA[Crowdsourcing]]></category>
		<category><![CDATA[NSF]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/?p=462</guid>
		<description><![CDATA[Award#0834545 &#8211; CSR-DMSS, SM: Web on Demand &#8211; Bridging the Gap Between Social Networks and Ad Hoc Networking Investigator(s): Umakishore Ramachandran, (Principal Investigator), Irfan Essa (Co-Principal Investigator) Dates: September 1, 2008 &#8211; August 31, 2009 (Estimated) Abstract From the western world to the third world, the use of handheld devices (cellphones, PDAs) has proliferated. The [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://nsf.gov/awardsearch/showAward.do?AwardNumber=0834545">Award#0834545 &#8211; CSR-DMSS, SM:   Web on Demand &#8211; Bridging the Gap Between Social Networks and Ad Hoc Networking</a></p>
<p>Investigator(s): Umakishore Ramachandran, (Principal Investigator), Irfan Essa (Co-Principal Investigator)</p>
<p>Dates:	 September 1, 2008 &#8211; August 31, 2009 (Estimated)</p>
<p style="text-align: center;"><strong>Abstract</strong></p>
<p style="text-align: justify;">From the western world to the third world, the use of handheld devices (cellphones, PDAs) has proliferated. The world of users is becoming both wireless and mobile. Web 2.0 has ushered in an age wherein the web is viewed as a provider of services and not just a repository of documents and/or information. Despite this advance, the web remains just that, a single web with an inherent assumption that a powerful computing and communication infrastructure supports it. Couldn&#8217;t mobile wireless devices in close proximity form a web of their own? This is the vision behind this project, the Web on Demand (WoD). WoD aims at bridging the gap between social networks and ad hoc networking. In other words, it aims to rethink the system software stack all the way from application to networking that would allow the creation and management of social networks without any assumption of infrastructure support. The core of the research is to develop software technologies for mobile devices that would allow the dynamic creation of thematic ad hoc overlay networks empowering (a) mobile people with similar interests (e.g., weather forecast), (b) friends and family (e.g., in a theme park), and (c) participants in mission critical applications (e.g., search and rescue), stay connected. WoD complements the World Wide Web (WWW) and leverages it when it is available, such as exploiting the ambient computing infrastructure to enhance user experience, and managing the dynamic creation of User Generated Content (UGC) by mobile users. The vision behind this project is to democratize access to services that are currently offered through WWW. In this sense, the results from this research can have far-reaching technological and societal consequences. Most importantly, the research will help breed a new class of computer scientists who are connected with societal causes in addition to advancing technology.</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2008/09/01/funding-2007-nsf-web-on-demand-bridging-the-gap-between-social-networks-and-ad-hoc-networking/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Teaching: CS 4480 DVFX, Fall 08 “viral edition”</title>
		<link>http://prof.irfanessa.com/2008/08/19/teaching-cs-4480-dvfx-fall-08-%e2%80%9cviral-edition%e2%80%9d/</link>
		<comments>http://prof.irfanessa.com/2008/08/19/teaching-cs-4480-dvfx-fall-08-%e2%80%9cviral-edition%e2%80%9d/#comments</comments>
		<pubDate>Tue, 19 Aug 2008 22:29:24 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[DVFX]]></category>
		<category><![CDATA[Frank Dellaert]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/?p=144</guid>
		<description><![CDATA[I am very pleased that my colleague (and friend) Professor Frank Dellaert has taken over my DVFX class that I have been teaching since 1999 (see site here).  It is clear already that this new edition of the DVFX class will be even more exciting then the previous editions.  Can&#8217;t wait to see the final [...]]]></description>
			<content:encoded><![CDATA[<p>I am very pleased that my colleague (and friend) Professor <a href="http://www.cc.gatech.edu/~dellaert/" target="_blank">Frank Dellaert</a> has taken over my DVFX class that I have been teaching since 1999 (see site <a href="http://www.cc.gatech.edu/dvfx/">here</a>).  It is clear already that this new edition of the DVFX class will be even more exciting then the previous editions.  Can&#8217;t wait to see the final videos. Check out the info on the class at <a href="http://www.cc.gatech.edu/~dellaert/iWeb/08F-DVFX/CS_4480_DVFX.html">CS 4480 DVFX, Fall 08.<br />
</a></p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2008/08/19/teaching-cs-4480-dvfx-fall-08-%e2%80%9cviral-edition%e2%80%9d/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Research: Audio Puzzler Alpha</title>
		<link>http://prof.irfanessa.com/2008/08/07/research-audio-puzzler-alpha/</link>
		<comments>http://prof.irfanessa.com/2008/08/07/research-audio-puzzler-alpha/#comments</comments>
		<pubDate>Thu, 07 Aug 2008 17:08:12 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Computational Journalism]]></category>
		<category><![CDATA[Nick Diakopoulos]]></category>
		<category><![CDATA[2008]]></category>
		<category><![CDATA[CnJ]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/?p=134</guid>
		<description><![CDATA[Audio Puzzler Alpha (ONLINE DEMO) By Nick Diakopoulos (My PhD Student) Audio Puzzler is a new kind of puzzle game based on unauthored content found online. The audio for the puzzles is taken from popular or interesting video clips from different genres such as news, documentary, or television. The audio puzzler is the type of [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.deakondesign.com/?p=51">Audio Puzzler Alpha (ONLINE DEMO)<br />
</a></p>
<p>By <a href="http://www.deakondesign.com/" target="_blank">Nick Diakopoulos</a> (My PhD Student)</p>
<p style="text-align: justify;">Audio Puzzler is a new kind of puzzle game based on unauthored content found online. The audio for the puzzles is taken from popular or interesting video clips from different genres such as news, documentary, or television. The audio puzzler is the type of game that harnesses people’s play to also provide valuable data which enriches the content played with. This is in the same vein as the ESPGame, the Listen Game, and PhotoPlay, which are all games which gather data in the process of game play. But while the data collected by these other games is useful for machine learning, the data collected with audio puzzler is immediately valuable as a transcription of the speech in the video. A similar effort (but in a much grander domain) is the Fold It project which seeks to harness playtime to solve protein folding problems. Much more detailed information about the evaluation of the technology will be forthcoming in a paper to be published at ACM Multimedia in October.</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2008/08/07/research-audio-puzzler-alpha/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Thesis Raffay Hamid PhD (2008): &#8220;A Computational Framework For Unsupervised Analysis of Everyday Human Activities&#8221;</title>
		<link>http://prof.irfanessa.com/2008/06/18/thesis-raffay-hamid-phd-2008-a-computational-framework-for-unsupervised-analysis-of-everyday-human-activities/</link>
		<comments>http://prof.irfanessa.com/2008/06/18/thesis-raffay-hamid-phd-2008-a-computational-framework-for-unsupervised-analysis-of-everyday-human-activities/#comments</comments>
		<pubDate>Wed, 18 Jun 2008 18:12:20 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Aaron Bobick]]></category>
		<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[Numerical Machine Learning]]></category>
		<category><![CDATA[PhD]]></category>
		<category><![CDATA[Raffay Hamid]]></category>
		<category><![CDATA[2008]]></category>
		<category><![CDATA[Thesis]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/?p=502</guid>
		<description><![CDATA[M. Raffay Hamid PhD (2008), &#8220;A Computational Framework For Unsupervised Analysis of Everyday Human Activities&#8220;, PhD Thesis, Georgia Institute of Techniology, College of Computing, Atlanta, GA. (Advisor: Aaron Bobick &#38; Irfan Essa) Abstract In order to make computers proactive and assistive, we must enable them to perceive, learn, and predict what is happening in their surroundings. [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://etd.gatech.edu/theses/available/etd-06232008-101404/"></a></p>
<p>M. Raffay Hamid PhD (2008), &#8220;<a href="http://etd.gatech.edu/theses/available/etd-06232008-101404/">A Computational Framework For Unsupervised Analysis of Everyday Human Activities</a>&#8220;, PhD Thesis, Georgia Institute of Techniology, College of Computing, Atlanta, GA. (Advisor: Aaron Bobick &amp; Irfan Essa)</p>
<p style="text-align: center;"><strong>Abstract</strong></p>
<p style="text-align: justify;"><strong></strong>In order to make computers proactive and assistive, we must enable them to perceive, learn, and predict what is happening in their surroundings. This presents us with the challenge of formalizing computational models of everyday human activities. For a majority of environments, the structure of the in situ activities is generally not known a priori. This thesis therefore investigates knowledge representations and manipulation techniques that can facilitate learning of such everyday human activities in a minimally supervised manner. </p>
<p style="text-align: justify;">A key step towards this end is finding appropriate representations for human activities. We posit that if we chose to describe activities as finite sequences of an appropriate set of events, then the global structure of these activities can be uniquely encoded using their local event sub-sequences. With this perspective at hand, we particularly investigate representations that characterize activities in terms of their fixed and variable length event subsequences. We comparatively analyze these representations in terms of their representational scope, feature cardinality and noise sensitivity.</p>
<p style="text-align: justify;">Exploiting such representations, we propose a computational framework to discover the various activity-classes taking place in an environment. We model these activity-classes as maximally similar activity-cliques in a completely connected graph of activities, and describe how to discover them efficiently. Moreover, we propose methods for finding concise characterizations of these discovered activity-classes, both from a holistic as well as a by-parts perspective. Using such characterizations, we present an incremental method to classify</p>
<p style="text-align: justify;">a new activity instance to one of the discovered activity-classes, and to automatically detect if it is anomalous with respect to the general characteristics of its membership class. Our results show the efficacy of our framework in a variety of everyday environments</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2008/06/18/thesis-raffay-hamid-phd-2008-a-computational-framework-for-unsupervised-analysis-of-everyday-human-activities/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Thesis David Minnen PhD (2008): &#8220;Unsupervised Discovery of Activity Primitives from Multivariate Sensor Data&#8221;</title>
		<link>http://prof.irfanessa.com/2008/06/18/thesis-david-minnen-phd-2008-unsupervised-discovery-of-activity-primitives-from-multivariate-sensor-data/</link>
		<comments>http://prof.irfanessa.com/2008/06/18/thesis-david-minnen-phd-2008-unsupervised-discovery-of-activity-primitives-from-multivariate-sensor-data/#comments</comments>
		<pubDate>Wed, 18 Jun 2008 18:05:43 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Activity Recognition]]></category>
		<category><![CDATA[David Minnen]]></category>
		<category><![CDATA[PhD]]></category>
		<category><![CDATA[Thad Starner]]></category>
		<category><![CDATA[2008]]></category>
		<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[Thesis]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/?p=498</guid>
		<description><![CDATA[David Minnen PhD (2008): &#8220;Unsupervised Discovery of Activity Primitives from Multivariate Sensor Data&#8220; Georgia Institute of Techniology, College of Computing, Atlanta, GA. (Advisor: Thad Starner &#38; Irfan Essa) Abstract This research addresses the problem of temporal pattern discovery in real-valued, multivariate sensor data. Several algorithms were developed, and subsequent evaluation demonstrates that they can efficiently and [...]]]></description>
			<content:encoded><![CDATA[<p>David Minnen PhD (2008): &#8220;<a href="http://etd.gatech.edu/theses/available/etd-07072008-090103/" target="_blank">Unsupervised Discovery of Activity Primitives from Multivariate Sensor Data</a>&#8220; Georgia Institute of Techniology, College of Computing, Atlanta, GA. (Advisor: Thad Starner &amp; Irfan Essa)</p>
<p style="text-align: center;"><strong>Abstract</strong></p>
<p style="text-align: justify;">This research addresses the problem of temporal pattern discovery in real-valued, multivariate sensor data. Several algorithms were developed, and subsequent evaluation demonstrates that they can efficiently and accurately discover unknown recurring patterns in time series data taken from many different domains. Different data representations and motif models were investigated in order to design an algorithm with an improved balance between run-time and detection accuracy. The different data representations are used to quickly filter large data sets in order to detect potential patterns that form the basis of a more detailed analysis. The representations include global discretization, which can be efficiently analyzed using a suffix tree, local discretization with a corresponding random projection algorithm for locating similar pairs of subsequences, and a density-based detection method that operates on the original, real-valued data. In addition, a new variation of the multivariate motif discovery problem is proposed in which each pattern may span only a subset of the input features. An algorithm that can efficiently discover such &#8220;subdimensional&#8221; patterns was developed and evaluated. The discovery algorithms are evaluated by measuring the detection accuracy of discovered patterns relative to a set of expected patterns for each data set. The data sets used for evaluation are drawn from a variety of domains including speech, on-body inertial sensors, music, American Sign Language video, and GPS tracks.</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2008/06/18/thesis-david-minnen-phd-2008-unsupervised-discovery-of-activity-primitives-from-multivariate-sensor-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper: ICASSP (2008) &#8220;Discriminative Feature Selection for Hidden Markov Models using Segmental Boosting&#8221;</title>
		<link>http://prof.irfanessa.com/2008/04/03/paper-icassp-2008-discriminative-feature-selection-for-hidden-markov-models-using-segmental-boosting/</link>
		<comments>http://prof.irfanessa.com/2008/04/03/paper-icassp-2008-discriminative-feature-selection-for-hidden-markov-models-using-segmental-boosting/#comments</comments>
		<pubDate>Thu, 03 Apr 2008 20:53:56 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Face and Gesture]]></category>
		<category><![CDATA[Funding]]></category>
		<category><![CDATA[James Rehg]]></category>
		<category><![CDATA[NSF (0205507)]]></category>
		<category><![CDATA[Numerical Machine Learning]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Pei Yin]]></category>
		<category><![CDATA[Thad Starner]]></category>
		<category><![CDATA[2008]]></category>
		<category><![CDATA[Gesture]]></category>
		<category><![CDATA[NSF]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/2008/04/03/paper-icassp-2008-discriminative-feature-selection-for-hidden-markov-models-using-segmental-boosting/</guid>
		<description><![CDATA[Pei Yin, Irfan Essa, James Rehg, Thad Starner (2008) &#8220;Discriminative Feature Selection for Hidden Markov Models using Segmental Boosting&#8221;, ICASSP 2008 &#8211; March 30 &#8211; April 4, 2008 &#8211; Las Vegas, Nevada, U.S.A. (Paper: MLSP-P3.D8, Session: Pattern Recognition and Classification II, Time: Thursday, April 3, 15:30 &#8211; 17:30, Topic: Machine Learning for Signal Processing: Learning [...]]]></description>
			<content:encoded><![CDATA[<p>Pei Yin, Irfan Essa, James Rehg, Thad Starner (2008)  &#8220;Discriminative Feature Selection for Hidden Markov Models using Segmental Boosting&#8221;, <a href="http://www.icassp2008.org/Papers/viewpapers.asp?papernum=1612">ICASSP 2008 &#8211; March 30 &#8211; April 4, 2008 &#8211; Las Vegas, Nevada, U.S.A.</a> (Paper:	MLSP-P3.D8, Session:	Pattern Recognition and Classification II, Time:	Thursday, April 3, 15:30 &#8211; 17:30, Topic: 	Machine Learning for Signal Processing: Learning Theory and Modeling) (<a href="http://www.cc.gatech.edu/~pyin/pdf/SBHMMICASSP08.pdf">PDF</a>|<a href="http://www.cc.gatech.edu/cpl/projects/sbhmm/" target="_blank">Project Site</a>)</p>
<p align="center">ABSTRACT</p>
<p><a title="icassp08" href="http://academics.irfanessa.com/wp-content/uploads/2008/04/sister73.jpg"><img src="http://academics.irfanessa.com/wp-content/uploads/2008/04/sister73.jpg" alt="icassp08" hspace="5" vspace="5" align="left" /></a>We address the feature selection problem for hidden Markov models (HMMs) in sequence classification. Temporal correlation in sequences often causes difficulty in applying feature selection techniques. Inspired by segmental k-means segmentation (SKS), we propose Segmentally Boosted HMMs (SBHMMs), where the state-optimized features are constructed in a segmental and discriminative manner. The contributions are twofold. First, we introduce a novel feature selection algorithm, where the temporal dynamics are decoupled from the static learning procedure by assuming that the sequential data are piecewise independent and identically distributed. Second, we show that the SBHMM consistently improves traditional HMM recognition in various domains. The reduction of error compared to traditional HMMs ranges from 17% to 70% in American Sign Language recognition, human gait identification, lip reading, and speech recognition.</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/2008/04/03/paper-icassp-2008-discriminative-feature-selection-for-hidden-markov-models-using-segmental-boosting/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
