<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Irfan Essa&#039;s Academic Activities &#187; 1997</title>
	<atom:link href="http://prof.irfanessa.com/tag/1997/feed/" rel="self" type="application/rss+xml" />
	<link>http://prof.irfanessa.com</link>
	<description>Academic/Professional Activities</description>
	<lastBuildDate>Thu, 01 Apr 2010 15:31:12 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=abc</generator>
		<item>
		<title>Paper: PUI (1997) &#8220;Prosody Analysis for Speaker Affect Determination&#8221;</title>
		<link>http://prof.irfanessa.com/1997/10/12/paper-pui-1997-prosody-analysis-for-speaker-affect-determination/</link>
		<comments>http://prof.irfanessa.com/1997/10/12/paper-pui-1997-prosody-analysis-for-speaker-affect-determination/#comments</comments>
		<pubDate>Sun, 12 Oct 1997 19:18:58 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Affective Computing]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[1997]]></category>
		<category><![CDATA[Audio Analysis]]></category>
		<category><![CDATA[HCI]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/?p=259</guid>
		<description><![CDATA[Andrew Gardner and Irfan Essa (1997) &#8220;Prosody Analysis for Speaker Affect Determination&#8221; In Proceedings of Perceptual User Interfaces Workshop (PUI 1997), Banff, Alberta, CANADA, Oct 1997 [PDF][Project Site] Abstract Speech is a complex waveform containing verbal (e.g. phoneme, syllable, and word) and nonverbal (e.g. speaker identity, emotional state, and tone) information. Both the verbal and [...]]]></description>
			<content:encoded><![CDATA[<p>Andrew Gardner and Irfan Essa (1997) &#8220;<a href="http://www-static.cc.gatech.edu/cpl/pubs/pui.97/">Prosody Analysis for Speaker Affect Determination</a>&#8221; In Proceedings of Perceptual User Interfaces Workshop (PUI 1997), Banff, Alberta, CANADA, Oct 1997 [<a href="http://www-static.cc.gatech.edu/cpl/pubs/pui.97/pui97.pdf" target="_self">PDF</a>][<a href="http://www-static.cc.gatech.edu/cpl/pubs/pui.97/" target="_self">Project Site</a>]</p>
<p style="text-align: center;"><strong>Abstract</strong></p>
<p style="text-align: justify;">Speech is a complex waveform containing verbal (e.g. phoneme, syllable, and word) and nonverbal (e.g. speaker identity, emotional state, and tone) information. Both the verbal and nonverbal aspects of speech are extremely important in interpersonal communication and human-machine interaction. However, work in machine perception of speech has focused primarily on the verbal, or content-oriented, goals of speech recognition, speech compression, and speech labeling. Usage of nonverbal information has been limited to speaker identification applications. While the success of research in these areas is well documented, this success is fundamentally limited by the effect of nonverbal information on the speech waveform. The extra-linguistic aspect of speech is considered a source of variability that theoretically can be minimized with an appropriate preprocessing technique; determination of such robust techniques is however, far from trivial.</p>
<p style="text-align: justify;">It is widely believed in the speech processing community that the nonverbal component of speech contains higher-level information that provides cues for auditory scene analysis, speech understanding, and the determination of a speaker&#8217;s psychological state or conversational tone. We believe that the identification of such nonverbal cues can improve the performance of classic speech processing tasks and will be necessary for the realization of natural, robust human-computer speech interfaces. In this paper we seek to address the problem of how to systematically analyze the nonverbal aspect of the speech waveform to determine speaker affect, specifically by analyzing the pitch contour.</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/1997/10/12/paper-pui-1997-prosody-analysis-for-speaker-affect-determination/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper: IEEE PAMI (1997) &#8220;Coding, analysis, interpretation, and recognition of facial expressions&#8221;</title>
		<link>http://prof.irfanessa.com/1997/07/14/paper-ieee-pami-1997-coding-analysis-interpretation-and-recognition-of-facial-expressions/</link>
		<comments>http://prof.irfanessa.com/1997/07/14/paper-ieee-pami-1997-coding-analysis-interpretation-and-recognition-of-facial-expressions/#comments</comments>
		<pubDate>Mon, 14 Jul 1997 15:10:39 +0000</pubDate>
		<dc:creator>Irfan Essa</dc:creator>
				<category><![CDATA[Affective Computing]]></category>
		<category><![CDATA[Face and Gesture]]></category>
		<category><![CDATA[PAMI/ICCV/CVPR/ECCV]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Sandy Pentland]]></category>
		<category><![CDATA[1997]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[Faces]]></category>
		<category><![CDATA[PAMI]]></category>

		<guid isPermaLink="false">http://academics.irfanessa.com/1997/07/14/paper-ieee-pami-1997-coding-analysis-interpretation-and-recognition-of-facial-expressions/</guid>
		<description><![CDATA[Coding, analysis, interpretation, and recognition of facial expressions Essa, I.A. Pentland, A.P. In IEEE Transactions on Pattern Analysis and Machine Intelligence, July 1997, Volume: 19 , Issue: 7, pp 757 &#8211; 763, ISSN: 0162-8828, CODEN: ITPIDJ. INSPEC Accession Number:5661539 Digital Object Identifier: 10.1109/34.598232 Abstract We describe a computer vision system for observing facial motion by [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://ieeexplore.ieee.org/search/srchabstract.jsp?arnumber=598232&amp;isnumber=13123&amp;punumber=34&amp;k2dockey=598232@ieeejrns&amp;query=%28%28essa%29%3Cin%3Eau+%29&amp;pos=1">Coding, analysis, interpretation, and recognition of facial expressions</a></p>
<p>Essa, I.A.   Pentland, A.P. In <em>IEEE Transactions on Pattern Analysis and Machine Intelligence</em>, July 1997, Volume: 19 , Issue: 7, pp 757 &#8211; 763, ISSN: 0162-8828, CODEN: ITPIDJ. INSPEC Accession Number:5661539<br />
Digital Object Identifier: <a href="http://doi.ieeecomputersociety.org/10.1109/34.598232">10.1109/34.598232</a></p>
<p align="center"><strong>Abstract</strong></p>
<p>We describe a computer vision system for observing facial motion by using an optimal estimation optical flow method coupled with geometric, physical and motion-based dynamic models describing the facial structure. Our method produces a reliable parametric representation of the face&#8217;s independent muscle action groups, as well as an accurate estimate of facial motion. Previous efforts at analysis of facial expression have been based on the facial action coding system (FACS), a representation developed in order to allow human psychologists to code expression from static pictures. To avoid use of this heuristic coding scheme, we have used our computer vision system to probabilistically characterize facial motion and muscle activation in an experimental population, thus deriving a new, more accurate, representation of human facial expressions that we call FACS . Finally, we show how this method can be used for coding, analysis, interpretation, and recognition of facial expressions</p>
]]></content:encoded>
			<wfw:commentRss>http://prof.irfanessa.com/1997/07/14/paper-ieee-pami-1997-coding-analysis-interpretation-and-recognition-of-facial-expressions/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
