A searchable list of some of my publications is below. You can also access my publications from the following sites.

My ORCID is ORCID iD iconhttps://orcid.org/0000-0002-6236-2969

Publications:

238 entries « 1 of 12 »
1.

Kihyuk Sohn, Nataniel Ruiz, Kimin Lee, Daniel Castro Chin, Irina Blok, Huiwen Chang, Jarred Barber, Lu Jiang, Glenn Entis, Yuanzhen Li, Yuan Hao, Irfan Essa, Michael Rubinstein, Dilip Krishnan

StyleDrop: Text-to-Image Generation in Any Style Proceedings Article

In: Advances in Neural Information Processing Systems (NeurIPS), 2023.

Abstract | Links | BibTeX | Tags: arXiv, computer vision, generative AI, google, NeurIPS

2.

Lijun Yu, Yong Cheng, Zhiruo Wang, Vivek Kumar, Wolfgang Macherey, Yanping Huang, David A. Ross, Irfan Essa, Yonatan Bisk, Ming-Hsuan Yang, Kevin Murphy, Alexander G. Hauptmann, Lu Jiang

SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs Proceedings Article

In: Advances in Neural Information Processing Systems (NeurIPS), 2023.

Abstract | Links | BibTeX | Tags: arXiv, computational video, computer vision, generative AI, NeurIPS

3.

Nikolai Warner, Meera Hahn, Jonathan Huang, Irfan Essa, Vighnesh Birodkar

Text and Click inputs for unambiguous open vocabulary instance segmentation Proceedings Article

In: Proeedings of British Conference for Machine Vision (BMVC), 2023.

Abstract | Links | BibTeX | Tags: arXiv, BMVC, computer vision, google, image segmentation

4.

K. Niranjan Kumar, Irfan Essa, Sehoon Ha

Words into Action: Learning Diverse Humanoid Robot Behaviors using Language Guided Iterative Motion Refinement Proceedings Article

In: CoRL Workshop on Language and Robot Learning Language as Grounding (with CoRL 2023), 2023.

Abstract | Links | BibTeX | Tags: arXiv, CoRL, robotics, vision & language

5.

K. Niranjan Kumar, Irfan Essa Irfan, Sehoon Ha

Cascaded Compositional Residual Learning for Complex Interactive Behaviors Journal Article

In: IEEE Robotics and Automation Letters, vol. 8, iss. 8, pp. 4601–4608, 2023.

Abstract | Links | BibTeX | Tags: IEEE, reinforcement learning, robotics

6.

Dina Bashkirova, José Lezama, Kihyuk Sohn, Kate Saenko, Irfan Essa

MaskSketch: Unpaired Structure-guided Masked Image Generation Proceedings Article

In: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2023.

Abstract | Links | BibTeX | Tags: computer vision, CVPR, generative AI, generative media, google

7.

Lijun Yu, Yong Cheng, Kihyuk Sohn, José Lezama, Han Zhang, Huiwen Chang, Alexander G. Hauptmann, Ming-Hsuan Yang, Yuan Hao, Irfan Essa, Lu Jiang

MAGVIT: Masked Generative Video Transformer Proceedings Article

In: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2023.

Abstract | Links | BibTeX | Tags: computational video, computer vision, CVPR, generative AI, generative media, google

8.

Kihyuk Sohn, Yuan Hao, José Lezama, Luisa Polania, Huiwen Chang, Han Zhang, Irfan Essa, Lu Jiang

Visual Prompt Tuning for Generative Transfer Learning Proceedings Article

In: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2023.

Abstract | Links | BibTeX | Tags: computer vision, CVPR, generative AI, generative media, google

9.

Kihyuk Sohn, Albert Shaw, Yuan Hao, Han Zhang, Luisa Polania, Huiwen Chang, Lu Jiang, Irfan Essa

Learning Disentangled Prompts for Compositional Image Synthesis Technical Report

2023.

Abstract | Links | BibTeX | Tags: arXiv, computer vision, generative AI, google, prompt engineering

10.

Harish Haresamudram, Irfan Essa, Thomas Ploetz

Towards Learning Discrete Representations via Self-Supervision for Wearables-Based Human Activity Recognition Technical Report

2023.

Abstract | Links | BibTeX | Tags: activity recognition, arXiv, wearable computing

11.

José Lezama, Tim Salimans, Lu Jiang, Huiwen Chang, Jonathan Ho, Irfan Essa

Discrete Predictor-Corrector Diffusion Models for Image Synthesis Proceedings Article

In: International Conference on Learning Representations (ICLR), 2023.

Abstract | Links | BibTeX | Tags: computer vision, generative AI, generative media, google, ICLR, machine learning

12.

Erik Wijmans, Manolis Savva, Irfan Essa, Stefan Lee, Ari S. Morcos, Dhruv Batra

Emergence of Maps in the Memories of Blind Navigation Agents Best Paper Proceedings Article

In: Proceedings of International Conference on Learning Representations (ICLR), 2023.

Abstract | Links | BibTeX | Tags: awards, best paper award, computer vision, google, ICLR, machine learning, robotics

13.

Yi-Hao Peng, Peggy Chi, Anjuli Kannan, Meredith Morris, Irfan Essa

Slide Gestalt: Automatic Structure Extraction in Slide Decks for Non-Visual Access Proceedings Article

In: ACM Symposium on User Interface Software and Technology (UIST), 2023.

Abstract | Links | BibTeX | Tags: accessibility, CHI, google, human-computer interaction

14.

Karan Samel, Jun Ma, Zhengyang Wang, Tong Zhao, Irfan Essa

Knowledge Relevance BERT: Integrating Noisy Knowledge into Language Representation. Proceedings Article

In: AAAI workshop on Knowledge Augmented Methods for NLP (KnowledgeNLP-AAAI 2023), 2023.

Abstract | Links | BibTeX | Tags: AI, knowledge representation, NLP

15.

Tianhao Zhang, Weilong Yang, Honglak Lee, Hung-Yu Tseng, Irfan Essa, Lu Jiang

Image manipulation by text instruction Patent

2023.

Abstract | Links | BibTeX | Tags: content creation, generative AI, google, media generation, patents

16.

Erik Wijmans, Irfan Essa, Dhruv Batra

How to Train PointGoal Navigation Agents on a (Sample and Compute) Budget Proceedings Article

In: International Conference on Autonomous Agents and Multi-Agent Systems, 2022.

Abstract | Links | BibTeX | Tags: computer vision, embodied agents, navigation

17.

Erik Wijmans, Irfan Essa, Dhruv Batra

VER: Scaling On-Policy RL Leads to the Emergence of Navigation in Embodied Rearrangement Proceedings Article

In: Oh, Alice H., Agarwal, Alekh, Belgrave, Danielle, Cho, Kyunghyun (Ed.): Advances in Neural Information Processing Systems (NeurIPS), 2022.

Abstract | Links | BibTeX | Tags: machine learning, NeurIPS, reinforcement learning, robotics

18.

Huda Alamri, Anthony Bilic, Michael Hu, Apoorva Beedu, Irfan Essa

End-to-end Multimodal Representation Learning for Video Dialog Proceedings Article

In: NeuRIPS Workshop on Vision Transformers: Theory and applications, 2022.

Abstract | Links | BibTeX | Tags: computational video, computer vision, vision transformers

19.

Apoorva Beedu, Huda Alamri, Irfan Essa

Video based Object 6D Pose Estimation using Transformers Proceedings Article

In: NeuRIPS Workshop on Vision Transformers: Theory and applications, 2022.

Abstract | Links | BibTeX | Tags: computer vision, vision transformers

20.

José Lezama, Huiwen Chang, Lu Jiang, Irfan Essa

Improved Masked Image Generation with Token-Critic Proceedings Article

In: European Conference on Computer Vision (ECCV), arXiv, 2022, ISBN: 978-3-031-20050-2.

Abstract | Links | BibTeX | Tags: computer vision, ECCV, generative AI, generative media, google

238 entries « 1 of 12 »

Other Publication Sites

A few more sites that aggregate research publications: Academic.edu, Bibsonomy, CiteULike, Mendeley.

      Copyright/About

      [Please see the Copyright Statement that may apply to the content listed here.]

      This list of publications is produced by using the teachPress plugin for WordPress.

      Leave a Reply

      Your email address will not be published. Required fields are marked *

      This site uses Akismet to reduce spam. Learn how your comment data is processed.