Publications
Socratic Planner: Inquiry-Based Zero-Shot Planning for Embodied Instruction Following
arXiv preprint
Paper
Continual Vision-and-Language Navigation
arXiv preprint
Paper
CLIP-RT: Learning Language-Conditioned Robotic Policies from Natural Language Supervision
CoRL 2024 Workshop on Language and Robot Learning
Project Page
Paper
PGA: Personalizing Grasping Agents with Single Human-Robot Interaction
IROS 2024
Paper
PROGrasp: Pragmatic Human-Robot Communication for Object Grasping
The Dialog Must Go On: Improving Visual Dialog via Generative Self-Training
CVPR 2023
ICML 2022 Workshop on Pre-training: Perspectives, Pitfalls, and Paths Forward
Project Page
Paper
Code
Slides
Video
GVCCI: Lifelong Learning of Visual Grounding for Language-Guided Robotic Manipulation
Improving Robustness to Texture Bias via Shape-focused Augmentation
CVPR 2022 Workshop on Human-centered Intelligent Services: Safety and Trustworthy
Paper
Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer
EMNLP 2021 Findings
Paper
Code
Slides
Attend What You Need: Motion-Appearance Synergistic Networks for Video Question Answering
C3: Contrastive Learning for Cross-domain Correspondence in Few-shot Image Generation
NeurIPS 2021 Workshop on Controllable Generative Modeling in Language and Vision
Paper
Label Propagation Adaptive Resonance Theory for Semi-Supervised Continuous Learning
ICASSP 2020
Paper
Dual Attention Networks for Visual Reference Resolution in Visual Dialog
EMNLP 2019
ICCV 2019 Workshop on Video Turing Test (Spotlight Talk)
Paper
Code
Slides