Publications
Conference Publications
ShapeGrasp: Zero-Shot Object Grasping with LLMs via Geometric Decomposition
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2024
🏆 Oral
Website
Paper
Thread
Let Me Help You! Neuro-Symbolic Short-Context Action Anticipation
IEEE Robotics and Automation Letters (RA-L), 2024
International Conference on Robotics and Automation (ICRA), 2025
Website
Paper
Code
Thread
Video
Demo
Sample-Efficient Learning of Novel Visual Concepts
Conference on Lifelong Learning Agents (CoLLAs), 2023
🏆 Oral; Top 20% of Accepted Papers
Website
Paper
Code
Thread
Video
Suspect Identification Framework using Contrastive Relevance Feedback
Winter Conference on Applications of Computer Vision (WACV), 2023
Paper
Video
Emotional Talking Faces: Making Videos More Expressive and Realistic
ACM Multimedia Asia (MMAsia), 2022
🏆 Best Demo Paper Award; 300+ stars on GitHub
Paper
Code
Video
Project Page
Disentangling Multiple Features in Video Sequences using Gaussian Processes in Variational Autoencoders
European Conference on Computer Vision (ECCV), 2020
ICVGIP 2020 Vision India
Paper
Code
Video
UAV Target Tracking in Urban Environments Using Deep Reinforcement Learning
International Conference on Unmanned Aircraft Systems (ICUAS), 2020
🏆 Oral; 50+ Citations
Paper
Video
C3VQG: Category Consistent Cyclic Visual Question Generation.
ACM Multimedia Asia (MMAsia), 2020
VQA and Dialogue Workshop, Computer Vision and Pattern Recognition (CVPR), 2020 (Spotlight)
Paper
Code
Video
PrOSe: Product of Orthogonal Spheres Parameterization for Disentangled Representation Learning
British Machine Vision Conference (BMVC), 2019
Paper
Geometry of Deep Generative Models for Disentangled Representations
Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP), 2018
Paper
Journal Publications
Multimodal Research in Vision and Language: A Review of Current and Emerging Trends
Information Fusion Journal, 2021 (Impact Factor: 15.7)
🏆 100+ citations
Paper
Deep Reinforcement Learning for Soft Robotic Applications: Brief Overview with Impending Challenges
Robotics 2019, 8(1), 4
🏆 100+ citations
Paper
Thesis
Enhancing Robot Perception and Interaction Through Structured Domain Knowledge
Master's Thesis
Thesis
Geometry of Neural Network based Disentangled Latent Space Models
Bachelor's Thesis
Thesis
Workshop Publications
WROOM: An Autonomous Driving Approach for Off-Road Navigation
Off-road Autonomy Workshop, ICRA 2024
Website
Paper
Simulator
Media
Geometric Shape Reasoning for Zero-Shot Task-Oriented Grasping
Workshop on 3D Visual Representations for Robot Manipulation, ICRA 2024
Website
Symbolic Graph Inference for Compound Scene Understanding
Workshop on Ontologies and Standards for Robotics and Automation, ICRA 2024
Paper
Knowledge-Guided Short-Context Action Anticipation in Human-Centric Videos
Workshop on AI for Creative Video Editing and Understanding, ICCV 2023
Paper
Video
Thread
Emotionally Enhanced Talking Face Generation
Workshop on AI for Creative Video Editing and Understanding, ICCV 2023
Workshop on Multimedia Content Generation and Evaluation: New Methods and Practice (McGE), MM 2023
Paper
Code
Video
Contrastive Personalization Approach to Suspect Identification
AAAI Student Abstract 2021
Paper
DisCont: Self-Supervised Visual Attribute Disentanglement using Context Vectors
Perception Through Structured Generative Models, ECCV 2020
ML Interpretability for Scientific Discovery, ICML 2020.
Paper
Code
Video