I am a Ph.D. student in Computer Science at the Robotics Perception and Learning Lab (RIPL), Georgia Institute of Technology, advised by Prof. Zsolt Kira. My research focuses on theoretical machine learning and computer vision, with an emphasis on enhancing the adaptability and generalization capabilities of foundation models.
Prior to this, I completed a Masters by Research in Computer Science at Georgia Tech, where my thesis explored efficient and robust fine-tuning of Vision-Language Models. During my Master’s program, I also gained industry experience as a Computer Vision Research Intern at the Sony R&D Center, Switzerland (Fall 2023 - Spring 2024) and as an Applied Scientist Intern at Amazon Science, Seattle (Summer 2023).
Before my time at Georgia Tech, I worked at Microsoft Research India (MSRI) under the supervision of Dr. Venkat Padmanabhan (Managing Director, MSRI), where I contributed to Project HyWay.
Ph.D. in Computer Science, 2028
Georgia Institute of Technology
Master's in Computer Science, 2024
Georgia Institute of Technology
B.Tech. in Computer Science, 2020
Delhi Technological University
| Feb 2026 | Our full paper titled "The Geometry of Robustness: Optimizing Loss Landscape Curvature and Feature Manifold Alignment for Robust Finetuning of Vision-Language Models" was accepted at CVPR 2026. |
| May 2025 | Our short paper titled "MedMoE: Modality-Specialized Mixture of Experts for Medical Vision-Language Understanding" was accepted at the MMFM-BIOMED Workshop at CVPR 2025. |
| Feb 2025 | Our full paper titled “FRAMES-VQA: Benchmarking Fine-Tuning Robustness across Multi-Modal Shifts in Visual Question Answering” was accepted at CVPR 2025. |
| Jan 2025 | Our full paper titled “Directional Gradient Projection for Robust Fine-tuning of Foundation Models” was accepted at ICLR 2025. |
| Jan 2025 | Started my Ph.D. in Computer Science at Georgia Institute of Technology. |
| Dec 2024 | Successfully defended my Master’s thesis. |
| Oct 2024 | Our full paper titled "Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation" was accepted at WACV 2025. |
| Sep 2023 | Started working as a Computer Vision Research Intern at Sony R&D Center, Switzerland. |
| Aug 2023 | Our full paper titled "Learning to Discern: Imitating Heterogeneous Human Demonstrations with Preference and Representation Learning" was accepted at CoRL 2023. |
| May 2023 | Started working as an Applied Scientist Intern at Amazon, Seattle. |
| Apr 2023 | Our full paper titled "HyWay: Enabling Unstructured Conversations in the New Hybrid World" was accepted at UbiComp 2023 (IMWUT). |
| Apr 2023 | Our workshop paper titled "Symbiotic Artificial Intelligence: Order Picking and Ambient Sensing" was accepted at the Ambient AI Workshop in ICASSP 2023. |
| Aug 2022 | Started my Master’s in Computer Science at Georgia Institute of Technology. |
| Jul 2022 | Our workshop paper titled "Active Data Discovery: Mining Unknown Data using Submodular Information Measures" was accepted at the Real World ML Workshop in ICML 2022. |
| May 2022 | Started my research internship in Team HyWay at Microsoft Research Lab, India. |
| Jul 2021 | Started my research internship at the Faryabi Lab at the University of Pennsylvania. |
| Jun 2020 | Started my research internship at IBM Research Lab, India. |
| May 2020 | Successfully defended my Bachelor’s thesis. |
| Jan 2020 | Our highlight paper titled "Attention-based Sketch Recognition using Transformers" was accepted at ECAI 2020. |
| Dec 2019 | Our short paper titled "Utilizing Temporal Psycholinguistic Cues for Suicidal Intent Estimation" was accepted at ECIR 2020. |
| Nov 2019 | Our full paper titled "Hindi-English Hate Speech Detection: Author Profiling, Debiasing, and Practical Perspectives" was accepted at AAAI 2020. |
| Aug 2019 | Started my research internship at MIDAS Lab, IIIT Delhi. |