I work on reinforcement learning and reasoning with LLM agents in the Long Horizon Agents team under Massimo Caccia at ServiceNow AI Research, based out of Montreal. I am also a Master's student at McGill University and Mila under the guidance of Xue (Steve) Liu. My current research revolves around reasoning in LLM web agents and tool-calling systems. My recent work on web agent training received an oral presentation at ICML and will appear at NeurIPS 2025.
I completed my Bachelor's in Electronics and Communication Engineering at NIT Hamirpur. During my undergrad, I worked as a researcher at Mila and IIT Delhi, focusing on robot perception, differentiable SLAM systems, and LiDAR-based perception tasks for autonomous navigation.