Dheeraj Vattikonda

I work on reinforcement learning and reasoning with LLM agents with Massimo Caccia at ServiceNow AI Research, based out of Montreal. I am also a Master's student at McGill University and Mila under the guidance of Xue (Steve) Liu.

My current research revolves around reasoning in LLM web agents and tool-calling systems, and my recent work on web agent training received an oral presentation at ICML and will appear at NeurIPS 2025.

Dheeraj Vattikonda

Selected Publications

Web Agent Training
Dheeraj Vattikonda, Emiliano PeƱaloza
NeurIPS 2025 ICML 2025 CUA Workshop Oral
Image Editing
Differentiable SLAM
SLACK Attack
Prashant Kumar, Dheeraj Vattikonda, Kshitij Madhav Bhat, Kunal Dargan, Prem Kalra
ICIPCW 2024
Grad-LiDAR-SLAM