Dheeraj Vattikonda

Dheeraj Vattikonda

Visiting Researcher, ServiceNow AI Research
M.Sc. Student, McGill University & Mila

About Me

I work on reinforcement learning and reasoning with LLM agents in the Long Horizon Agents team under Massimo Caccia at ServiceNow AI Research, based out of Montreal. I am also a Master's student at McGill University and Mila under the guidance of Xue (Steve) Liu.

My current research revolves around reasoning in LLM web agents and tool-calling systems. My recent work on web agent training received an oral presentation at ICML and will appear at NeurIPS 2025.

I completed my Bachelor's in Electronics and Communication Engineering at NIT Hamirpur, where I worked on robot perception, focusing on differentiable SLAM systems and LiDAR-based perception tasks for autonomous navigation.

News

Selected Publications

Privileged Information Distillation
Emiliano Peñaloza, Dheeraj Vattikonda, Nicolas Gontier, Alexandre Lacoste, Laurent Charlin, Massimo Caccia
arXiv 2026
Distill frontier models even when they hide their reasoning, using training-time privileged information.
Web Agent Training
Dheeraj Vattikonda, Santhoshi Ravichandran, Emiliano Penaloza, Hadi Nekoei, Megh Thakkar, Thibault Le Sellier de Chezelles, Nicolas Gontier, Miguel Muñoz-Mármol, Sahar Omidi Shayegan, Stefania Raimondo, Xue Liu, Alexandre Drouin, Laurent Charlin, Alexandre Piché, Alexandre Lacoste, Massimo Caccia
ICML 2025 CUA Workshop (Oral) · NeurIPS 2025
A comprehensive statistical analysis of training recipes for LLM web agents.
Aurora
Benno Krojer, Dheeraj Vattikonda, Luis Lara, Varun Jampani, Eva Portelance, Christopher Pal, Siva Reddy
NeurIPS 2024 (Spotlight)
Action and reasoning-centric image editing learned from video and simulation data.
Differentiable SLAM
Prashant Kumar, Dheeraj Vattikonda, Vedang Bhupesh Shenvi Nadkarni, Erqun Dong, Sabyasachi Sahoo
BMVC 2023
End-to-end differentiable SLAM improves downstream LiDAR perception.
SLACK
Prashant Kumar, Dheeraj Vattikonda, Kshitij Madhav Bhat, Kunal Dargan, Prem Kalra
ICIPCW 2024
Adversarial attacks on LiDAR SLAM through targeted point cloud injections.
Grad-LiDAR-SLAM
Aryan FNU, Dheeraj Vattikonda, Erqun Dong, Sabyasachi Sahoo
IROS 2022
A fully differentiable LiDAR SLAM pipeline with pose-graph optimization.