Dheeraj Vattikonda

I work on reinforcement learning and reasoning with LLM agents with Massimo Caccia at ServiceNow AI Research, based out of Montreal. I am also a Master's student at McGill University and Mila under the guidance of Xue (Steve) Liu.

My current research revolves around reasoning in LLM web agents and tool-calling systems, and my recent work on web agent training received an oral presentation at ICML and will appear at NeurIPS 2025.

Selected Publications

Web Agent Training

How to Train Your LLM Web Agent: A Statistical Diagnosis

Dheeraj Vattikonda, Emiliano Peñaloza

NeurIPS 2025 ICML 2025 CUA Workshop Oral

paper slides

Image Editing

Learning Action and Reasoning-Centric Image Editing from Videos and Simulations

Benno Krojer, Dheeraj Vattikonda

NeurIPS 2024 Spotlight

paper website

Differentiable SLAM

Differentiable SLAM Helps Deep Learning-based LiDAR Perception Tasks

Prashant Kumar, Dheeraj Vattikonda

BMVC 2023

paper

SLACK Attack

SLACK: Attacking LiDAR-based SLAM with Adversarial Point Injections

Prashant Kumar, Dheeraj Vattikonda, Kshitij Madhav Bhat, Kunal Dargan, Prem Kalra

ICIPCW 2024

paper

Grad-LiDAR-SLAM

Grad-LiDAR-SLAM: Fully Differentiable Global SLAM for LiDAR with Pose-Graph Optimization

Dheeraj Vattikonda, Aryan

IROS 2022

paper code