Applied Scientist · Amazon

Anurag Kashyap

I work on language model post-training, evaluation, and the infrastructure around training agents. My recent research focuses on benchmarking and improving agent behavior in realistic environments — terminals, containers, and long-context interaction.

Selected Work

All publications →

Projects

All projects →

Recent Writing

All posts →
  • Hello, world

    Welcome to the new site. I’ll use this space to write about machine learning, post-training, agents, and whatever else seems worth thinking through in public.