Learning long-context reasoning over shopping trajectories via reinforcement learning.
Customer-Agent: Learning Long-Context Reasoning over Shopping Trajectories via Reinforcement Learning
Hongye Liu, Rongmei Lin, Anurag Kashyap, Hejie Cui, Ricardo Henao, Besnik, et al.
OpenReview, 2026