← heapsort
RESEARCH27

SalesSim: Benchmarking and Aligning Multimodal Language Models as Retail User Simulators

arXiv CS.CLΒ·May 12, 2026

SalesSim is a framework and testbed designed to evaluate Multimodal Large Language Models (MLLMs) as realistic, persona-driven customer simulators in online retail conversations. It models retail interaction as an agentic process, benchmarking state-of-the-art models and identifying behavioral gaps in decision alignment and conversational quality.

Read original β†—