RESEARCHarXiv CS.CL·5/4/2026
Persona-Grounded Safety Evaluation of AI Companions in Multi-Turn Conversations
This research introduces a scalable framework for safety evaluation of multi-turn interactions with AI companion applications, addressing concerns about their emotional engagement risks. It integrates persona construction, scenario generation, simulation, and harm evaluation, applying it to Replika with high-risk user personas like those with depression or anxiety.
27