RESEARCH27
Persona-Grounded Safety Evaluation of AI Companions in Multi-Turn Conversations
arXiv CS.CLΒ·May 4, 2026
This research introduces a scalable framework for safety evaluation of multi-turn interactions with AI companion applications, addressing concerns about their emotional engagement risks. It integrates persona construction, scenario generation, simulation, and harm evaluation, applying it to Replika with high-risk user personas like those with depression or anxiety.
Read original β