RESEARCH31
Human-Guided Harm Recovery for Computer Use Agents
arXiv CS.AIΒ·April 22, 2026
This research formalizes "harm recovery" for AI agents executing actions on computer systems, addressing the challenge of optimally steering an agent from a harmful to a safe state. It grounds preference-aligned recovery through a user study and operationalizes insights into a reward model for evaluating recovery plans.
Read original β