← heapsort
RESEARCH31

Human-Guided Harm Recovery for Computer Use Agents

arXiv CS.AIΒ·April 22, 2026

This research formalizes "harm recovery" for AI agents executing actions on computer systems, addressing the challenge of optimally steering an agent from a harmful to a safe state. It grounds preference-aligned recovery through a user study and operationalizes insights into a reward model for evaluating recovery plans.

Read original β†—