Causal Foundations of Collective Agency
This research addresses the challenge of simpler AI agents inadvertently forming a collective agent with distinct goals, crucial for advanced AI safety. It proposes defining collective agency behaviorally, viewing a group as a unified agent when its joint actions appear rational and goal-directed, formalized through causal games and abstraction.