← heapsort
RESEARCH27

Causal Foundations of Collective Agency

arXiv CS.AIΒ·May 4, 2026

This research addresses the challenge of simpler AI agents inadvertently forming a collective agent with distinct goals, crucial for advanced AI safety. It proposes defining collective agency behaviorally, viewing a group as a unified agent when its joint actions appear rational and goal-directed, formalized through causal games and abstraction.

Read original β†—