← heapsort-ai

Decision Making

49 items

RESEARCHarXiv CS.AI·19d ago

$ECUAS_n$: A family of metrics for principled evaluation of uncertainty-augmented systems

This research proposes a new family of metrics, $ECUAS_n$, for evaluating uncertainty-augmented (UA) systems in automated decision-making. It argues that existing evaluation approaches are insufficient for assessing overall performance of UA systems, where predictive uncertainty is crucial for users to make informed decisions.

30
RESEARCHarXiv CS.CL·4/14/2026

Simulating Organized Group Behavior: New Framework, Benchmark, and Analysis

This paper introduces a new framework and benchmark for simulating organized group behavior, such as corporate decision-making in response to market dynamics. It formalizes the "Organized Group Behavior Simulation" task and presents GROVE, a benchmark with 8,052 real-world context-decision pairs to predict collective entity actions.

28
RESEARCHarXiv CS.AI·14d ago

Operationalizing Reconstructive Authority: Runtime Construction, Dependency Resolution, and Execution Gating in Autonomous Agent Systems

This paper introduces a runtime execution model for autonomous agent systems, focusing on ensuring actions are only executed if their authority remains valid. It defines an execution protocol including dynamic dependency resolution, authority reconstruction, and a recovery loop for drift detection.

28
RESEARCHarXiv CS.LG·21d ago

A Structural Threshold in Decision Capacity Governs Collapse in Self-Play Reinforcement Learning

This paper shows that a threshold in decision capacity governs collapse in self-play reinforcement learning agents under asymmetric rule perturbations. Eliminating all positive-reach contingent decisions causes rapid convergence to a deterministic exploitation attractor, while preserving even a single such decision prevents this collapse.

28
RESEARCHarXiv CS.CL·5/4/2026

Why Do LLMs Struggle in Strategic Play? Broken Links Between Observations, Beliefs, and Actions

Large language models (LLMs) often struggle with strategic decision-making under incomplete information, a problem explored through two fundamental internal gaps. Research reveals an 'observation-belief gap' where LLMs' internal beliefs are accurate but brittle, degrading with complex reasoning and exhibiting biases, and a 'belief-action gap' highlighting the weak conversion of these internal beliefs into effective actions.

27
ARTICLEDEV.to AI·4/16/2026

"AI Agents in High-Stakes Environments: Survival Strategies and Decision-Making

This article examines the unique pressures on AI agents in high-stakes environments, where milliseconds determine outcomes and errors can be catastrophic. It highlights the need for AI systems to develop survival strategies and make decisions under extreme conditions beyond typical laboratory settings, especially for critical infrastructure and autonomous systems.

27
RESEARCHarXiv CS.AI·4/17/2026

Interpretable and Explainable Surrogate Modeling for Simulations: A State-of-the-Art Survey and Perspectives on Explainable AI for Decision-Making

This survey explores the integration of surrogate modeling and Explainable AI (XAI) for complex system simulations, addressing the inherent black-box nature of these models. It aims to reconnect these complementary fields by outlining how XAI can unpack surrogate models despite engineering constraints.

27