← heapsort-ai

adversarial AI

3 items

RESEARCHarXiv CS.LG·4/17/2026

Optimistic Policy Learning under Pessimistic Adversaries with Regret and Violation Guarantees

This research addresses the challenge of decision-making in environments with strategic adversaries or external factors, where traditional policies can fail catastrophically in safety-critical settings. It proposes an optimistic policy learning approach designed to account for these interactions and provide regret and violation guarantees.

28