← heapsort
RESEARCH27

Learning When to Act: Communication-Efficient Reinforcement Learning via Run-Time Assurance

arXiv CS.LGΒ·May 14, 2026

This paper introduces a communication-efficient reinforcement learning approach where a single policy learns both control inputs and timing decisions, secured by a pointwise Lyapunov safety shield. A run-time assurance layer overrides the policy to provide strictly stronger safety guarantees and achieve significantly higher mean inter-sample intervals on various systems.

Read original β†—