RESEARCH27
Belief-State RWKV for Reinforcement Learning under Partial Observability
arXiv CS.LGΒ·April 14, 2026
This paper proposes Belief-State RWKV, a stronger RL formulation where the recurrent state is explicitly interpreted as a belief state. The method maintains a compact uncertainty-aware state, allowing policies to depend on both memory and confidence in partially observed settings.
Read original β