RESEARCH27

Belief-State RWKV for Reinforcement Learning under Partial Observability

arXiv CS.LG·April 14, 2026

This paper proposes Belief-State RWKV, a stronger RL formulation where the recurrent state is explicitly interpreted as a belief state. The method maintains a compact uncertainty-aware state, allowing policies to depend on both memory and confidence in partially observed settings.

Belief State RWKV Partial Observability reinforcement learning Recurrent Neural Networks

Read original ↗