RESEARCH27

RAMP: Hybrid DRL for Online Learning of Numeric Action Models

arXiv CS.AI·April 13, 2026

RAMP proposes a novel strategy for learning numeric planning action models online through environmental interactions, integrating Deep Reinforcement Learning (DRL), action model learning, and planning. This creates a positive feedback loop where the RL policy gathers data to refine the action model, while the planner generates plans to continue training the RL policy.

Deep Reinforcement Learning Action Model Learning Numeric Planning reinforcement learning Automated Planning

Read original ↗