RESEARCH27
Plan Before You Trade: Inference-Time Optimization for RL Trading Agents
arXiv CS.LGΒ·May 14, 2026
This paper introduces FPILOT, a plugin inference-time optimization framework for reinforcement learning trading agents. It uses predicted price trajectories to optimize the policy at inference-time before executing a trade, being compatible with any pre-trained agent.
Read original β