RESEARCH27

Plan Before You Trade: Inference-Time Optimization for RL Trading Agents

arXiv CS.LG·May 14, 2026

This paper introduces FPILOT, a plugin inference-time optimization framework for reinforcement learning trading agents. It uses predicted price trajectories to optimize the policy at inference-time before executing a trade, being compatible with any pre-trained agent.

Optimization financial trading reinforcement learning AI in finance portfolio management

Read original ↗