RESEARCHarXiv CS.LG·26d ago
Plan Before You Trade: Inference-Time Optimization for RL Trading Agents
This paper introduces FPILOT, a plugin inference-time optimization framework for reinforcement learning trading agents. It uses predicted price trajectories to optimize the policy at inference-time before executing a trade, being compatible with any pre-trained agent.
27