RESEARCHarXiv CS.LG·6d ago
Human-in-the-Loop Contextual Bandits for Short-Term Rental Dynamic Pricing: Structural Equivalence of Historical Warm-Up and Approval-Gated Live Learning
This paper introduces the Human-in-the-Loop Gated Bandit (HITL-GB) framework for dynamic pricing in short-term rental markets. It demonstrates that historical pricing data can be structurally equivalent to on-policy warm-up data, significantly reducing the cold-start period for online bandit learning.
27