← heapsort-ai

online learning

5 items

RESEARCHarXiv CS.LG·6d ago

Human-in-the-Loop Contextual Bandits for Short-Term Rental Dynamic Pricing: Structural Equivalence of Historical Warm-Up and Approval-Gated Live Learning

This paper introduces the Human-in-the-Loop Gated Bandit (HITL-GB) framework for dynamic pricing in short-term rental markets. It demonstrates that historical pricing data can be structurally equivalent to on-policy warm-up data, significantly reducing the cold-start period for online bandit learning.

27