← heapsort-ai

game theory

7 items

RESEARCHarXiv CS.LG·21d ago

A Structural Threshold in Decision Capacity Governs Collapse in Self-Play Reinforcement Learning

This paper shows that a threshold in decision capacity governs collapse in self-play reinforcement learning agents under asymmetric rule perturbations. Eliminating all positive-reach contingent decisions causes rapid convergence to a deterministic exploitation attractor, while preserving even a single such decision prevents this collapse.

28
RESEARCHarXiv CS.LG·14d ago

Truthful Online Preference Aggregation for LLM Fine-Tuning in Mobile Crowdsourcing

This paper investigates truthful online preference aggregation for fine-tuning Large Language Models (LLMs) in mobile crowdsourcing. It proposes a novel online weighted aggregation mechanism to address strategic misreporting by workers, modeling the process as a dynamic Bayesian game. The goal is to overcome existing approaches that fail to identify the most accurate worker and result in linear regret.

27