← heapsort
RESEARCH27

Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks

arXiv CS.AIΒ·April 25, 2026

This paper introduces COSPLAY, a co-evolution framework designed to enhance LLM decision-making in long-horizon interactive environments. It enables an LLM agent to retrieve skills from a learnable skill bank while an agent pipeline discovers and retains reusable skills from its own unlabeled rollouts.

Read original β†—