RESEARCH27

Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks

arXiv CS.AI·April 25, 2026

This paper introduces COSPLAY, a co-evolution framework designed to enhance LLM decision-making in long-horizon interactive environments. It enables an LLM agent to retrieve skills from a learnable skill bank while an agent pipeline discovers and retains reusable skills from its own unlabeled rollouts.

LLMs reinforcement learning Skill Discovery AI Agents

Read original ↗