RESEARCH27
Terminus-4B: Can a Smaller Model Replace Frontier LLMs at Agentic Execution Tasks?
arXiv CS.AIΒ·May 6, 2026
This research introduces Terminus-4B, a finetuned small language model, to explore its capability in replacing frontier LLMs for agentic terminal execution tasks. The model is post-trained using Supervised Finetuning and Reinforcement Learning with rubric-based LLM-as-judge rewards.
Read original β