← heapsort
RESEARCH27

Terminus-4B: Can a Smaller Model Replace Frontier LLMs at Agentic Execution Tasks?

arXiv CS.AIΒ·May 6, 2026

This research introduces Terminus-4B, a finetuned small language model, to explore its capability in replacing frontier LLMs for agentic terminal execution tasks. The model is post-trained using Supervised Finetuning and Reinforcement Learning with rubric-based LLM-as-judge rewards.

Read original β†—