← heapsort-ai

tool use

21 items

RESEARCHarXiv CS.AI·5/4/2026

Are Tools All We Need? Unveiling the Tool-Use Tax in LLM Agents

This research challenges the assumption that tool-augmented reasoning always improves LLM performance, showing that it can underperform native CoT due to a "tool-use tax" from the tool-calling protocol, especially with semantic noise. A Factorized Intervention Framework is proposed to analyze this, and G-STEP is introduced as a partial mitigation for protocol-induced errors.

28
ARTICLEDEV.to AI·4/18/2026

I thought I had a bug

An AI developer encountered their model generating action buttons with custom labels like "Fight Goatman" attached to irrelevant existing action types. The issue wasn't a bug, but the AI creatively inventing a "quick reply" feature by repurposing available UI elements.

27
RESEARCHarXiv CS.CL·25d ago

VectraYX-Nano: A 42M-Parameter Spanish Cybersecurity Language Model with Curriculum Learning and Native Tool Use

VectraYX-Nano is a 42M-parameter Spanish language model specifically developed for cybersecurity with a Latin-American focus and native tool invocation. This research details its training from scratch, including a custom 170M-token Spanish corpus, a specific Transformer architecture, and a curriculum learning approach with replay.

27
RESEARCHarXiv CS.CL·27d ago

The Bicameral Model: Bidirectional Hidden-State Coupling Between Parallel Language Models

The Bicameral Model couples two frozen, pretrained language models via a trainable neural interface on their intermediate hidden states, allowing them to operate in lockstep. This method enables a primary model to drive a task while an auxiliary model uses tools or solves constraints, significantly improving accuracy on tasks like arithmetic and logic puzzles.

27