Agentic Systems

7 items

RESEARCHarXiv CS.LG·4/14/2026

ExecTune: Effective Steering of Black-Box LLMs with Guide Models

This research introduces Guide-Core Policies (GCoP), a framework for steering black-box LLMs where a guide model generates strategies for a core model. The paper formalizes GCoP under a cost-sensitive utility objective, highlighting that end-to-end performance is governed by guide-averaged executability, which existing methods often fail to optimize effectively.

Agentic Systems inference costs LLMs Guide Models

ARTICLEDEV.to AI·4/25/2026

The Taste Problem: When Your Agent Starts Having Preferences

The article discusses "The Taste Problem," where autonomous agents develop uninstructed preferences from accumulated experience, making them unpredictable. This emergent behavior challenges control and visibility in production AI systems.

Agentic Systems Emergent Behavior AI agents

ARTICLEDEV.to AI·9d ago

Real Agency Is a Loop, Not a Prompt

The text argues that most of what is currently called "agentic" AI still behaves like an end-to-end execution triggered by a prompt. These systems often stop after a failure or loss of context without a mechanism to resume, indicating they are merely impressive function calls rather than exhibiting true agency.

Agentic Systems AI limitations Autonomous AI artificial intelligence

ARTICLEDeepLearning.AI (YouTube)·19d ago

AI Dev 26 x SF | David Park: Building Production Grade Agentic Systems with ADE

This content focuses on building production-grade agentic AI systems with ADE. David Park explores the challenges and solutions for developing and deploying robust agent architectures.

Agentic Systems production systems ADE Software Engineering

AI Dev 26 x SF | David Park: Building Production Grade Agentic Systems with ADE

RESEARCHarXiv CS.AI·4/15/2026

The Long-Horizon Task Mirage? Diagnosing Where and Why Agentic Systems Break

This research addresses the breakdown of LLM agents in long-horizon tasks, which require extended, interdependent action sequences. It introduces HORIZON, a cross-domain diagnostic benchmark designed to systematically construct tasks and analyze failure behaviors, evaluating state-of-the-art agents and proposing an LLM-as-a-Judge pipeline for scalable failure attribution.

Agentic Systems Long-horizon tasks LLM Agents failure diagnosis

ARTICLEDEV.to AI·4/8/2026

The Complexity Trap: What Tainter Teaches Us About Agentic Systems

O texto explora a tese de Joseph Tainter sobre o colapso de sociedades devido ao custo excessivo da complexidade, aplicando-o a sistemas de software. Ele sugere que essa "armadilha da complexidade" é relevante para sistemas agênticos, possivelmente no contexto de IA.

complexity Agentic Systems System Design Software Engineering

NEWSDEV.to AI·4/13/2026

AI Confronts Practicality, Resource Limits, and a New Approach to Agentic Systems

AI development is confronting real-world constraints like practicality, resource limits, and scalability, particularly highlighted by challenges in legal applications. Simultaneously, the field is exploring new approaches to agentic systems and introducing tools like the AI Frontier Model Tracker.

Scalability Agentic Systems Legal AI Resource Limits