RESEARCHarXiv CS.AI·15d ago
Toward Reliable Design of LLM-Enabled Agentic Workflows: Optimizing Latency-Reliability-Cost Tradeoffs
This paper analyzes the fundamental tradeoffs between latency, reliability, and cost in LLM-enabled agentic workflows. It introduces performance models for both LLM and non-LLM agents and studies the design of sequential workflows, presenting results on token allocation and optimal reliability.
27