← heapsort
DOC27

Evaluating Deep Agents using LangSmith on AWS

AWS Machine Learning BlogΒ·May 28, 2026

This post provides a practical guide combining learnings from LangChain and Anthropic to evaluate deep AI agents. It details how to apply evaluation patterns, build offline evaluations with pytest and LangSmith, and configure online monitoring using a text-to-SQL agent with Amazon Bedrock.

Read original β†—