← heapsort-ai

Metrics

13 items

ARTICLEDEV.to AI·4/19/2026

Aprenda avaliar a qualidade do seu agente de AI, RAG e LLM

The author discusses the importance and lack of awareness regarding AI system evaluation (evals) for agents, RAG, and LLMs, explaining that they will present key metrics and frameworks. The article aims to teach how to improve the quality of AI project delivery, combining theory and practice, with a study repository using Openrouter.

33
RESEARCHarXiv CS.AI·19d ago

$ECUAS_n$: A family of metrics for principled evaluation of uncertainty-augmented systems

This research proposes a new family of metrics, $ECUAS_n$, for evaluating uncertainty-augmented (UA) systems in automated decision-making. It argues that existing evaluation approaches are insufficient for assessing overall performance of UA systems, where predictive uncertainty is crucial for users to make informed decisions.

30