← heapsort-ai

AI reliability

41 items

ARTICLEDEV.to AI·4/27/2026

Testing AI Systems in Production: From LLM Evals to Agent Reliability

The article criticizes current LLM testing in production, noting that 'smooth' deployments often mask subtle hallucinations leading to financial or data loss due to inadequate truth-based evaluations. It stresses the need for robust retrieval evaluation pipelines, better data, and specific strategies to test AI agents for reliability and prevent destructive failures.

27
ARTICLEDEV.to AI·5/1/2026

LLMs are Listening to How We Ask, Not What We Ask

This article discusses a 2026 paper by Kumaran et al. identifying two critical, asymmetric biases in LLMs: a choice-supportive bias where models gain confidence in their prior answers, and a hypersensitivity to contradiction causing them to over-adjust when challenged. These findings have significant implications for developers building on top of LLMs, influencing how we interact with AI.

27
RESEARCHarXiv CS.CL·29d ago

Can LLMs Take Retrieved Information with a Grain of Salt?

This paper evaluates the ability of large language models (LLMs) to adapt their responses to the certainty of retrieved information, revealing systematic limitations. It proposes an interaction strategy combining prior reminders, certainty recalibration, and context simplification to enhance LLM reliability. This approach reduces obedience errors by 25% without modifying model weights.

27
RESEARCHarXiv CS.AI·28d ago

Where Reliability Lives in Vision-Language Models: A Mechanistic Study of Attention, Hidden States, and Causal Circuits

This research tests the "Attention-Confidence Assumption" in Vision-Language Models (VLMs), finding that attention structure is a near-zero predictor of correctness. The study uses a unified mechanistic pipeline (VLM Reliability Probe) to analyze attention, generation dynamics, and hidden-state geometry in three VLM families.

27
RESEARCHarXiv CS.AI·4/9/2026

SymptomWise: A Deterministic Reasoning Layer for Reliable and Efficient AI Systems

SymptomWise é um framework que aprimora a análise de sintomas por IA, separando a compreensão da linguagem do raciocínio diagnóstico para aumentar a confiabilidade e rastreabilidade. Ele utiliza conhecimento médico especializado e inferência determinística, empregando LLMs apenas para extração de sintomas e explicações, não para o diagnóstico em si.

27