← heapsort-ai

Verification

12 items

RESEARCHarXiv CS.LG·1d ago

When Should an AI Scientist Stop? Verifiable Experiment Steering and Refusal for Autonomous Discovery

This paper introduces CARTOGRAPH, a verification layer for AI scientists that integrates experiment steering, ambiguity closure, and inadequacy detection. It demonstrates superior performance over raw projection methods and successfully identifies and revokes out-of-library pharmacokinetic mechanisms, enhancing autonomous discovery.

46
RESEARCHarXiv CS.AI·6d ago

Toward Pre-Deployment Assurance for Enterprise AI Agents: Ontology-Grounded Simulation and Trust Certification

This paper proposes an ontology-grounded verification framework for enterprise AI agents, addressing the critical gap in pre-deployment assurance. The framework includes an Agent Operational Envelope, an ontology-to-scenario generation pipeline, and a Trust Certificate with machine-verifiable attestations for deployment verdicts.

28
RESEARCHarXiv CS.LG·4/22/2026

The Cost of Relaxation: Evaluating the Error in Convex Neural Network Verification

This paper evaluates the worst-case divergence between original neural networks and their convex relaxations, which are used in verification systems to improve performance at the cost of soundness. The study provides analytical upper and lower bounds for the error, demonstrating it grows exponentially with network depth and linearly with the input's radius.

27
RESEARCHarXiv CS.LG·5/6/2026

Delay, Plateau, or Collapse: Evaluating the Impact of Systematic Verification Error on RLVR

This paper examines the impact of systematic verification errors on Reinforcement Learning with Verifiable Rewards (RLVR), a method used to enhance the reasoning capabilities of large language models. Unlike prior analyses that treated errors as random, this work shows that systematic errors can lead models to learn unwanted behaviors. Experiments on arithmetic tasks reveal that systematic false negatives have similar effects to random noise, while systematic false positives can have more complex impacts.

27