← heapsort-ai

robustness

14 items

RESEARCHarXiv CS.LG·4/13/2026

Robust Reasoning Benchmark

This study proposes a new perturbation pipeline to evaluate the robustness of LLM reasoning, applying it to the AIME 2024 dataset. While frontier models show resilience, open-weight models suffer catastrophic accuracy drops, exposing structural fragility and potential issues with working memory or mechanical parsing.

30
ARTICLEDEV.to AI·4/8/2026

Announcing the OpenAI Safety Fellowship

O OpenAI Safety Fellowship é um programa de pesquisa focado na segurança da IA, abordando aspectos críticos como robustez, interpretabilidade e alinhamento de valores humanos. O texto detalha seus objetivos e componentes técnicos, como treinamento adversarial e técnicas de explicabilidade.

28
RESEARCHarXiv CS.LG·4/22/2026

The Cost of Relaxation: Evaluating the Error in Convex Neural Network Verification

This paper evaluates the worst-case divergence between original neural networks and their convex relaxations, which are used in verification systems to improve performance at the cost of soundness. The study provides analytical upper and lower bounds for the error, demonstrating it grows exponentially with network depth and linearly with the input's radius.

27
RESEARCHarXiv CS.LG·17d ago

Double descent for least-squares interpolation on contaminated data: A simulation study

This research investigates the "double descent" phenomenon in overparametrized models, which allows for improved generalization despite classical overfitting concerns. The study specifically explores this effect in linear regression with contaminated training data, finding that significant overparametrization enables double descent even in such robust settings.

27
RESEARCHarXiv CS.CL·7d ago

A Multi-Domain Red Teaming Framework for Safety, Robustness, and Fairness Evaluation of Medical Large Language Models

A new multi-domain red teaming framework was developed to evaluate the safety, robustness, and fairness of medical Large Language Models (LLMs) across 690 clinically grounded scenarios. The research revealed substantial performance variance and critical failures in safety-critical scenarios, even in high-performing systems.

27
RESEARCHarXiv CS.AI·7d ago

Position Paper: Post-Solve Robustness in Decision Engines: Feasible Regions and Smoothness Under Perturbations

This paper introduces a missing layer in optimization pipelines to address the post-solve robustness gap in Mixed-Integer Linear Programming (MILP) decision engines. It formalizes an epsilon-near-optimal feasible neighborhood and solution smoothness to assess how far a solved incumbent can be trusted under parameter perturbations.

27
RESEARCHarXiv CS.LG·4/8/2026

Learning Stable Predictors from Weak Supervision under Distribution Shift

Este artigo de pesquisa formaliza o 'supervision drift' em experimentos CRISPR-Cas13d, analisando a robustez de modelos sob shift de distribuição, inclusive quando o mecanismo de supervisão muda. Utilizando um benchmark não-IID, demonstra bom desempenho in-domain, mas falha na transferência temporal e apenas sucesso parcial na transferência entre linhagens celulares.

27
RESEARCHarXiv CS.AI·5/6/2026

Stable Agentic Control: Tool-Mediated LLM Architecture for Autonomous Cyber Defense

The paper introduces a tool-mediated LLM architecture for autonomous cyber defense, designed to provide formal guarantees for high-stakes decision-making under adversarial pressure. It certifies controllability, observability, and Input-to-State Stability (ISS) robustness through a machine-checked Lyapunov function, demonstrating its effectiveness on real enterprise attack graphs.

27