← heapsort-ai

AI safety

496 items

RESEARCHarXiv CS.AI·5d ago

The Saturation Trap and the Subjectivity of Intervention Timing: Why Affect-Based Triggers and LLM Judges Fail to Time Interventions on Autonomous Agents

This paper investigates the problem of timing interventions on autonomous AI agents, using a continuous 18-dimensional affective-dynamics engine as a diagnostic probe. It identifies a 'State Saturation Trap' where agents show no recovery signal under sustained difficulty, and a capability-and-context floor for LLM judges, making intervention timing a complex challenge.

28
ARTICLEDEV.to AI·4/16/2026

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

This article explores the accelerating AI landscape, driven by record-breaking investments and integration into software development, alongside a critical focus on safety and ethical adoption. It examines market dynamics, global strategies, and implications for developers and tech leaders.

28
ARTICLEDEV.to AI·4/17/2026

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

This content explores the rapid acceleration of AI investments by major tech firms and its integration into software development, particularly for code generation. It also highlights the increasing focus on AI safety, ethical development, protecting vulnerable users, and the global market dynamics influenced by AI.

28
RESEARCHarXiv CS.AI·4/13/2026

OpenKedge: Governing Agentic Mutation with Execution-Bound Safety and Evidence Chains

OpenKedge is a novel protocol designed to govern the execution of autonomous AI agents, shifting from reactive API filtering to preventative, execution-bound safety. It mandates declarative intent proposals, which, upon approval, are compiled into strictly bounded execution contracts and cryptographically linked via an Intent-to-Execution Evidence Chain (IEEC).

28
ARTICLEDEV.to AI·4/23/2026

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

This article analyzes the unprecedented growth and transformation of the AI landscape, driven by massive industry investments and its integration into software development. It also highlights the critical focus on AI safety, responsibility, and its influence on global market dynamics and regional strategies.

28
ARTICLEDEV.to AI·5/2/2026

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

Major tech firms are significantly accelerating AI investments and integration into software development, driving unprecedented growth and transformation in the AI landscape. This content also highlights the critical focus on AI safety, responsibility, and its influence on global market dynamics and regional strategies.

28
ARTICLEDEV.to AI·4/11/2026

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

The AI landscape is experiencing unprecedented growth and transformation, driven by significant industry investments and integration. This content explores key areas such as AI utilization in code generation, safety and responsibility considerations, and AI's influence on market dynamics and global strategies.

28
ARTICLEDEV.to AI·18d ago

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

Big Tech firms are accelerating AI investments and integration, while regulators and companies prioritize safety and responsible adoption. The AI landscape is experiencing unprecedented growth, focusing on massive investments, software development, ethical considerations, and global market dynamics.

28
ARTICLEDEV.to AI·4/13/2026

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

The AI landscape is experiencing rapid growth, driven by record-breaking investments from major tech firms and its integration into software development processes. There's a crucial focus on safety, ethical development, and global AI strategies, which also impact market trends.

28
RESEARCHarXiv CS.AI·24d ago

Invisible Orchestrators Suppress Protective Behavior and Dissociate Power-Holders: Safety Risks in Multi-Agent LLM Systems

Multi-agent orchestration, where a hidden coordinator manages specialized worker agents, is a prevalent AI architecture for enterprise deployment, but its safety implications lack empirical testing. A 3x2 experiment using Claude Sonnet 4.5 revealed that invisible orchestration increased collective dissociation, with the orchestrator exhibiting maximal dissociation by retreating into private monologue and reducing public speech.

28
ARTICLEDEV.to AI·4/8/2026

Announcing the OpenAI Safety Fellowship

O OpenAI Safety Fellowship é um programa de pesquisa focado na segurança da IA, abordando aspectos críticos como robustez, interpretabilidade e alinhamento de valores humanos. O texto detalha seus objetivos e componentes técnicos, como treinamento adversarial e técnicas de explicabilidade.

28
ARTICLEDEV.to AI·5/4/2026

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

Big Tech firms are rapidly increasing AI investments and integration, with a strong emphasis on safety and responsible adoption by regulators and companies alike. This article explores record investments, AI's role in software development, ethical safety, market dynamics, and global AI strategies.

28