AI control

5 items

RESEARCHarXiv CS.AI·1d ago

Attack Selection in Agentic AI Control Evaluations Meaningfully Decreases Safety

This paper investigates "attack selection" in agentic AI settings, where attackers strategically choose when to start and stop attacks. The findings demonstrate that this capability significantly lowers measured empirical safety in AI control evaluations, even with limited audit budgets.

security AI control Agentic AI adversarial attacks

ARTICLEDEV.to AI·4/23/2026

Simple and Controllable Music Generation

The content discusses the creation of music in a simple and controllable manner using artificial intelligence. It introduces a method for generating musical compositions with greater ease and precision regarding desired characteristics.

Audio AI AI control music generation Generative AI

ARTICLEDEV.to AI·5/3/2026

The AI "Intelligence-Authority" Gap: Why Your Agents Need a Deterministic Handbrake

The article addresses the "AI Intelligence-Authority Gap," highlighting the critical need for deterministic control mechanisms or a "handbrake" for AI agents. It emphasizes that while AI agents gain intelligence, they require robust human oversight to prevent unintended outcomes.

human-in-the-loop AI control AI safety AI agents

ARTICLELangChain Blog·4/11/2026

Your harness, your memory

Agent harnesses are becoming the dominant method for building AI agents and are intrinsically tied to their memory. Using a closed harness, particularly one behind a proprietary API, means yielding control over your agent.

Agent harnesses Proprietary APIs agent memory AI control

ARTICLEDEV.to AI·5/3/2026

Giving an AI agent permission to spawn sub-agents (without losing control)

This content explores giving AI agents permission to generate sub-agents, focusing on strategies to maintain control. It discusses how to manage the autonomy of multi-agent systems without losing human oversight.

AI control Agent autonomy multi-agent systems AI agents