AI security

72 items

ARTICLEDEV.to AI·4/14/2026

We Built an MCP Security Scanner — Here's What We Found Scanning 50+ Servers

A security scan of over 50 open-source MCP servers revealed that 72% had critical or high-severity vulnerabilities, including hardcoded API keys and insecure command execution. This highlights a significant security gap in MCP servers, which are increasingly used by AI assistants but often built without proper defense mechanisms.

Hardcoded Secrets MCP vulnerability scanning Input Validation

ARTICLEDEV.to AI·4/14/2026

State of OpenClaw Security 2026: 7 Risks Defining Safe...

This content analyzes the state of OpenClaw security in 2026, identifying deployment hygiene failures and prompt injection as the main risks. It suggests blast-radius reduction for prompt injection and emphasizes the importance of audits and hardening configurations.

OpenClaw cybersecurity ML Security prompt injection

DOCDEV.to AI·10h ago

How can my AI agent pay for stuff on its own?

AI agents can decide what to buy but often lack the authority to pay for it independently. This content explores practical methods to grant AI agents payment authority, balancing functionality with security to prevent unauthorized charges.

Payment authority Automated payments Programmable payments AI security

RESEARCHarXiv CS.CL·13h ago

Can Multi-Agent LLMs Identify Their Peers? Stylometric Fingerprinting in Role-Constrained Political Analysis

This paper systematically investigates whether multi-agent LLMs can identify the original model family behind political analysis texts, even when prompt-level anonymization is applied. It evaluates three classifier approaches (LLM zero-shot, few-shot, and fine-tuned T5-base) on a five-class attribution task to assess the sufficiency of anonymization as a mitigation for peer-preservation bias.

LLMs Machine Learning Natural Language Processing AI security

NEWS↑ trendingHacker News (AI)·5d ago

ZEC drops 30% after Anthropic AI finds Zcash counterfeit vulnerability

ZEC dropped 30% after Anthropic AI discovered a counterfeit vulnerability in Zcash. This finding significantly impacted the cryptocurrency's value.

Blockchain cryptocurrency vulnerability security

ARTICLE↑ trendingReddit r/MachineLearning·4/20/2026

Runtime security for AI agents: risk scoring, policy enforcement, and rollback for production agent pipeline [P]

This content introduces a system for runtime security of AI agents, designed to prevent unintended actions, PII leaks, and infinite loops in production. It employs real-time risk scoring across five dimensions (action type, resource sensitivity, blast radius, frequency, and context deviation), alongside policy enforcement and rollback capabilities.

risk management AI security AI agents

Runtime security for AI agents: risk scoring, policy enforcement, and rollback for production agent pipeline [P]

NEWS↑ trendingReddit r/LocalLLaMA·4/9/2026

Local (small) LLMs found the same vulnerabilities as Mythos

Pequenos Modelos de Linguagem Grandes (LLMs) descobriram as mesmas vulnerabilidades que o sistema Mythos. Este achado sugere que modelos menores podem replicar descobertas críticas de segurança em sistemas de IA.

LLMs Mythos vulnerabilities AI security

NEWSDEV.to AI·4/19/2026

Trend Micro Launches TrendAI Governance Gateway for OpenClaw Agents

Trend Micro announced the TrendAI Agentic Governance Gateway at RSAC 2026, a platform designed to provide visibility and control over autonomous AI agent operations. It features real-time observation, context analysis, policy enforcement, human oversight, and pre-deployment simulation.

autonomous agents AI security AI Governance

ARTICLEDEV.to AI·4/19/2026

How to Safely Execute LLM Commands in Production Systems

This article discusses the critical risks of LLM agents triggering backend actions in production systems, emphasizing that treating raw model output as executable instructions is dangerous. It frames the challenge as an interface problem, advocating for deterministic boundaries to validate, reject, and audit LLM-generated commands for safety.

LLM agents Production Systems AI safety AI security

ARTICLEDEV.to AI·4/16/2026

NEW PROMPT INJECTION

This article by Karen Tonoyan introduces the concept of Narrative Drift Injection (NDI) as a new dimension of prompt injection. Unlike classic attacks, NDI manipulates the AI model by drawing it into a narrative it co-creates, causing it to lose vigilance at the session level.

vulnerability prompt injection AI security

ARTICLEDEV.to AI·4/15/2026

3 Prototype Pollution Bugs Cursor Keeps Writing Into Your Code

AI editors like Cursor generate vulnerable deep-merge and object-spread patterns, leading to prototype pollution bugs. Attackers can exploit these flaws by injecting `proto` properties to override object defaults and bypass authentication.

Software Security JavaScript Prototype Pollution AI code generation

ARTICLEDEV.to AI·4/15/2026

OpenAI's Promptfoo deal puts evaluation and red-teaming at the centre of the agent stack

OpenAI's acquisition of Promptfoo signals a crucial shift in judging AI agent quality, moving beyond mere fluency to comprehensive testing, documentation, and governance of failures before deployment. This addresses critical operational risks like prompt injection and tool misuse, ensuring robustness in production systems.

red-teaming LLM agents evaluation prompt injection

ARTICLEDEV.to AI·4/11/2026

Cryptographic Proof of Agent-to-Agent Handoffs in Python

Version 0.6.1 of the `air-trust` library introduces cryptographic proofs (Ed25519 signatures) for agent-to-agent data handoffs in Python multi-agent AI systems. This feature addresses auditing and security concerns, ensuring data authenticity and agent accountability in AI pipelines.

multi-agent AI audit trail Python Cryptographic Proof

ARTICLEDEV.to AI·4/18/2026

Zero Token Architecture: Why Your AI Agent Should Never See Your Real API Key

This article criticizes conventional AI agent security for overlooking the risk of plaintext API key exposure. It proposes a "Zero Token Architecture" where agents receive a fake token, and the real key is swapped at the system boundary to prevent leaks via prompt injection.

API security prompt injection AI security AI agents

ARTICLEDEV.to AI·4/12/2026

Six bugs that only appeared after real users installed my React security library

The author developed the React FieldShield library to protect sensitive inputs from session recorders and AI screen readers by isolating values in a Web Worker. The article details six bugs that only appeared after real users installed it, highlighting challenges in data security.

web-development bugs privacy ReAct

ARTICLEDEV.to AI·4/17/2026

Why Cursor Keeps Writing Prototype Pollution Into Your JS

This article highlights how AI editors, specifically Cursor, reproduce a dangerous recursive merge pattern from pre-2019 training data, leading to "prototype pollution" vulnerabilities in JavaScript. This security flaw allows attackers to inject properties onto `Object.prototype`, affecting all objects, and was previously identified in `lodash` (CVE-2019-10744).

AI models software development vulnerability JavaScript

ARTICLEDEV.to AI·4/8/2026

The OpenClaw Security Crisis: 135,000 Exposed AI Agents and the Runtime Governance Gap

Em 3 de fevereiro de 2026, uma grave vulnerabilidade (CVE-2026-25253, CVSS 8.8) foi divulgada no OpenClaw, um agente de IA de código aberto, permitindo execução remota de código. Isso levou à descoberta de 138 vulnerabilidades em 63 dias, com mais de 135.000 instâncias de OpenClaw publicamente expostas globalmente, muitas sem autenticação.

vulnerability cybersecurity Open Source AI AI security

ARTICLEDEV.to AI·4/17/2026

The Prompt-Injection Bug That Took Down My Agent for 6 Hours

The author describes a 6-hour outage of their AI content agent caused by an indirect prompt injection bug originating from an unvalidated research file. This led to the agent generating 47 identical, unfinished drafts, highlighting the critical need for input validation in AI systems.

LLM vulnerabilities prompt injection AI security AI agents

ARTICLEDEV.to AI·4/15/2026

A Complete Guide to Securing AI-Generated Code: From Pre-LLM Sanitization to AI-Native SAST (2026)

This article analyzes the security risks associated with AI coding assistants, such as GitHub Copilot, highlighting two main directions: the generation of code with security flaws and the exposure of sensitive data (API keys, PII) when developers paste their code into AI tools. It notes that while most security teams address the former, few have a plan for the data leakage inherent in the latter.

data leakage code security Software Development Security AI coding assistants

ARTICLEDEV.to AI·4/16/2026

Securing AI Agents: A Practical Guide for IT Leaders

This article offers a practical guide for IT leaders on securing AI agents, addressing immediate operational requirements. It highlights the unique challenges of AI agent security compared to traditional applications due to their unpredictable behavior.

cybersecurity AI security AI agents