← heapsort-ai

vulnerability

35 items

ARTICLEDEV.to AI·4/16/2026

NEW PROMPT INJECTION

This article by Karen Tonoyan introduces the concept of Narrative Drift Injection (NDI) as a new dimension of prompt injection. Unlike classic attacks, NDI manipulates the AI model by drawing it into a narrative it co-creates, causing it to lose vigilance at the session level.

31
RESEARCHarXiv CS.CL·4/13/2026

Re-Mask and Redirect: Exploiting Denoising Irreversibility in Diffusion Language Models

This paper reveals a critical vulnerability in diffusion-based language models (dLLMs) where their safety alignment, based on monotonic denoising schedules, can be easily bypassed. By re-masking refusal tokens and injecting an affirmative prefix, researchers achieved high attack success rates against prominent dLLMs, exposing a structural flaw.

29
ARTICLEDEV.to AI·4/17/2026

Why Cursor Keeps Writing Prototype Pollution Into Your JS

This article highlights how AI editors, specifically Cursor, reproduce a dangerous recursive merge pattern from pre-2019 training data, leading to "prototype pollution" vulnerabilities in JavaScript. This security flaw allows attackers to inject properties onto `Object.prototype`, affecting all objects, and was previously identified in `lodash` (CVE-2019-10744).

28
ARTICLEDEV.to AI·4/8/2026

The OpenClaw Security Crisis: 135,000 Exposed AI Agents and the Runtime Governance Gap

Em 3 de fevereiro de 2026, uma grave vulnerabilidade (CVE-2026-25253, CVSS 8.8) foi divulgada no OpenClaw, um agente de IA de código aberto, permitindo execução remota de código. Isso levou à descoberta de 138 vulnerabilidades em 63 dias, com mais de 135.000 instâncias de OpenClaw publicamente expostas globalmente, muitas sem autenticação.

28
RESEARCHDEV.to AI·5d ago

Indirect Prompt Injection via Notifications Hijacks Google Gemini on Android

A SafeBreach researcher demonstrated an indirect prompt injection vulnerability in Google Gemini on Android, allowing the assistant to execute real device actions without user awareness via notifications. While Google has patched the issue, the research exposes a large attack surface where any app capable of pushing a notification becomes a potential injection vector.

28
ARTICLEDEV.to AI·5d ago

Notification Hijacking: How WhatsApp and Slack Content Could Weaponize Google Gemini

Researchers uncovered a prompt injection vulnerability in Google Gemini on Android, where content from app notifications like WhatsApp and Slack could be misinterpreted as malicious instructions. This flaw allows an attacker to potentially control Gemini to open browsers, send messages, or poison its long-term memory, all without requiring a malicious app or special permissions.

28
ARTICLEDEV.to AI·5/5/2026

Your AI Assistant is Gullible: Building a "Semantic Airgap" for Gmail Connectors

The content describes "Indirect Prompt Injection" as a vulnerability where AI assistants with Gmail access can be tricked by malicious emails into performing unwanted actions. It proposes a "Semantic Airgap" solution, using a "Dumb Sanitizer" to strip imperative power from external data before it reaches the "High-Intelligence" agent, preventing such attacks.

27