← heapsort
RESEARCH29

One Word at a Time: Incremental Completion Decomposition Breaks LLM Safety

arXiv CS.CLΒ·April 30, 2026

This research introduces Incremental Completion Decomposition (ICD), a novel jailbreak strategy that exploits weaknesses in LLM safety mechanisms by eliciting sequences of single-word continuations. ICD demonstrates superior Attack Success Rate (ASR) on various benchmarks compared to existing methods, providing theoretical and mechanistic evidence for its effectiveness.

Read original β†—