RESEARCH27

Chainwash: Multi-Step Rewriting Attacks on Diffusion Language Model Watermarks

arXiv CS.CL·May 8, 2026

This paper investigates multi-step rewriting attacks on diffusion language model watermarks, which are used to verify AI text authorship. The findings show that watermarked texts can have their detection compromised after multiple rewrites by other language models, even those unaware of the watermark key.

Diffusion Models language models AI watermarking security text generation

Read original ↗