RESEARCH27
Chainwash: Multi-Step Rewriting Attacks on Diffusion Language Model Watermarks
arXiv CS.CLΒ·May 8, 2026
This paper investigates multi-step rewriting attacks on diffusion language model watermarks, which are used to verify AI text authorship. The findings show that watermarked texts can have their detection compromised after multiple rewrites by other language models, even those unaware of the watermark key.
Read original β