RESEARCHarXiv CS.CL·5/8/2026
Chainwash: Multi-Step Rewriting Attacks on Diffusion Language Model Watermarks
This paper investigates multi-step rewriting attacks on diffusion language model watermarks, which are used to verify AI text authorship. The findings show that watermarked texts can have their detection compromised after multiple rewrites by other language models, even those unaware of the watermark key.
27