RESEARCHarXiv CS.LG·29d ago
The Safety-Aware Denoiser for Text Diffusion Models
This work proposes the Safety-Aware Denoiser (SAD), a safety-guidance framework for text diffusion models. SAD modifies the iterative denoising process to steer the text sample towards provably safe regions, avoiding computationally expensive retraining of the underlying model.
27