Skip to yearly menu bar Skip to main content


Poster session A
in
Workshop: ICLR 2025 Workshop on GenAI Watermarking (WMARK)

First-Place Solution to NeurIPS 2024 Invisible Watermark Removal Challenge

Fahad Shamshad · Tameem Bakr · Yahia Shaaban · Noor Hussein · Karthik Nandakumar · Nils Lukas


Abstract:

Content watermarking is as an important tool for the authentication and copyright protection of digital media. However, it is unclear whether existing watermarks are robust against adversarial attacks. We present the \textbf{winning solution} to the NeurIPS 2024 \textit{Erasing the Invisible} challenge, which stress-tests watermark robustness under varying degrees of an adversary's knowledge. The challenge consisted of two tracks: a black-box and beige-box track, depending on whether the adversary knows which watermarking method was used by the provider. For the \textbf{beige-box} track, we leverage an \textit{adaptive} VAE-based evasion attack, with a test-time optimization and color-contrast restoration in CIELAB space to preserve the image's quality. For the \textbf{black-box} track, we first cluster images based on their artifacts in the spatial or frequency-domain. Then, we apply image-to-image diffusion models with controlled noise injection and semantic priors from ChatGPT-generated captions to each cluster with optimized parameter settings. Empirical evaluations demonstrate that our method successfully \textbf{achieves near-perfect watermark removal} (95.7\%) with negligible impact on the residual image's quality. We hope that our attacks inspire the development of more robust image watermarking methods.

Chat is not available.