ICLR Towards Understanding Clean Generalization and Robust Overfitting in Adversarial Training

Poster
in
Workshop: Bridging the Gap Between Practice and Theory in Deep Learning

Towards Understanding Clean Generalization and Robust Overfitting in Adversarial Training

Binghui Li · Yuanzhi Li

[ Abstract ] [ Project Page ]

[ OpenReview]

Abstract: Similar to surprising performance in the standard deep learning, deep nets trained by adversarial training also generalize well for

unseen clean data (natural data)

$\textit{unseen clean data (natural data)}$ . However, despite adversarial training can achieve low robust training error, there exists a significant

robust generalization gap

$\textit{robust generalization gap}$ . We call this phenomenon the

Clean Generalization and Robust Overfitting (CGRO)

$\textit{Clean Generalization and Robust Overfitting (CGRO)}$ . In this work, we study the CGRO phenomenon in adversarial training from two views:

representation complexity

$\textit{representation complexity}$ and

training dynamics

$\textit{training dynamics}$ . Specifically, we consider a binary classification setting with

N

$N$ separated training data points.

First

$\textit{First}$ , we prove that, based on the assumption that we assume there is

poly (D)

$\operatorname{poly}(D)$ -size clean classifier (where

D

$D$ is the data dimension), ReLU net with only

O (N D)

$O(N D)$ extra parameters is able to leverages robust memorization to achieve the CGRO, while robust classifier still requires exponential representation complexity in worst case.

Next

$\textit{Next}$ , we focus on a structured-data case to analyze training dynamics, where we train a two-layer convolutional network with

O (N D)

$O(N D)$ width against adversarial perturbation. We then show that a three-stage phase transition occurs during learning process and the network provably converges to robust memorization regime, which thereby results in the CGRO.

Besides

$\textit{Besides}$ , we also empirically verify our theoretical analysis by experiments in real-image recognition datasets.

Chat is not available.

Poster in Workshop: Bridging the Gap Between Practice and Theory in Deep Learning

Towards Understanding Clean Generalization and Robust Overfitting in Adversarial Training

Binghui Li · Yuanzhi Li

Poster
in
Workshop: Bridging the Gap Between Practice and Theory in Deep Learning