Google Scholar

Theoretically principled trade-off between robustness and accuracy

H Zhang, Y Yu, J Jiao, E Xing… - International …, 2019 - proceedings.mlr.press

H Zhang, Y Yu, J Jiao, E Xing, L El Ghaoui, M Jordan

International conference on machine learning, 2019•proceedings.mlr.press

Abstract

We identify a trade-off between robustness and accuracy that serves as a guiding principle in the design of defenses against adversarial examples. Although this problem has been widely studied empirically, much remains unknown concerning the theory underlying this trade-off. In this work, we decompose the prediction error for adversarial examples (robust error) as the sum of the natural (classification) error and boundary error, and provide a differentiable upper bound using the theory of classification-calibrated loss, which is shown to be the tightest possible upper bound uniform over all probability distributions and measurable predictors. Inspired by our theoretical analysis, we also design a new defense method, TRADES, to trade adversarial robustness off against accuracy. Our proposed algorithm performs well experimentally in real-world datasets. The methodology is the foundation of our entry to the NeurIPS 2018 Adversarial Vision Challenge in which we won the 1st place out of 2,000 submissions, surpassing the runner-up approach by 11.41% in terms of mean L_2 perturbation distance.

proceedings.mlr.press

Show moreShow less

Save Cite Cited by 2388 Related articles All 9 versions View as HTML

Showing the best result for this search. See all results

Cite

Advanced search

Saved to My library

Theoretically principled trade-off between robustness and accuracy