KAUSTUBH SRIDHAR

Updated 629 days ago

ID: 50431017/7

kaustubhsridhar.github.io

CLICK HERE TO SEE DETAILS OF COMPANY CHANGES

Adversarial training (AT) and its variants have spearheaded progress in improving neural network robustness to adversarial perturbations and common corruptions in the last few years. Algorithm design of AT and its variants are focused on training models at a specified perturbation strength ϵ and only using the feedback from the performance of that ϵ-robust model to improve the algorithm. In this work, we focus on models, trained on a spectrum of ϵ values. We analyze three perspectives: model performance, intermediate feature precision and convolution filter sensitivity. In each, we identify alternative improvements to AT that otherwise wouldn't have been apparent at a single ϵ. Specifically, we find that for a PGD attack at some strength δ, there is an AT model at some slightly larger strength ϵ, but no greater, that generalizes best to it. Hence, we propose overdesigning for robustness where we suggest training models at an ϵ just above δ. Second, we observe (across various ϵ values)..

SEARCH FOR SIMILAR COMPANIES

Interest Score

HIT Score

0.00

Domain

kaustubhsridhar.github.io

Actual

kaustubhsridhar.github.io

185.199.108.153, 185.199.109.153, 185.199.110.153, 185.199.111.153

Status

Category

Company

0 comments Add a comment