Skip to yearly menu bar Skip to main content


Poster

Polylogarithmic width suffices for gradient descent to achieve arbitrarily small test error with shallow ReLU networks

Matus Telgarsky · Ziwei Ji

Abstract

Chat is not available.