landscape – Optimization Online

Over-Parameterized Deep Neural Networks Have No Strict Local Minima For Any Continuous Activations

Published: 2018/11/22, Updated: 2018/11/23

Nonlinear Optimization deep neural netwoks, landscape, nonconvex optimization, over-parameterization

In this paper, we study the loss surface of the over-parameterized fully connected deep neural networks. We prove that for any continuous activation functions, the loss function has no bad strict local minimum, both in the regular sense and in the sense of sets. This result holds for any convex and differentiable loss function, and … Read more