Global Optimality in Neural Network Training

Automatisch afspelen

FGV/EMAp - NetSci-X2015 - International School and Conference on Network Science - Dia 2/Manhã

0 Bekeken . 06/24/25 121gamers

Global Optimality in Neural Network Training

0 Bekeken • 06/24/25

insluiten

121gamers

16 abonnees

Benjamin D. Haeffele, René Vidal
The past few years have seen a dramatic increase in the performance of recognition systems thanks to the introduction of deep networks for representation learning. However, the mathematical reasons for this success remain elusive. A key issue is that the neural network training problem is nonconvex, hence optimization algorithms may not return a global minima. This paper provides sufficient conditions to guarantee that local minima are globally optimal and that a local descent strategy can reach a global minima from any initialization. Our conditions require both the network output and the regularization to be positively homogeneous functions of the network parameters, with the regularization being designed to control the network size. Our results apply to networks with one hidden layer, where size is measured by the number of neurons in the hidden layer, and multiple deep subnetworks connected in parallel, where size is measured by the number of subnetworks.

Laat meer zien

0 Comments

Facebook Reacties

Volgende

Global Optimality in Neural Network Training

Volgende

Houd er rekening mee dat als u jonger bent dan 18 jaar, u geen toegang kunt krijgen tot deze site.

Volgende

Global Optimality in Neural Network Training

Volgende

Taal