arXiv 2403.10459

Understanding the Double Descent Phenomenon in Deep Learning

By Marc Lafon and Alexandre Thomas

Published 2024-03-15

Discussion

Read the public discussion and references gathered around this paper.

Combining empirical risk minimization with capacity control is a classical strategy in machine learning when trying to control the generalization gap and avoid overfitting, as the model class capacity gets larger. Yet, in modern deep learning practice, very large over-parameterized models (e.g. neural networks) are optimized to fit perfectly the training data and still obtain great generalization performance. Past t…

View the original paper on arXiv