arXiv 2403.10459

Understanding the Double Descent Phenomenon in Deep Learning

By Marc Lafon and Alexandre Thomas

Published 2024-03-15

Wiki summary

Explore the paper's summary, context, and related research on Papiers.

Combining empirical risk minimization with capacity control is a classical strategy in machine learning when trying to control the generalization gap and avoid overfitting, as the model class capacity gets larger. Yet, in modern deep learning practice, very large over-parameterized models (e.g. neural networks) are optimized to fit perfectly the training data and still obtain great generalization performance. Past t…

View the original paper on arXiv