Generalization Articles

Page 1 of 1 (4 articles)

10/19/2024 • EN

Do AIs reason or recite?

Explores whether large language models like ChatGPT truly reason or merely recite memorized text from their training data, examining their logical capabilities.

artificial intelligence Autoregression Generalization large language models Machine Learning

Gael Varoquaux

4/1/2021 • EN

Notes on the Origin of Implicit Regularization in SGD

Explores how Stochastic Gradient Descent (SGD) inherently prefers certain minima, leading to better generalization in deep learning, beyond classical theory.

Deep Learning Generalization Implicit Regularization Optimization Algorithms Stochastic Gradient Descent

Ferenc Huszár

3/14/2019 • EN

Are Deep Neural Networks Dramatically Overfitted?

Explores the paradox of why deep neural networks generalize well despite having many parameters, discussing theories like Occam's Razor and the Lottery Ticket Hypothesis.

Deep Learning Generalization Machine Learning Neural Networks Overfitting

Lilian Weng

6/18/2018 • EN

Deep Learning: Theory & Practice

Highlights from a deep learning conference covering optimization algorithms' impact on generalization and human-in-the-loop efficiency.

Deep Learning Generalization Machine Learning Neural Networks optimization

Yoel Zeldes

Generalization Articles

Do AIs reason or recite?

Notes on the Origin of Implicit Regularization in SGD

Are Deep Neural Networks Dramatically Overfitted?

Deep Learning: Theory & Practice

Select Language