It was never so easy to get YouTube subscribers
Get Free YouTube Subscribers, Views and Likes

Physics of AI

Follow
Sebastien Bubeck

We propose an approach to the science of deep learning that roughly follows what physicists do to understand reality: (1) explore phenomena through controlled experiments, and (2) build theories based on toy mathematical models and nonfully rigorous mathematical reasoning. I illustrate (1) with the LEGO study (LEGO stands for Learning Equality and Group Operations), where we observe how transformers learn to solve simple linear systems of equations. I will also briefly illustrate (2) with an analysis of the emergence of threshold units when training a twolayers neural network to solve a simple sparse coding problem. The latter analysis connects to the recently discovered Edge of Stability phenomenon.

(1) https://arxiv.org/abs/2206.04301
(2) https://arxiv.org/abs/2212.07469

posted by buidaradaey