YouTube doesn't want you know this subscribers secret
Get Free YouTube Subscribers, Views and Likes

Reinforcement Learning from scratch

Follow
Graphics in 5 Minutes

How does Reinforcement Learning work? A short cartoon that intuitively explains this amazing machine learning approach, and how it was used in AlphaGo and ChatGPT.
Part 1 of 3.

0:00 intro
0:13 pong
0:28 the policy
0:51 policy as neural network
1:32 supervised learning
2:51 reinforcement learning using policy gradient
4:24 minimizing error using gradient descent
4:45 probabilistic policy
5:01 pong from pixels
6:58 visualizing learned weights
8:18 pointer to Karpathy "pong from pixels" blogpost

posted by Kramekgg