YouTube doesn't want you know this subscribers secret

Get Free YouTube Subscribers, Views and Likes

Reinforcement Learning from scratch

Follow

Graphics in 5 Minutes

How does Reinforcement Learning work? A short cartoon that intuitively explains this amazing machine learning approach, and how it was used in AlphaGo and ChatGPT.
Part 1 of 3.

0:00 intro
0:13 pong
0:28 the policy
0:51 policy as neural network
1:32 supervised learning
2:51 reinforcement learning using policy gradient
4:24 minimizing error using gradient descent
4:45 probabilistic policy
5:01 pong from pixels
6:58 visualizing learned weights
8:18 pointer to Karpathy "pong from pixels" blogpost

posted by Kramekgg

Reinforcement Learning: AlphaGo

Reinforcement Learning: AlphaGo

An introduction to Reinforcement Learning

An introduction to Reinforcement Learning

Watching Neural Networks Learn

Watching Neural Networks Learn

Q Learning simply explained | SARSA and QLearning Explanation

Q Learning simply explained | SARSA and QLearning Explanation

The BEST QLearning example! | The Mountain Car Problem

The BEST QLearning example! | The Mountain Car Problem

A friendly introduction to deep reinforcement learning, Qnetworks and policy gradients

A friendly introduction to deep reinforcement learning, Qnetworks and policy gradients

I Built a Neural Network from Scratch

I Built a Neural Network from Scratch

AI Learns to Walk (deep reinforcement learning)

AI Learns to Walk (deep reinforcement learning)

Reinforcement Learning: Machine Learning Meets Control Theory

Reinforcement Learning: Machine Learning Meets Control Theory

Neural Networks Explained from Scratch using Python

Neural Networks Explained from Scratch using Python

The Origin of Reinforcement Learning | How AI Learned to Feel

The Origin of Reinforcement Learning | How AI Learned to Feel

Evolving Genetic Neural Network Optimizes Poly Bridge Problems

Evolving Genetic Neural Network Optimizes Poly Bridge Problems

Reinforcement Learning, by the Book

Reinforcement Learning, by the Book

Deep Q Learning is Simple with PyTorch | Full Tutorial 2020

Deep Q Learning is Simple with PyTorch | Full Tutorial 2020

AI Learns to Park Deep Reinforcement Learning

AI Learns to Park Deep Reinforcement Learning

MultiAgent Hide and Seek

MultiAgent Hide and Seek

Why Does Diffusion Work Better than AutoRegression?

Why Does Diffusion Work Better than AutoRegression?

The future of AI looks like THIS (& it can learn infinitely)

The future of AI looks like THIS (& it can learn infinitely)

Recommended

Find A Key - Get A Friend!

Find A Key - Get A Friend!

00:35

Who Said That Snowmen Can't Be Eaten?

Who Said That Snowmen Can't Be Eaten?

01:33

Santa Claus Gives A Helping Hand

Santa Claus Gives A Helping Hand

02:19

Thanks God, Zuckerberg Didn't Study At Hogwarts!

Thanks God, Zuckerberg Didn't Study At Hogwarts!

06:13

Say Goodbye To Heavy Traffic Forever!

Say Goodbye To Heavy Traffic Forever!

03:55

Echo Look - Easy Step To Being Awesome

Echo Look - Easy Step To Being Awesome

01:34

Witness History Being Made By Robbie "Maddo" Maddison

Witness History Being Made By Robbie "Maddo" Maddison

00:04