Nick AI Research

Reinforcement Learning

Simple overview on RL in general : Pong game example

0. RL prelude

1.RL basics

2. Policy

3. Q-learning

4. DQN

RL FAQ

How to reduce the reward chain? (don't wait until the end to understand what is the first step)

Page updated

Google Sites

Report abuse