Reinforcement Learning

RL FAQ

  1. How to reduce the reward chain? (don't wait until the end to understand what is the first step)