Temporal-Difference Learning: Combining Dynamic Programming and Monte Carlo Methods for Reinforcement Learning | Towards Data Science
Milestones of RL: Q-Learning and Double Q-Learning

Source: Towards Data Science
Milestones of RL: Q-Learning and Double Q-Learning