Reinforcement Learning, Part 5: Temporal-Difference Learning | Towards Data Science
Intelligently synergizing dynamic programming and Monte Carlo algorithms

Source: Towards Data Science
Intelligently synergizing dynamic programming and Monte Carlo algorithms