Reinforcement Learning: Temporal Difference - Session 6
Subscribers:
22,600
Published on ● Video Link: https://www.youtube.com/watch?v=2TqOmsVH9VQ
Temporal difference: combining Monte Carlo (MC) and Dynamic Programming (DP)
Advantages of TD
No environment model required (vs DP)
Continual updates (vs MC)
Example: reinforcers