Reinforcement Learning: Temporal Difference - Session 6

Published on ● Video Link: https://www.youtube.com/watch?v=2TqOmsVH9VQ



Duration: 0:00
300 views
3


Temporal difference: combining Monte Carlo (MC) and Dynamic Programming (DP)
Advantages of TD
No environment model required (vs DP)
Continual updates (vs MC)
Example: reinforcers