Reinforcement Learning: Fundamentals - Session 2
Subscribers:
22,600
Published on ● Video Link: https://www.youtube.com/watch?v=2pazsuyd0Aw
Agent, environment, action, reward, state
Policy, Reward Signal, Value function
k-armed bandit
Epsilon-greedy method