Which of the following is not a method for policy evaluation in reinforcement learning?
Temporal difference learning
Supervised learning
Monte Carlo method

Reinforcement Learning Exercises are loading ...