What is the objective of a value-based reinforcement learning algorithm?
To find the shortest path between two points.
To learn a policy that maximizes the expected cumulative discounted reward.
Baroque art features strong contrasts, while Rococo art prefers more subtle transitions
Baroque art is generally larger in scale than Rococo art

Reinforcement Learning Exercises are loading ...