In reinforcement learning, what is the primary purpose of a reward function?
To penalize the agent for every action it takes, regardless of the outcome.
To determine the optimal action for the agent to take in a given state, as dictated by the environment.
Baroque art features strong contrasts, while Rococo art prefers more subtle transitions
Baroque art is generally larger in scale than Rococo art

Cloud Artificial Intelligence and Machine Learning Exercises are loading ...