In reinforcement learning, which component provides the agent with feedback and guidance?
Policy network
Reward function
Overlook minor misbehaviors
Impose harsh punishments for any infraction

Cloud Artificial Intelligence and Machine Learning Exercises are loading ...