Mean reward refers to the average reward that an agent receives over a specific period or a series of actions within a reinforcement learning framework. This concept is crucial as it helps in evaluating the performance of an agent by quantifying how effectively it learns from interactions with its environment. By focusing on mean reward, practitioners can assess the stability and reliability of the learning process, ensuring that the agent is not just chasing immediate rewards but also considering long-term benefits.
congrats on reading the definition of mean reward. now let's actually learn it.