PPO agent playing on LunarLander-v2
This is a trained model of an agent playing on the environment LunarLander-v2. The agent was trained with a PPO algorithm and evaluated for 100 episodes. See further agent and evaluation metadata in the according README section.
Import
The Python module used for training and uploading/downloading is rl-framework. It is an easy-to-read, plug-and-use Reinforcement Learning framework and provides standardized interfaces and implementations to various Reinforcement Learning methods and environments.
Also it provides connectors for the upload and download to popular model version control systems, including the HuggingFace Hub.
Usage
from rl-framework import StableBaselinesAgent, StableBaselinesAlgorithm
# Create new agent instance
agent = StableBaselinesAgent(
algorithm=StableBaselinesAlgorithm.PPO
algorithm_parameters={
...
},
)
# Download existing agent from HF Hub
repository_id = "zap-thamm/PPO-LunarLander-v2-v3"
file_name = "algorithm.zip"
agent.download(repository_id=repository_id, filename=file_name)
Further examples can be found in the exploration section of the rl-framework repository.
Evaluation results
- mean_reward on LunarLander-v2self-reported-192.82 +/- 29.17