PPO Agent for LunarLander-v2
This is a trained model of a PPO agent for LunarLander-v2. It was trained as part of the Hugging Face Deep RL Course.
Evaluation results
- mean_reward on LunarLander-v2self-reported250.000
This is a trained model of a PPO agent for LunarLander-v2. It was trained as part of the Hugging Face Deep RL Course.