| tags: | |
| - LunarLander-v2 | |
| - ppo | |
| - deep-reinforcement-learning | |
| - reinforcement-learning | |
| - custom-implementation | |
| - deep-rl-course | |
| model-index: | |
| - name: PPO | |
| results: | |
| - task: | |
| type: reinforcement-learning | |
| name: reinforcement-learning | |
| dataset: | |
| name: LunarLander-v2 | |
| type: LunarLander-v2 | |
| metrics: | |
| - type: mean_reward | |
| value: -158.03 +/- 64.23 | |
| name: mean_reward | |
| verified: false | |
| # PPO Agent Playing LunarLander-v2 | |
| This is a trained model of a PPO agent playing LunarLander-v2. | |
| # Hyperparameters | |