| library_name: sample-factory | |
| tags: | |
| - deep-reinforcement-learning | |
| - reinforcement-learning | |
| - sample-factory | |
| model-index: | |
| - name: APPO | |
| results: | |
| - metrics: | |
| - type: mean_reward | |
| value: 9350.13 +/- 1.31 | |
| name: mean_reward | |
| task: | |
| type: reinforcement-learning | |
| name: reinforcement-learning | |
| dataset: | |
| name: mujoco_doublependulum | |
| type: mujoco_doublependulum | |
| A(n) **APPO** model trained on the **mujoco_doublependulum** environment. | |
| This model was trained using Sample Factory 2.0: https://github.com/alex-petrenko/sample-factory | |