PPO Agent playing Walker2DBulletEnv-v0

This is a trained model of a PPO agent playing Walker2DBulletEnv-v0 using the stable-baselines3 library.

Usage (with Stable-baselines3)

Downloads last month
6
Video Preview
loading

Evaluation results

  • mean_reward on Walker2DBulletEnv-v0
    self-reported
    1968.90 +/- 16.24