Reinforcement Learning
stable-baselines3
Hopper-v3
deep-reinforcement-learning
Hopper-v4
Eval Results (legacy)
Instructions to use qgallouedec/trpo-Hopper-v3-1865505172 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- stable-baselines3
How to use qgallouedec/trpo-Hopper-v3-1865505172 with stable-baselines3:
from huggingface_sb3 import load_from_hub checkpoint = load_from_hub( repo_id="qgallouedec/trpo-Hopper-v3-1865505172", filename="{MODEL FILENAME}.zip", ) - Notebooks
- Google Colab
- Kaggle
| !!python/object/apply:collections.OrderedDict | |
| - - - batch_size | |
| - 128 | |
| - - cg_damping | |
| - 0.1 | |
| - - cg_max_steps | |
| - 25 | |
| - - gae_lambda | |
| - 0.95 | |
| - - gamma | |
| - 0.99 | |
| - - learning_rate | |
| - 0.001 | |
| - - n_critic_updates | |
| - 20 | |
| - - n_envs | |
| - 2 | |
| - - n_steps | |
| - 1024 | |
| - - n_timesteps | |
| - 1000000.0 | |
| - - normalize | |
| - true | |
| - - policy | |
| - MlpPolicy | |
| - - sub_sampling_factor | |
| - 1 | |