latest
User Documentation
Introduction
Installation
Algorithms
Running Experiments
Experiment Outputs
Plotting Results
Introduction to RL
Part 1: Key Concepts in RL
Part 2: Kinds of RL Algorithms
Part 3: Intro to Policy Optimization
Resources
Spinning Up as a Deep RL Researcher
Key Papers in Deep RL
Exercises
Benchmarks for Spinning Up Implementations
Algorithms Docs
Vanilla Policy Gradient
Trust Region Policy Optimization
Proximal Policy Optimization
Deep Deterministic Policy Gradient
Twin Delayed DDPG
Soft Actor-Critic
Utilities Docs
Logger
Plotter
MPI Tools
Run Utils
Etc.
Acknowledgements
About the Author
Spinning Up
Docs
»
Proximal Policy Optimization Head-to-Head
Edit on GitHub
Proximal Policy Optimization Head-to-Head
¶
HalfCheetah
¶
Hopper
¶
Walker2d
¶
Swimmer
¶
Ant
¶
Read the Docs
v: latest
Versions
latest
Downloads
pdf
html
epub
On Read the Docs
Project Home
Builds
Free document hosting provided by
Read the Docs
.