latest

User Documentation

Introduction
Installation
Algorithms
Running Experiments
Experiment Outputs
Plotting Results

Introduction to RL

Part 1: Key Concepts in RL
Part 2: Kinds of RL Algorithms
Part 3: Intro to Policy Optimization

Resources

Spinning Up as a Deep RL Researcher
Key Papers in Deep RL
Exercises
Benchmarks for Spinning Up Implementations

Algorithms Docs

Vanilla Policy Gradient
Trust Region Policy Optimization
Proximal Policy Optimization
Deep Deterministic Policy Gradient
Twin Delayed DDPG
Soft Actor-Critic

Utilities Docs

Logger
Plotter
MPI Tools
Run Utils

Etc.

Acknowledgements
About the Author

Spinning Up

Docs »
Proximal Policy Optimization Head-to-Head
Edit on GitHub

Proximal Policy Optimization Head-to-Head¶

HalfCheetah¶

../_images/ppo_halfcheetah_performance.svg

Hopper¶

../_images/ppo_hopper_performance.svg

Walker2d¶

../_images/ppo_walker2d_performance.svg

Swimmer¶

../_images/ppo_swimmer_performance.svg

Ant¶

../_images/ppo_ant_performance.svg

© Copyright 2018, OpenAI. Revision 038665d6.

Built with Sphinx using a theme provided by Read the Docs.