Logo
latest

User Documentation

  • Introduction
  • Installation
  • Algorithms
  • Running Experiments
  • Experiment Outputs
  • Plotting Results

Introduction to RL

  • Part 1: Key Concepts in RL
  • Part 2: Kinds of RL Algorithms
  • Part 3: Intro to Policy Optimization

Resources

  • Spinning Up as a Deep RL Researcher
  • Key Papers in Deep RL
  • Exercises
  • Benchmarks for Spinning Up Implementations

Algorithms Docs

  • Vanilla Policy Gradient
  • Trust Region Policy Optimization
  • Proximal Policy Optimization
  • Deep Deterministic Policy Gradient
  • Twin Delayed DDPG
  • Soft Actor-Critic

Utilities Docs

  • Logger
  • Plotter
  • MPI Tools
  • Run Utils

Etc.

  • Acknowledgements
  • About the Author
Spinning Up
  • Docs »
  • Overview: module code

All modules for which code is available

  • spinup.algos.pytorch.ddpg.ddpg
  • spinup.algos.pytorch.ppo.ppo
  • spinup.algos.pytorch.sac.sac
  • spinup.algos.pytorch.td3.td3
  • spinup.algos.pytorch.vpg.vpg
  • spinup.algos.tf1.ddpg.ddpg
  • spinup.algos.tf1.ppo.ppo
  • spinup.algos.tf1.sac.sac
  • spinup.algos.tf1.td3.td3
  • spinup.algos.tf1.trpo.trpo
  • spinup.algos.tf1.vpg.vpg
  • spinup.utils.logx
  • spinup.utils.mpi_pytorch
  • spinup.utils.mpi_tf
  • spinup.utils.mpi_tools
  • spinup.utils.run_utils

© Copyright 2018, OpenAI. Revision 038665d6.

Built with Sphinx using a theme provided by Read the Docs.