This guide is still under construction

General Resources

Supervised Learning for RL

Iterative Linear Quadratic Regulator (iLQR)

Guided Policy Search

Trust Region Policy Optimization

  • Full paper
  • CS 294-112 Lecture
  • Blog post on why Hessian of KL Divergence = Fisher Information Matrix

Q-Function Methods

Hierarchical RL/Meta Learning

Inverse Reinforcement Learning

Reward Shaping

Gaussian Processes & Bayesian Optimization