This guide is still under construction

General Resources

Supervised Learning for RL

Iterative Linear Quadratic Regulator (iLQR)

Guided Policy Search

  • Trust Region Policy Optimization
    • Full paper
    • CS 294-112 Lecture
    • Blog post on why Hessian of KL Divergence = Fisher Information Matrix

Q-Function Methods

Hierarchical RL/Meta Learning

Inverse Reinforcement Learning

Reward Shaping

Gaussian Processes & Bayesian Optimization