Skip to main content

Prerequisite chain

Prerequisites for Policy Optimization: PPO and TRPO

Topics you need before working through Policy Optimization: PPO and TRPO. Direct prerequisites are listed first; transitive prerequisites (the chain reachable through them) follow.

Direct prerequisites (5)

  1. Policy Gradient Theoremlayer 3, tier 1
  2. Actor-Critic Methodslayer 3, tier 2
  3. DDPG: Deep Deterministic Policy Gradientlayer 3, tier 2
  4. Offline Reinforcement Learninglayer 3, tier 2
  5. TD3: Twin Delayed Deep Deterministic Policy Gradientlayer 3, tier 2

Reachable through the chain (257)

These topics are not directly cited as prerequisites but are reached transitively by following the chain upward. Working through the direct prerequisites pulls these in.