Skip to main content

Prerequisite chain

Prerequisites for Policy Gradient Theorem

Topics you need before working through Policy Gradient Theorem. Direct prerequisites are listed first; transitive prerequisites (the chain reachable through them) follow.

Direct prerequisites (7)

  1. Markov Decision Processeslayer 2, tier 1
  2. Convex Optimization Basicslayer 1, tier 1
  3. Multi-Armed Bandits Theorylayer 2, tier 2
  4. Online Learning and Banditslayer 3, tier 2
  5. Q-Learninglayer 2, tier 1
  6. Temporal Difference Learninglayer 2, tier 2
  7. Value Iteration and Policy Iterationlayer 2, tier 1

Reachable through the chain (249)

These topics are not directly cited as prerequisites but are reached transitively by following the chain upward. Working through the direct prerequisites pulls these in.