Skip to main content

Prerequisite chain

Prerequisites for Residual Stream and Transformer Internals

Topics you need before working through Residual Stream and Transformer Internals. Direct prerequisites are listed first; transitive prerequisites (the chain reachable through them) follow.

Direct prerequisites (3)

  1. Transformer Architecturelayer 4, tier 2
  2. Forgetting Transformer (FoX)layer 4, tier 2
  3. Gradient Flow and Vanishing Gradientslayer 2, tier 1

Reachable through the chain (172)

These topics are not directly cited as prerequisites but are reached transitively by following the chain upward. Working through the direct prerequisites pulls these in.