Skip to main content

Prerequisite chain

Prerequisites for Mechanistic Interpretability: Features, Circuits, and Causal Faithfulness

Topics you need before working through Mechanistic Interpretability: Features, Circuits, and Causal Faithfulness. Direct prerequisites are listed first; transitive prerequisites (the chain reachable through them) follow.

Direct prerequisites (5)

  1. Transformer Architecturelayer 4, tier 2
  2. Principal Component Analysislayer 1, tier 1
  3. Kolmogorov-Arnold Networks (KANs)layer 4, tier 2
  4. Residual Stream and Transformer Internalslayer 4, tier 2
  5. RLHF and Alignmentlayer 4, tier 2

Reachable through the chain (261)

These topics are not directly cited as prerequisites but are reached transitively by following the chain upward. Working through the direct prerequisites pulls these in.