Unlock: Post-Training Overview

How post-training turns a pretrained language model into a deployable assistant: SFT, preference optimization, safety tuning, verifiable rewards, evaluation gates, and the failure modes each stage introduces.

389 Prerequisites0 Mastered0 Working264 Gaps

Prerequisite mastery32%

Recommended probe

Residual Stream and Transformer Internals is your weakest prerequisite with available questions. You haven't been assessed on this topic yet.

Post-Training OverviewTARGET

Hardware for ML PractitionersFoundations

No quiz

Residual Stream and Transformer InternalsResearchWEAKEST

Not assessed1 question

Truth Directions and Linear ProbesResearch

No quiz

Agentic RL and Tool UseFrontier

Not assessed3 questions

BERT and the Pretrain-Finetune ParadigmResearch

Not assessed3 questions

Policy Optimization: PPO and TRPOAdvanced

No quiz

RLHF and AlignmentResearch

Not assessed3 questions

Test-Time Compute and SearchFrontier

No quiz

Transformer ArchitectureResearch

Not assessed11 questions