Unlock: Policy Representations

How to parameterize policies in reinforcement learning: categorical for discrete actions, Gaussian for continuous actions, and why the choice affects gradient variance and exploration.

251 Prerequisites0 Mastered0 Working192 Gaps

Prerequisite mastery24%

Recommended probe

Natural Language Processing Foundations is your weakest prerequisite with available questions. You haven't been assessed on this topic yet.

Policy RepresentationsTARGET

Natural Language Processing FoundationsCoreWEAKEST

Not assessed5 questions

Markov Decision ProcessesCore

Not assessed3 questions