Policy Evaluation

Policy Evaluation (Prediction)

Computing the state-value function for a given policy . Also called the prediction problem.

Iterative update:

Converges to as . Used as a subroutine in Policy Iteration.

Appears In