First-Visit MC

First-Visit MC

A Monte Carlo prediction method that averages returns only from the first time a state is visited in each episode. Each return is an independent, identically distributed estimate of .

Converges to by the Law of Large Numbers. Contrast with Every-Visit MC, which uses all visits.

See RL-L03 - Monte Carlo Methods for the full algorithm.