First-Visit MC
First-Visit MC
A Monte Carlo prediction method that averages returns only from the first time a state is visited in each episode. Each return is an independent, identically distributed estimate of .
Converges to by the Law of Large Numbers. Contrast with Every-Visit MC, which uses all visits.
See RL-L03 - Monte Carlo Methods for the full algorithm.