Linear Function Approximation

Linear Function Approximation

The value function is approximated as a linear combination of features:

where is a feature vector and is a weight vector.

Gradient

The gradient is simply the feature vector:

This makes updates simple:

Convergence Guarantee

Why Linear Is Special

Semi-gradient TD(0) with linear FA converges to the TD Fixed Point:

This guarantee does not hold for non-linear (e.g., neural network) approximators.

Feature Construction

The power of linear FA depends entirely on the feature vector . See Feature Construction:

  • Tile Coding: Binary features from overlapping tilings
  • Polynomials:
  • Fourier basis: Cosine functions at different frequencies
  • Radial Basis Functions: Gaussian bumps centered at prototypes
  • One-hot (tabular): Each state gets its own feature → recovers tabular case

Connections

Appears In