Neural Network Function Approximation

Neural Network Function Approximation

Using neural networks as non-linear function approximators for value functions or policies: where is a neural network with parameters .

Advantages over Linear FA

  • Automatic feature learning: No manual Feature Construction needed
  • Representational power: Can approximate any continuous function (universal approximation theorem)
  • Handles raw inputs: Can process pixels, text, etc. directly

Challenges in RL

  • No convergence guarantees for Semi-Gradient Methods with non-linear FA
  • Deadly Triad becomes more dangerous — non-linear + bootstrapping + off-policy
  • Non-stationarity: Target values change as policy improves
  • Catastrophic forgetting: Updating for new states can degrade performance on old states

Stabilization techniques: Experience Replay, Target Network (as in Deep Q-Network (DQN))

Appears In