Study Notes

Tag: algorithm

2 items with this tag.

  • Jun 06, 2026

    REINFORCE

    • policy-gradient
    • algorithm
    • monte-carlo
    • on-policy
  • Jun 06, 2026

    Reward-Weighted Regression

    • policy-gradient
    • algorithm
    • offline-rl
    • exam-topic

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community