Study Notes
Search
Search
Dark mode
Light mode
Explorer
Tag: policy-gradient
14 items with this tag.
Mar 20, 2026
Actor-Critic
policy-gradient
actor-critic
exam-topic
Mar 20, 2026
Advantage Function
policy-gradient
actor-critic
value-function
temporal-difference
Mar 20, 2026
Baseline
variance-reduction
policy-gradient
reinforcement-learning
Mar 20, 2026
Deterministic Policy Gradient
policy-gradient
off-policy
continuous-control
actor-critic
Mar 20, 2026
GRPO
policy-gradient
deep-rl
llm-training
Mar 20, 2026
Gaussian Policy
policy-gradient
continuous-actions
stochastic
Mar 20, 2026
Generalized Advantage Estimation
advantage-function
temporal-difference
policy-gradient
bias-variance
Mar 20, 2026
Maximum Entropy RL
deep-rl
policy-gradient
Mar 20, 2026
Natural Policy Gradient
policy-gradient
optimization
fisher-information
geometry
Mar 20, 2026
PPO
policy-gradient
deep-rl
exam-topic
Mar 20, 2026
Policy Gradient Methods
policy-gradient
exam-topic
Mar 20, 2026
Policy Gradient Theorem
policy-gradient
theoretical-foundation
gradient-ascent
Mar 20, 2026
REINFORCE
policy-gradient
algorithm
monte-carlo
on-policy
Mar 20, 2026
Softmax Policy
policy-gradient
discrete-actions
stochastic
exploration