Study Notes
Search
Search
Dark mode
Light mode
Explorer
Home
❯
Concepts
Folder: Concepts
183 items under this folder.
Mar 20, 2026
Actor-Critic
policy-gradient
actor-critic
exam-topic
Mar 20, 2026
Adam
optimization
Mar 20, 2026
Advantage Function
policy-gradient
actor-critic
value-function
temporal-difference
Mar 20, 2026
Agentic Search
rag
agents
retrieval
reasoning
Mar 20, 2026
Algorithmic Fairness
ir-society
Mar 20, 2026
AlphaGo Zero
deep-rl
planning
exam-topic
Mar 20, 2026
Approximate Nearest Neighbor
neural-ir
vector-search
algorithms
Mar 20, 2026
Autoregressive Retrieval
neural-ir
generative-retrieval
Mar 20, 2026
BERT for IR
neural-ir
Mar 20, 2026
BM25
retrieval-models
key-formula
exam-topic
Mar 20, 2026
Bag of Words
foundations
Mar 20, 2026
Baseline
variance-reduction
policy-gradient
reinforcement-learning
Mar 20, 2026
Belief State
foundations
exam-topic
Mar 20, 2026
Bellman Equation
foundations
key-formula
exam-topic
Mar 20, 2026
Bellman Error
approximation
exam-topic
Mar 20, 2026
Bellman Optimality Equation
foundations
key-formula
Mar 20, 2026
Bi-Encoder
neural-ir
exam-topic
Mar 20, 2026
Bias-Variance Trade-off
foundations
Mar 20, 2026
Binary Independence Model
retrieval-models
Mar 20, 2026
Boolean Retrieval
foundations
Mar 20, 2026
Bootstrapping
foundations
Mar 20, 2026
COIL
neural-ir
learned-sparse-retrieval
Mar 20, 2026
Cascading Position Bias
click-models
position-bias
user-behavior
Mar 20, 2026
Click Models
evaluation
Mar 20, 2026
ColBERT
neural-ir
exam-topic
Mar 20, 2026
Conservative Q-Learning (CQL)
deep-rl
exam-topic
Mar 20, 2026
Contrastive Learning
neural-ir
training
deep-learning
representation-learning
Mar 20, 2026
Convolutional Neural Networks
deep-learning
Mar 20, 2026
Counterfactual Learning to Rank
learning-to-rank
causal-inference
offline-evaluation
Mar 20, 2026
Cranfield Paradigm
evaluation
exam-topic
Mar 20, 2026
Critical Information Theory
ir-society
critical-theory
Mar 20, 2026
Cross-Encoder
neural-ir
exam-topic
Mar 20, 2026
DPR
neural-ir
dense-retrieval
embedding-search
Mar 20, 2026
DSI
neural-ir
generative-retrieval
Mar 20, 2026
Deadly Triad
approximation
exam-topic
key-formula
Mar 20, 2026
Decision Diffuser
deep-rl
offline-rl
generative-models
Mar 20, 2026
Decision Transformer
deep-rl
offline-rl
sequence-modeling
Mar 20, 2026
Deep Q-Network (DQN)
deep-rl
exam-topic
key-formula
Mar 20, 2026
Deep Recurrent Q-Learning
deep-rl
Mar 20, 2026
Deep Reinforcement Learning
deep-rl
Mar 20, 2026
DeepCT
neural-ir
Mar 20, 2026
DeepImpact
neural-ir
Mar 20, 2026
DeepSeek-R1
llm
reasoning
reinforcement-learning
emergent-behavior
Mar 20, 2026
Dense Retrieval
neural-ir
exam-topic
Mar 20, 2026
Deterministic Policy Gradient
policy-gradient
off-policy
continuous-control
actor-critic
Mar 20, 2026
Discount Factor
foundations
Mar 20, 2026
DocT5Query
neural-ir
document-expansion
Mar 20, 2026
Document Expansion
foundations
Mar 20, 2026
Document Identifiers
neural-ir
generative-retrieval
Mar 20, 2026
Doubly Robust Estimation
causal-inference
unbiased-estimation
ranking
Mar 20, 2026
Dyna
planning
exam-topic
Mar 20, 2026
Dynamic Programming
tabular-methods
exam-topic
Mar 20, 2026
Emancipatory IR
ir-society
Mar 20, 2026
Episodic Semi-Gradient Control
approximation
exam-topic
Mar 20, 2026
Epsilon-Greedy Policy
foundations
exam-topic
Mar 20, 2026
Every-Visit MC
tabular-methods
Mar 20, 2026
Examination Hypothesis
click-models
user-behavior
ranking
Mar 20, 2026
Expected SARSA
tabular-methods
key-formula
Mar 20, 2026
Experience Replay
deep-rl
exam-topic
Mar 20, 2026
Explainability
ir-society
Mar 20, 2026
Exploration vs Exploitation
foundations
exam-topic
Mar 20, 2026
Exploring Starts
tabular-methods
Mar 20, 2026
Exposure Fairness
ir-society
Mar 20, 2026
F-Measure
evaluation
key-formula
exam-topic
Mar 20, 2026
Feature Construction
approximation
Mar 20, 2026
First-Visit MC
tabular-methods
Mar 20, 2026
Function Approximation
approximation
exam-topic
Mar 20, 2026
GENRE
neural-ir
entity-retrieval
autoregressive
Mar 20, 2026
GPU Architecture
gpu
efficiency
Mar 20, 2026
GRPO
policy-gradient
deep-rl
llm-training
Mar 20, 2026
Gaussian Policy
policy-gradient
continuous-actions
stochastic
Mar 20, 2026
Generalized Advantage Estimation
advantage-function
temporal-difference
policy-gradient
bias-variance
Mar 20, 2026
Generalized Policy Iteration
foundations
exam-topic
Mar 20, 2026
Generative Retrieval
neural-ir
exam-topic
Mar 20, 2026
Gradient Descent
optimization
Mar 20, 2026
Gradient-TD Methods
approximation
exam-topic
Mar 20, 2026
Hard Negative Mining
neural-ir
training
optimization
Mar 20, 2026
Importance Sampling
tabular-methods
approximation
key-formula
exam-topic
Mar 20, 2026
Information Retrieval
foundations
Mar 20, 2026
Inverse Propensity Weighting
unbiased-estimation
causal-inference
ranking
Mar 20, 2026
Inverted Index
foundations
exam-topic
Mar 20, 2026
Item Selection Bias
bias
ranking
position-bias
Mar 20, 2026
Kernel Fusion
gpu
efficiency
Mar 20, 2026
LSTD
approximation
key-formula
exam-topic
Mar 20, 2026
Language Model for IR
retrieval-models
Mar 20, 2026
Learned Sparse Retrieval
neural-ir
exam-topic
Mar 20, 2026
Learning to Rank
neural-ir
Mar 20, 2026
Linear Function Approximation
approximation
key-formula
exam-topic
Mar 20, 2026
Listwise LTR
ltr
ranking
machine-learning
listwise-loss
neural-ranking
ranking-metrics
Mar 20, 2026
MAP
evaluation
key-formula
exam-topic
Mar 20, 2026
MRR
evaluation
key-formula
Mar 20, 2026
Markov Decision Process
foundations
exam-topic
Mar 20, 2026
Maximum Entropy RL
deep-rl
policy-gradient
Mar 20, 2026
Mean Squared Value Error
approximation
key-formula
exam-topic
Mar 20, 2026
Misinformation
ir-society
Mar 20, 2026
Model of the Environment
foundations
Mar 20, 2026
Model-Based Reinforcement Learning
planning
exam-topic
Mar 20, 2026
MonoBERT
neural-ir
transformer
bert
Mar 20, 2026
Monte Carlo Control
tabular-methods
Mar 20, 2026
Monte Carlo Methods
tabular-methods
exam-topic
Mar 20, 2026
Monte Carlo Tree Search (MCTS)
planning
deep-rl
exam-topic
Mar 20, 2026
Multi-Armed Bandit
foundations
exam-topic
Mar 20, 2026
Multi-Stage Ranking
neural-ir
architecture
efficiency
Mar 20, 2026
NDCG
evaluation
key-formula
exam-topic
Mar 20, 2026
Natural Policy Gradient
policy-gradient
optimization
fisher-information
geometry
Mar 20, 2026
Neural Network Function Approximation
approximation
deep-rl
Mar 20, 2026
Neural Networks
foundations
Mar 20, 2026
Neural Reranking
neural-ir
reranking
deep-learning
Mar 20, 2026
Off-Policy Divergence
approximation
exam-topic
Mar 20, 2026
Off-Policy Learning
foundations
exam-topic
Mar 20, 2026
Offline Reinforcement Learning
deep-rl
Mar 20, 2026
On-Policy Distribution
approximation
Mar 20, 2026
On-Policy Learning
foundations
Mar 20, 2026
On-Policy vs Off-Policy
foundations
exam-topic
Mar 20, 2026
Optimal Policy
foundations
exam-topic
Mar 20, 2026
Optimality and Approximation
approximation
Mar 20, 2026
Optimistic Initial Values
foundations
exam-topic
Mar 20, 2026
Ordinary Least Squares
optimization
Mar 20, 2026
Outlier Bias
bias
click-models
user-behavior
Mar 20, 2026
POMDP
foundations
exam-topic
Mar 20, 2026
PPO
policy-gradient
deep-rl
exam-topic
Mar 20, 2026
Pairwise LTR
ltr
ranking
machine-learning
pairwise-loss
neural-ranking
Mar 20, 2026
Partial Observability
foundations
exam-topic
Mar 20, 2026
Pointwise LTR
ltr
ranking
machine-learning
regression
classification
Mar 20, 2026
Policy Evaluation
tabular-methods
Mar 20, 2026
Policy Gradient Methods
policy-gradient
exam-topic
Mar 20, 2026
Policy Gradient Theorem
policy-gradient
theoretical-foundation
gradient-ascent
Mar 20, 2026
Policy Improvement
tabular-methods
exam-topic
Mar 20, 2026
Policy Iteration
tabular-methods
exam-topic
Mar 20, 2026
Policy
foundations
exam-topic
Mar 20, 2026
Pooling
evaluation
Mar 20, 2026
Position Bias
bias
ranking
user-behavior
Mar 20, 2026
Precision at K
evaluation
key-formula
Mar 20, 2026
Precision
evaluation
key-formula
Mar 20, 2026
Predictive State Representation
foundations
Mar 20, 2026
Product Quantization
neural-ir
efficiency
Mar 20, 2026
Q-Learning
tabular-methods
key-formula
exam-topic
Mar 20, 2026
Query Expansion
neural-ir
Mar 20, 2026
Query Likelihood Model
retrieval-models
key-formula
exam-topic
Mar 20, 2026
REINFORCE
policy-gradient
algorithm
monte-carlo
on-policy
Mar 20, 2026
RMSProp
optimization
Mar 20, 2026
Recall
evaluation
key-formula
Mar 20, 2026
Regularization
foundations
Mar 20, 2026
Reinforcement Learning
foundations
Mar 20, 2026
Reparameterization Trick
deep-rl
optimization
Mar 20, 2026
Retrieval-Augmented Generation
neural-ir
Mar 20, 2026
Return
foundations
key-formula
Mar 20, 2026
Reward Signal
foundations
Mar 20, 2026
Rocchio Algorithm
retrieval-models
relevance-feedback
key-formula
Mar 20, 2026
Rollout Algorithm
planning
Mar 20, 2026
SARSA
tabular-methods
key-formula
exam-topic
Mar 20, 2026
SEARCH-R1
rag
agentic-search
reinforcement-learning
llm
Mar 20, 2026
SPLADE
neural-ir
exam-topic
Mar 20, 2026
Semi-Gradient Methods
approximation
key-formula
exam-topic
Mar 20, 2026
Smoothing
retrieval-models
exam-topic
Mar 20, 2026
Soft Actor-Critic (SAC)
deep-rl
exam-topic
actor-critic
Mar 20, 2026
Softmax Policy
policy-gradient
discrete-actions
stochastic
exploration
Mar 20, 2026
Sparton
gpu
efficiency
Mar 20, 2026
State Space
foundations
Mar 20, 2026
Stemming
foundations
Mar 20, 2026
Stochastic Gradient Descent
optimization
Mar 20, 2026
Stop Words
foundations
Mar 20, 2026
TD Error
tabular-methods
key-formula
Mar 20, 2026
TD Fixed Point
approximation
key-formula
exam-topic
Mar 20, 2026
TD(0)
tabular-methods
key-formula
Mar 20, 2026
TF-IDF
retrieval-models
key-formula
exam-topic
Mar 20, 2026
Tabular RL
foundations
Mar 20, 2026
Target Network
deep-rl
exam-topic
Mar 20, 2026
Temporal Difference Learning
tabular-methods
exam-topic
key-formula
Mar 20, 2026
Term Weighting
foundations
Mar 20, 2026
Tile Coding
approximation
exam-topic
Mar 20, 2026
Tokenization
foundations
Mar 20, 2026
Transformers
foundations
deep-learning
nlp
key-formula
exam-topic
Mar 20, 2026
Triton
gpu
efficiency
Mar 20, 2026
Trust Bias
bias
click-models
user-behavior
Mar 20, 2026
Upper Confidence Bound
foundations
Mar 20, 2026
Value Function
foundations
key-formula
exam-topic
Mar 20, 2026
Value Iteration
tabular-methods
exam-topic
Mar 20, 2026
Vector Space Model
retrieval-models
foundations
Mar 20, 2026
Word Embeddings
foundations
nlp
embeddings
Mar 20, 2026
duoT5
neural-ir
reranking
Mar 20, 2026
monoT5
neural-ir
reranking
Mar 20, 2026
uniCOIL
neural-ir