Study Notes
Search
Search
Dark mode
Light mode
Explorer
Home
❯
Concepts
Folder: Concepts
275 items under this folder.
Jun 06, 2026
A
Jun 06, 2026
A3C
policy-gradient
actor-critic
deep-rl
exam-topic
Jun 06, 2026
Action-Value Methods
foundations
evaluation
exam-topic
Jun 06, 2026
Actor-Critic
policy-gradient
actor-critic
exam-topic
Jun 06, 2026
Adagrad
optimization
deep-rl
exam-topic
Jun 06, 2026
Adam
optimization
Jun 06, 2026
Advantage Actor-Critic (A2C)
policy-gradient
actor-critic
deep-rl
exam-topic
Jun 06, 2026
Advantage Function
policy-gradient
actor-critic
value-function
temporal-difference
Jun 06, 2026
Agentic Search
rag
agents
retrieval
reasoning
Jun 06, 2026
Algorithmic Fairness
ir-society
Jun 06, 2026
AlphaGo Zero
deep-rl
planning
exam-topic
Jun 06, 2026
Approximate Nearest Neighbor
neural-ir
vector-search
algorithms
Jun 06, 2026
Atlas
neural-ir
dense-retrieval
exam-topic
Jun 06, 2026
Atomic Item IDs
generative-rec
sequential-rec
exam-topic
Jun 06, 2026
Autoregressive Generation
generative-rec
sequential-rec
llm
exam-topic
Jun 06, 2026
Autoregressive Retrieval
neural-ir
generative-retrieval
Jun 06, 2026
BERT for IR
neural-ir
Jun 06, 2026
BERT4Rec
sequential-rec
collaborative-filtering
exam-topic
Jun 06, 2026
BM25
retrieval-models
key-formula
exam-topic
Jun 06, 2026
Background Planning
model-based-rl
planning
exam-topic
Jun 06, 2026
Bag of Words
foundations
Jun 06, 2026
Baseline
variance-reduction
policy-gradient
reinforcement-learning
Jun 06, 2026
Bayesian Personalized Ranking (BPR)
collaborative-filtering
sequential-rec
evaluation
exam-topic
Jun 06, 2026
Beam Search
generative-rec
sequential-rec
exam-topic
Jun 06, 2026
Belief State
foundations
exam-topic
Jun 06, 2026
Bellman Equation
foundations
key-formula
exam-topic
Jun 06, 2026
Bellman Error
approximation
exam-topic
Jun 06, 2026
Bellman Optimality Equation
foundations
key-formula
Jun 06, 2026
Beyond-Accuracy Metrics
evaluation
fairness
exam-topic
groups
Jun 06, 2026
Bi-Encoder
neural-ir
exam-topic
Jun 06, 2026
Bias-Variance Trade-off
foundations
Jun 06, 2026
Binary Independence Model
retrieval-models
Jun 06, 2026
Boolean Retrieval
foundations
Jun 06, 2026
Bootstrapping
foundations
Jun 06, 2026
COIL
neural-ir
learned-sparse-retrieval
Jun 06, 2026
Cascading Position Bias
click-models
position-bias
user-behavior
Jun 06, 2026
Catalog Coverage
evaluation
fairness
exam-topic
Jun 06, 2026
Classifier-Free Guidance
deep-rl
offline-rl
generative-models
exam-topic
Jun 06, 2026
Click Models
evaluation
Jun 06, 2026
ColBERT
neural-ir
exam-topic
Jun 06, 2026
Cold Start Problem
collaborative-filtering
generative-rec
llm
exam-topic
Jun 06, 2026
Collaborative Filtering
collaborative-filtering
evaluation
exam-topic
Jun 06, 2026
Compatible Function Approximation
policy-gradient
actor-critic
exam-topic
Jun 06, 2026
Conservative Q-Learning (CQL)
deep-rl
exam-topic
Jun 06, 2026
Constrained Decoding
generative-rec
llm
exam-topic
Jun 06, 2026
Content-Based Recommendation
collaborative-filtering
exam-topic
Jun 06, 2026
Contrastive Learning
neural-ir
training
deep-learning
representation-learning
Jun 06, 2026
Convolutional Neural Networks
deep-learning
Jun 06, 2026
Counterfactual Learning to Rank
learning-to-rank
causal-inference
offline-evaluation
Jun 06, 2026
Cranfield Paradigm
evaluation
exam-topic
Jun 06, 2026
Critical Information Theory
ir-society
critical-theory
Jun 06, 2026
Cross-Domain Recommendation
collaborative-filtering
llm
exam-topic
Jun 06, 2026
Cross-Encoder
neural-ir
exam-topic
Jun 06, 2026
DPR
neural-ir
dense-retrieval
embedding-search
Jun 06, 2026
DSI
neural-ir
generative-retrieval
Jun 06, 2026
Data Sparsity
collaborative-filtering
sequential-rec
exam-topic
Jun 06, 2026
Deadly Triad
approximation
exam-topic
key-formula
Jun 06, 2026
Decision Diffuser
deep-rl
offline-rl
generative-models
Jun 06, 2026
Decision Transformer
deep-rl
offline-rl
sequence-modeling
Jun 06, 2026
Decision-Time Planning
model-based-rl
planning
exam-topic
Jun 06, 2026
Deep Deterministic Policy Gradient
policy-gradient
deep-rl
actor-critic
exam-topic
Jun 06, 2026
Deep Q-Network (DQN)
deep-rl
exam-topic
key-formula
Jun 06, 2026
Deep Recurrent Q-Learning
deep-rl
Jun 06, 2026
Deep Reinforcement Learning
deep-rl
Jun 06, 2026
DeepCT
neural-ir
Jun 06, 2026
DeepImpact
neural-ir
Jun 06, 2026
DeepSeek-R1
llm
reasoning
reinforcement-learning
emergent-behavior
Jun 06, 2026
Dense Retrieval
neural-ir
exam-topic
Jun 06, 2026
Deterministic Policy Gradient
policy-gradient
off-policy
continuous-control
actor-critic
Jun 06, 2026
Diffusion Models
generative-rec
sequential-rec
exam-topic
Jun 06, 2026
Direct Preference Optimization (DPO)
generative-rec
llm
exam-topic
Jun 06, 2026
Discount Factor
foundations
Jun 06, 2026
Diversity
evaluation
exam-topic
Jun 06, 2026
DocT5Query
neural-ir
document-expansion
Jun 06, 2026
Document Expansion
neural-ir
document-expansion
Jun 06, 2026
Document Identifiers
neural-ir
generative-retrieval
Jun 06, 2026
Doubly Robust Estimation
causal-inference
unbiased-estimation
ranking
Jun 06, 2026
Dyna
planning
exam-topic
Jun 06, 2026
Dynamic Programming
tabular-methods
exam-topic
Jun 06, 2026
Emancipatory IR
ir-society
Jun 06, 2026
Entropy
policy-gradient
exploration
deep-rl
exam-topic
Jun 06, 2026
Episodic Semi-Gradient Control
approximation
exam-topic
Jun 06, 2026
Epsilon-Greedy Policy
foundations
exam-topic
Jun 06, 2026
Every-Visit MC
tabular-methods
Jun 06, 2026
Examination Hypothesis
click-models
user-behavior
ranking
Jun 06, 2026
Expected SARSA
tabular-methods
key-formula
Jun 06, 2026
Experience Replay
deep-rl
exam-topic
Jun 06, 2026
Explainability
ir-society
Jun 06, 2026
Exploration vs Exploitation
foundations
exam-topic
Jun 06, 2026
Exploring Starts
tabular-methods
Jun 06, 2026
Exposure Fairness
ir-society
Jun 06, 2026
F-Measure
evaluation
key-formula
exam-topic
Jun 06, 2026
Factorized Personalized Markov Chains (FPMC)
collaborative-filtering
sequential-rec
exam-topic
Jun 06, 2026
Fairness in Recommendation
evaluation
fairness
exam-topic
groups
Jun 06, 2026
Feature Construction
approximation
Jun 06, 2026
FiD
neural-ir
generative-retrieval
exam-topic
Jun 06, 2026
Filter Bubble
evaluation
fairness
exam-topic
Jun 06, 2026
First-Visit MC
tabular-methods
Jun 06, 2026
Fisher Information
policy-gradient
optimization
deep-rl
exam-topic
Jun 06, 2026
Fourier Basis
approximation
key-formula
exam-topic
Jun 06, 2026
Function Approximation
approximation
exam-topic
Jun 06, 2026
GENRE
neural-ir
entity-retrieval
autoregressive
Jun 06, 2026
GPU Architecture
gpu
efficiency
Jun 06, 2026
GRPO
policy-gradient
deep-rl
llm-training
Jun 06, 2026
GRU4Rec
sequential-rec
collaborative-filtering
exam-topic
Jun 06, 2026
Gated Recurrent Unit (GRU)
sequential-rec
exam-topic
Jun 06, 2026
Gaussian Policy
policy-gradient
continuous-actions
stochastic
Jun 06, 2026
Generalized Advantage Estimation
advantage-function
temporal-difference
policy-gradient
bias-variance
Jun 06, 2026
Generalized Policy Iteration
foundations
exam-topic
Jun 06, 2026
Generative Recommendation
generative-rec
sequential-rec
llm
exam-topic
Jun 06, 2026
Generative Retrieval
neural-ir
exam-topic
Jun 06, 2026
Gradient Descent
optimization
Jun 06, 2026
Gradient-TD Methods
approximation
exam-topic
Jun 06, 2026
HSTU
sequential-rec
generative-rec
llm
exam-topic
Jun 06, 2026
Hard Negative Mining
neural-ir
training
optimization
Jun 06, 2026
Hierarchical Reinforcement Learning
temporal-abstraction
exploration
exam-topic
Jun 06, 2026
Hit Rate
evaluation
collaborative-filtering
sequential-rec
exam-topic
Jun 06, 2026
Hybrid Recommendation
collaborative-filtering
exam-topic
Jun 06, 2026
Implicit and Explicit Feedback
collaborative-filtering
evaluation
exam-topic
Jun 06, 2026
Importance Sampling
tabular-methods
approximation
key-formula
exam-topic
Jun 06, 2026
In-Context Learning
llm
generative-rec
exam-topic
Jun 06, 2026
Information Retrieval
foundations
Jun 06, 2026
Inverse Dynamics Model
model-based
offline-rl
deep-rl
exam-topic
Jun 06, 2026
Inverse Propensity Weighting
unbiased-estimation
causal-inference
ranking
Jun 06, 2026
Inverted Index
foundations
exam-topic
Jun 06, 2026
Item Selection Bias
bias
ranking
position-bias
Jun 06, 2026
Item Tokenization
generative-rec
llm
exam-topic
Jun 06, 2026
Kernel Fusion
gpu
efficiency
Jun 06, 2026
LLM-based Recommendation
llm
generative-rec
collaborative-filtering
exam-topic
Jun 06, 2026
LSTD
approximation
key-formula
exam-topic
Jun 06, 2026
LSTM
deep-rl
neural-ir
partial-observability
exam-topic
Jun 06, 2026
LambdaMART
ltr
ranking
listwise-loss
exam-topic
Jun 06, 2026
Language Model for IR
retrieval-models
Jun 06, 2026
Large Language Models (LLM)
llm
generative-rec
exam-topic
Jun 06, 2026
Large Recommendation Models (LRM)
generative-rec
sequential-rec
llm
exam-topic
Jun 06, 2026
Learned Sparse Retrieval
neural-ir
exam-topic
Jun 06, 2026
Learning to Rank
neural-ir
Jun 06, 2026
Linear Function Approximation
approximation
key-formula
exam-topic
Jun 06, 2026
Listwise LTR
ltr
ranking
machine-learning
listwise-loss
neural-ranking
ranking-metrics
Jun 06, 2026
LoRA
llm
generative-rec
exam-topic
Jun 06, 2026
Long-Tail Distribution
evaluation
fairness
collaborative-filtering
exam-topic
Jun 06, 2026
MAP
evaluation
key-formula
exam-topic
Jun 06, 2026
MRR
evaluation
key-formula
Jun 06, 2026
Markov Chain
sequential-rec
collaborative-filtering
exam-topic
Jun 06, 2026
Markov Decision Process
foundations
exam-topic
Jun 06, 2026
Matrix Factorization
collaborative-filtering
exam-topic
Jun 06, 2026
Maximal Marginal Relevance (MMR)
evaluation
sequential-rec
generative-rec
exam-topic
Jun 06, 2026
Maximum Entropy RL
deep-rl
policy-gradient
Jun 06, 2026
Mean Squared Value Error
approximation
key-formula
exam-topic
Jun 06, 2026
Misinformation
ir-society
Jun 06, 2026
Model of the Environment
foundations
Jun 06, 2026
Model-Based Reinforcement Learning
planning
exam-topic
Jun 06, 2026
Momentum
optimization
deep-rl
exam-topic
Jun 06, 2026
MonoBERT
neural-ir
transformer
bert
Jun 06, 2026
Monte Carlo Control
tabular-methods
Jun 06, 2026
Monte Carlo Methods
tabular-methods
exam-topic
Jun 06, 2026
Monte Carlo Tree Search (MCTS)
planning
deep-rl
exam-topic
Jun 06, 2026
Multi-Armed Bandit
foundations
exam-topic
Jun 06, 2026
Multi-Stage Ranking
neural-ir
architecture
efficiency
Jun 06, 2026
Multiple Additive Regression Trees
ltr
ranking
gradient-boosting
exam-topic
Jun 06, 2026
NDCG
evaluation
key-formula
exam-topic
Jun 06, 2026
Natural Policy Gradient
policy-gradient
optimization
fisher-information
geometry
Jun 06, 2026
Negative Sampling
collaborative-filtering
sequential-rec
evaluation
exam-topic
Jun 06, 2026
Neighborhood-based Collaborative Filtering
collaborative-filtering
exam-topic
Jun 06, 2026
Neural Collaborative Filtering
collaborative-filtering
exam-topic
Jun 06, 2026
Neural Network Function Approximation
approximation
deep-rl
Jun 06, 2026
Neural Networks
foundations
Jun 06, 2026
Neural Reranking
neural-ir
reranking
deep-learning
Jun 06, 2026
Next-Item Prediction
sequential-rec
generative-rec
collaborative-filtering
exam-topic
Jun 06, 2026
Novelty
evaluation
generative-rec
exam-topic
Jun 06, 2026
Off-Policy Divergence
approximation
exam-topic
Jun 06, 2026
Off-Policy Learning
foundations
exam-topic
Jun 06, 2026
Offline Reinforcement Learning
deep-rl
Jun 06, 2026
On-Policy Distribution
approximation
Jun 06, 2026
On-Policy Learning
foundations
Jun 06, 2026
On-Policy vs Off-Policy
foundations
exam-topic
Jun 06, 2026
OneRec
generative-rec
sequential-rec
llm
exam-topic
Jun 06, 2026
Online and Offline Evaluation
evaluation
exam-topic
Jun 06, 2026
Optimal Policy
foundations
exam-topic
Jun 06, 2026
Optimality and Approximation
approximation
Jun 06, 2026
Optimistic Initial Values
foundations
exam-topic
Jun 06, 2026
Ordinary Least Squares
optimization
Jun 06, 2026
Outlier Bias
bias
click-models
user-behavior
Jun 06, 2026
P5
generative-rec
llm
exam-topic
Jun 06, 2026
POMDP
foundations
exam-topic
Jun 06, 2026
PPO
policy-gradient
deep-rl
exam-topic
Jun 06, 2026
Pairwise LTR
ltr
ranking
machine-learning
pairwise-loss
neural-ranking
Jun 06, 2026
Partial Observability
foundations
exam-topic
Jun 06, 2026
Plackett-Luce Model
ltr
listwise-loss
ranking
exam-topic
Jun 06, 2026
Pointwise LTR
ltr
ranking
machine-learning
regression
classification
Jun 06, 2026
Policy Evaluation
tabular-methods
Jun 06, 2026
Policy Gradient Methods
policy-gradient
exam-topic
Jun 06, 2026
Policy Gradient Theorem
policy-gradient
theoretical-foundation
gradient-ascent
Jun 06, 2026
Policy Improvement
tabular-methods
exam-topic
Jun 06, 2026
Policy Iteration
tabular-methods
exam-topic
Jun 06, 2026
Policy
foundations
exam-topic
Jun 06, 2026
Pooling
evaluation
Jun 06, 2026
Popularity Bias
evaluation
fairness
exam-topic
Jun 06, 2026
Position Bias
bias
ranking
user-behavior
Jun 06, 2026
Position-Based Click Model
click-models
unbiased-ltr
user-behavior
exam-topic
Jun 06, 2026
Precision at K
evaluation
key-formula
Jun 06, 2026
Precision
evaluation
key-formula
Jun 06, 2026
Predictive State Representation
foundations
Jun 06, 2026
Product Quantization
neural-ir
efficiency
Jun 06, 2026
Q-Learning
tabular-methods
key-formula
exam-topic
Jun 06, 2026
Query Expansion
neural-ir
Jun 06, 2026
Query Likelihood Model
retrieval-models
key-formula
exam-topic
Jun 06, 2026
REINFORCE
policy-gradient
algorithm
monte-carlo
on-policy
Jun 06, 2026
RMSProp
optimization
Jun 06, 2026
RQ-VAE
generative-rec
sequential-rec
exam-topic
Jun 06, 2026
Recall
evaluation
key-formula
Jun 06, 2026
Recommender System
collaborative-filtering
evaluation
exam-topic
Jun 06, 2026
Recurrent Neural Network (RNN)
sequential-rec
exam-topic
Jun 06, 2026
Regularization
foundations
Jun 06, 2026
Reinforcement Learning from Human Feedback
policy-gradient
deep-rl
exam-topic
Jun 06, 2026
Reinforcement Learning
foundations
Jun 06, 2026
Reparameterization Trick
deep-rl
optimization
Jun 06, 2026
Retrieval-Augmented Generation
neural-ir
Jun 06, 2026
Return
foundations
key-formula
Jun 06, 2026
Reward Signal
foundations
Jun 06, 2026
Reward-Weighted Regression
policy-gradient
algorithm
offline-rl
exam-topic
Jun 06, 2026
Rocchio Algorithm
retrieval-models
relevance-feedback
key-formula
Jun 06, 2026
Rollout Algorithm
planning
Jun 06, 2026
SARSA
tabular-methods
key-formula
exam-topic
Jun 06, 2026
SASRec
sequential-rec
collaborative-filtering
exam-topic
Jun 06, 2026
SEARCH-R1
rag
agentic-search
reinforcement-learning
llm
Jun 06, 2026
SPLADE
neural-ir
exam-topic
Jun 06, 2026
Scaling Laws
generative-rec
llm
exam-topic
Jun 06, 2026
Self-Attention
sequential-rec
generative-rec
exam-topic
Jun 06, 2026
Self-RAG
neural-ir
exam-topic
Jun 06, 2026
Semantic IDs
generative-rec
sequential-rec
llm
exam-topic
Jun 06, 2026
Semi-Gradient Methods
approximation
key-formula
exam-topic
Jun 06, 2026
Sequential Recommendation
collaborative-filtering
sequential-rec
exam-topic
Jun 06, 2026
Serendipity
evaluation
collaborative-filtering
exam-topic
Jun 06, 2026
Session-based Recommendation
sequential-rec
collaborative-filtering
exam-topic
Jun 06, 2026
Smoothing
retrieval-models
exam-topic
Jun 06, 2026
Soft Actor-Critic (SAC)
deep-rl
exam-topic
actor-critic
Jun 06, 2026
Softmax Policy
policy-gradient
discrete-actions
stochastic
exploration
Jun 06, 2026
Sparton
gpu
efficiency
Jun 06, 2026
State Aggregation
approximation
function-approximation
exam-topic
Jun 06, 2026
State Space
foundations
Jun 06, 2026
Stemming
foundations
Jun 06, 2026
Stochastic Gradient Descent
optimization
Jun 06, 2026
Stop Words
foundations
Jun 06, 2026
Supervised Fine-Tuning (SFT)
generative-rec
llm
exam-topic
Jun 06, 2026
Surrounding Item Bias
bias
click-models
unbiased-ltr
exam-topic
Jun 06, 2026
TD Error
tabular-methods
key-formula
Jun 06, 2026
TD Fixed Point
approximation
key-formula
exam-topic
Jun 06, 2026
TD(0)
tabular-methods
key-formula
Jun 06, 2026
TD3
policy-gradient
deep-rl
actor-critic
exam-topic
Jun 06, 2026
TF-IDF
retrieval-models
key-formula
exam-topic
Jun 06, 2026
TIGER
generative-rec
sequential-rec
exam-topic
Jun 06, 2026
Tabular RL
foundations
Jun 06, 2026
Target Network
deep-rl
exam-topic
Jun 06, 2026
Temporal Difference Learning
tabular-methods
exam-topic
key-formula
Jun 06, 2026
Term Weighting
foundations
Jun 06, 2026
Tile Coding
approximation
exam-topic
Jun 06, 2026
Tokenization
foundations
Jun 06, 2026
Top-N Recommendation
collaborative-filtering
evaluation
exam-topic
Jun 06, 2026
Transformer Kernel (TK)
neural-ir
reranking
exam-topic
Jun 06, 2026
Transformers
foundations
deep-learning
nlp
key-formula
exam-topic
Jun 06, 2026
Triton
gpu
efficiency
Jun 06, 2026
Trust Bias
bias
click-models
user-behavior
Jun 06, 2026
Trust Region Policy Optimization (TRPO)
policy-gradient
deep-rl
optimization
exam-topic
Jun 06, 2026
Unbiased Learning to Rank
unbiased-ltr
evaluation
neural-ir
exam-topic
Jun 06, 2026
Upper Confidence Bound
foundations
Jun 06, 2026
Upside-Down RL
deep-rl
offline-rl
policy-gradient
exam-topic
Jun 06, 2026
User-Item Interaction Matrix
collaborative-filtering
exam-topic
Jun 06, 2026
Value Function
foundations
key-formula
exam-topic
Jun 06, 2026
Value Iteration
tabular-methods
exam-topic
Jun 06, 2026
Vector Space Model
retrieval-models
foundations
Jun 06, 2026
Word Embeddings
foundations
nlp
embeddings
Jun 06, 2026
duoT5
neural-ir
reranking
Jun 06, 2026
monoT5
neural-ir
reranking
Jun 06, 2026
uniCOIL
neural-ir