Study Notes

❯

Folder: Concepts

275 items under this folder.

Jun 06, 2026
A
Jun 06, 2026
A3C
Jun 06, 2026
Action-Value Methods
Jun 06, 2026
Actor-Critic
Jun 06, 2026
Adagrad
Jun 06, 2026
Adam
- optimization
Jun 06, 2026
Advantage Actor-Critic (A2C)
Jun 06, 2026
Advantage Function
Jun 06, 2026
Agentic Search
Jun 06, 2026
Algorithmic Fairness
- ir-society
Jun 06, 2026
AlphaGo Zero
Jun 06, 2026
Approximate Nearest Neighbor
Jun 06, 2026
Atlas
Jun 06, 2026
Atomic Item IDs
Jun 06, 2026
Autoregressive Generation
Jun 06, 2026
Autoregressive Retrieval
- neural-ir
- generative-retrieval
Jun 06, 2026
BERT for IR
- neural-ir
Jun 06, 2026
BERT4Rec
Jun 06, 2026
BM25
Jun 06, 2026
Background Planning
Jun 06, 2026
Bag of Words
- foundations
Jun 06, 2026
Baseline
Jun 06, 2026
Bayesian Personalized Ranking (BPR)
Jun 06, 2026
Beam Search
Jun 06, 2026
Belief State
- foundations
- exam-topic
Jun 06, 2026
Bellman Equation
Jun 06, 2026
Bellman Error
- approximation
- exam-topic
Jun 06, 2026
Bellman Optimality Equation
- foundations
- key-formula
Jun 06, 2026
Beyond-Accuracy Metrics
Jun 06, 2026
Bi-Encoder
- neural-ir
- exam-topic
Jun 06, 2026
Bias-Variance Trade-off
- foundations
Jun 06, 2026
Binary Independence Model
- retrieval-models
Jun 06, 2026
Boolean Retrieval
- foundations
Jun 06, 2026
Bootstrapping
- foundations
Jun 06, 2026
COIL
- neural-ir
- learned-sparse-retrieval
Jun 06, 2026
Cascading Position Bias
Jun 06, 2026
Catalog Coverage
Jun 06, 2026
Classifier-Free Guidance
Jun 06, 2026
Click Models
- evaluation
Jun 06, 2026
ColBERT
- neural-ir
- exam-topic
Jun 06, 2026
Cold Start Problem
Jun 06, 2026
Collaborative Filtering
Jun 06, 2026
Compatible Function Approximation
Jun 06, 2026
Conservative Q-Learning (CQL)
- deep-rl
- exam-topic
Jun 06, 2026
Constrained Decoding
Jun 06, 2026
Content-Based Recommendation
- collaborative-filtering
- exam-topic
Jun 06, 2026
Contrastive Learning
Jun 06, 2026
Convolutional Neural Networks
- deep-learning
Jun 06, 2026
Counterfactual Learning to Rank
Jun 06, 2026
Cranfield Paradigm
- evaluation
- exam-topic
Jun 06, 2026
Critical Information Theory
- ir-society
- critical-theory
Jun 06, 2026
Cross-Domain Recommendation
Jun 06, 2026
Cross-Encoder
- neural-ir
- exam-topic
Jun 06, 2026
DPR
Jun 06, 2026
DSI
- neural-ir
- generative-retrieval
Jun 06, 2026
Data Sparsity
Jun 06, 2026
Deadly Triad
Jun 06, 2026
Decision Diffuser
Jun 06, 2026
Decision Transformer
Jun 06, 2026
Decision-Time Planning
Jun 06, 2026
Deep Deterministic Policy Gradient
Jun 06, 2026
Deep Q-Network (DQN)
Jun 06, 2026
Deep Recurrent Q-Learning
- deep-rl
Jun 06, 2026
Deep Reinforcement Learning
- deep-rl
Jun 06, 2026
DeepCT
- neural-ir
Jun 06, 2026
DeepImpact
- neural-ir
Jun 06, 2026
DeepSeek-R1
Jun 06, 2026
Dense Retrieval
- neural-ir
- exam-topic
Jun 06, 2026
Deterministic Policy Gradient
Jun 06, 2026
Diffusion Models
Jun 06, 2026
Direct Preference Optimization (DPO)
Jun 06, 2026
Discount Factor
- foundations
Jun 06, 2026
Diversity
- evaluation
- exam-topic
Jun 06, 2026
DocT5Query
- neural-ir
- document-expansion
Jun 06, 2026
Document Expansion
- neural-ir
- document-expansion
Jun 06, 2026
Document Identifiers
- neural-ir
- generative-retrieval
Jun 06, 2026
Doubly Robust Estimation
Jun 06, 2026
Dyna
- planning
- exam-topic
Jun 06, 2026
Dynamic Programming
- tabular-methods
- exam-topic
Jun 06, 2026
Emancipatory IR
- ir-society
Jun 06, 2026
Entropy
Jun 06, 2026
Episodic Semi-Gradient Control
- approximation
- exam-topic
Jun 06, 2026
Epsilon-Greedy Policy
- foundations
- exam-topic
Jun 06, 2026
Every-Visit MC
- tabular-methods
Jun 06, 2026
Examination Hypothesis
Jun 06, 2026
Expected SARSA
- tabular-methods
- key-formula
Jun 06, 2026
Experience Replay
- deep-rl
- exam-topic
Jun 06, 2026
Explainability
- ir-society
Jun 06, 2026
Exploration vs Exploitation
- foundations
- exam-topic
Jun 06, 2026
Exploring Starts
- tabular-methods
Jun 06, 2026
Exposure Fairness
- ir-society
Jun 06, 2026
F-Measure
Jun 06, 2026
Factorized Personalized Markov Chains (FPMC)
Jun 06, 2026
Fairness in Recommendation
Jun 06, 2026
Feature Construction
- approximation
Jun 06, 2026
FiD
Jun 06, 2026
Filter Bubble
Jun 06, 2026
First-Visit MC
- tabular-methods
Jun 06, 2026
Fisher Information
Jun 06, 2026
Fourier Basis
Jun 06, 2026
Function Approximation
- approximation
- exam-topic
Jun 06, 2026
GENRE
Jun 06, 2026
GPU Architecture
- gpu
- efficiency
Jun 06, 2026
GRPO
Jun 06, 2026
GRU4Rec
Jun 06, 2026
Gated Recurrent Unit (GRU)
- sequential-rec
- exam-topic
Jun 06, 2026
Gaussian Policy
Jun 06, 2026
Generalized Advantage Estimation
Jun 06, 2026
Generalized Policy Iteration
- foundations
- exam-topic
Jun 06, 2026
Generative Recommendation
Jun 06, 2026
Generative Retrieval
- neural-ir
- exam-topic
Jun 06, 2026
Gradient Descent
- optimization
Jun 06, 2026
Gradient-TD Methods
- approximation
- exam-topic
Jun 06, 2026
HSTU
Jun 06, 2026
Hard Negative Mining
Jun 06, 2026
Hierarchical Reinforcement Learning
Jun 06, 2026
Hit Rate
Jun 06, 2026
Hybrid Recommendation
- collaborative-filtering
- exam-topic
Jun 06, 2026
Implicit and Explicit Feedback
Jun 06, 2026
Importance Sampling
Jun 06, 2026
In-Context Learning
Jun 06, 2026
Information Retrieval
- foundations
Jun 06, 2026
Inverse Dynamics Model
Jun 06, 2026
Inverse Propensity Weighting
Jun 06, 2026
Inverted Index
- foundations
- exam-topic
Jun 06, 2026
Item Selection Bias
Jun 06, 2026
Item Tokenization
Jun 06, 2026
Kernel Fusion
- gpu
- efficiency
Jun 06, 2026
LLM-based Recommendation
Jun 06, 2026
LSTD
Jun 06, 2026
LSTM
Jun 06, 2026
LambdaMART
Jun 06, 2026
Language Model for IR
- retrieval-models
Jun 06, 2026
Large Language Models (LLM)
Jun 06, 2026
Large Recommendation Models (LRM)
Jun 06, 2026
Learned Sparse Retrieval
- neural-ir
- exam-topic
Jun 06, 2026
Learning to Rank
- neural-ir
Jun 06, 2026
Linear Function Approximation
Jun 06, 2026
Listwise LTR
Jun 06, 2026
LoRA
Jun 06, 2026
Long-Tail Distribution
Jun 06, 2026
MAP
Jun 06, 2026
MRR
- evaluation
- key-formula
Jun 06, 2026
Markov Chain
Jun 06, 2026
Markov Decision Process
- foundations
- exam-topic
Jun 06, 2026
Matrix Factorization
- collaborative-filtering
- exam-topic
Jun 06, 2026
Maximal Marginal Relevance (MMR)
Jun 06, 2026
Maximum Entropy RL
- deep-rl
- policy-gradient
Jun 06, 2026
Mean Squared Value Error
Jun 06, 2026
Misinformation
- ir-society
Jun 06, 2026
Model of the Environment
- foundations
Jun 06, 2026
Model-Based Reinforcement Learning
- planning
- exam-topic
Jun 06, 2026
Momentum
Jun 06, 2026
MonoBERT
Jun 06, 2026
Monte Carlo Control
- tabular-methods
Jun 06, 2026
Monte Carlo Methods
- tabular-methods
- exam-topic
Jun 06, 2026
Monte Carlo Tree Search (MCTS)
Jun 06, 2026
Multi-Armed Bandit
- foundations
- exam-topic
Jun 06, 2026
Multi-Stage Ranking
Jun 06, 2026
Multiple Additive Regression Trees
Jun 06, 2026
NDCG
Jun 06, 2026
Natural Policy Gradient
Jun 06, 2026
Negative Sampling
Jun 06, 2026
Neighborhood-based Collaborative Filtering
- collaborative-filtering
- exam-topic
Jun 06, 2026
Neural Collaborative Filtering
- collaborative-filtering
- exam-topic
Jun 06, 2026
Neural Network Function Approximation
- approximation
- deep-rl
Jun 06, 2026
Neural Networks
- foundations
Jun 06, 2026
Neural Reranking
Jun 06, 2026
Next-Item Prediction
Jun 06, 2026
Novelty
Jun 06, 2026
Off-Policy Divergence
- approximation
- exam-topic
Jun 06, 2026
Off-Policy Learning
- foundations
- exam-topic
Jun 06, 2026
Offline Reinforcement Learning
- deep-rl
Jun 06, 2026
On-Policy Distribution
- approximation
Jun 06, 2026
On-Policy Learning
- foundations
Jun 06, 2026
On-Policy vs Off-Policy
- foundations
- exam-topic
Jun 06, 2026
OneRec
Jun 06, 2026
Online and Offline Evaluation
- evaluation
- exam-topic
Jun 06, 2026
Optimal Policy
- foundations
- exam-topic
Jun 06, 2026
Optimality and Approximation
- approximation
Jun 06, 2026
Optimistic Initial Values
- foundations
- exam-topic
Jun 06, 2026
Ordinary Least Squares
- optimization
Jun 06, 2026
Outlier Bias
Jun 06, 2026
P5
Jun 06, 2026
POMDP
- foundations
- exam-topic
Jun 06, 2026
PPO
Jun 06, 2026
Pairwise LTR
Jun 06, 2026
Partial Observability
- foundations
- exam-topic
Jun 06, 2026
Plackett-Luce Model
Jun 06, 2026
Pointwise LTR
Jun 06, 2026
Policy Evaluation
- tabular-methods
Jun 06, 2026
Policy Gradient Methods
- policy-gradient
- exam-topic
Jun 06, 2026
Policy Gradient Theorem
Jun 06, 2026
Policy Improvement
- tabular-methods
- exam-topic
Jun 06, 2026
Policy Iteration
- tabular-methods
- exam-topic
Jun 06, 2026
Policy
- foundations
- exam-topic
Jun 06, 2026
Pooling
- evaluation
Jun 06, 2026
Popularity Bias
Jun 06, 2026
Position Bias
Jun 06, 2026
Position-Based Click Model
Jun 06, 2026
Precision at K
- evaluation
- key-formula
Jun 06, 2026
Precision
- evaluation
- key-formula
Jun 06, 2026
Predictive State Representation
- foundations
Jun 06, 2026
Product Quantization
- neural-ir
- efficiency
Jun 06, 2026
Q-Learning
Jun 06, 2026
Query Expansion
- neural-ir
Jun 06, 2026
Query Likelihood Model
Jun 06, 2026
REINFORCE
Jun 06, 2026
RMSProp
- optimization
Jun 06, 2026
RQ-VAE
Jun 06, 2026
Recall
- evaluation
- key-formula
Jun 06, 2026
Recommender System
Jun 06, 2026
Recurrent Neural Network (RNN)
- sequential-rec
- exam-topic
Jun 06, 2026
Regularization
- foundations
Jun 06, 2026
Reinforcement Learning from Human Feedback
Jun 06, 2026
Reinforcement Learning
- foundations
Jun 06, 2026
Reparameterization Trick
- deep-rl
- optimization
Jun 06, 2026
Retrieval-Augmented Generation
- neural-ir
Jun 06, 2026
Return
- foundations
- key-formula
Jun 06, 2026
Reward Signal
- foundations
Jun 06, 2026
Reward-Weighted Regression
Jun 06, 2026
Rocchio Algorithm
Jun 06, 2026
Rollout Algorithm
- planning
Jun 06, 2026
SARSA
Jun 06, 2026
SASRec
Jun 06, 2026
SEARCH-R1
Jun 06, 2026
SPLADE
- neural-ir
- exam-topic
Jun 06, 2026
Scaling Laws
Jun 06, 2026
Self-Attention
Jun 06, 2026
Self-RAG
- neural-ir
- exam-topic
Jun 06, 2026
Semantic IDs
Jun 06, 2026
Semi-Gradient Methods
Jun 06, 2026
Sequential Recommendation
Jun 06, 2026
Serendipity
Jun 06, 2026
Session-based Recommendation
Jun 06, 2026
Smoothing
- retrieval-models
- exam-topic
Jun 06, 2026
Soft Actor-Critic (SAC)
Jun 06, 2026
Softmax Policy
Jun 06, 2026
Sparton
- gpu
- efficiency
Jun 06, 2026
State Aggregation
Jun 06, 2026
State Space
- foundations
Jun 06, 2026
Stemming
- foundations
Jun 06, 2026
Stochastic Gradient Descent
- optimization
Jun 06, 2026
Stop Words
- foundations
Jun 06, 2026
Supervised Fine-Tuning (SFT)
Jun 06, 2026
Surrounding Item Bias
Jun 06, 2026
TD Error
- tabular-methods
- key-formula
Jun 06, 2026
TD Fixed Point
Jun 06, 2026
TD(0)
- tabular-methods
- key-formula
Jun 06, 2026
TD3
Jun 06, 2026
TF-IDF
Jun 06, 2026
TIGER
Jun 06, 2026
Tabular RL
- foundations
Jun 06, 2026
Target Network
- deep-rl
- exam-topic
Jun 06, 2026
Temporal Difference Learning
Jun 06, 2026
Term Weighting
- foundations
Jun 06, 2026
Tile Coding
- approximation
- exam-topic
Jun 06, 2026
Tokenization
- foundations
Jun 06, 2026
Top-N Recommendation
Jun 06, 2026
Transformer Kernel (TK)
Jun 06, 2026
Transformers
Jun 06, 2026
Triton
- gpu
- efficiency
Jun 06, 2026
Trust Bias
Jun 06, 2026
Trust Region Policy Optimization (TRPO)
Jun 06, 2026
Unbiased Learning to Rank
Jun 06, 2026
Upper Confidence Bound
- foundations
Jun 06, 2026
Upside-Down RL
Jun 06, 2026
User-Item Interaction Matrix
- collaborative-filtering
- exam-topic
Jun 06, 2026
Value Function
Jun 06, 2026
Value Iteration
- tabular-methods
- exam-topic
Jun 06, 2026
Vector Space Model
- retrieval-models
- foundations
Jun 06, 2026
Word Embeddings
Jun 06, 2026
duoT5
- neural-ir
- reranking
Jun 06, 2026
monoT5
- neural-ir
- reranking
Jun 06, 2026
uniCOIL
- neural-ir

Created with Quartz v4.5.2 © 2026

GitHub
Discord Community