Position-Based Click Model
Definition
Position-Based Click Model (PBM)
The Position-Based Click Model is the simplest practical click model. It assumes a user clicks a document shown at rank iff the user examines rank and the document is relevant — and crucially that examination depends only on the rank , never on the document, the query, or the surrounding results. It is the operational instantiation of the Examination Hypothesis in which the examination probability is a per-rank constant.
Intuition
Think of each result position as having a fixed “visibility” determined purely by where it sits on the page. Rank 1 is almost always looked at; rank 8 is looked at far less. The PBM bakes this into a single number per rank, the propensity , and then treats whether the user clicks given that they looked as a clean measurement of relevance.
This factorization is what makes the model so useful: the two latent causes of a click — did they see it? (position) and did they like it? (relevance) — are assumed independent. Once you know the per-rank examination probabilities, an observed click becomes a noisy but unbiased-after-reweighting signal of relevance, which is exactly what IPW exploits.
The contrast to keep in mind: PBM says examination of rank is the same regardless of what is above it. The cascade model / Cascading Position Bias says the opposite — examination depends on the relevance of everything above .
Mathematical Formulation
A click random variable for document at rank factors into two independent Bernoulli events:
where:
- — observed click on document displayed at rank
- — latent examination event for rank ; is the propensity
- — latent relevance of to query ;
- — depends only on (the defining PBM assumption), monotonically decreasing in
- — depends only on , never on
So the per-(document, rank) click probability is simply the product . The latent variables are unobserved; only is observed, which is what forces an EM-style inference.
Likelihood and EM estimation
Given a click log of sessions , each presenting document at rank with observed click , the data log-likelihood is:
Because is latent, parameters are fit by Expectation-Maximization:
- E-step — for a non-click () we infer the posterior that the rank was nonetheless examined (the click failed because the doc was irrelevant): (for a click both and are certain).
- M-step — re-estimate each parameter as the average of its inferred posterior over the relevant sessions:
Iterating E and M to convergence yields the propensities used downstream.
Key Properties / Variants
- Two latent factors, one product — the entire model is ; everything else (EM, IPW) is bookkeeping on top of this factorization.
- Propensities for counterfactual LTR — the fitted are exactly the inverse weights used by Inverse Propensity Weighting: a click at rank counts as units of relevance evidence, debiasing Counterfactual Learning to Rank objectives.
- Estimating without full EM — propensities can also be recovered by result randomization (swap a document across ranks and watch how its click rate scales) or intervention harvesting from naturally occurring rank changes, avoiding the joint EM fit.
- Identifiability caveat — if documents rarely change rank, the data cannot separate “clicked because examined” from “clicked because relevant”; multiple explain the log equally well. Randomization breaks this degeneracy.
- Position-only assumption is the weakness — PBM ignores that earlier results affect later examination. When users scan-and-stop, the cascade model / Cascading Position Bias is the correct alternative.
- Does not model trust or outlier effects — top ranks attracting extra clicks (Trust Bias) and visually distinctive items grabbing attention (Outlier Bias) both violate PBM’s clean factorization and require extended models.
Algorithm: PBM Parameter Estimation via EM
──────────────────────────────────────────────
Input: click log {(d_s, k_s, c_s)} over sessions s
Initialize θ_k, γ_d ∈ (0,1) for all ranks k, docs d
Repeat until convergence:
# E-step: posteriors over latent E, R for non-clicks
For each session s:
if c_s == 1:
P(E=1) ← 1 ; P(R=1) ← 1
else: # c_s == 0
denom ← 1 - θ_{k_s} * γ_{d_s}
P(E=1) ← θ_{k_s} * (1 - γ_{d_s}) / denom
P(R=1) ← (1 - θ_{k_s}) * γ_{d_s} / denom
# M-step: re-estimate as posterior averages
For each rank k:
θ_k ← mean over {s : k_s = k} of [ c_s + (1-c_s)*P(E=1)_s ]
For each doc d:
γ_d ← mean over {s : d_s = d} of [ c_s + (1-c_s)*P(R=1)_s ]
Return propensities {θ_k} → feed as 1/θ_k weights to IPWConnections
- Instantiates: Examination Hypothesis (examination probability made a per-rank constant)
- Member of: Click Models (simplest member of the family)
- Quantifies: Position Bias via the propensities
- Feeds: Inverse Propensity Weighting and Counterfactual Learning to Rank / Unbiased Learning to Rank
- Contrasted with: cascade model / Cascading Position Bias (examination depends on items above)
- Violated by: Trust Bias, Outlier Bias, Surrounding Item Bias
- Robust alternative when assumptions break: Doubly Robust Estimation