Examination Hypothesis

Definition

The examination hypothesis states that:

A user clicks on an item if and only if they examine it AND find it relevant.

Formally:

$P (Click_{d, k}) = P (Exam_{d} ∣ position k) \cdot P (Rel_{d} ∣ q)$

This is the foundational assumption of Click Models, particularly the Position-Based Click Model.

Intuition

The hypothesis decomposes the click event into two independent components:

Examination: Did the user even see the item?
- Depends on position (users scan top-to-bottom)
- Depends on context (surrounding items, layout)
- Can depend on user behavior (e.g., did they find something already?)
Relevance: Given that they saw it, would they click?
- Depends on item quality
- Depends on query intent
- Independent of presentation (in the hypothesis)

Visual: A user must pass both filters to produce a click:

User
  ↓
[Examines position k?] ← Position bias
  ↓ Yes
[Relevant to query?] ← True relevance
  ↓ Yes
CLICK

Mathematical Formulation

Decomposition

Under the examination hypothesis, the joint probability of a click factors:

$P (Click_{d, k}) = P (Exam_{k}, Rel_{d})$

By conditional independence (the hypothesis):

$P (Exam_{k}, Rel_{d}) = P (Exam_{k}) \cdot P (Rel_{d})$

Note: $Rel_{d}$ is independent of $k$ (position).

Logistic Formulation

In probabilistic models, often expressed:

$P (Click_{d, k}) = σ (exam (k) + rel (d))$

where $σ$ is the sigmoid function, and:

$exam (k)$ captures position effects
$rel (d)$ captures relevance effects

In Click Models

The hypothesis enables clean graphical models. For the Position-Based Click Model:

        Exam_k
         /   \
        /     \
       /       \
      ↓         ↓
    Click ← Rel_d
      ↑
    observed

The examination node and relevance node conditionally determine the click.

Implications

1. Invertibility of Position Bias

Under the hypothesis, if we know clicks and true relevance, we can back out position bias:

$P (Exam_{k}) = \frac{P ( Click _{d, k} )}{P ( Rel _{d} ∣ q )}$

This is crucial for Inverse Propensity Weighting.

2. No Interaction Effects

The hypothesis assumes no interaction between examination and relevance:

A highly relevant item is equally likely to be clicked at any position (conditional on examination)
A position-1 item is equally likely to be clicked at any relevance level (conditional on relevance)

In reality, this is violated (e.g., Trust Bias, outlier effects).

3. Separability of Factors

Because of independence, we can:

Estimate position bias from clicks alone (with sufficient diverse data)
Estimate relevance from data that varies position (e.g., randomized experiments)
Fit separate models for examination and relevance

Key Assumptions

1. Examination is Binary

An item is either examined or not; there’s no “partial examination.”

Violation: Users might scan an item title (partial exam) without clicking body text.

2. Examination Only Depends on Position

$P (Exam_{d, k}) = P (Exam_{k})$

The probability of examining item $d$ at position $k$ is the same for all $d$ .

Violations:

Cascading Position Bias: Examination depends on previous items’ relevance
Outlier Bias: Distinctive items attract attention regardless of position
Surrounding Item Bias: Examination of item $k$ depends on neighbors

3. Relevance is Independent of Position

$P (Rel_{d} ∣ q, k) = P (Rel_{d} ∣ q)$

Whether item $d$ is relevant doesn’t change based on where it’s displayed.

Violations:

Trust Bias: Items appear more relevant when displayed at high positions
Context Bias: Relevance judgment depends on surrounding items

4. No Examination = No Click

Users cannot accidentally click unseen items, and there are no random clicks.

Violations:

Misdirected clicks (clicking the wrong item)
Spam clicks
Bot activity

Estimation Under the Hypothesis

Expectation-Maximization

The hypothesis enables the EM algorithm for Click Models:

E-step: Given current estimates of $exam_{k}$ and $rel_{d}$ , infer the latent examination events from clicks.

M-step: Update estimates of $exam_{k}$ and $rel_{d}$ to maximize data likelihood.

Probabilistic Inference

With the examination event as a latent variable:

$P (Exam_{k} = 1∣ Click = 1) = \frac{P ( Click = 1∣ Exam = 1 ) \cdot P ( Exam = 1 )}{P ( Click = 1 )}$

This allows inference:

From clicks → estimates of examination
From examination → estimates of relevance

Violations in Practice

1. Cascading Behavior

Users examine items sequentially and stop when satisfied:

$P (Exam_{k}) = \prod_{j < k} (1 - P (Rel_{j}))$

Impact: IPS using PBM assumptions becomes severely biased.

Solution: Use Cascading Position Bias models instead.

2. Trust Bias

Items at top positions get more clicks than justified:

$P (Click) = P (Exam) \cdot [P (Rel) + (1 - P (Rel)) \cdot τ_{k}]$

Introduces false positives at high ranks.

Impact: Learned models overestimate relevance of top-ranked items.

Solution: Model trust bias explicitly.

3. Context Effects

Examination and relevance are not independent:

Outlier items attract more attention
Relevant neighbors make an item harder to notice
Visual distinctiveness increases examination

Impact: Separability assumption fails; click models become less identifiable.

Solution: Include context features in the model.

Identifiability

A critical issue: under the examination hypothesis, identifiability is not guaranteed.

For certain data distributions, multiple combinations of $exam$ and $rel$ equally well explain the clicks:

Data: Clicks on
Search (all items at various positions)

Explanation 1:
  exam = [1.0, 0.9, 0.7, ...]
  rel = [0.8, 0.9, 0.6, ...]
  Likelihood: L₁

Explanation 2:
  exam = [1.0, 1.0, 1.0, ...]
  rel = [0.8, 0.81, 0.42, ...]
  Likelihood: L₁ (same!)

Both explain the data equally well!

Why? If items always appear at similar positions, we can’t distinguish:

“Item A gets clicks because position 1 is examined” (high exam, low rel)
“Item A gets clicks because it’s relevant” (low exam, high rel)

Consequence: Model retraining might converge to different solutions.

Connections

Foundation: Basis for Click Models, Position-Based Click Model
Violation: Cascading Position Bias, Trust Bias, Outlier Bias
Estimation: Inverse Propensity Weighting assumes hypothesis
Alternative: Doubly Robust Estimation if hypothesis is violated

Study Notes

Explorer

Examination Hypothesis

Examination Hypothesis

Definition

Intuition

Mathematical Formulation

Decomposition

Logistic Formulation

In Click Models

Implications

1. Invertibility of Position Bias

2. No Interaction Effects

3. Separability of Factors

Key Assumptions

1. Examination is Binary

2. Examination Only Depends on Position

3. Relevance is Independent of Position

4. No Examination = No Click

Estimation Under the Hypothesis

Expectation-Maximization

Probabilistic Inference

Violations in Practice

1. Cascading Behavior

2. Trust Bias

3. Context Effects

Identifiability

Connections

Appears In

Graph View

Table of Contents

Backlinks