The Predictive Information Index (PII) quantifies how much outcome-relevant information is retained when reducing a set of predictors (e.g., items) to a composite score.
One version of PII, the variance-based form, is defined as:
\text{PII}_{v} = 1 - \frac{\text{Var}(\hat{Y}_{\text{Full}} - \hat{Y}_{\text{Score}})}{\text{Var}(\hat{Y}_{\text{Full}})}
Where: - \(\hat{Y}_{\text{Full}}\): predictions from a full model (e.g., all items or predictors) - \(\hat{Y}_{\text{Score}}\): predictions from a reduced score (e.g., mean or sum)
A PII of 1 means no predictive information was lost. A PII near 0 means the score loses most predictive information.
pii()
library(piiR)
# Simulate an outcome and two prediction vectors
set.seed(123)
y <- rnorm(100) # observed outcome
full <- y + rnorm(100, sd = 0.3) # full-model predictions
score <- y + rnorm(100, sd = 0.5) # score-based predictions
# Compute the three PII variants
pii(y, score, full, type = "r2") # variance explained
## [1] 0.7248883
## [1] -1.690292
## [1] 0.6619032