Linear
Quadratic
A quadratic model seems like it might be a better fit.
But how can we measure that?
Deviance (
* Note: a common definition of deviance requires a comparison to a ‘saturated’ model. For clarity, we use this simpler definition.
Underfit
Overfit
Training data
Fit the model on a subset of the data (e.g. 50%)
Test data
Asses model fit on the held-out portion of the data
Interpretation 1
Penalize deviance score for each added parameter by some ‘reasonable’ value.
Interpretation 2
Model the average difference in deviance between training and test data.
Assumptions:
Pick the model with the lowest value
WAIC(M1) = 209.0; WAIC(M2) = 208.1
→ M2 is the winner
Report several models along with values
Multi-model table showing estimates for different combinations of coefficients, along with WAIC
Average predictions across models
Simultaneous posterior predictions of new data from all models, weighted by WAIC
Figures by Peter McMahan (source code)
Still from The Hudsucker Proxy (1994)
David Byrne by Deborah Feingold