How do AIC and BIC help with model selection, and why can't I just use R-squared?
I'm studying CFA Level II and confused about why we need information criteria like AIC and BIC when R-squared already tells us how well a model fits. What do these criteria add, and when would they give different recommendations?
R-squared measures goodness of fit — how much variance your model explains. The problem: R-squared ALWAYS increases when you add variables, even useless ones. This makes it a terrible tool for model selection because it rewards complexity without penalizing overfitting.
The Overfitting Problem:
Adding a random noise variable to your model will increase R-squared slightly (in-sample) but degrade out-of-sample prediction. You need a metric that balances fit against complexity.
AIC (Akaike Information Criterion):
AIC = 2k - 2ln(L)
Where k = number of parameters, L = maximum likelihood. Lower AIC is better. The 2k term penalizes model complexity.
BIC (Bayesian Information Criterion):
BIC = k*ln(n) - 2ln(L)
Where n = number of observations. Lower BIC is better. BIC penalizes complexity MORE heavily than AIC because ln(n) > 2 for n > 8 (which is always true in practice).
Key Differences:
| Criterion | Penalty for Complexity | Tends to Select | Best For |
|---|---|---|---|
| R-squared | None | Most complex model | Never use alone for selection |
| Adjusted R-squared | Mild penalty | Slightly simpler models | Quick comparison |
| AIC | Moderate (2k) | Best predictive model | Forecasting accuracy |
| BIC | Strong (k*ln(n)) | Most parsimonious model | Finding the 'true' model |
When They Disagree:
AIC and BIC often recommend different models. AIC prefers slightly more complex models (better predictions), while BIC prefers simpler ones (more interpretable).
Practical Example:
You're modeling quarterly corporate earnings growth using potential predictors: GDP growth, interest rates, credit spreads, oil prices, and consumer confidence.
| Model | Variables | R-squared | Adj R-sq | AIC | BIC |
|---|---|---|---|---|---|
| A | GDP only | 0.35 | 0.34 | -120 | -116 |
| B | GDP + Rates | 0.42 | 0.40 | -128 | -122 |
| C | GDP + Rates + Spreads | 0.44 | 0.41 | -130 | -121 |
| D | All 5 variables | 0.46 | 0.40 | -126 | -112 |
- R-squared picks Model D (always picks the fullest model)
- AIC picks Model C (best balance of fit and parsimony)
- BIC picks Model B (most parsimonious adequate model)
For forecasting, you'd likely go with Model C (AIC). For understanding the key drivers, Model B (BIC) provides a cleaner story.
Exam Tip: If a CFA exam question asks which model to select using a given criterion, simply pick the model with the LOWEST AIC or BIC value. No calculation needed — just comparison.
Practice model selection problems in our CFA Level II question bank.
Master Level II with our CFA Course
107 lessons · 200+ hours· Expert instruction
Related Questions
How do I map a CFA Ethics vignette to the right standard?
When does a duty to clients override pressure from an employer?
Do conflicts have to be disclosed before making a recommendation?
Why do CFA Ethics answers focus so much on the action taken?
What does a high-water mark actually do in a hedge fund fee calculation?
Join the Discussion
Ask questions and get expert answers.