Back to CFA Level II

CFA Level II · Cheat Sheet

Quantitative Methods

Tip: Use your browser's print function (Ctrl+P / Cmd+P) to save as PDF for offline study.

QUANTITATIVE METHODS: MULTIPLE REGRESSION CHEAT SHEET

CLRM ASSUMPTIONS & VIOLATIONS

|---|---|---|---|---|---|

---

MODEL FIT & SIGNIFICANCE TESTS

R-squared: SSR/SST = 1 − SSE/SST Adjusted R-squared: 1 − [(1 − R²)(n−1)/(n−k−1)]

  • *Use to compare models with different k*

F-statistic: (SSR/k) / (SSE/(n−k−1)) = MSR/MSE

  • *H₀: all slope coefficients = 0*
  • *Significant F ≠ all variables individually significant (see Q1)*

t-statistic: (b̂ − b₀) / SE(b̂), df = n−k−1

  • *Significant at 5% if p < 0.05 or |t| > ~2 (large n)*

---

HETEROSKEDASTICITY

|---|---|

---

SERIAL CORRELATION

|---|---|

---

MULTICOLLINEARITY

|---|---|

---

COMMON TRAPS

|---|---|

AssumptionViolatesEffect on CoefficientsEffect on InferenceDetectionFix
1. Linear in parameters
2. No perfect collinearityMulticollinearityUnbiased↑ Std. errorsVIF, t vs F patternRemove/combine variables
3. E(ε) = 0
4. Constant varianceHeteroskedasticityUnbiased↓ Reliability of SE, t, pBreusch-Pagan, residual plotWhite (HC) SE, transform
5. No serial correlationSerial CorrelationUnbiased↓ Reliability of SE, t, pDurbin-Watson, ACFNewey-West SE, lag variables
6. Normality of εUnbiased (large n)↓ InferenceQ-Q plot, histogramLarge sample, robust methods
CueAnswer
DefinitionVariance of ε varies across observations
Root causesConditional on variable size; omitted variables
ConsequenceSE typically understated → inflated t-stats → spurious significance
Durbin-Watson?NO—DW tests serial correlation
Breusch-Pagan?YES—regress squared residuals on X's; chi-square test
Visual testPlot residuals or squared residuals vs fitted; fan shape = heteroskedasticity
FixWhite (HC) std. errors; log/root transform; robust regression
CueAnswer
DefinitionCov(εᵢ, εⱼ) ≠ 0 for i ≠ j; common in time series
Durbin-WatsonDW ≈ 2(1 − r), where r = first-lag correlation
DW = 2No serial correlation
DW → 0Positive serial correlation
DW → 4Negative serial correlation
ConsequenceSE typically understated → inflated t-stats
FixNewey-West (HAC) SE; include lagged Y or residuals; generalized LS
CueAnswer
DefinitionTwo or more X's highly correlated with each other
Effect on coefficientsUnbiased but unstable
Effect on SE↑↑ Standard errors inflate
Classic patternHigh F-stat + low individual t-stats (see Q1)
VIF formulaVIF_j = 1/(1 − R²_j) where R²_j from regressing X_j on other X's
VIF thresholdVIF > 5 → concern; VIF > 10 → severe
Other signsLarge coefficient swings when variables added/removed; high R² but few sig. t's
FixRemove redundant variable; combine into composite; domain knowledge
MistakeReality
Adjusted R² < R² means misspecificationAlways true by design—not a problem
High F-stat means all variables matterNo—F tests joint hypothesis; individual t's show relevance
Multicollinearity biases coefficientsFalse—it inflates SE, not coefficients
Heteroskedasticity biases coefficientsFalse—it distorts SE, t-stats, and p-values
Serial correlation detected by Breusch-PaganWrong test—BP is for heteroskedasticity; use DW or ACF
---

QUICK DECISION TREE

High F, low individual t-stats? → Multicollinearity or irrelevant variable Residuals fan-shaped vs fitted? → Heteroskedasticity (use White SE) DW << 2? → Positive serial correlation (use Newey-West SE) VIF > 5 for a variable? → Consider removing/combining Need to compare two models with different # of predictors? → Use adjusted R², not R²

Aligned to the CFA Institute Level II curriculum.

Make this cheat sheet yours

Personalize this sheet — focus it however you study, or build one from the exact questions you keep getting wrong.

Sign up free to create a personalized cheat sheet.