Model building strategy for logistic regression: purposeful selection

Ann Transl Med. 2016 Mar;4(6):111. doi: 10.21037/atm.2016.02.15.

Abstract

Logistic regression is one of the most commonly used models to account for confounders in medical literature. The article introduces how to perform purposeful selection model building strategy with R. I stress on the use of likelihood ratio test to see whether deleting a variable will have significant impact on model fit. A deleted variable should also be checked for whether it is an important adjustment of remaining covariates. Interaction should be checked to disentangle complex relationship between covariates and their synergistic effect on response variable. Model should be checked for the goodness-of-fit (GOF). In other words, how the fitted model reflects the real data. Hosmer-Lemeshow GOF test is the most widely used for logistic regression model.

Keywords: Hosmer-Lemeshow; Logistic regression; R; interaction; linearity; purposeful selection.