Multicollinearity in prognostic factor analyses using the EORTC QLQ-C30: identification and impact on model selection

Stat Med. 2002 Dec 30;21(24):3865-84. doi: 10.1002/sim.1358.

Abstract

Clinical and quality of life (QL) variables from an EORTC clinical trial of first line chemotherapy in advanced breast cancer were used in a prognostic factor analysis of survival and response to chemotherapy. For response, different final multivariate models were obtained from forward and backward selection methods, suggesting a disconcerting instability. Quality of life was measured using the EORTC QLQ-C30 questionnaire completed by patients. Subscales on the questionnaire are known to be highly correlated, and therefore it was hypothesized that multicollinearity contributed to model instability. A correlation matrix indicated that global QL was highly correlated with 7 out of 11 variables. In a first attempt to explore multicollinearity, we used global QL as dependent variable in a regression model with other QL subscales as predictors. Afterwards, standard diagnostic tests for multicollinearity were performed. An exploratory principal components analysis and factor analysis of the QL subscales identified at most three important components and indicated that inclusion of global QL made minimal difference to the loadings on each component, suggesting that it is redundant in the model. In a second approach, we advocate a bootstrap technique to assess the stability of the models. Based on these analyses and since global QL exacerbates problems of multicollinearity, we therefore recommend that global QL be excluded from prognostic factor analyses using the QLQ-C30. The prognostic factor analysis was rerun without global QL in the model, and selected the same significant prognostic factors as before.

Publication types

  • Comparative Study

MeSH terms

  • Antineoplastic Agents / therapeutic use
  • Breast Neoplasms / drug therapy
  • Clinical Trials as Topic / methods*
  • Doxorubicin / therapeutic use
  • Female
  • Humans
  • Paclitaxel / therapeutic use
  • Principal Component Analysis / methods*
  • Prognosis*
  • Quality of Life / psychology
  • Regression Analysis*
  • Surveys and Questionnaires

Substances

  • Antineoplastic Agents
  • Doxorubicin
  • Paclitaxel