When can categorical variables be treated as continuous? A comparison of robust continuous and categorical SEM estimation methods under suboptimal conditions

Mijke Rhemtulla; Patricia É Brosseau-Liard; Victoria Savalei

doi:10.1037/a0029315

When can categorical variables be treated as continuous? A comparison of robust continuous and categorical SEM estimation methods under suboptimal conditions

Psychol Methods. 2012 Sep;17(3):354-73. doi: 10.1037/a0029315. Epub 2012 Jul 16.

Authors

Mijke Rhemtulla¹, Patricia É Brosseau-Liard, Victoria Savalei

Affiliation

¹ Center for Research Methods and Data Analysis, University of Kansas, 1425 Jayhawk Boulevard, Watson Library 470, Lawrence, KS 66045, USA. mijke@ku.edu

PMID: 22799625
DOI: 10.1037/a0029315

Abstract

A simulation study compared the performance of robust normal theory maximum likelihood (ML) and robust categorical least squares (cat-LS) methodology for estimating confirmatory factor analysis models with ordinal variables. Data were generated from 2 models with 2-7 categories, 4 sample sizes, 2 latent distributions, and 5 patterns of category thresholds. Results revealed that factor loadings and robust standard errors were generally most accurately estimated using cat-LS, especially with fewer than 5 categories; however, factor correlations and model fit were assessed equally well with ML. Cat-LS was found to be more sensitive to sample size and to violations of the assumption of normality of the underlying continuous variables. Normal theory ML was found to be more sensitive to asymmetric category thresholds and was especially biased when estimating large factor loadings. Accordingly, we recommend cat-LS for data sets containing variables with fewer than 5 categories and ML when there are 5 or more categories, sample size is small, and category thresholds are approximately symmetric. With 6-7 categories, results were similar across methods for many conditions; in these cases, either method is acceptable.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Bias
Factor Analysis, Statistical*
Humans
Least-Squares Analysis*
Likelihood Functions*
Models, Statistical*
Sample Size