The Equivalence of SF-36 Summary Health Scores Estimated Using Standard and Country-Specific Algorithms in 10 Countries: Results from the IQOLA Project

doi:10.1016/S0895-4356(98)00108-5

Journal of Clinical Epidemiology

Volume 51, Issue 11, November 1998, Pages 1167-1170

https://doi.org/10.1016/S0895-4356(98)00108-5 Get rights and content

Abstract

Data from general population surveys (n = 1771 to 9151) in nine European countries (Denmark, France, Germany, Italy, the Netherlands, Norway, Spain, Sweden, and the United Kingdom) were analyzed to test the algorithms used to score physical and mental component summary measures (PCS-36/MCS-36) based on the SF-36 Health Survey. Scoring coefficients for principal components were estimated independently in each country using identical methods of factor extraction and orthogonal rotation. PCS-36 and MCS-36 scores were also estimated using standard (U.S.-derived) scoring algorithms, and results were compared. Product-moment correlations between scores estimated from standard and country-specific scoring coefficients were very high (0.98 to 1.00) for both physical and mental health components in all countries. As hypothesized for orthogonal components, correlations between physical and mental components within each country were very low (0.00 to 0.12) for both estimation methods. Mean scores for PCS-36 differed by as much as 3.0 points across countries using standard scoring, and mean scores for MCS-36 differed across countries by as much as 6.4 points. In view of the high degree of equivalence observed within each country, using standard and country-specific algorithms, we recommend use of standard scoring algorithms for purposes of multinational studies involving these 10 countries.

Introduction

The scoring and interpretation of physical and mental summary measures from the SF-36 Health Survey has been shown to achieve a number of advantages 1, 2, 3, 4. Compared with the eight SF-36 scales, scores for physical and mental health summary measures can be estimated with smaller confidence intervals, expand the range of health states measured, and greatly increase the number of levels distinguished, in comparison with any one of the eight scales [3]. While the summary measures do not reproduce all of the reliable variance in the eight-scale SF-36 profile, they have the advantage of reducing the number of statistical comparisons required when analyzing SF-36 data. Empirical tests suggest that they do so without a substantial loss of information 3, 5.

Construction of the summary measures in the United States was based on a number of findings. First, two physical and mental factors were shown to account for 80% to 85% of the reliable variance in the eight SF-36 scales in patient and general populations 3, 4. As hypothesized, scales measuring physical functioning, role limitations due to physical health, bodily pain, and general health correlated highest with the physical component and lowest with the mental component, whereas mental health, role limitations due to emotional problems, social functioning, and vitality correlated highest with the mental factor and lowest with the physical. This pattern of correlations between scales and summary component scores was also quite robust, suggesting that each summary has a comparable interpretation across population subgroups 3, 4, 5. The summary measures have also been shown to be valid in discriminating between physical and mental health status and outcomes in both cross-sectional and longitudinal tests 3, 5, 6, 7, 8.

The two-component SF-36 model of health was first described in the United States 3, 4, 5. It has been replicated across large general population samples from nine Western European countries (Denmark, France, Germany, Italy, the Netherlands, Norway, Spain, Sweden, and the United Kingdom) 6, 9. These replications suggest that it may be feasible to score and interpret physical and mental health summary measures in these countries. It is not clear, however, how such summary scores should be estimated. In this study, we compare country-specific versus standard (U.S.-derived) scoring algorithms for the SF-36 physical and mental health summary measures to evaluate their equivalence and explore the implications of using one scoring method or the other in international analyses.

Section snippets

Data

Data come from 10 general population surveys, which have been described in detail elsewhere [10]. In brief, samples were selected to be nationally representative in nine countries (Denmark, France, Germany, Italy, the Netherlands, Norway, Spain, the United Kingdom, and the United States). Data from Sweden were collected through seven mail surveys conducted in various regions of Sweden [11]. Self-administration of the SF-36 was used in six countries; the exceptions were Italy (50% personal

Analyses

The correlation between each pair of SF-36 summary components scored using standard (U.S.) and country-specific algorithms was examined to test their equivalence in each country. We hypothesized that these correlations would be positive and very high and accepted correlations greater than 0.90 as satisfactory evidence of equivalence. In addition, we examined correlations between physical and mental summary components that were scored using the same methods (e.g., PCS-36/MCS-36); we hypothesized

Results

Correlations between the SF-36 summary measures scored using standard (U.S.) scoring algorithms and country-specific scoring algorithms were very high, ranging from 0.980 to 0.998 across countries for the PCS-36/CPCS-36 and 0.984 to 0.998 for the MCS-36/CMCS-36 (Table 2). Thus, the correlational standard of equivalence was satisfied for both physical and mental health summary measures in all countries. Correlations between SF-36 physical and mental summary measures scored using standard

Discussion

For both physical and mental health, we observed substantial relative agreement between SF-36 summary measures estimated using standard and country-specific scoring algorithms in all countries. Specifically, product-moment correlations between SF-36 summary measures scored using standard (U.S.) scoring and country-specific scoring ranged from 0.980 to 0.998 across countries. On the basis of the strength of these findings, we recommend use of standard scoring, using U.S.-derived scoring

References (15)

J.E. Ware et al.
The factor structure of the SF-36 Health Survey in ten countriesResults from the IQOLA Project
J Clin Epidemiol
(1998)
B. Gandek et al.
Methods for validating and norming translations of health status questionnairesThe IQOLA project approach
J Clin Epidemiol
(1998)
B. Gandek et al.
Tests of data quality, scaling assumptions, and reliability of the SF-36 in eleven countriesResults from the IQOLA Project
J Clin Epidemiol
(1998)
S. Fukuhara et al.
Psychometric and clinical tests of validity of the Japanese SF-36 Health Survey
J Clin Epidemiol
(1998)
J.E. Ware et al.
The MOS 36-Item Short-Form Health Survey (SF-36)I. Conceptual framework and item selection
Med Care
(1992)
J.E. Ware et al.
SF-36 Health Survey Manual and Interpretation Guide
(1993)
J.E. Ware et al.
SF-36 Physical and Mental Health Summary ScalesA User’s Manual
(1994)

There are more references available in the full text version of this article.

Cited by (517)

Recovery and functional outcome after radial nerve palsy in adults with a humeral shaft fracture: a multicenter prospective case series
2023, JSES International
The consequences of radial nerve palsy associated with a humeral shaft fracture are unclear. The aim of this study was to examine the functional recovery of radial nerve palsy, at presentation or postoperatively, in patients with a humeral shaft fracture.
Data from patients who participated in the HUMeral shaft fractures: measuring recovery after operative versus non-operative treatment (HUMMER) study, a multicenter prospective cohort study including adults with a closed humeral shaft fracture Arbeitsgemeinschaft für Osteosynthesefragen (AO) type 12A or 12B, and had radial nerve palsy at presentation or postoperatively, were extracted from the HUMMER database. The primary outcome measure was clinically assessed recovery of motor function of the radial nerve. Secondary outcomes consisted of treatment, functional outcome (Disabilities of the Arm, Shoulder, and Hand and Constant–Murley Score), pain level, quality of life (Short Form-36 and EuroQoL-5D-3L), activity resumption, and range of motion of the shoulder and elbow joint at 12 months after trauma.
Three of the 145 nonoperatively treated patients had radial nerve palsy at presentation. One recovered spontaneously and 1 after osteosynthesis. Despite multiple surgical interventions, the third patient had no recovery after entrapment between fracture fragments. Thirteen of the 245 operatively treated patients had radial nerve palsy at presentation; all recovered. Nine other patients had postoperative radial nerve palsy; 8 recovered. One had ongoing recovery at the last follow-up, after nerve release and suture repair due to entrapment under the plate. At 12 months, the functional outcome scores of all patients suggested full recovery regarding functional outcome, pain, quality of life, activity resumption, and range of motion.
Radial nerve palsy in patients with a humeral shaft fracture at presentation or postoperatively functionally recovers in 94% and 89%, respectively.
Quality of life and functional limitations in persons with epilepsy
2023, Epilepsy Research
Epilepsy can reduce quality of life (QOL), functionality, and social participation, but these effects have not been adequately quantified in large, population-based, controlled studies. We sought to evaluate the impact of epilepsy on patients’ QOL and employment outcomes.
In this cross-sectional study we used nationally representative, pooled data from the Medical Expenditure Panel Survey (MEPS) household component files for 2010–2018. MEPS is a population-based survey of U.S. community-dwelling persons. We included respondents with condition file records for epilepsy. We also analyzed respondents with records for seizure. The primary outcomes were short form-12 physical and mental health scores. Secondary outcomes included self-rated health status, employment status, educational attainment, school/household/work limitations, and missed workdays. We compared these outcomes between persons with epilepsy (PWE) and age- and gender-matched controls.
We identified 1078 people with epilepsy, 2344 seizure cases, and 3422 cases of either condition (persons with epilepsy and/or seizures). Epilepsy was associated with a decrease of − 4.0 (95% CI: −5.1 to −2.8) points in SF-12 physical health scores and − 3.1 (95% CI: −4.2 to −1.9) in SF-12 mental health scores. Epilepsy was also associated with decreases in the likelihood of reporting good/very good/excellent health status (−13.3 [95% CI: −16.1 to −10.4] percentage points). Epilepsy was also associated with adverse employment-related outcomes. Specifically, PWE were 17.9 (95% CI: 14.3–21.4) percentage points more likely to report that they had work or household limitations. The associations between outcomes and epilepsy were, in most cases, larger than those between outcomes and other common, chronic conditions.
Epilepsy is associated with worse quality of life and employment-related outcomes. Interventions should aim to improve functioning and patients’ ability to maintain employment.
Health-related quality of life in hoarding: A comparison to chronic conditions with high disease burden
2022, Journal of Psychiatric Research
Citation Excerpt :
Standardization and weighted aggregation of the eight multi-item domains results in two summary scales: the Physical Component Summary (PCS) and the Mental Component Summary (MCS). Methodology for calculation of component summary scores is described elsewhere (Ware et al., 1998). The QoL of individuals with CHS was compared to that of those diagnosed with other conditions for which data were available in the BHR and were known to have a high disease burden.
Hoarding disorder often results in debilitating functional impairment and may also compromise health-related quality of life (QoL). This study investigated the association between hoarding behavior and QoL relative to six highly impairing medical and psychiatric disorders in a sample of 20,722 participants enrolled in the internet-based Brain Health Registry. Nearly 1 in 8 participants (12.2%) endorsed clinically relevant hoarding symptoms (CHS). In separate multivariable linear regression models, hoarding was more strongly associated with mental QoL than diabetes ( $S t a n d a r d i z e d β$ = −0.21, 95% CI: [-0.22, −0.20] vs. −0.01 [-0.02, 0.0]), heart disease (−0.22 [-0.23, −0.20] vs. 0.00 [-0.02, 0.01]), chronic pain (−0.18 [-0.19, −0.16] vs. −0.12 [-0.13, −0.10]), post-traumatic stress disorder (PTSD; −0.20 [-0.22, −0.19] vs. −0.07 [-0.09, −0.06]), and substance use disorder (SUD; −0.21 [-0.23, −0.20] vs. −0.04 [-0.05, −0.03]). Similarly, CHS was more strongly negatively associated with physical QoL than diabetes (−0.11 [-0.10, −0.12] vs. −0.08 [-0.06, −0.09]), major depressive disorder (−0.09 [-0.10, −0.08] vs. −0.05 [-0.06, 0.03]), PTSD (−0.11 [-0.12, −0.10] vs. −0.08 [-0.09, −0.07]), and SUD (−0.12 [-0.13, −0.09] vs. −0.01 [-0.02, 0.00]). Higher hoarding severity was associated with reductions in both mental ( $S t a n d a r d i z e d β$ = −0.28, ΔR² = 0.08, p < 0.0001) and physical ( $β$ = −0.12, ΔR² = 0.02, p < 0.0001) QoL, though the strength of the relationship between hoarding symptoms and QoL varied with depression severity. Efforts to improve the overall QoL and well-being of those with CHS are needed.
Evolution of Bowel Complaints after Laparoscopic Endometriosis Surgery: A 1497 Women Comparative Study
2022, Journal of Minimally Invasive Gynecology
To assess to what degree can digestive symptoms improve after endometriosis surgery for different localizations.
A comparative retrospective study employing data prospectively recorded in the North-West Inter-Regional Female Cohort for Patients with Endometriosis (CIRENDO) from June 2009 to November 2018.
Two referral centers.
A total of 1497 women undergoing surgery because of pelvic endometriosis were divided into 3 groups: superficial endometriosis (Group 1, n = 396), deep endometriosis sparing the bowel (Group 2, n = 337), and deep endometriosis involving the bowel (Group 3, n = 764).
Surgery for endometriosis.
Preoperative and postoperative gastrointestinal symptoms were evaluated with standardized questionnaires, including the Gastrointestinal Quality of Life Index (GIQLI) and Knowles-Eccersley-Scott-Symptom questionnaire (KESS). The degree of postoperative improvement in digestive symptoms was compared between the groups. The women in Group 3 were significantly symptomatic in terms of cycle-related gastrointestinal symptoms and scores of standardized questionnaires GIQLI and KESS. According to the 1-year postoperative evaluation, women in Group 3 experienced the most significant improvement in their gastrointestinal symptoms.
Women with severe bowel symptoms and deep endometriosis infiltrating the bowel should be informed about the high probability of symptom improvement after the removal of bowel nodules. Conversely, in women without deep endometriosis, postoperatively, there is less improvement in baseline digestive complaints.
COMPREHENSIVE EVALUATION OF HYDROXYUREA TREATMENT IMPACT ON QUALITY OF LIFE IN ADULT PATIENTS WITH SICKLE CELL DISEASE AND SICKLE CELL DISEASE WITH BETA-THALASSEMIA: A DUAL ASSESSMENT USING CHQ-PF50 IN CHILDREN AND SF-36 QUESTIONNAIRES IN ADULTS
2024, Community Practitioner
Long-term health-related quality-of-life and psychosocial outcomes after uterus transplantation: A 5-year follow-up of donors and recipients
2024, Human Reproduction

View all citing articles on Scopus

View full text

Original ArticleThe Equivalence of SF-36 Summary Health Scores Estimated Using Standard and Country-Specific Algorithms in 10 Countries: Results from the IQOLA Project

Abstract

Introduction

Section snippets

Data

Analyses

Results

Discussion

J Clin Epidemiol

J Clin Epidemiol

J Clin Epidemiol

J Clin Epidemiol

The MOS 36-Item Short-Form Health Survey (SF-36)I. Conceptual framework and item selection

Med Care

SF-36 Health Survey Manual and Interpretation Guide

SF-36 Physical and Mental Health Summary ScalesA User’s Manual

Original Article
The Equivalence of SF-36 Summary Health Scores Estimated Using Standard and Country-Specific Algorithms in 10 Countries: Results from the IQOLA Project