Introduction

Neck pain is a highly prevalent condition with about two thirds of the adult population affected at some time in their lives [6]. Unspecific neck pain usually resolves within days, but in 10% neck pain recurs or becomes chronic. Quantification of neck pain is necessary not only for the evaluation of current and future therapies, but also for assessing clinical outcomes of impairment and disability [2]. Although several measures of generalised pain and disability were available (i.e. Oswestry Disability Questionnaire and Pain Disability Index) [13], there was no well-developed measure that dealt specifically with neck pain until 1999.

In order to provide a new comprehensive measure of neck pain and disability the neck pain and disability scale (NPAD) has been developed by Wheeler and colleagues specifically to be used in this condition [15]. The NPAD measures problems with neck movements, neck pain intensity, effect of neck pain on emotion and cognition, and the level of interference in daily life activities. The NPAD has been found easy to complete for patients and simple to score, and it provides a validated measure to evaluate outcomes in patients with neck pain [9, 15]. Originally developed in the USA to date, validated French [16], Portuguese [5], and Turkish [2] translations are available showing adequate validity and reliability. The NPAD has been regularly used in previous studies [11].

The aim of this study was to develop a German version of the NAPD (NPAD-d) and evaluate its internal consistency as well as its construct and discriminant validity in a primary care setting.

Methods

Study design

This was a cross-sectional survey including patients from a GP setting in Germany with at least one onset of neck pain between March 2005 and April 2006. The study was approved by the local research ethics committee.

Instruments

Participants received a comprehensive self-administered questionnaire covering multiple domains such as socio-demographic information, anxiety, depression, social support, and neck pain. Participants received the questionnaire from primary care physicians together with written instructions. Due to budgetary constraints no mail or telephone follow-up was done when persons did not or incompletely return the questionnaire.

Neck and pain disability scale [9, 15]

The NPAD is a 20-item measure that was specifically developed for patients with neck pain. It measures the intensity of pain; its interference with vocational, recreational, social, and functional aspects of living; and the extent of associated emotional factors. Patients responded to each item by marking along a 10 cm visual analogue scale. Item scores range from 0 to 5, and the total score (possible range 0–100) is the sum of the item scores. A valid NPAD score can be generated if no more than 15% of the items are missing. The NPAD has been shown to have validity in comparison to other self-reported pain measures [9] as well as supporting constructs of mood and neuroticism [15]. Previous studies have shown that NPAD items clustered into three [5, 16] or four to five [9] subscales.

Development of the neck and pain disability scale German version

Originally developed in an Anglo-American context, the rather simply worded NAPD was culturally adapted and translated into German. Two authors (MS, WH) independently translated the original version with the help of an English native speaker and critically reviewed translation versions reciprocally. All authors in the group resolved disagreements if any through discussion. Due to the simple wording of questionnaire and the unitary structure of the questions we abstained from back translation of the questionnaire (Appendix 1). According to the original version, item scores range from 0 to 5, and the total score (possible range 0–100) is the sum of the item scores.

Hospital anxiety and depression scale [3, 10, 17]

The hospital anxiety and depression scale (HADS) is a widely used short self-assessment questionnaire, especially developed for physically ill patients. Its items mainly ask for psychological manifestations of (generalised) anxiety and depressive mood. Each of the two subscales consists of seven items. Possible subscale scores range from 0 to 21.

Social support questionnaire [8]

The 14-item short form (five-point scale from “relevant” to “not relevant”) assesses perceived social support (F-SozU). The items refer to different aspects of experienced support (emotional support, instrumental support and social integration), resulting in a global scale with higher scores indicating better social support.

The German versions of the HADS and the FSozU have been previously validated and were considered to be suitable for research in physically ill patients [3, 8, 10, 17].

Recruitment of patients

As part of a project on the quality of medical care in general practice (MedViP), a network of 104 general practices has been established [14]. Fifteen of these within a radius of 30 km around Göttingen were selected for participation and provided anonymised electronic patient data (date of birth, sex, diagnosis). Patients were included in a list of potentially eligible persons if at least one consultation because of neck pain was documented in the electronic patient record during the period from March 2005 to April 2006. All GPs were asked to exclude patients from a list of 1,308 potentially eligible persons, if they had their neck pain consultation because of a new trauma, were terminally ill, suffered from cancer, were in need of nursing care or had severe cognitive impairment. Additionally, patients seen by locums only, patients who had moved to a region outside of the study area or who were not able to speak German were excluded from the study. Eighty persons did not fulfil the inclusion criteria; 1,228 persons were invited to participate in the study. Of those, 745 were not willing to participate in the study. In fact, 483 persons gave their informed consent to participate and received the comprehensive questionnaire. Of those, 22 (5%) did not return or complete the questionnaire. In 13 of 461 persons with completed questionnaires (3%), no NPAD-d score was available because those persons had more than three NPAD-d items missing. The final analytic sample consisted of 448 persons with a valid NPAD-d score (Fig. 1).

Fig. 1
figure 1

Flowchart of participants

Statistical analyses

First, summary statistics including simple counts, percentages, and mean values were computed to describe the demographic and health-related characteristics of the samples. Then NPAD-d total scores were calculated as previously described using the same method as the original NPAD. Up to three missing item values were imputed by value substitution based on each subject’s valid responses to NPAD-d items. Specifically, imputed values for missing NPAD items were calculated by dividing the sum of the non-missing NPAD-d items by the number of the non-missing items.

The descriptive properties of the normally distributed NPAD-d were evaluated using frequency distributions, percentages, and mean values. Internal consistency of the NPAD-d was examined using Cronbach’s alpha. The range of coefficient alpha (α) values is generally categorised as 0.60–0.64 minimally acceptable, 0.65–0.70 acceptable, 0.70–0.74 good, 0.75–0.80 very good, and 0.80 and above excellent [7]. Then, standardised item-to-total score correlations were analysed by calculating correlation coefficients between each item and the sum of all other NPAD-d items excluding the item investigated. Factor structure was explored using unrestricted principal factor analysis and unrestricted principal component analysis with varimax rotation. Eigenvalues greater than 1.0 were used to identify clustering of items. The factor structure was determined by attributing any items that had factor loadings > 0.5 to a specific factor.

Construct validity of the NPAD-d was explored using simple correlations of the NPAD-d with constructs known to be associated with neck pain (HADS depression subscale, HADS anxiety subscale, FSozU social support scale, number of visits with GPs, orthopaedists, and anaesthetists respectively) and with non-NAPD-d neck pain related characteristics (number of days with neck pain in previous year) using Pearson’s rho coefficients (convergent validity) [4, 12]. Using t tests for mean scores, the discriminative abilities of the NPAD-d were examined by comparing differences between subgroups stratified on psychological characteristics known to be associated with neck pain [2, 16] (depression according to HADS depression subscale, anxiety according to HADS anxiety subscale) and non-NPAD-d neck pain related characteristics (neck pain on the day of questionnaire completion).

All P values reported were two-sided and all analyses were performed using Stata 9.2 (Stata Corporation, College Station, TX/USA).

Results

Demographic and clinical characteristics of the study sample

Of the 448 patients included in the analysis, 56% reported to have neck pain on the day of questionnaire completion. Mean age was 49 ± 16 years. Almost 80% of the study participants were female. One third had basic education, and about one third were unemployed or retired. The mean value of the HADS depression subscale (range 0–21) was 5.4 ± 3.8. The mean HADS anxiety subscale (range 0–21) was 8.0 ± 4.1. In the study population perceived social support was generally high with a mean social support score (range 1–5) of 4.2 ± 0.7 (Table 1).

Table 1 Demographic and clinical characteristics of the sample (n = 448)

Descriptive analysis of the neck and pain disability scale German version

Ranging from 0 to 100, mean NPAD-d was 48.6 ± 18.6 (Table 1). Of the 448 persons included in the analyses, 350 (78%) had no missing NPAD-d items, 80 persons (18%) had one missing item, 14 persons (3%) had two missing items, and four persons (<1%) had three missing items. The frequency of missing items was evenly distributed in the NPAD-d items with the exception of items #7 (pain while driving or riding a car) and #20 (effect of pain pills). Those questions had a notably higher number of missing items, probably because some of the patients didn’t drive or take pain pills, respectively.

Psychometric properties of the neck and pain disability scale German version

Cronbach’s alpha of NPAD-d was 0.94 denoting excellent internal consistency of the scale. The item to total correlation of the individual NPAD-d items with the total scale ranged from 0.461 to 0.817 showing consistently significant correlations of the single items with the total scale (Table 2).

Table 2 Item to total score correlations of NPAD-d items (N = 448)
Table 3 Factor structure of the NPAD-d (unrotated 1-factor solution and varimax rotation of 3-factor solution)

Exploratory principal component analysis indicated that the NPAD-d items rather uniformly load on a single factor. The only exception was item #1 which had a borderline loading of 0.414 on the single factor, which explained 48% of the variance (Table 3, left column).

Using the Eigenvalue criterion, three factors as compared to one factor were retained. However, the two additional factors only led to a moderate improvement in explained variance as each of them explained only 6% of the NPAD-d variance in the unrotated model. In the three-factor model, 19 of the 20 NPAD-d items could be attributed to at least one factor or had borderline factor loadings of between 0.4 and 0.5. Only item #20 (effect of pain pills) failed to load on any of the three factors. Cross-loadings or non-loading of items was not uncommon. Several items seemed to measure more than one construct because they tended to load on more than one factor (items # 8–11, 13) (Table 3, right column).

Construct validity

Correlation analysis showed highly significant correlations with criterion variables. Specifically, measures of depression (HADS depression subscale: Pearson’s rho = 0.436, P < 0.001), anxiety (HADS anxiety subscale: Pearson’s rho = 0.410, P < 0.001), and social support (FSozU: Pearson’s rho = −0.168, P < 0.001) were highly correlated with NPAD-d values. Number of days with neck pain in the previous year, a non-NPAD-d neck pain related characteristic, showed also a strong correlation with NPAD-d (Pearson’s rho = 0.363, P < 0.001). Additionally, measures of health care use (number of visits with a GP: Pearson’s rho = 0.246, P < 0.001; number of visits with an orthopaedist: Pearson’s rho = 0.213, P = 0.026; number of visits with an anaesthetist: Pearson’s rho = 0.597, P = 0.015) were relevantly correlated with NPAD-d.

Discriminative validity

An analysis of the NAPD-d and its ability to discriminate between subgroups of patients with different neck pain-related characteristics are shown in Table 4. Based on depression symptoms, anxiety symptoms, as well as presence of neck pain on the day of questionnaire completion the NPAD-d distinguished between these groups with a high level of significance.

Table 4 Comparison of the NPAD-d across groups with different social and clinical characteristics

Discussion

This newly developed German version of the neck pain and disability scale (NPAD-d) demonstrated good validity and reliability in a general practice setting. The NPAD-d exhibited stable internal consistency, good construct and discriminant validity. It can thus be reliably administered using standard population survey techniques albeit with some uncertainty regarding the factor structure.

This is the first study to develop and validate a German version of the NPAD in a primary care setting. It was conducted in a relatively large group recruited by a defined algorithm from the whole patient population of various practices and is therefore largely representative of the typical neck pain patients seen in general practice. The quite large number of exclusions can be traced back to predefined reasons according to this algorithm (Fig. 1) so that it seems unlikely that our sample was subject to an unintentional selection bias.

There are several limitations to consider in evaluating this research. First, the population consisted largely of subjects with mild or moderate neck pain indicated by mean NPAD-d scores of 48.6. Although this may be expected in this adult population, the NPAD-d should also be tested in populations with severe pain and/or disability. Second, although the NPAD compared favourably to other pain measures in a previous study [9], no comparisons of the German version to other pain instruments were possible in this project. Third, the study data were derived cross-sectionally. Therefore, the instrument has not been tested for sensitivity to change, to detect change over time, or for feasibility and utility in clinical settings. It is worth noting that test–retest reliability of the NPAD was estimated to be good in the original publication.

The most precise and reliable evidence of the presence and intensity of pain is the patients’ self-report [1]. However, there is no established “gold standard” for pain assessment with which to compare a pain scale for a specific condition. Therefore, comparisons must rely on other constructs known to be associated with pain or other methods of pain assessment. Our study results consistently showed high correlations of the NPAD-d with constructs of mood, anxiety, ambulatory health care utilisation, and non-NPAD-d related pain indicators. These highly desirable measurement properties have previously been demonstrated for the original version of the NPAD and for the French [16], Portuguese [5], and Turkish [2] versions. High internal consistency values, as found in this study for the NPAD-d, have also been reported for the original version [15] and for the other three adapted versions [2, 5, 16].

However, in factor analyses of former studies differing numbers of factors on which NPAD version items were loading have been identified. For the French [16] and the Portuguese [5] versions, NPAD items loaded on three factors, whereas in the original publication NPAD items clustered into four to five [9, 15] subscales, although that publication was based on a relatively small sample and the stability of the observed factor solution may be questioned. In this study, we found slightly differing results in factor analysis. Our model derived from principal factor analysis indicated a one-factor solution for the NPAD-d; when using the Eigenvalue criterion, three factors were retained. However, when comparing the results of the factor analysis of the NPAD as they have been presented in the original publication [15] with the results of our three-factor model, clustering of items showed high concordance. Additionally, several original NPAD items tended to load on more than one factor which is also consistent with our findings. Therefore, due to cross-loading or non-loading of items in this model and due to the small amount of variance explained by two of the three factors retained in the principal factor model, we assume that the single-factor model reflects the content structure of the NPAD-d more robustly. We conclude that the NPAD is a multidimensional assessment instrument measuring different facets of one construct, neck pain, in a stable manner. Additionally, the number of factors identified in the French and Portuguese version studies was also different from the original publication which is further evidence that the factor structure presented in the original publication may be too sample-specific to be reproducible in separate samples.

The NPAD-d is the first self-administered questionnaire for the assessment of neck pain available in German. Cronbach’s alpha values of higher than 0.90 may be indicative of redundancy of the scale [7]. Given the high Cronbach’s alpha of 0.94, previous research and our results indicate that it may be possible to shorten the 20-item NPAD-d instrument without losing significant reliability and validity.

This study provides a first step in furthering knowledge of self-administered neck pain assessment in the German-speaking countries and highlights some important concerns with regard to the factor structure and usability of the NPAD. Further studies on the effects of clinical assessment process and the management of neck pain are necessary before widespread use of the NPAD-d in the clinical context.