Research reportThe reliability and validity of the Seasonal Pattern Assessment Questionnaire: a comparison between patient groups
Introduction
It is remarkable that one instrument, i.e., the Seasonal Pattern Assessment Questionnaire (SPAQ; Rosenthal et al., 1987) has such a prominent place in the study of Seasonal Affective Disorder (SAD), the more so since data on the psychometric qualities of the SPAQ are relatively scarce. The SPAQ has two objectives. The original purpose of the SPAQ was to study a number of characteristics of SAD like seasonal variation of mood, food intake and weight gain, seasonal sleep duration, and sensitivity to weather conditions. Although not developed as such, the SPAQ was introduced as a diagnostic or screening instrument for SAD by Kasper et al. (1989a), who were the first to formulate the SPAQ criteria for SAD. Since 1989 the SPAQ has been widely used as a screening instrument. In fact, all over the world the prevalence figures of SAD are based on the ‘Kasper criteria’ of the SPAQ (Mersch et al., 1999a).
Nevertheless, in only a few prevalence studies responders who met the criteria of the SPAQ for SAD were interviewed. Kasper et al. (1989a) concluded that the SPAQ underestimated the number of subjects that met the clinical criteria of SAD as established in an interview (50% false negatives, 0% false positives), while Magnusson (1996), evaluating the results of the Magnusson and Stefansson (1993) study, found that the clinical interview and the SPAQ reached approximately the same number of SAD subjects. The SPAQ in the latter study reached a sensitivity of 94% and a specificity of 73%. Michalak et al. (2001) interviewed 66 SAD cases detected by the SPAQ in a prevalence study on 1999 residents of the United Kingdom. No less than 55% of the SAD cases were false positives. No false negatives were detected in 23 interviewed non-SAD cases. In a study by Raheja et al. (1996) the SPAQ only missed three out of 47 SAD patients showing a very good sensitivity (94%). Since only SAD cases were studied, the specificity could not be assessed. At a follow-up assessment 5–8 years later the sensitivity was 74%, while the specificity was 46%.
There is accumulating evidence that the ‘Kasper criteria’ of the SPAQ lead to overestimation of the percentage of SAD cases in the general population. In the Michalak et al. (2001) study the prevalence of SAD was estimated by the SPAQ as 5.3% while a DSM-IV interview estimated the prevalence at 2.4%. In a population survey in the USA Blazer et al. (1998) studied SAD in a sample of 8098 subjects by means of a structured interview. Only 0.4% of the respondents met the criteria for SAD. This figure is considerably lower than the mean prevalence figure of SAD in North America (6.2%), based on the SPAQ criteria (Mersch et al., 1999a). Two studies investigated the prevalence in Canada, both using a clinical interview and the SPAQ in a telephone survey. Results of the Levitt et al. (2000) study on a sample of 781 respondents showed that the SPAQ detected almost twice as many SAD subjects (5.0 versus 2.9%) as a clinical interview based on the DSM-IV (American Psychiatric Association, 1994). In a second study on 1605 respondents designed to test the latitude hypothesis of SAD (Levitt and Boyle, 2002), the SPAQ overestimated the mean prevalence three to four times, compared to a ‘golden-standard’ clinical interview (7.4 versus 1.9%). So, validity studies that have been performed show mixed results and do not unequivocally support the confidence in the SPAQ’s validity suggested by its widespread use (Mersch, 2001).
Another way to establish the validity is to study the ability of the SPAQ to discriminate between different groups (e.g., SAD and sub-SAD subjects, different patient groups and normal control groups), composed in a way other than by means of SPAQ criteria (contrast validity). Regretfully, the few studies that have compared different patient groups with the SPAQ (Kasper et al., 1989b, Thompson et al., 1988, Hardin et al., 1991) fail to provide data on the sensitivity and specificity of the SAD criteria of the SPAQ.
In the present study the ratings on the SPAQ of SAD patients were compared with non-seasonal depressed outpatients, non-depressed outpatients and a control group of healthy volunteers. The main objective was to establish the validity of the SAD criteria of the SPAQ. A secondary goal was to establish the reliability and factor structure of the seasonality scale (a replication of the Magnusson et al., 1997 study) and the climate scale. Also, the differences between the groups with respect to atypical SAD symptoms will be studied.
Section snippets
Subjects
Four groups of subjects participated in the study.
Seasonality
The level of seasonal variation of the four groups on the seasonality items is calculated by multiplying the score on each item with the difference of the corresponding items of the calendar section, a method similar as used by Thompson et al. (1988) and Wirz-Justice et al. (1992). For instance, the score of each subject on the seasonality item ‘sleep’ is multiplied by the difference between the scores on the question ‘when do you sleep most’ (0 or 1) and ‘when do you sleep least’ (0 or 1) on
Discussion
The main goal of the present study was to establish the capability of the SPAQ and the SAD criterion developed by Kasper et al. (1989a) to discriminate between different patient groups and a normal control group. In general, the SPAQ performed quite well. The specificity was excellent, but the sensitivity is less impressive. Discriminant analysis showed that the criteria of the SPAQ were reasonably able to place subjects in the right group. However, compared to the Magnusson (1996) and the
References (24)
Measuring depression: the CES-D scale
- et al.
Evaluation of seasonality in six clinical populations and two normal populations
J. Psychiatr. Res.
(1991) - et al.
Reliability of checklist-guided diagnoses for DSM-IIIR affective and anxiety disorders
J. Affect. Disord.
(1990) Validation of the Seasonal Pattern Assessment Questionnaire (SPAQ)
J. Affect. Disord.
(1996)- et al.
Internal consistency of the Seasonal Pattern Assessment Questionnaire (SPAQ)
J. Affect. Disord.
(1997) - et al.
Seasonal Affective Disorder and latitude: a review of the literature
J. Affect. Disord.
(1999) - et al.
The prevalence of Seasonal Affective Disorder in The Netherlands: a prospective and retrospective study of seasonal mood variation in the general population
Biol. Psychiatry
(1999) - et al.
The seasonal pattern assessment questionnaire for identifying seasonal affective disorders
J. Affect. Disord.
(1996) - et al.
A comparison of normal, bipolar and seasonal affective disorder subjects using the Seasonal Pattern Assessment Questionnaire
J. Affect. Disord.
(1988) Diagnostic and Statistical Manual of Mental Disorders
(1987)