Article Text

Original research
Efficacy of positive psychotherapy in reducing negative and enhancing positive psychological outcomes: a meta-analysis of randomised controlled trials
  1. Thole Hilko Hoppen,
  2. Nexhmedin Morina
  1. Institute of Psychology, University of Münster, Münster, Germany
  1. Correspondence to Dr Thole Hilko Hoppen; thoppen{at}uni-muenster.de

Abstract

Objective Positive psychotherapy (PPT) aims at increasing positive affect, meaning and engagement. We aimed to synthesise the available evidence on PPT efficacy.

Design We conducted a preregistered systematic literature search and meta-analysis of randomised controlled trials examining the efficacy of PPT for increasing positive (eg, satisfaction with life) or decreasing negative psychological outcomes (eg, depression).

Data sources Medline, PsycINFO and Web of Science from 2006 (ie, inception of PPT) to February 2020 as well as related systematic reviews and meta-analyses.

Results We included 20 randomised controlled trial with a total of 1360 participants. Moderate effect sizes were found for increasing positive outcomes (g=−0.72, 95% CI: −1.31 to −0.14, k=10, numbers needed to treat (NNT)=2.55) and reducing negative outcomes (g=0.48, 95% CI: 0.18 to 0.78, k=8, NNT=3.76) when PPT was compared with waitlist control conditions at post-treatment assessment. When compared with active control conditions, PPT yielded large effect sizes for increasing positive outcomes (g=−0.92, 95% CI: −1.74 to −0.11, k=6, NNT=2.05) and reducing depression (g=0.94, 95% CI: 0.18 to 1.70, k=6, NNT=2.03) at post-treatment assessment. No significant differences in efficacy were found when compared with established treatments such as cognitive–behavioural therapy. Evidence was found to support an association between trial quality and effect sizes. For positive outcomes, higher trial quality was related to larger effect size. Whereas higher trial quality was associated with smaller effect size for depression. Follow-up assessments remained too scarce for most planned analyses.

Conclusions Our findings support the short-term efficacy of PPT. However, results are to be regarded with due caution in the light of low number of trials. More high-quality trials that assess efficacy at follow-ups are needed to draw firmer conclusions on the long-term efficacy of PPT.

PROSPERO registration number CRD42020173567.

  • depression & mood disorders
  • schizophrenia & psychotic disorders
  • cancer pain
  • adult psychiatry
  • psychiatry

Data availability statement

Data sharing not applicable as no datasets generated and/or analysed for this study. We performed a meta-analysis on published und publicly accessible data. No additional data available.

http://creativecommons.org/licenses/by-nc/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Strengths and limitations of this study

  • This meta-analysis was preregistered and conducted in line with the Preferred Reporting Items for Systematic Reviews and Meta-analyses guidelines.

  • Data synthesis was based on a broad systematic literature search including broad secondary manual searches.

  • Potential moderators including trial quality, treatment lengths and alliance were analysed.

  • Scarcity of available trials precluded many (sub)analyses and asks for due caution in interpreting the present findings.

  • Due to lacking follow-up assessment, long-term efficacy could not be determined.

Introduction

Positive psychotherapy (PPT) is theoretically grounded in the field of positive psychology and proposes that psychopathology such as depression can be effectively treated by directly and primarily building and strengthening pleasure (ie, positive emotions), meaning (ie, belonging to and serving something greater than the self) and engagement (ie, active involvement in daily life).1 PPT presumes that by means of fostering positive resources, negative symptoms will be successfully dampened. While the founders believed from inception that PPT might be an effective treatment for various disorders, they started off by investigating its efficacy in treating depression. PPT consists of single positive interventions such as Using Your Strength, the Three Good Things and the Gratitude Visit. In Using Your Strength, for instance, participants are asked to fill out the Values in Action Inventory of Strengths2 and to think of ways to use their top five strengths more in daily life. Seligman et al ended up including 26 positive exercises in their final PPT manual. In their first randomised controlled trial (RCT) on the efficacy of PPT, they offered a 6-week, 2-hour per week group intervention with 8–11 mildly to moderately depressed students per group and found that PPT was effective in lowering depressive symptoms and increasing satisfaction with life compared with waitlist controls.1 They also conducted a second RCT were they offered a 14-session individual PPT over 12 weeks in a sample of adults suffering from major depressive disorder. Again, PPT was found effective in decreasing depression and increasing happiness, in this RCT compared with treatment-as-usual.1 Since then, numerous other RCTs have assessed the efficacy of PPT.3 Apart from further research on populations suffering depressive symptoms or depressive disorders, PPT has been investigated in various other contexts including patients with psychosis4 and multiple other mental disorders5 as well as in patients with several somatic complaints such as cancer6 7 or multiple sclerosis.8 In their systematic review of the PPT literature, Walsh et al summarised the findings of 12 publications (from 9 individuals trials) published before May 2015.3 The authors conclude that the application of PPT in intervention research is heterogeneous in terms of both, the modifications of the original manual as well as the conditions targeted by PPT as intended by the PPT developers.1 9 To the best of our knowledge, no meta-analysis with an exclusive focus on the efficacy of PPT has been published to this date. Against this background, we performed a systematic literature review and meta-analysis of RCTs assessing the efficacy of PPT.

Methods

Following the recommendations by the Preferred Reporting Items for Systematic Reviews and Meta-analyses group,10 we defined the main structured research question describing the Population, Intervention, Comparison, Outcome and Study design as ‘In individuals with mental or physical health complaints, does PPT (I), compared with control conditions (C), improve psychological outcomes (O) in RCTs (S)?’.

Patient and public involvement

Not applicable. We performed a meta-analysis on published data.

Literature search strategy

Inclusion criteria for the meta-analysis consisted of: (1) RCT, (2) evaluation of the efficacy of PPT as developed by Seligman et al1 and (3) a minimum of 10 participants per treatment arm at post-treatment assessment with available data on at least one relevant outcome. No restrictions were placed on age of participants, comparison conditionor publication type. Studies that only applied a mixture of PPT with another intervention, such as a mixture of PPT and cognitive–behavioural therapy (CBT) in comparison to a control condition,9 were excluded due to our narrow focus on the efficacy of PPT, as founded by Seligman et al.1 We searched the following databases: PsycINFO, MEDLINE and Web of Science from 2006 up to 13 February 2020. The year 2006 represents the year where the theoretical underpinnings of the PPT were first published.1 No other limits or filters were applied. MeSH terms for Ebscohost (regarding MEDLINE and PsycINFO) were as follows: ‘SU positive psychotherapy OR TI positive psychotherapy OR AB positive psychotherapy’ (see also online supplemental materials eList 1). In Web of Science, a similar search string to Ebscohost was chosen to search for ‘positive psychotherapy’ in titles, abstracts and keywords. To retrieve additional publications, the reference lists of all included papers and relevant (ie, related) meta-analyses and systematic reviews were manually screened.11–19 Secondary handsearches were conducted using Google Scholar. The study synthesis was performed independently by both authors.

Coding of studies

The publications were independently coded by both authors. From each publication, the following study, intervention and participant characteristics were coded and extracted: country the trial was conducted in, clinical population targeted (ie, any physical or mental health condition), experimental intervention type (ie, original PPT manual or modified version), intervention format (ie, individual or group), comparison group(s), session number and session duration in minutes, follow-up duration in months for the longest reported follow-up assessment of the relevant outcome(s), number of participants at post-treatment assessment, age of participants (ie, mean and SD or range), proportion of sample with female sex in per cent, applied statistical analysis (ie, completer or intention-to-treat (ITT) analyses) and relevant outcome(s) targeted by PPT. The post-treatment assessment experimental group and control group means, SD and sample sizes on the relevant outcome(s) (see in more detail below) were extracted. When reported, follow-up assessment data on relevant outcomes per group were also extracted. When multiple follow-up assessments were reported, the data from the longest follow-up assessment were retrieved. When relevant data were not reported, it was either calculated from given data (eg, SD from SEs) or the corresponding author of the respective publication was contacted via email twice with 1 month in between. In one case, we contacted authors due to unusual results. Mohamadi et al potentially reported the means and SDs for a relevant outcome (ie, quality of life) in wrong order (ie, means where SDs should be placed and vice versa).20 We contacted the authors twice via email and were left with no response. Consequently, we calculated two analyses; one with changed order of means and SD and one with unchanged order.

We divided control conditions into passive control conditions, which turned out to exclusively consist of waitlist control conditions (WLC), active control conditions (ie, treatment-as-usual and placebo exercises) and other active treatment conditions (ie, CBT, dialectic behavioural therapy (DBT) and Mindfulness-Based CBT (MBCT)). Note that included trials included different physical or mental health conditions and, therefore, TAU may involve various different treatment regimens.

Quality assessment

Both authors independently rated the quality of the included trials by using a quality assessment constructed by Cuijpers et al and adjusted in two subsequent meta-analyses.21–23 After independent rating, regular digital meetings were held to discuss disagreements. This scale assesses the following nine quality criteria: (1) Were symptoms/diagnoses assessed with a semistructured diagnostic interview?, (2) Was a treatment manual used?, (3) Were therapists trained either specifically for the study or in a general training?, (4) Was treatment integrity checked by supervision and/or recordings and/or standardised instruments?, (5) Was data analysed with ITT analysis?, (6) Was group allocation performed with a true randomisation technique?, (7) Was randomisation done by an independent third person (or computer or sealed envelopes)?, (8) Were blinded assessors used for interviews? and (9) Were drop-outs adequately reported? Items for each of the nine quality criteria were scored on a four-point scale, where 3 indicates high quality (eg, a published treatment manual was used), 2 indicates limited quality (eg, an unpublished treatment manual was used), 1 indicates lack of required quality (eg, no treatment manual was used) and 0 indicates unknown (ie, required information not reported). When self-report measures were used to assess outcomes in a given trial, a score of 3 was given on the quality item concerning blinded assessments. In case of technology-based interventions, a trial received a score of 3 on the quality items concerning trained therapists and formal fidelity checks due to the technology-based standardised procedure. The nine ratings were then summed up to yield the respective trial quality sum score and used as a potential moderator in meta-regressions.

Data extraction of outcome measures

For each study, a maximum of two outcomes were selected, one positive psychological outcome (if available) and one negative (if available). Choice of extracted positive and/or negative psychological outcome(s) was data driven. That is, we first extracted all negative and positive psychological outcomes per trial and then analysed across all included trials which positive and negative psychological outcomes were most frequently assessed and reported in the PPT trial literature. For the negative outcomes, depression was by far the most frequently assessed outcome (k=14) and the sole negative outcome extracted. Assessment of positive outcomes was more heterogeeous. Satisfaction with life was assessed most often (k=11), consecutively followed by happiness (k=9), well-being (k=5), hope (k=5), positive affect (k=4), quality of life (k=3), self-efficacy (k=2) and meaning in life (k=1). As such, we prioritised satisfaction with life first in the data extraction phase when several positive outcomes were reported in a given trial, happiness second and so forth. We planned to conduct two overarching analyses across included negative and positive outcomes, respectively, as well as subanalyses on all individual outcomes with a sufficient number of independent trials (ie, k≥4). Data were extracted by both authors and regular digital meetings were held to discuss disagreements.

Statistical analysis

Analyses were completed with the metafor package (V.1.9.8) in R V.3.5. using random-effects models given that we expected large heterogeneity in reported effect sizes .24–26 We prioritised ITT data when available (k=3) over completer data (k=17, including k=3 with insufficient information on participant flow, see table 1 for further information). To obtain the effect size Hedges’s g, R first calculates the standardised mean difference d (ie, control group mean subtracted from the experimental group mean and then divided by the pooled SD). The standardised mean difference is then multiplied by a sample size correction factor J=1-(3/ (4df – 1)) to yield Hedges’s g.27 Analyses were conducted if four or more trials were available for a given (sub)analysis.28 Effect sizes g may be conservatively interpreted with Cohen’s convention of small (±0.2), medium (±0.5) and large (±0.8) effects.29 As a test of homogeneity of effect sizes, we calculated the Q-statistic and the corresponding p value. We also calculated the I2 statistic, as a measure of heterogeneity of effect sizes across trials in per cent. It has been suggested that I2-statistics of 25, 50, and 75% may be interpreted as referring to low, moderate and high levels of heterogeneity, respectively.30 Because we expected large heterogeneity, we also calculated prediction intervals.31 Prediction intervals, unlike I2-statistics, present a heterogeneity estimate in the same metric as the original effect size measure (ie, g). As such, prediction intervals provide a predicted range for the true treatment effect in similar future trials.32 When the prediction interval excludes the null, it is likely that similar future trials will also find significant effects. To check for potential effects of outliers on meta-analytic outcomes, we aimed at repeating analyses without identified outliers. Outliers were defined as effect sizes departing 3.3 SD away from the pooled mean effect in both directions.33 34 However, no outliers were identified in any of the performed analyses. When analyses consisted of at least 10 trials,35 we assessed risk of publication bias through visual inspection of funnel plots, Egger’s test of asymmetry and number of missing studies using the trim-and fill procedure.36 The trim-and-fill procedure yields an asymmetry-corrected estimate of the effect size (ie, taking publication bias into account). We calculated the numbers needed to treat (NNT) as a measure of efficacy that is easily interpretable from a clinical perspective. It informs about the numbers of patients that need to be treated until one adverse event is prevented.37 NNTs were calculated with the NNT function of the dmetar package and are based on the pooled effect sizes (ie, Hedges’ g). Lastly, we performed moderator analyses in R with trial quality sum score and treatment length (in minutes) as continuous variables (ie, meta-regressions) and alliance as a dichotomous variable (ie, trials with vs without the involvement of the founders of PPT1) to check for potential moderating effects on efficacy outcomes. Since too few trials were available to check for alliance, we performed sensitivity analyses with trials involving the founders omitted.1 Moreover, we performed more general sensitivity analyses with the leaving1out function of the metafor package.

Table 1

Basic characteristics of included trials

Results

Study characteristics

Figure 1 describes the flow of hits during the study synthesis. Of the initial 5501 hits, a total of 17 publications that described 20 trials met our inclusion criteria. Basic characteristics of the included trials can be found in table 1. Nine trials (45%) compared the efficacy of PPT with WLC. Five trials (25%) compared PPT with an active control condition (eg, treatment-as-usual, control exercises). Three trials (15%) compared PPT with another psychological intervention (eg, CBT, DBT). Lastly, three trials (15%) compared PPT with more than one control conditions.1 21 38 Fourteen trials (70%) applied PPT in a group setting and the remaining six trials in an individual setting. Two of the latter trials described in one publication applied an internet-based PPT.39 Treatment lengths was 917.06 min on average (unweighted mean across trials reporting on both, number and duration of sessions, k=17) with an SD of 374.79 min. Note that the pioneering manual of Seligman et al1 constitutes of a 720 min (ie, 12 sessions á 60 min). Average number of sessions was 9.17 (SD=2.71) and average session length was 101.76 min (SD=22.03). Ten trials (50%) conducted follow-up assessments on relevant outcomes whereas nine trials failed to do so. The remaining study assessed data on a relevant outcome 2 weeks after the post-treatment assessment,40 which we excluded from the follow-up data due to too short amount of time between postassessment and follow-up assessment. The average follow-up period was 7.10 months (SD=4.21). Most trials were conducted in Iran (k=10) and the USA (k=5). The remaining trials were conducted in Austria (k=1), South Korea (k=1), Canada (k=1), China (k=1) and the UK (k=1). One publication entailing three trials was a PhD dissertation,39 whereas the remaining trials constituted articles published in peer-reviewed journals. Study quality was moderate overall with a mean of 17.85 out of the possible range from 0 to 27. Study quality varied considerably across included trials with an SD of 4.69. The detailed quality assessment per trial can be found in table 2.

Figure 1

PRISMA flow chart depicting synthesis of included randomised controlled trials. PPT, positive psychotherapy; PRISMA, Preferred Reporting Items for Systematic Reviews and Meta-analyses; RCT, randomised controlled trial.

Table 2

Quality assessment of included trials

Participant characteristics

Basic characteristics of included participants per trial can be found in table 1. A total of 1360 participants participated in the included trials. Most of the participants were female (unweighted mean across included trials=71.75%) with a range from 23.63%41 to 100%.42 The patients had a pooled weighted mean age of 39.97 with a pooled SD of 10.18. It is worth noting, however, that several studies only reported age ranges rather than means and SD43 or did not report on age altogether.39

The efficacy of PPT in increasing positive outcomes

Results on the efficacy of PPT are displayed in table 3. In terms of increasing various positive outcomes such as satisfaction with life (SWL) and happiness, PPT was found moderately more effective than WLC at post-treatment assessment (g=−0.72, 95% CI: −1.31 to −0.14, k=10, NNT=2.55). See figure 2 for the corresponding forest plot. Results remained similar, when the results of Mohamadi et al20 were entered as reported in their publication (g=−0.82, 95% CI: −1.39 to −0.25, k=10, NNT=2.27). Number of available trials allowed for a publication bias check. While a visual inspection of the funnel plot led to the suspicion of publication bias (ie, missing trials to the left and a potential outlier to the far left, see online supplemental eFigure 1), Egger’s test did not indicate significant asymmetry (t=−1.91, p=0.093). The sensitivity analysis yielded that one trial had particular influence on the pooled effect size. When Abdeyan et al (ie, assessed positive outcome=hope) was omitted, pooled effect size decreased to g=−0.44 (see online supplemental eTable 1). No evidence was found for the efficacy of PPT in increasing positive outcomes compared with WLC at follow-up assessment (g=−0.36, 95% CI: −0.83 to 0.11, k=4, NNT=5.01). See online supplemental eFigure 2 for the corresponding forest plot. Follow-up assessment results are to be scrutinised with due caution in the light of low number of available trials (k=4), large heterogeneity in effect sizes (I2=74.34) and the wide range of the prediction interval (PI=−1.29; 0.57). Satisfaction with life was the only positive outcome with enough trials to warrant a meta-analytical subanalysis. In comparison to WLC at post-treatment assessment, PPT was not found more effective in increasing satisfaction with life (g=−0.15, 95% CI: −0.40 to 0.09, k=4, NNT=11.55). See online supplemental eFigure 3 for the corresponding forest plot. Heterogeneity in outcomes was low (I2=11.20). The sensitivity analysis did not yield that one of the four studies was particularly influential on the pooled effect with all leaving1out analyses yielding a non-significant pooled g (see online supplemental eTable 1). In comparison to active control conditions (ie, treatment-as-usual and placebo exercises) at post-treatment assessment, PPT yielded a large effect size in increasing positive outcomes (g=−0.92, 95% CI: −1.74 to −0.11, k=6, NNT=2.05). See online supplemental eFigure 4 for the corresponding forest plot. However, heterogeneity in outcomes was large (I2=92.51) and the prediction interval included the null (PI=−2.98 to 1.13) illustrating large variability in findings. When compared with other active treatment conditions (ie, CBT, DBT, MBCT and Neurofeedback-aided Meditation), no differences in efficacy at post-treatment assessment were found for increasing positive outcomes (g=−0.29, 95% CI: −0.89 to 0.32, k=6, NNT=6.24). See online supplemental eFigure 5 for the corresponding forest plot. Again, heterogeneity in outcomes was large (I2=79.57) and the prediction interval included the null (PI=−1.71 to 1.13). Results remained insignificant when results of Mohamadi et al20 were entered as reported in their publication (g=−0.65, 95% CI: −1.31 to 0.01, k=6). Lastly, when trials with alliance (ie, involvement of the founder) were omitted, results for the comparison with WLC at post-treatment assessment remained similar (g=−1.04, 95% CI: −1.79 to −0.28, k=7, NNT=1.87, see table 3).

Figure 2

Forest plots—efficacy of PPT versus waitlist control (WC) conditions in increasing positive (left) and decreasing negative (right) psychological outcomes at post-treatment assessment. PPT, positive psychotherapy.

Table 3

Efficacy of PPT for increasing positive outcomes and decreasing negative outcomes

The efficacy of PPT in decreasing negative outcomes

PPT was found moderately more effective in reducing depression, negative affect and stress than WLC at post-treatment assessment (g=0.48, 95% CI: 0.18 to 0.78, k=8). See figure 2 for the corresponding forest plot. To avoid one adverse event (ie, depression, negative affect or stress), a little less than four patients needed to be treated (NNT=3.76). The sensitivity analysis did not yield that one of the eight studies was particularly influential on the pooled effect with all leaving1out analyses yielding moderate pooled effect sizes between 0.40 and 0.58 (see online supplemental eTable 1). Results on decreasing depression were similar (g=0.57, 95% CI: 0.21 to 0.92, k=6, NNT=3.22). See online supplemental eFigure 6 for the corresponding forest plot. Again, the sensitivity analysis did not yield that one of the six studies was particularly influential with moderate pooled effect sizes between 0.47 and 0.68 for the leaving1out analyses (see online supplemental eTable 1). Prediction intervals for both analyses (ie, all negative outcomes and depression only) excluded the null (PI=−0.17 to 1.13; PI=−0.18 to 1.31, respectively) highlighting substantial levels of heterogeneity in efficacy outcomes and remaining uncertainty about the true efficacy when similar future trials accumulate. In comparison to active control conditions (ie, treatment-as-usual with or without medication and placebo exercises) at post-treatment assessment, PPT yielded large effect sizes in reducing depression (g=0.94, 95% CI: 0.18 to 1.70, k=6, NNT=2.03). Please find the corresponding forest plot in online supplemental eFigure 7. Again, heterogeneity was large (I2=90.28) and the PI excluded the null (PI=−0.96 to 2.83). When compared with other active treatment conditions (ie, CBT, DBT, MBCT and Neurofeedback-aided Meditation), no differences in efficacy at post-treatment assessment were found for decreasing negative outcomes (g=0.08, 95% CI: −0.48 to 0.64, k=6, NNT=22.22). Please find the corresponding forest plot in online supplemental eFigure 8. Trials that included follow-up assessments on the efficacy of PPT in decreasing negative outcomes were too few to allow for meta-analytical review for all included comparisons (k<4). Lastly, when trials with alliance (ie, involvement of the founder) were omitted, results for the comparison with WLC at post-treatment assessment remained similar (g=0.63, 95% CI: 0.20 to 1.07, k=5, NNT=2.89, see table 3).

Moderator analyses

Moderator analyses revealed that trial quality as a continuous variable was associated with effect sizes in most of the above-mentioned analyses. See table 4 for an overview of results. With regard to the efficacy of PPT in increasing positive outcomes in comparison to WLC at post-treatment assessment, trial quality was found to be a significant moderator with higher trial quality being associated with larger effect sizes (b=0.17, p=0.003). A similar result was found for the follow-up assessment results (b=0.12, p=0.036). In terms of the comparison with active control conditions at post-treatment assessment, trial quality was also found to moderate effect sizes with higher trial quality being associated with larger effect sizes (b=0.18, p=0.015). No significant moderation of trial quality was found for the comparison with other active treatment conditions (b=−0.01, p=0.907) nor for the sub-analysis on satisfaction with life (b=−0.01, p=0.915).

Table 4

Subanalyses on trial quality and treatment length as potential moderators

In terms of the efficacy of PPT in decreasing negative outcomes in comparison to WLC at post-treatment assessment, trial quality was found to be a significant moderator with higher trial quality being associated with smaller effect sizes (b=−0.08, p=0.003). A similar result was found for the sub-analyses on depression (b=−0.11, p<0.001). Similarly, the subanalysis on depression for the comparison of PPT and active control conditions yielded a significant moderation of trial quality with higher trial quality being associated with smaller effect sizes (b=−0.17, p=0.005). However, a significant moderation was found for the comparison with other active treatment conditions with higher trial quality being related to larger effect sizes in decreasing negative outcomes (b=0.13, p<0.001). No evidence was found for a moderation of treatment length in any of the analyses (see table 4).

Discussion

Our systematic search resulted in 20 RCTs that assessed the efficacy of PPT. The results of the meta-analysis indicate that PPT can effectively increase positive psychological outcomes and decrease depression at post-treatment assessment. Both comparisons with WLC and active control groups support the short-term efficacy of PPT. Overall, there is too few data on the long-term efficacy of PPT. Additionally, moderator analyses yielded that trial quality was significantly associated with effect size. For positive outcomes, higher quality of trials was related to larger effect sizes. Whereas for depression, higher quality of trials was related to smaller effect sizes. However, the low number of available trials, large heterogeneities, identification of some influential single trials in the sensitivity analyses and wide prediction intervals call for cautious statements on the efficacy.

The findings support the short-term efficacy of PPT in increasing positive psychological outcomes. However, the larger magnitude in effect sizes for comparisons with active control conditions (pooled g=−0.92) compared with WLC (pooled g=−0.72) is surprising and counterintuitive. Usually the opposite pattern is found in clinical research.21 28 Unplanned post hoc investigations on potential reasons hint towards the effect of an almost outlier in the analysis involving active comparison groups.7 This trial offered either PPT or treatment-as-usual to patients with cancer and yielded a strikingly large effect size at post-treatment assessment favouring PPT (g=−2.79) for increasing meaning in life. Furthermore, a second trial on cancer patients also produced a large effect size for increasing happiness (g=−1.80) as compared with waitlist at post-treatment assessment.6 While these two trials on cancer patients suggest that PPT might be highly effective in increasing positive outcomes in this population, two trials remain of course a slim evidence base. It should be noted, however, that the analysis on passive control conditions (ie, waitlist controls) also involved an almost outlier.40 This study offered PPT to depressed patients and yielded a strikingly large effect size at post-treatment assessment (g=−2.98) favouring PPT in increasing hope. Both almost outlier studies involved a moderate sample size (see table 1). All this suggests that more trials are needed to allow for firmer conclusions.

When PPT was compared with other established psychological interventions such as CBT, current data did not suggest any significant difference in efficacy. Accordingly, the results of the six RCTs included in this comparison suggests that PPT is similarly effective in increasing positive psychological outcomes. However, due to the low number of trials for this comparison these findings need to be viewed with due caution.

The first and foremostly assessed negative outcome in the PPT literature remains depression. As suggested and intended by its developers, PPT was found moderately to largely effective in lowering depressive symptoms. Again, the counterintuitive pattern was found with larger effect sizes in lowering depression for PPT in comparison to active control conditions (pooled g=0.94) as opposed to WLC (pooled g=0.57). Once more, unplanned post hoc investigations were performed in an attempt to find potential reasons for the counterintuitive finding. Again, we found that an almost outlier might explain the difference. The analysis involving active control groups involved an almost outlier with an effect size of g=2.45,44 whereas the analysis involving WLC did not involve such an almost outlier.

Data on the efficacy at follow-up assessments altogether were scarce. The only feasible analysis on follow-up assessment data (ie, PPT vs WLC in increasing positive outcomes) yielded a non-significant effect size. The current available literature does not allow for any other valid follow-up analyses and, thus, conclusions on the long-term efficacy of PPT cannot not yet be made. This represents perhaps the main limitation of the literature on the efficacy of PPT. For the same reason, additional sensitivity analyses (eg, group vs individual PPT, or PPT efficacy by health condition vs mental health condition) were not feasible.

Trial quality overall was moderate and, therefore, leaves room for improvement. Results overall are comparable to related meta-analyses on positive psychology interventions (PPIs) more generally which report moderate effect sizes in increasing positive outcomes and decreasing negative outcomes.11–19 A recent meta-analysis on PPIs further also reports on a significant relation between trial quality and the efficacy of PPIs.15 However, PPIs vary considerably and generalisations from meta-analyses on PPIs on PPT are, therefore, not straightforward.

This represents the first meta-analysis with an exclusive focus on the efficacy of PPT. Several limitations need to be considered. First and foremost, the number of included trials is relatively small and accordingly more research is needed to draw firmer conclusions. Second, depression and SWL were the only two outcomes with enough trials to warrant subanalyses. More research is needed to allow for more homogeneous analyses on PPT efficacy for specific outcomes. Third and related to the second limitation, the two overarching analyses on various positive and negative outcomes involved large heterogeneity, respectively. The decision to conduct such overarching analyses on heterogeneous outcomes was based on the overall scarcity of trials. We aimed at conducting more homogeneous subanalyses were possible which were, as mentioned, only feasible for depression and SWL. As more trials accumulate, more fine-grained analyses will become feasible. Fourthly and lastly, the long-term efficacy of PPT remains uncertain due to lack of follow-up assessments.

Conclusion

Our findings indicate that PPT can effectively increase positive outcomes and decrease negative outcomes at post-treatment assessment. However, there is lack of follow-up data and the number of available trials altogether remains scarce precluding many of the planned subanalyses. More research with high methodological rigour and including follow-up assessments is needed to draw firmer and more precise conclusions on PPT efficacy.

Data availability statement

Data sharing not applicable as no datasets generated and/or analysed for this study. We performed a meta-analysis on published und publicly accessible data. No additional data available.

Ethics statements

Patient consent for publication

References

Supplementary materials

  • Supplementary Data

    This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.

Footnotes

  • Twitter @HoppenDr

  • Contributors THH and NM conceptualised the meta-analysis conducted the systematic literature search and coding of studies. THH performed the statistical analyses. THH and NM wrote the manuscript and agreed to be accountable for all aspects of the work.

  • Funding The authors have not declared a specific grant for this research from any funding agency in the public, commercial or not-for-profit sectors.

  • Competing interests None declared.

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.