Objective To evaluate the evidence on the efficacy of psychosocial interventions for improving pregnancy rates and reducing distress for couples in treatment with assisted reproductive technology (ART).
Design Systematic review and meta-analysis.
Data sources PsycINFO, PubMed, EMBASE, CINAHL, Web of Science and The Cochrane Library between 1978 and April 2014.
Study selection Studies were considered eligible if they evaluated the effect of any psychosocial intervention on clinical pregnancy and/or distress in infertile participants, used a quantitative approach and were published in English.
Data extraction Study characteristics and results were extracted and the methodological quality was assessed. Effect sizes (ES; Hedges g) were pooled using a random effects model. Heterogeneity was assessed using the Q statistic and I2, and publication bias was evaluated using Egger’s method. Possible moderators and mediators were explored with meta-analyses of variances (ANOVAs) and meta-regression.
Results We identified 39 eligible studies (total N=2746 men and women) assessing the effects of psychological treatment on pregnancy rates and/or adverse psychological outcomes, including depressive symptoms, anxiety, infertility stress and marital function. Statistically significant and robust overall effects of psychosocial intervention were found for both clinical pregnancy (risk ratio=2.01; CI 1.48 to 2.73; p<0.001) and combined psychological outcomes (Hedges g=0.59; CI 0.38 to 0.80; p=0.001). The pooled ES for psychological outcomes were generally larger for women (g: 0.51 to 0.73) than men (0.13 to 0.34), but the difference only reached statistical significance for depressive symptoms (p=0.004). Meta-regression indicated that larger reductions in anxiety were associated with greater improvement in pregnancy rates (Slope 0.19; p=0.004). No clear-cut differences were found between effects of cognitive–behavioural therapy (CBT; g=0.84), mind–body interventions (0.61) and other intervention types (0.50).
Conclusions The present meta-analysis suggests that psychosocial interventions for couples in treatment for infertility, in particular CBT, could be efficacious, both in reducing psychological distress and in improving clinical pregnancy rates.
- Psychosocial intervention
This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/
Statistics from Altmetric.com
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.
Strengths and limitations of this study
A major strength of this study is the extensive search of various databases from 1978 to April 2014, as well as a comprehensive methodological assessment.
Further analyses were performed to account for publication bias, yielding conservative effect sizes and thus strengthening the robustness of the estimates.
Heterogeneity and indications of publication bias were observed for several of the outcomes.
A substantial variation of the methodological quality and missing information on fertility and assisted reproductive technology (ART) treatment may limit the interpretability of the outcomes.
Fecundity has become a growing problem for many couples trying to conceive a child, and although not all couples choose to seek medical assistance, more than 10% of the childbearing population has resorted to assisted reproductive technology (ART) to conceive.1–5 Being involuntarily childless and going through various ART procedures imposes considerable stress on the couple, and childlessness is often perceived as a life crisis where the emotional strain equals that found for traumatic events.2 ,6–10 Although infertile couples may be considered mentally healthy in general,11 several studies indicate that coping with infertility is associated with periodically heightened levels of psychological symptoms of distress, depression and anxiety.12 ,13 Feelings of loss, grief, anger and sadness are not uncommon, and women often report bodily disparagement, lack of femininity, shame and self-blame.2 ,14 There is some evidence to suggest that dysregulation in the uterus microenvironment may influence the ability to conceive, for example, oxidative stress and inflammation,15 ,16 which may be promoted by psychological distress.17 ,18 Such findings have led several studies to investigate possible links between mental state and pregnancy outcome.10 ,19–24 Although the results have been mixed, reviews of the literature have generally reached the conclusion that psychosocial factors such as depressive symptoms, anxiety, distress and certain coping strategies are linked to reduced chances of pregnancy.12 ,25 ,26 Two recently published meta-analyses, however, report conflicting results.27 ,28 Whereas one meta-analysis supported the conclusion that emotional distress may be critical to the success of fertility treatment outcome,27 the other did not find sufficient support for this hypothesis.28 The different conclusions could be due to between-study methodological differences, for example, in the chosen measures of distress and definitions of pregnancy (eg, serum positive test, clinical pregnancy or live birth).
Nonetheless, the evidence indicating a considerable psychosocial burden associated with infertility and its treatment has inspired several researchers to explore the effect of various psychosocial interventions in reducing distress, improving quality of life, and thereby possibly optimising the chances of pregnancy. So far, three meta-analyses have reviewed effects of psychological interventions on mental health and pregnancy outcome. Again, the results have been mixed. The first meta-analysis, published in 2003, concluded that psychological intervention appeared to have a beneficial effect on negative emotions,29 particularly anxiety. An effect of counselling was also found for infertility-related distress, whereas no clear effect was seen on pregnancy rates. Although the original systematic review identified 25 independent studies, the final meta-analysis only included 8–10 studies selected on the basis of their methodological quality. The second meta-analysis published in 2005 focused on differences in effects related to intervention format, for example, individual/couple versus group setting.30 Overall, the results suggested that both individual/couple and group interventions were effective in reducing emotional distress as well as increasing the conception rate. In contrast to the two first meta-analyses, which had investigated both controlled and uncontrolled studies, the third meta-analysis from 2009, which only included controlled studies,31 found no evidence for an effect of psychological interventions on emotional distress. An effect, however, was found for pregnancy rates, but only for infertile couples not in ART.
Taken together, while showing promising results, the findings of existing quantitative systematic reviews, the most recent published in 2009, are mixed. The literature within this field is expanding, and studies of new psychosocial intervention approaches building on existing knowledge and targeting specific problems of infertile patients, for example, mind/body interventions (MBIs), internet-based treatments and online psychoeducation programmes, have since been published. Furthermore, the more recently published studies have generally used randomised controlled trial (RCT) designs, a notable strength reducing the risk of bias and making the studies more easily comparable.32 An updated review and meta-analysis is needed to determine to what degree psychosocial interventions may reduce infertility-related distress related to improvement of pregnancy chances during fertility treatment.
The present study was conducted in accordance with the preferred reporting items for systematic reviews and meta-analyses (PRISMA) recommendations.33 ,34 An a priori designed study protocol guided the literature search, study selection and data synthesis.
Search strategy and criteria
A comprehensive and systematic search of the literature published between 1978 (first baby born after in vitro fertilisation (IVF)) and April 2014 was conducted, using a sensitive search strategy recommended for reviews by Higgins and Green.35 When conducting the searches, we combined keywords representing the two primary concepts, infertility and psychosocial treatment: (1) “infertil*”, “childlessness”, “IVF”, “ICSI”, “fertility treatment/problems” “assisted reproduction” and (2) “psychological/psychosocial intervention”, “social support”, “couples therapy”, “psycho-education”, “internet-based intervention” and “behavioral therapy” (for a full search history, see online supplementary appendix 1). We identified relevant records by electronic searches in general medical and psychological databases: PubMed, PsycINFO, The Cochrane Library, EMBASE, CINAHL and Web of Science. Furthermore, we cross-examined reference lists of the retrieved papers and reviews for additional relevant studies. We did not pursue the grey literature or trial registries, and limited our search to include only peer-reviewed articles published in English.
Studies were considered eligible if they (1) reported data on infertile participants (2) presented data on a psychosocial intervention or a supportive programme (3) included baseline and postintervention measures of stress, distress or pregnancy outcome and (4) used a quantitative research approach. In general terms, infertility refers to not being able to conceive for more than 1 year without contraception (WHO, 2002). Despite this standard definition, a recent review has found considerable between-study variation in definitions.36 Furthermore, infertility can be graded in relation to clinical diagnosis and duration. The present meta-analysis reviews studies using several different definitions of the term ‘infertile’, and includes all studies of patients diagnosed with different types of infertility and in different types and stages of ART treatments, for example, intrauterine insemination (IUI), IVF and intracytoplasmic sperm injection (ICSI). ‘Psychosocial interventions or supportive programs’ were defined as all interventions with a psychosocial aim that did not include the prescription of medication or had a primary physical focus, for example, acupuncture or massage therapy. However, studies using ‘psychophysiological’ approaches, for example, relaxation, guided imagery or meditation exercises as part of a psychosocial programme, were included. The interventions could be delivered in an individual-based, group-based, couples-based or internet-based format. We included controlled and uncontrolled trial (UCT) studies, but chose to exclude expert opinion, magazines, commentaries, case reports, editorials, newspaper articles, newsletters and book chapters. Neither did we include abstracts-only, doctoral theses or conference presentations. Our primary outcome was pregnancy rate, defined as clinical pregnancy. This clinical definition implies a visualisation of at least one gestational sac and fetal heartbeat in approximately the fifth week after fertilisation. Secondary outcome measures were psychological ratings of depressive symptoms, anxiety, generalised stress, specific infertility stress and interpersonal functioning assessed through self-reported questionnaires.
Data extraction and quality assessment
All full-text articles were read by two independent review authors (IF-V, NGS) and the data were extracted according to predefined criteria. Disagreements were discussed with a third author (YF) and resolved by consensus. If information on any outcome was missing or if clarifications were needed, authors were contacted for further information. Each study was assessed for methodological quality using the Jadad criteria,37 a commonly used tool to evaluate methodological quality, for example, use and adequate description of randomisation and blinding procedures, and description of dropout rates (score range 0–5). In addition to the 0–5 points possible on the original Jadad scale, 1 additional point was given for each of the following: (1) was a control group included, in order to acknowledge whether the intervention group was compared with another group, although randomisation was not used; (2) were both predata and postdata presented, as including preintervention and postintervention data will provide more accurate results; (3) was any form of blinding or masking of conditions to patients or (4) blinding of researchers attempted, acknowledging if the study had attempted to mask the active condition; (5) was a standardised and reliable outcome measure used, a criterion increasing the validity and comparability of the outcomes and (6) were pre–post correlations provided, which could provide better estimates of the effect size (ES). The modified scale yielded a total quality score ranging from 0 to 11. With respect to the modified quality score, the mean score difference between rater 1 and 2 (means (SD) 5.2 (1.8) and 5.6 (2.0)) did not reach statistical significance (t (77)=1.1; p=0.28), and the inter-rater score correlation was r=0.83 (p<0.001). The κ Statistic was not used, as this assumes the nominal data and no natural ordering of ratings. Quality ratings were not used as weights when calculating aggregated ES as this is generally discouraged due to the risk of introducing additional bias.38 Instead, associations between ES and study quality indicators were explored with meta-analyses of variances (ANOVAs) and meta-regression (modified quality score). In cases where we were unable to retrieve articles from the authorised databases, authors were contacted between 1 and 3 times in order to amend the data collected.
The ES used were the risk ratio (RR) for pregnancy and Hedges g for psychological outcomes. Hedges g is a variation of Cohen's d which enables correction of potential bias due to small sample sizes.39 ,40 A positive Hedges g indicates a result in the expected direction, for example, a reduction in distress in the intervention group compared with controls. An RR>1.0 indicates a greater proportion of pregnancies in the intervention group. RRs were based on pregnancy rates and total N in the intervention and control groups. When possible, Hedges g was calculated on the basis of reported means and SDs at preintervention and postintervention or means and SDs of change scores. This was possible for 50 of 61 ES. When required and available, the reported pre–post correlations were used in the calculation. This was the case for five ES. When unavailable, the pre–post correlation was set to 0.50. When SDs were unavailable, two approaches were used. For STAI (State-Trait Anxiety Inventory) state anxiety scores, the average pre-SDs and post-SDs (10.9 and 10.8, respectively) for the studies which reported the SD were used, as the SDs appeared to be highly comparable across the remaining studies. For other measures, ES were estimated either on the basis of sample size and either p value or η2. In one study reporting only medians,41 the means and SDs were estimated following a previously suggested approach.42
Heterogeneity was assessed using Q and I² statistics. Heterogeneity tests are aimed at determining whether results reflect genuine between-study differences (heterogeneity), or whether the variation is due to chance (homogeneity).43 In accordance with recommendations, a p value ≤0.10 was used to determine significant heterogeneity due to the general low statistical power of heterogeneity tests.44 The I2 quantity provides a measure of the degree of inconsistency by estimating the amount of variance in a pooled ES that can be accounted for by heterogeneity in the sample of studies.45 I2 values of 0%, 25%, 50% and 75% indicate no, low, moderate and high heterogeneity, respectively.
All ES were weighted with the inverse variance and combined with a random effects model. First, the overall ES of the effect of psychosocial interventions on pregnancy rates was calculated. Then the overall ES for the combined psychological outcomes was calculated together with the overall ES for the individual outcome measures of depression, state anxiety, infertility-related distress and marital function. This was performed for the combined sample (women+men). If the results indicated study heterogeneity, and if the number of studies in each category was sufficient (K≥3), possible between-study differences in ES were explored by comparing the ES of studies according to the following study characteristics: gender, study design, intervention type and intervention format (mixed effect meta-ANOVAs), methodological quality (modified quality score), mean age of the sample, intervention duration and number of sessions (mixed effect meta-regression).
Prior to the search, statistical power analyses were conducted as previously recommended.46 On the basis of the findings of the earlier meta-analysis,31 we expected to find an RR of 1.4 for pregnancy rates and an average sample size of N=76. We expected to be able to detect a similar small ES (Hedges g=0.28 or RR=1.4) with an α of 5% and a statistical power of 80%, with a total of only nine studies, using a random effects model. On the basis of these results, we considered it worthwhile to conduct the meta-analysis. The calculations were conducted using Comprehensive Meta-Analysis, V.2 (http://www.meta-analysis.com), IBM SPSS V.20 and various formulas in Microsoft Excel.
The possibility of publication bias, a widespread problem when conducting meta-analyses, was evaluated with funnel plots,47 Egger's method and by calculating fail-safe numbers.48 ,49 A funnel plot is a graphic illustration of study ES in relation to study size or precision. Egger's test provides a statistic for the skewness of results.50 Calculation of fail-safe numbers is aimed at achieving an indication of the number of unpublished studies with null findings that would reduce the result to statistical non-significance (p>0.05). It has been suggested that a reasonable level is achieved if the fail-safe number exceeds 5K+10 (K=N studies in the meta-analysis).51 If the results were suggestive of publication bias, an adjusted ES was calculated using Duval and Tweedie's52 trim and fill method, which imputes ES of missing studies and recalculates the ES accordingly.
In a first screening, duplicates were identified, and titles and abstracts reviewed. A total of 157 studies were found potentially relevant and reviewed independently by two raters. Four articles could not be retrieved due to the ‘no access policy from the university, and the authors did not respond to our enquiries.53–56 Initially, the raters were uncertain or disagreed on 13 (8.3%) articles (inter-rater agreement 0.78; p<0.001 (κ statistic)) indicating ‘substantial agreement’.57 After negotiation, 5 of these were included, resulting in 41 potentially eligible articles. One additional study was excluded due to the combination of psychological intervention with a psychoactive drug, and one study had insufficient statistical data and the authors did not respond to our enquiry. We thus included a total of 39 studies in the present review. On three occasions, the authors provided unpublished additional data.58–60 Figure 1 shows a flow chart of the study selection process.
The study characteristics are summarised in table 1. Based on the outcome, 29 of the studies were aimed at reducing negative emotional distress,41 ,58–85 with the targeted outcomes being infertility-related distress (k=10), depression (k=21), anxiety (k=25) and marital function (k=5). Five studies focused solely on the outcome of pregnancy,86–90 and five had included distress as well as pregnancy as the outcome.78 ,91–94 Twenty-one studies were RCTs,58 ,61 ,65–72 ,74 ,75 ,83 ,85 ,89–95 and 10 were non-RCTs (NRCTs),41 ,59 ,60 ,76 ,79 ,80 ,86–88 ,96 with most control groups receiving standardised care or being waiting list controls. Only three studies had included an active/attention control condition, for example, non-emotional writing or receiving an information booklet.70 ,71 ,74 One study offered gift certificates to the control group participants if they responded to the follow-up questionnaires.89 Relatively few studies were UCTs (k=8).62–64 ,73 ,77 ,81 ,82 ,84 The reporting of the participants’ medical treatment status was inconsistent. Five studies did not provide information on treatment status (whether or not in current ART treatment), 3 reported that some, but not how many, of the participants were in treatment, and 31 reported that their participants were currently in ART treatment, although not what kind of treatment, for example, IUI, IVF/ICSI or treatment cycle. The cause of infertility was also inconsistently reported, and some participants may still have been under evaluation during the study period. Twenty-five studies had included only women, while the remaining 14 had included both women and men. The included studies had reported data for a total of 3401 participants (3064 women and 347 men). The mean age and mean duration of infertility for intervention group participants were (32.7 years, ‘SD’ 2.2) and (4.6 years, ‘SD’ 2.1), and for control group participants (32.6 years, ‘SD’ 1.7) and (5.1 years, ‘SD’ 3.0), respectively. The specific intervention strategies mostly employed were cognitive–behavioural therapy (CBT; k=8) and MBI (k=12). The remaining studies had used a variety of interventions, including stress management, hypnosis, art therapy, expressive writing intervention, crisis intervention and various types of counselling. Some studies had included more than one approach, for example, cognitive–behavioural approaches supplemented with mind–body techniques such as relaxation. To be categorised as MBI, a study had to use such strategies as the general approach over the course of intervention. Thus, if studies had mainly used CBT strategies and only incorporated other approaches, for example, relaxation exercises, in one or two sessions, they were categorised as CBT interventions. The number of sessions ranged from 1 to 24, lasting approximately from 20 min to 3 h, and the duration of psychosocial intervention ranged from 1 week to 28 months.
A total of 15 studies reported the number of participants at baseline and then again at follow-up, and as seen in table 1, the number of dropouts varied across studies. Although the dropout rates in the intervention groups were somewhat higher (mean 30.5% (SD 20.2)) than in controls (24.9% (24.8)), the difference did not reach statistical significance (t(28) 0.68, p=0.50). Furthermore, only four studies explicitly stated that the analysis was based on an intention-to-treat (ITT) approach.70 ,72 ,83 ,92 Two additional studies used methods comparable to ITT, for example, carrying last (baseline) observations forward or use of multilevel linear modelling.69 ,97 Four studies stated that there were no differences between completers and dropouts without specifying this further,41 ,64 ,81 ,85 and the remaining studies failed to report whether there were dropouts or how such missing data were dealt with. The possible association between ES and uneven dropouts in the intervention and control groups was analysed for the 15 studies that reported dropouts by regressing the difference in dropout rates on the overall ES across all outcomes. The result indicated that larger dropouts in the intervention group compared were generally associated with smaller ES (Slope=−0.02), but the association did not reach statistical significance (p=0.268).
All included studies were methodologically assessed with the original Jadad scale and the additional methodological criteria. The original Jadad scores ranged from 0 to 4 with a mean of 2.28 (SD 1.36), and the modified total quality scores ranged from 1 to 10 with a mean of 5.36 (SD 2.05). The main methodological issue was that only very few studies attempted to blind or mask the intervention conditions to either patients or researchers. The quality ratings for each criterion for each study and total scores are shown in table 2.
Effects of psychosocial intervention
The results of the meta-analyses are shown in table 3.
A statistically significant and robust ES (RR=2.01) was found for the 10 studies which had investigated effects of psychosocial intervention on clinical pregnancy rates, with the chance of becoming pregnant being doubled in the intervention group. Adjusting for possible publication bias, the RR was somewhat lower (1.57). A forest plot of the effects of psychological intervention on pregnancy outcomes is shown in figure 2.
Combined psychological outcomes
Combining the ES of the 35 studies which had included one or more psychological outcomes revealed a statistically significant, robust,51 medium39 ES (g=0.59). The results indicated possible publication bias (skewed funnel plot, Egger's test (p<0.05)) in favour of larger published ES. When imputing missing ES,52 the resulting adjusted pooled ES was smaller (0.31) but remained statistically significant. Taking gender into consideration, the ES (0.51) remained statistically significant for women, still suggesting a robust effect. The ES was smaller for men (0.34) and did not reach statistical significance. A forest plot of the effects of psychological intervention on the combined psychological outcomes is shown in figure 3.
Only 10 studies had included infertility-related distress as an outcome. Small ES were found for women and men combined (0.24) and women alone (0.37) and did not reach statistical significance.
Twenty-one studies had assessed depressive symptoms. A statistically significant ES (1.00) was found for women and men combined. However, when adjusting for possible publication bias, the results changed dramatically to a small, non-significant ES of 0.31. Similar results were found for women alone with a statistically significant ES of 0.73, which was reduced to a non-significant 0.29 after adjusting for possible publication bias. For men alone, the ES (0.13) did not reach statistical significance.
Twenty-five studies had included state anxiety as an outcome. A statistically significant, robust medium ES (0.51) was found for women and men combined. Adjusting for possible publication bias led to a smaller but statistically significant ES (0.31). For women, the ES of 0.53 was statistically significant, but smaller (0.32) and non-significant when adjusting for publication bias. For men only, the analysis produced a small, non-significant ES of 0.32.
Only five studies (N=633) had included measures of marital function, but only very small (ES 0.09–0.08), non-significant effects were found.
As the Q statistics were generally statistically significant (p<0.10) and the I2 statistic indicated low-to-medium heterogeneity, when a sufficient number of studies were available for each analysis, we explored possible sources of heterogeneity and analysed whether the ES for pregnancy and combined psychological outcomes varied according to between-study differences in study design and intervention characteristics (type and format). The results are shown in table 3.
The ES found for pregnancy outcomes were statistically significant for RCTs (RR=1.7) and NRCTs (2.8), with the ES for NRCTs being considerably smaller (1.9) when adjusting for publication bias. The difference did not reach statistical significance. For psychological outcomes, statistically significant results were found for RCTs (g=0.70) and UCTs (0.55), but not for NRCTs (0.28). When adjusting for publication bias, the ES for RCTs was considerably reduced (0.26). Furthermore, between-group differences did not reach statistical significance.
The number of studies for each intervention type was insufficient to explore differences in pregnancy outcomes. For the combined psychological outcomes, statistically significant and, as indicated by the large fail-safe numbers, robust effects were found for all three intervention categories with the largest ES found for CBT (g=0.84), followed by MBI (0.61) and other intervention types (0.50). The between-group differences, did not reach statistical significance. Furthermore, the results suggested the possibility of publication bias, and when adjusting for publication bias, all three ES were reduced from medium to small.
For pregnancy outcomes, the number of studies was sufficient for group and individual formats. Both formats yielded statistically significant ES (RR 2.03 and 1.65), and the between-group difference did not reach statistical significance. For the combined psychological outcomes, a statistically significant effect was found for the Group format (g=0.76; p<0.001). The ES for intervention formats such as individual, couples and online did not reach statistical significance. The overall between-group difference for intervention formats was statistically significant (p<0.001).
Other study characteristics
The possible moderating influence of the continuously assessed study characteristics of mean age, intervention duration, number or sessions, and study quality (modified quality scores) were analysed with meta-regression. As seen in table 4, no significant effects were found for any of the moderators for either pregnancy or the combined psychological outcomes. A total of six studies had examined the effects on pregnancy and anxiety. When examining the possible role of anxiety reduction as a mediator of the effect on pregnancy outcome with meta-regression, a statistically significant association was found between the ES for anxiety and pregnancy, indicating that the greater the reduction in anxiety, the greater the likelihood of achieving pregnancy (see table 4).
Our meta-analysis of the available evidence suggests that women who receive some form of psychological intervention are approximately twice as likely to become pregnant when compared with controls receiving standardised care or active control intervention. Although the results of the 10 currently available studies taken together appeared robust, there were some indications of publication bias in favour of studies with larger positive ES. It should also be noted that the precision of the ES estimate is limited, with possible RRs ranging from approximately 1.5 to 2.7. Furthermore, although the between-group difference did not reach statistical significance when disregarding the possibility of publication bias, NRCTs yielded greater effects (RR 2.8 (95% CI 1.55 to 5.06)) than RCTs (RR 1.7 (95% CI 1.17 to 2.40)). Compared with other types of interventions that historically have been introduced to improve pregnancy rates in ART (improved culture media, new hormone stimulation regimens, etc), even an effect corresponding to the lower limit of the CI is substantial. While the results could be considered surprising, the available data do not provide any clear-cut reasons to reject this finding, which is further supported by the results of the meta-regression showing that larger reductions in anxiety were associated with improved pregnancy outcomes. With respect to the psychological outcomes currently reported in the literature, the results suggest that psychological intervention could be effective in reducing anxiety (25 studies) as well as depressive symptoms (21 studies) with the effects corresponding to medium and large ES (0.5 and 1.0). As seen for pregnancy outcomes, there were indications of publication bias in the direction of larger positive effects, and adjusting for publication bias resulted in a considerably smaller, statistically non-significant, ES for depressive symptoms. The pooled results did not reach statistical significance for the 10 studies which had investigated effects on infertility-related distress and the 5 studies which had included measures of marital function.
Comparing with results of previous reviews
The present review included 39 studies with a total of 3401 women (3064) and men (347). The participants received various psychosocial interventions lasting from 1 week to 6 months, including CBT, emotional disclosure, psychoeducation and MBIs. The present review evaluates almost twice the number of studies included in the most recent previous review,31 which reported mixed results of the efficacy of psychosocial intervention. Whereas the former review found no evidence for attenuating distress, there was promising support of psychological intervention increasing pregnancy chances for women not receiving ART.31 In line with the second review from 2005,30 we found more credible results for group intervention than for other formats, for example, online, individual and couples interventions.30 The first review published in 2003 also highlighted group interventions as more effective, especially if the interventions emphasised education and skills training, such as relaxation. Our results concurred with these earlier observations, suggesting that interventions delivered in groups may be more effective in reducing distress. Moreover, although the comparison did not reach statistical significance, prior to adjusting for publication bias, the intervention type of CBT appeared to be more effective than MBI and other types of interventions. It should be noted that the categorisation of interventions may be somewhat ambiguous. For example, the study by Cousineau et al83 could have been categorised as an MBI, as the authors had provided a website that directed attention towards relaxation exercises. However, as there was no reporting of whether the participants were engaged in weekly or daily training, we chose to interpret relaxation as an optional feature, and hence the study was not categorised as an MBI. The possible ambiguity and considerable variability in interventions forced us to categorise many studies as ‘other’, which limits our understanding of the possible mechanisms in psychosocial interventions. Taken together, the available data do not provide a clear basis for understanding the possible differences between effects of different intervention types, and the results should be interpreted with caution. The more recently conducted studies included in the present review have contributed by increasing the size of the available data set considerably, and taken together, the currently available evidence suggests that offering psychosocial interventions may improve both the chances of pregnancy and the quality of life for infertile patients going through fertility treatment.
Strengths and limitations
Our systematic review and meta-analysis has several strengths. We conducted a comprehensive search and performed the review in accordance with the recommended guidelines.34 In order to limit the possibility of selection bias, we encouraged authors of eligible studies to elaborate on their results if the data reported were insufficient, and asked authors of papers written in a foreign language to submit their results to us in English. The included studies represented a range of different countries, have used comparable outcome measures, and provided fairly comprehensive descriptions of the interventions studied. In addition, we conducted a detailed evaluation of the methodological quality in order to detect any issues that could possible affect the accuracy of the ES calculated. While not all characteristics, in particular reproductive, could be assessed; most general methodological aspects were covered. We also explored heterogeneity and made adjustments for possible publication bias, when required.
Some limitations of the currently available data should also be noted. First, the samples investigated may not have been as homogeneous as could be wished for. A small number of infertile participants did not receive treatment with ART, and furthermore, it was not consistently reported what type of ART procedure the participants received, what phase or treatment they were in, or the causes of infertility. This information is clearly important when interpreting the outcomes, and unknown between-study and within-study between-group differences, for example, in numbers of cycles, idiopathic infertility and embryo transfer, may have influenced the results, in particular for pregnancy outcomes. However, such differences are likely to be less important in RCTs, where randomisation is expected to reduce their influence. Although the difference did not reach statistical significance, RCTs reported smaller ES for pregnancy outcomes than NRCTs, which could be interpreted as supporting the concern that infertility and treatment characteristics may have been unevenly distributed between psychological treatment arms, thus increasing the risk for misattribution of outcomes to intervention, at least for NRCTs. On the other hand, we found no statistical significant associations between study quality scores and either pregnancy or psychological outcomes, no statistically significant differences in dropout rates between intervention and control groups, and, as suggested by the large fail-safe numbers, the improvements generally appeared quite robust. A second possible limitation is the high level of heterogeneity indicated by the Q and I2 statistics, and the pooled ES reported in the present review should thus be viewed as an estimate of the average expected effect across a wide range of different settings. A third issue is that the considerable dropout rates and lack of ITT analyses may have influenced the results, and it cannot be excluded that fertility-related and treatment-related factors such as non-optimal fertilisation, small number of eggs, etc, may have demotivated some participants and made them dropout of the study, while individuals who progressed through the treatment phases with more satisfactory outcomes were more likely to complete the study. Fourth, the indications of publication bias found for several results suggest the possibility of a ‘file drawer problem’, that is, the existence of relevant unpublished null findings, a common problem when conducting systematic reviews. Finally, owing to inconsistencies in the reporting of causes of infertility, we are unable to evaluate the possible associations between ES and causes of infertility. Although meta-analysis remains the gold standard when evaluating the current evidence within a field of research, as is often the case with systematic reviews, qualitative as well as quantitative, the overall level of the evidence reported in our review may be challenged by publication bias and the heterogeneity and methodological limitations of the available published studies.
Clinical and practical implications
We found evidence for improvement in general psychological symptoms such as anxiety and depression, but not for infertility-specific distress. A possible explanation for the latter could be the lack of sensitivity of the infertility-related distress measures used. The questions used in these measures are directly concerned with thoughts and feelings about involuntary childlessness, and rumination about the involuntary childlessness may persist even when psychosocial intervention improves general psychological well-being. Of particular interest is the result of our meta-regression analysis of the six studies which had included both pregnancy and anxiety as outcomes showing that larger reductions in anxiety were associated with greater chances of pregnancy. Anxiety is a state of arousal, which over time is physically and mentally stressful for the individual.17 Reducing distress, anxiety in particular, may increase the physiological ability to cope with stress and advance the possibility of impregnation. We found no association between mean age and pregnancy rates outcomes, which may seem surprising, since age is the most important predictor of pregnancy outcomes of ART.99 ,100 However, our meta-regression was conducted for the mean age of the samples, and the mean age across study samples showed little variation (mean age 32.7; SD 2.4). The rather narrow age interval across study samples may explain an apparent lack of association between age and chance of pregnancy. Our findings also suggest that group interventions appear to be more efficacious than individual, couples or online interventions. There could be various reasons for this. First, group interventions had a longer duration (mean 9.5 weeks) and involved more sessions (8.3) than individual interventions (mean 5.3 and 4.4), and second, there is evidence of a positive impact of ‘group settings’, that is, the sense of community between participants, reducing the feelings of isolation or alienation and sharing with individuals in the same life situation, etc.101–104
Recommendations for future research
Despite the overall positive effects of psychosocial interventions found in the literature, our results suggest a need for further studies with more rigorous methodology, including more strict reporting of causes of infertility, the types of ART used, and which phases of treatment the participants are in. Also, most of the studies were conducted in high-income countries; it is therefore important to note that the assertions made here cannot be generalised to low-income and developing countries. There is thus a need for research in low-income or developing countries as well. Another aspect pertaining to generalisability is the challenge of comparing volunteering infertile participants in psychosocial efficacy studies with the general population of infertile individuals. The response rates in this area are moderate, and it seems important in future studies to explore and compare characteristics of dropouts and completers, as well as of non-responders and responders. Furthermore, it would be of importance to develop clinically meaningful categories of distress with the purpose of improving interventions targeted to the various types and levels of distress experienced by the participants. Psychological well-being/distress fluctuates over time during fertility treatment and a stepped care approach could be potentially valuable in this population.105 It is also possible that interventions aimed at relieving distress conducted at different phases in treatment may obtain different psychological outcome results. This calls for improved reporting and comparability of the timing of psychosocial interventions and greater precision and comparability of the timing of outcome assessments. Also needed are studies testing specific hypotheses concerning possible moderating and mediating mechanisms of the effects of interventions on distress and pregnancy outcomes. For example, which psychosocial factors do we need to target to optimise effects on distress and pregnancy rates, and which biomarkers affected by psychosocial interventions, for example, oxidative stress, inflammatory processes, can best explain the observed effects? This could assist in developing a more solid evidence base providing better guidance for patients, health professionals and policymakers about ‘what works for whom’ in infertile patients.
In conclusion, the present meta-analysis of 39 studies suggests that psychosocial interventions, in particular CBT and MBI interventions, are beneficial for reducing distress and for improving pregnancy outcomes of ART. Moreover, there is some preliminary evidence to suggest that reduction in anxiety achieved through psychological intervention may improve the chance of pregnancy. Despite the robust overall effect found, the considerable heterogeneity of the available studies with respect to methodological quality, intervention type and format still warrants caution as to the conclusions which can be drawn.
The authors would like to thank Gina Bay, Librarian at the Library, Department of Psychology and Behavioral Sciences, School of Business and Social Sciences, Aarhus University for valuable and tireless assistance throughout the database search process. They would also like to give their most grateful thanks to the researchers who provided them with additional data on their studies or aided them in other ways.
This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.
Files in this Data Supplement:
- Data supplement 1 - Online supplement
Contributors RZ and YF designed the protocol. YF developed search strategies and IF-V, NGS and YF performed the searches and study selection. YF and IF-V undertook data extraction and quality assessments. RZ was responsible for analysing the data. YF drafted the manuscript and IF-V, NGS, HJI and RZ contributed to the manuscript throughout the process. All authors provided inputs and were involved with the interpretation of this review. YF is the study guarantor.
Funding YF was supported by a research grant from The Danish Agency for Science Technology and Innovation.
Competing interests None.
Provenance and peer review Not commissioned; externally peer reviewed.
Data sharing statement Extra data can be accessed via the Dryad data repository at http://datadryad.org/ with the doi:10.5061/dryad.kv50v.