Objectives Evidence comparing the effectiveness of surgical and conservative treatment of symptomatic lumbar disc herniation is controversial. We sought to compare short-term and long-term effectiveness of surgical and conservative treatment in sciatica symptom severity and quality of life in patients with lumbar disc herniation in a routine clinical setting.
Methods A prospective cohort study of a routine clinical practice registry consisting of 370 patients. Outcome measures were the North American Spine Society questionnaire and the 36-Item Short-Form Health Survey to assess patient-reported back pain, physical function, neurogenic symptoms and quality of life. Primary outcomes were back pain at 6 and 12 weeks. Standard open discectomy was assessed versus conservative interventions at 6, 12, 52 and 104 weeks. We filled in missing outcome variable values with multiple imputation, accounted for repeated measures within patients with mixed-effects models and adjusted baseline group differences in relevant prognostic indicators by inverse probability of treatment weighting.
Results Surgical treatment patients reported less back pain at 6 weeks than those receiving conservative therapy (−0.97; 95% CI −1.89 to −0.09), were more likely to report ≥50% decrease in back pain symptoms from baseline to 6 weeks (48% vs 17%, risk difference: 0.34; 95% CI 0.16 to 0.47) and reported less physical function disability at 52 weeks (−3.7; 95% CI −7.4 to −0.1). The other assessments showed minimal between-group differences with CIs, including the null effect.
Conclusions Compared with conservative therapy, surgical treatment provided faster relief from back pain symptoms in patients with lumbar disc herniation, but did not show a benefit over conservative treatment in midterm and long-term follow-up.
This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/
Statistics from Altmetric.com
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.
Strengths and limitations of this study
We included in the present study consecutively sampled patients from a routine clinical practice registry who were followed-up until 2 years after treatment. Thus, as opposed to randomised controlled trials of head-to-head comparisons between surgical and conservative treatment for lumbar disc herniation, which are hampered by a large number of crossovers and dropouts, our results are directly generalised to routine clinical settings.
By using inverse probability weighting, we were able to minimise the risk of confounding by indication to mimic results of a randomised controlled experiment, while maintaining the generalisability of observational studies.
Because of the observational nature of our investigation, results of the present investigation must be carefully interpreted because of the risk of residual confounding by indication.
Sciatica is one of the most debilitating types of pain emanating from the low back, with a lifetime incidence of ∼30%.1 ,2 Sciatica is a disorder caused by pressure on or irritation of the nerve root. Main symptoms and signs include unilateral leg pain that is worse than concomitant low back pain, pain radiating beyond the knee, decreased muscle strength in a myotomal distribution and sensory deficits in a dermatomal distribution.3 ,4 Compared with patients with localised low back pain only, those with sciatica generally have more persistent and severe pain, worse prognosis, consume more healthcare resources and are disabled and absent from work for a longer period of time.1
Lumbar disc herniation is considered to be one of the main causes of sciatica, and lumbar discectomy is the most popular surgical procedure performed in patients with sciatica in the USA.1 ,2 ,5–7 Lumbar disc herniation also occurs in asymptomatic patients, and often spontaneously regresses without surgery.8–10 Conservative treatment, including physical therapy, pharmacological treatment and infiltrations, is an alternative approach for symptomatic patients; 90% of sciatica cases due to lumbar disc herniation resolve with conservative measures.5 ,9 Conservative treatment of lumbar disc herniation has a lower risk of complications than surgery and is preferred by the vast majority of patients.11
Several studies have compared the effectiveness of surgical and conservative treatment in patients with lumbar disc herniation associated with sciatica, but methodological aspects limit interpretation of their results. Observational cohort studies have typically differed in important baseline prognostic indicators between treatment group and their results were thus more prone to confounding.12–14 Randomised controlled trials (RCTs) are less prone to generate confounded results. However, in RCTs comparing surgical with conservative treatment, a large proportion of patients randomly allocated to conservative treatment actually received surgical treatment right after randomisation or after an initial period of conservative treatment (26–54%).7 ,15–17 Therefore, RCTs are actually mainly comparing early surgery with conservative treatment and delayed surgery in selected patients, as was referred to by Peul et al.18 ,19 In addition, some researchers have questioned whether patients willing to participate in RCTs of surgery versus conservative treatment are representative of patients commonly seen in clinical practice.20–23
In order to present results that are more representative of routine clinical care while minimising the risk of confounded results, we conducted a properly sized observational cohort study in a routine clinical setting using consecutive sampling, in which baseline differences in prognostic indicators were accounted for in an analysis with inverse probability weighting closely mimicking an RCT, with the aim of comparing the effect of surgical and conservative treatment on sciatica symptom severity and quality of life in patients with lumbar disc herniation.
This was a prospective observational cohort study based on the routine clinical practice registry of the Neurosurgery and Rheumatology Departments of the Cantonal Hospital in Aarau, Switzerland. All eligible patients were consecutively invited to participate in the study. Recruitment took place from May 2003 to December 2007.
Ethics and consent
This study is part of a quality management programme on anonymised patients; therefore, no institutional review board approval is required in Switzerland.
Patients were considered eligible if they were at least 18 years old, were diagnosed with symptomatic low-back pain due to lumbar disk herniation and associated radicular pain and showed signs of nerve root irritation (positive straight leg raise or femoral nerve tension tests) and/or neurological deficits (asymmetrical depressed reflexes or motor or sensory deficits in corresponding myotomal or dermatomal distribution) requiring hospitalisation. Diagnosis was verified by advanced spinal imaging (MRI or CT) with disk herniation at a level and side corresponding to the clinical symptoms and physical findings. The study population included all patients willing to participate in a standardised clinical follow-up programme comprising consultations and patient-based outcome measures, and who had outcome data available at 6 or 12-week follow-up measurement.
The assignment to treatment interventions was decided by physicians based on patients' clinical indications. Surgical treatment was a standard open discectomy as described by Delamarter and McCulloch and by Spengler, with examination of the involved nerve root performed using a microscope, with the patient under general anaesthesia and in the knee–chest position.24 ,25 After a midline incision, the paraspinous muscles were reflected and the interlaminar space was entered.24 If necessary, the medial border of the superior facet was removed to provide an unobstructed view of the involved nerve root. Using a small annular incision, the herniated disc fragment was removed, the spinal canal was inspected and the foramen and recessus probed for residual disk or bony pathology.25 The nerve root was then decompressed, with the purpose of leaving it freely mobile.
The conservative treatment consisted of ergonomic instruction, active physical therapy, education/counselling with instructions for home-based exercise, and non-steroidal anti-inflammatory drugs if tolerated. Patients with insufficient analgesic response were prescribed additional opioids. Those with an inadequate response to opioids were offered epidural infiltrations, CT-guided periradicular infiltrations26 and, in the case of continued inadequate response or recurrence, CT-guided pulsed radiofrequency therapy of the affected nerve root.27 If conservative treatment failed, which was ascertained on a case-by-case basis, surgery was provided as an option.
Sciatica symptom severity was assessed using the North American Spine Society (NASS) questionnaire, and quality of life was assessed using the 36-Item Short-Form Health Survey (SF-36). The primary outcome measures were the changes in score from baseline to weeks 6 and 12 as assessed by the back pain subscale of the NASS questionnaire using a scale from 0 to 10.28 Secondary outcome measures included the NASS neurogenic symptoms subscale (which addresses leg or foot pain, numbness and tingling on a scale from 0 to 30), the NASS function subscale (which addressed disability because of pain on a scale from 0 to 45), the SF-36 V.1 physical and mental subscales (scale from 0 to 100)29 and the proportion of patients responding to treatment, defined as a 50% reduction in baseline scores of the NASS pain subscale. Lower scores indicate a better outcome for the NASS questionnaire, and a worse outcome for the SF-36. The validated German language versions of the NASS30 and SF-3631 questionnaires were used in an audiovisual touchscreen version (Qualitouch, Zürich, Switzerland).32
All outcomes were prospectively assessed at baseline and at 6, 12, 52 and 104 weeks. Outcome measures were specified prior to statistical analysis.
Baseline and efficacy data are presented as counts and percentages for dichotomous variables and as means and SDs for continuous variables. Between-group comparisons of baseline data were performed using Pearson's χ2 for dichotomous variables and Student's t-test for continuous variables. Only patients with complete primary outcome data, that is, NASS back pain assessed at 6 or 12-week follow-up, were considered in the analysis. We accounted for missing data by using multiple imputation with baseline efficacy variables, age, body mass index, gender, social status, employment status, country of origin and treatment group as explanatory variables in the imputation model, to create 20 imputed data sets. For each patient, we estimated propensity scores for receiving surgical treatment using a probit model that included baseline efficacy variables, age, body mass index, gender, social status, employment status and country of origin as explanatory variables. Propensity scores were then used to derive inverse probability of treatment weights, with the inverse of the propensity score as analytic weights for patients in the surgical group and the inverse of 1 minus the propensity score for patients in the conservative group.33 ,34 To account for repeated measures within patients across multiple follow-up assessments, we used linear or logistic mixed-effects models adjusted for the inverse probability of treatment weighting to derive, for each outcome measure at each follow-up time, group-specific means or proportions with 95% CIs and between-group differences in means or proportions with 95% CI. Statistical analyses were performed with STATA release 12.1 (Stata Corp, College Station, Texas, USA). All p values are two-sided.
Study flow and patient characteristics
Three hundred and seventy patients were consecutively sampled and assigned to surgical (n=297) or conservative (n=73) treatment (figure 1). Table 1 shows baseline clinical characteristics; patients receiving surgical treatment tended to have more severe neurogenic symptoms at baseline (p=0.098) and were more likely to be Swiss citizens (p≤0.001) from a higher social class (p=0.065). Adjusted p values in table 1 indicate that there is no evidence of significant differences between groups for all variables at baseline after adjustment using inverse probability weighting (p≥0.72).
Table 2 shows that 6 weeks after the end of treatment, patients in the surgical group had less pain than patients in the conservative group (−1.0, 95% CI −1.9 to −0.1)). However, we observed a constant decrease in between-group differences in pain scores in all subsequent follow-up assessments, with CIs overlapping the null effect (table 2 and figure 2). Similarly, 34% (95% CI 16% to 47%) more patients in the surgical group responded to treatment at 6 weeks after the end of treatment, but the 95% CI for between-group comparison in all subsequent follow-up assessments included the null effect (table 2).
NASS neurogenic symptoms and NASS function
The neurogenic symptoms of patients in the surgical group tended to improve faster (6 and 12 weeks: −3.5, 95% CI −7.7 to 0.7), but we observed no difference in the long term (2 years: −1.3, 95% CI −6.3 to 3.7). There was no difference between groups in physical function in the first follow-up assessment at 6 weeks (0.7, 95% CI −2.8 to 4.2). Patients in the surgical group reported lower functional impairment at 1 year (−3.7, 95% CI −7.4 to −0.1), but this difference was not sustained at the 2-year follow-up assessment (−1.1, 95% CI −5.2 to 2.9) (table 2 and figure 2).
36-Item Short-Form Health Survey
There was little evidence of a difference in quality of life between groups throughout the study. Patients in the surgical group tended to score better on the SF-36 physical subscale in the short term (6 weeks: −3.1, 95% CI −6.4 to 0.1), but the difference was minimal in the long term (2 years: −0.6, 95% CI −4.7 to 3.5) (table 2). Scores of the SF-36 mental subscale were similar in both groups in all follow-up assessments (table 2 and figure 3).
We found no evidence that surgical treatment, when compared with conservative treatment, reduced the severity of sciatica symptoms or improved the quality of life of patients with lumber disc herniation in the medium or long term. Pain was relieved more quickly in patients who received surgical treatment (evident at the 3-week follow-up), but the difference between groups was no longer present after 3 months. Patients in the surgical group did report less physical impairment at the 1-year follow-up, but not in previous or subsequent assessments. Surgery was not more effective for the treatment of neurogenic symptoms or the improvement of quality of life over the course of the study.
Faster improvement in pain symptoms with surgical treatment is a common finding in comparisons with conservative treatment in patients with lumbar disc herniation. Previous observational studies have also found that back pain is reduced more quickly with surgical treatment.12–14 ,35 Findings regarding neurogenic symptoms, physical function, and quality of life, however, are not as consistently reported by other observational studies. As opposed to our findings, previous observational studies have found benefits of surgical treatment in these outcomes at short-term and long-term follow-up. The discrepancy between our findings and those of previous studies may be due to differences in eligibility criteria and methods of outcome assessment, a more effective control intervention and a different approach to statistical analysis to control for confounding by indication.
Interestingly, results of our observational cohort conducted in a routine care setting more closely resemble those reported by previous RCTs.7 ,15–19 RCTs have also typically reported quicker pain reduction in patients who received surgery, but no clear benefit of surgery over conservative treatment at long-term assessments of neurogenic symptoms, physical function or quality of life. However, in the classic trial by Weber,15 the beneficial effect of surgical treatment lasted longer than in other trials; treatment effects of surgical and conservative treatment only became similar after 4 years of follow-up, and remained similar until the 10-year final follow-up.
The observational nature of our investigation limits our ability to interpret its findings. In observational clinical studies, results are likely to be influenced by confounding by indication. Patients with a worse prognosis at baseline are more likely to be allocated by physicians to surgical intervention, and indeed, this was the case in our study, in which patients in the surgical group showed a trend towards worse neurogenic symptoms at baseline (p=0.098). However, the methods we used for statistical analysis allowed us to mimic a randomised controlled experiment.29 This method of analysis, that is, inverse probability weighting, assumes that the probability of being allocated to surgical or conservative treatment depends mainly on the prognostic indicators we included in our analysis. Although this assumption may be inaccurate in some cases, our results are remarkably similar to those reported in previous RCTs. Surgical RCTs are commonly criticised for lack of generalisability because patients who agree to be randomised in these trials may not be representative of those seen in clinical practice. The results of the present investigation do not suffer from this limitation, because no randomisation took place. Moreover, a significant number of patients dropped out of our study due to loss to follow-up, especially by latter time points. We conducted multiple imputation as an attempt to include in our analysis patients with missing outcome data; however, no statistical technique is likely to completely solve the problem of missing data, and it is always better to have observed data as opposed to imputed data for all patients included in the analysis.
Surgical and conservative treatments had long-term beneficial effects on sciatica symptoms in patients with lumbar disc herniation. Compared with conservative treatment, surgical treatment relieved back pain faster, but no relevant clinical difference was observed after 3 months. Surgical treatment may thus be attractive to patients with debilitating pain symptoms who seek quick relief, or who did not experience satisfactory improvement with conservative treatment.
MG and BRdC contributed equally and share first authorship.
Contributors PH, HL, RT and MG conceived and designed the experiments; MG, ED, RT, HL and PH performed the experiments; BRdC, PJ and SR analysed the data; MG, ED, RT, HL, PH, BRdC, PJ and SR contributed to the writing of the manuscript.
Funding This study was funded by the Hugo and Elsa Isler Foundation, Aarau, Switzerland.
Competing interests PJ has received research grants to the institution from Astra Zeneca, Biotronik, Biosensors International, Eli Lilly and The Medicines Company, and serves as unpaid member of the steering group of trials funded by Astra Zeneca, Biotronik, Biosensors, St Jude Medical and The Medicines Company. The other authors report no conflict of interest.
Provenance and peer review Not commissioned; externally peer reviewed.
Data sharing statement No additional data are available.