Short communicationClinical trials and the response rate illusion
Introduction
There are various ways of comparing the performance of drug and placebo in clinical trials and systematic reviews of those trials. Outcome data for conditions that are assessed on continuous scales are typically presented as mean change scores (or sometimes as post treatment scores adjusted for baseline scores). Categorical results can also be calculated from continuous data, in the form of response rates or improvement rates (i.e., the percent of patients deemed to have responded or improved) and statistics derived from them including odds ratios, relative risk ratios, and number needed to treat. These categorical outcomes consist of the proportion of people who meet a (usually) predefined level of improvement or fall below a predefined threshold score on a continuous measure. They do not reflect natural categories, but are simply a way of dividing up a continuous distribution.
Ideally, these different methods of assessing outcome should produce similar conclusions, since they are derived from the same data. In practice, they can be divergent. This is particularly striking in reviews of antidepressant medication, where mean improvement scores indicate very small drug-placebo differences [1], [2]. whereas response rate data suggest that these differences are more substantial [1], [3]. The purpose of this article is to explore the reasons for the apparently divergent outcomes produced by comparisons of mean improvement and response rates and to consider their implications for the analysis and reportage of clinical trial data. Although our concern extends to all clinical trials in which data derived from continuous scales are reported as response rates, we illustrate the issue with data from the antidepressant literature.
Section snippets
Response rates and continuous distributions
Response rates depend on the criterion used to define a response, as well as on the magnitude of drug and placebo effects. When a response rate of 50% has been found, as has been reported in the published antidepressant trial data [3], it means that the criterion that had been chosen for defining a response has coincidentally turned out to be the median of the distribution of improvement scores. This is true regardless of the shape of that distribution. In normal distributions, it is also the
How the response rate illusion is produced
The response rate illusion is not due to response rate statistics, but rather to our interpretation of them. We think of a responder as a person who improves and a non-responder as a person who does not improve, but this is not necessarily accurate when response to treatment has been derived from continuous scores, rather than defined by natural categories (e.g., survival). A patient who is classified as a non-responder on the basis of a criterion of improvement on a continuous scale, may have
Conclusions
Some outcome data (e.g., death and pregnancy) can only be expressed in terms of response rates. Other outcomes do not fall into natural categories and can be assessed meaningfully with continuous scores. Imposing categories on such data is hazardous. It creates the impression of discrete patterns of response where the data does not suggest any, it obscures the arbitrary nature of criteria used to form the categories, and as we have shown, it can spuriously inflate the differences between groups
References (5)
- NICE; National Institute for Clinical Excellence, 2004; Vol....
- et al.
Prev Treat
(2002)
Cited by (69)
Toxicity profile of epidermal growth factor receptor tyrosine kinase inhibitors for patients with lung cancer: A systematic review and network meta-analysis
2021, Critical Reviews in Oncology/HematologyCitation Excerpt :Other exclusion criteria were: 1) trials in which EGFR-TKIs were used in a combination, maintenance, neoadjuvant, or adjuvant treatment strategy; 2) trials comparing EGFR-TKIs with other treatments like monoclonal antibodies, immunotherapies, and some pathway inhibitors; 3) trials comparing treatments unapproved by any Food and Drug Administration (FDA); and 4) trials whose safety outcomes subsequently published updated data in a mature or longer duration of follow-up. Placebo-controlled trials were excluded in the original analysis as they mainly recruit patients with a mild form of disease to meet ethical and safety requirements for regulatory approval (Kirsch and Moncrieff, 2007); however, these trials were added to a validation analysis. The primary outcome was systemic AEs reported as those of all-grade and grade ≥3, representing incidence and severity of toxicity, respectively; the secondary outcome was specific AEs (all-grade) including those from skin, gastrointestinal tract, lung, etc.
Network meta-analysis of antidepressants
2018, The LancetDepression: why drugs and electricity are not the answer
2022, Psychological MedicineMapping actions to improve access to medicines for mental disorders in low and middle income countries
2017, Epidemiology and Psychiatric Sciences