Minimal clinically important difference in means in vulnerable populations: challenges and solutions

Janet L Peacock; Jessica Lo; Judith R Rees; Odile Sauzet

doi:10.1136/bmjopen-2021-052338

Article Text

PDF

PDF +
Supplementary
Material

Research methods

Communication

Minimal clinically important difference in means in vulnerable populations: challenges and solutions

http://orcid.org/0000-0002-0310-2518Janet L Peacock1,
Jessica Lo2,
Judith R Rees1,
http://orcid.org/0000-0002-1029-8846Odile Sauzet3

¹Department of Epidemiology, Geisel School of Medicine at Dartmouth, Hanover, New Hampshire, USA
²Centre for Healthy Brain Ageing, University of New South Wales, Sydney, New South Wales, Australia
³Epidemiology and International Public Health, Bielefeld School of Public Health, Bielefeld University, Bielefeld, Germany

Correspondence to Professor Janet L Peacock; janet.peacock{at}dartmouth.edu

Abstract

Introduction and motivation Many health studies measure a continuous outcome and compare means between groups. Since means for biological data are often difficult to interpret clinically, it is common to dichotomise using a cut-point and present the ‘percentage abnormal’ alongside or in place of means. Examples include birthweight where ‘abnormal’ is defined as <2500 g (low birthweight), systolic blood pressure with abnormal defined as >140 mm Hg (high blood pressure) and lung function with varying definitions of the ‘limit of normal’. In vulnerable populations with low means, for example, birthweight in a population of preterm babies, a given difference in means between two groups will represent a larger difference in the percentage with low birthweight than in a general population of babies where most will be full term. Thus, in general, the difference in percentage of patients with abnormal values for a given difference in means varies according to the reference population’s mean value. This phenomenon leads to challenges in interpreting differences in means in vulnerable populations and in defining an outcome-specific minimal clinically important difference (MCID) in means since the proportion abnormal, which is useful in interpreting means, is not constant—it varies with the population mean. This has relevance for study power calculations and data analyses in vulnerable populations where a small observed difference in means may be difficult to interpret clinically and may be disregarded, even if associated with a relatively large difference in percentage abnormal which is clinically relevant.

Methods To address these issues, we suggest both difference in means and difference in percentage (proportion) abnormal are considered when choosing the MCID, and that both means and percentages abnormal are reported when analysing the data.

Conclusions We describe a distributional approach to analyse proportions classified as abnormal that avoids the usual loss of precision and power associated with dichotomisation.

statistics & research methods
epidemiology
public health

http://creativecommons.org/licenses/by-nc/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.

https://doi.org/10.1136/bmjopen-2021-052338

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Strengths and limitations of this study

Addresses a challenging issue in study design and analysis.
Motivated by real study data and illustrated using hypothetical data to allow generalisation.
Describes a statistically robust methodological solution.
A formal review of dichotomisation was not included.
Facilitates the reporting of clinically meaningful results.

Introduction

When we design a new research study, such as a clinical trial, we usually calculate the sample size required to answer the key research questions with adequate precision and/or statistical power. When two groups are being compared, we normally consider what is the minimum difference between the groups that is clinically important. This minimum difference, sometimes referred to as the ‘minimal clinically important difference’ (MCID)1, is usually identified using clinical experience together with data from patient studies. Recommended MCIDs have been published for different outcomes in different settings such as pain scores,2 health-related quality of life,3 chronic obstructive pulmonary disease (COPD) outcomes,4 6 min walk distance5 and more. In this paper, we discuss the challenges in interpreting MCIDs which have been designed for a general population when they are applied to a vulnerable population.

Clinically meaningful effect size

Cohen suggested the use of a standardised effect size, calculated as the difference in means divided by the SD at baseline.6 He attached descriptors to different sizes of these quantities such that an effect size of 0.8 is ‘large’, 0.5 is ‘medium’ and 0.2 is ‘small’. These descriptors implicitly assume that MCID is the same for all populations, but this is not necessarily so, as we later illustrate.

When a study’s primary outcome is continuous and the clinically meaningful difference is uncertain, statisticians and researchers may express the difference to be detected as a multiple of the SD, as described above after Cohen. Hence studies sometimes set out to be able to detect, for example, a difference of 0.33 or 0.5 SD. This seems reasonable if there are no other data to inform the decisions, but behind this seemingly objective measure, the question arises of what is the meaning of a small difference? More importantly, we consider whether the clinical impact of a given difference is the same in different populations. A simple example is to consider the effect of maternal smoking in pregnancy which is associated with reduced mean birthweight of around 180 g.7 However, it is not straightforward to interpret this difference because in full-term pregnancies with mean birthweight of 3400 g, a mean reduction of 180g to 3220 g has much less clinical impact on overall pregnancy outcome than if the population were preterm babies with a mean of 2600 g reduced to 2420 g. If we think at the population level and consider a difference in means as a shift in mean,8 then translating this shift into the risk for an individual is not intuitive unless we define the percentages who are ‘abnormal’ using a clinically meaningful cut-point, such as here using birthweight <2500 g to define ‘abnormal’.

Motivating example: lung function z-scores

The phenomenon described above came to our attention while analysing and interpreting lung function z-score outcomes in children who had been born very prematurely and so from hereon we will use lung function z-scores to illustrate the problem introduced above. Z-scores are commonly used for standardising lung function measurements for age/sex/height and sometimes ethnicity. By standardising, a patient’s individual lung function measurement can be compared against a known expected value, usually in a general population. Z-scores are used in assessing an individual patient’s measurements to allow clinicians to identify abnormal values that might indicate the need for further intervention. However, in addition to their clinical use on an individual basis, z-scores are also used in research studies to compare groups, so that key demographic characteristics do not confound group-level comparisons. The simplest version of an individual’s z-score is:

where the expected value (ie, the population mean value) and SD refer to a general population with similar characteristics to the observed individual. Z-scores may be calculated using simple or complex regression modelling, for example, in standardising a child’s lung function measurement for their age, sex, height and ethnicity.9 In a sample of healthy patients, we expect their mean z-score to be close to the population mean of all individuals and so the sample mean z-score is expected to be 0 with SD equal to 1.

Hypothetical example: interpreting differences in z-scores

We conducted a brief descriptive review of the use of z-scores to see how researchers analysed and reported lung function z-scores. This indicated that where authors discussed the magnitude of group differences in mean z-scores, they tended to provide an estimate of the percentage with abnormal measurements using a clinically meaningful cut-point. Abnormal lung function z-score may be defined using one of the lower centiles of a general (normal) population, such as below 2.5th (z< -1.96) or below fifth centiles (z< -1.64).9 To illustrate these ideas, we will use the fifth centile in a general population (z< -1.64) so the cut-point is −1.64, meaning that someone with a lung function z-score less than −1.64 will be categorised as ‘abnormal’, sometimes known as ‘below the limit of normal’.

In figures 1–3, we consider three hypothetical z-score populations. Each is assumed to follow a Normal distribution with the same SD but with different means: Figure 1 mean=0 (‘general population’), figure 2 mean=−0.5 (‘moderately vulnerable population’), figure 3 mean=−1.0 (‘vulnerable population’). We see that the percentage of the population that is classed ‘abnormal’ (z< -1.64) varies with the population mean.

Figure 1

Hypothetical distribution showing the percentage abnormal (z< -1.64) with population mean=0.

Figure 2

Hypothetical distribution showing the percentage abnormal (z< -1.64) with population mean=−0.5.

Figure 3

Hypothetical distribution showing the percentage abnormal (z< -1.64) with population mean=−1.

We now consider lung function z-scores in three populations that are Normally distributed with means 0 (general population), −0.5 (moderately vulnerable) and −1 (vulnerable) and compare each with another three populations that have means that are 0.25 lower (figures 4–6 and table 1). The fifth centile (z< -1.64) is used to define ‘abnormal’ and the difference is expressed as percentage points. We see that the percentage abnormal is affected by the means of the pairs of populations even though the difference in means is 0.25 in each case. Where the pairs of means are 0 and −0.25, the difference in the percentage abnormal is 3.2 percentage points (figure 4), for populations with means −0.5 and −0.75, the difference is 6.0 percentage points (figure 5), and for means −1 and −1.25 it rises to 8.7 percentage points (figure 6 and table 1). In other words, the same effect size, 0.25, has different consequences in different populations, so that a small difference in means has a greater impact in vulnerable populations. These results are tabulated in table 1.

View this table:

Table 1

Illustration of the effect of a small difference in mean (0.25 z-scores) in different populations using the fifth centile for a general population to define abnormal

Figure 4

Two hypothetical distributions with means 0 and −0.25 showing the difference in percentage abnormal (z< -1.64) between the two populations.

Figure 5

Two hypothetical distributions with means −0.5 and −0.75 showing the difference in percentage abnormal (z< -1.64) between the two populations.

Figure 6

Two hypothetical distributions with means −1 and −1.25 showing the difference in percentage abnormal (z< -1.64) between the two populations.

Real study example: United Kingdom Oscillation Study (UKOS)

The UKOS was a multicentre randomised controlled trial that compared outcomes in extremely premature babies allocated to either conventional mechanical ventilation (CV) or to high frequency oscillation (HFO) at birth. The trial found no evidence for any differences in respiratory or neurological outcome at hospital discharge or at follow-up at age 1 year or age 2 years.10–12 The ex-preterm children were followed up at age 11–14 years to explore possible effects of ventilation at birth on lung function in adolescence; a small but statistically significant difference was found in mean lung function z-score for the primary outcome, forced expiratory flow at 75% of forced vital capacity, and for the majority of secondary lung function outcomes, all in favour of HFO.13 This finding was in keeping with limited non-randomised evidence but the small effect size alongside a difference in mean z-scores was challenging to interpret clinically. We, therefore, calculated the difference in the proportion of patients with abnormal values using the fifth centile as the lower limit of normal (z< -1.64), which showed that the mean difference of 0.23 z-scores corresponds to a difference of 8.2 percentage points in the percentage of children with abnormal lung function (table 2). This relatively large difference arises because the populations were vulnerable prematurely-born children with mean lung function z-scores around −1.0. The clinical relevance of the percentages abnormal is more apparent than the means of lung function scores, since abnormal lung function in childhood is associated with poor respiratory health in adulthood.

View this table:

Table 2

Interpretation of results using the two approaches for high frequency oscillation (HFO) and conventional mechanical ventilation (CV)

Tools to help

We have shown with hypothetical and real data that a given effect size expressed as a difference in means, has different clinical implications in vulnerable populations compared with healthy populations. It is therefore helpful to report not only the difference in means but also the difference in the proportions (or percentages) of patients with abnormal values. In addition, it follows that when designing studies with two groups and a continuous outcome, it is important to consider the difference in the proportion abnormal as well as the difference in means when deciding on the appropriate MCID. In the next sections, we describe how to calculate the difference in proportions efficiently and how to incorporate these ideas into the calculation of sample size for a new study.

Calculation of the difference in proportions with abnormal values

In the example above (table 2), we did not calculate the proportion with abnormal measurements by dichotomisation using the usual formula, number abnormal divided by the total number. Instead, we have used the distributional approach14–18 to estimate the proportions and their difference. This methodology allows the difference in the proportions of individuals with abnormal measurements to be estimated with the same precision as the original analysis conducted with the mean z-scores, but without the usual loss in power associated with dichotomisation. A fuller description of the distributional approach methodology is given in online supplemental S1 with a full list of associated publications. Software is available in Stata19 and R (https://cran.r-project.org/web/packages/distdichoR/index.html).

Supplemental material

[bmjopen-2021-052338supp001.pdf]

Calculation of sample sizes for study design

When deciding on the sample size for a new study, the issues described in this paper are particularly important if we are studying vulnerable patient populations. We need to ensure that the sample size is adequate to detect the minimum meaningful difference in the proportions of patients with abnormal values (eg, below fifth centile). In brief the following steps are needed:

Assume that the ‘control’ or normative population mean and SD are known and that the distribution is approximately Normal or can be transformed to Normal (as commonly assumed in many power calculations).
Identify the MCID for proportions, ‘MCID_prop’, from the proportion abnormal in the control population and the minimum meaningful difference as described above.
Compute the equivalent difference in means given the anticipated control population mean and SD, ‘MCID_means’ using standard formulae relating the mean and SD to the proportion beyond a given cut-point in a Normal distribution.
Calculate the target sample size based on the MCID_means.

In online supplemental, we have provided illustrative tables for z-scores (S2–S4) that allow researchers to estimate differences in the percentages with abnormal values for a range of mean population z-score values. In this way, researchers designing studies can explore the potential effect sizes for percentages at risk associated with a range of differences in mean z-scores and so make more informed decisions at the study design stage or when analysing an existing dataset where the sample size is fixed. By following this approach, it should be less likely that real differences will be missed or dismissed as clinically unimportant. The principles and calculations can easily be applied to other continuous variables where a cut-point can be used to define an abnormal value.

Discussion

In this paper, we have discussed the interpretation of difference in means in terms of the difference in percentages of individuals with abnormal values and we have shown that specific differences in means may have a different clinical importance depending on the patient population. There is an implicit recognition of this in Make et al’s paper where they give specific MCIDs for various outcomes relevant to patients with COPD.4 They suggest that 100 mL is a reasonable MCID for FEV₁ They do not specifically discuss FEV z-scores. Similarly Jones et al20 discuss MCIDs for a range of COPD outcomes including lung function and while they do not discuss z-scores either, they acknowledge that baseline FEVR₁R in a patient affects the degree of possible response so that a cross-the-board absolute value for MCID may not be appropriate. A Health Technology Review and associated papers helpfully explored in depth the methods used to specify the target difference for randomised trials but they did not discuss whether the MCID is constant across all populations.21–23

We have estimated the percentage (proportion) of patients with abnormal measurements using the distributional approach, which has the advantage that the whole distribution of data is used. Therefore, statistical power is not lost as it usually is when we dichotomise,14–18 and the comparison of means and percentages are estimated with the same precision. Thus, this dual approach addresses the difficulty in interpreting means, uses a method that is statistically robust, and provides estimates that are clinically meaningful.

We emphasise that we are not advocating replacing means with percentages or proportions alone. This would discard a substantial amount of information. Means must be computed, reported and interpreted, and where helpful, alongside the equivalent percentages of patients with abnormal values. Means, difference in means and difference in percentages should be presented with confidence intervals.

Using the distributional approach has a further benefit—it avoids the difficulty of multiple testing when both differences in means and differences in percentages are reported as the main outcome. Since the difference in percentages is a function of the difference in means, the two estimated differences provide equivalent inferences—they are alternative ways of presenting the same data in the same way, just as data from a 2×2 Chi-squared test can be presented as a difference in proportions and/or a ratio of proportions.

Limitations of our paper include that we have presented only one real-world example to keep the report short. While the example given is easily generalisable, further examples would be useful and we are planning future work that will illustrate the specific impact of these issues in a range of real-world situations. We also note that differences in means may be clinically meaningful such as when a study observes a large difference in means that has clear clinical importance.

In conclusion, we recommend that new studies using a continuous outcome where a cut-point may be used to define an abnormal measurement consider carefully what size of difference would be clinically meaningful in their population and to calculate both means and the percentage with abnormal values using an appropriate cut-point. This is particularly important in vulnerable populations. We recommend using the distributional approach to calculate the difference in percentages, since it retains statistical power and avoid issues with multiple testing. The researcher gets two-for-the-price-of-one.

Ethics statements

Patient consent for publication

References

↵
1. Jaeschke R,
2. Singer J,
3. Guyatt GH
. Measurement of health status. ascertaining the minimal clinically important difference. Control Clin Trials 1989;10:407–15.doi:10.1016/0197-2456(89)90005-6pmid:http://www.ncbi.nlm.nih.gov/pubmed/2691207
OpenUrl CrossRef PubMed Web of Science
↵
1. Dworkin RH,
2. Turk DC,
3. Wyrwich KW, et al
. Interpreting the clinical importance of treatment outcomes in chronic pain clinical trials: IMMPACT recommendations. J Pain 2008;9:105–21.doi:10.1016/j.jpain.2007.09.005pmid:http://www.ncbi.nlm.nih.gov/pubmed/18055266
OpenUrl CrossRef PubMed Web of Science
↵
1. Norman GR,
2. Sloan JA,
3. Wyrwich KW
. Interpretation of changes in health-related quality of life: the remarkable universality of half a standard deviation. Med Care 2003;41:582–92.doi:10.1097/01.MLR.0000062554.74615.4Cpmid:http://www.ncbi.nlm.nih.gov/pubmed/12719681
OpenUrl CrossRef PubMed Web of Science
↵
1. Make B,
2. Casaburi R,
3. Leidy NK
. Interpreting results from clinical trials: understanding minimal clinically important differences in COPD outcomes. COPD 2005;2:1–5.doi:10.1081/COPD-200051363pmid:http://www.ncbi.nlm.nih.gov/pubmed/17136955
OpenUrl CrossRef PubMed
↵
1. Bohannon RW,
2. Crouch R
. Minimal clinically important difference for change in 6-minute walk test distance of adults with pathology: a systematic review. J Eval Clin Pract 2017;23:377–81.doi:10.1111/jep.12629pmid:http://www.ncbi.nlm.nih.gov/pubmed/27592691
OpenUrl CrossRef PubMed
↵
1. Cohen J
. Statistical power analysis for the behavioral sciences. 2nd ed. Hillsdale, N.J: L. Erlbaum Associates, 1988.
↵
1. Brooke OG,
2. Anderson HR,
3. Bland JM, et al
. Effects on birth weight of smoking, alcohol, caffeine, socioeconomic factors, and psychosocial stress. BMJ 1989;298:795–801.doi:10.1136/bmj.298.6676.795pmid:http://www.ncbi.nlm.nih.gov/pubmed/2496859
OpenUrl Abstract/FREE Full Text
↵
1. Rose G
. Sick individuals and sick populations. Int J Epidemiol 1985;14:32–8.doi:10.1093/ije/14.1.32pmid:http://www.ncbi.nlm.nih.gov/pubmed/3872850
OpenUrl CrossRef PubMed Web of Science
↵
1. Quanjer PH,
2. Stanojevic S,
3. Cole TJ, et al
. Multi-Ethnic reference values for spirometry for the 3-95-yr age range: the global lung function 2012 equations. Eur Respir J 2012;40:1324–43.doi:10.1183/09031936.00080312pmid:http://www.ncbi.nlm.nih.gov/pubmed/22743675
OpenUrl Abstract/FREE Full Text
↵
1. Johnson AH,
2. Peacock JL,
3. Greenough A, et al
. High-Frequency oscillatory ventilation for the prevention of chronic lung disease of prematurity. N Engl J Med 2002;347:633–42.doi:10.1056/NEJMoa020432pmid:http://www.ncbi.nlm.nih.gov/pubmed/12200550
OpenUrl CrossRef PubMed Web of Science
↵
1. Thomas MR,
2. Rafferty GF,
3. Limb ES, et al
. Pulmonary function at follow-up of very preterm infants from the United Kingdom oscillation study. Am J Respir Crit Care Med 2004;169:868–72.doi:10.1164/rccm.200310-1425OCpmid:http://www.ncbi.nlm.nih.gov/pubmed/14693671
OpenUrl CrossRef PubMed Web of Science
↵
1. Marlow N,
2. Greenough A,
3. Peacock JL, et al
. Randomised trial of high frequency oscillatory ventilation or conventional ventilation in babies of gestational age 28 weeks or less: respiratory and neurological outcomes at 2 years. Arch Dis Child Fetal Neonatal Ed 2006;91:F320–6.doi:10.1136/adc.2005.079632pmid:http://www.ncbi.nlm.nih.gov/pubmed/16690640
OpenUrl Abstract/FREE Full Text
↵
1. Zivanovic S,
2. Peacock J,
3. Alcazar-Paris M, et al
. Late outcomes of a randomized trial of high-frequency oscillation in neonates. N Engl J Med 2014;370:1121–30.doi:10.1056/NEJMoa1309220
OpenUrl CrossRef PubMed
↵
1. Peacock JL,
2. Sauzet O,
3. Ewings SM, et al
. Dichotomising continuous data while retaining statistical power using a distributional approach. Stat Med 2012;31:3089–103.doi:10.1002/sim.5354
OpenUrl CrossRef PubMed
↵
1. Sauzet O,
2. Ofuya M,
3. Peacock JL
. Dichotomisation using a distributional approach when the outcome is skewed. BMC Med Res Methodol 2015;15:40. doi:10.1186/s12874-015-0028-8
↵
1. Sauzet O,
2. Peacock JL
. Estimating dichotomised outcomes in two groups with unequal variances: a distributional approach. Stat Med 2014;33:4547–59.doi:10.1002/sim.6255
OpenUrl
↵
1. Sauzet O,
2. Breckenkamp J,
3. Borde T, et al
. A distributional approach to obtain adjusted comparisons of proportions of a population at risk. Emerg Themes Epidemiol 2016;13:8. doi:10.1186/s12982-016-0050-2
↵
1. Ofuya M,
2. Sauzet O,
3. Peacock JL
. Dichotomisation of a continuous outcome and effect on meta-analyses: illustration of the distributional approach using the outcome birthweight. Syst Rev 2014;3:63. doi:10.1186/2046-4053-3-63
↵
1. Sauzet O,
2. Kleine M
. Distributional estimates for the comparison of proportions of a Dichotomized continuous outcome. Stata J 2016;16:880–99.doi:10.1177/1536867X1601600405
OpenUrl
↵
1. Jones PW,
2. Beeh KM,
3. Chapman KR, et al
. Minimal clinically important differences in pharmacological trials. Am J Respir Crit Care Med 2014;189:250–5.doi:10.1164/rccm.201310-1863PP
OpenUrl CrossRef PubMed Web of Science
↵
1. Cook J,
2. Hislop J,
3. Adewuyi T, et al
. Assessing methods to specify the target difference for a randomised controlled trial: delta (difference elicitation in trials) review. Health Technol Assess 2014;18.doi:10.3310/hta18280
↵
1. Cook JA,
2. Hislop J,
3. Altman DG, et al
. Specifying the target difference in the primary outcome for a randomised controlled trial: guidance for researchers. Trials 2015;16:12. doi:10.1186/s13063-014-0526-8
↵
1. Cook JA,
2. Julious SA,
3. Sones W, et al
. Delta2 guidance on choosing the target difference and undertaking and reporting the sample size calculation for a randomised controlled trial. Trials 2018;19:606. doi:10.1186/s13063-018-2884-0

Supplementary materials

Supplementary Data

This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.

Data supplement 1

Footnotes

Contributors JLP and OS conceived this study and initiated a discussion with the other authors. JL conducted the brief descriptive review and contributed to the analysis. All authors, JLP, JL, JRR and OS, contributed to the interpretation of the results and to the drafting of the manuscript. All authors agreed the final submission.
Funding The authors have not declared a specific grant for this research from any funding agency in the public, commercial or not-for-profit sectors.
Competing interests None declared.
Provenance and peer review Not commissioned; externally peer reviewed.
Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.

[1] ↵
Jaeschke R,
Singer J,
Guyatt GH
. Measurement of health status. ascertaining the minimal clinically important difference. Control Clin Trials 1989;10:407–15.doi:10.1016/0197-2456(89)90005-6pmid:http://www.ncbi.nlm.nih.gov/pubmed/2691207
OpenUrl CrossRef PubMed Web of Science

[2] Jaeschke R,

[3] Singer J,

[4] Guyatt GH

[5] ↵
Dworkin RH,
Turk DC,
Wyrwich KW, et al
. Interpreting the clinical importance of treatment outcomes in chronic pain clinical trials: IMMPACT recommendations. J Pain 2008;9:105–21.doi:10.1016/j.jpain.2007.09.005pmid:http://www.ncbi.nlm.nih.gov/pubmed/18055266
OpenUrl CrossRef PubMed Web of Science

[6] Dworkin RH,

[7] Turk DC,

[8] Wyrwich KW, et al

[9] ↵
Norman GR,
Sloan JA,
Wyrwich KW
. Interpretation of changes in health-related quality of life: the remarkable universality of half a standard deviation. Med Care 2003;41:582–92.doi:10.1097/01.MLR.0000062554.74615.4Cpmid:http://www.ncbi.nlm.nih.gov/pubmed/12719681
OpenUrl CrossRef PubMed Web of Science

[10] Norman GR,

[11] Sloan JA,

[12] Wyrwich KW

[13] ↵
Make B,
Casaburi R,
Leidy NK
. Interpreting results from clinical trials: understanding minimal clinically important differences in COPD outcomes. COPD 2005;2:1–5.doi:10.1081/COPD-200051363pmid:http://www.ncbi.nlm.nih.gov/pubmed/17136955
OpenUrl CrossRef PubMed

[14] Make B,

[15] Casaburi R,

[16] Leidy NK

[17] ↵
Bohannon RW,
Crouch R
. Minimal clinically important difference for change in 6-minute walk test distance of adults with pathology: a systematic review. J Eval Clin Pract 2017;23:377–81.doi:10.1111/jep.12629pmid:http://www.ncbi.nlm.nih.gov/pubmed/27592691
OpenUrl CrossRef PubMed

[18] Bohannon RW,

[19] Crouch R

[20] ↵
Cohen J
. Statistical power analysis for the behavioral sciences. 2nd ed. Hillsdale, N.J: L. Erlbaum Associates, 1988.

[21] Cohen J

[22] ↵
Brooke OG,
Anderson HR,
Bland JM, et al
. Effects on birth weight of smoking, alcohol, caffeine, socioeconomic factors, and psychosocial stress. BMJ 1989;298:795–801.doi:10.1136/bmj.298.6676.795pmid:http://www.ncbi.nlm.nih.gov/pubmed/2496859
OpenUrl Abstract/FREE Full Text

[23] Brooke OG,

[24] Anderson HR,

[25] Bland JM, et al

[26] ↵
Rose G
. Sick individuals and sick populations. Int J Epidemiol 1985;14:32–8.doi:10.1093/ije/14.1.32pmid:http://www.ncbi.nlm.nih.gov/pubmed/3872850
OpenUrl CrossRef PubMed Web of Science

[27] Rose G

[28] ↵
Quanjer PH,
Stanojevic S,
Cole TJ, et al
. Multi-Ethnic reference values for spirometry for the 3-95-yr age range: the global lung function 2012 equations. Eur Respir J 2012;40:1324–43.doi:10.1183/09031936.00080312pmid:http://www.ncbi.nlm.nih.gov/pubmed/22743675
OpenUrl Abstract/FREE Full Text

[29] Quanjer PH,

[30] Stanojevic S,

[31] Cole TJ, et al

[32] ↵
Johnson AH,
Peacock JL,
Greenough A, et al
. High-Frequency oscillatory ventilation for the prevention of chronic lung disease of prematurity. N Engl J Med 2002;347:633–42.doi:10.1056/NEJMoa020432pmid:http://www.ncbi.nlm.nih.gov/pubmed/12200550
OpenUrl CrossRef PubMed Web of Science

[33] Johnson AH,

[34] Peacock JL,

[35] Greenough A, et al

[36] ↵
Thomas MR,
Rafferty GF,
Limb ES, et al
. Pulmonary function at follow-up of very preterm infants from the United Kingdom oscillation study. Am J Respir Crit Care Med 2004;169:868–72.doi:10.1164/rccm.200310-1425OCpmid:http://www.ncbi.nlm.nih.gov/pubmed/14693671
OpenUrl CrossRef PubMed Web of Science

[37] Thomas MR,

[38] Rafferty GF,

[39] Limb ES, et al

[40] ↵
Marlow N,
Greenough A,
Peacock JL, et al
. Randomised trial of high frequency oscillatory ventilation or conventional ventilation in babies of gestational age 28 weeks or less: respiratory and neurological outcomes at 2 years. Arch Dis Child Fetal Neonatal Ed 2006;91:F320–6.doi:10.1136/adc.2005.079632pmid:http://www.ncbi.nlm.nih.gov/pubmed/16690640
OpenUrl Abstract/FREE Full Text

[41] Marlow N,

[42] Greenough A,

[43] Peacock JL, et al

[44] ↵
Zivanovic S,
Peacock J,
Alcazar-Paris M, et al
. Late outcomes of a randomized trial of high-frequency oscillation in neonates. N Engl J Med 2014;370:1121–30.doi:10.1056/NEJMoa1309220
OpenUrl CrossRef PubMed

[45] Zivanovic S,

[46] Peacock J,

[47] Alcazar-Paris M, et al

[48] ↵
Peacock JL,
Sauzet O,
Ewings SM, et al
. Dichotomising continuous data while retaining statistical power using a distributional approach. Stat Med 2012;31:3089–103.doi:10.1002/sim.5354
OpenUrl CrossRef PubMed

[49] Peacock JL,

[50] Sauzet O,

[51] Ewings SM, et al

[52] ↵
Sauzet O,
Ofuya M,
Peacock JL
. Dichotomisation using a distributional approach when the outcome is skewed. BMC Med Res Methodol 2015;15:40. doi:10.1186/s12874-015-0028-8

[53] Sauzet O,

[54] Ofuya M,

[55] Peacock JL

[56] ↵
Sauzet O,
Peacock JL
. Estimating dichotomised outcomes in two groups with unequal variances: a distributional approach. Stat Med 2014;33:4547–59.doi:10.1002/sim.6255
OpenUrl

[57] Sauzet O,

[58] Peacock JL

[59] ↵
Sauzet O,
Breckenkamp J,
Borde T, et al
. A distributional approach to obtain adjusted comparisons of proportions of a population at risk. Emerg Themes Epidemiol 2016;13:8. doi:10.1186/s12982-016-0050-2

[60] Sauzet O,

[61] Breckenkamp J,

[62] Borde T, et al

[63] ↵
Ofuya M,
Sauzet O,
Peacock JL
. Dichotomisation of a continuous outcome and effect on meta-analyses: illustration of the distributional approach using the outcome birthweight. Syst Rev 2014;3:63. doi:10.1186/2046-4053-3-63

[64] Ofuya M,

[65] Sauzet O,

[66] Peacock JL

[67] ↵
Sauzet O,
Kleine M
. Distributional estimates for the comparison of proportions of a Dichotomized continuous outcome. Stata J 2016;16:880–99.doi:10.1177/1536867X1601600405
OpenUrl

[68] Sauzet O,

[69] Kleine M

[70] ↵
Jones PW,
Beeh KM,
Chapman KR, et al
. Minimal clinically important differences in pharmacological trials. Am J Respir Crit Care Med 2014;189:250–5.doi:10.1164/rccm.201310-1863PP
OpenUrl CrossRef PubMed Web of Science

[71] Jones PW,

[72] Beeh KM,

[73] Chapman KR, et al

[74] ↵
Cook J,
Hislop J,
Adewuyi T, et al
. Assessing methods to specify the target difference for a randomised controlled trial: delta (difference elicitation in trials) review. Health Technol Assess 2014;18.doi:10.3310/hta18280

[75] Cook J,

[76] Hislop J,

[77] Adewuyi T, et al

[78] ↵
Cook JA,
Hislop J,
Altman DG, et al
. Specifying the target difference in the primary outcome for a randomised controlled trial: guidance for researchers. Trials 2015;16:12. doi:10.1186/s13063-014-0526-8

[79] Cook JA,

[80] Hislop J,

[81] Altman DG, et al

[82] ↵
Cook JA,
Julious SA,
Sones W, et al
. Delta2 guidance on choosing the target difference and undertaking and reporting the sample size calculation for a randomised controlled trial. Trials 2018;19:606. doi:10.1186/s13063-018-2884-0

[83] Cook JA,

[84] Julious SA,

[85] Sones W, et al

Log in using your username and password

Main menu

Log in using your username and password

You are here

Abstract

Statistics from Altmetric.com

Request Permissions

Strengths and limitations of this study

Introduction

Clinically meaningful effect size

Motivating example: lung function z-scores

Hypothetical example: interpreting differences in z-scores

Real study example: United Kingdom Oscillation Study (UKOS)

Tools to help

Calculation of the difference in proportions with abnormal values

Supplemental material

Calculation of sample sizes for study design

Discussion

Ethics statements

Patient consent for publication

References

Supplementary materials

Supplementary Data

Footnotes

Read the full text or download the PDF:

Log in using your username and password