Relative risks and confidence intervals were easily computed indirectly from multivariable logistic regression

doi:10.1016/j.jclinepi.2006.12.001

Journal of Clinical Epidemiology

Volume 60, Issue 9, September 2007, Pages 874-882

https://doi.org/10.1016/j.jclinepi.2006.12.001 Get rights and content

Abstract

Objective

To assess alternative statistical methods for estimating relative risks and their confidence intervals from multivariable binary regression when outcomes are common.

Study Design and Setting

We performed simulations on two hypothetical groups of patients in a single-center study, either randomized or cohort, and reanalyzed a published observational study. Outcomes of interest were the bias of relative risk estimates, coverage of 95% confidence intervals, and the Akaike information criterion.

Results

According to simulations, a commonly used method of computing confidence intervals for relative risk substantially overstates statistical significance in typical applications when outcomes are common. Generalized linear models other than logistic regression sometimes failed to converge, or produced estimated risks that exceeded 1.0. Conditional or marginal standardization using logistic regression and bootstrap resampling estimated risks within the [0,1] bounds and relative risks with appropriate confidence intervals.

Conclusion

Especially when outcomes are common, relative risks and confidence intervals are easily computed indirectly from multivariable logistic regression. Log-linear regression models, by contrast, are problematic when outcomes are common.

Introduction

Relative risk (RR) is a common measure of the effect of treatment or exposure on outcome in cohort studies. Estimating this simple ratio of the disease risk among the treated (or exposed) compared to the untreated (or unexposed), and an appropriate confidence interval, is a routine application of Mantel–Haenszel methods [1], provided the investigator needs to adjust for only one or two categorical factors. More commonly, however, the study calls for simultaneous adjustment of several factors, some of which are continuous, via multivariable regression modeling. As described in texts on biomedical statistics [2], logistic regression for binary outcome data produces an adjusted odds ratio (OR), not a relative risk. Although the OR has attractive mathematical properties, clinicians rarely think in terms of odds of disease or the OR as a measure of effect [3], [4]. If the risk of an outcome event is rare, under 10%, and the OR is small, the OR approximates the relative risk. But with more common outcomes, the OR is well-known to be more extreme (farther from 1.0) than the relative risk for the same data [5], [6]. The controversy generated by the report of Schulman et al. report on the effects of race and sex on physician referrals exemplifies this distortion [7], [8].

Some authors [9] have converted ORs to relative risks by the simple relationship RR = OR/([1 − p₀] + [p₀ × OR]), where the OR came from the estimate of the logistic regression model, while the value of the baseline risk (p₀) was estimated as the unadjusted risk in the reference group (in that case a hospital). The authors estimated the upper and lower bounds for the confidence interval by substituting for OR the upper and lower confidence bounds for the OR from the logistic regression. This method of estimating a confidence interval, known as the “method of substitution”, has been applied to other measures of association [10]. Subsequent criticism suggested, however, that the proposed confidence interval for relative risk would be too narrow because of its failure to account for variability in the baseline risk [11], [12]. Others arrived at the same conclusion independently in similar contexts [13].

Estimating relative risk is also possible by means of alternative generalized linear models.

One proposed option, the log-binomial model, replaces the logit link in logistic regression with a log link but maintains the specification of a binomial distribution [12], [14]. Although this functional form estimates relative risk directly by simple exponentiation of the regression coefficient for the exposure of interest, the log link permits estimates of risk within the broader bounds [0,∞] when probabilities must fall within the bounds [0,1]. Because of this mismatch between the bounds of the model and the allowable outcome, Wacholder [15] proposed constraining the fitting algorithm to respect the [0,1] bounds. His algorithm has been incorporated into the Stata statistical package in the function “binreg” (Stata Corp., College Station, TX, 2001). Owing to the known problem of convergence with the log-binomial model, several authors [16], [17], [18] recently proposed Poisson regression, a generalized linear model with a log-link and a Poisson distribution, and the sandwich variance estimator to produce confidence intervals with correct coverage. The issue of expected risks exceeding the [0,1] bounds remains, however. Greenland recently reviewed these recent articles in the broader context of the literature on standardization of estimates [19].

Inspite of the shortcoming of simplistic methods, they continue to appear in the literature. As of October 2005, almost 200 articles have cited and used the method of substitution outlined in 1998 by Zhang and Yu [9]. These applications, including those published in major medical journals [20], [21], involved common outcomes. The log-linear model with sandwich variance estimates outlined by Zou in 2004 has also begun to gain use, again in leading medical journals [22] for applications with common outcomes. Nevertheless, both methods, as we shall point out, suffer from theoretical as well as methodological problems.

We first demonstrate that confidence intervals generated from logistic regression and the method of substitution, at least as promulgated by Zhang and Yu, exhibit poor coverage for the intended applications of common outcomes. Then we explain why the log-binomial and the log-linear (Poisson) models options might also fail when outcomes are common. Finally, we build upon methodological literature on conditional and marginal standardization to demonstrate several options for using logistic regression to estimate relative risk.

Section snippets

Simulations

We simulated data sets with known values for baseline risks (0.1, 0.2, and 0.3) and relative risk (1.25, 1.5, 1.75, and 2.0), and with 100, 500, and 1000 hypothetical patients split equally into two hypothetical groups, unexposed and exposed. Additional simulations assumed two unbalanced data sets: 100 unexposed and 400 exposed patients, or 100 exposed and 400 unexposed patients. The program next simulated the occurrence of disease at an expected rate among the unexposed (untreated) patients

Method of substitution

In our simulations, the method of substitution advocated by Zhang and Yu generally produced inappropriately narrow 95% confidence intervals for relative risk (Table 1). Even for a low baseline risk (0.2) and a modest relative risk (2.0), the confidence intervals intended to have 95% coverage actually produced less than 90% coverage. For a given relative risk, coverage of the confidence intervals worsened as the baseline risk (p₀) increased. For a given baseline risk, coverage deteriorated with

Discussion

Confidence intervals are essential to support estimates of relative risk from multivariable regression models [32], [33]. Our simulations demonstrate why the method of substitution outlined by Zhang and Yu [9] and finding common use in leading journals fails in the very situations for which it was designed—when baseline risk and relative risk are not small. Confidence intervals are too narrow and therefore precision of estimates is overstated. By contrast, confidence intervals based on either

Acknowledgment

Funding: Support was provided in part by an Agency for Healthcare Research and Quality (AHRQ), Centers for Education and Research on Therapeutics cooperative agreement (U18 HS10399), and by Agency for Healthcare Research and Quality, Grant No. R03 HS 11481-01.

Competing interests: Dr. Berlin is employed by Johnson & Johnson, which markets products for treatment of wounds. Johnson & Johnson has provided no input to or support for this study.

References (41)

J.C. Sinclair et al.
Clinically useful measures of effect in binary analyses of randomized trials
J Clin Epidemiol
(1994)
A.S. Robbins et al.
What's the relative risk? A method to directly estimate risk ratios in cohort studies of common outcomes
Ann Epidemiol
(2002)
W.D. Flanders et al.
Large sample confidence intervals for regression standardized risks, risk ratios, and risk differences
J Chronic Dis
(1987)
S. Greenland et al.
Estimation of a common effect parameter from sparse follow-up data
Biometrics
(1985)
D.G. Altman
Practical statistics for medical research
(1991)
J.M. Bland et al.
The odds ratio
BMJ
(2000)
J. Deeks
When can odds ratio mislead? [letter]
BMJ
(1998)
D.G. Altman et al.
Odds ratios should be avoided when events are common
BMJ
(1998)
K.A. Schulman et al.
The effect of race and sex on physicians' recommendations for cardiac catheterization
N Engl J Med
(1999)
F. Davidoff
race, sex, and physicians' referral for cardiac catheterization [letter]
N Engl J Med
(1999)

J. Zhang et al.

What's the relative risk? A method of correcting the odds ratio in cohort studies of common outcomes

JAMA

(1998)

L.E. Daly

Confidence limits made easy: interval estimation using a substitution method

Am J Epidemiol

(1998)

L.A. McNutt et al.

Correcting the odds ratio in cohort studies of common outcomes

JAMA

(1999)

L.A. McNutt et al.

Estimating the relative risk in cohort studies and clinical trials of common outcomes

Am J Epidemiol

(2003)

L.M. Bjerre et al.

Expressing the magnitude of adverse effects in case–control studies: “the number of patients needed to be treated for one additional patient to be harmed”

BMJ

(2000)

S. Wacholder

Binomial regression in GLIM: estimating risk ratios and risk differences

Am J Epidemiol

(1986)

G. Zou

A modified Poisson regression approach to prospective studies with binary data

Am J Epidemiol

(2004)

R.E. Carter et al.

Quasi-likelihood estimation for relative risk regression models

Biostatistics

(2005)

D. Spiegelman et al.

Easy SAS calculations for risk and prevalence ratios and differences

Am J Epidemiol

(2005)

S. Greenland

Model-based estimation of relative risks and other epidemiologic measures in studies of common outcomes and in case–control studies

Am J Epidemiol

(2004)

Cited by (207)

Association of initial opioid prescription duration and an opioid refill by pain diagnosis: Evidence from outpatient settings in ten US health systems
2024, Preventive Medicine
The Centers for Disease Control and Prevention's 2022 Clinical Practice Guideline for Prescribing Opioids for Pain cautioned that inflexible opioid prescription duration limits may harm patients. Information about the relationship between initial opioid prescription duration and a subsequent refill could inform prescribing policies and practices to optimize patient outcomes. We assessed the association between initial opioid duration and an opioid refill prescription.
We conducted a retrospective cohort study of adults ≥19 years of age in 10 US health systems between 2013 and 2018 from outpatient care with a diagnosis for back pain without radiculopathy, back pain with radiculopathy, neck pain, joint pain, tendonitis/bursitis, mild musculoskeletal pain, severe musculoskeletal pain, urinary calculus, or headache. Generalized additive models were used to estimate the association between opioid days' supply and a refill prescription.
Overall, 220,797 patients were prescribed opioid analgesics upon an outpatient visit for pain. Nearly a quarter (23.5%) of the cohort received an opioid refill prescription during follow-up. The likelihood of a refill generally increased with initial duration for most pain diagnoses. About 1 to 3 fewer patients would receive a refill within 3 months for every 100 patients initially prescribed 3 vs. 7 days of opioids for most pain diagnoses. The lowest likelihood of refill was for a 1-day supply for all pain diagnoses, except for severe musculoskeletal pain (9 days' supply) and headache (3–4 days' supply).
Long-term prescription opioid use increased modestly with initial opioid prescription duration for most but not all pain diagnoses examined.
Stroke and activity limitation in Chinese adults 65 Years or older
2023, Disability and Health Journal
Differences in activity limitations between stroke survivors and people with other chronic conditions and how their levels of activity limitation vary by sociodemographic characteristics have not been well quantified.
To quantify activity limitations experienced by Chinese older adult stroke survivors and explore stroke effects in specific subgroups.
We used Chinese Longitudinal Healthy Longevity Survey 2017–2018 data (N = 11,743) to produce population-weighted estimates of activity limitations using the Activities of Daily Living (ADL) and the Instrumental ADL (IADL) scales for older adults (age 65 and older) stroke survivors compared to those with non-stroke chronic conditions and those without chronic conditions. Multinomial logistic regressions were run with outcomes “no activity limitation,” “IADL only limitation,” and “ADL limitation.”
The weighted marginal prevalence of ADL limitation was higher in the stroke group (14.8%) than in those with non-stroke chronic condition (4.8%) or no chronic conditions (3.6%) (p < 0.01). The corresponding prevalence of IADL limitation for the three groups was 36.0%, 31.4%, and 22.2%, respectively (p < 0.01). Stroke survivors aged ≥ 80 years had a higher prevalence of ADL/IADL limitation than those aged 65–79 years (p < 0.01). Formal education was associated with a lower prevalence of ADL/IADL limitation in each chronic condition group (p < 0.01).
Prevalence and severity of activity limitation among Chinese older adult stroke survivors were several times higher than those without chronic conditions and those with non-stroke chronic conditions. Stroke survivors, particularly those aged ≥80 years and those without formal education, might be predisposed to more severe activity limitation and require more support to compensate.
Technology and the geography of the foreign exchange market
2023, Journal of International Money and Finance
We analyze the impact of technology on the production and trade in services, focusing on the location of foreign exchange transactions and the effect of submarine fiber-optic cable connections. Cable connections between local markets and major financial centers reduce the costs of trading currencies locally and increase the share of currency transactions taking place in the issuing country. But they also attenuate the effect of existing spatial frictions that prevent transactions from moving offshore to take advantage of agglomeration economies and thick-market advantages of major financial centers. In practice, this second effect dominates. Our estimates suggest that the advent of cable connections boosted the share in global turnover of London, the world’s largest trading venue, by as much as one-third.
Low-dose aspirin use in pregnancy and the risk of preterm birth: a Swedish register-based cohort study
2023, American Journal of Obstetrics and Gynecology
Preterm birth is the leading cause of neonatal mortality and morbidity. Women who have had a previous preterm birth are at increased risk for preterm birth in their subsequent pregnancies. Low-dose aspirin use reduces the risk for preterm birth among women at risk of developing preeclampsia, however, it is unclear whether low-dose aspirin may reduce the risk of recurrent preterm birth.
This study aimed to investigate the association between low-dose aspirin use and preterm birth among women with a previous preterm birth.
We conducted a Swedish register-based cohort study and included women who had a first and second pregnancy between 2006 and 2019, with the first pregnancy ending in preterm birth (medically indicated or with spontaneous onset <37 weeks of gestation). The association between low-dose aspirin use and preterm birth in the second pregnancy was estimated via logistic regression via standardization and expressed as marginal relative risks with the 95% confidence interval.
Among the study cohort (N=22,127), 3057 women (14%) were prescribed low-dose aspirin during their second pregnancy and 3703 women (17%) gave birth prematurely. Low-dose aspirin use was associated with a reduced risk for preterm birth, (marginal relative risk, 0.87; 95% confidence interval, 0.77–0.99). There were no statistically significant associations between low-dose aspirin use and an altered risk for moderate preterm birth, defined as birth between 32 and 36 weeks’ gestation (marginal relative risk, 0.90; 95% confidence interval, 0.78–1.03), or very preterm birth, defined as birth <32 weeks’ gestation (marginal relative risk, 0.75; 95% confidence interval, 0.54–1.04). Regarding the onset of preterm birth, low-dose aspirin use was associated with a reduced risk for spontaneous preterm birth (marginal relative risk, 0.70; 95% confidence interval, 0.57–0.86) but no reduction in the risk for medically indicated preterm birth (marginal relative risk, 1.09; 95% confidence interval, 0.91–1.30) was observed.
Among women with a previous preterm birth, low-dose aspirin use was associated with a reduced risk for preterm birth. When investigating preterm birth by onset in the second pregnancy, low-dose aspirin use was associated with a reduced risk for spontaneous preterm birth. Our results suggest that low-dose aspirin may be an effective prophylaxis for recurrent preterm birth.
Impact of Medicaid expansion on young adult firearm and motor vehicle crash trauma patients
2022, Surgery Open Science
The Affordable Care Act Medicaid expansion has increased insurance coverage and reduced some disparities in care and outcomes among trauma patients, but its impact on subsets of trauma patients with particular mechanisms of injury are unclear. This study evaluated the association of the Affordable Care Act Medicaid expansion with insurance coverage, trauma care, and outcomes among young adults hospitalized for firearm- or motor vehicle crash–related injuries.
We used statewide hospital discharge data from 5 Medicaid expansion and 5 nonexpansion states to compare changes in insurance coverage and outcomes among firearm and motor vehicle crash trauma patients aged 19–44 from before (2011–2013) to after (2014–2017) Medicaid expansion. We examined difference in differences overall, by race/ethnicity, and by zip-code-level median income quartile.
Medicaid expansion was associated with a decrease in the proportion of young adult motor vehicle crash and firearm trauma patients who were uninsured (motor vehicle crash: difference in differences − 12.7 percentage points, P < .001; firearm: difference in differences − 30.7 percentage points, P < .001). Medicaid expansion was also associated with increases in the percentage of patients discharged to any rehabilitation (motor vehicle crash: difference in differences 1.78 percentage points, P = .001; firearm: difference in differences 2.07 percentage points, P = .02) and inpatient rehabilitation (motor vehicle crash: difference in differences 1.21 percentage points, P = .001; firearm: difference in differences 1.58 percentage points, P = .002). Among patients with firearm injuries, Medicaid expansion was associated with a reduction in in-hospital mortality (difference in differences − 1.55 percentage points, P = .002).
In its first 4 years, the Affordable Care Act Medicaid expansion increased insurance coverage and access to rehabilitation among young adults hospitalized for firearm- or motor vehicle crash–related injuries while reducing inpatient mortality among firearm trauma patients.
Controversy and Debate: Questionable utility of the relative risk in clinical research: Paper 4: Odds Ratios are far from “portable” — A call to use realistic models for effect variation in meta-analysis
2022, Journal of Clinical Epidemiology
Recently Doi et al. argued that risk ratios should be replaced with odds ratios in clinical research. We disagreed, and empirically documented the lack of portability of odds ratios, while Doi et al. defended their position. In this response we highlight important errors in their position.
We counter Doi et al.’s arguments by further examining the correlations of odds ratios, and risk ratios, with baseline risks in 20,198 meta-analyses from the Cochrane Database of Systematic Reviews.
Doi et al.’s claim that odds ratios are portable is invalid because 1) their reasoning is circular: they assume a model under which the odds ratio is constant and show that under such a model the odds ratio is portable; 2) the method they advocate to convert odds ratios to risk ratios is biased; 3) their empirical example is readily-refuted by counter-examples of meta-analyses in which the risk ratio is portable but the odds ratio isn't; and 4) they fail to consider the causal determinants of meta-analytic inclusion criteria: Doi et al. mistakenly claim that variation in odds ratios with different baseline risks in meta-analyses is due to collider bias. Empirical comparison between the correlations of odds ratios, and risk ratios, with baseline risks show that the portability of odds ratios and risk ratios varies across settings.
The suggestion to replace risk ratios with odds ratios is based on circular reasoning and a confusion of mathematical and empirical results. It is especially misleading for meta-analyses and clinical guidance. Neither the odds ratio nor the risk ratio is universally portable. To address this lack of portability, we reinforce our suggestion to report variation in effect measures conditioning on modifying factors such as baseline risk; understanding such variation is essential to patient-centered practice.

View all citing articles on Scopus

View full text

Original ArticleRelative risks and confidence intervals were easily computed indirectly from multivariable logistic regression

Abstract

Objective

Study Design and Setting

Results

Conclusion

Introduction

Section snippets

Simulations

Method of substitution

Discussion

Acknowledgment

J Clin Epidemiol

Ann Epidemiol

J Chronic Dis

Estimation of a common effect parameter from sparse follow-up data

Biometrics

Practical statistics for medical research

The odds ratio

BMJ

When can odds ratio mislead? [letter]

BMJ

Odds ratios should be avoided when events are common

BMJ

The effect of race and sex on physicians' recommendations for cardiac catheterization

N Engl J Med

race, sex, and physicians' referral for cardiac catheterization [letter]

N Engl J Med

What's the relative risk? A method of correcting the odds ratio in cohort studies of common outcomes

JAMA

Confidence limits made easy: interval estimation using a substitution method

Am J Epidemiol

Correcting the odds ratio in cohort studies of common outcomes

JAMA

Estimating the relative risk in cohort studies and clinical trials of common outcomes

Am J Epidemiol

Expressing the magnitude of adverse effects in case–control studies: “the number of patients needed to be treated for one additional patient to be harmed”

BMJ

Binomial regression in GLIM: estimating risk ratios and risk differences

Am J Epidemiol

A modified Poisson regression approach to prospective studies with binary data

Am J Epidemiol

Quasi-likelihood estimation for relative risk regression models

Biostatistics

Easy SAS calculations for risk and prevalence ratios and differences

Am J Epidemiol

Model-based estimation of relative risks and other epidemiologic measures in studies of common outcomes and in case–control studies

Am J Epidemiol

Original Article
Relative risks and confidence intervals were easily computed indirectly from multivariable logistic regression