Article Text

Original research
Comparative efficacy and complications of long-acting and intermediate-acting insulin regimens for adults with type 1 diabetes: an individual patient data network meta-analysis
  1. Areti Angeliki Veroniki1,2,
  2. Georgios Seitidis3,
  3. Lesley Stewart4,
  4. Mike Clarke5,
  5. Catrin Tudur-Smith6,
  6. Dimitris Mavridis3,
  7. Catherine H Yu1,7,
  8. Lorenzo Moja8,9,
  9. Sharon E Straus1,2,10,
  10. Andrea C Tricco1,11
  1. 1Knowledge Translation Program, Li Ka Shing Knowledge Institute, St. Michael’s Hospital, Toronto, Ontario, Canada
  2. 2Institute of Health Policy Management and Evaluation, University of Toronto, Toronto, Ontario, Canada
  3. 3Department of Primary Education, University of Ioannina, Ioannina, Greece
  4. 4Centre for Reviews and Dissemination, University of York, York, UK
  5. 5Northern Ireland Methodology Hub, Queen's University Belfast, Belfast, UK
  6. 6Biostatistics, University of Liverpool, Liverpool, UK
  7. 7Department of Medicine, University of Toronto, Toronto, Ontario, Canada
  8. 8Department of Biomedical Sciences for Health, University of Milan, Milan, Italy
  9. 9Essential Medicines and Health Products, WHO, Geneva, Switzerland
  10. 10Department of Geriatric Medicine, University of Toronto, Toronto, Ontario, Canada
  11. 11Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada
  1. Correspondence to Dr Areti Angeliki Veroniki; Areti-Angeliki.Veroniki{at}unityhealth.to

Abstract

Objective To examine the comparative efficacy and complications of long-acting and intermediate-acting insulin for different patient characteristics for type 1 diabetes mellitus (T1DM).

Design Systematic review and individual patient data (IPD) network meta-analysis (NMA).

Data sources MEDLINE, EMBASE and the Cochrane Central Register of Controlled Trials were searched through June 2015.

Eligibility criteria Randomised controlled trials (RCTs) on adults with T1DM assessing glycosylated haemoglobin (A1c) and severe hypoglycaemia in long-acting and intermediate-acting insulin regimens.

Data extraction and synthesis We requested IPD from authors and funders. When IPD were not available, we used aggregate data. We conducted a random-effects model, and specifically a one-stage IPD-NMA for those studies providing IPD and a two-stage IPD-NMA to incorporate those studies not providing IPD.

Results We included 28 RCTs plus one companion report, after screening 6680 titles/abstracts and 205 full-text articles. Of the 28 RCTs, 27 studies provided data for the NMA with 7394 participants, of which 12 RCTs had IPD on 4943 participants. The IPD-NMA for A1c suggested that glargine once daily (mean difference [MD]=−0.31, 95% confidence interval [CI]: −0.48 to −0.14) and detemir once daily (MD=−0.25, 95% CI: –0.41 to −0.09) were superior to neutral protamine Hagedorn (NPH) once daily. NPH once/two times per day improved A1c compared with NPH once daily (MD=−0.30, 95% CI: –0.50 to −0.11). Results regarding complications in severe hypoglycaemia should be considered with great caution due to inconsistency in the evidence network. Accounting for missing data, there was no evidence of inconsistency and long-acting insulin regimens ranked higher regarding reducing severe hypoglycaemia compared with intermediate-acting insulin regimens (two-stage NMA: glargine two times per day SUCRA (Surface Under the Cumulative Ranking curve)=89%, detemir once daily SUCRA=77%; one-stage NMA: detemir once daily/two times per day SUCRA=85%). Using multiple imputations and IPD only, complications in severe hypoglycaemia increased with diabetes-related comorbidities (regression coefficient: 1.03, 95% CI: 1.02 to 1.03).

Conclusions Long-acting insulin regimens reduced A1c compared with intermediate-acting insulin regimens and were associated with lower severe hypoglycaemia. Of the observed differences, only glargine once daily achieved a clinically significant reduction of 0.30%. Results should be interpreted with caution due to very low quality of evidence.

PROSPERO registration number CRD42015023511.

  • DIABETES & ENDOCRINOLOGY
  • EPIDEMIOLOGY
  • PUBLIC HEALTH

Data availability statement

All data relevant to the study are included in the article or uploaded as supplemental information. The IPD database was only available for a certain period through the proprietary-specific online platform. Not applicable.

http://creativecommons.org/licenses/by-nc/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

STRENGTHS AND LIMITATIONS OF THIS STUDY

  • This systematic review with individual patient data network meta-analysis (IPD-NMA) followed the methods guidelines in the Cochrane Handbook, applied Confidence in NMA (CINeMA) to assess certainty of evidence, and reported findings using Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) extensions for NMA and IPD meta-analysis.

  • Compared with the findings of our previous systematic review that only included aggregate data, the inclusion of IPD allowed us to increase certainty in the treatment effects and include randomised controlled trials (RCTs) with non-reported published outcome data or participant characteristics.

  • Limitations of the NMA include the age of the included RCTs, high risk of within-study bias, small-study effects in severe hypoglycaemia, and intransitivity because of sex and participant age.

  • IPD were available for fewer than half of the eligible RCTs from a single funder, highlighting potential retrieval bias.

  • The literature search was conducted 7 years ago, but obtaining IPD in a timely manner was very challenging; further eligible studies may have been published since then.

Introduction

A previous systematic review and network meta-analysis (NMA) showed that long-acting insulin (glargine, detemir) is superior to intermediate-acting insulin (neutral protamine Hagedorn (NPH), lente), providing better glycaemic control and reducing the occurrence of severe hypoglycaemia in patients with type 1 diabetes mellitus (T1DM).1 Also, evidence base on multiple study designs and any type of insulin (ultra-long, long, intermediate-acting and human/animal insulin) in adults with T1DM showed that both ultra-long-acting and long-acting insulin were superior to intermediate-acting insulin in reducing glycosylated haemoglobin (A1c).2 However, the relative efficacy and safety of these formulations for different patient subpopulations (eg, baseline A1c, diabetes-related comorbidities) are unknown.3–6 Organisations including the American Diabetes Association, Diabetes Canada and the European Association for the Study of Diabetes recommend tailoring insulin regimens according to an individual’s needs.7–9 However, there is a lack of high-quality evidence to support tailoring of insulin regimens to these characteristics. For example, a recent Cochrane review found detemir reduced severe hypoglycaemia compared with NPH and that there were no clinical differences between children and adults, but evidence was of moderate certainty.10

Standard NMAs use aggregate data (AD), that is, intervention effects and associated CIs, usually obtained from primary study publications or authors. Alternatively, NMAs can use individual patient data (IPD) from each eligible study, specifically, data from each patient enrolled in the original study. There are multiple advantages with using IPD in NMA, such as adjusting and exploring treatment effect modifiers and including treatment-by-covariate interactions to allow for treatment effects to vary by patient-level characteristics. The use of IPD provides greater statistical power than AD to detect the effect of an interaction between treatment and covariate, particularly when patient-level covariates are of interest, avoids dichotomisation of continuous variables (eg, age) and decreases the chances of aggregation bias due to the use of group-level information in the analysis.11 12 An IPD-NMA allows for estimating treatment effects conditional on patient characteristics and particular populations, and can reduce heterogeneity and inconsistency between different sources of evidence in a network of randomised controlled trials (RCTs). Therefore, an IPD-NMA can provide more precise, confident and informative results compared with an AD NMA.13

Our previous systematic review with NMA using AD 1 evaluated long-acting versus intermediate-acting insulin, but definitive conclusions could not be provided about whether the treatment effect changed for patients with different characteristics. To inform health professionals and clinical practice guidelines, we sought to examine the comparative efficacy and complications of different presentations of insulin regimens, that is, hypoglycaemic agents, for patients with T1DM with different characteristics. In this systematic review, we informed our NMA using IPD, which allows for tailoring of results to specific patient characteristics.

Methods

Protocol

Our protocol was prospectively registered (PROSPERO; registration # CRD42015023511) and published.14 Additional information is also provided in the online supplemental files 1 and 2. Below, we briefly summarise our methods.

We reported our findings using the Preferred Items for Systematic Reviews and Meta-analysis (PRISMA) statement for NMA of healthcare interventions, and the PRISMA extension to IPD.15 16

Eligibility criteria

We updated our previous systematic review1 of RCTs comparing long-acting versus other long-acting or intermediate-acting insulin regimens. Eligible studies were RCTs including adults with T1DM assessing A1c change and/or severe hypoglycaemia. A severe hypoglycaemic event was defined as reported in the individual trials and generally included a medical emergency in which patients needed healthcare assistance to quickly ingest sugar or receive a glucose injection. We selected the A1c outcome, as it is a validated surrogate endpoint for reduction of cardiovascular complications.17 We selected the severe hypoglycaemic event outcome, as it is associated with significant use of healthcare resources and higher risk of mortality.18

Search strategy and study selection

Experienced librarians designed and ran our literature searches (LP, BS).1 The following databases were searched: MEDLINE, EMBASE and the Cochrane Central Register of Controlled Trials up to 16 June 2015. We scanned references of included studies and relevant reviews to identify additional relevant RCTs.

After conducting a pilot test on a random sample of RCTs, pairs of reviewers (FY, JDI, MG and PAK) screened titles, abstracts and full texts, independently. The same reviewers extracted data. Any conflicts were resolved through discussion. Overall, there was 90% agreement between reviewers during screening.

IPD collection process and data abstraction

Two individuals (a senior scientist ACT and a research assistant Susan Le) contacted authors to request IPD. Briefly, we followed the strategies outlined below. More details can be found in our RCT protocol.19 We sent (a) an email requesting IPD, (b) email reminders to the authors (four in total) at 2-week, 6-week, 10-week and 14-week intervals after the initial email, (c) reminder letters in week 7 and (d) telephone reminders in week 15. We offered coauthorship on our updated systematic review provided that the authors shared their anonymised IPD and met the International Committee of Medical Journal Editors (ICMJE) authorship criteria.20 Two individuals (AAV and Susan Le) also contacted funders of the eligible RCTs. If funding was not reported in an RCT, we confirmed funding with the original author (whom we emailed during the RCT). We contacted industry funders after navigating the data sharing process through the relevant websites, online portals, email or phone inquiry. Two follow-up reminders were sent to the funders.

To retrieve IPD, we contacted the corresponding author for each trial and, if unsuccessful, followed with the next-in-order author, as described elsewhere.21 We also contacted all industry funders of RCTs.

We requested the following IPD: (a) participant characteristics including participant age, sex, pregnancy status, baseline A1c level, presence of comorbid conditions, history of hypoglycaemia, other medications used, patient retention along with reasons for drop-out and number of participants; (b) interventions, including treatment allocated, and dosage; (c) outcomes, A1c values and severe hypoglycaemia, including time to development and measurement or event dates; and (d) study-level characteristics, such as year conducted and method of randomisation.

From each RCT, we extracted the following study characteristics: year of publication, country and continent as reported or according to the first author, and journal in which the study was published. We categorised each RCT according to funding source: (1) industry-funded, (2) publicly funded, (3) mixed and (4) non-funded studies. For trials where IPD were not available, we also extracted: (a) patient characteristics: study size and percentage of men; (b) outcome data: study data (eg, events or mean and SDs and sample size per arm) and (c) treatments compared.

Risk of bias and quality appraisal

We used the Cochrane risk of bias tool to assess within-study bias,22 and checked consistency between the IPD and trial publications. We also checked baseline imbalance for important characteristics across the treatment groups.

When at least 10 studies were available, small-study effects were examined visually using the comparison-adjusted funnel plot under the fixed-effect model.23 The NMA findings were assessed for each outcome using the CINeMA tool (Confidence in NMA; see online supplemental file 1 for more details).20

Synthesis

We described the characteristics of the included patients and treatments using frequencies and distributions. Since IPD were available through a single funder-specific platform, we conducted a one-stage analysis for the retrieved IPD (12 studies). To include all eligible RCTs in the meta-analysis, we conducted a two-stage analysis, whereby each patient was analysed separately in each RCT in the first stage and then the RCT aggregate estimates were synthesised in a random-effects standard meta-analysis or NMA in the second stage (main analysis). We graphically summarise the geometry of the included trials in a network plot.

One-stage analysis of trials providing IPD

We conducted a one-stage IPD-NMA based on data provided to us by trial funders and used a Bayesian approach adjusting for all of the available requested patient characteristics and including treatment-by-covariate interactions.24 We considered two different assumptions regarding the treatment-by-covariate interactions: (a) independent regression coefficients per study and treatment comparison (ie, assuming that all treatment-by-covariate interactions are different for each treatment vs the control), and (b) identical regression coefficients across studies and comparisons (ie, assuming that all treatment-by-covariate interactions are common for each treatment vs the control). Assuming identical regression coefficients assumption is a strong assumption but is a less data-demanding approach compared with the independent regression coefficients assumption, where the number of parameters increases in the model per treatment comparison.25 26 Our IPD analyses are based on the intention-to-treat principle including patients who had been previously excluded from a trial’s analyses where data were available. We also used imputation techniques as described in online supplemental appendix 1.

Two-stage analysis

For those studies providing IPD, we fitted a logistic regression model for severe hypoglycaemia and a linear regression model for A1c within the funder’s portal to get adjusted treatment effects. We conducted a two-stage analysis for IPD only, as well as combined adjusted IPD with AD treatment effects in a standard meta-analysis or NMA model (main analysis) using a frequentist approach. We included patient-level covariates with main terms in the model including all of the available requested patient characteristics. Unadjusted estimates from retrieved IPD and AD were also combined in a joint NMA model as a subsequent analysis.

A common-within network between-study variance was assumed across comparisons for all NMA models.27 We estimated the between-study variance (τ2) using the DerSimonian and Laird28 method for the two-stage analyses and used a τ~U(0,2) prior to the one-stage analyses. We compared τ2 with the relevant distributions provided by Turner et al29 and Rhodes et al30 to assess heterogeneity. We calculated I2 on the combined IPD and AD NMA, to quantify overall heterogeneity and inconsistency.

We visually inspected similarity of the distribution of prespecified effect modifiers, such as age and sex, across treatment comparisons to assess for transitivity.31–33 Consistency was assessed globally and locally with the design-by-treatment interaction model34 35 and the loop-specific method,36 37 respectively. We monitored changes in between-study heterogeneity and network inconsistency using subgroup NMA and network meta-regression analyses on the prespecified potential effect modifiers. More details are provided in online supplemental appendix 1.

We present the summary OR and mean difference (MD) along their corresponding 95% CIs and predictive intervals (PIs).38 We assessed all MDs using the minimal clinically important difference (MCID), adopting a cut-off point greater than 0.3039 percentage units (U.S. Food and Drug Administration [FDA] considered a clinically meaningful threshold of 0.30 or 0.40). We ranked the interventions under the consistency assumption using p-scores and the surface under the cumulative ranking curves (SUCRAs), and present them in a rank-heat plot.40–42 The SUCRA is the numerical summary of the rank distribution for each treatment, with range 0%–100%, where 100% reflects the best treatment and 0% the worst treatment. A frequentist statistic equivalent to the SUCRA statistic is the p-score statistic. All analyses were conducted in the RStudio43 using R V.3.6.2 and V.3.4.3 (available in funder’s platform) and the R2jags,44 meta45 and netmeta46 packages, respectively.

Patient and public involvement

Not applicable.

Results

Literature search, study selection and IPD obtained

We screened 6680 titles and abstracts and 205 full-text articles. We included 29 trials (28 unique studies and 1 companion report; figure 1A and online supplemental appendix 2).

Figure 1

Flow diagram for study inclusion in the review (A) and studies retrieved with individual patient data (IPD) (B). NPH, neutral protamine Hagedorn; T1DM, type 1 diabetes mellitus.

Of the 28 RCTs, 23 were industry funded, 22 were funded by one industry funder and 1 RCT jointly by two industry funders. Of the remaining studies, two were publicly funded and three did not report funding information. None of the authors of the 28 RCTs (7428 participants) and the identified companion report47 shared their study IPD.21 We contacted two funders for these RCTs. We requested data for 15 RCTs (5052 participants) from Novo Nordisk and obtained IPD for 12 RCTs (4943 participants), shared data through a proprietary-specific platform after 1058 total waiting days for a response (up to 9 March 2020; online supplemental appendix 3). The supplied IPD included data for severe hypoglycaemia, which were not reported in a published RCT.48 We contacted Sanofi-Aventis for nine studies, but none of the IPD were provided. Figure 1B shows the study flow for retrieving IPD.

Study and patient characteristics

The mean age of patients across the 28 trials ranged from 26 to 47 years. The majority of the RCTs were conducted in Europe or Oceania and included patients with A1c ranging from 6.85% to 9.70% (table 1 and online supplemental appendix 4). Study duration (defined from first patient recruited to last patient followed up) ranged between 10 and 19 years (online supplemental appendix 5). Across RCTs with available IPD, we observed no imbalance in participant characteristics in the intervention and comparison groups (online supplemental appendix 6). For studies that were industry funded, we found that smaller studies and studies of populations with lower baseline A1c were less likely to supply IPD (study size, studies with no supplied IPD: median 68 participants, IQR 47–147 participants, studies with supplied IPD: median 400 participants, IQR 288–468 participants; and A1c baseline, studies with no supplied IPD: median 7.68, IQR 7.11–7.93, studies with supplied IPD: median 8.14, IQR 7.92–8.35) (online supplemental appendix 7). Non-industry-funded studies did not supply IPD.

Table 1

Study and patient characteristics

Risk of bias and IPD integrity

Based on the publication of the 28 RCTs, at a high risk of bias were 3 (11%) for allocation concealment and 21 (75%) for blinding of participants and personnel (online supplemental appendix 8). Ten (36%) RCTs had an unclear or high risk of incomplete outcome data bias and 16 (57%) RCTs had a high potential risk of ‘other’ bias. Overall risk of bias was comparable in studies with available and unavailable IPD (online supplemental appendix 8).

Overall, IPD were consistent with results from published RCTs (online supplemental appendix 9). The median dropout rate in IPD was 6% (IQR 3%–9%). No detection of publication bias was observed in the comparison-adjusted funnel plot for A1c, but small-study effects were detected in the finding of severe hypoglycaemia pointing towards NPH being better (online supplemental appendix 10).

Network meta-analysis

In total, 27 studies (7394 participants) provided eligible data for the NMA. No evidence for intransitivity was observed in A1c or severe hypoglycaemia outcomes regarding study duration and A1c baseline (online supplemental appendix 11). However, the distribution of sex differed between the comparison NPH once daily versus detemir once daily (mean per cent male: 73%) and the remaining network (mean per cent male: 52%) in A1c. Although the distribution of the year of publication was on average similar across all comparisons in the A1c network, an imbalance was evident in severe hypoglycaemia for the treatment comparison glargine once daily versus NPH once/two times per day that included studies published before 2005 compared with the remaining network. While no inconsistency was detected between direct and indirect evidence for A1c, evidence of inconsistency was found for severe hypoglycaemia (online supplemental appendix 12). Figure 2 shows the network plots for the two outcomes.

Figure 2

Network diagrams for (A) A1c and (B) severe hypoglycaemia outcomes. Each node and line are weighted according to the number of studies. Numerical values represent the number of studies per treatment comparison overall and with individual patient data (IPD). Orange-coloured edges include at least one study with available IPD, and black-coloured edges include aggregate data only. A1c, glycosylated haemoglobin; bid, two times per day; NPH, neutral protamine Hagedorn; od, once daily; qid, four times a day.

Glycosylated haemoglobin (A1c)

The NMA for A1c included 27 RCTs with 8 treatment nodes and 7394 participants, of which 12 RCTs (4943 participants) contributed IPD.

Network of studies with aggregate and fully adjusted treatment effect estimates from IPD

All available treatment options were included in this network. Glargine once daily reduced A1c compared with NPH once daily (MD=−0.31, 95% CI: −0.48 to −0.14) and NPH four times a day (MD=−0.40, 95% CI: −0.76 to −0.04). NPH once daily/two times per day improved A1c compared with NPH four times a day (MD=−0.40, 95% CI: −0.77 to −0.02). Detemir once daily (MD=−0.25, 95% CI: −0.41 to −0.09) and NPH once daily/two times per day (MD=−0.30, 95% CI: −0.50 to −0.11) were superior to NPH once daily (table 2 and online supplemental appendix 13). Glargine once daily and NPH once daily/two times per day achieved an MCID. However, results were imprecise according to PIs only for glargine once daily versus NPH once daily.

Table 2

NMA results in A1c and severe hypoglycaemia

Long-acting analogues were likely superior to intermediate-acting insulin regimens with detemir four times a day (p-score=71%) having the greatest likelihood of being the most effective in improving A1c followed by glargine once daily and detemir once daily/two times per day (p-score=67%, figure 3A).

Figure 3

Rank-heat plot of p-scores for (A) A1c and (B) severe hypoglycaemia. Circles from inside out present results for two-stage network meta-analyses including: (1) fully adjusted results from studies with individual patient data (IPD), (2) crude results from studies with IPD, (3) crude aggregate data (AD), (4) AD and crude results from studies with IPD, (5) AD and fully adjusted results from studies with IPD. A1c, glycosylated haemoglobin; bid, two times per day; NPH, neutral protamine Hagedorn; od, once daily; qid, four times a day.

Network of studies with fully adjusted treatment effect estimates from IPD

Studies in this two-stage NMA compared detemir once daily, once daily/two times per day and four times a day, glargine once daily, and NPH once daily and once daily/two times per day. Detemir once daily ranked best (p-score=76%), followed by detemir once daily/two times per day (p-score=63%; online supplemental appendix 14). However, results were imprecise.

Network of studies with AD

Studies in this NMA compared all available treatments. NPH once daily/two times per day improved A1c compared with NPH once daily (MD=−0.30 95% CI: −0.48 to −0.12) and NPH four times a day (MD=−0.40, 95% CI: −0.76 to −0.04). Although results were associated with an MCID, they were imprecise according to PIs apart from the comparison NPH once daily/two times per day versus NPH once daily. The most effective treatments were most likely detemir once daily/two times per day (p-score=78%) and glargine once daily (p-score=68%).

Additional analyses: network of studies with AD and treatment effect estimates from IPD

The analyses using crude IPD and AD suggested that NPH once daily/two times per day improved A1c compared with NPH once daily (MD=−0.30 95% CI: −0.49 to −0.12) and NPH four times a day (MD=−0.40, 95% CI: −0.76 to −0.04) (online supplemental appendix 14). Results were also associated with an MCID. The most efficacious treatment was likely detemir once daily/two times per day (p-score=71%). Excluding the nodes with insulin regimen four times a day from the network, detemir once daily/two times per day (p-score=65%) and glargine once daily (p-score=65%) ranked best. In contrast to studies with serious overall risk of bias, studies without serious overall risk of bias suggested no difference between NPH once daily/two times per day and NPH four times a day (MD=−0.21 95% CI: −0.67 to 0.25). Adjusting for differences in baseline A1c, sex, RCT duration and age, NPH once daily and four times a day were the least effective treatments. However, the treatment effect for A1c did not significantly change with a unit increase in any of the aforementioned potential effect modifiers.

Additional analyses: network of studies with treatment effect estimates from IPD

Studies in the one-stage NMA of studies with IPD only compared detemir once daily, two times per day, once daily/two times per day and four times a day, glargine once daily, and NPH once daily, two times per day, once daily/two times per day (online supplemental appendix 15). Detemir two times per day ranked best (SUCRA=67%), followed by detemir once daily/two times per day (SUCRA=66%). However, results were imprecise (online supplemental appendix 15). Analyses using IPD only estimated detemir once daily and two times per day as two of the best insulin regimens. This agreed with sensitivity analyses on prior to between-study variance, imputation methods, and crude or fully adjusted IPD. Across all adjusted IPD analyses, A1c baseline was positively and clinically importantly associated with the treatment effect. The multiple imputation by chained equations suggested that detemir once daily/two times per day best improved A1c. However, results were imprecise due to the sparsity of the network.

Heterogeneity across all analyses was low according to the empirical distribution (median 0.03, 95% range: 0.00 to 4.95) by Rhodes et al.30 On average, confidence in NMA results was moderate and ranged between low and high across treatment comparisons in the network (online supplemental appendix 15).

Severe hypoglycaemia

The NMA for severe hypoglycaemia included 17 RCTs with 8 treatment nodes and 6438 participants, of which 10 RCTs (4436 participants) contributed IPD. Four studies included zero events in all arms and were excluded from the analysis.49–52 Study definitions on severe hypoglycaemia are provided in online supplemental appendix 16. The average time to at least one serious hypoglycaemic event across studies with available IPD ranged between 21 and 272 days (online supplemental appendix 17).

Network of studies with AD and fully adjusted treatment effect estimates from IPD

Studies in this NMA compared all available treatments. Although glargine two times per day (p-score=84%) and detemir once daily (p-score=78%) had the highest ranking score with the least complications regarding severe hypoglycaemia (table 2, figure 3B and online supplemental appendix 13), results were imprecise, and the network of evidence was inconsistent. Thus, these observations should be interpreted with great caution. Estimated ORs for direct and indirect evidence of the comparisons detemir once daily/two times per day versus glargine once daily, and glargine once daily versus NPH once daily/two times per day pointed to the opposite direction (online supplemental appendix 18). These results were explored further in additional analyses (see also online supplemental appendix 14 and 19).

Network of studies with fully adjusted treatment effect estimates from IPD

Studies in this two-stage NMA compared detemir two times per day, once daily/two times per day and four times per day, glargine once daily, and NPH two times per day and once daily/two times per day (online supplemental appendix 19). Detemir once daily/two times per day ranked best among all insulin regimens (SUCRA=97%, online supplemental appendix 14). Although heterogeneity decreased to I2=0%, results were imprecise and firm conclusions could not be drawn. The network included no loops, and inconsistency could not be assessed.

Network of studies with AD

Studies in this NMA compared all available treatments. Similar to NMA of adjusted IPD and AD, despite glargine two times per day (p-score=87%) and detemir once daily (p-score=78%) had the highest likelihood of causing the least complications for severe hypoglycaemia (online supplemental appendix 14), direct and indirect evidence in the network were inconsistent. Hence, results should be considered with great caution.

Additional analyses: network of studies with AD and treatment effect estimates from IPD

The analyses using crude IPD and AD agreed with the main analysis (adjusted IPD and AD), but different sources of evidence in the network were still inconsistent. Subsequent analyses using adjusted IPD and AD did not explain inconsistency in the evidence network. Only sensitivity analysis accounting for missing data and using AD to apply the informative missingness OR showed no evidence of inconsistency in the network, which may suggest that missing data impact on consistency between direct and indirect evidence. Of the 17 studies with AD, 16 reported the number of dropouts with median dropout rate 3% (IQR 2%–10%) across groups. Heterogeneity was comparable with the main analysis (τ2=0·14), and glargine two times per day (SUCRA=89%) and detemir once daily (SUCRA=77%) were most likely the insulin regimens with the least complications regarding severe hypoglycaemia. However, results were imprecise.

Additional analyses: network of studies with treatment effect estimates from IPD

Studies in the one-stage NMA of studies with IPD only (crude or fully adjusted) compared detemir two times per day, once daily/two times per day and four times a day, glargine once daily, and NPH two times per day and once daily/two times per day (online supplemental appendix 19). Analyses suggested that detemir once daily/two times per day was the insulin regimen with least complications in severe hypoglycaemia. The network included no loops, and inconsistency could not be assessed. Accounting for missing data and adjusting for potential effect modifiers in the one-stage NMA, we found that the likelihood of experiencing a severe hypoglycaemic event increased with diabetes-related comorbidities (regression coefficient in sensitivity analysis with multiple imputations: 1.03, 1.02–1.03; online supplemental appendix 19). The credibility of claimed effect modifications remains limited.

Heterogeneity across all analyses was moderate compared with the Turner et al29 empirical distribution (median 0.04, 95% range: 0.00 to 1.58). Confidence in the NMA results was very low (online supplemental appendix 15).

Discussion

In this review, we evaluated efficacy and complications from each type of long-acting versus intermediate-acting insulin regimens for patients with T1DM for the outcomes A1c and severe hypoglycaemia using studies with both AD and IPD. Our results showed that long-acting insulin regimens were more efficacious for reducing A1c than intermediate-acting insulins. Among the intermediate-acting insulin regimens, we found that NPH once daily/two times per day performed best. Despite the higher efficacy of the long-acting insulin regimens, of the observed differences only glargine once daily achieved an MCID. Overall, NMA results were imprecise with moderate confidence.

Long-acting insulin analogues were associated with lower severe hypoglycaemia. Patients receiving glargine two times per day were less likely to experience a hypoglycaemic event, followed by detemir once daily and detemir once daily/two times per day. Our results were imprecise, yet our findings showed that among all insulin regimens, the therapy with the most complications regarding severe hypoglycaemia was NPH once daily/two times per day. Adjusting for differences by study or patient characteristics, including sex, RCT duration and age, we found that severe hypoglycaemia occurred less often for patients receiving long-acting versus intermediate-acting insulin regimens. Also, our NMA accounting for missing data showed that the likelihood of severe hypoglycaemic events was associated with diabetes-related comorbidities. However, results on severe hypoglycaemia should be interpreted with caution due to very low quality of evidence, including network inconsistency, imprecision and potential reporting bias.

To our knowledge, this is the first IPD-NMA of long-acting and intermediate-acting insulin regimens for patients with T1DM. We were able to obtain IPD for 43% of RCTs and 67% of known randomised participants from a single funder. Data sharing was indeed disappointing. We followed the methods guidelines in the Cochrane Handbook,53 applied the CINeMA tool,20 and reported in accordance with the PRISMA extensions for NMA and IPD meta-analysis.15 16 Compared with the findings of our previous systematic review,1 including IPD allowed us to: (a) increase certainty in the treatment effect estimates; for example, glargine once daily was superior to NPH once daily, and NPH once daily and four times a day were inferior to NPH once daily/two times per day, in regard to A1c; (b) include trials that we previously were unable to include since outcome data were not reported in their publication54; (c) explore for treatment-by-covariate interactions that were not reported in the original publications (eg, comorbidities, additional medications); and (d) observe minor differences between published results and reanalysis of IPD (most likely because we adopted the intention-to-treat principle, whereas RCTs excluded some patients from their published analyses). It should be noted that NPH or detemir four times a day is not a label recommendation and should be administered once daily or two times per day; however, in this systematic review, we only report the eligible studies and what they applied.

There are some limitations in this study. First, we were able to include IPD for fewer than half of the eligible RCTs, highlighting potential retrieval bias for IPD. Empirical evidence shows that it is inevitable to obtain only part of the requested IPD from the RCTs included in a systematic review.21 55–57 Unavailable IPD can be in part of a study dataset (eg, missing information on a specific outcome or participant characteristics) or in a whole study. However, as shown in simulation studies,58 the use of both IPD and AD in the same model has improved precision in treatment effects and power to detect a true treatment effect. A key advantage of using IPD (over AD alone) is the potential to improve completeness and validity of data in each RCT, and hence reduce bias due to reporting in publication. Second, the available IPD did not allow analyses that could have informed personalisation of medicine, tailoring insulin regimens to the individual’s characteristics (ie, lifestyle, diet, general health, motivation, hypoglycaemia awareness status and ability for self-management), as planned in our protocol due to absence of these data.3–6 To inform guideline recommendations, future RCTs should collect data that will enable tailor efficacy and complications of long-acting and intermediate-acting insulin regimens to these characteristics. Third, the comparison-adjusted funnel plot for severe hypoglycaemia suggested there is an indication for small-study effects pointing to NPH being better, and hence results should be interpreted with caution. Fourth, the network of evidence for severe hypoglycaemia was inconsistent. The network was only consistent when we accounted for missing data in the analysis, suggesting that attrition bias may explain disagreement between direct and indirect evidence. Other potential reasons for inconsistency may include the presence of small-study effects, and the intransitivity due to the imbalance in the distribution of publication year across treatment comparisons. In particular, the treatment comparison glargine once daily versus NPH once daily/two times per day was informed by studies published before 2005. It should be noted that in 2005, the ICMJE made registration of clinical trials obligatory.59 Hence, we expect that there are differences in the tactic and quality of studies published before and after 2005. Intransitivity was also observed in A1c regarding sex. Fifth, on average, confidence in NMA findings in A1c was moderate, which was due to the high risk of within-study bias, imprecision and heterogeneity across studies. Confidence in the network of severe hypoglycaemia was very low, because of high risk of within-study and reporting bias, imprecision and incoherence. Sixth, the literature search was conducted 7 years ago, and further eligible studies may have been published. However, obtaining IPD in a timely manner was very challenging (eg, 740 days were required to access IPD from initial inquiry). Seventh, in this review, we included the long-acting insulins glargine and detemir, but we did not identify studies assessing the (ultra)long-acting insulin degludec. However, it should be considered that degludec was only approved in September 2015 by the FDA and in August 2017 by Health Canada, and our literature search was conducted in June 2015. Eighth, we considered studies by De Leeuw et al60 and Vague et al61 as two independent studies, while these publications report on the same RCT. However, the contribution of the Vague et al61 study to the network estimates was on average 1.81% to the A1c treatment effect estimates and 1.27% to the severe hypoglycaemia treatment effect estimates, suggesting that the findings of this study only minimally impacted the NMA findings.20

We expect that our findings will be of interest to patients, health professionals and other end-users, such as the American Diabetes Association and Diabetes Canada, the European Association for the Study of Diabetes and other national and international organisations. To optimise healthcare for patients with T1DM, high-quality and well-conducted RCTs measuring patient-important outcomes including glycaemic variability and time-in-range are of immediate need, particularly in low/middle-income countries.62 Also, sharing of IPD from such future trials and those RCTs already completed is crucial to decrease the uncertainty surrounding our findings and facilitating tailored decision-making. Our study showed that industry-funded studies shared IPD for large RCTs (>200 patients) with patients of poor control of A1c baseline (>8%). IPD retrieval is not yet well established in the field of T1DM. We need to advocate to ensure that scientists have routine access to these data for reanalysis and meta-analysis.

Data availability statement

All data relevant to the study are included in the article or uploaded as supplemental information. The IPD database was only available for a certain period through the proprietary-specific online platform. Not applicable.

Ethics statements

Patient consent for publication

Ethics approval

Not applicable.

Acknowledgments

We thank Laure Perrier for conducting the initial literature search, Alissa Epworth for updating it and Becky Skidmore for peer reviewing the literature search. We thank Dr. Robert Peterson and McAuley Glenn for their support on this study as knowledge users. We would also like to thank Dr. Paul A Khan, Fatemeh Yazdi, Marco Ghassemi and John D Ivory for screening some of the citations, data abstracting included studies or both. Finally, we thank Huda Ashoor and Patricia Rios for helping coordinate the review at an early stage, screen citations and full-text articles, abstract data, appraise quality, and clean data, and for editing a previous version of the manuscript. We thank Susan Le for helping with contacting the authors and sponsors of the included studies in this review, as well as for assisting with the manuscript submission. We would like to thank Novo Nordisk for sharing individual patient data with us through their online platform.

References

Supplementary materials

  • Supplementary Data

    This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.

Footnotes

  • Contributors AAV, SES and ACT conceived and designed the study. AAV coordinated and conducted the analyses, contacted funders, analysed data, interpreted results, appraised quality of results and wrote a draft manuscript. AAV is the guarantor of the study. GS conducted analysis and edited the manuscript. SES and ACT interpreted results and edited the manuscript. ACT contacted authors. LS, MC, CT-S, DM and CHY provided input into the design, interpreted results and edited the manuscript. All authors read and approved the final manuscript.

  • Funding This research was funded by the Knowledge Synthesis Canadian Institutes of Health Research (CIHR) competition (No. 351143). AAV was previously funded by the CIHR Banting Postdoctoral Fellowship Program (No. 139157) and the European Union’s Horizon 2020 (No. 754936). GS and DM were funded from the European Union’s Horizon 2020 (No. 754936). SES was funded by a Tier 1 Canada Research Chair in Knowledge Translation. ACT was funded by a Tier 2 Canada Research Chair in Knowledge Synthesis.

  • Disclaimer The funder contributed to defining the scope of the systematic review but otherwise had no role in study design, data collection, analysis and interpretation of data. Data sharing sponsor was provided the ability to review at the time of submission of the manuscript for publication.

  • Competing interests None declared.

  • Patient and public involvement Patients and/or the public were not involved in the design, or conduct, or reporting, or dissemination plans of this research.

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.