Article Text

Download PDFPDF

Protocol
Summarising and synthesising regression coefficients through systematic review and meta-analysis for improving hypertension prediction using metamodelling: protocol
  1. Mohammad Ziaul Islam Chowdhury1,
  2. Iffat Naeem1,
  3. Hude Quan1,
  4. Alexander A Leung2,
  5. Khokan C Sikdar3,
  6. Maeve O'Beirne4,
  7. Tanvir C Turin4
  1. 1 Department of Community Health Sciences, University of Calgary Cumming School of Medicine, Calgary, Alberta, Canada
  2. 2 Department of Medicine, University of Calgary Cumming School of Medicine, Calgary, Alberta, Canada
  3. 3 Health Status Assessment, Surveillance and Reporting, Public Health Surveillance and Infrastructure, Population, Public and Indigenous Health, Alberta Health Services, Calgary, Alberta, Canada
  4. 4 Department of Family Medicine, University of Calgary Cumming School of Medicine, Calgary, Alberta, Canada
  1. Correspondence to Mohammad Ziaul Islam Chowdhury; mohammad.chowdhury{at}ucalgary.ca

Abstract

Introduction Hypertension is one of the most common medical conditions and represents a major risk factor for heart attack, stroke, kidney disease and mortality. The risk of progression to hypertension depends on several factors, and combining these risk factors into a multivariable model for risk stratification would help to identify high-risk individuals who should be targeted for healthy behavioural changes and/or medical treatment to prevent the development of hypertension. The risk prediction models can be further improved in terms of accuracy by using a metamodel updating technique where existing hypertension prediction models can be updated by combining information available in existing models with new data. A systematic review and meta-analysis will be performed of hypertension prediction models in order to identify known risk factors for high blood pressure and to summarise the magnitude of their association with hypertension.

Methods and analysis MEDLINE, Embase, Web of Science, Scopus and grey literature will be systematically searched for studies predicting the risk of hypertension among the general population. The search will be based on two key concepts: hypertension and risk prediction. The summary statistics from the individual studies will be the regression coefficients of the hypertension risk prediction models, and random-effect meta-analysis will be used to obtain pooled estimates. Heterogeneity and publication bias will be assessed, along with study quality, which will be assessed using the Prediction Model Risk of Bias Assessment Tool checklist.

Ethics and dissemination Ethics approval is not required for this systematic review and meta-analysis. We plan to disseminate the results of our review through journal publications and presentations at applicable platforms.

  • hypertension
  • statistics & research methods
  • risk management
http://creativecommons.org/licenses/by-nc/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Strengths and limitations of this study

  • One of the big strengths of this study is the comprehensiveness of the systematic review, which will include a search of four different databases and extensive use of reference lists.

  • This study will provide a comprehensive summary of hypertension risk prediction models and the risk factors they employ in the general population, and will summarise evidence as to their performance.

  • The study will lay a foundation for further improvement on hypertension risk prediction using metamodelling, where pooled common regression coefficients obtained through meta-analysis will be used in a new population.

  • This review has some limitations due to inevitable heterogeneity caused by study differences in modelling methods, adjustment factors and methods of measurements.

Introduction

Hypertension or high blood pressure is one of the most common medical conditions affecting almost one in four individuals worldwide1 and represents a major risk factor for heart attack, stroke, kidney disease and mortality.2 Hypertension has been attributed to be responsible for 13% of global deaths.3 Prevention of hypertension, as well as control of blood pressure in hypertensive patients, is considered a major public health and primary care concern.4 Current population health research is increasingly integrating aspects of precision public health, which emphasises targeted health interventions by identifying people who are at greatest risk of disease.5

The risk of developing hypertension depends on many factors, including age, body mass index (BMI), blood pressure, smoking, family history and level of physical inactivity.6 Combining these known risk factors into a multivariable model for risk stratification would help to identify high-risk individuals who should be targeted for healthy behavioural changes and/or medical treatment to prevent the development of hypertension.7 8 Inaccurate risk estimation can lead to ineffective interventions in patients at low risk, as well as missed opportunities to intervene in those who are most susceptible to develop hypertension.

The accuracy of the prediction model can be improved by using a metamodel updating technique.9 Existing hypertension prediction models can be updated by combining information from other published models. This metamodel updating approach works from the ‘middle ground’ in which existing prediction models designed for the specific populations and endpoints of interest are then used and revised to fit a new population.9 10 The updated model is then based on both the new and existing data, with the potential of improving model performance in the new population of interest.

Many prediction models have been developed to predict the risk of hypertension in the general population. Although there are multiple models, their predictive ability varies due to a lack of consistency in estimating risk by different models. Two prior reviews examined hypertension risk prediction models in adults.11 12 Both reviews conducted a narrative synthesis of the evidence to summarise the existing knowledge and performance of hypertension prediction models. In addition, a systematic review on prediction models to identify children at increased risk of future hypertension was also conducted.13 However, none of the previous studies performed a meta-analysis. Our review is different from the past reviews and adds to knowledge on predicting risk of hypertension and identifying associated risk factors in the following ways: (1) we plan to synthesise common regression coefficients of the existing hypertension risk prediction models through meta-analysis; (2) we propose pooled regression coefficients to be further used in developing a new hypertension risk prediction model for improved accuracy; (3) we plan to provide a detailed assessment of quality of studies among models developed; and (4) we hope to identify several additional and recently derived models. A systematic review and meta-analysis will be performed to identify existing prediction models for incident hypertension and risk factors they employ in the general adult population and to summarise evidence as to their performance. This will allow for model updating, which has the potential of improving the predictive ability for hypertension.

Objectives

The specific objectives for this systematic review and meta-analysis are

  1. To summarise the predictive ability and quality of all existing risk prediction models for incident hypertension.

  2. To identify a list of risk factors/predictors considered in existing hypertension risk prediction models and get their corresponding parameter values (regression coefficients).

  3. To lay a foundation for further improvement on hypertension risk prediction through metamodel updating technique using pooled common regression coefficients identified in objective 2.

Method

Protocol design

A systematic review and meta-analysis study design will be used to summarise information about all known published hypertension risk prediction models.To summarise and synthesise research evidence from multiple studies that are combinable, this study design is most applicable. This study protocol is reported in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses Protocol.14

Data sources and searches

We will systematically search MEDLINE, Embase, Web of Science and Scopus (each from inception to March 2020) for studies predicting the risk of incident hypertension in the general adult population. We will also search the reference lists of all identified relevant articles and will contact experts in the field of hypertension for information about other potential ongoing or unpublished studies. These experts will be identified from the review process. For the grey literature, Google Scholar and ProQuest (theses and dissertations) will be searched. The systematic search will be broken into two key concepts: hypertension and risk prediction. For each key concept, proper free-text words and Medical Subject Headings (MeSH) terms will be used. In addition, certain text words will be truncated, or wildcards will be used to capture all relevant articles from all databases. Finally, all the words and MeSH terms will be combined using the Boolean operators ‘AND’, ‘OR’ and ‘NOT’ as appropriate. The detailed search strategy for MEDLINE is provided in box 1.

Box 1

Keywords used to search in MEDLINE

Keywords

  • prediction model*.mp

  • risk function*.mp

  • risk prediction*.mp

  • risk table*.mp

  • predictive model*.mp

  • exp ‘Predictive Value of Tests’/

  • risk chart*.mp

  • risk equation*.mp

  • risk engine*.mp

  • risk calculat*.mp

  • risk score*.mp

  • prediction tool*.mp

  • prediction rule*.mp

  • risk model*.mp

  • prognostic tool*.mp

  • prognostic model*.mp

  • exp Risk Assessment/

  • risk algorithm*.mp

  • risk ind*.mp

  • prediction algorithm*.mp

  • (hypertension adj2 (risk score or risk model or prediction model or risk prediction model or risk assessment)).mp

  • (high blood pressure adj2 (risk score or risk model or prediction model or risk prediction model or risk assessment)).mp

  • OR/1–22

  • validation.mp.

  • exp Validation Studies/

  • validate*.mp

  • OR/24–26

  • 23 AND 27

  • exp Hypertension/

  • hypertens*.mp

  • (high adj2 blood pressure).mp

  • high blood pressure.mp

  • elevated blood pressure.mp

  • blood pressure.mp

  • OR/29–34

  • 28 AND 35

Eligibility criteria

We plan to use broad inclusion criteria to provide a comprehensive systematic review of the topic. Only primary studies with extractable data will be included in this review. This excludes reviews, editorials, commentaries and letters to the editor. We will only consider cohort studies considering the risk prediction models for incident hypertensions are generally developed using cohort study design with a follow-up information. However, studies will not be considered for inclusion if they were written in languages other than English and French. The Population, Prognostic Factors (or Models of Interest) and Outcome15 framework will used to outline eligibility criteria.

Population

The study population consists of general people who are free from hypertension at baseline and those for which hypertension risk prediction models are developed. There will be no restrictions in geographical region, time period or gender of the study participants. However, only the adult population, those who are aged 18 years or older, will be considered as our outcome is essential hypertension common in adults.

Prognostic Factors (or Models of Interest)

Studies that developed risk prediction models for incidence hypertension in the general adult population will be included. However, studies focusing on the additive predictive value of new risk factors to an existing prediction model, or studies presenting a prediction model developed in patients with previous hypertension, or studies that derived risk prediction tools other than score-type tools, such as risk charts, will not be included. In addition, we will not consider studies that have only assessed bivariate associations between predictors and hypertension incidence. Rather, we will be focusing on those studies where hypertension prediction models were built incorporating predictors that demonstrated significant prognostic contribution in predicting incident hypertension. Further, we will only consider predictors that are non-invasive, considering their feasibility in routine setting. A hypertension risk prediction model is defined as one combining two or more predictors to obtain estimates of the predicted risk of developing hypertension. Regression coefficients associated with the risk prediction models are constants (that are estimated) in the prediction model that tells about the change in outcome corresponding to the change in predictors and describe the relationship between a predictor and the outcome. If a model is assessed on more than one external population, information from all reported models will be considered. However, if the model is presented both in a derivation cohort and in a validation cohort, only information from the validation cohort will be considered.

Outcome

Outcomes of our study are hypertension and will be primarily defined as a systolic blood pressure (SBP) of ≥140 mm Hg or a diastolic blood pressure (DBP) of ≥90 mm Hg or taking antihypertensive medication. Although the 2017 American College of Cardiology (ACC)/American Heart Association (AHA) Hypertension Guideline lowers the cut-off level to define hypertension (SBP≥130 mm Hg or DBP≥80 mm Hg or taking antihypertensive medication),16 we will consider the former definition bearing in mind prediction models developed before appearance of new definition considered the former definition. Nevertheless, if we identify any study that reported a different formal definition of hypertension (eg, new ACC/AHA definition), we will also consider those definitions to capture the maximum number of studies. In the event of more than one hypertension definition, meta-analysis will be performed separately for each different definition of hypertension (outcome).

Study selection

Title–abstract screening

Initially, all articles identified through our search strategy will be exported to EndNote (Clarivate Analytics) (a software for managing bibliographies, citations and references) for duplicate removal. Level 1 screening will consist of an independent review by two researchers of the title and abstract of deduplicated records. Studies will be included if they indicate assessment of prediction models for hypertension. Records will be included if they discuss either validation or development of such a model. Further, records will be included if they assess the predictive ability of individual risk factors associated with hypertension. Inter-rater reliability (kappa coefficient) will be calculated to measure disagreement between independent reviewers, which will be resolved by consensus. Studies kept during level 1 screening will go through level 2 screening.

Full-text screening

Full text articles will be retrieved for all records meeting level 1 review criteria. Level 2 screening will consist of full-text review by two independent reviewers. Articles will be included if they contain extractable data on hypertension prediction models and regression coefficients for hypertension risk factors. The articles identified eligible after full-text review will be included for data extraction. Agreement between reviewers will be quantified using the kappa statistic. Any disagreement between reviewers will be resolved through consensus.

Data extraction and synthesis

First, the numbers of studies identified, excluded (with the reason for exclusion) and included in the systematic review and subsequent meta-analysis will be summarised using the Preferred Reporting Items for Systematic Reviews and Meta-Analyses flow diagram17 (figure 1). Next, two reviewers will independently extract data from each study using a standardised form (table 1) that will include study name, location where the model was developed and participants’ ethnicity, study design used, age and gender of the study participants, predictors included in the model, number of events and total participants, definition used for hypertension, duration of follow-up, modelling method used, measures of discrimination and calibration of the prediction model, and the validation of the prediction model. In a separate form (table 2), information about the parameters (regression coefficients) of the identified hypertension risk prediction models will be extracted, which includes study name, sample size, modelling method, mathematical model and corresponding regression coefficients, number of predictors, list of predictors, values of the regression coefficients and statistical significance of the regression coefficients. In this review, we will primarily focus on a traditional regression type’s prediction modelling (eg, logistic regression model and Cox proportional hazard regression model), anticipating that it is the most common form of prediction modelling. Recently emerged more complicated modelling strategies (eg, machine learning tools) will not be considered in this review. It is very likely that some of the studies will report ORs or HRs (depending on the modelling methods) instead of the regression coefficients for their model. In that case, we will back-calculate the regression coefficients from the OR or HR using appropriate formula. For example, the OR formula for any single generally defined exposure variable E , whether dichotomous, ordinal or interval, controlling for a collection of control variables is given by

Figure 1

Preferred Reporting Items for Systematic Reviews and Meta-Analyses diagram for systematic review of studies presenting hypertension prediction models developed in the general population.

Table 1

Extracted information about existing hypertension prediction models from the selected studies

Table 2

Extracted information about the parameters of the existing hypertension prediction models

Embedded Image

where Embedded Image and Embedded Image are the two levels of the exposure variable Embedded Image is the regression coefficient associated with exposure Embedded Image and the deltas Embedded Image are coefficients of potential interaction effects involving E separately with each of the p potential effect modifiers Embedded Image .18 When there is no interaction case, the OR formula reduces to the quantity Embedded Image and becomes Embedded Image when the difference between exposure levels equals 1. This formula can be used to convert ORs to the regression coefficients. Similarly, the HRs can also be converted to the regression coefficients. As relation between OR and HR is not yet established, pooling regression coefficients derived from them will be separate.

Meta-analysis of regression coefficients associated with the risk factors in prediction models involved several challenges, including incomparable effect measures (eg, OR and HRs); different methods of measurement for predictors and outcome; different approaches to handling outcome and predictors (eg, age can be continuous or categorical with different levels of categories and cut-offs to define categories); differences in measurement units; lack of proper reporting of estimates (eg, SE of the effect measures may not be reported); and use of a different set of adjustment predictors in different models.19 These issues can lead to a substantial amount of heterogeneity.

A recent guideline has provided some recommendations to overcome these issues.19 A separate meta-analysis on similar effect measures (eg, meta-analysis of ORs and HRs separately), similar associations (eg, meta-analysis of unadjusted and adjusted predictors separately), similar cut-offs for predictors (eg, separate meta-analysis on distinct cut-offs of predictors), and similar method of measurement for predictors and outcome (eg, separate meta-analysis for each measurement method) are among the recommendations.

Although ideal but very unlikely, the estimates (eg, regression coefficients or ORs or HRs) derived from different models are adjusted for the same set of other predictors. Estimates adjusted for a different set of predictors create difficulty in interpreting meta-analysis results. To overcome this issue, the guideline recommended considering meta-analysis only on those estimates that are adjusted for at least a predefined minimum core set of established predictors.19 This core set of predictors for the outcome can be defined in consultation with the experts. For the hypertension outcome, a potential core set of predictors to adjust can be age, gender, smoking status and BMI.

Standardisation of the effect measures to ensure they all relate to the same scale of measurement will also be considered if necessary. Regression coefficients will be converted into standardised regression coefficients and then meta-analysis will be performed. Most of the existing standardisation techniques20 21 were built with continuous response variables in mind, and complication arises when response variable become binary as in our case. We will adopt the latent-theoretical method proposed by Grace et al 22 for standardisation as this method offers a suitable approach for standardisation in a binary response variable.

We will follow the above steps and recommendations provided by the guideline19 in our study to make our meta-analysis results (pooled estimates) more interpretable. Although our initiatives may not provide complete remedy and heterogeneity may remain, we anticipate performing a random-effect meta-analysis will allow for inevitable heterogeneity in effect measures across studies.

Quality assessment

Study quality will be assessed by each reviewer, according to the Prediction Model Risk of Bias Assessment Tool (PROBAST) checklist23 (table 3). The PROBAST tool is designed for assessing the risk of bias and concerns regarding the applicability of both diagnostic and prognostic prediction model studies. The PROBAST contains 20 questions under four domains: participants, predictors, outcome and analysis that facilitates judgement of risk of bias and applicability. Overall risk of bias of the prediction models will be judged as ‘low’, ‘high’ or ‘unclear’, and overall applicability of the prediction models will be judged as ‘low concern’, ‘high concern’ and unclear according to the PROBAST checklist.

Table 3

Prediction Model Risk of Bias Assessment Tool checklist

Data analysis

The summary statistics, also known as the effect measure, from the individual studies will be the regression coefficients of the hypertension risk prediction models. Studies will be grouped based on the type of modelling method used to develop the model as regression coefficients will have different meanings based on different modelling methods (eg, in a logistic regression model, a coefficient represents the change in the log odds that would result from a one unit change in one specific predictor, holding all other predictors constant while in a Cox proportional hazard regression model, the coefficients represent the change in the expected log of the HR relative to a one unit change in one specific predictor, holding all other predictors constant). We will use random-effect meta-analysis to obtain the pooled weighted average of the common regression coefficients with 95% CIs for groups of models using the DerSimonian and Laird method.24 Forest plots will be also generated to show the pooled regression coefficient, together with 95% CI, the author’s name, publication year and study weights. In studies that provided only regression coefficients (β) but no measure of its variance or CIs, the SE and 95% CI of the regression coefficients (β) will be calculated using the formula

Embedded Image

where Embedded Image the number of patients with hypertension and Embedded Image the number of patients without hypertension and the upper 95% CIEmbedded Image , and lower 95% CIEmbedded Image .25 If the CIs of the regression coefficients are available, SEs of the regression coefficients will be derived from the CIs. Information about the pooled regression coefficients will be presented in a tabular form (table 4). STATA V.13.1 will be used to perform statistical analysis using the following commands: metan, metareg, metabias, metatrim and metafunnel.

Table 4

Information about the pooled regression coefficients

Testing for heterogeneity

Studies that will be included in our systematic review are likely to be diverse methodologically, and as a result, heterogeneity in the results is to be expected. Heterogeneity arises when differences between study results are beyond those attributable to chance alone; potential sources include the study setting, the study participant type and the study methodology, among others. Heterogeneity will be assessed using Cochran’s Q statistic and quantified with the I2 statistic. A p value of less than 0.05 will be considered statistically significant heterogeneity and will be categorised as low, moderate and high when the I2 values are below 25%, between 25% and 75%, and above 75%, respectively.26 Sources of heterogeneity will be further explored using metaregression and stratified analyses according to the modelling method and study characteristics (sample size, duration of follow-up, number of predictors included within the prediction models and geographical location).

Assessment of publication bias

Publication bias, a tendency from authors to publish favourable or significant results, threatens the validity of a review’s findings. Synthesised results through meta-analysis from the systematic review can produce overstated conclusions due to publication bias.27 Publication bias will be examined using a funnel plot and Egger’s test. Funnel plots provide visual aid for detecting publication bias in systematic reviews. A symmetrical inverted funnel plot will indicate absence of publication bias, whereas an asymmetrical funnel plot will indicate presence of publication bias.28 We will apply the funnel plot to visually describe the publication bias. Egger’s test will be performed to formally test/confirm the existence of publication bias.28 If publication bias is identified, we will apply Duval and Tweedie’s trim and fill method as a form of sensitivity analysis to see how bias in the pooled estimates can potentially be reduced.29

Patient and public involvement

Patients and the public will not be involved in the development, design, conduct or reporting of the study.

Ethics and dissemination

There is no involvement of human subjects in this study as there will be no primary data collection. So ethical approval is not required for this systematic review. A manuscript will be prepared for peer-reviewed journals after completion of the full analysis. Also, findings will be presented at relevant conferences and seminars.

Discussion

This systematic review will provide information on available hypertension risk prediction models, their performances and their study quality. Additionally, it will provide information on risk factors/predictors that were considered during the model development and their corresponding parameter values (regression coefficients). Together, this information will deliver a comprehensive summary of the predictive ability of these models based on the reported data. Previous hypertension risk prediction models consider various demographic and clinical characteristics, and blood biomarkers, including SBP, paternal history of hypertension, BMI, age and sex.30 31 The Framingham risk score is one of the most widely used for hypertension risk prediction modelling and risk assessment.32 However, it is imperative to regularly review and synthesise risk prediction models for hypertension as research in risk factors of hypertension improve and data for further clinical and demographic biomarkers become available. Using methodologies to evaluate prediction model performance and accuracy, we found that the metamodelling technique allows for improvement on existing hypertension prediction models. This compilation of both existing and new information allows for an effective technique to improve precision public health in the corresponding populations.33 34

This systematic review has its strengths in an effective and systematic search strategy. With a broad inclusion criterion, we hope to obtain all relevant literature on hypertension risk models. Further, the unique pooled analysis of regression coefficient provides an opportunity to synthesise large amounts of information’s on risk factors for hypertension. This review also has some limitations. Although the English language is generally perceived to be the universal language of science, selection of research findings in a particular language (eg, English and French) can introduce language bias and may lead to erroneous conclusions. With this in mind, readers should be cautious when interpreting the findings of our results. Pooling regression coefficients through meta-analysis is also challenging, considering inevitable heterogeneity caused by study differences in modelling methods, adjustment factors and methods of measurements. We urge readers to be cautious while interpreting the findings of the summary results.

In conclusion, this systematic review has the potential to synthesise the overall evidence on the performance and effectiveness of hypertension prediction models and summarise the current development status. This will provide updated tools for healthcare providers and clinical practice, along with advancing the field of hypertension research.

References

Footnotes

  • Contributors All authors contributed to this work. MZIC and TCT contributed to the conception and design of the review. MZIC and IN read and screened abstracts and titles of potentially relevant studies. MZIC and IN read the retained papers and were responsible for extracting data and rating their quality independently. MZIC drafted the paper and IN, HQ, AAL, KCS, MOB and TCT critically reviewed it and suggested amendments prior to submission. All authors approved the final version of the manuscript and take responsibility for the integrity of the reported findings.

  • Funding The authors have not declared a specific grant for this research from any funding agency in the public, commercial or not-for-profit sectors.

  • Competing interests None declared.

  • Patient consent for publication Not required.

  • Provenance and peer review Not commissioned; externally peer reviewed.