Article Text

Original research
Validating risk models versus age alone for atrial fibrillation in a young Dutch population cohort: should atrial fibrillation risk prediction be expanded to younger community members?
  1. Jelle C L Himmelreich1,
  2. Ralf E Harskamp1,
  3. Bastiaan Geelhoed2,
  4. Saverio Virdone3,
  5. Wim A M Lucassen1,
  6. Ron T Gansevoort4,
  7. Michiel Rienstra2
  1. 1Department of General Practice, Amsterdam Public Health, Amsterdam UMC Locatie AMC, Amsterdam, The Netherlands
  2. 2Department of Cardiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
  3. 3Department of Statistics, Thrombosis Research Institute, London, UK
  4. 4Department of Nephrology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
  1. Correspondence to Jelle C L Himmelreich; j.c.himmelreich{at}amsterdamumc.nl

Abstract

Background Advancing age is the primary selection criterion for community screening for atrial fibrillation (AF), with selection often restricted to those aged ≥65 years. If multivariable models were shown to have considerable additional value over age alone in predicting AF risk among younger individuals, AF screening could be expanded to patients with lower age, but with high AF risk as per a validated risk model.

Methods We validated risk models CHARGE-AF (Cohorts for Heart and Aging Research in Genomic Epidemiology model for AF) and FHS-AF (Framingham Heart Study model for AF), and risk scores CHA2DS2-VASc and CHA2DS2-VA, and presented their predictive abilities for 5-year and 10-year AF risk versus that of age alone in a young Dutch population cohort (PREVEND) free from AF at baseline. We assessed discrimination by the C-statistic and calibration by the calibration plot and stratified Kaplan-Meier plot using survey-weighted Cox models.

Results During 5-year and 10-year follow-up there were n=98 (2.46/1000 person-years) and n=249 (3.29/1000 person-years) new AF cases, respectively, among 8265 participants with mean age 49±13 years. CHARGE-AF and FHS-AF both showed good discrimination for 5-year and 10-year AF (C-statistic range 0.83–0.86) with accurate calibration for 5-year AF, but overestimation of 10-year AF risk in highest-risk individuals. CHA2DS2-VASc and CHA2DS2-VA relatively underperformed. Age alone showed similar discrimination to that of CHARGE-AF and FHS-AF both in the overall, young PREVEND cohort and in subgroups for lower age and lower stroke risk.

Conclusion Multivariable models accurately discriminate for 5-year and 10-year AF risk among young European community-dwelling individuals. However, their additional discriminatory value over age alone was limited. Selection strategies for primary AF screening using multivariable models should not be expanded to younger individuals.

  • thromboembolism
  • epidemiology
  • pacing & electrophysiology

Data availability statement

Data are available upon reasonable request. Requests for and information on usage of the de-identified PREVEND data set can be directed to m.rienstra@umcg.nl.

http://creativecommons.org/licenses/by-nc/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Strengths and limitations of this study

  • Validation for incident atrial fibrillation of risk models and comparison with age alone, performed specifically in a young cohort.

  • Applied appropriate weighting to account for oversampling of patients with microalbuminuria in the PREVEND data set.

  • Absence of established methods for use in the presence of survey weights prevented adjustment for the competing risk of death or formal testing for difference in C-statistic.

  • Ethnic diversity was minimal in the PREVEND data set; findings mainly extend to a European-white population.

Background

Atrial fibrillation (AF) is the most commonly encountered cardiac arrhythmia in the community with a lifetime risk of over 30%.1 Timely diagnosis of AF is crucial in order to decrease the risk of complications such as heart failure and most notably ischaemic stroke.2 In this regard, the importance of efficiently identifying patients at high risk of AF, eligible for screening, is increasingly recognised.3 4

Previous community AF screening strategies often selected only for advancing age.5–8 However, multivariable risk models have been shown to more accurately discriminate for elevated AF risk than selecting for high age alone in older community populations.9 In a recent systematic review and meta-analysis of prediction models validated for incident AF in community settings, three models showed significant summary discrimination: CHARGE-AF (Cohorts for Heart and Aging Research in Genomic Epidemiology model for AF),10 FHS-AF (Framingham Heart Study model for AF)11 and CHA2DS2-VASc (originally developed to assess stroke risk in individuals with AF, but with demonstrated merits in discriminating for incident AF risk as well).4 12 The systematic review concluded that CHARGE-AF, which was developed to predict patients’ absolute 5-year AF risk based on 11 easily obtainable variables, seemed most promising as a risk stratification tool for community AF screening.4

An important finding from the systematic review4 was that validation cohorts of AF prediction models largely constituted older populations, while there was evidence for higher model discrimination among subgroups of younger patients.13–15 Although younger AF patients would be less likely to reach the threshold for an anticoagulation indication, it can be argued that early AF detection is still important in these patient to increase long-term health and quality of life through, for example, lifestyle interventions and rate/rhythm control.2 And while the number needed to screen for AF using single time-point measurements rise steeply with lower age,16 it remains unanswered whether multivariable models could have increased this efficiency among younger individuals over a primarily age-based cut-off. Further research into the relative value of multivariable models over age alone in predicting AF specifically among younger community-dwelling individuals is thus warranted. Such work could help researchers decide whether to expand their scope to younger patients, or to adhere to the current convention of selecting primarily for advanced age—whether or not followed by triage based on a risk factor17 or risk model.18

In the current study, we aim to assess whether the current convention of selecting primarily for advanced age in AF screening selection could be challenged by using multivariable prediction models or risk scores to discriminate for high AF risk among younger patients as well. We will do so by comparing predictive performance for 5-year and 10-year AF risk of two multivariable models (CHARGE-AF and FHS-AF) and two risk scores (CHA2DS2-VASc and CHA2DS2-VA) with that of age alone, in a young community-based cohort from The Netherlands.

Methods

Study population

The Prevention of Renal and Vascular End-Stage Disease (PREVEND) study was designed to evaluate the natural course of microalbuminuria and its relation to renal and cardiovascular disease in the general population. Details of this community-based cohort study have been outlined previously.19–21 In summary, participants in the PREVEND study involved inhabitants of the city of Groningen, The Netherlands, between 28 and 75 years of age who responded to the request to participate in a one-time morning urine albumin screening test. All respondents with urine albumin ≥10 mg/L (n=7768) and a random sample of those with urine albumin <10 mg/L (n=3394) were invited for baseline screening, excluding those who were pregnant or who used insulin. Baseline screening involved assessment of demographic factors, anthropometric measurements, cardiovascular and metabolic risk factors, health behaviour assessment, blood sample collection and two 24-hours urine samples. All persons who completed baseline screening were invited for follow-up visits at the outpatient clinic at 3-year intervals. A standard 12-lead ECG was performed during each visit.

Diagnosing AF

Incident AF was diagnosed when either AF or atrial flutter was present on the ECG obtained at one of the PREVEND follow-up visits or at an outpatient visit or hospital admission to one of the hospitals in the city of Groningen (University Medical Center Groningen or Martini Hospital). All ECGs were electronically screened for the following criteria: PR interval absence, AF, atrial flutter or atrial ectopy. This method was validated with complete manual screening by two independent observers of all ECGs from the PREVEND baseline visit, and showed 100% sensitivity for AF. All ECGs with suspected AF as determined by electronic screening were manually reviewed by two independent observers. Disagreements or inconsistencies were resolved by two independent cardiologists.20

Clinical follow-up data

Follow-up in PREVEND ranged up to 14 years. We calculated follow-up duration as the time between the baseline screening visit and the last contact date in case of loss to follow-up, death, diagnosis of AF or the end of the 5-year or 10-year window of our respective analyses, whichever came first. Information on death was obtained through a national government vital statistics databases.20

Models and risk scores

CHARGE-AF is a multivariable Cox model that uses the variables age, race/ethnicity, height, weight systolic blood pressure (SBP), diastolic blood pressure, current smoker status, antihypertensive medication use, diabetes, heart failure and myocardial infarction to derive a person’s absolute 5-year risk of AF.10 The coefficients and calculations used to derive CHARGE-AF as well as the other original models used in our analyses are shown in the online supplemental methods.

FHS-AF is a multivariable Cox model that was originally derived and validated to predict 10-year AF risk, and was later recalibrated and validated to also predict 5-year AF risk.11 22 Both models require the variables age, sex, body mass index, SBP, hypertension treatment, PR interval and heart failure. The original, 10-year model additionally requires significant cardiac murmur.

CHA2DS2-VASc (Congestive heart failure, Hypertension, Age>75, Stroke/transient ischaemic attack/thromboembolism, Vascular disease, Age 65–74, Sex category) was originally developed to assess stroke risk in individuals with AF, and is commonly used to assess the indication for anticoagulation as stroke prophylaxis in patients with AF.2 12 The risk score has also been demonstrated to have merits in discriminating for incident AF risk in community cohort.4

CHA2DS2-VA is a modified CHA2DS2-VASc omitting sex category as variable. It is recommended by the Cardiac Society of Australia and New Zealand to assess stroke risk,23 and is analogous to international recommendations to differentiate between men and women in choosing a CHA2DS2-VASc cut-off for stroke risk assessment.2 24 The model was recently shown to have merits in selecting patients for single time-point AF screening as well as prolonged monitoring for AF, and—like CHA2DS2-VASc—has the advantage of not requiring data such as height, weight or blood pressure which are often missing in routine primary care data.15 18 25 CHA2DS2-VA has not yet been validated for incident AF over an extended time period in community cohorts, warranting further assessment.

Extending CHARGE-AF to 10-year follow-up

While CHARGE-AF’s discrimination for 10-year AF risk has been validated, the model has never been calibrated to predict absolute AF risk over a 10-year window.9 13 26–28 We recalibrated CHARGE-AF’s coefficients using the PREVEND data set censored at 10-year follow-up in order to enable an assessment of its calibration for 10-year AF, to investigate its use in younger community cohorts and to facilitate a direct comparison of calibration with the FHS-AF model for 10-year AF risk prediction.11 We used the newly recalibrated 10-year CHARGE-AF model in our analyses on 10-year AF risk.

Multivariable predictors versus age alone

We presented predictive ability for 5-year and 10-year AF risk among risk models and risk scores, followed by a discussion of relative performance between multivariable predictors and age alone. For CHARGE-AF and FHS-AF we were able to assess both discrimination and calibration for 5-year and 10-year AF using their original and recalibrated models. In validating CHA2DS2-VASc, CHA2DS2-VA and age as predictors for AF risk we were only able to assess discrimination since these models have never been formally calibrated to predict absolute AF risk in the community.

Additional analyses

We performed stratified analyses by age <65 versus ≥65 years of age as well as by CHA2DS2-VASc <2 versus ≥2 in order to replicate previous studies that showed differences in performance among these clinically relevant subgroups,9 13 14 and in order to further assess the merits of multivariable risk models over age alone in younger and lower-risk participants.

In our analysis on the original CHARGE-AF simple model for 5-year AF risk we also assessed model performance when routinely scoring all participants’ race as ‘Caucasian/white’. We added this analysis because information on race is not routinely scored in Dutch health records, and to validate previous research that used this approach to deal with systematically missing ethnicity data.15

Statistical analysis

We presented descriptive statistics as means±SD, median (IQR) or number (percentage). We compared baseline parameters between those who did and did not develop AF during follow-up using the unpaired t-test for continuous variables and Pearson’s χ2 test for categorical variables. We assessed statistical significance at the 0.05 level in all analyses.

In our validation and recalibration analyses, we used a survey-weighted Cox proportional hazards model to account for the oversampling of individuals with microalbuminuria at baseline, and to enable generalisation of results to the general population. Here, we applied a weighing factor of 11.92 to people with urinary albumin excretion <10 mg/L and a weighing factor of 1.66 to people with urinary albumin excretion >10 mg/L. The weighting factors were derived based on the unequal inclusion probabilities.29

We assessed discrimination in all analyses using the C-statistic and 95% CI. We assessed calibration in analyses on CHARGE-AF and FHS-AF by the calibration plot as well as the calibration slope and 95% CI where a 95% CI not containing one indicated statistically insufficient calibration.30 We additionally assessed calibration of CHARGE-AF and FHS-AF by the Kaplan-Meier (KM) plot grouped according to baseline risk in categories <2.5%, 2.5%–5% and >5% for 5-year risk and categories <5%, 5%–10% and >10% for 10-year risk.

We assessed missing data and found that this was <5% for all variables in the CHARGE-AF model, except for use of antihypertensive medication (n=1460, 17.7% missing data). We assumed no antihypertensive medication use in patients who did not have a documented history of hypertension, which left 363 patients with hypertension (4.4%) with missing data on antihypertensive medication use. We assumed absence of significant murmur when data for this variable was missing. The percentage of missing data among ECG was <2.5%. There was no missing data for the outcomes AF or death. To account for missing data in our validation analyses we used single imputation under the missing at random assumption.

We performed statistical analyses using SPSS V.26,31 and R using the haven, dplyr, rms, survey and survival packages.32

Patient and public involvement

No patients were involved in the development of the research question and outcome measures.

Results

Study population

A total of 8592 persons completed baseline screening, from which we excluded participants without an ECG at enrolment (n=248), as well as participants with prevalent AF at the baseline screening (n=79). Among the study population of 8265 participants, mean age at baseline was 49±13 years (range 29–74), the proportion of women was 50.2% and 95% were Caucasian/white. Patients who developed AF at 5-year and 10-year follow-up were significantly older, were more likely to be men and had a higher burden of cardiovascular risk factors and clinically manifested cardiovascular disease (table 1).

Table 1

Baseline characteristics of study participants in PREVEND cohort stratified by presence of atrial fibrillation at 5 and 10 years of follow-up

Outcomes and follow-up

Mean follow-up among the total cohort was 11.2 years. Figure 1 illustrates the occurrence of AF and death events and cumulative AF incidence over 10 years. There were n=98 (2.46/1000 person-years) and n=249 (3.29/1000 person-years) incident AF cases at 5 and 10 years of follow-up, respectively. There were n=226 (5.68/1000 person-years) and n=490 (6.48/1000 person-years) deaths at 5 and 10 years of follow-up, respectively.

Figure 1

(A) cumulative number of events over 10-year follow-up; (B) cumulative AF incidence during 10-year follow-up. AF, atrial fibrillation.

CHARGE-AF and FHS-AF: validation for 5-year and 10-year AF risk

Validation of CHARGE-AF and FHS-AF for 5-year AF risk both resulted in a C-statistic (95% CI) of 0.86 (0.82 to 0.90). with calibration slopes not significantly deviating from 1 in either analysis (table 2). The calibration plots for both models showed accurate absolute risk prediction throughout deciles of predicted 5-year AF risk (figure 2, Panels A-B), with KM plots for both models showing accurate predictions for all predefined risk groups (figure 2, Panels C-D).

Figure 2

(A and B) Calibration plots for 5-year AF risk of the CHARGE-AF and FHS-AF models, respectively, in the PREVEND cohort; (C and D) Kaplan-Meier plots or 5-year AF risk according to subgroups <2.5% (green line), 2.5%–5% (blue line) and >5% (red line) predicted 5-year risk for 5-year AF risk of the CHARGE-AF and FHS-AF models, respectively, in the PREVEND cohort. Calibration plot interpretation: Points with brackets indicate intersects of mean observed and mean CHARGE-AF predicted AF risk per decile of predicted CHARGE-AF risk, with 95% CI of observed risk per decile. Data exceeding the neutral (dotted) line indicates underestimation of absolute AF risk by the CHARGE-AF simple model in our sample, while data below the neutral line indicates overestimation of absolute AF risk by the model. Spikes on the x-axis represent the distribution of AF-free survivors by baseline predicted risk. AF, atrial fibrillation; CHARGE-AF, Cohorts for Heart and Aging Research in Genomic Epidemiology model for AF; FHS-AF, Framingham Heart Study model for AF.

Table 2

Validation of CHARGE-AF, FHS-AF, CHA2DS-VASc2, CHA2DS-VA and age alone for 5-year and 10-year AF risk

Recalibration of the CHARGE-AF simple model for 10-year AF risk resulted in the coefficients shown in online supplemental table S1). In predicting 10-year AF risk, the C-statistic (95% CI) for the newly recalibrated CHARGE-AF model was 0.85 (0.82 to 0.88), and that of FHS-AF was 0.83 (0.79 to 0.86). Calibration slopes did not significantly deviate from 1 in either analysis (table 2). Calibration plots of both models showed accurate risk prediction in lower-risk subgroups but overestimation of absolute 10-year AF risk in those with highest predicted risk (online supplemental figure S1, Panels A-B). The stratified KM plots showed accurate risk estimation for all predefined risk groups by FHS-AF, while the newly recalibrated CHARGE-AF overestimated absolute risk for the middle (5%–10% 10-year risk) group (online supplemental figure S1, Panels C-D).

CHARGE-AF and FHS-AF: additional analyses

In the subgroups analyses for both 5-year and 10-year AF risk, C-statistics of CHARGE-AF and FHS-AF were higher in younger (<65 years) versus higher age (≥65 years), and higher in lower (CHA2DS2-VASc<2) versus higher (CHA2DS2-VASc≥2) stroke risk (table 2). Calibration slopes did not significantly differ from 1 in the analysed subgroups except for FHS-AF’s 5-year analysis in patients with CHA2DS2-VASc<2 (table 2). Routinely scoring race as Caucasian/white in the 5-year analysis for CHARGE-AF gave similar results as the overall analysis with a C-statistic of 0.86 (95%CI: 0.82 to 0.90) and a calibration slope of 1.09 (95%CI: 0.86 to 1.32).

CHA2DS2VASc and CHA2DS2VA: overall and subgroup validation

The C-statistic (95% CI) of CHA2DS2VASc and CHA2DS2VA in the overall PREVEND cohort were 0.72 (0.63 to 0.81) and 0.80 (0.73 to 0.86), respectively, for 5-year AF, and 0.68 (0.62 to 0.73) and 0.74 (0.70 to 0.79), respectively, for 10-year AF risk (table 2). Like in the overall analysis, CHA2DS2VA showed higher C-statistic (95% CI) for AF risk than CHA2DS2VASc in all subgroup analyses, with CHA2DS2VASc’s C-statistic for AF being non-significant in all subgroup analyses except those CHA2DS2VASc ≥2 for 5-year and 10-year AF risk.

Multivariable predictors versus age alone

Discrimination of age alone for 5-year and 10-year AF in the overall PREVEND cohort was almost on par with that of CHARGE-AF and FHS-AF, and considerably higher than that of CHA2DS2VASc and—to a lesser extent—CHA2DS2VA (table 2).

In the subgroup analyses, age alone as predictor shared CHARGE-AF and FHS-AF’s pattern of higher C-statistic for AF among younger and lower-CHA2DS2VASc subgroups. In most subgroup analyses, including those for participants <65 years of age at baseline (n=7012), C-statistic point estimates of the multivariable models CHARGE-AF and FHS-AF were higher than age alone, but with considerable overlap in confidence intervals. The multivariable models’ advantage versus age in terms of C-statistic was most pronounced in older and higher-CHA2DS2VASc subgroups in the analyses on 10-year AF risk.

When assessing discrimination of age alone versus that of CHA2DS2VASc and CHA2DS2VA in the subgroup analyses, age’s C-statistic point estimate was higher in all subgroup analyses except when comparing to CHA2DS2VA in those with CHA2DS2VASc ≥2 (n=1568). As in the overall analysis, however, age alone considerably outperformed CHA2DS2VASc in the subgroups analyses, while CHA2DS2VA showed intermediate predictive ability.

Discussion

Our study indicates that the discriminatory advantage of multivariable models over age alone assumed from earlier work appears to be limited in a younger, lower-risk cohort. While multivariable models CHARGE-AF and FHS-AF were shown to have good predictive properties for 5-year and 10-year AF risk, their discrimination was only marginally better than age alone as linear predictor in the young PREVEND cohort. The risk scores CHA2DS2VASc, and to a lesser extent CHA2DS2VA, even underperformed relative to age alone as well as to the multivariable models.

Comparison to previous studies

Previous external validation of CHARGE-AF showed that CHARGE-AF has an overall robust performance in older community cohorts.4 Our study elaborated on these findings by validating CHARGE-AF in a young population with relatively low prevalence of cardiovascular risk factors and relatively low AF incidence. We hereby confirmed that discrimination of multivariable models for new AF is high among younger community members, while showing that the discriminatory advantage of multivariable models versus age alone assumed from earlier work appears to be limited in a younger, lower-risk cohort.15

A recent study validating risk models for prevalent AF on 12-lead ECG confirmed the high potential of risk models in efficient patient selection for AF screening.33 As in our work, CHARGE-AF was among the top performing models. In their cohort with mean age 64.8 years (72.9 among AF-positive cases), the authors concluded that diagnostic accuracy of multivariable models was superior to the age cut-off ≥65 years. The authors did not provide an analysis comparing multivariable models to age as continuous linear predictor for patients under 65 years of age, as we did in the current work. Although generalisability to true community screening was limited by the nature of their sample (self-referred, self-funded attendees to a private vascular screening clinic), it was a first large-scale indication that multivariable models could be used for AF screening selection.33

The PREVEND cohort’s extensive follow-up and rich data set including ECG parameters allowed us to recalibrate CHARGE-AF for 10-year AF risk, and to directly compare the model with the 10-year FHS-AF model. Our finding that discrimination for new AF was similar for CHARGE-AF and FHS-AF, while both performed better than CHA2DS2-VASc, concurred with previous work.4 9

We were to our knowledge the first to validate CHA2DS2-VA for 5-year and 10-year AF risk in the community. The higher C-statistics seen using CHA2DS2-VA compared with its counterpart, CHA2DS2-VASc, are not surprising given the omission of female sex category, which is known to be negatively associated with incident AF.2

The association of individual determinants and biomarkers with AF in the PREVEND data set outside the context of risk prediction models can be found in previous work.20 Addition of biomarkers to CHARGE-AF as done by previous authors, whether through CHARGE-AF’s established augmented model or by addition of individual biomarkers,9 10 showed no meaningful discriminatory improvement in our sample (data not shown).

Clinical implications and future directions

Our findings are relevant for patient selection for AF screening, an aim for which risk models are now increasingly looked at.3 The similarity in discrimination between multivariable models and age alone, both in the overall young PREVEND cohort and in the subgroup analyses of patients <65 years of age, suggest limited additional value of multivariable models for AF screening triage among younger community-dwelling adults. The current analysis therefore supports the current convention of applying patient selection algorithms for AF screening primarily to those with advanced age.7 17

For implementation in community-based medicine, CHARGE-AF holds an edge over other existing risk models derived for AF in that it does not require ECG or laboratory data.11 28 34 However, recent work has indicated that even for CHARGE-AF to be used as a remote, low-cost triage test for AF screening selection, there are still considerable limitations from incomplete routine care data.15 Until such limitations are overcome, for example, by valid imputation techniques applicable in routine primary care data, researchers may need to look for more universally applicable risk models to assess risk of AF. However both models in our analysis that do not require laboratory, ECG or physical measurements (CHA2DS2-VA and CHA2DS2-VASc) were still outperformed by age as linear predictor for AF risk in the young PREVEND cohort. Recent work has indeed indicated that age ≥65 years is almost on par with CHA2DS2-VA in sensitivity for AF on single time-point screening.25 Future work on AF risk prediction using routine care data, rather than prospective cohorts with high data completeness, could further work towards the optimal selection strategy for AF screening in settings with a high degree of incomplete data. We advise to incorporate age as single predictor in such analyses as well.

A future direction from this work’s recalibration of CHARGE-AF for 10-year AF risk is to investigate integration of the model with established cardiovascular risk calculators such as SCORE in Europe, QRISK in the UK or ASCVD in North America.35–37 Many of the variables used in these tools (eg, age, blood pressure, smoking status) as well as treatment recommendations (eg, antihypertensive medication use) are variables that can also be used to calculate the CHARGE-AF model. Physicians can use these data to discuss working on modifiable risk factors that may not only positively affect the risk of (cardio)vascular disease, but perhaps also the risk of AF and associated complications, such as heart failure and stroke.2 We note that further external validation of the newly recalibrated CHARGE-AF model for 10-year AF risk is warranted before researching integration with other cardiovascular risk calculators.

Strengths and limitations

Primary strength of this community-based validation study is that it is the first to specifically validate risk models for incident AF in a young cohort. With 85% of its participants being below the age of 65, the PREVEND cohort allowed us to be the first to thoroughly test whether current conventions in AF screening selection could be challenged by the use of multivariable models in younger patients, to which the answer seems to be negative. Further strengths are the cohort’s detailed clinical assessment, the availability of ECG measurements, the extensive follow-up period and robust validation of AF by 3 yearly study visit including ECG and linkage of data from local hospitals. This allowed for validation of multiple models in one data set as well as for recalibration of the original CHARGE-AF model for 10-year data. Where there was missing data, we resolved this by imputation, as recommended over complete case analysis in risk model validation.38 We note that CHARGE-AF validation in complete cases was similar to the imputed data set (data not shown). Finally, the analysis on routinely scoring race as Caucasian/white allowed us to validate this approach to deal with structural missingness of this variable in a predominantly Caucasian/white cohort, as has been done before in routine care data.15

There are a number of limitations to consider. First, the PREVEND cohort’s oversampling of patients with microalbuminuria impairs the cohort’s generalisability to the community at large and to cohorts previously used in risk model validation studies. We ameliorated this limitation by using a weighting scheme based on the known oversampling rate in the PREVEND data set, as has been used previously.29 In the absence of established methods for use in the presence of survey weights, we were unable to adjust for the competing risk of death or perform a formal test for difference in C-statistic. Third, while a significant effort was made to capture AF by performing 3 yearly ECG and by linkage of data from local hospitals, PREVEND did not include continuous ECG recordings and may thus have overlooked asymptomatic episodic/paroxysmal forms of AF. Also, AF cases diagnosed by general practitioners may not have been included when the AF diagnosis had not been made in one of the local hospitals. Fourth, ethnic diversity was minimal in PREVEND, and as such our findings mainly extend to a European-white population. While CHARGE-AF has been assessed in African-American, Hispanic and East-Asian populations,10 the performance of CHARGE-AF in other ethnic groups should be an area of future research. Fifth, due to the demographic make-up of the PREVEND cohort, the sample of participants aged 65 years or older was limited in size. While this resulted in relatively wide C-statistic confidence intervals for this subgroup, we note that our results coincide with aggregated results from previous validation studies performed in older community cohorts.4 Sixth, we were unable to externally validate our recalibrated CHARGE-AF coefficients for predicting 10-year AF risk. We chose not to perform split-data set validation as this has been advised against,39 and as such, external validation has to be awaited before definitive conclusions can be drawn as of the performance of the recalibrated 10-year CHARGE-AF model.

Conclusion

In this validation study of multivariable predictors and age alone for 5-year and 10-year AF, we showed that there is little additional prognostic value of multivariable models over age alone in younger individuals. Discrimination of age alone for new AF was on par with that of two multivariable models (CHARGE-AF and FHS-AF), and exceeded that of two multivariable risk scores (CHA2DS2-VASc and CHA2DS2-VA). Our work suggests that selection strategies for primary AF screening using multivariable models should not be expanded to younger individuals.

Data availability statement

Data are available upon reasonable request. Requests for and information on usage of the de-identified PREVEND data set can be directed to m.rienstra@umcg.nl.

Ethics statements

Patient consent for publication

Ethics approval

The PREVEND study was approved by the medical ethics committee of the University Medical Center Groningen (METc code 01/039) and was conducted in accordance with the Declaration of Helsinki. All participants provided written informed consent.

Acknowledgments

The authors thank the staff and participants of the PREVEND study for their important contributions. We thank Karen S. Pieper, PhD, for valuable input in the statistical analyses.

References

Supplementary materials

  • Supplementary Data

    This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.

Footnotes

  • JCLH and REH are joint first authors.

  • Contributors JCLH and REH performed data analysis and data presentation, were primarily responsible for manuscript preparation, and were guarantors of this work. RTG and MR supervised data collection and data preparation at UMCG, and provided valuable input to the manuscript. BG and SV assisted in data analysis, and provided valuable input to the manuscript. WAML provided valuable input to the manuscript.

  • Funding This work was supported by the Netherlands Organisation for Health Research and Development (ZonMw) (80-83910-98-13046) and the European Research Council under the European Union’s Horizon 2020 research and innovation programme (648 131).

  • Competing interests None declared.

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.