Article Text

Original research
Parametric and non-parametric estimation of reference intervals for routine laboratory tests: an analysis of health check-up data for 260 889 young men in the South Korean military
  1. Taeyun Kim1,
  2. Hyunji Choi2,3,
  3. Sun Min Lee3,4
  1. 1Internal Medicine, The Armed Forces Goyang Hospital, Goyang, Republic of Korea
  2. 2Laboratory Medicine, Pusan National University Yangsan Hospital, Yangsan, Republic of Korea
  3. 3Research Institute for Convergence of Biomedical Science and Technology, Pusan National University Yangsan Hospital, Yangsan, Gyeongnam, Republic of Korea
  4. 4Laboratory Medicine, Pusan National University School of Medicine, Busan, Republic of Korea
  1. Correspondence to Dr Sun Min Lee; yaong97{at}paran.com

Abstract

Objectives Determination of reference intervals (RIs) using big data faces several obstacles due to heterogeneity in analysers, period and ethnicity. The present study aimed to establish the RIs for routine common blood count (CBC) and biochemistry laboratory tests in homogeneous, healthy, male Korean soldiers in their 20s using a large health check-up data set, comparing parametric and non-parametric estimation.

Design A multicentre, cross-sectional study.

Setting Seven armed forces hospitals in South Korea.

Participants A total of 609 649 men underwent health examination when promoted to corporal between January 2015 and September 2021. 260 889 eligible individuals aged 20–25 were included in the analysis.

Main outcomes and measures The RIs were established by parametric and non-parametric methods. In the parametric approach, maximum likelihood estimation was applied to measure the Box-Cox transformation parameter and the values at the 2.5th and 97.5th percentiles were recalculated. The non-parametric approach adopted the Tukey’s exclusion test and the values at the 2.5th and 97.5th percentiles were obtained. Classification by body mass index was also performed.

Results The obtained RIs for haematology parameters were comparable between devices. If the values followed a Gaussian distribution, parametric and non-parametric methods were well matched for haematology and biochemical markers. When the values were right-skewed, the upper limits were higher with parametric than with non-parametric methods. Participants with obesity showed higher RIs for CBC, some liver function tests and some lipid profiles than participants without obesity.

Conclusions Using data from healthy, male Korean soldiers in their 20s, we proposed the RIs for CBC and biochemical parameters, comparing parametric and non-parametric estimation. As such approaches based on large data sets become more prevalent, further studies are needed to discriminate eligible individuals and determine RIs in an extrapolated sample.

  • Clinical chemistry
  • Biochemistry
  • INTERNAL MEDICINE

Data availability statement

Data are available upon reasonable request. The data that support the findings of this study are available from the corresponding author or upon reasonable request.

http://creativecommons.org/licenses/by-nc/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Strengths and limitations of this study

  • This study was based on 260 889 healthy, male Korean soldiers in their 20s, representing one of the largest samples in the field of reference intervals.

  • The reference intervals for haematology and biochemistry markers were estimated and determined using both parametric and non-parametric estimation.

  • The reference intervals were compared between obese and non-obese individuals.

  • This study was limited to analysis of reference intervals only in young, healthy military soldiers.

Introduction

Establishing the reference intervals (RIs) and the reference values for laboratory markers is important in supporting decision-making in clinical practice.1 RIs have been widely calculated by estimating the 0.025 and 0.975 percentiles, 95% CI, from a healthy population.2 When using non-parametric values, according to the Clinical Laboratory Standards Institute guidelines, it is considered that a minimum of 120 healthy individuals is required to calculate the 90% CI of the upper and lower reference limits. However, since a large volume of data is easily accessible nowadays, analytical approaches based on high-volume data have been conducted in clinical laboratory medicine.3 In this regard, extension from a few examinees to a real-world population to overcoming the limitation in numbers might be worthy, although appropriate harmonisation of medical records might be challenging.

Direct sampling is a traditional approach that refers to generating RIs from a preselected population with measurement and determination in order. On the contrary, indirect sampling selects results from routine pathology results in a mixed population, which may contain healthy and unhealthy individuals together. Appropriate statistical techniques, such as the truncated maximum likelihood method, are then applied to determine RIs.4 Data mining, a process of extracting and uncovering new information from a large volume of data, is an indirect method of establishing RIs. However, data from diseased individuals may overlap and this heterogeneity is one of the major hurdles of an indirect method.4

Several real-world approaches using ‘big data’ have been performed to build the RIs for blood sampling results. For example, a single-centre study in Korea evaluated the RI for common blood count (CBC) in 804 623 individuals aged 3–99 using a single haematological device.5 However, whether unhealthy subjects have been excluded is questionable and the study only estimated the CBC values. Another single ‘big data’ approach in the Netherlands recruited more than seven million test results and rigorously analysed the RIs for 18 commonly used clinical parameters.6 However, they presumed a healthy population not based on clinical information but the distribution of test results. The heterogeneity originating from the intralaboratory differences and the mixed distribution of healthy and non-healthy populations made calculating RIs from ‘big data’ challenging.7

In this regard, establishing RIs using a large volume of data in a homogeneous healthy adult group might be worthy. Therefore, the present study aims to analyse the health check-up data of about 260 000 healthy Korean soldiers in their 20s, suggest the RIs for CBC and several chemical parameters, partition the samples by body mass index (BMI), and compare the RIs between parametric and non-parametric methods and between analytical instruments.

Methods

Study participants

As a result of conscription in South Korea, all male citizens perform compulsory military service and undergo physical examination at the Military Manpower Administration before enlistment in the army. Cases of severe physical or mental illnesses are exempted from mandatory military service. Therefore, men without significant illnesses are likely to be serving in the Korean military. After joining the army, about 1 year later, all Korean soldiers undergo additional physical examination when they are promoted to the rank of a corporal from a private first class.

Electronic medical records between January 2015 and September 2021 from the New Defense Medical Information System of the Korean armed forces database were extracted. The medical data of 609 349 men who underwent medical examination before their promotion to the corporal across 18 armed forces hospitals were acquired. Of the hospitals, three (n=41 113) were excluded due to lack of information on haematological and biochemical test devices. Due to heterogeneity in analytical instruments, we then excluded eight hospitals (n=293 799) and selected seven hospitals that use the biochemical devices of the same company and the haematological equipment from two different companies. We selected soldiers aged 20–25 with a low probability of having a disease and excluded 13 548 aged <20 or >25. Finally, 260 889 healthy Korean soldiers in their 20s were included in the analysis.

Informed consent was waived due to the study’s retrospective nature and because anonymous clinical data were used for analysis.

Laboratory tests and analytical instruments

The results for the following haematology test parameters were collected: haemoglobin (Hb, g/L), haematocrit (Hct, %), red blood cell count (RBC, 1012/L), white cell count (WCC, 109/L) and platelet (PLT, 109/L). The XN series (Sysmex, Kobe, Japan) were used in five hospitals. Four of them used an XN-1000 analyser and one hospital used an XN-3000. ADVIA 2120i (Siemens Healthcare Diagnostics, Eschborn, Germany) was used in two hospitals.

The results for the following chemistry parameters were collected: creatinine (mg/dL), aspartate aminotransferase (AST, IU/L), alanine aminotransferase (ALT, IU/L), gamma glutamyl transferase (GGT, IU/L), total cholesterol (mg/dL), triglyceride (mg/dL), high-density lipoprotein (HDL) cholesterol (mg/dL), low-density lipoprotein (LDL) cholesterol (mg/dL) and fasting glucose. The Beckman Coulter AU series (Beckman Coulter, Miami, USA) were used to measure these biochemical parameters in seven hospitals. Of the hospitals, four, two and one used AU-480, AU-680 and AU-5800, respectively.

Routine external quality control management procedures for laboratory tests were performed under the Korean Association of External Quality Assessment Service. All hospitals performed internal quality controls and the analytical devices are calibrated in line with the instructions provided by the manufacturers.

Calculation of RIs

Both parametric and non-parametric methods were used to calculate the RIs for haematological and chemical parameters. Considering there is no consensus on which method is appropriate to draw RIs from large-volume data, we adopted both parametric and non-parametric methods. The non-parametric method was applied using steps similar to the process used in the Canadian Laboratory Initiative on Pediatric Reference Intervals (CALIPER) study8: (1) distributions of the individual parameters were graphically, preliminary inspected using a quantile-quantile graph and a histogram; (2) the Tukey’s exclusion test, where values below Q1–1.5×IQR or above Q3+1.5×IQR are excluded, was used to identify outliers; and (3) the values at the 2.5th and 97.5th percentiles were then defined as the lower and upper limits. The parametric method was applied using the sequences adopted in previous studies by Arzideh et al9 and Zierk et al10: (1) a Box-Cox transformation based on kernel density estimation, which may maximise the likelihood function, was used to transform the data into a symmetric distribution; (2) the distribution’s 2.5th and 97.5th percentiles were obtained; and (3) the log-transformed value was then recalculated into the real value. In addition, classification according to the presence of obesity, defined by the Korean Society for the Study of Obesity guideline, was performed11: non-obese (<25.0 kg/m2) and obese (25.0–30.0 kg/m2) groups. The Mann-Whitney U test was used to compare differences between obese and non-obese individuals. All statistical analyses were performed using R V.4.1.2 software for Windows (R Development Core Team) and Analyse-it (Analyse-it Software, Leeds, UK).

Patient and public involvement

There is no patient and public involvement.

Results

CBC parameters (eg, Hb, Hct, RBC, WCC and PLT) and biochemical markers (eg, creatinine, AST, ALT, GGT, lipid profiles and fasting glucose) are depicted in figures 1 and 2. WCC and PLT count and most liver function parameters (eg, AST, ALT and GGT) revealed a non-Gaussian distribution. In terms of lipid profile, triglyceride distribution violated the normality and presented right skewness. These distribution patterns are ascertained in online supplemental tables S1 and S2. The mean value was higher than the median value for WCC, PLT, AST, ALT, GGT and lipid profiles, reflecting a right-skewed distribution.

Figure 1

Distribution of haematological markers.

Figure 2

Distribution of creatinine, liver function tests and lipid profiles. ALT, alanine aminotransferase; AST, aspartate aminotransferase; GGT, gamma glutamyl transferase; HDL, high-density lipoprotein; LDL, low-density lipoprotein.

The RIs for haematology tests determined and established by both parametric and non-parametric methods in healthy Korean soldiers aged 20–25 are tabulated in table 1. Similar RIs between the XN series and ADVIA 2120i were observed in all study subjects. In terms of WCC and PLT, which were distributed with a right tail, the values at the 97.5th percentile were higher in the parametric than in the non-parametric method. In addition, in all examined soldiers, 26.4% were obese and partitioning of RIs according to BMI revealed the values of Hb, Hct, RBC, WCC and PLT to be higher in obese than in non-obese individuals (p<0.001). This level of difference was irrespective of analytical devices and methods.

Table 1

Comparison of RIs for haematological tests in healthy Korean soldiers aged 20–25 years old determined by non-parametric and parametric methods

The established RIs for common chemistry examinations in healthy Korean soldiers in their 20s according to the parametric and non-parametric methods are summarised in table 2. The parametric method presented higher RIs than the non-parametric approach for AST, ALT, GGT and triglyceride levels. Some lipid profiles (eg, total cholesterol, triglyceride and LDL cholesterol) were higher in obese than in non-obese soldiers (p<0.001), while HDL cholesterol level was lower in obese soldiers (p<0.001).

Table 2

Comparison of RIs for biochemistry tests in healthy Korean soldiers aged 20–25 years old determined by non-parametric and parametric methods

Discussion

The present study inspected a large volume of data from haematological and biochemical laboratory tests in 260 889 healthy Korean soldiers aged 20–25, with the individuals classified by obesity status. Both parametric and non-parametric methods were used to calculate the RIs. Within the study period, no significant differences by year were observed. The analytical devices and the population were stable over the study period. However, the distribution of data among the seven hospitals was not equal (online supplemental figure 1). In CBC tests, the estimated RIs were comparable between parametric and non-parametric methods. The estimated RIs showed concordance between analytical instruments, and obesity was associated with the difference in the RIs for haematological tests. The upper limit values of some chemistry markers were higher in the parametric than in the non-parametric method. This might have originated from the right-skewed distribution of raw data, where the Tukey’s test excludes a bunch of data in the right tail. In addition, we showed that the RIs for biochemical parameters differed between obese and non-obese individuals.

This study is the largest to suggest RIs, with the strength of evaluating a group of non-diseased adults of a single ethnicity in their 20s. Of all enlisted men within the study period, about 2% were multicultural.12 Considering more than 3000 individuals are required to derive the RIs for each decade of life in their 20s, 30s and 40s,13 there is a paucity of data establishing the RIs for the age group in their 20s from a large population. The current results from a large number of samples could be advantageous, considering that a small sample size may produce selection bias14 and that variation in estimating the RIs for a population is largely affected by the sample size. The established RIs align with the previous results from big data approaches.

An analysis using the data set of the Canadian Health Measures Survey used the non-parametric rank method to establish the RIs for common haematological parameters and showed comparable values for Hb, Hct, RBC, WCC and PLT with the current study.15 In terms of partitioning, we observed that the RIs for haematological values were higher in obese than in non-obese individuals. Several studies support this observation, linking obesity, proinflammatory cytokines and increased erythropoiesis. Waist circumference was positively correlated with Hb, Hct, RBC, WCC and PLT.16 A nationwide population-based cohort study revealed that high Hb level was associated with increased OR for metabolic syndrome.17 We also obtained positive correlation coefficients between BMI and all examined haematological indices with p<0.001 (data not shown). Given that a prediction model proposed an increasing trend in obesity in Korean adults, estimating 62% of men would have obesity by 2030,18 a careful interpretation of laboratory tests by BMI might be necessary.

The established RIs for AST, ALT, GGT, total cholesterol, triglyceride, LDL cholesterol and fasting glucose levels were higher in obese than in non-obese, while HDL cholesterol was lower in obese participants. Elevated liver enzyme levels are associated with obesity in the adult population.19 However, due to interlaboratory variability in the upper limit of normal liver enzyme levels, an RI based on strictly defined reference population by different analysers should be established.20 This study agrees with the previous report regarding lipid profiles, where total cholesterol, LDL cholesterol and triglyceride levels were higher in obese than in non-obese individuals.21 Therefore, it might be plausible to determine the RIs for liver function tests and lipid profiles based on an individual’s clinical condition.

A big data-driven approach to determining RIs has been introduced to clinical laboratory medicine for several years. The application of big medical data has also facilitated advances in bioinformatics, precision medicine, public health informatics and artificial intelligence in imaging.22 Using stored laboratory results has an advantage in terms of data availability. This merit makes collecting data from specific subpopulations more accessible. For example, Tyler et al23 showed that the RIs in critically ill patients were significantly different from those in healthy outpatients, indicating cautious decision-making in the intensive care unit based on the RIs from a healthy control. Another merit lies in the representativeness of the RIs from big data. A truly large number of study participants may permit generation of population-representative reference limits. A Canadian nationally representative health survey enrolled 11 999 samples of citizens aged 3–79, and robust RIs spanning from children to the elderly were extracted.15

However, several hurdles exist when estimating RIs using big data. This approach does not guarantee high-quality RIs due to mixed, incomplete clinical data. Several statistical methods have been introduced to improve the reliability of RIs. For example, Ammer et al24 developed the refineR algorithm, which can be freely accessible by downloading open-source R package. Other approaches, such as kosmic algorithm25 or the truncated minimum χ2 method,26 might improve the generalisability of indirect methods in calculating RIs from real-world data. In addition, data used in the indirect approach may involve noise.27 It might be very challenging to separate healthy from non-healthy individuals. The intersection point between those individuals could be determined using Box-Cox transformation following kernel density estimation,10 which was adopted in the current study. Although the range of age covered in this study was only 6 years, the study participants were likely healthy individuals. We used health check-up data from the outpatient setting, which is recommended because this sample selection could minimise the enrolment of patients in acute pathophysiological conditions.14

In terms of methods for RI determination, both direct and indirect approaches have been used. The direct method calculates RIs from predefined healthy individuals, whether selecting them before collecting blood samples, a priori, or after, a posteriori. True randomisation would be important in this method, but requires extensive cost, time and effort to represent the target population. Generally, the direct method recruits small numbers of the population, which could cause selection bias.4 The indirect method selects a large number of individuals from a mixed population, which may contain not only non-diseased but also diseased individuals, and measures RIs based on robust statistical methods.4 Because the method could not select healthy individuals using predetermined specific criteria, individuals who are diseased or in a subclinical state could influence the RIs. In this regard, individuals in the current study are large in number, are unlikely to have a disease and are of a single race. This may be advantageous in combining the merits of direct and indirect methods.

This study has several limitations. First, because only healthy military soldiers in their 20s were included in the analysis, data classification according to parameters such as age, sex or ethnicity was not feasible. Second, although individuals in the current study were considered healthy, noise due to unrecognised or subclinical health status could exist. For example, a soldier was healthy at enlistment, but a disease might newly occur before the physical examination. Third, an examination of CBC differential profiles was not performed. Despite these limitations, with sufficiently large sample size, the study showed the RIs for CBC and common clinical chemistry parameters in healthy Korean men in their 20s. Considering sample size planning has been crude in many studies28 and several obstacles in recruiting appropriate participants may preclude a sufficiently large sample size, this study has merit in providing RIs within homogeneous healthy young adults using an adequately large sample and in showing differences in RIs for haematological markers and common chemical tests between obese and non-obese individuals.

Conclusions

By analysing a large sample size of healthy soldiers, we suggest the RIs for CBC and common clinical chemistry parameters in homogeneous Korean men in their 20s using parametric but non-parametric methods. We observed differences in the RIs for haematology markers and common biochemical tests between obese and non-obese individuals, and clinicians should interpret the test results cautiously according to their anthropometric values.

Data availability statement

Data are available upon reasonable request. The data that support the findings of this study are available from the corresponding author or upon reasonable request.

Ethics statements

Patient consent for publication

Ethics approval

The study protocol was approved by the Institutional Review Board of the Armed Forces Medical Command (Seongnam, South Korea; AFMC-202109-HR-064-01) and was conducted in accordance with the principles of the Declaration of Helsinki. All procedures were performed in accordance with the relevant guidelines and regulations. Informed consent was waived due to the study's retrospective nature and because anonymous clinical data were used for analysis.

References

Supplementary materials

  • Supplementary Data

    This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.

Footnotes

  • Contributors TK: conceptualisation, methodology, software, formal analysis, investigation, data curation, writing - original draft preparation, visualisation. HC: validation. SML: project administration and writing- review and editing. All authors substantially revised, commented on and approved the final manuscript. TK acts as guarantor.

  • Funding The study was supported for the publication by a 2021 research grant from Pusan National University Yangsan Hospital.

  • Competing interests None declared.

  • Patient and public involvement Patients and/or the public were not involved in the design, or conduct, or reporting, or dissemination plans of this research.

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.