Article Text

Original research
Evaluation of cytokines as a biomarker to distinguish active tuberculosis from latent tuberculosis infection: a diagnostic meta-analysis
  1. Beibei Qiu,
  2. Qiao Liu,
  3. Zhongqi Li,
  4. Huan Song,
  5. Dian Xu,
  6. Ye Ji,
  7. Yan Jiang,
  8. Dan Tian,
  9. Jianming Wang
  1. Department of Epidemiology, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, China
  1. Correspondence to Jianming Wang; jmwang{at}


Objectives With a marginally effective vaccine and no significant breakthroughs in new treatments, a sensitive and specific method to distinguish active tuberculosis from latent tuberculosis infection (LTBI) would allow for early diagnosis and limit the spread of the pathogen. The analysis of multiple cytokine profiles provides the possibility to differentiate the two diseases.

Design Systematic review and meta-analysis.

Data sources PubMed, Cochrane Library, Clinical Key and EMBASE databases were searched on 31 December 2019.

Eligibility criteria We included case–control studies, cohort studies and randomised controlled trials considering IFN-γ, TNF-α, IP-10, IL-2, IL-10, IL-12 and VEGF as biomarkers to distinguish active tuberculosis and LTBI.

Data extraction and synthesis Two students independently extracted data and assessed the risk of bias. Diagnostic OR, sensitivity, specificity, positive and negative likelihood ratios and area under the curve (AUC) together with 95% CI were used to estimate the diagnostic value.

Results Of 1315 records identified, 14 studies were considered eligible. IL-2 had the highest sensitivity (0.84, 95% CI: 0.72 to 0.92), while VEGF had the highest specificity (0.87, 95% CI: 0.73 to 0.94). The highest AUC was observed for VEGF (0.85, 95% CI: 0.81 to 0.88), followed by IFN-γ (0.84, 95% CI: 0.80 to 0.87) and IL-2 (0.84, 95% CI: 0.81 to 0.87).

Conclusion Cytokines, such as IL-2, IFN-γ and VEGF, can be utilised as promising biomarkers to distinguish active tuberculosis from LTBI.

PROSPERO registration number CRD42020170725.

  • tuberculosis
  • infectious diseases & infestations
  • diagnostic microbiology
  • public health
  • immunology

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See:

Statistics from

Strengths and limitations of this study

  • All stages of the study were conducted by two researchers independently and supervised by a third reviewer.

  • This study was performed with the methods of the Cochrane Handbook for Systematic Reviews of Interventions and provided evidence regarding the diagnostic value of cytokines in the differentiation of active tuberculosis and latent tuberculosis infection.

  • The heterogeneity was relatively high. Study design, reference standard and cytokine determination method were the primary sources of heterogeneity.


Tuberculosis is caused by Mycobacterium tuberculosis that often affects the lungs. Globally, an estimated 10.0 million people fell ill with tuberculosis in 2018, a number that has been relatively stable in recent years.1 Coinfection with tuberculosis and AIDS,2 tuberculosis and diabetes,3 liver damage caused by antituberculosis drugs4 and ambient air pollution5 are all huge obstacles to achieve the ‘End Tuberculosis Goal’. According to the WHO, the number of persons with both incident and prevalent tuberculosis remains the highest in the South-East Asian and African regions.6

It is estimated that approximately 1.7 billion individuals in the world are latently infected with M. tuberculosis.7 Among them, 5%–10% will develop active tuberculosis (ATB) during their lifetime, especially when their immune system is weak. On the country level, China and India had the highest latent tuberculosis infection (LTBI) burden, followed by Indonesia.7 With reasonable assumptions for reactivation risks, incident tuberculosis cases arising from the LTBI reservoir would prohibit reaching the ‘End Tuberculosis Strategy’ goal. Accurate and rapid diagnosis would allow the medications to be allocated appropriately, and actions can be taken to curtail M. tuberculosis spread more effectively. The traditional tuberculin skin test (TST) and the recently developed interferon-gamma release assay (IGRA) can assist in the diagnosis of LTBI, but they neither distinguish between infection and active disease nor predict the risk of activation from latent infection.8–10

IGRAs are blood tests that detect the secretion of IFN-γ by sampled lymphocytes after stimulation with proteins that are relatively specific for M. tuberculosis.11 As IFN-γ is produced by memory T cells,12 it is not surprising that the measurement of this cytokine alone cannot accurately distinguish LTBI subjects from those with active disease.13 Detecting other cytokines and adopting separate or combined methods can significantly improve diagnostic accuracy. With a marginally effective vaccine and no apparent breakthrough in new treatments, a sensitive and specific method to distinguish the active disease from LTBI would allow for an early diagnosis and limit the spread of the pathogen. Thus, we performed this meta-analysis through an extensive and in-depth search for relevant studies to analyse the possibility of multiple cytokine profiles to differentiate these two diseases.



Our protocol was performed using the methods of the Cochrane Handbook for Systematic Reviews of Interventions. We performed this meta-analysis followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses.14

Data sources and searches

We selected PubMed, Cochrane Library, Clinical Key databases and EMBASE for systematic and comprehensive searches. Articles published on 31 December 2019 were searched. The primary search process had no language restrictions. We further read the references cited in the selected articles to identify other relevant studies and improve the search sensitivity. The search terms are listed in online supplementary table S1.

Study selection

We selected articles describing pathological changes of cytokines, including IFN-γ, TNF-α, IP-10, IL-2, IL-10, IL-12 and VEGF, stimulated by M. tuberculosis antigen, among patients with ATB and LTBI. Cytokines were analysed quantitatively or qualitatively. The ability of cytokines as biomarkers to discriminate ATB from LTBI was evaluated. We included articles using the designs of either case–control studies, cohort studies or randomised controlled trials (RCTs). The exclusion criteria were as follows: editorial, correspondence, narrative review or system review; the number of ATB or LTBI cases was less than 10; studies did not report any follow-up outcomes and studies did not report true positive (TP), false positive (FP), false negative (FN) and true negative (TN) or did not provide sufficient data to calculate them. Two researchers conducted rigorous and independent assessments of the articles. Differences were resolved through negotiation. We did not find any quantitative and qualitative differences between them in the article search and data extraction phase. Their interagreement was 100%.

Data extraction

Two independent extractors extracted the data. We retrieved and read the entire content of the selected articles and extracted data including the first author, publication date, study area, sample size, sample type, reference standard, demographics (age and gender), clinical characteristics (HIV infection, diabetes, liver or kidney injury, drug resistance, previous history of tuberculosis, extrapulmonary tuberculosis and lung cavity), TP, FP, FN and TN. All data were summarised and processed in the form of a feature table.

Risk of bias assessment

We used the Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) to assess the quality and risk of bias of each study.15 The items of QUADAS-2 covered the disease spectrum, gold standard, disease progression bias, verification bias, evaluation bias, clinical evaluation bias, pooling bias, trial implementation, case withdrawal and uncertain results. The evaluation results were defined as ‘yes’, ‘no’ or ‘unclear’.


The sensitivity, specificity, diagnostic OR (DOR), positive likelihood ratio (PLR) and negative likelihood ratio (NLR), together with 95% CI, were used to estimate the diagnostic value of the cytokines.

Statistical analysis

We used Excel 2010 to draw feature tables and STATA V.15 (StataCorp, College Station, Texas, USA) to perform the meta-analysis. The pooled sensitivity, specificity, PLR, NLR, DOR and 95% CI for each cytokine were calculated. A forest plot was drawn to visually show the difference in the point estimates of each study. A summary receiver operating characteristic (SROC) curve was plotted, and the overall diagnostic value of cytokines was displayed by the area under the curve (AUC). The fixed or random-effect model was applied based on the heterogeneity test. If I2 >50% or p<0.10, we selected the random-effects model; otherwise, we applied the fixed-effects model. Meta-regression analysis was used to explore the causes of heterogeneity. Egger’s test and Begg’s test were applied to detect possible publication bias.

Patient and public involvement

Patients and public were not involved in this study.


Search results

Preliminary searching yielded 1362 records. Then, we removed 382 duplicated records, 824 irrelevant articles by reading titles and abstracts and 142 irrelevant articles that did not meet the inclusion criteria after reading the full text. Finally, there were 14 articles included in the meta-analysis (figure 1).8 10 16–27

Figure 1

Flow diagram of the search process. PRISMA, Preferred Reporting Items for Systematic Reviews and Meta-Analyses.

Characteristics of the studies

Articles were published during 2012–2019. They were performed in China (5), India (1), Australia (1), South Korea (5), Japan (1) and Italy (1), respectively. Except for Australia and Italy, all countries had a relatively high burden of tuberculosis. The total number of study subjects was 959, including 476 ATB cases and 483 LTBI cases. One study used the T-spot as the reference standard for ATB,16 while the others applied the M. tuberculosis pathogenic test. One study defined LTBI based on positive TST results and close contact with ATB patients for more than 1 month without clinical symptoms,17 two studies defined LTBI based on a positive TST and IGRA,18 19 and the other 11 studies used QuantiFERON-TB Gold In-Tube (QFT-IT), chest X-ray examinations and clinical symptoms as reference standards. Seven studies reported Bacillus Calmette-Guerin vaccination history. Four articles explicitly reported whether the patients had extrapulmonary tuberculosis. The characteristics of the included studies are listed in table 1.

Table 1

Baseline characteristics of the studies

Study quality

As shown in figure 2, two studies had a high risk of bias with flow and timing concerns. We found that the applicability concerns were low for ‘patient selection’ in seven studies, ‘index tests’ in six studies, and ‘reference standard’ in one study.

Figure 2

Quality assessment of the studies.

Pooled diagnostic value of cytokines in distinguishing ATB and LTBI

Seven cytokines, IFN-γ, TNF-α, IP-10, IL-2, IL-10, IL-12 and VEGF, were selected as indicators to calculate the accuracy and ability of their use as biomarkers to differentiate ATB and LTBI. Cytokines and related indicators included in every study are shown in table 2. One study23 applied the FluoroSpot, five studies19 21 22 25 27 applied an ELISA assay and eight studies used Luminex to measure the cytokines. The forest plots and SROC curves are shown in online supplementary figures S1–14. The pooled sensitivity, specificity, PLR, NLR, DOR, AUC and heterogeneity index I2 and p-value are summarised in table 3. The numbers of study subjects in each study are listed in table 4. IL-2 had the highest sensitivity (0.84, 95% CI: 0.72 to 0.92) and VEGF had the highest specificity (0.87, 95% CI: 0.73 to 0.94). IFN-γ had the highest DOR (12, 95% CI: 5 to 26). After drawing the SROC curves for seven cytokines, the highest AUC was 0.85 (95% CI: 0.81 to 0.88) for VEGF, followed by IFN-γ (0.84, 95% CI: 0.80 to 0.87) and IL-2 (0.84, 95% CI: 0.81 to 0.87).

Table 2

Cytokines and related indicators included in every study

Table 3

Summary of the meta-analysis for each cytokine

Table 4

The number of subjects in each study

Meta-regression analysis

The meta-regression analysis results are shown in online supplementary tables S2–8 and figures S15–21. Regression models included joint models and models for sensitivity and specificity that were independently established. We identified five factors that may have caused the heterogeneity, including study design, inclusion and exclusion of study subjects, reference standard, independence of the index test and reference standard and the method of the index test.

Publication bias evaluation

Publication bias was judged by Egger’s and Begg’s test and is shown in online supplementary table S9. IP-10 had an apparent publication bias (Egger’s test p=0.078; Begg’s test p=0.016). The other six cytokines did not show a significant publication bias. The funnel plots are illustrated in online supplementary figures S22–28.


The advantage and originality of this meta-analysis lay in its search of major databases, considering as many cytokines as possible, and including various types of professional studies. We evaluated seven cytokines (IFN-γ, TNF-α, IP-10, IL-2, IL- 10, IL-12 and VEGF) in the scope of the meta-analysis and probed their capacity as biomarkers to distinguish ATB and LTBI, which is unprecedented in previous studies. We observed that IL-2 had the highest sensitivity, and VEGF had the highest specificity. Although the alternative test using smear microscopy suggested a sensitivity of at least 80% and a specificity of at least 98%,28 cytokines such as IL-2 and VEGF also have potential discrimination abilities. As expected, IFN-γ had the highest DOR value.

To explore factors that may cause heterogeneity and bias in this meta-analysis, we first stratified the articles by the study design. Except for four studies using cohort or case–control designs,8 16 20 26 the other 10 studies were RCTs. The RCT has distinct advantages and can effectively prevent selective bias. Then, we performed a subgroup analysis by the reference standard. Although TST and IGRA are commonly used as screening tools, there is no unified and clear reference standard for LTBI. In this meta-analysis, one study defined LTBI based on TST,17 two studies comprehensively considered the results of TST and IGRA,18 19 and the other 11 studies relied on IGRA to determine M. tuberculosis infection.

In addition to the study design and reference standard, cytokine detection methods may also affect the results. For 14 studies included in this meta-analysis, one study used FluoroSpot,23 five studies used traditional ELISA or capillary-based ELISA19 21 22 25 27 and the other eight studies used Luminex. The FluoroSpot applies selective filters for emission, which can analyse each analyte separately and then identify the double-stained and triple-stained spots. It can detect two or three cytokines at the same time with high sensitivity and specificity.29 ELISA is widely used in the determination of cytokines in various body fluids with high repeatability. However, traditional ELISA has the disadvantages of complicated operation, long measurement time and large sample consumption. Capillary-based ELISA significantly improves the above disadvantages, shortening the measurement time to 16 min and reducing the sample volume to 20 µL.30 Luminex is now a vital tool for the quantitative determination of cytokines. It is possible to measure multiple cytokines simultaneously with a small sample in a short time by using hundreds of micrometer-scale specially prepared microspheres.31 Also, the precision of the equipment used to measure the cytokines and the choice of cytokine threshold would affect the diagnostic value. In most cases, the threshold is determined by the receiver operating characteristic curve with maximised sensitivity and specificity.32 33 However, in areas with a low burden of tuberculosis, the threshold may be set at a lower level in order to better distinguish the active and latent tuberculosis.34

To improve the diagnostic value, multiple cytokines are usually used in combination. Won et al found that a combination of five biomarkers (IL-5, IL-10, TNF-α, VEGF and IL-2/IFN-γ) can predict 95.5% of ATB and 93.3% of LTBI.8 In another study, the combination of ESAT-6/CFP-10-specific EGF and Rv2032-specific VEGF correctly discriminated against all participants (100%).35 Kim et al reported that the combination of IFN-γ, TNF-α and IL-2R had a sensitivity of 100% and a specificity of 86.36%.19 Wang et al found that six cytokines in combination (tuberculosis antigen-stimulated IFN-γ, IP-10 and IL-1Ra; unstimulated cytokines of IP-10, VEGF and IL-12) had a sensitivity of 85.7% and a specificity of 91.3%.20 Our analysis showed that the combination of cytokines represented by IL-2, VEGF and IFN has potential value in screening for patients with ATB and LTBI. However, the immune response to M. tuberculosis infection is complex and multifaceted. The impact of coinfection with HIV and other iatrogenic causes on test performance in immunocompromised patients needs to be determined to understand the full benefits and limitations of this technology.

Millions of patients with LTBI are underdiagnosed every year,36 37 and there is an urgent need for better diagnostic tools.38 The quick differentiation and correct identification of ATB from LTBI is the current focus of global tuberculosis prevention and control. Blood and urine are good sources of samples for diagnosis without causing harm to the human body.39 Findings from our meta-analysis have particular guiding significance and a theoretical basis for clinical practice, which could provide clues for developing new methods and techniques to screen for tuberculosis and LTBI.

Our study has several limitations. First, as mentioned above, the differences in study design, reference standards and cytokine determination method may be sources of bias. Second, the studies involved in the analysis were mainly conducted in countries with a high burden of tuberculosis. The diagnostic value of cytokines in low prevalence areas is uncertain. Third, there are differences in the quality of different research groups, which may also contribute to heterogeneity. Although we used QUADAS-2 to assess the quality and risk of bias of each study, it could not fully consider all kinds of causes of bias and heterogeneity.

Although this meta-analysis has several limitations mentioned above, the findings of this study are valuable and provide evidence regarding cytokines, such as IL-2, IFN-γ and VEGF, to be utilised as promising biomarkers to distinguish ATB from LTBI.


Supplementary materials

  • Supplementary Data

    This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.


  • Contributors BQ and JW conceived, initiated and led the study. BQ, QL, ZL, HS, DX, YeJ, YaJ and DT collected the data. BQ and QL analysed the data with input from all of the authors. BQ and JW prepared the manuscript. All authors reviewed and approved the manuscript.

  • Funding This study was supported by the National Natural Science Foundation of China (81973103), National Key R&D Program of China (2017YFC0907000), Qing Lan Project of Jiangsu Province (2019) and Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD). The funding agencies had no role in the study design, data collection, analysis, decision to publish or preparation of the manuscript.

  • Competing interests None declared.

  • Patient consent for publication Not required.

  • Ethics approval The ethical approval can be exempted as this is a systematic review and meta-analysis of data from already published studies.

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Data availability statement All data relevant to the study are included in the article or uploaded as supplementary information. All data generated or analysed during this study are included in this published article.

  • Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.