Rapid Diagnostic Algorithms as a Screening Tool for Tuberculosis: An Assessor Blinded Cross-Sectional Study

Franz Ratzinger; Harald Bruckschwaiger; Martin Wischenbart; Bernhard Parschalk; Delmiro Fernandez-Reyes; Heimo Lagler; Alexandra Indra; Wolfgang Graninger; Stefan Winkler; Sanjeev Krishna; Michael Ramharter

doi:10.1371/journal.pone.0049658

Abstract

Background

A major obstacle to effectively treat and control tuberculosis is the absence of an accurate, rapid, and low-cost diagnostic tool. A new approach for the screening of patients for tuberculosis is the use of rapid diagnostic classification algorithms.

Methods

We tested a previously published diagnostic algorithm based on four biomarkers as a screening tool for tuberculosis in a Central European patient population using an assessor-blinded cross-sectional study design. In addition, we developed an improved diagnostic classification algorithm based on a study population at a tertiary hospital in Vienna, Austria, by supervised computational statistics.

Results

The diagnostic accuracy of the previously published diagnostic algorithm for our patient population consisting of 206 patients was 54% (CI: 47%–61%). An improved model was constructed using inflammation parameters and clinical information. A diagnostic accuracy of 86% (CI: 80%–90%) was demonstrated by 10-fold cross validation. An alternative model relying solely on clinical parameters exhibited a diagnostic accuracy of 85% (CI: 79%–89%).

Conclusion

Here we show that a rapid diagnostic algorithm based on clinical parameters is only slightly improved by inclusion of inflammation markers in our cohort. Our results also emphasize the need for validation of new diagnostic algorithms in different settings and patient populations.

Citation: Ratzinger F, Bruckschwaiger H, Wischenbart M, Parschalk B, Fernandez-Reyes D, Lagler H, et al. (2012) Rapid Diagnostic Algorithms as a Screening Tool for Tuberculosis: An Assessor Blinded Cross-Sectional Study. PLoS ONE 7(11): e49658. https://doi.org/10.1371/journal.pone.0049658

Editor: Olivier Neyrolles, Institut de Pharmacologie et de Biologie Structurale, France

Received: June 14, 2012; Accepted: October 11, 2012; Published: November 21, 2012

Copyright: © 2012 Ratzinger et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Funding: The study was in part supported by the Landsteiner Gesellschaft. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Tuberculosis is causing an estimated 1.7 million deaths per year and the highest burden of disease is found in regions of high HIV prevalence. [1] One of the main obstacles to effective treatment and control of tuberculosis is a lack of accurate, rapid, point-of-care and low-cost diagnostic tools. [2] Radiology and microscopy of sputum samples remain the most important diagnostic tools in low-income regions and culture, PCR, histology, and radiology are additional valuable diagnostic tools in high-income countries. Recently, the development of automated molecular tests for the diagnosis of pulmonary tuberculosis showed promising results, however this diagnostic approach is less useful for extra-pulmonary infections. [3] To date no diagnostic method is therefore able to provide high diagnostic accuracy in a timely manner for pulmonary and extra-pulmonary tuberculosis. Other diagnostic tools including the Mendel Mantoux skin test or interferon gamma release assays cannot reliably discriminate between latent infection and active disease. [4], [5] PCR based diagnostic tools are rapid and show promising diagnostic accuracy in sputum positive tuberculosis, however cost and extrapulmonary infections are limiting its usefulness.[6]–[8] Recently developed FACS based diagnostic methods for extra-pulmonary tuberculosis show promising diagnostic results but necessitate advanced technical equipment and skills, and wait for prospective evaluation in different patient populations. [9] Similarly, current efforts to identify novel biomarkers or screening rules for tuberculosis have not yet resulted in a reliable candidate molecule for further clinical assessment.[10]–[14].

Based on proteomic fingerprinting of serum Agranoff and colleagues proposed a rapid screening test for active tuberculosis based on the measurement of inflammation parameters including C-reactive protein, transthyretin, serum amyloid A, and neopterin. [15] The proposed classification-model was established by machine learning methods to obtain the best diagnostic algorithm. [16], [17] In that publication a diagnostic accuracy of up to 84% was reported in a prospectively obtained data set for the detection of active cases of primarily pulmonary tuberculosis. Although this test performance is far from perfect, a reliable algorithm might considerably help in classifying patients in high or low probability for tuberculosis. This might help to focus more time consuming and resource intensive investigations only on persons with high pre-test probability. To better appreciate the diagnostic potential of the previously published algorithm external validity needs to be assessed in different patient populations.[18]–[20].

The aim of this study was to assess the external validity of the initially reported diagnostic algorithm for the diagnosis of active pulmonary and extra-pulmonary tuberculosis in a Central European cohort. In addition we aimed to establish improved screening algorithms by machine learning methodology. For this purpose we aimed to construct two models – one including all useful laboratory and clinical parameters, and another model relying entirely on clinical information. The development of a diagnostic algorithm based on clinical information only was judged to being particularly useful for low-income regions.

Materials and Methods

Study Design and Outcome Parameters

This study was designed as a cross-sectional study. The study population consisted of 439 patients with clinical suspicion for active tuberculosis. All patients attending as out- or in-patients the Department of Infectious Disease at the Vienna General Hospital, Medical University of Vienna, between October 2001 and June 2008 were considered eligible, if the treating physician had requested laboratory testing of any biologic samples for mycobacterial culture.

Cases were classified as suffering from active tuberculosis by either a positive culture result for M. tuberculosis or a diagnosis based on either histology or radiology results suggestive for active tuberculosis and clinical cure following administration of specific anti-tuberculosis treatment. Non-tuberculosis patients were defined as subjects for whom biological samples had been sent for mycobacterial culture, but for whom an alternative diagnosis was established. Patients with HIV infection and paediatric patients were excluded from further analysis.

Patients being evaluated for tuberculosis routinely underwent assessment of serum inflammation markers at our institution. Those individuals for whom no results for acute phase parameters were available were excluded from further analysis. The inflammation parameters C-reactive protein, serum amyloid A, fibronectin, haptoglobin, and interleukin 6 were assessed routinely by nephelometry (Siemens DADE BN II). Similarly haematology, clinical biochemistry and blood sedimentation rates were performed routinely. Neopterin and transthyretin were analysed for the purpose of this study by ELISA (neopterin, Enzyme Linked Immunoassay, IBL Hamburg, Germany) and nephelometry (transthyretin, Siemens DADE Behring BN II) using frozen serum samples. Clinical information, microbiologic culture results, and results of histopathology and radiology were obtained from electronic patient records.

Ethics Statement and Statistical Analysis

All participants provided written consent for the use and analysis of data and archived specimens. The study protocol was approved by the Ethics Committee of the Medical University of Vienna (EK: 724/2007). All data were pseudonymized and were entered into an electronic database and statistical analysis was performed using a commercially available software package (SPSS Statistics 16.0, SPSS Inc.). For comparison between groups Pearson’s χ²-test or a Mann-Whitney-U-test was applied as appropriate. Statistical significance was defined at a level of α = 0.05 and the Bonferroni-Holm approach was used for correction for multiple testing. For the purpose of validating a previously published diagnostic algorithm, outcome information was masked and data were sent for outcome prediction to the trial statistician of the previous study. [15] Classification of patients was performed by the blinded statistician and the outcomes were returned for the evaluation of the diagnostic accuracy for this data set. Further analysis was performed using various supervised machine learning techniques. We applied different such methods for classifying the feature based data into classes (TB, not TB), as desired.

Briefly describing the used methods, (I) a support vector machine (SVM) generates a discriminant function from training samples, based on so-called support vectors, maximizing the margin between classes. [21] Furthermore, (II) the ADTree + AdaBoost algorithm iteratively improves “weak” decision trees to a “strong” model, i.e., focusing on those instances that were misclassified in the previous iteration. [22] Furthermore, different prediction models were established using the (III) naïveBayes algorithm, calculating prior-, conditional- and posterior-probabilities, (IV) the logistic regression classifier, characterized by membership function for each class, and (V) the multi-layer perceptron (an artificial neural network), combining various linear models for non-linear classification.[21], [23]–[25] In this context attribute evaluators serve the purpose to skip irrelevant parameters of the data set prior to classification. Further optimization was performed by the discretization filter that converts continuous to nominal values and the principal components analysis (PCA), which transforms conceivably correlated parameters to an uncorrelated set of variables (i.e., transforms the variables to a different space, using the principal components as basis).[23], [26]–[28].

The Java based software suite WEKA (Waikato Environment for Knowledge Analysis, version 3.6.2, URL: http://www.cs.waikato.ac.nz/ml/weka/,licensed under GNU General Public License) was applied for the construction of improved diagnostic algorithms. [29] Missing values were not imputed in our data set. Optimization results of the models were assessed in internal validation. All training sets were trained with all major supervised classifying algorithms, maximizing the accuracy. When equal accuracy was rated, better Receiver Operating Characteristic (ROC) curve was used as selection criteria. [30] The outcome of the machine learning process was evaluated in a stratified 10-fold cross validation. [31], [32].

Results

Following inclusion and exclusion criteria on all subjects being consecutively screened for tuberculosis a study population of 206 patients was constituted. 233 patients were excluded from further analysis, due to the unavailability of stored blood specimens (172 patients), missing data or loss of follow up (35 patients), diagnostic uncertainty or patients already receiving tuberculostatic therapy at the time of first physician’s contact (18 patients), infection with Mycobacteria other than tuberculosis (MOTT, 3 patients), HIV infection (4 patients) and age (1 patient).

Among those individuals 36 had a definitive diagnosis of active tuberculosis and 170 patients were suffering from other conditions (see: table 1). Distribution of diagnostic test for establishing diagnosis of active tuberculosis is presented in table 2. Clinical and laboratory characteristics of the study population are depicted in table 3. Median age, body mass index, C-reactive protein, serum amyloid A and were all significantly lower in tuberculosis than in non-tuberculosis patients in univariate analysis after adjustment for multiple testing using the Bonferroni-Holm procedure.

Download:

Table 1. Baseline characteristics of study population.

https://doi.org/10.1371/journal.pone.0049658.t001

Download:

Table 2. Type of confirmation of active tuberculosis.

https://doi.org/10.1371/journal.pone.0049658.t002

Download:

Table 3. Clinical and laboratory characteristics of tuberculosis and non-tuberculosis patients.

https://doi.org/10.1371/journal.pone.0049658.t003

Evaluation of Diagnostic Algorithm

The data set was masked for outcomes and sent to the authors of the previously published study for analysis. Predicted outcomes were used for computation of diagnostic accuracy of the diagnostic algorithm in our patient population. One patient had to be excluded in this evaluation due to missing transthyretin values. Predicted outcomes are depicted in table 4.

Download:

Table 4. Diagnostic performance of tested diagnostic algorithms.

https://doi.org/10.1371/journal.pone.0049658.t004

In summary, the Gaussian kernel based support vector machine model (SVM 1) yielded a moderate diagnostic accuracy of 54% (47%–61%) when applied to our patient population showing sensitivity and specificity of 19% (8%–36%) and 62% (52%–71%), respectively. The second evaluated model, the meta-classifier model (AD 2) reached a diagnostic test accuracy of 42% (35%–49%) sensitivity: 58% (40%–75%), specificity: 38% (31%–46%).

Development of Extended Diagnostic Algorithms

We aimed to develop two new diagnostic models by a machine learning approach – one making use of all available parameters (“Optimal Performance Algorithm”) and an alternative restricted to the use of clinical parameters (“Clinical Data Algorithm”). Firstly, most potent feature sets were identified to maximize the performance of the classification model. The feature selection process was started with single attribute evaluators, combined with a ranker search. All standard single attribute evaluators led to similar results, identifying the following six parameters: age, body mass index, C-reactive protein, serum amyloid A, weight loss, and night sweat. In an additional step, attribute subset evaluators were used on the original feature set and age, body mass index, C-reactive protein, and serum amyloid A were identified as evaluators. These results were consistent with the univariate analysis of variables. Two training sets were created with the aim to obtain two distinct diagnostic algorithms. Firstly we aimed to maximize test performance by including all useful parameters. Secondly we intended to construct a model that entirely relies on clinical information and may therefore prove particularly useful in low-income regions lacking the infrastructure to perform laboratory analysis of inflammation markers.

We tested the parameter sets with principal component analysis, the entropy based discretization method of Fayyad and Irani and a combination of both methods. [27], [28] The approach resulting in the best outcome in a stratified 10-fold cross validation was chosen. These included the following attributes for the “optimal performance set”: age, body mass index, C-reactive protein, night sweat. The discretization method of Fayyad and Irani, which yielded into improved models in this training set, was not able to establish discrete counterparts of serum amyloid A. For the clinical data model the parameters age, body mass index, and night sweats were identified.

All major supervised machine learning techniques were applied and evaluated by an internal 10-fold cross validation. According to these results, a logistic regression based classifier, the Naïve Bayes algorithm and a multilayer preceptor were identified as superior in our data set. Logistic regression based classification was performed with the ridge estimator of leChessie and van Houwelingen to establish an improved diagnostic model. [33] Naïve Bayes was used in standard settings. The multilayer preceptor was performed in standard settings using 4 hidden layers. [29].

The “Optimal Performance Algorithm” evaluated those parameters with best data pre-processing performance. The logistic regression based classifier was enhanced by the use of the discretization filter, and the Naïve Bayes was improved by the application of principal component analysis and the discretization filter.

Employing these settings a diagnostic accuracy of 86% (80%–90%) was achieved for our patient population with an area under the curve (AUC) of the receiver operating characteristic (ROC) of 0.78. In this analysis the sensitivity was 42% (26%–59%) and the specificity was 95% (91%–98%). The true positive rate for tuberculosis cases in our study population was between 42% and 61% (see: table 4).

For the evaluation of the “Clinical Data Algorithm” the multilayer preceptor employing in standard settings showed the best accuracy. A diagnostic accuracy of 85% (79%–89%) could be achieved. Sensitivity [31% (16%–48%), specificity: 96%, (92%–98%)] and the AUC of the ROC curve (0.7) was lower than the previously established model. The logistic model combined with discretization and principal components analysis led to a similar result but to a lower ROC curve (see: table 4).

Discussion

Rapid and reliable diagnostic tests for tuberculosis are urgently needed and the previously published diagnostic algorithms showed highly encouraging accuracy. Provided that this good diagnostic precision is reproducible for diverse patient populations and settings, such a rapid assessment tool, which could be part of a point-of-care test for active tuberculosis, would constitute a major improvement in the diagnosis, management, and control of tuberculosis. [34].

In this study the previously published diagnostic classification model was evaluated in a Central European patient population. The diagnostic accuracy was disappointingly low at 54% and 42%. This poor diagnostic performance may be explained by various factors. Whereas the analysis of data was identical in both studies – and classification of cases was performed by the same person as in the initial study – the patient population under investigation differed considerably between the two studies.

Whereas Agranoff and colleagues worked with a study population predominantly suffering from pulmonary tuberculosis, our study population included a significant proportion of patients with extra-pulmonary tuberculosis. However, in our data set the diagnostic accuracy of the Agranoff model was not significantly improved when restricting the analysis to only those individuals suffering from pulmonary tuberculosis [n = 19; accuracy: 58% (SVM) and 41% (AdaBoost) see: table 4].

Contrary to the previous study, no HIV seropositive patients were included in our study. Other potential differences may include variations in treatment seeking behaviour, diagnostic approaches of caring physicians, differences in ethnicity of patients, and a discrepancy in pre-test probability based on an unequal numeric distribution of cases and controls.

Whereas an equal number of cases and controls was selected in Agranoff´s study following a case-control study design, we used a cross-sectional study design applying predefined inclusion and exclusion criteria in order to avoid an artificially high proportion of tuberculosis patients in our data set. Therefore all consecutive patient with clinical suspicion for tuberculosis were included leading to a 1∶4 distribution of tuberculosis and non-tuberculosis patients, respectively. Differences in the pre-test probability invariably affect the performance of diagnostic models and may be an explanation of impaired generalizability of both the previously published and the newly established model.

In addition, the ethnic origin of patients was unevenly distributed in Agranoff’s training set. Whereas 79% of tuberculosis patients originated from sub-Saharan Africa in the training set, the proportion Africans was only 34% in the control group. Furthermore the control group in that study was heterogeneous consisting of both patients suffering from inflammatory conditions and healthy volunteers. Whereas limitations of our study are the retrospective identification of this patient cohort, a limited sample size, and exclusion of potential participants due to missing data for a proportion of identified subjects, a great emphasis was laid on the constitution of a homogenous comparator that was entirely chosen based on the exposure (suspicion for tuberculosis) and not for the outcome under investigation (diagnosis of tuberculosis). All these factors may explain the lower than expected diagnostic accuracy of the initially published model and stress the need for further improvement and prospective evaluation of this diagnostic algorithm in various clinical settings.

Following our goal to develop an improved diagnostic algorithm, we used machine learning methodology to obtain an improved diagnostic algorithm. The “Optimal Performance Algorithm”, including age, body mass index, night sweat, C-reactive protein led to a diagnostic accuracy of 86% (80%–90%) with an AUC of the ROC-curve of 0.78 in an internal 10-fold cross validation. Similarly the “Clinical Data Algorithm”, consisting of age, body mass index and night sweat, had a diagnostic accuracy of 85% (79%–89%) and an AUC-ROC of 0.70. Considering the ease of obtaining the respective clinical parameters and the variability in the model estimation the Clinical Data Algorithm seems particularly useful. This finding may also be interpreted in that way that the inclusion of inflammation parameters does not significantly improve diagnostic models in tuberculosis. However further prospective evaluation in these diverse clinical settings and comparative evaluation to the diagnostic accuracy by a skilled physician is warranted in future prospective studies.

Considering the presented results, no final judgment may therefore be given whether machine learning based diagnostic algorithms are an appropriate screening method for tuberculosis or not. Arguably clinical parameters of patients suffering from tuberculosis may vary considerably and other parameters than inflammation parameters may prove more suitable as markers for the screening of patients. These markers may include serum concentrations of calcium [35]–[37], iron [38], vitamin D [39]–[41] or orosomucoid [42], [43] and it may prove rewarding to evaluate those alone or in combination in future diagnostic algorithms.

In summary this study demonstrates low external validity of the previously published machine learning based diagnostic algorithm when evaluated for our patient population. Although diagnostic algorithms with improved diagnostic precision were established based on data of a Central European patient population, further independent prospective evaluation of these models is needed to better appreciate the potential of machine learning based diagnostic algorithms for the rapid screening of patients for active tuberculosis.

Author Contributions

Conceived and designed the experiments: FR HB WG SW SK MR. Performed the experiments: MW BP DFR HL AI MR. Analyzed the data: FR HB MW DFR MR. Contributed reagents/materials/analysis tools: FR HB MW DFR MR. Wrote the paper: FR HB MW BP DFR HL AI WG SW SK MR.

References

1. World Health O (2010) Global tuberculosis control : WHO report 2010: World Health Organization.
2. Perkins MD, Kritski AL (2002) Diagnostic testing in the control of tuberculosis. Bull World Health Organ 80: 512–513.
- View Article
- Google Scholar
3. Boehme CC, Nabeta P, Hillemann D, Nicol MP, Shenai S, et al. (2010) Rapid molecular detection of tuberculosis and rifampin resistance. N Engl J Med 363: 1005–1015.
- View Article
- Google Scholar
4. Papay P, Eser A, Winkler S, Frantal S, Primas C, et al. (2011) Factors impacting the results of interferon-gamma release assay and tuberculin skin test in routine screening for latent tuberculosis in patients with inflammatory bowel diseases. Inflamm Bowel Dis 17: 84–90.
- View Article
- Google Scholar
5. Sester M, Sotgiu G, Lange C, Giehl C, Girardi E, et al. (2011) Interferon-gamma release assays for the diagnosis of active tuberculosis: a systematic review and meta-analysis. Eur Respir J 37: 100–111.
- View Article
- Google Scholar
6. Glennon M, Cormican M (2001) Detection and diagnosis of mycobacterial pathogens using PCR. Expert Rev Mol Diagn 1: 163–174.
- View Article
- Google Scholar
7. Balasingham SV, Davidsen T, Szpinda I, Frye SA, Tonjum T (2009) Molecular diagnostics in tuberculosis: basis and implications for therapy. Mol Diagn Ther 13: 137–151.
- View Article
- Google Scholar
8. Miller MB, Popowitch EB, Backlund MG, Ager EP (2011) Performance of Xpert MTB/RIF RUO assay and IS6110 real-time PCR for Mycobacterium tuberculosis detection in clinical samples. J Clin Microbiol 49: 3458–3462.
- View Article
- Google Scholar
9. Nemeth J, Winkler HM, Zwick RH, Rumetshofer R, Schenk P, et al. (2009) Recruitment of Mycobacterium tuberculosis specific CD4+ T cells to the site of infection for diagnosis of active tuberculosis. J Intern Med 265: 163–168.
- View Article
- Google Scholar
10. Getahun H, Kittikraisak W, Heilig CM, Corbett EL, Ayles H, et al. (2011) Development of a Standardized Screening Rule for Tuberculosis in People Living with HIV in Resource-Constrained Settings: Individual Participant Data Meta-analysis of Observational Studies. PLoS Med 8: e1000391.
- View Article
- Google Scholar
11. Cain KP, McCarthy KD, Heilig CM, Monkongdee P, Tasaneeyapan T, et al. (2010) An Algorithm for Tuberculosis Screening and Diagnosis in People with HIV. New England Journal of Medicine 362: 707–716.
- View Article
- Google Scholar
12. Marais BJ, Gie RP, Hesseling AC, Schaaf HS, Lombard C, et al. (2006) A refined symptom-based approach to diagnose pulmonary tuberculosis in children. Pediatrics 118: e1350–1359.
- View Article
- Google Scholar
13. Wilson D, Badri M, Maartens G (2011) Performance of Serum C-Reactive Protein as a Screening Test for Smear-Negative Tuberculosis in an Ambulatory High HIV Prevalence Population. Plos One 6.
14. van’t Hoog AH, Meme HK, Laserson KF, Agaya JA, Muchiri BG, et al. (2012) Screening Strategies for Tuberculosis Prevalence Surveys: The Value of Chest Radiography and Symptoms. Plos One 7.
15. Agranoff D, Fernandez-Reyes D, Papadopoulos MC, Rojas SA, Herbster M, et al. (2006) Identification of diagnostic markers for tuberculosis by proteomic fingerprinting of serum. Lancet 368: 1012–1021.
- View Article
- Google Scholar
16. Thiede B, Hohenwarter W, Krah A, Mattow J, Schmid M, et al. (2005) Peptide mass fingerprinting. Methods 35: 237–247.
- View Article
- Google Scholar
17. Vapnik V (1998) Statistical learning theory: Wiley.
18. Tanaka T, Sakurada S, Kano K, Takahashi E, Yasuda K, et al. (2011) Identification of tuberculosis-associated proteins in whole blood supernatant. BMC Infect Dis 11: 71.
- View Article
- Google Scholar
19. Lange C (2007) [Year in review: tuberculosis 2006]. Pneumologie 61: 518–521.
- View Article
- Google Scholar
20. Walzl G, Ronacher K, Hanekom W, Scriba TJ, Zumla A (2011) Immunological biomarkers of tuberculosis. Nat Rev Immunol 11: 343–354.
- View Article
- Google Scholar
21. Kotsiantis SB, Zaharakis ID, Pintelas PE (2006) Machine learning: a review of classification and combining techniques. Artif Intell Rev 26: 159–190.
- View Article
- Google Scholar
22. Freund Y, Mason L (1999) The Alternating Decision Tree Learning Algorithm. Proceedings of the Sixteenth International Conference on Machine Learning: Morgan Kaufmann Publishers Inc. 124–133.
23. John GHL, P. (1995) Estimating Continuous Distributions in Bayesian Classifiers. In Proceedings of UAI. 338–345.
24. Mitchell TM (1997) Machine Learning: McGraw-Hill, Inc. 154–184.
25. Alpaydin E (2010) Introduction to Machine Learning: The MIT Press. 220–231.
26. Guyon I, Elisseeff A (2006) An Introduction to Feature Extraction Feature Extraction. In: Guyon I, Nikravesh M, Gunn S, Zadeh L, editors: Springer Berlin/Heidelberg. 1–25.
27. Fayyad UM, Irani KB (1993) Multi-Interval Discrertiation of Continuous-Valued Attributes for Classification Learning. Proc Thirteenth International Joint Conference on Artificial Intelligence. Los Altos, CA: Morgan Kaufman Publ. Inc. 1022–1027.
28. Jolliffe IT (2002) Introduction. Principal Component Analysis: Springer New York. 1–9.
29. Witten I, Frank E (2005) Data Mining: Practical Machine Learning Tools and Techniques: Morgan Kaufmann.
30. Fawcett T (2006) An introduction to ROC analysis. Pattern Recogn Lett 27: 861–874.
- View Article
- Google Scholar
31. Refaeilzadeh P, Tang L, Liu H (2009) Cross Validation. Encyclopedia of Database Systems: Springer.
32. Kohavi R (1995) A study of cross-validation and bootstrap for accuracy estimation and model selection. Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2. Montreal, Quebec, Canada: Morgan Kaufmann Publishers Inc. 1137–1143.
33. Le Cessie S, Van Houwelingen JC (1992) Ridge Estimators in Logistic Regression. Applied Statistics 41: 191–201.
- View Article
- Google Scholar
34. McNerney R, Daley P (2011) Towards a point-of-care test for active tuberculosis: obstacles and opportunities. Nat Rev Micro 9: 204–213.
- View Article
- Google Scholar
35. Liam CK, Lim KH, Srinivas P, Poi PJ (1998) Hypercalcaemia in patients with newly diagnosed tuberculosis in Malaysia. Int J Tuberc Lung Dis 2: 818–823.
- View Article
- Google Scholar
36. Ali-Gombe A, Onadeko BO (1997) Serum calcium levels in patients with active pulmonary tuberculosis. Afr J Med Med Sci 26: 67–68.
- View Article
- Google Scholar
37. Sharma OP (2000) Hypercalcemia in granulomatous disorders: a clinical review. Curr Opin Pulm Med 6: 442–447.
- View Article
- Google Scholar
38. Harju E (1989) Clinical pharmacokinetics of iron preparations. Clin Pharmacokinet 17: 69–89.
- View Article
- Google Scholar
39. Fares A (2011) Seasonality of tuberculosis. J Glob Infect Dis 3: 46–55.
- View Article
- Google Scholar
40. Sita-Lumsden A, Lapthorn G, Swaminathan R, Milburn HJ (2007) Reactivation of tuberculosis and vitamin D deficiency: the contribution of diet and exposure to sunlight. Thorax 62: 1003–1007.
- View Article
- Google Scholar
41. Ustianowski A, Shaffer R, Collin S, Wilkinson RJ, Davidson RN (2005) Prevalence and associations of vitamin D deficiency in foreign-born persons with tuberculosis in London. J Infect 50: 432–437.
- View Article
- Google Scholar
42. Zhang J, Wu X, Shi L, Liang Y, Xie Z, et al. (2012) Diagnostic serum proteomic analysis in patients with active tuberculosis. Clinica Chimica Acta 413: 883–887.
- View Article
- Google Scholar
43. Fassbender K, Fassbender M, Schaberg T, Sobieska M, Müller W (1995) Glycosylation of alpha 1-acid glycoprotein in bacterial lung infections: distinct pattern in tuberculosis. Clinical Chemistry 41: 472–473.
- View Article
- Google Scholar
44. Clopper CJ, Pearson ES (1934) The use of confidence or fiducial limits illustrated in the case of the binomal. Biometrika 26: 404–413.
- View Article
- Google Scholar

[ref1] 1. World Health O (2010) Global tuberculosis control : WHO report 2010: World Health Organization.

[ref2] 2. Perkins MD, Kritski AL (2002) Diagnostic testing in the control of tuberculosis. Bull World Health Organ 80: 512–513.
View Article
Google Scholar

[3] View Article

[4] Google Scholar

[ref3] 3. Boehme CC, Nabeta P, Hillemann D, Nicol MP, Shenai S, et al. (2010) Rapid molecular detection of tuberculosis and rifampin resistance. N Engl J Med 363: 1005–1015.
View Article
Google Scholar

[6] View Article

[7] Google Scholar

[ref4] 4. Papay P, Eser A, Winkler S, Frantal S, Primas C, et al. (2011) Factors impacting the results of interferon-gamma release assay and tuberculin skin test in routine screening for latent tuberculosis in patients with inflammatory bowel diseases. Inflamm Bowel Dis 17: 84–90.
View Article
Google Scholar

[9] View Article

[10] Google Scholar

[ref5] 5. Sester M, Sotgiu G, Lange C, Giehl C, Girardi E, et al. (2011) Interferon-gamma release assays for the diagnosis of active tuberculosis: a systematic review and meta-analysis. Eur Respir J 37: 100–111.
View Article
Google Scholar

[12] View Article

[13] Google Scholar

[ref6] 6. Glennon M, Cormican M (2001) Detection and diagnosis of mycobacterial pathogens using PCR. Expert Rev Mol Diagn 1: 163–174.
View Article
Google Scholar

[15] View Article

[16] Google Scholar

[ref7] 7. Balasingham SV, Davidsen T, Szpinda I, Frye SA, Tonjum T (2009) Molecular diagnostics in tuberculosis: basis and implications for therapy. Mol Diagn Ther 13: 137–151.
View Article
Google Scholar

[18] View Article

[19] Google Scholar

[ref8] 8. Miller MB, Popowitch EB, Backlund MG, Ager EP (2011) Performance of Xpert MTB/RIF RUO assay and IS6110 real-time PCR for Mycobacterium tuberculosis detection in clinical samples. J Clin Microbiol 49: 3458–3462.
View Article
Google Scholar

[21] View Article

[22] Google Scholar

[ref9] 9. Nemeth J, Winkler HM, Zwick RH, Rumetshofer R, Schenk P, et al. (2009) Recruitment of Mycobacterium tuberculosis specific CD4+ T cells to the site of infection for diagnosis of active tuberculosis. J Intern Med 265: 163–168.
View Article
Google Scholar

[24] View Article

[25] Google Scholar

[ref10] 10. Getahun H, Kittikraisak W, Heilig CM, Corbett EL, Ayles H, et al. (2011) Development of a Standardized Screening Rule for Tuberculosis in People Living with HIV in Resource-Constrained Settings: Individual Participant Data Meta-analysis of Observational Studies. PLoS Med 8: e1000391.
View Article
Google Scholar

[27] View Article

[28] Google Scholar

[ref11] 11. Cain KP, McCarthy KD, Heilig CM, Monkongdee P, Tasaneeyapan T, et al. (2010) An Algorithm for Tuberculosis Screening and Diagnosis in People with HIV. New England Journal of Medicine 362: 707–716.
View Article
Google Scholar

[30] View Article

[31] Google Scholar

[ref12] 12. Marais BJ, Gie RP, Hesseling AC, Schaaf HS, Lombard C, et al. (2006) A refined symptom-based approach to diagnose pulmonary tuberculosis in children. Pediatrics 118: e1350–1359.
View Article
Google Scholar

[33] View Article

[34] Google Scholar

[ref13] 13. Wilson D, Badri M, Maartens G (2011) Performance of Serum C-Reactive Protein as a Screening Test for Smear-Negative Tuberculosis in an Ambulatory High HIV Prevalence Population. Plos One 6.

[ref14] 14. van’t Hoog AH, Meme HK, Laserson KF, Agaya JA, Muchiri BG, et al. (2012) Screening Strategies for Tuberculosis Prevalence Surveys: The Value of Chest Radiography and Symptoms. Plos One 7.

[ref15] 15. Agranoff D, Fernandez-Reyes D, Papadopoulos MC, Rojas SA, Herbster M, et al. (2006) Identification of diagnostic markers for tuberculosis by proteomic fingerprinting of serum. Lancet 368: 1012–1021.
View Article
Google Scholar

[38] View Article

[39] Google Scholar

[ref16] 16. Thiede B, Hohenwarter W, Krah A, Mattow J, Schmid M, et al. (2005) Peptide mass fingerprinting. Methods 35: 237–247.
View Article
Google Scholar

[41] View Article

[42] Google Scholar

[ref17] 17. Vapnik V (1998) Statistical learning theory: Wiley.

[ref18] 18. Tanaka T, Sakurada S, Kano K, Takahashi E, Yasuda K, et al. (2011) Identification of tuberculosis-associated proteins in whole blood supernatant. BMC Infect Dis 11: 71.
View Article
Google Scholar

[45] View Article

[46] Google Scholar

[ref19] 19. Lange C (2007) [Year in review: tuberculosis 2006]. Pneumologie 61: 518–521.
View Article
Google Scholar

[48] View Article

[49] Google Scholar

[ref20] 20. Walzl G, Ronacher K, Hanekom W, Scriba TJ, Zumla A (2011) Immunological biomarkers of tuberculosis. Nat Rev Immunol 11: 343–354.
View Article
Google Scholar

[51] View Article

[52] Google Scholar

[ref21] 21. Kotsiantis SB, Zaharakis ID, Pintelas PE (2006) Machine learning: a review of classification and combining techniques. Artif Intell Rev 26: 159–190.
View Article
Google Scholar

[54] View Article

[55] Google Scholar

[ref22] 22. Freund Y, Mason L (1999) The Alternating Decision Tree Learning Algorithm. Proceedings of the Sixteenth International Conference on Machine Learning: Morgan Kaufmann Publishers Inc. 124–133.

[ref23] 23. John GHL, P. (1995) Estimating Continuous Distributions in Bayesian Classifiers. In Proceedings of UAI. 338–345.

[ref24] 24. Mitchell TM (1997) Machine Learning: McGraw-Hill, Inc. 154–184.

[ref25] 25. Alpaydin E (2010) Introduction to Machine Learning: The MIT Press. 220–231.

[ref26] 26. Guyon I, Elisseeff A (2006) An Introduction to Feature Extraction Feature Extraction. In: Guyon I, Nikravesh M, Gunn S, Zadeh L, editors: Springer Berlin/Heidelberg. 1–25.

[ref27] 27. Fayyad UM, Irani KB (1993) Multi-Interval Discrertiation of Continuous-Valued Attributes for Classification Learning. Proc Thirteenth International Joint Conference on Artificial Intelligence. Los Altos, CA: Morgan Kaufman Publ. Inc. 1022–1027.

[ref28] 28. Jolliffe IT (2002) Introduction. Principal Component Analysis: Springer New York. 1–9.

[ref29] 29. Witten I, Frank E (2005) Data Mining: Practical Machine Learning Tools and Techniques: Morgan Kaufmann.

[ref30] 30. Fawcett T (2006) An introduction to ROC analysis. Pattern Recogn Lett 27: 861–874.
View Article
Google Scholar

[65] View Article

[66] Google Scholar

[ref31] 31. Refaeilzadeh P, Tang L, Liu H (2009) Cross Validation. Encyclopedia of Database Systems: Springer.

[ref32] 32. Kohavi R (1995) A study of cross-validation and bootstrap for accuracy estimation and model selection. Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2. Montreal, Quebec, Canada: Morgan Kaufmann Publishers Inc. 1137–1143.

[ref33] 33. Le Cessie S, Van Houwelingen JC (1992) Ridge Estimators in Logistic Regression. Applied Statistics 41: 191–201.
View Article
Google Scholar

[70] View Article

[71] Google Scholar

[ref34] 34. McNerney R, Daley P (2011) Towards a point-of-care test for active tuberculosis: obstacles and opportunities. Nat Rev Micro 9: 204–213.
View Article
Google Scholar

[73] View Article

[74] Google Scholar

[ref35] 35. Liam CK, Lim KH, Srinivas P, Poi PJ (1998) Hypercalcaemia in patients with newly diagnosed tuberculosis in Malaysia. Int J Tuberc Lung Dis 2: 818–823.
View Article
Google Scholar

[76] View Article

[77] Google Scholar

[ref36] 36. Ali-Gombe A, Onadeko BO (1997) Serum calcium levels in patients with active pulmonary tuberculosis. Afr J Med Med Sci 26: 67–68.
View Article
Google Scholar

[79] View Article

[80] Google Scholar

[ref37] 37. Sharma OP (2000) Hypercalcemia in granulomatous disorders: a clinical review. Curr Opin Pulm Med 6: 442–447.
View Article
Google Scholar

[82] View Article

[83] Google Scholar

[ref38] 38. Harju E (1989) Clinical pharmacokinetics of iron preparations. Clin Pharmacokinet 17: 69–89.
View Article
Google Scholar

[85] View Article

[86] Google Scholar

[ref39] 39. Fares A (2011) Seasonality of tuberculosis. J Glob Infect Dis 3: 46–55.
View Article
Google Scholar

[88] View Article

[89] Google Scholar

[ref40] 40. Sita-Lumsden A, Lapthorn G, Swaminathan R, Milburn HJ (2007) Reactivation of tuberculosis and vitamin D deficiency: the contribution of diet and exposure to sunlight. Thorax 62: 1003–1007.
View Article
Google Scholar

[91] View Article

[92] Google Scholar

[ref41] 41. Ustianowski A, Shaffer R, Collin S, Wilkinson RJ, Davidson RN (2005) Prevalence and associations of vitamin D deficiency in foreign-born persons with tuberculosis in London. J Infect 50: 432–437.
View Article
Google Scholar

[94] View Article

[95] Google Scholar

[ref42] 42. Zhang J, Wu X, Shi L, Liang Y, Xie Z, et al. (2012) Diagnostic serum proteomic analysis in patients with active tuberculosis. Clinica Chimica Acta 413: 883–887.
View Article
Google Scholar

[97] View Article

[98] Google Scholar

[ref43] 43. Fassbender K, Fassbender M, Schaberg T, Sobieska M, Müller W (1995) Glycosylation of alpha 1-acid glycoprotein in bacterial lung infections: distinct pattern in tuberculosis. Clinical Chemistry 41: 472–473.
View Article
Google Scholar

[100] View Article

[101] Google Scholar

[ref44] 44. Clopper CJ, Pearson ES (1934) The use of confidence or fiducial limits illustrated in the case of the binomal. Biometrika 26: 404–413.
View Article
Google Scholar

[103] View Article

[104] Google Scholar

Figures

Abstract

Background

Methods

Results

Conclusion

Introduction

Materials and Methods

Study Design and Outcome Parameters

Ethics Statement and Statistical Analysis

Results

Evaluation of Diagnostic Algorithm

Development of Extended Diagnostic Algorithms

Discussion

Author Contributions

References