Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

A Model to Discriminate Malignant from Benign Thyroid Nodules Using Artificial Neural Network

  • Lu-Cheng Zhu ,

    Contributed equally to this work with: Lu-Cheng Zhu, Yun-Liang Ye

    Affiliation Department of Radiation Oncology and Chemotherapy, The First Affiliated Hospital of Wenzhou Medical College, Wenzhou, China

  • Yun-Liang Ye ,

    Contributed equally to this work with: Lu-Cheng Zhu, Yun-Liang Ye

    Affiliation Department of Oncology, The First Affiliated Hospital of Wenzhou Medical College, Wenzhou, China

  • Wen-Hua Luo,

    Affiliation Department of Radiation Oncology and Chemotherapy, The First Affiliated Hospital of Wenzhou Medical College, Wenzhou, China

  • Meng Su,

    Affiliation Department of Radiation Oncology and Chemotherapy, The First Affiliated Hospital of Wenzhou Medical College, Wenzhou, China

  • Hang-Ping Wei,

    Affiliation Department of Radiation Oncology and Chemotherapy, The First Affiliated Hospital of Wenzhou Medical College, Wenzhou, China

  • Xue-Bang Zhang,

    Affiliation Department of Radiation Oncology and Chemotherapy, The First Affiliated Hospital of Wenzhou Medical College, Wenzhou, China

  • Juan Wei,

    Affiliation Department of Radiation Oncology and Chemotherapy, The First Affiliated Hospital of Wenzhou Medical College, Wenzhou, China

  • Chang-Lin Zou

    zcl19670115@163.com

    Affiliation Department of Radiation Oncology and Chemotherapy, The First Affiliated Hospital of Wenzhou Medical College, Wenzhou, China

Abstract

Objective

This study aimed to construct a model for using in differentiating benign and malignant nodules with the artificial neural network and to increase the objective diagnostic accuracy of US.

Materials and methods

618 consecutive patients (528 women, 161 men) with 689 thyroid nodules (425 malignant and 264 benign nodules) were enrolled in the present study. The presence and absence of each sonographic feature was assessed for each nodule - shape, margin, echogenicity, internal composition, presence of calcifications, peripheral halo and vascularity on color Doppler. The variables meet the following criteria: important sonographic features and statistically significant difference were selected as the input layer to build the ANN for predicting the malignancy of nodules.

Results

Six sonographic features including shape (Taller than wide, p<0.001), margin (Not Well-circumscribed, p<0.001), echogenicity (Hypoechogenicity, p<0.001), internal composition (Solid, p<0.001), presence of calcifications (Microcalcification, p<0.001) and peripheral halo (Absent, p<0.001) were significantly associated with malignant nodules. A three-layer 6-8-1 feed-forward ANN model was built. In the training cohort, the accuracy of the ANN in predicting malignancy of thyroid nodules was 82.3% (AUROC = 0.818), the sensitivity and specificity was 84.5% and 79.1%, respectively. In the validation cohort, the accuracy, sensitivity and specificity was 83.1%, 83.8% and 81.8%, respectively. The AUROC was 0.828.

Conclusion

ANN constructed by sonographic features can discriminate benign and malignant thyroid nodules with high diagnostic accuracy.

Introduction

Nodular thyroid disease is a common finding in the general population, particularly in iodine-deficient areas. The prevalence of palpable nodules in population is 3% to 4%, and the prevalence of nonpalpable nodules incidentally identified on imaging approaches 40% to 50% after the age of 60 years[1][5]. The diagnosis of thyroid cancer relies on cervical ultrasound and fine-needle aspiration (FNA) biopsy, which collects cells for cytological examination [6], [7]. FNA cytology is currently the most reliable diagnostic tool for evaluation of thyroid nodules. It provides a definitive diagnosis of benign or malignant thyroid disease in most cases. However, in 20% to 30% of nodules, FNA cytology cannot reliably rule out cancer, and such cases are reported as indeterminate for malignancy [8], [9]. To improve the diagnosis accuracy, new diagnostic approaches combined FNA cytology and molecular biomarkers were proposed in recent years [10][12]. In additional, CT and MRI have a limited role in the initial evaluation of solitary nodule and their indications include suspected tracheal involvement, either by invasion or compression, extension into the mediastinum, or recurrent disease[5], [13][15].

Though FNA biopsy can differentiate malignant and benign nodules in most cases, it is an invasive procedure after all and uncomfortable for the patient [14], [15]. Ultrasonography (US) is a powerful imaging technique for identifying thyroid nodules, which are very common in clinical practice. It is a cost-effective, noninvasive, portable, and safe imaging modality in the evaluation for detection of nonpalpable thyroid cancers, it has barely drawbacks expect for its low sensitivity. The incidence of thyroid nodules detected by US ranges from 10% to 67% [16][19]. The great majority of nodules are benign, yet the clinical importance lies in the detection of malignancy, which comprises approximately 2.7–17% of all thyroid nodules. US has been widely used to distinguish benign from malignant nodules using several sonographic characteristics. However, no single ultrasound feature has the adequate diagnostic accuracy for diagnosing malignant nodules.

The artificial neural network (ANN) is a novel computer model inspired by the working of the human brain. It can build nonlinear statistical models to deal with complex biological systems. ANN models have several advantages of over statistical methods. It can rapidly recognize linear patterns, non-linear patterns with threshold impacts, categorical, step-wise linear, or even contingency effects[20]. Analyses by ANN need not start with a hypothesis or a priori identification of potentially key variables, so undocumented or quantitated potential prognostic factors can be determinate if they already exist in the masses of datasets, though they might have been overlooked in the past. It can build nonlinear statistical models to deal with complex biological systems. In recent years, ANN models have been introduced in clinical medicine for clinical validations [21][23]. In this study, we aimed to construct a model for using in differentiating benign and malignant nodules with the artificial neural network and to increase the objective diagnostic accuracy of US.

Materials and Methods

Patients

We conducted our retrospective study extend from January 2010 and December 2012. 618 consecutive patients with 689 thyroid nodules were enrolled in the present study. All the patients had undergone US and US-guided FNAB preoperatively and subsequently undertook surgery. Pathological results were used as the reference standards. The enrollment criteria for the patients were as follows: (1) there was no thyroid diseases history. (2) There was no radiation history on neck. (3) All patients underwent US and FNAB examinations. All the patients were from eastern China (most of the patients were from Wenzhou City) and received primary treatment in our hospital.

Ethics

Written informed consent was obtained from the patient for publication of this report. The study was approved by the Ethics Committee of The First Affiliated Hospital of Wenzhou Medical College, Wenzhou, China.

Clinicopathological and US features

Thyroid ultrasound examinations were performed by two experience technicians with an Acuson Sequoia and 128XP sonographic scanners (Siemens Medical Solutions, Mountain View, CA) equipped with commercially available 8- to 13-MHz linear probes. The following sonographic features were assessed for each nodule: shape, margin, echogenicity, internal composition, presence of calcifications, peripheral halo and vascularity on color Doppler. The shape of the nodule was classified as taller than width measured in transverse dimension or wider than tall. Margins of nodules were categorized as well circumscribed when clear demarcation with normal thyroid was noted, and as not well circumscribed, which included irregular and microlobulated margins. The echogenicity of each nodule was classified as hypo-, iso- or hyperechoic in comparison with the normal background thyroid tissue. A nodule was defined as marked hypoechoic, when a nodule was hypoechoic relative to adjacent strap muscles. The echo structure was defined as solid, cystic or predominantly cystic. Predominantly cystic nodules were those containing cystic components that constituted more than an estimated 50% of the lesion. The presence of micro- and macrocalcifications was documented. Microcalcifications were defined as tiny, punctuate echogenic foci of 1 mm or less either with or without posterior shadowing. Microcalcifications were defined as larger than 1 mm. The vascularity on color Doppler was classified as absent, present flow. The status of nodules was confirmed by a final histological examination after surgery.

Statistical Analysis and Neural Network analysis

Statistical analysis was performed using SPSS 20.0 software (SPSS Inc., Chicago, IL, USA). Continuous variables were expressed by mean ± standard deviation and compared using student's t-test when necessary. Categorical variables were described by proportions or count and compared using proportions chi-square test or the Fisher's exact test when necessary. Univariate analysis was applied to assess the relationship between sonographic features (input variables) and malignancy (output variables). The variables we selected as the input layer to build the ANN for predicting the malignancy of nodules were required to meet the following criteria: important sonographic features and statistically significant difference.

In this study, we built an ANN by using the Matlab 8.0 (The Match Works Inc., Natick, USA) Variables found to be significantly related to the malignancy of nodules were selected to build the ANN. 689 eligible nodules were assigned to a training cohort (n = 464; 67%) and a validation cohort (n = 225; 33%) randomly using rv.bernoulli method. One of the major limitations of ANN is over-training, which can lead to good performance on training sets but poor performance on relatively independent validation sets. To avoid over-training during building of the ANN, 332 patients (72%) were again randomly selected from the training group to train the network and the remaining 132 (28%) were used for cross-validation. The learning mechanism applied on this ANN was BP by calculating the errors between output value and desired output value. Then, the weight of the connections was altered between neurons to decrease the overall errors of the network. Training was terminated when the sum of square errors was at minimum, compared with the cross-validation data set. The activation function, representing the outcomes of ANN, was used with continuous outputs on the interval from 0 to 1, in which 0 = benign, 1 = malignant. To avoid different inter nodules variability, we repeated the process only excluding nodules derived from the same patient and total 561 nodules were taking in the study.

Results

Baseline characteristic of thyroid nodules

A total of 689 thyroid nodules were enrolled in the study. The clinicopathologic data of all patients was listed in Table 1. The size of the nodules ranged from 4 mm to 52 mm (mean size 13.3 mm ±6.5). We found no statistical difference between the benign and malignant nodules with regard to size. A taller than wide shape was found more frequently in malignant nodules (56.5%) than in benign nodules (23.5%). Hypoechogenicity (including the subgroup of markedly hypoechoic nodules) was a sonographic feature to be found in a substantial number of malignant nodules (81.4%). The frequency of hypoechogenicity in benign nodules was low (50.8%). The presence of microcalcifications and intranodular vascularity on Doppler examination in malignant nodules was significantly higher than in benign ones. However, no significant difference was found on intranodular vascularity on Doppler examination between benign and malignant nodules (Table 1). Specific characteristics of the training and validation cohorts used to build and test the ANN are described in Table 2.

thumbnail
Table 1. Patients characteristics and ultrasonographic Features of Benign and Malignant Thyroid Nodules.

https://doi.org/10.1371/journal.pone.0082211.t001

thumbnail
Table 2. Baseline characteristics of the study nodules stratified by ANN cohorts.

https://doi.org/10.1371/journal.pone.0082211.t002

Construction of ANN

As shown in Table 1, the sonographic features including shape (Taller than wide, p<0.001), margin (Not Well-circumscribed, p<0.001), echogenicity (Hypoechogenicity, p<0.001), internal composition (Solid, p<0.001), presence of calcifications (Microcalcification, p<0.001) and peripheral halo (Absent, p<0.001) were significantly associated with malignant nodules at the Univariate analysis which were all used to build the ANN.

Multilayer perceptron (MLP) is one of the most popular and mature ANN architectures with a feed forward neural network where processing neurons are grouped into layers and connected by weighted links. We therefore established an ANN model using MLP. In this present study, MLP included the input, hidden and output layers. Neurons were linked with weighted connections (Fig. 1). In general, the number of input variables and output variables were respectively equal to the number of sonographic features and malignancy of thyroid nodules we set. As Fig. 1 shows, the MLP has six input neurons and one output neuron. After the debugging and testing five times, eight hidden neurons were added to the hidden layer to increase the MLP's performance.

thumbnail
Figure 1. Schematic representation of the artificial neural network developed to distinguish malignancy of thyroid nodules.

https://doi.org/10.1371/journal.pone.0082211.g001

Assessment of the predictive accuracy of ANN

In the training cohort, the accuracy of the ANN in predicting malignancy of thyroid nodules was 82.3% (AUROC = 0.818, 95% CI: 0.780–0.852, p<0.001), the sensitivity, specificity malignancy predictive value (positive predictive value, PPV) and benignity predictive value (negative predictive value, NPV) was 84.5%, 79.1%, 85.7% and 77.5%, respectively (Table 3, Fig. 2). When the ANN was finally evaluated in the validation cohort, the accuracy, sensitivity, specificity, PPV and NPV was 83.1%, 83.8%, 81.8%, 89.9% and 72.4%, respectively (Table 3). The AUROC was 0.828, 95% CI: 0.772–0.875, p<0.001 (Fig. 2). When excluding nodules derived from the same patient, total 561 nodules were enrolled in study, we obtained a similar results (Table 4).

thumbnail
Figure 2. Receiver operating characteristic curve analysis of the predictive accuracy of the models to predict malignancy of thyroid nodules in the training and validation cohorts.

https://doi.org/10.1371/journal.pone.0082211.g002

thumbnail
Table 3. Classification Accuracy of ANN in Training and Validation Groups (689 nodules).

https://doi.org/10.1371/journal.pone.0082211.t003

thumbnail
Table 4. Classification Accuracy of ANN in Training and Validation Groups (561 nodules).

https://doi.org/10.1371/journal.pone.0082211.t004

Discussion

It is well known that none of the single sonographic features allows to differentiate malignant from benign thyroid lesions. However, finding in US image of nodule one or more than one suspicious features, correlates well with the risk of malignancy[19]. In our study, we found that six sonographic features, including shape, margin, echogenicity, internal composition, presence of calcifications and peripheral halo, could be used for the discrimination of the thyroid nodules.

In the study of Kim et al [24], suspicious sonographic features were defined as irregular or microlobulated margin, marked hypoechogenicity, microcalcifications and a shape that was more tall than it was wide. In the presence of even one of these sonographic findings the sensitivity, specificity and accuracy were 93.8%, 66% and 74.8%, respectively. Moon et al [25] evaluated the diagnostic accuracy of US for the depiction of benign and malignant thyroid nodules and found that the US criteria including a shape taller than wide, a spiculated margin, marked hypoechogenicity, microcalcification and macrocalcification were helpful for discrimination of malignant nodules from benign ones. According to their results, the diagnostic accuracy for the nodules one centimeter or less in size was 77% when one of the five malignant findings was used. Other studies [26][28] found the same sonographic features.

Color Doppler sonography can aid in the prediction of thyroid malignancy. Internal flow is suggestive of malignancy [29], [30], but this technique cannot be used to exclude malignancy. According to a previous study[30], 14% of solid non-hypervascular nodules were malignant. In our study, there was no difference found in vascularity between benign and malignant nodules. Therefore, Color Doppler imaging was not used in our study.

Many authors [31][34] reported that the combination of ultrasound features makes the diagnosis of a malignant nodule more probable. In these previous study, each suspicious US feature was summed as the same weight, even though each US feature has a different probability of malignancy. Therefore, the risk of malignancy was higher in a thyroid nodule with one suspicious US feature, such as a microcalcification or microlobulated margin than for a thyroid module with 2 suspicious malignant US features (solid composition and hypoechogenicity). Kwak et al [35] developed a model that the suspicious US feature had a different risk score according to their ORs in thyroid malignancy. However, most of the sonographic features have multidimensional and nonlinear relationship. So it is ideally difficult to predict the malignancy with a conventional statistical technique. Neural networks offer a number of advantages, including requiring less formal statistical training, ability to implicitly detect complex nonlinear relationships between dependent and independent variables, ability to detect all possible interactions between predictor variables, and the availability of multiple training algorithms. However, ANN also requires large amounts of training data and there was no uniform standard in choosing network structure.

In the present study, six sonographic features including shape, margin, echogenicity, internal composition, presence of calcifications and peripheral halo were significantly associated with malignant nodules. Then we built a three-layer 6-8-1 feed-forward ANN model including these six sonographic features as input neurons. In the training cohort, the accuracy of the ANN in predicting malignancy of thyroid nodules was 82.3% (AUROC = 0.818), the sensitivity and specificity was 84.5% and 79.1%, respectively. In the validation cohort, the accuracy, sensitivity and specificity was 83.1%, 83.8% and 81.8%, respectively. The AUROC was 0.828.

In conclusion, ANN constructed by sonographic features can discriminate benign and malignant thyroid nodules with high diagnostic accuracy.

Author Contributions

Conceived and designed the experiments: CLZ. Performed the experiments: LCZ. Analyzed the data: LCZ YLY. Contributed reagents/materials/analysis tools: YLY WHL MS HPW XBZ JW. Wrote the paper: LCZ.

References

  1. 1. Mazzaferri EL (1992) Thyroid cancer in thyroid nodules: finding a needle in the haystack. The American journal of medicine 93: 359–362.
  2. 2. Gharib H (2004) Changing trends in thyroid practice: understanding nodular thyroid disease. Endocrine Practice 10: 31–39.
  3. 3. Rojeski MT, Gharib H (1985) Nodular thyroid disease: evaluation and management. The New England journal of medicine 313: 428–436.
  4. 4. Desforges JF, Mazzaferri EL (1993) Management of a solitary thyroid nodule. New England Journal of Medicine 328: 553–559.
  5. 5. Wiest PW, Hartshorne MF, Inskip PD, Crooks LA, Vela BS, et al. (1998) Thyroid palpation versus high-resolution thyroid ultrasonography in the detection of nodules. Journal of Ultrasound in Medicine 17: 487–496.
  6. 6. Cooper DS, Doherty GM, Haugen BR, Kloos RT, Lee SL, et al. (2009) Revised american thyroid association management guidelines for patients with thyroid nodules and differentiated thyroid cancer: the american thyroid association (ATA) guidelines taskforce on thyroid nodules and differentiated thyroid cancer. Thyroid 19: 1167–1214.
  7. 7. Gharib H, Papini E, Paschke R, Duick D, Valcavi R, et al. (2009) American Association of Clinical Endocrinologists, Associazione Medici Endocrinologi, and European Thyroid Association Medical guidelines for clinical practice for the diagnosis and management of thyroid nodules: executive summary of recommendations. Endocrine practice: official journal of the American College of Endocrinology and the American Association of Clinical Endocrinologists 16: 468–475.
  8. 8. Ohori NP, Schoedel KE (2011) Variability in the atypia of undetermined significance/follicular lesion of undetermined significance diagnosis in the Bethesda System for Reporting Thyroid Cytopathology: sources and recommendations. Acta cytologica 55: 492–498.
  9. 9. Baloch ZW, LiVolsi VA, Asa SL, Rosai J, Merino MJ, et al. (2008) Diagnostic terminology and morphologic criteria for cytologic diagnosis of thyroid lesions: A synopsis of the National Cancer Institute Thyroid Fine-Needle Aspiration State of the Science Conference. Diagnostic cytopathology 36: 425–437.
  10. 10. Tomei S, Marchetti I, Zavaglia K, Lessi F, Apollo A, et al. (2012) A molecular computational model improves the preoperative diagnosis of thyroid nodules. BMC Cancer 12: 396.
  11. 11. Prasad NB, Kowalski J, Tsai HL, Talbot K, Somervell H, et al. (2012) Three-gene molecular diagnostic model for thyroid cancer. Thyroid 22: 275–284.
  12. 12. Nikiforov YE, Nikiforova MN (2011) Molecular genetics and diagnosis of thyroid cancer. Nature Reviews Endocrinology 7: 569–580.
  13. 13. Popowicz B, Klencki M, Lewiński A, Słowińska-Klencka D (2009) The usefulness of sonographic features in selection of thyroid nodules for biopsy in relation to the nodule's size. European Journal of Endocrinology 161: 103–111.
  14. 14. Peccin S, De Castro J, Furlanetto TW, Furtado A, Brasil B, et al. (2002) Ultrasonography: is it useful in the diagnosis of cancer in thyroid nodules? Journal of endocrinological investigation 25: 39–43.
  15. 15. Lin J-D, Chao T-C, Huang B-Y, Chen S-T, Chang H-Y, et al. (2005) Thyroid cancer in the thyroid nodules evaluated by ultrasonography and fine-needle aspiration cytology. Thyroid 15: 708–717.
  16. 16. Harach HR, Franssila KO, Wasenius VM (1985) Occult papillary carcinoma of the thyroid. A “normal” finding in Finland. A systematic autopsy study. Cancer 56: 531–538.
  17. 17. Brander A, Viikinkoski P, Nickels J, Kivisaari L (1991) Thyroid gland: US screening in a random adult population. Radiology 181: 683–687.
  18. 18. Tan GH, Gharib H (1997) Thyroid incidentalomas: management approaches to nonpalpable nodules discovered incidentally on thyroid imaging. Ann Intern Med 126: 226–231.
  19. 19. Frates MC, Benson CB, Charboneau JW, Cibas ES, Clark OH, et al. (2005) Management of Thyroid Nodules Detected at US: Society of Radiologists in Ultrasound Consensus Conference Statement1. Radiology 237: 794–800.
  20. 20. Levine RF (2001) Clinical problems, computational solutions: a vision for a collaborative future. Cancer 91: 1595–1602.
  21. 21. Hong WD, Ji YF, Wang D, Chen TZ, Zhu QH (2011) Use of artificial neural network to predict esophageal varices in patients with HBV related cirrhosis. Hepat Mon 11: 544–547.
  22. 22. Vahdani P, Alavian SM, Aminzadeh Z, Raoufy MR, Gharibzadeh S, et al. (2009) Using Artificial Neural Network to Predict Cirrhosis in Patients with Chronic Hepatitis B Infection with Seven Routine Laboratory Findings. Hepat Mon 9: 271–275.
  23. 23. Lai KC, Chiang HC, Chen WC, Tsai FJ, Jeng LB (2008) Artificial neural network-based study can predict gastric cancer staging. Hepatogastroenterology 55: 1859–1863.
  24. 24. Kim EK, Park CS, Chung WY, Oh KK, Kim DI, et al. (2002) New sonographic criteria for recommending fine-needle aspiration biopsy of nonpalpable solid nodules of the thyroid. AJR Am J Roentgenol 178: 687–691.
  25. 25. Moon W-J, Jung SL, Lee JH, Na DG, Baek J-H, et al. (2008) Benign and Malignant Thyroid Nodules: US Differentiation–Multicenter Retrospective Study1. Radiology 247: 762–770.
  26. 26. Cappelli C, Castellano M, Pirola I, Cumetti D, Agosti B, et al. (2007) The predictive value of ultrasound findings in the management of thyroid nodules. QJM 100: 29–35.
  27. 27. Cappelli C, Castellano M, Pirola I, Gandossi E, De Martino E, et al. (2006) Thyroid nodule shape suggests malignancy. Eur J Endocrinol 155: 27–31.
  28. 28. Chan BK, Desser TS, McDougall IR, Weigel RJ, Jeffrey RB Jr (2003) Common and uncommon sonographic features of papillary thyroid carcinoma. J Ultrasound Med 22: 1083–1090.
  29. 29. Papini E, Guglielmi R, Bianchini A, Crescenzi A, Taccogna S, et al. (2002) Risk of malignancy in nonpalpable thyroid nodules: predictive value of ultrasound and color-Doppler features. J Clin Endocrinol Metab 87: 1941–1946.
  30. 30. Frates MC, Benson CB, Doubilet PM, Cibas ES, Marqusee E (2003) Can color Doppler sonography aid in the prediction of malignancy of thyroid nodules? J Ultras Med 22: 127–131.
  31. 31. Tae HJ, Lim DJ, Baek KH, Park WC, Lee YS, et al. (2007) Diagnostic value of ultrasonography to distinguish between benign and malignant lesions in the management of thyroid nodules. Thyroid 17: 461–466.
  32. 32. Ozel A, Erturk SM, Ercan A, Yilmaz B, Basak T, et al. (2012) The diagnostic efficiency of ultrasound in characterization for thyroid nodules: how many criteria are required to predict malignancy? Med Ultrason 14: 24–28.
  33. 33. Paschke R, Hegedus L, Alexander E, Valcavi R, Papini E, et al. (2011) Thyroid nodule guidelines: agreement, disagreement and need for future research. Nat Rev Endocrinol 7: 354–361.
  34. 34. Bo YH, Ahn HY, Lee YH, Lee YJ, Kim JH, et al. (2011) Malignancy rate in sonographically suspicious thyroid nodules of less than a centimeter in size does not decrease with decreasing size. J Korean Med Sci 26: 237–242.
  35. 35. Kwak JY, Jung I, Baek JH, Baek SM, Choi N, et al. (2013) Image reporting and characterization system for ultrasound features of thyroid nodules: multicentric Korean retrospective study. Korean J Radiol 14: 110–117.