Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Multiclass classification of autofluorescence images of oral cavity lesions based on quantitative analysis

  • Ming-Jer Jeng,

    Roles Conceptualization, Investigation, Methodology, Writing – review & editing

    Affiliations Department of Electronic Engineering, Chang Gung University, Taoyuan, Taiwan, Department of Otolaryngology-Head and Neck Surgery, Chang Gung Memorial Hospital, Linkou, Taiwan

  • Mukta Sharma,

    Roles Data curation, Formal analysis, Methodology, Software, Writing – original draft

    Affiliation Department of Electronic Engineering, Chang Gung University, Taoyuan, Taiwan

  • Ting-Yu Chao,

    Roles Data curation, Resources

    Affiliation Department of Electronic Engineering, Chang Gung University, Taoyuan, Taiwan

  • Ying-Chang Li,

    Roles Investigation, Resources

    Affiliation Green Technology Research Center, Chang Gung University, Taoyuan, Taiwan

  • Shiang-Fu Huang ,

    Roles Data curation, Investigation, Resources, Supervision, Writing – review & editing

    shiangfu.huang@gmail.com (SFH); liann@mail.cgu.edu.tw (LBC)

    Affiliations Department of Otolaryngology-Head and Neck Surgery, Chang Gung Memorial Hospital, Linkou, Taiwan, Department of Public Health, Chang Gung University, Taoyuan, Taiwan

  • Liann-Be Chang ,

    Roles Investigation, Project administration, Supervision, Writing – review & editing

    shiangfu.huang@gmail.com (SFH); liann@mail.cgu.edu.tw (LBC)

    Affiliations Department of Otolaryngology-Head and Neck Surgery, Chang Gung Memorial Hospital, Linkou, Taiwan, Green Technology Research Center, Chang Gung University, Taoyuan, Taiwan

  • Lee Chow

    Roles Project administration, Supervision

    Affiliation Department of Physics, University of Central Florida, Orlando, FL, United States of America

Abstract

Background

Oral cancer is one of the most common diseases globally. Conventional oral examination and histopathological examination are the two main clinical methods for diagnosing oral cancer early. VELscope is an oral cancer-screening device that exploited autofluorescence. It yields inconsistent results when used to differentiate between normal, premalignant and malignant lesions. We develop a new method to increase the accuracy of differentiation.

Materials and methods

Five samples (images) of each of 21 normal mucosae, as well as 31 premalignant and 16 malignant lesions of the tongue and buccal mucosa were collected under both white light and autofluorescence (VELscope, 400-460 nm wavelength). The images were developed using an iPod (Apple, Atlanta Georgia, USA).

Results

The normalized intensity and standard deviation of intensity were calculated to classify image pixels from the region of interest (ROI). Linear discriminant analysis (LDA) and quadratic discriminant analysis (QDA) classifiers were used. The performance of both of the classifiers was evaluated with respect to accuracy, precision, and recall. These parameters were used for multiclass classification. The accuracy rate of LDA with un-normalized data was increased by 2% and 14% and that of QDA was increased by 16% and 25% for the tongue and buccal mucosa, respectively.

Conclusion

The QDA algorithm outperforms the LDA classifier in the analysis of autofluorescence images with respect to all of the standard evaluation parameters.

Introduction

Oral cancer is the sixth common malignancy globally; it is closely associated with smoking, drinking alcohol, chewing tobacco and chewing betel quid. The most common histology of oral cancer is squamous cell carcinoma [1]. In males, it is the most commonly found cancer in the head and neck, accounting for 90% of the 300,000 newly diagnosed oral malignancies every year [2]. According to Stewart [3], about 60% of the new oral cancer cases and 68% of the deaths associated with oral cancer are reported in Asia. In Taiwan, oral cancer is the fifth most common cause of death and the fourth most common among males [4]. Oral potentially malignant disorders (OPMDs) are mucosal lesions that have high potential to develop oral cavity squamous cell carcinomas (OCSCCs), including leukoplakia, erythroplakia, erythroleukoplakia, and submucosal fibrosis [5]. OCSCCs are typically diagnosed late, resulting in an overall five-year survival rate of 50% [6]. The early detection and timely treatment of premalignancy may prevent the transformation of OPMDs into oral cancer [7]. During this transformation process, tissue structures in the squamous epithelium and metabolism are distorted [8]. Collagen and elastin are degraded during the tumor development and invasion. This distortion is observed in histopathological examinations [9, 10]. The most frequently used method for early diagnosis is oral screening [1113]. Although biopsies and histopathological examinations are the gold standard for diagnosing oral cancer [14], biopsy is an invasive and time consuming. It requires an incision in the tissues. These shortcomings have led clinicians to shift to non-invasive techniques such as vital staining, light-based detection and the use of optical diagnostic techniques [15].

This work focuses on a non-invasive and light-based detection device, VELscope, which exploits autofluorescence. Autofluorescence is the natural emission of light by biological structures. These structures assimilate light and can be differentiated by the light that originates from artificially added fluorescent markers. These markers become fluorescent upon excitation by light with wavelength 375-460 nm [16, 17]. Autofluorescence phenomena can easily be used to detect metabolic transformation in tissue structures and are therefore useful in screening for oral cancer [11]. In this study, premalignant and malignant lesions are differentiated to avoid the need for invasive biopsies, this non-invasive technique is better used for surveillance to determine whether a biopsy is needed but these methods can never replace the biopsy.

The VELscope (Visually Enhanced Lesion Scope) is a handheld device that increases the visibility of the oral membrane abnormalities by activating tissue fluorescence. It uses direct tissue autofluorescence with a wavelength of blue light (between 400 nm and 460 nm) to enhance the visibility of oral mucosal abnormalities, which are never visible under white light [18]. At these wavelengths, the normal oral mucosa is associated with a pale green fluorescence as viewed through a filter and abnormal tissue is associated with a loss of autofluorescence and appears dark. Many works [1926] have evaluated the efficacy of the VELscope by direct comparing VELscope results with the biopsy reports. VELscope has been shown to have high sensitivity and to assist in the detection of oral lesions. However, it can not effectively differentiate between high-risk and low-risk lesions. Awan [19] demonstrated the relatively high sensitivity, 84% and a low specificity 15%, in differentiating high-risk lesions from benign lesions. Similarly, Ganga [26] obtained sensitivity and specificity values of 76% and 66.29%, respectively. They suggested that VELscope has reasonable sensitivity, but yields many false-positive results. Huang et al. [27] proposed the use of quantitative analysis(quadratic discriminant analysis, QDA) to quantify the classification of VELscope images (autofluorescence images) by their intensity and heterogeneity. They used QDA as a method of discriminant analysis classification to differentiate between normal and abnormal (malignant/premalignant) lesions of oral mucosa. They successfully differentiated between abnormal and normal lesions with higher specificity and good sensitivity. One major limitation of their study was evident in differentiating between malignant and premalignant lesions. Therefore, in this work, three groups of patients were recruited, having normal, premalignant and malignant lesions. Information in the image was normalized first and then linear discriminant analysis (LDA) and QDA were used for classification.

Materials and methods

Patients and samples

This study was approved by the Institutional Review Board (IRB) of Chang Gung Medical Foundation (IRB No: 201800420B0), Taiwan. It was performed in the Department of Otolaryngology-Head and Neck Surgery, with the written and informed consent of the enrolled participants. Clinical data and pathological reports were collected at Chang Gung Hospital for analysis. The patients with malignancy and premalignancy received a biopsy after the VELscope autofluorescence images were captured. Thirty-one and sixteen patients had premalignant and malignant lesions, respectively. Images of the tongues 11 healthy students at Chang Gung University and of the buccal mucosa of 10 such students were obtained. None of these healthy people habitually smoked, consumed alcohol or chewed betel quid. A total of 340 (68*5) images were analyzed, as shown in Table 1. Table 2 summarizes the demographics of the 47 patients who were classified by their lesions at subsites of the tongue and buccal mucosa. The tongues had 11 premalignant and eight malignant lesions and the buccal mucosae had 20 premalignant and eight malignant lesions.

thumbnail
Table 1. Total fluorescence images studied under VELscope.

https://doi.org/10.1371/journal.pone.0228132.t001

The premalignant lesions included leukoplakia and erythroplakia, which have higher risks of malignant transformation. The malignant lesions were all squamous cell carcinoma, as established by the biopsies. Five autofluorescence images were obtained under white and blue light at various angles and intensities in each case to increase the accuracy of our analysis. The images under blue light were captured using a VELscopeVX (LED Dental and Apteryx, Atlanta Georgia, USA) with an iPod (Apple, Atlanta Georgia, USA) touch, which had eight million pixels.

Methodology

The clinician(Dr. Huang, S.F.) identified the region of interest(ROI) was selected within the lesion(premalignant/malignant) in each image. The analysis considered the same ROIs in the images obtained from the healthy persons. As shown in Fig 1, in the images of the tongue and buccal mucosa, the ROI is shown in a circle. The RGB images are converted into the gray-level images. The reduction of complexity (from 3D pixel to 1D pixel) of the calculation without changing the intensity of the image is important consideration. The following steps were taken and shown in Fig 2.

  • Convert the RGB image into a gray-level image.
  • Calculate the average intensity and standard deviation of the selected ROI.
  • Normalization of ROI by calculating the normalized average intensity and standard deviation of intensity.
  • Feed the above parameters into the LDA and QDA classifier to identify normal, premalignant and malignant lesions.
thumbnail
Fig 1. The white light and VELscope autofluorescence images of the tongue and buccal mucosa in selected ROI with (a) Normal, (b) Premalignant and (c) Malignant lesion.

https://doi.org/10.1371/journal.pone.0228132.g001

Under VELscope, the intensity of normal lesions is commonly higher than that of malignant lesions. In the work of Ganga [26], three fluorescence phenomena were observed under VELscope; they were FVI(fluorescence visualization increase), FVR(fluorescence visualization retained) and FVL(fluorescence visualization loss). All three phenomena were observed in the obtained images. The normal mucosa appears as pale green and fluorescence visualization is retained, so the mucosa is referred to as FVR. The malignant lesions exhibit FVL due to a loss of autofluorescence. The premalignant lesions exhibit either FVI or FVL. The images under VELscope were associated with green color. Fluorescence is the emission of light, causing energy to be lost. The tissue absorbs blue light but emits green light. Therefore, a quantitative analysis of green pixel values was performed, as described below. The average intensity and standard deviation of intensity provide information on the brightness and heterogeneity of selected ROIs. The surface of a malignant lesion was not as homogeneous as that of normal mucosa. Therefore, the standard deviation was calculated along with the intensity to determine the heterogeneity of each ROI. Eqs 1 and 2 give the mean intensity (μ) of an image and the standard deviation of intensity (σ). Where Ai is the array of R, G, B or gray pixel values of in the ROI and N is the number of pixels in the ROI. To normalize the average intensity and the standard deviation, Eqs 3 and 4 are used, where (μN) and (σN) are the normalized average intensity and standard deviation of intensity, respectively. The μROI and σROI are the average intensity and standard deviation in the ROI, respectively. (1) (2) (3) (4)

Normalization

The normalized intensity and the standard deviation of intensity were used to neutralize the autofluorescence in the ROI, which have arisen from the parts of the image outside the ROI. Non-ROI and unwanted autofluorescence, which may affect the ROI in the image; it can include teeth, supporting device, prosthesis, or filling materials. Normalization eliminated the redundant data and increased the variability and separability of the classifiers. The results of the quantitative analysis are validated by comparing normalized and un-normalized data.

Data analysis

Many statistical methods, such as principal component analysis (PCA), LDA and QDA are used in pattern discrimination or classification, dimension reduction and pattern recognition. PCA finds the directions of maximal variance and can be used only in two-class classification. LDA targets class separability and maximizes the component axis for class separation [28]. LDA is the enhanced version of PCA and is frequently used in medical imaging, especially in pathological examinations to find a linear combination of features that separates several categories of objects [29]. QDA is an extension of, and more flexible than, LDA. LDA assumes a single variance-covariance matrix over all classes whereas the QDA assumes different variance-covariance matrices for each class. LDA yields a linear boundary whereas QDA produces a quadratic boundary of the classifier, resulting in substantially greater variance because it is quadratic [30]. Both classifiers are very helpful in medical applications [31, 32]. Therefore, the effectiveness of both in distinguishing among normal, premalignant and malignant lesions is studied herein. QDA and LDA assume that every category of information exhibits multivariate Gaussian distributions. The equation for multivariate Gaussian distribution for class n is Eq 5, where k is a dimension; ∑n is the covariance of class n and μn is the mean of class n. Eqs 6 and 7 are the LDA and QDA functions. (5) (6) (7) Sensitivity and specificity are commonly used parameters in binary classification. However, multiclass classification is used herein. The performance of each classifier for tongue and buccal mucosa was evaluated using such parameters as precision, recall, f1-score and accuracy. Precision is the positive predictive value (PPV), which is defined as the number of relevant cases as a proportion of the retrieved cases. Recall is the true positive rate (TPR) or sensitivity, which is defined as the number of relevant cases that have been retrieved divided by the total number of relevant cases. The F1-score is the harmonic mean of precision and recall, accuracy is the proportional precision in a classification system. The definitions are given by Eqs 8 to 11. Here, TP is True Positive (an abnormal lesion is categorized correctly as abnormal); TN is True Negative (a normal is categorized correctly as normal); and FP is False Positive (a normal is categorized wrongly as abnormal); FN is False Negative (an abnormal is categorized wrongly as normal). The confusion table provides the numbers of correct and incorrect predictions. Abnormalities include any premalignant or malignant lesion. Therefore, TP and TN correspond to true or correct classification and FP and FN correspond to false or incorrect classification. (8) (9) (10) (11)

Results

All the participants were registered between July 2017 and September 2018, having provided written and informed consent. Biopsies confirmed that 16 and 31 patients had malignant and premalignant, respectively. Twenty-one healthy participants were enrolled. Table 1 provides information about the patients. Fig 1 shows the white-light and autofluorescence images. The clinician (Dr. Huang S.F.) selected all ROIs. The intensity and standard deviation of intensity of all autofluorescence images were recorded and analyzed. The standard deviations revealed the heterogeneity of lesions. The intensity and standard deviation of intensity among adjacent non-ROIs were recorded and used in normalization. Intensity and standard deviation of intensity were analyzed using multiclass classification algorithms.

For multiclass classification, LDA and QDA classifiers were used. To prevent over-fitting, 75% of the data were used in training and the remaining 25% were used in testing. k-Fold cross-validation (k = 2) were used with the LDA and QDA classifiers to categorize the lesions in autofluorescence images. In k-fold cross-validation, the data were divided into k equal folds. Then, the algorithm was trained for using k-1 folds, while the remaining fold was used as the test set. The performance was aggregated across k folds. Each classifier yielded confusion and performance tables. Decision boundary curves were developed separately for the tongue and buccal mucosa. Each row of the confusion table represented instances in a predicted class, and each column represented instances in an actual class. The diagonal and non-diagonal elements represented true/correct and false/incorrect predictions, respectively. Normalized and un-normalized data were analyzed. The un-normalized data yielded poorer differentiation than the normalized data, indicating that the autofluorescence of the ROI was influenced by the non-ROI regions. Precision and recall are the two important parameters in multiclass classification. Accuracy is not an adequate parameter for evaluating the performance evaluation of a predictive model, owing to the accuracy paradox [33]. Therefore, precision and recall are used as the critical performance parameters in our analysis. Both LDA and QDA had the potential to differentiate the three classes (N, M, and PM) but the QDA classifier yielded better results than the LDA perhaps because the latter is data-independent, whereas the former is data-dependent.

One hundred and fifty images of the tongue subsite were obtained, of which 112 were used for training and 38 were used for testing. LDA yielded a total of 74 (24+12+38) and 97 (36+31+30) true or correctly classified cases, and 38 (15+18+5) and 15 (4+3+8) false or incorrectly classified cases, with un-normalized and normalized training data, respectively, as shown in Table 3. QDA yielded a total of as 81 (28+17+36) and 99 (36+31+32) true or correctly classified cases, and 31 (11+13+7) and 13 (4+3+6) false or incorrectly classified cases, with un-normalized and normalized training data, respectively. Similar results were obtained with testing data [Table 3]. In evaluating the LDA, normalized training and testing data yielded 21% and 27% greater precision than un-normalized data, respectively [Table 4]. For QDA, the corresponding values were 16% and 20%, respectively. QDA with normalized testing and training data differentiated between tongue lesions with 86% and 88% accuracy, respectively; with un-normalized training and testing data, the accuracy rates were 72% and 68%, respectively. The decision boundary curves in Fig 3(a) to 3(d) revealed more misclassification with un-normalized data than with normalized data. Fig 3(a) and 3(b) show linear boundaries among three classes, based on differences in intensity and standard deviation of intensity, obtained using normalized and un-normalized data. Fig 3(c) and 3(d) show quadratic boundaries, which clearly differentiated the three categorizes.

thumbnail
Fig 3. For tongue (a) Un-normalized and (b) Normalized data were plotted using LDA classifier, whereas (c) Un-normalized and (d) normalized data were plotted using QDA classifier.

The misclassification exhibited as a star and the big dark circles represented the mean of each class. The results showed that the linear boundary ((a) and (b)) and quadratic boundary ((c) and (d)) can discriminate among normal (N), malignant (M) and premalignant lesions (PM).

https://doi.org/10.1371/journal.pone.0228132.g003

For the buccal mucosa, 142 and 48 images (total 190 images) were used for training and testing data, respectively. LDA yielded a total of 78 (10+11+57) and 111 (21+22+68) true or correctly classified cases and 64 (30+ 19+ 15) and 31 (17+7+7) false or incorrectly classified cases, using un-normalized and normalized training data, respectively, as shown in Table 5. QDA yielded 85 (29+ 4+ 52) and 120(35+20+65) true or correctly classified cases and 57 (11+26+20) and 22 (3+9+10) false or incorrectly classified cases, using un-normalized and normalized training data, respectively. Similar results were obtained using the testing data [Table 5]. For LDA, the normalized training and testing data yielded 25% and 9% greater precision, respectively, than the un-normalized data [Table 6]. For QDA, the corresponding values are 28% and 16%, respectively, as shown in Table 6. The QDA differentiated tongue lesions with 72% and 84% accuracy with normalized testing and training data, respectively, and 56% and 59% accuracy with un-normalized training and testing data, respectively. In Fig 4(a) to 4(d), the decision boundary curves reveal more misclassification with un-normalized data than with normalized data. Fig 4(a) and 4(b) show linear boundaries among three classes, based on differences in intensity and standard deviation of intensity, with un-normalized and normalized data. Fig 4(c) and 4(d) show quadratic boundaries, which clearly differentiated among the three categorizes.

thumbnail
Fig 4. For buccal mucosa (a) Un-normalized and (b) Normalized data were plotted using LDA classifier, whereas (c) Un-normalized and (d) normalized data were plotted using QDA classifier.

The misclassification exhibited as a star and the big dark circles represented the mean of each class. The results showed that the linear boundary ((a) and (b)) and quadratic boundary ((c) and (d)) can discriminate among normal (N), malignant (M) and premalignant lesions (PM).

https://doi.org/10.1371/journal.pone.0228132.g004

thumbnail
Table 5. Confusion table of LDA and QDA for buccal mucosa.

https://doi.org/10.1371/journal.pone.0228132.t005

To visualize the multiclass classification performance, ROC (Receiver Operating Characteristic) curves were generated for both classifiers and drawn between the true positive rate and the false positive rate (1-specificity). An ROC curve is a probability curve and the AUC (Area Under the Curve) represents the degree or measure of separability. This can distinguish among classes. In Fig 5, ROC curves are obtained using the LDA and QDA classifiers and normalized data for the tongue and buccal mucosa. They show that the classification separability achieved using QDA exceeds that achieved using LDA. This study focused on the early diagnosis of oral cancer to identify premalignancy. For the tongue, a premalignant classification separability of 93% was achieved using the LDA classifier and of 95% using the QDA classifier, as shown in Fig 5(a) and 5(b), respectively. For the buccal mucosa, a premalignant classification separability of 81% was achieved using the LDA classifier and of 84% using the QDA classifier, as shown in Fig 5(c) and 5(d), respectively.

thumbnail
Fig 5. ROC curves were plotted (a) For normalized data of tongue using LDA and (b) QDA classifier (c) For normalized data of buccal mucosa using LDA and (d) QDA classifier.

Where N is normal, PM is premalignant and M is malignant. Macro-average ROC curve gives the average of the three classes.

https://doi.org/10.1371/journal.pone.0228132.g005

Discussion

Autofluorescence is used in a fast and non-invasive method for detecting malignant and premalignant lesions. Many new diagnostic techniques and instruments are routinely used; they include Vizi-Lite (Zila Pharmaceuticals, Phoenix, AZ), Identafi (DentalEZ, PA, USA), Narrow band imaging (NBI; Olympus Medical Systems Corporation, Tokyo, Japan) and VELscope (LED Medical Diagnostics Inc., Burnaby Canada) [34, 35]. Vizi-Lite is an old device that is based on chemical autofluorescence. Identafi uses multispectral fluorescence to facilitate intraoral examination. NBI is an endoscopic technique to enhance the visualization of oral mucosal abnormalities and underlying vasculature. It uses a specific wavelength of light to detect the vasculature in submucosa [36, 37]. Scientists from the British Columbia Cancer Agency (BCCA) developed the VELscope, in which an extrinsic light source (400-460nm) is used to excite endogenous fluorophores. When this exogenous light interacts with the oral mucosa, the fluorophores absorb a photon of this light and emit a lower-energy photon; this process is known as fluorescence. Under this light, normal oral mucosa emits green autofluorescence, whereas the abnormal mucosa appears dark owing to the loss of autofluorescence(LAF) [38]. The two most diagnostically important endogenous fluorophores are Nicotinamide Adenine Dinucleotide (NADH) and Flavin Adenine Dinucleotide (FAD). They are used to monitor dramatic metabolic changes in cells and tissues. The oxidation-reduction (redox) ratio or NADH/FAD is used to measure cellular metabolism [39]. Each fluorophore has a specific excitation and emission spectrum. Fluorophores are not distributed uniformly in tissues; rather, each tissue contains a greatly varying concentration of fluorophores [40]. Although many optical diagnostic methods exist, none has yet been identified as having the best identification rate.

This work focused on diagnosing oral lesions in the preliminary stage before they become malignant. It presented a multiclass classification approach, which successfully differentiates among premalignant, malignant and normal tissues. VELscope autofluorescence images were captured and the normalized average intensities and standard deviation of intensities in ROIs were calculated. Intensity-based classification alone is not good enough. Huang TT et al. developed a method that VELscope autofluorescence images by QDA [27]. They calculated the intensity and heterogeneity of the ROI in the images. They used the QDA classifier to distinguish normal tissues from the premalignant and malignant lesions. They effectively classified normal and abnormal tissues. However, they did not differentiate between the premalignancies and the malignancies. In the present study, in contrast, premalignant and malignant lesions were successfully differentiated, as were normal and abnormal tissues, using a novel normalization technique. Currently, no quantitative analytical method of multiclass classification method exists for diagnosing oral cancer. Our method is helpful for detecting early-stage oral cancer. The detection of more OPMDs corresponds to a higher survival rate. The five-year survival rate for patients with late-stage oral cancer is only 20% and that for those with early-stage premalignant-stage oral cancer is about 82% [41].

In this work, most premalignant cases are related to leukoplakia and erythroplakia, which are the most common OPMDs. The fluorescent intensity of leukoplakia lesions exceeded that of the normal tissues, owing to FVI autofluorescence. Erythroleukoplakia lesions are associated with both FVL and FVI autofluorescences. They yielded a mixed autofluorescence signal and appeared dark brown in images. Erythroplakia is associated with a loss of fluorescence while leukoplakia in the periphery presents as FVI. Erythroleukoplakia can present as dark brown under VELscope. Malignant lesions usually exhibit a loss of autofluorescence (fluorescence visualization loss or FVL) and have appeared dark black. Sometimes, malignant lesions did not appear as darker they were formed by transformation from leukoplakia, but they then fluoresced like a premalignant lesion. Therefore, different types of lesion do not exhibit the same autofluorescence because they vary in location and morphology [42]. Quantitative analysis is required because VELscope alone does not provide appropriate useful information about the type of lesion. Heterogeneity is also important in analyzing an image, as it can be used to distinguish between normal and abnormal lesions. Therefore, variations in intensity and standard deviation of intensity were quantitatively analyzed for normal, premalignant and malignant lesions. The quantitative analysis involved multiclass classification with normalization. The two subsites (tongue and buccal mucosa) of oral mucosa yielded separate results, unlike in other research. The method successfully differentiated among normal (N), premalignant (PM) and malignant (M) lesions. LDA and QDA were used for classification. In this study, intensity and standard deviation of intensity in the malignant lesions differed between the subsites. Generally, tumor lesions exhibited variation in intensity and heterogeneity owing to the different locations of the tumors, but for the tongue (Fig 3(b) and 3(d)), more of the tumors were in stage T1 than in stage T2 or T4 (as indicated in the patient demographics table). Therefore, the intensity and standard deviation vary greatly on tumor subsites on the tongue. For the buccal mucosa (Fig 4(b) and 4(d)), more of the tumors were in stage T2 than in stage T1 or T4. Owing to the variation in the distribution of tumor stages, the standard deviation of intensity in malignant regions exceeded those in normal and premalignant regions. Image normalization has an integral role in any image based analysis and reduces variability among samples even when the experimental conditions are perfect [43]. The normalization technique that was used herein adequately improves the evaluation parameters and feature extraction without compromising the basic features in the images. The results of the quantitative analysis are validated by comparing normalized and un-normalized samples. We hope that this approach can further improve the differentiation between premalignant and malignant lesions. However, VELscope without quantitative analysis is not effective for such differentiation. The quantitative and multiclass classification approaches herein successfully distinguish premalignant lesions from normal and malignant lesions. Based on an earlier study, we can conclude that different parts of oral mucosa (tongue, buccal, gingiva, hard palate, soft palate) have different percentages of collagen and elastin [44]. Variations in collagen and elastin compositions among subsites of the oral mucosa are analyzed quantitatively. The quantitative analysis at a specific subsite cannot be generalized to all subsites of the oral mucosa. In this work, tongue and buccal mucosa were analyzed because they are the most frequent tumor subsites in patients in Taiwan owing to the popularity of smoking and the chewing of areca quid. All cases that involve the tongue involve the lateral subsite. The alteration at other subsites requires further investigation.

The first limitation of this study is that it is retrospective. A limited number of cases of premalignancy/malignancy were used. To increase sensitivity and specificity, a huge data set must be analyzed. Another inherent limitation of VELscope is that it cannot easily capture images of all interior subsites of the oral mucosa, including the hard palate and the retromolar.

This study provides a baseline definition of the boundary between surgical and tumor-free resection margin. This work also can help surgeons to determine the type of lesion. In the future, we will use other optical technologies to identify different stages (more than two) of oral cancer.

Conclusion

Without quantitative analysis, VELscope cannot be used definitively to identify dysplastic tissue changes. This work applied normalization to autofluorescence images to calculate its normalized average intensity and standard deviation of its intensity. Multiclass classifiers (LDA and QDA) were used to generate decision boundary curves. QDA classifies normal, premalignant and malignant lesions with greater accuracy, precision, recall and f1-score, which have seldom been considered in the literature. Although these methods cannot completely replace biopsies, they are helpful to clinicians in detecting oral cancer early. In the future, we will introduce other classifiers with light sources with many wavelengths to increase the accuracy of detection.

Acknowledgments

The authors would like to thank Chang Gung University for support and the students who participated in this study.

References

  1. 1. Reddy SS, Sharma S, Mysorekar V. Expression of Epstein–barr virus among oral potentially malignant disorders and oral squamous cell carcinomas in the South Indian tobacco-chewing population. Journal of Oral Pathology … Medicine. 2017;46(6):454–459.
  2. 2. Kumar R, Samal SK, Routray S, Dash R, Dixit A. Identification of oral cancer related candidate genes by integrating protein-protein interactions, gene ontology, pathway analysis and immunohistochemistry. Scientific Reports. 2017;7(1):2472. pmid:28559546
  3. 3. Stewart B, Wild CP, et al. World cancer report 2014. Self. 2018
  4. 4. Lingen MW, Kalmar JR, Karrison T, Speight PM. Taiwan’s Leading Causes of Death in 2016. Ministry of Health and Welfare in Taiwan. 2016.
  5. 5. Lingen MW, Abt E, Agrawal N, Chaturvedi AK, Cohen E, D’Souza G, et al. Evidence-based clinical practice guideline for the evaluation of potentially malignant disorders in the oral cavity: a report of the American Dental Association. The Journal of the American Dental Association. 2017;148(10):712–727. pmid:28958308
  6. 6. Allen CT, Law JH, Dunn GP, Uppaluri R. Emerging insights into head and neck cancer metastasis. Head … neck. 2013;35(11):1669–1678.
  7. 7. Mortazavi H, Baharvand M, Mehdipour M. Oral potentially malignant disorders: an overview of more than 20 entities. Journal of dental research, dental clinics, dental prospects. 2014;8(1):6. pmid:25024833
  8. 8. Hsu PP, Sabatini DM. Cancer cell metabolism: Warburg and beyond. Cell. 2008;134(5):703–707. pmid:18775299
  9. 9. Velidandla S, Gaikwad P, Ea lla KKR, Bhorgonde KD, Hunsingi P, Kumar A. Histochemical analysis of polarizing colors of collagen using Picrosirius Red staining in oral submucous fibrosis. Journal of international oral health: JIOH. 2014;6(1):33. pmid:24653600
  10. 10. Tosios KI, Economou I, Vasilopoulos NN, Koutlas IG. Elastofibromatous changes and hyperelastosis of the oral mucosa. Head and neck pathology. 2010;4(1):31–36. pmid:20237986
  11. 11. Burzynski NJ, Firriolo FJ, Butters JM, Sorrell CL. Evaluation of oral cancer screening. Journal of Cancer Education. 1997;12(2):95–99. pmid:9229272
  12. 12. Rethman MP, Carpenter W, Cohen EE, Epstein J, Evans CA, Flaitz CM, et al. Evidence-based clinical recommendations regarding screening for oral squamous cell carcinomas. The Journal of the American Dental Association. 2010; 141 (5):509–520. pmid:20436098
  13. 13. Skandarajah A, Sunny SP, Gurpur P, Reber CD, D’Ambrosio MV, Raghavan N, et al. Mobile microscopy as a screening tool for oral cancer in India: A pilot study. PloS one. 2017;12(11):e0188440. pmid:29176904
  14. 14. Siddiqui MM, Rais-Bahrami S, Turkbey B, George AK, Rothwax J, Shakir N, et al. Comparison of MR/ultrasound fusion–guided biopsy with ultrasound-guided biopsy for the diagnosis of prostate cancer. Jama. 2015;313(4):390–397. pmid:25626035
  15. 15. Liu D, Zhao X, Zeng X, Dan H, Chen Q. Non-invasive techniques for detection and diagnosis of oral potentially malignant disorders. The Tohoku journal of experimental medicine. 2016;238(2):165–177. pmid:26888696
  16. 16. Lingen MW, Kalmar JR, Karrison T, Speight PM. Critical evaluation of diagnostic aids for the detection of oral cancer. Oral oncology. 2008;44(1):10–22. pmid:17825602
  17. 17. Kanniyappan U, Prakasarao A, Dornadula K, Singaravelu G. An in vitro diagnosis of oral premalignant lesion using time-resolved fluorescence spectroscopy under UV excitation—a pilot study. Photodiagnosis and photodynamic therapy. 2016;14:18–24. pmid:26853635
  18. 18. Wu Y, Qu JY. Autofluorescence spectroscopy of epithelial tissues. Journal of biomedical optics. 2006;11(5):054023. pmid:17092172
  19. 19. Awan K, Morgan P, Warnakulasuriya S. Evaluation of an autofluorescence based imaging system (VELscope) in the detection of oral potentially malignant disorders and benign keratoses. Oral oncology. 2011;47(4):274–277. pmid:21396880
  20. 20. Bhatia N, Lalla Y, Vu AN, Farah CS. Advances in optical adjunctive aids for visualisation and detection of oral malignant and potentially malignant lesions. International journal of dentistry. 2013;2013. pmid:24078812
  21. 21. Burian E, Schulz C, Probst F, Palla B, Tröltzsch M, Maglitto F, et al. Fluorescence based characterization of early oral squamous cell carcinoma using the Visually Enhanced Light Scope technique. Journal of Cranio-Maxillo-Facial Surgery. 2017;45(9):1526–1530. pmid:28688861
  22. 22. Bhatia N, Matias MAT, Farah CS. Assessment of a decision making protocol to improve the efficacy of VELscope in general dental practice: a prospective evaluation. Oral oncology. 2014;50(10):1012–1019. pmid:25096826
  23. 23. Nagi R, Reddy-Kantharaj YB, Rakesh N, Janardhan-Reddy S, Sahu S. Efficacy of light based detection systems for early detection of oral cancer and oral potentially malignant disorders: systematic review. Medicina oral, patologia oral y cirugia bucal. 2016;21(4):e447. pmid:26946209
  24. 24. Farah CS, Mclntosh L, Georgiou A, McCullough MJ. Efficacy of tissue autofluo-rescence imaging (VELScope) in the visualization of oral mucosal lesions. Head … neck. 2012;34(6):856–862.
  25. 25. Sawan D, Mashlah A. Evaluation of premalignant and malignant lesions by fluorescent light (VELscope). Journal of International Society of Preventive … Community Dentistry. 2015;5(3):248.
  26. 26. Ganga RS, Gundre D, Bansal S, Shirsat PM, Prasad P, Desai RS. Evaluation of the diagnostic efficacy and spectrum of autofluorescence of benign, dysplastic and malignant lesions of the oral cavity using VELscope. Oral oncology. 2017;75:67–74. pmid:29224826
  27. 27. Huang TT, Huang JS, Wang YY, Chen KC, Wong TY, Chen YC, et al. Novel quantitative analysis of autofluorescence images for oral cancer screening. Oral oncology. 2017;68:20–26. pmid:28438288
  28. 28. Fisher RA. The use of multiple measurements in taxonomic problems. Annals of human genetics. 1936;7(2):179–188.
  29. 29. Talmor A, Daemen A, Murdoch E, Missfelder-Lobos H, Timmerman D, Bourne T, et al. Defining the relationship between fetal Doppler indices, abdominal circumference and growth rate in severe fetal growth restriction using functional linear discriminant analysis. Journal of The Royal Society Interface. 2013;10(88):20130376.
  30. 30. Gardner-Lubbe S, Dube FS. Visualisation of quadratic discriminant analysis and its application in exploration of microbial interactions. BioData mining. 2015;8(1):8. pmid:25798196
  31. 31. Siqueira LF, Júnior RFA, de Araújo AA, Morais CL, Lima KM. LDA vs. QDA for FT-MIR prostate cancer tissue classification. Chemometrics and Intelligent Laboratory Systems. 2017;162:123–129.
  32. 32. Jeng MJ, Sharma M, Sharma L, Chao TY, Huang SF, Chang LB, et al. Raman Spectroscopy Analysis for Optical Diagnosis of Oral Cancer Detection. Journal of clinical medicine. 2019;8(9):1313.
  33. 33. Zhu X. Knowledge Discovery and Data Mining: Challenges and Realities: Challenges and Realities. Igi Global; 2007.
  34. 34. Yan YJ, Huang TW, Cheng NL, Hsieh YF, Tsai MH, Chiou JC, et al. Portable LED-induced autofluorescence spectroscopy for oral cancer diagnosis. Journal of biomedical optics. 2017;22(4):045007.
  35. 35. Kordbacheh F, Bhatia N, Farah C. Patterns of differentially expressed genes in oral mucosal lesions visualised under autofluorescence (VELscope). Oral diseases. 2016;22(4):285–296. pmid:26749103
  36. 36. Gono K, Yamazaki K, Doguchi N, Nonami T, Obi T, Yamaguchi M, et al. En-doscopic observation of tissue by narrowband illumination. Optical Review. 2003;10(4):211–215.
  37. 37. Song LMWK, Adler DG, Conway JD, Diehl DL, Farraye FA, Kantsevoy SV, et al. Narrow band imaging and multiband imaging. Gastrointestinal Endoscopy. 2008;67(4):581–589. pmid:18374021
  38. 38. Poh CF, Zhang L, Anderson DW, Durham JS, Williams PM, Priddy RW, et al. Fluorescence visualization detection of field alterations in tumor margins of oral cancer patients. Clinical Cancer Research. 2006;12(22):6716–6722. pmid:17121891
  39. 39. Druzhkova IN, Shirmanova MV, Lukina MM, Dudenkova VV, Mishina NM, Zagaynova EV. The metabolic interaction of cancer cells and fibroblasts-coupling between NAD (P) H and FAD, intracellular pH and hydrogen peroxide. Cell cycle. 2016;15(9):1257–1266. pmid:26986068
  40. 40. Wagnieres GA, Star WM, Wilson BC. In vivo fluorescence spectroscopy and imaging for oncological applications. Photochemistry and photobiology. 1998;68(5):603–632. pmid:9825692
  41. 41. Horner M, Ries L, Krapcho M, Neyman N, Aminou R, Howlader N, et al. SEER Cancer Statistics Review, 1975-2006, National Cancer Institute. Bethesda, MD; 2009.
  42. 42. Messadi DV, Younai FS, Liu HH, Guo G, Wang CY. The clinical effectiveness of reflectance optical spectroscopy for the in vivo diagnosis of oral lesions. International journal of oral science. 2014;6(3):162. pmid:25059250
  43. 43. Yayon N, Dudai A, Vrieler N, Amsalem O, London M, Soreq H. Intensify3D: Normalizing signal intensity in large heterogenic image stacks. Scientific reports. 2018;8(1):4311. pmid:29523815
  44. 44. Ciano J, Beatty BL. Regional Quantitative Histological Variations in Human Oral Mucosa. The Anatomical Record. 2015;298(3):562–578. pmid:25402000