A New Approach of Fatigue Classification Based on Data of Tongue and Pulse With Machine Learning

Shi, Yulin; Yao, Xinghua; Xu, Jiatuo; Hu, Xiaojuan; Tu, Liping; Lan, Fang; Cui, Ji; Cui, Longtao; Huang, Jingbin; Li, Jun; Bi, Zijuan; Li, Jiacai

doi:10.3389/fphys.2021.708742

ORIGINAL RESEARCH article

Front. Physiol., 07 February 2022

Sec. Computational Physiology and Medicine

Volume 12 - 2021 | https://doi.org/10.3389/fphys.2021.708742

This article is part of the Research Topic Artificial Intelligence in Traditional Medicine View all 9 articles

A New Approach of Fatigue Classification Based on Data of Tongue and Pulse With Machine Learning

$\r\nYulin Shi$ Yulin Shi¹

Xinghua Yao¹

Jiatuo Xu^1*

Xiaojuan Hu²

Liping Tu¹

Fang Lan¹

Ji Cui¹

Longtao Cui¹

Jingbin Huang¹

Jun Li¹

Zijuan Bi¹ $Jiacai Li\r\n$ Jiacai Li¹

¹Basic Medical College, Shanghai University of Traditional Chinese Medicine, Pudong, China
²Shanghai Innovation Center of TCM Health Service, Shanghai University of Traditional Chinese Medicine, Pudong, China

Background: Fatigue is a common and subjective symptom, which is associated with many diseases and suboptimal health status. A reliable and evidence-based approach is lacking to distinguish disease fatigue and non-disease fatigue. This study aimed to establish a method for early differential diagnosis of fatigue, which can be used to distinguish disease fatigue from non-disease fatigue, and to investigate the feasibility of characterizing fatigue states in a view of tongue and pulse data analysis.

Methods: Tongue and Face Diagnosis Analysis-1 (TFDA-1) instrument and Pulse Diagnosis Analysis-1 (PDA-1) instrument were used to collect tongue and pulse data. Four machine learning models were used to perform classification experiments of disease fatigue vs. non-disease fatigue.

Results: The results showed that all the four classifiers over “Tongue & Pulse” joint data showed better performances than those only over tongue data or only over pulse data. The model accuracy rates based on logistic regression, support vector machine, random forest, and neural network were (85.51 ± 1.87)%, (83.78 ± 4.39)%, (83.27 ± 3.48)% and (85.82 ± 3.01)%, and with Area Under Curve estimates of 0.9160 ± 0.0136, 0.9106 ± 0.0365, 0.8959 ± 0.0254 and 0.9239 ± 0.0174, respectively.

Conclusion: This study proposed and validated an innovative, non-invasive differential diagnosis approach. Results suggest that it is feasible to characterize disease fatigue and non-disease fatigue by using objective tongue data and pulse data.

Introduction

Fatigue refers to the state that the body cannot endure certain physical intensity with both physiological and pathological manifestation (Chaudhuri and Behan, 2004). Fatigue is subjective uncomfortableness. It can be either mental or physical, and can be of different degrees depending on the health conditions (Persson and Bondke Persson, 2016). Studies have shown that chronic fatigue syndrome (CFS) (Wang et al., 2014; Sandler and Lloyd, 2020), depression (Kim et al., 2019), cancer (Lawrence et al., 2004), and other diseases have obvious fatigue manifestations, and various treatment modalities, such as radiotherapy (Hickok et al., 2005; Dhruva et al., 2010), chemotherapy (Minton et al., 2013), and hormone and biological therapy (Phillips et al., 2013) can aggravate fatigue. Fatigue is one of the most common subjective symptoms of abnormal health state and can be further categorized as disease fatigue and non-disease fatigue. Due to the lack of objective diagnostic tool of fatigue, there is still no reliable and stable evaluation method to distinguish disease fatigue and non-disease fatigue.

Traditional Chinese medicine (TCM) leverages symptoms, physical signs, tongue, and pulse as one of the ways to characterize patient health status. With rapid development in computer science, various machine learning methodologies, such as logistic regression (Bucur et al., 2017; Zhang et al., 2018), support vector machine (SVM) (Li X. et al., 2019), random forest (Ozçift, 2011; Kong and Yu, 2018), convolutional neural network (Shin et al., 2016; Yu et al., 2017), and deep neural network (Ben-Bassat et al., 2018) have been widely applied in the field of medical research. Using artificial intelligence methods in understanding the diagnostic data and syndromes or diseases can help improve the accuracy and precision of diagnosis in an objective and efficient manner. In TCM, fatigue is believed to be related to decline of the whole or local functional state of the human body-the performance of Qi deficiency. Tongue diagnosis and pulse diagnosis are recognized diagnostic methods which are based on overall evaluation of human body; and this is suitable in functional states evaluation, forming important foundation for the evaluation of health status and disease diagnosis. Tongue and pulse manifestations are closely related to heart, lung, spleen, stomach, liver, and kidney functioning, just as the old saying goes: “Tongue reflecting sign of heart,” “The tongue is the external phenology of the spleen and stomach,” “Heart dominating blood and vessel,” “The pulse is the house of blood,” and “Lung connecting all vessels.” Tongue and pulse conditions can reflect the function of Qi, blood, and viscera. Therefore, when fatigue occurs, the changes in functions of the heart, lungs, or other viscera will be reflected in tongue and pulse manifestations. Thus, tongue and pulse conditions can be used to understand the severity and cause of fatigue. Using a large amount of patient level data collected by modern tongue diagnostic or pulse diagnostic instruments, a number of diagnostic models have been developed using machine learning in other disease areas (Wang et al., 2013, 2020; Zhang et al., 2019). Based on modern tongue (Ding et al., 2015; Li W.L. et al., 2019) and pulse diagnosis (Shi et al., 2017; Kung et al., 2020) technology, research on fatigue has made great progress.

Fatigue is an early sign of abnormal health status, which plays a very important role in understanding the health status and early prevention and diagnosis of disease. However, due to lack of objective evidence for fatigue, especially in the early stage of the disease, fatigue is often neglected, which delays diagnosis and timely intervention. A reliable and consistent method to distinguish disease fatigue and non-disease fatigue can effectively assist differentiation of disease fatigue and non-disease fatigue in early diagnosis. This study aims to establish a method for early differential diagnosis of fatigue, to facilitate early diagnosis, prevention, and treatment of disease. This is an interdisciplinary work in which we interpret the scientific rules of disease diagnosis based on objective data of tongue and pulse.

Materials and Methods

Study Subjects

A total of 486 fatigue patients were included in this study from January 2015 to December 2018 at Medical Examination Center of Shuguang Hospital affiliated to Shanghai University of TCM. Patients were divided into two groups by experienced clinicians according to disease diagnostic guidelines and fatigue diagnostic criteria: non-disease fatigue subjects (n = 242), and disease fatigue subjects (n = 244). The study included a group of healthy population (n = 250) as controls. Patient selection and classification is shown in Figure 1. All patients have signed informed consent form.

FIGURE 1

Figure 1. Overall flowchart.

Inclusion and Exclusion Criteria

Specific diagnosis of disease for patients with disease fatigue was made by four experienced clinicians following diagnostic criteria of Western medicine. Most common diseases included Chinese Diabetes and Society (2018), Hypertension et al. (2019), and hyperlipidemia (Yan et al., 2017). Health Status Assessment Questionnaire Scale (H20 Scale) and the Information Record Form of Four Diagnosis of TCM (Copyright No.: 2016Z11L025702) (Shi et al., 2021) (as shown in Supplementary Material 1) were used to further define the state of fatigue.

Inclusion criteria: (1) meeting the diagnostic criteria of disease or there are obvious abnormal physical signs for diseases. (2) Have symptom of fatigue.

Exclusion criteria: (1) pregnant or lactating women. (2) Psychopath. (3) Patients with poor compliance.

Collecting Clinical Tongue Data and Pulse Data

Tongue and Face Diagnosis Analysis-1 (TFDA-1) instrument and Pulse Diagnosis Analysis-1 (PDA-1) instrument were used to collect tongue data and pulse data. TFDA-1 instrument is shown in Figure 2 and its corresponding analysis software, named tongue diagnosis analysis system (TDAS) V2.0, is shown in Figure 3. The corresponding indices of tongue body and tongue coating could be obtained via TDAS. All these indices reflect the tongue characteristics from different perspectives, which served as important objective basis for health status evaluation and syndrome diagnosis. PDA-1 instrument and its corresponding sphygmogram are shown in Figure 4. All investigators were specialized medical students who had been trained for standard study operating procedures to ensure consistency and accuracy of data collection.

FIGURE 2

Figure 2. Tongue and Face Diagnosis Analysis-1 (TFDA-1) tongue diagnosis instrument. (A) Front view. (B) Profile view.

FIGURE 3

Figure 3. The corresponding software analysis interface of TFDA-1 equipment.

FIGURE 4

Figure 4. Pulse Diagnosis Analysis-1 (PDA-1) pulse diagnosis instrument and sphygmogram. (A) PDA-1 pulse diagnosis instrument. (B) Sphygmogram and its parameters.

The tongue indices were from three color spaces, Lab, HIS, and YCrCb (Qi et al., 2016; Sun et al., 2016; Schiller et al., 2018), each tongue and pulse index had its medical meaning (Qi et al., 2016; Luo et al., 2018; Li et al., 2021b; Shi et al., 2021). The indices of tongue diagnosis and pulse diagnosis and their corresponding clinical meaning are shown in Supplementary Table 1.

Statistical Analysis

The SPSS 25.0 software was used for statistical analysis. Continuous data with normal distribution are presented as mean and SD, and those with abnormal distribution are presented using median and interquartile range (IQR). Comparisons between groups were conducted using ANOVA or Kruskal–Wallis H-test for continuous variables. A p < 0.05 (two-tailed) was considered to be statistically significant in comparisons.

Classification by Machine Learning Approach

In this study, four machine learning methods, such as logistic regression, SVM, neural network, and random forest were used. The random forest is an ensemble learning method for classification and other tasks, which does not utilize the gradient decent. When modeling data by the random forest, no operations of normalizing data were performed. In our experiments by using the three models of logistic regression, SVM, and neural network, the data were normalized using the method of Z-score. The preprocessing-data method of Z-score is described as the following Eq. 1.

Z = \frac{X - μ}{σ} (1)

where X denotes an element in a data vector, μ for mean value, and σ for SD.

Logistic regression, a multivariate analysis method for studying the relationship between categorical variables and influencing factors, is usually used to construct prediction models for exploring risk factors and predicting the probability of a certain disease. Its accuracy of prediction can be improved by adjusting regression model parameters (Bucur et al., 2017; Zhang et al., 2018). Logistic regression model is described by the following Eq. 2.

ln \frac{y}{1 - y} = W^{T} X + b (2)

where X denotes a vector for sample, W denotes a vector for the linear parameters, and b and y are scalars.

Support vector machine is one of the most important supervised learning models, used to solve classification or regression problems. Its essence is to find a hyperplane between different data types to create a boundary, which maximizes the interval between data points in different classes. SVM is widely used in face recognition and disease patterns (Li et al., 2012; Zhang et al., 2017).

Random forest is a classifier that uses multiple decision trees to train and predict sample. Though it is not the most accurate classification algorithm, it runs efficiently on large datasets and can handle thousands of input variables without variable deletion (Kong and Yu, 2018). In our random forest, two metrics, i.e., Gini index and information gain, were separately taken as criterion to select partition attributes. The Gini index was calculated according to Eqs 3, 4, and the information gain by using Eqs 5, 6.

G i n i (D) = 1 - \sum_{k = 1}^{n} p_{k}^{2} (3)

G i n i_i n d e x (D, a) = \sum_{v = 1}^{v} \frac{| D^{v} |}{| D |} G i n i (D^{v}) (4)

E n t (D) = - \sum_{k = 1}^{n} p_{k} l o g_{2} p_{k} (5)

G a i n (D, a) = Ent (D) - \sum_{v = 1}^{v} \frac{| D^{v} |}{| D |} E n t (D^{v}) (6)

where D denotes a data set, n for the total number of categories in the data set D. Symbol p_k is a probability of a sample being classified to be the k-th category. In other words, p_k means a ratio that the k-th category accounts for in the dataset. Symbol a represents an attribute, V for the number of sets obtained by partitioning the set D according to the attribute a, D^v for a subset of the set D corresponding to a value of the attribute a.

Neural network is another important machine learning method. It can simulate human brain to achieve artificial intelligence. Our neural network contained one hidden layer with activation function. Three activation functions, such as Tanh, Sigmoid, and ReLU, were selected respectively in the hidden layer. The computation in the hidden layer with activation Tanh is presented in Eq. 7, Eq. 8 is for computations in the type of hidden layer with activation Sigmoid, and Eq. 9 for the type of hidden layer with activation ReLU. Two optimizers, i.e., adaptive moment estimation (Adam) and stochastic gradient decent optimizer (SGD), are taken, respectively.

y = \tanh (W^{T} \times X - θ) = \frac{e^{(W^{T} \times X - θ)} - e^{- (W^{T} \times X - θ)}}{e^{(W^{T} \times X - θ)} + e^{- (W^{T} \times X - θ)}} (7)

y = σ (W^{T} \times X - θ) = \frac{1}{1 + e^{- (W^{T} \times X - θ)}} (8)

y = max (0, W^{T} \times X - θ) (9)

where X is an input vector, W for a weight vector, and θ for a threshold.

We used SPSS 25.0 to detect outliers or extreme values of tongue and pulse data, the sample who had outliers or extreme values were deleted. All tongue and pulse data were extracted in batches by specialized tongue and pulse diagnosis analysis software, at the same time, we conducted a manual check of all data to ensure that there was no artificial input errors and missing values. All the experiments were performed in Python 3.6. The metric of area under the curve (AUC) was calculated as an area under the receiver operating characteristic curve (ROC). Accuracy, Precision, Sensitivity, Specificity, and F1 were formally defined in the following Eqs 10–14. The accuracy was defined as a ratio between the number of correctly classified samples and the total number of samples. Precision was defined as a ratio of correctly predicted positive samples out of predicted positive samples. F1-score is the harmonic mean of Precision and Sensitivity (Yang et al., 2018). Sensitivity was defined as the proportion of positive samples which are correctly identified, which measures the ability of classifier to correctly identify positive samples. Specificity is the proportion of negatives which are correctly predicted (Handelman et al., 2018).

Accuracy = \frac{TP + TN}{TP + TN + FP + FN} \times 100 % (10)

Precision = \frac{TP}{TP + FP} \times 100 (11)

Sensitivity = \frac{TP}{TP + FN} \times 100 % (12)

Specificity = \frac{TN}{TN + FP} \times 100 % (13)

F = \frac{2 \times Precision \times Sensitivity}{Precision + Sensitivity} (14)

True Positive (TP) is the number of positive samples which are correctly predicted. True Negative (TN) is the number of negative samples which are correctly predicted. False Positive (FP) denotes the number of negative samples which are predicted to be positive. False Negative (FN) is the number of positive samples predicted to be negative.

Visualization of Machine Learning

Predicted results of machine learning models were visualized by using t-distributed stochastic neighbor embedding (t-SNE). The visualization intuitively showed predicted results and capabilities of machine learning models. The t-SNE algorithm was deployed to reduce the high-dimensional data collected in this study into two-dimensional data. The features in each dimension of the obtained two-dimensional data were rescaled to the range of by using min-max normalization. A general formula for the min-max normalization was given as Eq. 15, where an original value in a dimension was the normalized value. Normalized data were then scattered on a two-dimensional plane.

x^{'} = \frac{x - min (x)}{\max (x) - min (x)} (15)

Results

Basic Statistics

The baseline characteristics of the subjects are presented in Table 1.

TABLE 1

Table 1. Baseline characteristic [median (P25, P75)].

There were statistically significant differences in age and body mass index (BMI) between disease fatigue and non-disease fatigue group subjects (p < 0.01). Patients with disease fatigue who were older are associated with higher BMI.

Statistical Analysis Over Tongue Data

We selected the widely recognized tongue indices for statistical analysis based on experience from previous studies. The result of tongue indices among three groups are depicted in Table 2. The prefix TB-represents the tongue body, and TC-represents the tongue coating.

TABLE 2

Table 2. Statistical analysis of tongue body and tongue coating index [mean (SD), median (P₂₅, P₇₅)].

Statistical results of tongue data showed that TB-a, TB-b, TB-H, TB-S, TB-I, TB-Cb, TC-L, TC-H, TC-I, TC-Y, TC-Cr, TC-Cb, perAll, and perPart showed significant differences among three groups. The numerical distribution trend of the indices of TB-L, TB-a, TB-S, TB-I, TB-Y, TB-Cb, TC-L, TC-I, TC-Y, TC-Cb, and perAll was as follows: healthy subjects < non-disease fatigue subjects < disease fatigue subjects; the numerical distribution trend of the indices of TB-b, TC-b, TB-Cr, TC-Cr, TB-H, TC-H, and perPart had the following order: disease fatigue subjects < non-disease fatigue subjects < healthy subjects.

Statistical Analysis Over Pulse Data

Similar as in tongue data analysis, the widely used pulse indices were selected for statistical analysis. Results of pulse indices among healthy subjects, non-disease fatigue subjects, and disease fatigue subjects are depicted in Table 3.

TABLE 3

Table 3. Statistical analysis of pulse index [median (P₂₅, P₇₅)].

Statistical results of pulse indices showed that t₁, t₄, h₅, w₁, w₂, w₁/t, and w₂/t showed significant difference among three groups (p < 0.05 and p < 0.01), and the numerical distribution trend of the indices of t₁, t₄, w₁, w₂, w₁/t, and w₂/t was that the group of disease fatigue was larger than the group of non-disease fatigue and the health controls, the numerical distribution trend of h₅ was as follows: disease fatigue subjects < non-disease fatigue subjects < healthy subjects.

Results Using Machine Learning and Visualization

Based on the statistical analysis over tongue data and pulse data (Tables 2, 3), such tongue indices and pulse indices showing significant statistic inferences were utilized to characterize disease fatigue and non-disease fatigue. Logistic regression, SVM, random forest, and neural network were deployed as classification models over the datasets, respectively, such as “Tongue,” “Pulse,” “Tongue & Pulse,” and “Tongue & Pulse & Age & BMI.” A dataset in each of our experiments was randomly split into training set and testing set according to a ratio of 8:2. For each of the four models, a procedure of adjusting model parameters was performed separately for each of the four datasets. A setting of parameters with best performances was selected for a model over a dataset. Based on the selected parameters setting, experiments were conducted for 10 times over the corresponding dataset by using the selected model. Classification results of 10 experiments were described in the form of “mean ± SD” for each model over each dataset. They are depicted in Table 4. The results from 10 times repeated modeling of the best parameters of each model are depicted in Supplementary Tables 2–5.

TABLE 4

Table 4. Classification results of disease fatigue against non-disease fatigue over four datasets using four classifiers.

Each subfigure in Figures 5–8 plotted 10 ROC curves which were obtained in 10 repeated experiments using a machine model over a dataset, and it gave 10 AUC results corresponding to area under each one of 10 ROC curves. The 10 ROC curves were in different colors, each color represented an ROC result achieved in one experiment. The ROCs of 10 times repeated experiments obtained using logistic regression, SVM, random forest, and neural network over four datasets were depicted in Figures 5–8, respectively. The accuracy rate over four datasets for four machine learning models are depicted in Figure 9.

FIGURE 5

Figure 5. Receiver operating characteristics (ROCs) of 10 times repeated experiments obtained using logistic regression over four datasets. (A) ROCs over “Tongue” dataset. (B) ROCs over “Pulse” dataset. (C) ROCs over “Tongue & Pulse” dataset. (D) ROCs over “Tongue & Pulse & Age & BMI” dataset.

FIGURE 6

Figure 6. Receiver operating characteristics of 10 times repeated experiments obtained using support vector machine (SVM) over four datasets. (A) ROCs over “Tongue” dataset. (B) ROCs over “Pulse” dataset. (C) ROCs over “Tongue & Pulse” dataset. (D) ROCs over “Tongue & Pulse & Age & BMII” dataset.

FIGURE 7

Figure 7. Receiver operating characteristics (ROCs) of 10 times repeated experiments obtained using random forest over four datasets. (A) ROCs over “Tongue” dataset. (B) ROCs over “Pulse” dataset. (C) ROCs over “Tongue & Pulse” dataset. (D) ROCs over “Tongue & Pulse & Age & BMI” dataset.

FIGURE 8

Figure 8. Receiver operating characteristics (ROCs) of 10 times repeated experiments obtained using neural network over four datasets. (A) ROCs over “Tongue” dataset. (B) ROCs over “Pulse” dataset. (C) ROCs over “Tongue & Pulse” dataset. (D) ROCs over “Tongue & Pulse & Age & BMI” dataset.

FIGURE 9

Figure 9. The accuracy rate of four classifiers over four datasets.

For all four classifiers, performance over the “Tongue & Pulse” dataset were better than those only using tongue data or pulse data. After adding age and BMI data, the classification efficiency was improved for each of the four models. Over “Tongue & Pulse” dataset, neural network and logistic regression had better classification effects than other classifiers. Overall, the distribution trend of the average accuracy of different classifiers except for random forest based on different datasets had the following order: “Tongue” < “Pulse” < “Tongue & Pulse” < “Tongue & Pulse &Age & BMI.”

There are many different indices of the same diagnosis method, data of a single dimension tends to have a high consistency, so its visualization effect is better. As the data dimension increases, the data complexity increases, and the visualization effect decreases. The visualization of modeling classification results of tongue and pulse sets based on different classifiers in this study are shown in Figures 10, 11. In each subfigure in Figure 10, either blue point or red point represents a two-dimension data point, which was obtained by performing dimensional reduction operation over original testing data and by executing min-max normalization. The abscissa and ordinate were the two dimensions of the two-dimension data obtained by dimensional reduction, respectively.

FIGURE 10

Figure 10. Visualization of “Tongue” data based on different classifiers. (A) Logistic regression. (B) Neural network. (C) Random forest. (D) SVM.

FIGURE 11

Figure 11. Visualization of “Pulse” data based on different classifiers. (A) Logistic regression. (B) Neural network. (C) Random forest. (D) SVM.

Discussion

The purpose of this study was to determine whether general fatigue was caused by diseases and to provide a convenient and reliable method for early screening of fatigue. To achieve this, we enrolled patients undergoing routine physical examination as the research subjects, rather than patients with confirmed disease diagnoses, such as heart disease, cancer, and neurological degenerative diseases, because these patients typically would have definite diagnoses and thus would not meet our research objective to understand early screening for atypical disease fatigue. This study primarily leveraged basic health information and data of tongue and pulse to screen for fatigue population for diseases and non-disease reasons. According to Tables 2, 3, tongue and pulse data of the healthy population overlaps with the two groups of patients with fatigue to a certain extent. The healthy population was selected to serve as baseline to understand general data of tongue and pulse and was not used in modeling for classification.

Our research team has been continuously working on research related to tongue diagnosis technology and has established a relatively reliable analysis methodology for tongue and index, and has also published findings on tongue diagnosis (Zhang et al., 2017; Qi et al., 2018; Qiao et al., 2018; Li et al., 2021a,b; Shi et al., 2021). The index of tongue diagnosis mainly included the color and texture of tongue body and tongue coating and proportion of tongue coating. According to the distribution law of perAll, perPart, TB-Cb, TC-Cb, TB-Cr, TC-Cr, TB-I, TC-I, TB-Y, TC-Y, TB-L, and TC-L, the increase of TC-I, TB-I, TB-L, TC-L, TB-Y, and TC-Y in disease fatigue population indicated white tongue coating, and high perAll and low perPart indicated thick tongue coating. White greasy or white thick coating is generally seen in dampness syndrome or cold syndrome, which were commonly seen in patients with qi deficiency of spleen and stomach or poor transportation function of spleen and stomach (Zhang et al., 2013). The increased TB-Cb and TC-Cb, decreased TB-Cr and TC-Cr indicate purple or more cyanotic tongue body, which is generally seen in qi stagnation and blood stasis syndrome or cold syndrome. Generally speaking, patients with coronary heart disease (Zi et al., 2021), or chronic liver disease (Liu et al., 2003), or vasculitis (Xu et al., 2020), or cancer (Hao et al., 2016), often have purple or more cyanotic tongue body. All the indices were quantified by special TDAS software (TDAS V2.0), and the conclusions were made through statistical analysis. In addition, studies have shown that pulse was closely related to cardiovascular function (Hu et al., 2018; Luo et al., 2018). In our study, the statistical result of pulse indices showed that compared with non-disease fatigue and healthy subjects, disease fatigue subjects had more severe functional decline in left ventricular function, peripheral resistance, aortic compliance, vascular wall elasticity, blood viscosity, and other cardiovascular functions. In addition, pulse was influenced by with these indices.

In the section of modeling using machine learning methods, age and BMI, as recognized prognostic factors, were closely related to diseases. Age and BMI were basic information related to human health, which were closely related to diseases. Studies have shown a correlation between age and the incidence of diseases (Wolff et al., 2020), with the increase in age, the risk of disease gradually increased. Previous studies had shown that BMI (Komatsu et al., 2020) was a key factor of diseases, it played an important role in the diagnostic process. Generally speaking, with the increase of age and BMI, the risk of disease gradually increased. In this study, classification models were constructed over “Age & BMI” datasets, and related experimental results showed that age and BMI had a good classification effect for classifying disease fatigue and non-disease fatigue. However, our focus in this study was that whether data of tongue and pulse or tongue and pulse combined with basic information of age and BMI could distinguish different fatigue states well. For classification models only based on “Age & BMI” datasets and that whether age and BMI had any effect on tongue and pulse, they were not our focus. In conclusion, models based on “Tongue & Pulse” datasets had good classification performances for classifying disease fatigue and non-disease fatigue, and adding age and BMI could help improve the classification performances of models. The classification performances of models over “Tongue & Pulse & Age & BMI” datasets were better than models based on datasets of “Tongue,” “Pulse,” “Tongue & Pulse,” and “Age & BMI,” respectively. Because pulse can reflect cardiovascular function and was closely related to health status. It was convincible that the accurate diagnosis rate of pulse was higher than that of tongue. Therefore, age, BMI, tongue, and pulse were important factors for the fatigue classification model.

Limitations and Future Work

This study still had some limitations. First, this study mainly focused on tongue and pulse data differences between two “fatigue” groups (disease and non-disease) from a holistic perspective. However, there are a wide range of diseases that require further analysis. Second, the baseline clinical characteristics of the subjects were not comprehensive enough. In the future, narrowing down the research scope of disease, a large-scale and multicenter epidemiological investigation should be combined, and more complete baseline demographic and clinical characteristics data would be useful in further understanding tongue and pulse data for other diseases.

Data Availability Statement

The datasets generated and analyzed during the current study are not publicly available due to the confidentiality of the data, which is an important component of the National Key Technology R&D Program of the 13th Five-Year Plan (No. 2017YFC1703301) in China, but are available from the corresponding author on reasonable request.

Ethics Statement

The study protocol was approved by the IRB of Shuguang Hospital affiliated with Shanghai University of TCM (No. 2018-626-55-01). The patients/participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author Contributions

YS and JX designed the study. YS and XY wrote the manuscript. XH and LT performed the data analysis. FL, JC, LC, and JH performed the data collection. JuL, ZB, and JiL contributed to the critical discussion and manuscript revision. All authors contributed to the article and approved the submitted version.

Funding

This research was funded by the National Key Research and Development Program of China (2017YFC1703301), the National Natural Science Foundation of China (81873235, 81973750, and 81904094), and 1226 Major Project (BWS17J028). They were not involved in the preparation of this manuscript or in the decision to submitting it for publication.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

The authors were especially thankful for the positive support received from the Medical Examination Center of Shuguang Hospital Affiliated to Shanghai University of Traditional Chinese Medicine and all medical staff involved.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphys.2021.708742/full#supplementary-material

References

Ben-Bassat, I., Chor, B., and Orenstein, Y. (2018). A deep neural network approach for learning intrinsic protein-RNA binding preferences. Bioinform 34, i638–i646.

Google Scholar

Bucur, E., Danet, A. F., Lehr, C. B., Lehr, E., and Nita-Lazar, M. (2017). Binary logistic regression-Instrument for assessing museum indoor air impact on exhibits. J. Air Waste Manag. Assoc. 67, 391–401. doi: 10.1080/10962247.2016.1231724

PubMed Abstract | CrossRef Full Text | Google Scholar

Chaudhuri, A., and Behan, P. O. (2004). Fatigue in neurological disorders. Lancet 363, 978–988. doi: 10.1016/s0140-6736(04)15794-2

CrossRef Full Text | Google Scholar

Yan, C., Ya-Bei, C., and Rong-Fang, T. (2017). Interpretation of “Guideline for Prevention and Treatment of Dyslipidemia in Chinese Adults in 2016. Chin. J. Pract. Intern. Med. 37, 38–42.

Google Scholar

Chinese Diabetes and Society (2018). Guidelines for the prevention and control of type 2 diabetes in China. Chin. J. Pract. Intern. Med. 38, 292–344.

Google Scholar

Dhruva, A., Dodd, M., Paul, S. M., Cooper, B. A., Lee, K., West, C., et al. (2010). Trajectories of fatigue in patients with breast cancer before, during, and after radiation therapy. Cancer Nurs. 33, 201–212. doi: 10.1097/NCC.0b013e3181c75f2a

PubMed Abstract | CrossRef Full Text | Google Scholar

Ding, T., Feng, L., Rong, L., and Xi, L. D. (2015). “Tongue inspection on Fatigue,”, in The 10th Annual Conference of Rehabilitation Committee of Traditional Chinese Medicine of China Disabled Persons, (Shangai:Rehabilitation Association) 4.

Google Scholar

Handelman, G. S., Kok, H. K., Chandra, R. V., Razavi, A. H., Lee, M. J., and Asadi, H. (2018). eDoctor: machine learning and the future of medicine. J. Intern. Med. 284, 603–619. doi: 10.1111/joim.12822

PubMed Abstract | CrossRef Full Text | Google Scholar

Hao, J., Zhu, C., Cao, R., Yang, X., Ding, X., Man, Y., et al. (2016). [Purple-bluish tongue is associated with platelet counts, and the recurrence of epithelial ovarian cancer]. J. Tradit. Chin. Med. 36, 321–325. doi: 10.1016/s0254-6272(16)30044-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Hickok, J. T., Roscoe, J. A., Morrow, G. R., Mustian, K., Okunieff, P., and Bole, C. W. (2005). Frequency, severity, clinical course, and correlates of fatigue in 372 patients during 5 weeks of radiotherapy for cancer. Cancer 104, 1772–1778. doi: 10.1002/cncr.21364

PubMed Abstract | CrossRef Full Text | Google Scholar

Hu, X. J., Zhang, L., Xu, J. T., Liu, B. C., Wang, J. Y., Hong, Y. L., et al. (2018). Pulse Wave Cycle Features Analysis of Different Blood Pressure Grades in the Elderly. Evid. Based. Complement. Alternat. Med. 2018:1976041. doi: 10.1155/2018/1976041

PubMed Abstract | CrossRef Full Text | Google Scholar

Hypertension. WGoCGftMo, League. Ch, Cardiology. CSo, Committee. Cmdah, Care. HBoCIEaPAfMaH, Association. HBoCGM.. (2019). 2018 Chinese guidelines for the management of hypertension. Chin. J. Cardiovasc. Med. 24, 24–56.

Google Scholar

Kim, S., Jang, H. J., Myung, W., Kim, K., Cha, S., Lee, H., et al. (2019). Heritability estimates of individual psychological distress symptoms from genetic variation. J. Affect. Disord. 252, 413–420. doi: 10.1016/j.jad.2019.04.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Komatsu, T., Fujihara, K., Yamada, M. H., Sato, T., Kitazawa, M., Yamamoto, M., et al. (2020). 449-P: impact of Body Mass Index (BMI) and Waist Circumference (WC) on Coronary Artery Disease (CAD) in Japanese with and without Diabetes Mellitus (DM). Diabetes 69:449. doi: 10.2337/db20-449-p

CrossRef Full Text | Google Scholar

Kong, Y., and Yu, T. (2018). A Deep Neural Network Model using Random Forest to Extract Feature Representation for Gene Expression Data Classification. Sci. Rep. 8:16477.

Google Scholar

Kung, Y. Y., Kuo, T. B. J., Lai, C. T., Shen, Y. C., Su, Y. C., and Yang, C. C. H. (2020). Disclosure of suboptimal health status through traditional Chinese medicine-based body constitution and pulse patterns. Complement. Ther. Med. 56:102607. doi: 10.1016/j.ctim.2020.102607

PubMed Abstract | CrossRef Full Text | Google Scholar

Lawrence, D. P., Kupelnick, B., Miller, K., Devine, D., and Lau, J. (2004). Evidence report on the occurrence, assessment, and treatment of fatigue in cancer patients. J. Natl. Cancer. Inst. Monogr. 2004, 40–50. doi: 10.1093/jncimonographs/lgh027

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, F., Zhao, C., Xia, Z., Wang, Y., Zhou, X., and Li, G. Z. (2012). Computer-assisted lip diagnosis on Traditional Chinese Medicine using multi-class support vector machines. BMC. Complement. Altern. Med. 12:127. doi: 10.1186/1472-6882-12-127

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, J., Chen, Q., Hu, X., Yuan, P., Cui, L., Tu, L., et al. (2021a). Establishment of noninvasive diabetes risk prediction model based on tongue features and machine learning techniques. Int. J. Med. Inform. 149:104429. doi: 10.1016/j.ijmedinf.2021.104429

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, J., Yuan, P., Hu, X., Huang, J., Cui, L., Cui, J., et al. (2021b). A tongue features fusion approach to predicting prediabetes and diabetes with machine learning. J. Biomed. Inform. 115:103693. doi: 10.1016/j.jbi.2021.103693

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, W. L., Yi, Z. X., and Min, P. (2019). Objective analysis of complexion and tongue color in patients with chronic fatigue syndrome. Shandong Med. J. 59, 81–83.

Google Scholar

Li, X., Zhang, Y., Cui, Q., Yi, X., and Zhang, Y. (2019). Tooth-Marked Tongue Recognition Using Multiple Instance Learning and CNN Features. IEEE. Trans. Cybern. 49, 380–387. doi: 10.1109/tcyb.2017.2772289

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, Q., Yue, X. Q., Deng, W. Z., and Ren, R. Z. (2003). [Quantitative study on tongue color in primary liver cancer patients by analysis system for comprehensive information of tongue diagnosis]. Zhong Xi Yi Jie He Xue Bao 1, 180–183.

Google Scholar

Luo, Z. Y., Cui, J., Hu, X. J., Tu, L. P., Liu, H. D., Jiao, W., et al. (2018). A Study of Machine-Learning Classifiers for Hypertension Based on Radial Pulse Wave. Biomed. Res. Int. 2018:2964816.

Google Scholar

Minton, O., Berger, A., Barsevick, A., Cramp, F., Goedendorp, M., Mitchell, S. A., et al. (2013). Cancer-related fatigue and its impact on functioning. Cancer 119, 2124–2130.

Google Scholar

Ozçift, A. (2011). Random forests ensemble classifier trained with data resampling strategy to improve cardiac arrhythmia diagnosis. Comput. Biol. Med. 41, 265–271.

Google Scholar

Persson, P. B., and Bondke Persson, A. (2016). Fatigue. Acta. Physiol. 218, 3–4.

Google Scholar

Phillips, K. M., Pinilla-Ibarz, J., Sotomayor, E., Lee, M. R., Jim, H. S., Small, B. J., et al. (2013). Quality of life outcomes in patients with chronic myeloid leukemia treated with tyrosine kinase inhibitors: a controlled comparison. Support. Care Cancer 21, 1097–1103.

Google Scholar

Qi, Z., Tu, L. P., Chen, J. B., Hu, X. J., Xu, J. T., and Zhang, Z. F. (2016). The Classification of Tongue Colors with Standardized Acquisition and ICC Profile Correction in Traditional Chinese Medicine. Biomed. Res. Int. 2016:3510807.

Google Scholar

Qi, Z., Tu, L. P., Luo, Z. Y., Hu, X. J., Zeng, L. Z., Jiao, W., et al. (2018). Tongue Image Database Construction Based on the Expert Opinions: assessment for Individual Agreement and Methods for Expert Selection. Evid. Based Complement. Alternat. Med. 2018:8491057.

Google Scholar

Qiao, L. J., Qi, Z., Tu, L. P., Zhang, Y. H., Zhu, L. P., Xu, J. T., et al. (2018). The Association of Radial Artery Pulse Wave Variables with the Pulse Wave Velocity and Echocardiographic Parameters in Hypertension. Evid. Based Complement. Alternat. Med. 2018:5291759. doi: 10.1155/2018/5291759

PubMed Abstract | CrossRef Full Text | Google Scholar

Sandler, C. X., and Lloyd, A. R. (2020). Chronic fatigue syndrome: progress and possibilities. Med. J. Aust. 212, 428–433. doi: 10.5694/mja2.50553

PubMed Abstract | CrossRef Full Text | Google Scholar

Schiller, F., Valsecchi, M., and Gegenfurtner, K. R. (2018). An evaluation of different measures of color saturation. Vision. Res. 151, 117–134. doi: 10.1016/j.visres.2017.04.012

PubMed Abstract | CrossRef Full Text | Google Scholar

Shi, H. Z., Fan, Q. C., Gao, J. Y., Liu, J. L., Bai, G. E., Mi, T., et al. (2017). Evaluation of the health status of six volunteers from the Mars 500 project using pulse analysis. Chin. J. Integr. Med. 23, 574–580. doi: 10.1007/s11655-016-2539-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Shi, Y., Hu, X., Cui, J., Cui, L., Huang, J., Ma, X., et al. (2021). Clinical data mining on network of symptom and index and correlation of tongue-pulse data in fatigue population. BMC. Med. Inform. Decis. Mak. 21:72. doi: 10.1186/s12911-021-01410-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Shin, H. C., Roth, H. R., Gao, M., Lu, L., Xu, Z., Nogues, I., et al. (2016). Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning. IEEE. Trans. Med. Imaging 35, 1285–1298. doi: 10.1109/TMI.2016.2528162

PubMed Abstract | CrossRef Full Text | Google Scholar

Sun, X., Young, J., Liu, J. H., Bachmeier, L., Somers, R. M., Chen, K. J., et al. (2016). Prediction of pork color attributes using computer vision system. Meat. Sci. 113, 62–64. doi: 10.1016/j.meatsci.2015.11.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, X., Liu, J., Wu, C., Liu, J., Li, Q., Chen, Y., et al. (2020). Artificial intelligence in tongue diagnosis: using deep convolutional neural network for recognizing unhealthy tongue with tooth-mark. Comput. Struct. Biotechnol. J. 18, 973–980. doi: 10.1016/j.csbj.2020.04.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, X., Zhang, B., Yang, Z., Wang, H., and Zhang, D. (2013). Statistical analysis of tongue images for feature extraction and diagnostics. IEEE. Trans. Image. Process. 22, 5336–5347. doi: 10.1109/TIP.2013.2284070

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, Y. Y., Li, X. X., Liu, J. P., Luo, H., Ma, L. X., and Alraek, T. (2014). Traditional Chinese medicine for chronic fatigue syndrome: a systematic review of randomized clinical trials. Complement. Ther. Med. 22, 826–833. doi: 10.1016/j.ctim.2014.06.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Wolff, B., Macioce, V., Vasseur, V., Castelnovo, L., Michel, G., Nguyen, V., et al. (2020). Ten-year outcomes of anti-vascular endothelial growth factor treatment for neovascular age-related macular disease: a single-centre French study. Clin. Exp. Ophthalmol. 48, 636–643. doi: 10.1111/ceo.13742

PubMed Abstract | CrossRef Full Text | Google Scholar

Xu, M., Chen, H., Shi, Z. X., Da, Y. W., Luo, Y. M., Gao, L., et al. (2020). Pathological Observation of Blood Stasis Syndrome in Non-diabetic Peripheral Neuropathies: a Retrospective Analysis Based on Nerve Biopsy. Chin. J. Integr. Med. 26, 776–782. doi: 10.1007/s11655-019-3045-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, K., Wang, N., Liu, G., Wang, R., Yu, J., Zhang, R., et al. (2018). Heterogeneous network embedding for identifying symptom candidate genes. J. Am. Med. Inform. Assoc. 25, 1452–1459. doi: 10.1093/jamia/ocy117

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, L., Chen, H., Dou, Q., Qin, J., and Heng, P. A. (2017). Automated Melanoma Recognition in Dermoscopy Images via Very Deep Residual Networks. IEEE. Trans. Med. Imaging 36, 994–1004. doi: 10.1109/TMI.2016.2642839

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, J., Qian, J., Yang, T., Dong, H. Y., and Wang, R. J. (2019). Analysis and recognition of characteristics of digitized tongue pictures and tongue coating texture based on fractal theory in traditional Chinese medicine. Comput. Assist. Surg. 24, 62–71. doi: 10.1080/24699322.2018.1560081

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, J., Xu, J., Hu, X., Chen, Q., Tu, L., Huang, J., et al. (2017). Diagnostic Method of Diabetes Based on Support Vector Machine and Tongue Images. Biomed. Res. Int. 2017:7961494.

Google Scholar

Zhang, K., Geng, W., and Zhang, S. (2018). Network-based logistic regression integration method for biomarker identification. BMC. Syst. Biol. 12:135. doi: 10.1186/s12918-018-0657-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, S. S., Zhao, L. Q., Wang, H. B., Wu, B., Wang, C. J., Huang, S. P., et al. (2013). Efficacy of Gastrosis No.1 compound on functional dyspepsia of spleen and stomach deficiency-cold syndrome: a multi-center, double-blind, placebo-controlled clinical trial. Chin. J. Integr. Med. 19, 498–504. doi: 10.1007/s11655-013-1503-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Zi, M., Li, R., Lu, F., Li, Q., Zhao, Y., Jia, H., et al. (2021). Clinical Study for Safety Evaluation of GXN Tablets Combined with Aspirin in Long-Term Treatment of Coronary Heart Disease. Evid. Based Complement. Alternat. Med. 2021:6658704.

Google Scholar

Keywords: fatigue, tongue diagnosis, pulse diagnosis, machine learning, intelligent diagnosis

Citation: Shi Y, Yao X, Xu J, Hu X, Tu L, Lan F, Cui J, Cui L, Huang J, Li J, Bi Z and Li J (2022) A New Approach of Fatigue Classification Based on Data of Tongue and Pulse With Machine Learning. Front. Physiol. 12:708742. doi: 10.3389/fphys.2021.708742

Received: 12 May 2021; Accepted: 03 November 2021;
Published: 07 February 2022.

Edited by:

Xu Wang, Beijing University of Chinese Medicine, China

Reviewed by:

Jun Zhang, Institute of Microelectronics, Chinese Academy of Sciences (CAS), China
Tsung-Lin Cheng, National Changhua University of Education, Taiwan

Copyright © 2022 Shi, Yao, Xu, Hu, Tu, Lan, Cui, Cui, Huang, Li, Bi and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jiatuo Xu, xjt@fudan.edu.cn

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.