Expand this Topic clickable element to expand a topic
Skip to content
Optica Publishing Group

Classification accuracy improvement of laser-induced breakdown spectroscopy based on histogram of oriented gradients features of spectral images

Open Access Open Access

Abstract

To improve the classification accuracy of laser-induced breakdown spectroscopy (LIBS), image histogram of oriented gradients (HOG) features method (IHFM) for materials analysis was proposed in this work. 24 rice (Oryza sativa L.) samples were carried out to verify the proposed method. The results showed that the classification accuracy of rice samples by the full-spectra intensities method (FSIM) and IHFM were 60.25% and 81.00% respectively. The classification accuracy was obviously improved by 20.75%. Universality test results showed that this method also achieved good results in the plastics, steel, rock and minerals classification. This study provides an effective method to improve the classification performance of LIBS.

© 2018 Optical Society of America under the terms of the OSA Open Access Publishing Agreement

1. Introduction

Laser-induced breakdown spectroscopy (LIBS) is a convenient and efficient elemental analysis technology. It has been widely used in many fields, such as metals [1–4], mineral [5], rocks [6,7], and water [8,9] etc. Traditional classification methods can be divided into two categories: full-spectra intensities method (FSIM) and multi-spectral lines intensities method (MSLIM) in LIBS analysis. In-depth research and extensive applications on these methods have been reported in recent years. For example, Gottfried et al. [10]studied the classification of simulant residues based on full-spectra feature model and the intensity/ratio model, and their classification accuracy were 54.8% and 47.4%. Ji et al. [11] adopted LIBS assisted FSIM with partial least squares discriminant analysis (PLS-DA) algorithm for Tegillarca granosa discrimination, the classification accuracy was less than 30%. Zhang et al. [12] employed LIBS coupled the FSIM with support vector machine (SVM) and radial basis function neural network (RBFNN) algorithm to classified the coffee. The classification accuracies of the two methods reached about 83.59 and 82.03%, respectively. To improve the generalization performance of the algorithm, some researchers studied the line selection method of MSLIM. For example, Kong et al. [13] proposed three data reduction methods for LIBS, including background normalization, the internal standard method, and the standard normal variate. N. Labbé et al. [14] proposed multivariate analysis method to extract information from LIBS spectral data. In our previous work [15,16], LIBS assisted MSLIM with decision tree (DT), random forest (RF), PLS-DA, linear discriminant analysis (LDA) and SVM algorithms on rice geographic origins classification, and the classification accuracies were 86.8%, 96.3%, 96.8%, 98.6%, and 99.2%, respectively.

Above researches proved the feasibility of FSIM and MSLIM in LIBS classification, but their classification accuracies for some complex samples with uneven surface such as simulant residues [10] and Tegillarca granosa [11] and different varieties of rice samples still can’t meet some important application requirements. Taking rice sample as an example, the annual output of Wuchang high-quality rice in its origin is about 714,000 tons (official data in 2014), but this variety of rice on the market has reached 10 million tons. The identification and classification of rice producing areas or varieties by LIBS technology can effectively prevent food safety [17,18] and seed adulteration problems [19,20] as described above. The reason for unsatisfactory classification performance should be the feature extraction way of the traditional classification methods cannot effectively extract the features difference between samples [21]. Taking MSLIM and FSIM as examples, their classification features are intensities of multiple spectral lines and full-spectral lines, respectively. Both features are one-dimensional with relatively weak characterization ability. However, the inherent relationships between multi-dimensional spectral intensities can also be used to characterize the samples, but few research have referred on this.

The histogram of oriented gradient (HOG) [22] feature is a kind of feature descriptor for images. It has been widely used in computer vision [23] and image pattern recognition [24] due to its powerful ability on pedestrian detection [25]. Its feature extraction of LIBS images can indirectly realize the extraction of multi-dimensional spectral features.

In this work, a classification method IHFM, which combined with image features and SVM algorithm was proposed. A classification of 24 rice samples was conducted to verify the proposed method. To test the universal applicability of IHFM, classification experiment were also carried out on a variety of plastic, steel, rock and mineral samples.

2. Experimental setup and samples

2.1 Experimental setup

The experimental setup of this study is shown in Fig. 1. A Q-switched Nd: YAG laser (Quantel Brilliant, energy: 40 mJ/pulse, wavelength: 532 nm, frequency: 10Hz, pulse width:8ns) was focused at 1.25 mm below the target surface by a 100 mm focal length lens togenerate plasma. The plasma emission was collected by a collector and then transmitted to a spectrometer (Andor Tech., Mechelle 5000, spectral range from 200 to 950 nm with a resolution of λ/Δλ = 5000, wavelength accuracy of ± 0.05 nm) through an optical fiber. The light signal is converted into electrical signal by an ICCD camera (Andor Tech., iStar DH-334T) and output to computer.

 figure: Fig. 1

Fig. 1 LIBS experimental setup.

Download Full Size | PDF

The samples were placed on a 3-axis displacement stage in this experiment. The scanning area of the laser was the upper surface of the sample, with a diameter of about 40.98mm (the edge thickness of the container is ignored) and an area of 1318.3mm2. To collect stable spectra, the displacement stage moved at a speed of 4 mm/s. The time delay between laser pulse signal and spectrometer acquisition was 1.5 μs, and gate width was 3 μs. To reduce spectral fluctuations, 50 pulses were accumulated for each spectra, and 100 spectra were collected for each sample.

2.2 Samples

In experiments, 24 different varieties rice samples were bred by different research institutes in China, as shown in Table 1.

Tables Icon

Table 1. Rice samples information list

There is parental relationship among some of the varieties in the sample. For example, both YX725 and YX3728’s male parents are YX 1A, and the female parent is MiHui series varieties. The male parents of JR 372, SR 16, TR 99 and LJ 8 are all Zhen Dao 88. Due to the similarity of the parental relationship of each samples, it is difficult to classify them by the existing classification methods.

3. Images HOG feature extraction and optimization of LIBS parameters

3.1 Extraction procedure and principle

  • (1) Image generation of spectrum

    Schematic diagram of characteristics spectrum conversion to characteristics image is shown in Fig. 2. In this experiment, the actual effective range of the spectrometer was 200.331-894.514 nm. The effective pixel number of each spectrum was 24262 which were sequentially arranged and written into a data structure of image and then saved as a 16-bit greyscale image in portable network graphics (PNG) format using Matlab's relevant library functions. There were 2400 spectra in this experiment, so 2400 LIBS images could be generated. Based on the optimization result, the number of rows and columns was set to 87 and 275 (the optimization process for this parameter was shown in section 3.2(2)), respectively.

 figure: Fig. 2

Fig. 2 Schematic diagram of characteristics spectrum conversion to characteristics images

Download Full Size | PDF

  • (2) Extraction of HOG features from LIBS images

    HOG feature is a kind of feature descriptor used for object detection in computer vision and image processing. It is constructed by counting histogram of gradient direction in local area of image [22]. In this study, HOG features of LIBS images were acquired using library function features = extractHOGFeatures() of Matlab [26]. The parameters of CellSize (the number of pixels contained in each cell), BlokSize (the number of cells contained in each block), BolckOverlap (the number of block overlaps), and NumBins (the number of histogram channels) were set to 28, 2, 1, 9, respectively.

3.2 Optimization of image column number and image grayscale coefficient

  • (1) Optimization of image column number

    Figure 3(a) showed the relationship between the number of images columns and the classification accuracy. From the principle of extracting the HOG features, the changes of the image pixels arrangement would cause the direction changes of the cell gradient. To obtain the optimal arrangement of pixels, the image generation of the spectrum was optimized firstly. The range of the optimization for image column number was 50-350, and the change steps of image column number was 5. As is shown in Fig. 3(a), with the increase of the number of columns, the classification accuracy increases first and reaches a maximum at 275, and then decreases gradually. Thus, column parameter c was set to 275 in the following classification of rice samples.

  • (2) Optimization of image grayscale coefficient

    Grayscale coefficient is the ratio of the spectral intensity and the gray value of the image. Its mathematical definition is shown as Eq. (1), where g is the grayscale coefficient, and I(X, Y) is the actual spectral intensity corresponding to the pixel image gray value H(X, Y).

    g=I(X,Y)H(X,Y)

    In the image generation procedure, when the spectral intensity was greater than 65535, the 16-bit gray PNG image [27] would be distorted. Because in the library function of Matlab, if the gray value of the written image is greater than 65535, the portion larger than 65535 will be automatically discarded. Therefore, the spectral intensity greater than 65535 cannot be displayed correctly in equal proportion in the image. To solve this problem, the parameter of grayscale coefficient was introduced and optimized in the range of 1-50 and 50-350, and the change steps was 2 and 5 respectively. Figure 3(b) showed the relationship between grayscale coefficient and the classification accuracy. As can be seen from Fig. 3(b), the maximum value appeared when g = 65, and at this point the classification accuracy is 80.83%. Thus, grey coefficient parameter g was set to 65 in the following classification of rice samples.

 figure: Fig. 3

Fig. 3 (a). The relationship between the number of image columns and the classification accuracy (b). The relationship between grayscale coefficient and the classification accuracy.

Download Full Size | PDF

4. Results and discussion

4.1 Comparison of IHFM and FSIM for rice samples classification

As can be seen from the section 2.1 and 3.1(1), a total of 2400 spectra and images would be used for the classification. During the classification, 1200 spectra or images among them were used for training the SVM model [28], and the rest 1200 spectra were used for testing. In addition, the SVM algorithm here and its parameters optimization was implemented by the LIBSVM [29–31] tool in the Matlab environment. The classification results of IHFM and FSIM for rice samples were shown in Fig. 4. Classification accuracies of every sample were listed in Table 2.

 figure: Fig. 4

Fig. 4 Comparison of classification results using FSIM (a) and IHFM (b).

Download Full Size | PDF

Tables Icon

Table 2. Classification accuracies of FSIM and IHFM for 24 varieties of rice samples

From Fig. 4, the coincidence rate between the predicted label and the actual label using IHFM (Fig. 4(b)) was bigger than that of FSIM (Fig. 4(a)). From Table 2, the classification accuracies of the IHFM were obviously better than those of the FSIM for 24 varieties of rice samples and it increased from 60.25% of FSIM to 81.00% of IHFM.

To further verify the classification accuracies of rice samples, a 10-fold cross-validation was adopted in this work. The results of the cross-validation were 60.25% of FSIM and 76.33% of IHFM, respectively.

The main reason that the classification accuracy of IHFM was better than that of FSIM on rice classification may as follows: when HOG features was extracted from LIBS images in IHFM, it essentially obtained the gradient direction relationship of the multi-dimensional spectral intensities, but only one-dimensional spectral intensities was extracted in FSIM. Due to the weak stability (average RSDs of matrix elements C, H, O were over 31.43%) of the LIBS spectra of these rice samples, and the characterization ability of one-dimensional spectral intensity for each sample were weaker than those of the multi-dimensional spectra, so the classification accuracies of FSIM were relatively poor.

The correlation between the characteristic matrices of the samples can indirectly reflect the feasibility of classification. Generally, the higher the correlation of characteristic matrices, the more difficult the classification, and vice versa [32]. The correlation between the characteristic matrices X and Y was expressed by the correlation coefficient ρXY, which was defined as shown in Eq. (2) and Eq. (3).

ρXY=Cov(X,Y)σXσY
Cov(X,Y)=E[(X-μX)(Y-μY)]

In the above equations, ρXY and Cov(X,Y)are the correlation coefficient and the covariance respectively. σX and σY are the standard deviations, and μX and μY are the means of the characteristic matrices.

Taking the GD1 and ZD2 rice samples as examples, the correlation coefficient ρIHFM between the image HOG feature matrices and ρFSIM between the full-spectrum characteristic matrices are 0.8527 and 0.9969, respectively. That is to say, ρIHFM < ρFSIM, so the classification accuracy of IHFM is better than FSIM under same algorithm and their respective optimal parameters.

4.2 Universality test for IHFM

To verify the universality of IHFM, further classification tests were carried out on a variety of plastic [33], steel, rock and mineral samples. Categories numbers of the plastic, steel, rock and mineral samples were 20, 18, 15 and 16, respectively. The number of collected spectra of these samples was 150, 100, 100 and 50 respectively. The values of gray parameters and column parameters were fixed to 10 and 183 for plastic, 1 and 64 for steel, 100 and 64 for rock, 100 and 64 for mineral, respectively.

Figure 5 shows that the classification accuracies of IHFM has reached a high level for all the above samples. Among them, the classification accuracies are more than 99% for both plastic and steel samples, and 97.47% and 91.25% for rock and mineral samples.

 figure: Fig. 5

Fig. 5 Classification result comparison of FSIM and IHFM for other samples.

Download Full Size | PDF

To further verify the classification accuracies of these samples, here is the same as the rice classification with a 10-fold cross-validation. For plastics, steel, rocks and minerals, the cross-validation results of FSIM were 99.44, 97.11, 96.67 and 89.00%, respectively, and the results of IHFM were 99.60, 98.78, 97.60 and 91.50%, respectively.

It was shown that IHFM could achieve good classification results not only for the rice samples of different varieties, but for those samples such as plastics, steels, rocks and minerals. The above indicates that, IHFM has good universality.

5. Conclusions

In summary, IHFM was proposed to improve the classification accuracies of rice samples in LIBS technology. The classification accuracy of rice samples was increased by 20.75%. This method also achieved good results in the plastics, steel, rocks and minerals classification, with accuracies of 99.87, 99.44, 97.47, 91.25%, respectively. Compared to the traditional classification method such as FSIM, whose classification features could only be extracted from one-dimensional spectral intensities. However, IHFM could indirectly extract classification features from multi-dimensional spectral intensities by LIBS image. The features extraction way of the IHFM could effectively reduce the correlation between the characteristic matrices of the samples, so that the classification accuracies could be improved. IHFM provided an effective solution to the problem of qualitative analysis based on LIBS technology. The results demonstrated that the combination of LIBS technology with image processing technology and pattern recognition technology may promote LIBS analytical performance.

Funding

National Special Fund for the Development of Major Research Equipment and Instruments (No. 2011YQ160017); National Natural Science Foundation of China (No. 61575073); China Postdoctoral Science Foundation (No.2018M632838).

References

1. Z. Chen, H. Li, M. Liu, and R. Li, “Fast and sensitive trace metal analysis in aqueous solutions by laser-induced breakdown spectroscopy using wood slice substrates,” Spectrochim. Acta B At. Spectrosc. 63(1), 64–68 (2008). [CrossRef]  

2. R. Yi, X. Yang, R. Zhou, J. Li, H. Yu, Z. Hao, L. Guo, X. Li, Y. Lu, and X. Zeng, “Determination of Trace Available Heavy Metals in Soil Using Laser-Induced Breakdown Spectroscopy Assisted with Phase Transformation Method,” Anal. Chem. 90(11), 7080–7085 (2018). [CrossRef]   [PubMed]  

3. C. Li, Z. Hao, Z. Zou, R. Zhou, J. Li, L. Guo, X. Li, Y. Lu, and X. Zeng, “Determinations of trace boron in superalloys and steels using laser-induced breakdown spectroscopy assisted with laser-induced fluorescence,” Opt. Express 24(8), 7850–7857 (2016). [CrossRef]   [PubMed]  

4. P. Zhang, L. Sun, H. Yu, P. Zeng, L. Qi, and Y. Xin, “An Image Auxiliary Method for Quantitative Analysis of Laser-Induced Breakdown Spectroscopy,” Anal. Chem. 90(7), 4686–4694 (2018). [CrossRef]   [PubMed]  

5. M. Sweetapple, S. Tassios, and D. Body, “In situ analysis and mapping of lithium in Li-bearing pegmatite minerals By Laser-Induced Breakdown Spectroscopy (LIBS),” in PEG2015 poster, pp.105–106.

6. G. S. Senesi, B. Campanella, E. Grifoni, S. Legnaioli, G. Lorenzetti, S. Pagnotta, F. Poggialini, V. Palleschi, and O. De Pascale, “Elemental and mineralogical imaging of a weathered limestone rock by double-pulse micro-Laser-Induced Breakdown Spectroscopy,” Spectrochim. Acta B At. Spectrosc. 143, 91–97 (2018). [CrossRef]  

7. W. T. Li, Y. N. Zhu, X. Li, Z. Q. Hao, L. B. Guo, X. Y. Li, X. Y. Zeng, and Y. F. Lu, “In situ classification of rocks using stand-off laser-induced breakdown spectroscopy with a compact spectrometer,” J. Anal. At. Spectrom. 33(3), 461–467 (2018). [CrossRef]  

8. T. Karpate, S. K. M. Muhammed, R. Nayak, V. K. Unnikrishnan, and C. Santhosh, “LIBS: a potential tool for industrial/agricultural waste water analysis,” Proc. SPIE 9893, 989317 (2016). [CrossRef]  

9. J. Kang, R. Li, Y. Wang, Y. Chen, and Y. Yang, “Ultrasensitive detection of trace amounts of lead in water by LIBS-LIF using a wood-slice substrate as a water absorber,” J. Anal. At. Spectrom. 32(11), 2292–2299 (2017). [CrossRef]  

10. J. L. Gottfried, “Discrimination of biological and chemical threat simulants in residue mixtures on multiple substrates,” Anal. Bioanal. Chem. 400(10), 3289–3301 (2011). [CrossRef]   [PubMed]  

11. G. Ji, P. Ye, Y. Shi, L. Yuan, X. Chen, M. Yuan, D. Zhu, X. Chen, X. Hu, and J. Jiang, “Laser-Induced Breakdown Spectroscopy for Rapid Discrimination of Heavy-Metal-Contaminated Seafood Tegillarca granosa,” Sensors (Basel) 17(11), 2655 (2017). [CrossRef]   [PubMed]  

12. C. Zhang, T. Shen, F. Liu, and Y. He, “Identification of Coffee Varieties Using Laser-Induced Breakdown Spectroscopy and Chemometrics,” Sensors (Basel) 18(2), 95 (2017). [CrossRef]   [PubMed]  

13. H. Y. Kong, L. X. Sun, J. T. Hu, Y. Xin, and Z. B. Cong, “Quantitative Analysis of Steels Using PLS with Three Data Reduction Methods Based on LIBS,” Adv. Mat. Res. 997, 578–582 (2014).

14. N. Labbé, I. M. Swamidoss, N. André, M. Z. Martin, T. M. Young, and T. G. Rials, “Extraction of information from laser-induced breakdown spectroscopy spectral data by multivariate analysis,” Appl. Opt. 47(31), G158–G165 (2008). [CrossRef]   [PubMed]  

15. P. Yang, R. Zhou, W. Zang, S. S. Tang, Z. Q. Hao, X. Y. Li, Y. F. Lu, and X. Y. Zeng, “Laser-induced breakdown spectroscopy assisted chemometric methods for rice geographic origin classification,” Appl. Opt. 57(28), 8297–8302 (2018).

16. P. Yang, Y. N. Zhu, X. Y. Yang, J. M. Li, S. S. Tang, Z. Q. Hao, L. B. Guo, X. Y. Li, X. Y. Zeng, and Y. F. Lu, “Evaluation of sample preparation methods for rice geographic origin classification using laser-induced breakdown spectroscopy,” J. Cereal Sci. 80, 111–118 (2018). [CrossRef]  

17. S. Moncayo, S. Manzoor, J. D. Rosales, J. Anzano, and J. O. Caceres, “Qualitative and quantitative analysis of milk for the detection of adulteration by Laser Induced Breakdown Spectroscopy (LIBS),” Food Chem. 232, 322–328 (2017). [CrossRef]   [PubMed]  

18. P. Yang, R. Zhou, W. Zhang, R. Yi, S. Tang, L. Guo, Z. Hao, X. Li, Y. Lu, and X. Zeng, “High-sensitivity determination of cadmium and lead in rice using laser-induced breakdown spectroscopy,” Food Chem. 272, 323–328 (2019). [CrossRef]  

19. H. Li, Y. E. Wang, Q. Liu, and D. H. Xu, “Varieties discrimination of rice seed based on laser induced breakdown spectroscopy of subsection combination,” Laser Journal 38, 8–12 (2017).

20. Z. Q. Ke, Y. E. Wang, H. Fan, Q. Li, Liu, and J. H. Lin,”Identification of Rice Seed Varieties Based on LIBS,” Laser Journal 37, 56–60 (2016).

21. S. J. M. Rad, F. A. Tab, and K. Mollazade, “Application of imperialist competitive algorithm for feature selection a case study on bulk rice classification,” Int. J. Comput. Appl. 40(1), 41–48 (2012).

22. N. Dalal and B. Triggs, “Histograms of oriented gradients for human detection,” IEEE Computer Society Conference on Computer Vision and Pattern Recognition (IEEE, 2005), pp. 886–893.

23. N. Dalal, B. Triggs, and C. Schmid, “Human Detection Using Oriented Histograms of Flow and Appearance,” in Computer Vision – ECCV 2006 (Springer Berlin Heidelberg, 2006), 428–441.

24. S. A. M. Al-Sumaidaee, M. A. M. Abdullah, R. R. O. Al-Nima, S. S. Dlay, and J. A. Chambers, “Multi-gradient features and elongated quinary pattern encoding for image-based facial expression recognition,” Pattern Recognit. 71, 249–263 (2017). [CrossRef]  

25. C. Cai, Y. Wang, Z. Cao, Y. Zhang, T. Lu, and D. Wang, “HOG pedestrian detection based on edge symmetry and trilinear interpolation,” 7 (2018).

26. R. Kadota, H. Sugano, M. Hiromoto, H. Ochi, R. Miyamoto, and Y. Nakamura, “Hardware Architecture for HOG Feature Extraction,” in 2009 Fifth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2009), 1330–1333.

27. Y. Wang, J. Du, X. Cheng, Z. Liu, and K. Lin, “Degradation and encryption for outsourced PNG images in cloud storage,” International Journal of Grid and Utility Computing 7(1), 22–28 (2016). [CrossRef]  

28. A. Sophian and D. Aini, “Fingertip Detection Using Histogram of Gradients and Support Vector Machine,” Indonesian Journal of Electrical Engineering and Computer Science 8, 482–486 (2017).

29. C. W. Hsu and C. J. Lin, “A comparison of methods for multiclass support vector machines,” IEEE Trans. Neural Netw. 13(2), 415–425 (2002). [CrossRef]   [PubMed]  

30. C. W. Hsu and C. J. Lin, “A pratical guide for support vector matchine,” 1–16 (2003).

31. H. T. Lin and C. J. Lin, “A Study on Sigmoid Kernels for SVM and the Training of non-PSD Kernels by SMO-type Methods,” Neural Comput. 3, 1–32 (2003).

32. M. Zhang, “Eeg feature extraction based on non-negative matrix factorization,” (Yanshan University 2014).

33. E. Negre, V. Motto-Ros, F. Pelascini, and J. Yu, “Classification of plastic materials by imaging laser-induced ablation plumes,” Spectrochim. Acta B At. Spectrosc. 122, 132–141 (2016). [CrossRef]  

Cited By

Optica participates in Crossref's Cited-By Linking service. Citing articles from Optica Publishing Group journals and other participating publishers are listed here.

Alert me when this article is cited.


Figures (5)

Fig. 1
Fig. 1 LIBS experimental setup.
Fig. 2
Fig. 2 Schematic diagram of characteristics spectrum conversion to characteristics images
Fig. 3
Fig. 3 (a). The relationship between the number of image columns and the classification accuracy (b). The relationship between grayscale coefficient and the classification accuracy.
Fig. 4
Fig. 4 Comparison of classification results using FSIM (a) and IHFM (b).
Fig. 5
Fig. 5 Classification result comparison of FSIM and IHFM for other samples.

Tables (2)

Tables Icon

Table 1 Rice samples information list

Tables Icon

Table 2 Classification accuracies of FSIM and IHFM for 24 varieties of rice samples

Equations (3)

Equations on this page are rendered with MathJax. Learn more.

g = I ( X , Y ) H ( X , Y )
ρ X Y = C o v ( X , Y ) σ X σ Y
C o v ( X , Y ) = E [ ( X - μ X ) ( Y - μ Y ) ]
Select as filters


Select Topics Cancel
© Copyright 2024 | Optica Publishing Group. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.