The purpose of this paper is to use terahertz (THz) spectroscopy combined with manifold learning and improved support vector machine (SVM) model to identify the coumarin-based food additives. The 216 THz absorbance spectra (144 for calibration set and 72 for prediction set) of six coumarin-based food additives are measured by using THz time-domain spectroscopy (THz-TDS) in the range of 0.5-2.0 THz. The method (P-t-SNE) combined principal component analysis (PCA) with manifold learning t-distributed stochastic neighbor embedding (t-SNE) is used for feature extraction of the THz spectra. Then, an improved SVM using differential evolution (DE) to improve gray wolf optimization (GWO) to optimize parameters is proposed. Finally, the result shows that the prediction set accuracy of PCA-DEGWO-SVM, P-t-SNE-DEGWO-SVM, and P-t-SNE-GWO-SVM models are 97.22%, 98.61%, and 95.83%, respectively, indicating that the accuracy by P-t-SNE is increased by about 1.39% compared with that processed by PCA, and the accuracy by DEGWO is also increased by about 2.78% compared with that processed by GWO. In conclusion, the improved model (Pt-SNE-DEGWO-SVM) has the best identification effect, and it is proved to be an effective method to identify coumarin-based food additives.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.