Data fusion of multiple‐information strategy based on Fourier transform near infrared spectroscopy and Fourier‐transform mid infrared for geographical traceability of Wolfiporia cocos combined with chemometrics
Abstract:Owing to the widespread concern relating to herb safety and quality, there is a momentum to discriminate the geographical traceability of fungus with multiple‐information technologies. In this study, we attempted to evaluate the fusion strategy of multiple‐information for the geographical traceability of this fungus based on Fourier transform near infrared spectroscopy (FT‐NIR) and Fourier‐transform mid infrared spectroscopy (FT‐MIR) with chemometrics. From all results, (1) comparative visualization of t‐distr… Show more
“…When it comes to diagnosing cancer, these numbers are simply too low not to mention that other studies dealing with colorectal cancer data showed a stable higher than 90% accuracy for as the result of classification, albeit with different machine learning methods 41 . Meanwhile, projects revolving around FT‐IR techniques and spectrum level classification via traditional models or modified neural networks managed to reach similar or higher efficiency scores as well 42–47 . However, in some cases, these studies put a lot more focus on the preprocessing of the spectra or even using the different derivatives as input for the machine learning models, which could definitely be a viable option for us to better our results.…”
Section: Resultsmentioning
confidence: 99%
“…41 Meanwhile, projects revolving around FT-IR techniques and spectrum level classification via traditional models or modified neural networks managed to reach similar or higher efficiency scores as well. [42][43][44][45][46][47] However, in some cases, these studies put a lot more focus on the preprocessing of the spectra or even using the different derivatives as input for the machine learning models, which could definitely be a viable option for us to better our results. The accuracy can be improved by adding new slides to the cohort and increasing the number of spectra.…”
In this project, we used formalin‐fixed paraffin‐embedded (FFPE) tissue samples to measure thousands of spectra per tissue core with Fourier transform mid‐infrared spectroscopy using an FT‐IR imaging system. These cores varied between normal colon (NC) and colorectal primer carcinoma (CRC) tissues. We created a database to manage all the multivariate data obtained from the measurements. Then, we applied classifier algorithms to identify the tissue based on its yielded spectra. For classification, we used the random forest, a support vector machine, XGBoost, and linear discriminant analysis methods, as well as three deep neural networks. We compared two data manipulation techniques using these models and then applied filtering. In the end, we compared model performances via the sum of ranking differences (SRD).
“…When it comes to diagnosing cancer, these numbers are simply too low not to mention that other studies dealing with colorectal cancer data showed a stable higher than 90% accuracy for as the result of classification, albeit with different machine learning methods 41 . Meanwhile, projects revolving around FT‐IR techniques and spectrum level classification via traditional models or modified neural networks managed to reach similar or higher efficiency scores as well 42–47 . However, in some cases, these studies put a lot more focus on the preprocessing of the spectra or even using the different derivatives as input for the machine learning models, which could definitely be a viable option for us to better our results.…”
Section: Resultsmentioning
confidence: 99%
“…41 Meanwhile, projects revolving around FT-IR techniques and spectrum level classification via traditional models or modified neural networks managed to reach similar or higher efficiency scores as well. [42][43][44][45][46][47] However, in some cases, these studies put a lot more focus on the preprocessing of the spectra or even using the different derivatives as input for the machine learning models, which could definitely be a viable option for us to better our results. The accuracy can be improved by adding new slides to the cohort and increasing the number of spectra.…”
In this project, we used formalin‐fixed paraffin‐embedded (FFPE) tissue samples to measure thousands of spectra per tissue core with Fourier transform mid‐infrared spectroscopy using an FT‐IR imaging system. These cores varied between normal colon (NC) and colorectal primer carcinoma (CRC) tissues. We created a database to manage all the multivariate data obtained from the measurements. Then, we applied classifier algorithms to identify the tissue based on its yielded spectra. For classification, we used the random forest, a support vector machine, XGBoost, and linear discriminant analysis methods, as well as three deep neural networks. We compared two data manipulation techniques using these models and then applied filtering. In the end, we compared model performances via the sum of ranking differences (SRD).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.