A new graphically oriented local modeling procedure called interval partial least-squares ( iPLS) is presented for use on spectral data. The iPLS method is compared to full-spectrum partial least-squares and the variable selection methods principal variables (PV), forward stepwise selection (FSS), and recursively weighted regression (RWR). The methods are tested on a near-infrared (NIR) spectral data set recorded on 60 beer samples correlated to original extract concentration. The error of the full-spectrum correlation model between NIR and original extract concentration was reduced by a factor of 4 with the use of iPLS ( r = 0.998, and root mean square error of prediction equal to 0.17% plato), and the graphic output contributed to the interpretation of the chemical system under observation. The other methods tested gave a comparable reduction in the prediction error but suffered from the interpretation advantage of the graphic interface. The intervals chosen by iPLS cover both the variables found by FSS and all possible combinations as well as the variables found by PV and RWR, and iPLS is still able to utilize the first-order advantage.
It is nowadays widely accepted that genetic algorithms (GAs) are powerful tools in variable selection and that after suitable modifications they can also be powerful in detecting the most relevant spectral regions for multivariate calibration. One of the main limitations of GAs is related to the fact that when spectral intensities are measured at a very large number of wavelengths the search domain increases correspondingly and therefore the detection of the relevant regions is much more difficult. A modification of interval partial least squares (iPLS), designated backward interval PLS (biPLS), is developed and studied such that it can detect and remove the least relevant regions, thereby reducing the search domain to a size that GAs can handle easily. In this paper the application to two different spectroscopic data sets will be shown: infrared spectroscopic analysis of polymer film additives and determination of the contents of erucic acid and total fatty acids in brassica seeds by near-infrared spectroscopy. The developed method is compared with model performances based on expert selection of variables as well as with results from application of the previously developed GA-PLS method. The sequential application of biPLS and GA-PLS has proven successful, and comparable or better results have been obtained, introducing a more automatic region selection procedure and a substantial decrease in computation time.
In this study, near-infrared (NIR) transmittance and Raman spectroscopy chemometric calibrations of the active substance content of a pharmaceutical tablet were developed using partial least-squares regression (PLS). Although the active substance contained the strongly Raman active C≡N functional group, the best results were obtained with NIR transmittance, which highlights the difference between (microscopic) surface sampling and whole tablet diffuse transmittance sampling. The tablets exist in four dosages with only two different concentrations of active substance (5 mg (5.6% w/w), and 10, 15, and 20 mg (8.0% w/w) active substance per tablet). A calibration on all four dosages resulted in a prediction error expressed as the root mean squared error of cross-validation (RMSECV) of 0.30% w/w for the NIR transmittance calibration. The corresponding error when using Raman spectra was 0.56% w/w. Specially prepared calibration batches covering the range 85–115% of the nominal content for each dosage were added to the first sample set, and NIR transmittance calibrations on this set—containing coated as well as uncoated tablets—gave a further reduction in prediction errors to 0.21–0.289% w/w. This corresponds to relative prediction errors (RMSECV/ynom) of 2.6–3.7%. This is a reasonably low error when compared to the error of the chromatographic reference method, which was estimated to 3.5%.
A cytotoxic T lymphocyte (CTL) clone generated in vitro from the peripheral blood of a healthy HLA-A2-positive individual against a synthetic p53 protein-derived wild-type peptide (L9V) was shown to kill squamous carcinoma cell lines derived from two head and neck carcinomas, which expressed mutant p53 genes, in a L9V͞HLA-A2 specific and restricted fashion. Thus, the normal tolerance against endogenously processed p53 protein-derived self-epitopes can be broken by peptide-specific in vitro priming. p53 proteinderived wild-type peptides might thus represent tumor associated target molecules for immunotherapeutical approaches.
A modification of the standard Canonical Variates Analysis (CVA) method to cope with collinear high-dimensional data is developed. The method utilizes Partial Least Squares regression as an engine for solving an eigenvector problem involving singular covariance matrices. Three data sets are analyzed to demonstrate the properties of the method: a two-group problem with near infrared spectroscopic data consisting of 60 samples and 376 variables, a multi-group problem with fluorescence spectroscopic data (1023 variables) consisting of 83 samples from six groups and a three-group problem with physical-chemical data (10 variables) consisting of 41 samples from three groups. It is demonstrated that the modified CVA method forces the discriminative information into the first canonical variates as expected. The weight vectors found in the modified CVA method possess the same properties as weight vectors of the standard CVA method. By combination of the suggested method with, for example, Linear Discriminant Analysis (LDA) as a classifier, an operational tool for classification and discrimination of collinear data is obtained.
Export of cocoa beans is of great economic importance in Ghana and several other tropical countries. Raw cocoa has an astringent, unpleasant taste, and flavor, and has to be fermented, dried, and roasted to obtain the characteristic cocoa flavor and taste. In an attempt to obtain a deeper understanding of the changes in the cocoa beans during fermentation and investigate the possibility of future development of objective methods for assessing the degree of fermentation, a novel combination of methods including cut test, colorimetry, fluorescence spectroscopy, NIR spectroscopy, and GC-MS evaluated by chemometric methods was used to examine cocoa beans sampled at different durations of fermentation and samples representing fully fermented and dried beans from all cocoa growing regions of Ghana. Using colorimetry it was found that samples moved towards higher a* and b* values as fermentation progressed. Furthermore, the degree of fermentation could, in general, be well described by the spectroscopic methods used. In addition, it was possible to link analysis of volatile compounds with predictions of fermentation time. Fermented and dried cocoa beans from the Volta and the Western regions clustered separately in the score plots based on colorimetric, fluorescence, NIR, and GC-MS indicating regional differences in the composition of Ghanaian cocoa beans. The study demonstrates the potential of colorimetry and spectroscopic methods as valuable tools for determining the fermentation degree of cocoa beans. Using GC-MS it was possible to demonstrate the formation of several important aroma compounds such 2-phenylethyl acetate, propionic acid, and acetoin and the breakdown of others like diacetyl during fermentation. Practical Application: The present study demonstrates the potential of using colorimetry and spectroscopic methods as objective methods for determining cocoa bean quality along the processing chain. Development of objective methods for determining cocoa bean quality will be of great importance for quality insurance within the fields of cocoa processing and raw material control in chocolate producing companies.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.