“…Prior to performing PCA, each spectrum was pre-processed by calculating 2nd derivatives using a 9-point Savitzky-Golay algorithm (Savitzky & Golay, 1964), and subsequently corrected by extended multiplicative scatter correction (EMSC) method (Kohler, Afseth, & Martens, 2010;Kohler, Kirschner, Oust, & Martens, 2005;Martens, Nielsen, & Engelsen, 2003) using only the spectral ranges that contain band information relevant to the lipid and protein components (i.e., 3100-2800 and 1800-950 cm À1 ). Outliers were removed based upon the samples with high residual variance across all principal components (PCs).…”