SummaryAnimal models and archived human biobank tissues are useful resources for research in disease development, diagnostics and therapeutics. For the preservation of microscopic anatomical features and to facilitate long‐term storage, a majority of tissue samples are denatured by the chemical treatments required for fixation, paraffin embedding and subsequent deparaffinization. These aggressive chemical processes are thought to modify the biochemical composition of the sample and potentially compromise reliable spectroscopic examination useful for the diagnosis or biomarking. As a result, spectroscopy is often conducted on fresh/frozen samples. In this study, we provide an extensive characterization of the biochemical signals remaining in processed samples (formalin fixation and paraffin embedding, FFPE) and especially those originating from the anatomical layers of a healthy rat colon. The application of chemometric analytical methods (unsupervised and supervised) was shown to eliminate the need for tissue staining and easily revealed microscopic features consistent with goblet cells and the dense populations of cells within the mucosa, principally via strong nucleic acid signals. We were also able to identify the collagenous submucosa‐ and serosa‐ as well as the muscle‐associated signals from the muscular regions and blood vessels. Applying linear regression analysis to the data, we were able to corroborate this initial assignment of cell and tissue types by confirming the biological origin of each layer by reference to a subset of authentic biomolecular standards. Our results demonstrate the potential of using label‐free Raman microspectroscopy to obtain superior imaging contrast in FFPE sections when compared directly to conventional haematoxylin and eosin (H&E) staining.
Raman Spectroscopy has long been anticipated to augment clinical decision making, such as classifying oncological samples. Unfortunately, the complexity of Raman data has thus far inhibited their routine use in clinical settings. Traditional machine learning models have been used to help exploit this information, but recent advances in deep learning have the potential to improve the field. However, there are a number of potential pitfalls with both traditional and deep learning models. We conduct a literature review to ascertain the recent machine learning methods used to classify cancers using Raman spectral data. We find that while deep learning models are popular, and ostensibly outperform traditional learning models, there are many methodological considerations which may be leading to an over-estimation of performance; primarily, small sample sizes which compound sub-optimal choices regarding sampling and validation strategies. Amongst several recommendations is a call to collate large benchmark Raman datasets, similar to those that have helped transform digital pathology, which researchers can use to develop and refine deep learning models.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.