Completely labeled pathology datasets are often challenging and time-consuming to obtain. Semi-supervised learning (SSL) methods are able to learn from fewer labeled data points with the help of a large number of unlabeled data points. In this paper, we investigated the possibility of using clustering analysis to identify the underlying structure of the data space for SSL. A cluster-then-label method was proposed to identify high-density regions in the data space which were then used to help a supervised SVM in finding the decision boundary. We have compared our method with other supervised and semi-supervised state-of-the-art techniques using two different classification tasks applied to breast pathology datasets. We found that compared with other state-of-the-art supervised and semi-supervised methods, our SSL method is able to improve classification performance when a limited number of labeled data instances are made available. We also showed that it is important to examine the underlying distribution of the data space before applying SSL techniques to ensure semi-supervised learning assumptions are not violated by the data.
Neoadjuvant treatment (NAT) of breast cancer (BCa) is an option for patients with the locally advanced disease. It has been compared with standard adjuvant therapy with the aim of improving prognosis and surgical outcome. Moreover, the response of the tumor to the therapy provides useful information for patient management. The pathological examination of the tissue sections after surgery is the gold-standard to estimate the residual tumor and the assessment of cellularity is an important component of tumor burden assessment. In the current clinical practice, tumor cellularity is manually estimated by pathologists on hematoxylin and eosin (H&E) stained slides, the quality, and reliability of which might be impaired by inter-observer variability which potentially affects prognostic power assessment in NAT trials. This procedure is also qualitative and time-consuming. In this paper, we describe a method of automatically assessing cellularity. A pipeline to automatically segment nuclei figures and estimate residual cancer cellularity from within patches and whole slide images (WSIs) of BCa was developed. We have compared the performance of our proposed pipeline in estimating residual cancer cellularity with that of two expert pathologists. We found an intra-class agreement coefficient (ICC) of 0.89 (95% CI of [0.70, 0.95]) between pathologists, 0.74 (95% CI of [0.70, 0.77]) between pathologist #1 and proposed method, and 0.75 (95% CI of [0.71, 0.79]) between pathologist #2 and proposed method. We have also successfully applied our proposed technique on a WSI to locate areas with high concentration of residual cancer. The main advantage of our approach is that it is fully automatic and can be used to find areas with high cellularity in WSIs. This provides a first step in developing an automatic technique for post-NAT tumor response assessment from pathology slides.
BackgroundThe molecular etiology of thyroid carcinoma (TC) and other thyroid diseases which may present malignant precursor lesions is not fully explored yet. The purpose of this study was to estimate frequency, type and clinicopathological value of BRAF exon 15 mutations in different types of cancerous and non-cancerous thyroid lesions originating in an ethnically diverse population.MethodsBRAF exon 15 was sequenced in 381 cases of thyroid lesions including Hashimoto´s thyroiditis, nodular goiters, hyperplastic nodules, follicular adenomas (FA), papillary TC (PTC), follicular variant PTC (FVPTC), microcarcinomas of PTC (micro PTC; tumor size ≤ 1 cm), follicular TC (FTC), and non-well differentiated TC (non-WDTC).ResultsWe identified BRAF mutations in one of 69 FA, 72 of 115 (63%) PTC, seven of 42 (17%) FVPTC, 10 of 56 (18%) micro PTC, one of 17 (6%) FTC, and one of eight (13%) non-WDTC. Most of the cases showed the common V600E mutation. One case each of PTC, FVPTC, and FTC harbored a K601E mutation. A novel BRAF mutation was identified in a FA leading to deletion of threonine at codon 599 (p.T599del). A rare 3-base pair insertion was detected in a stage III PTC resulting in duplication of threonine at codon 599 (p.T599dup). Patients with PTC harboring no BRAF mutation (BRAFwt) were on average younger than those with a BRAF mutation (BRAFmut) in the PTC (36.6 years vs. 43.8 years). Older age (≥ 45 years) in patients with PTC was significantly associated with tumor size ≥ 4 cm (P = 0.018), vessel invasion (P = 0.004), and distant metastasis (P = 0.001). Lymph node (LN) involvement in PTC significantly correlated with tumor size (P = 0.044), and vessel invasion (P = 0.013). Of notice, taken the whole TC group, family history of thyroid disease positively correlated with capsular invasion (P = 0.025).ConclusionsOlder age is manifold associated with unfavorable tumor markers in our series. The K601E identified in a PTC, FVPTC, and FTC seems to be more distributed among different histological types of TC than previously thought. The T599del is a yet undescribed mutation and the rare T599dup has not been reported as a mutation in PTC so far.
The residual cancer burden index is an important quantitative measure used for assessing treatment response following neoadjuvant therapy for breast cancer. It has shown to be predictive of overall survival and is composed of two key metrics: qualitative assessment of lymph nodes and the percentage of invasive or in situ tumour cellularity (TC) in the tumour bed (TB). Currently, TC is assessed through eye-balling of routine histopathology slides estimating the proportion of tumour cells within the TB. With the advances in production of digitized slides and increasing availability of slide scanners in pathology laboratories, there is potential to measure TC using automated algorithms with greater precision and accuracy. We describe two methods for automated TC scoring: 1) a traditional approach to image analysis development whereby we mimic the pathologists’ workflow, and 2) a recent development in artificial intelligence in which features are learned automatically in deep neural networks using image data alone. We show strong agreements between automated and manual analysis of digital slides. Agreements between our trained deep neural networks and experts in this study (0.82) approach the inter-rater agreements between pathologists (0.89). We also reveal properties that are captured when we apply deep neural network to whole slide images, and discuss the potential of using such visualisations to improve upon TC assessment in the future.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.