2017
DOI: 10.1007/s11042-017-4699-5
|View full text |Cite
|
Sign up to set email alerts
|

Learning laparoscopic video shot classification for gynecological surgery

Abstract: Videos of endoscopic surgery are used for education of medical experts, analysis in medical research, and documentation for everyday clinical life. Hand-crafted image descriptors lack the capabilities of a semantic classification of surgical actions and video shots of anatomical structures. In this work, we investigate how well single-frame convolutional neural networks (CNN) for semantic shot classification in gynecologic surgery work. Together with medical experts, we manually annotate hours of raw endoscopi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
21
1

Year Published

2017
2017
2022
2022

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 53 publications
(22 citation statements)
references
References 29 publications
0
21
1
Order By: Relevance
“…While the aforementioned methods on video classification have achieved state-of-the-art performance on benchmark video recognition datasets like UCF-101 and Sports-1M, work on surgical video recognition has received little attention primarily due to the dearth of large-scale datasets in this domain. Previous works in surgical video analysis have addressed problems like surgical phase recognition [14], surgical gesture classification [15], tool tracking [16], classification of anatomical structures and surgical actions from video shots [17]. The most relevant to this work is the paper by Twinanda et al [4] in which they propose a pipeline for classification of the type of Laparoscopic video, which consists of frame rejection, feature extraction, feature encoding and classification.…”
Section: B Surgery Classificationmentioning
confidence: 99%
“…While the aforementioned methods on video classification have achieved state-of-the-art performance on benchmark video recognition datasets like UCF-101 and Sports-1M, work on surgical video recognition has received little attention primarily due to the dearth of large-scale datasets in this domain. Previous works in surgical video analysis have addressed problems like surgical phase recognition [14], surgical gesture classification [15], tool tracking [16], classification of anatomical structures and surgical actions from video shots [17]. The most relevant to this work is the paper by Twinanda et al [4] in which they propose a pipeline for classification of the type of Laparoscopic video, which consists of frame rejection, feature extraction, feature encoding and classification.…”
Section: B Surgery Classificationmentioning
confidence: 99%
“…With the comparison of adjacent color histograms and thresholds for significant motion changes, they are able to detect such keypoint moments in laparoscopic surgeries. Content classification has also been addressed recently by Petscharnig and Schoeffmann [37,38], who evaluate well-known convolutional neural network architectures for the purpose of semantic segment annotation.…”
Section: Related Workmentioning
confidence: 99%
“…AlexNet is an architecture based on CNNs that has proven successful in scene classification tasks. It is recognized as an excellent basic level, automatic scene classification technology . While a typical CNN pooling process is non‐overlapping, AlexNet, dose in fact have an overlapping pooling process.…”
Section: Introductionmentioning
confidence: 99%
“…It is recognized as an excellent basic level, automatic scene classification technology. 53,54 While a typical CNN pooling process is non-overlapping, AlexNet, dose in fact have an overlapping pooling process. This contributes to a higher classification accuracy because more original information is retained.…”
Section: Introductionmentioning
confidence: 99%