2022
DOI: 10.1007/978-3-031-16449-1_38
|View full text |Cite
|
Sign up to set email alerts
|

Instrument-tissue Interaction Quintuple Detection in Surgery Videos

Abstract: Instrument-tissue interaction detection task, which helps understand surgical activities, is vital for constructing computer-assisted surgery systems but with many challenges. Firstly, most models represent instrument-tissue interaction in a coarse-grained way which only focuses on classification and lacks the ability to automatically detect instruments and tissues. Secondly, existing works do not fully consider relations between intraand inter-frame of instruments and tissues. In the paper, we propose to repr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 33 publications
0
2
0
Order By: Relevance
“…Xu et al (2021) employed a transformer model along with adversarial learning to generate captions, akin to triplets, depicting semantic relationships between components involved in a surgical scene. Lin et al (2022) assigned instrument and target bounding boxes to triplet information and utilized a spatio-temporal graph for instrument-target interaction detection in cataract surgery.…”
Section: Action Triplet: From Recognition To Detectionmentioning
confidence: 99%
See 1 more Smart Citation
“…Xu et al (2021) employed a transformer model along with adversarial learning to generate captions, akin to triplets, depicting semantic relationships between components involved in a surgical scene. Lin et al (2022) assigned instrument and target bounding boxes to triplet information and utilized a spatio-temporal graph for instrument-target interaction detection in cataract surgery.…”
Section: Action Triplet: From Recognition To Detectionmentioning
confidence: 99%
“…Actual action localization was offered by the SARAS-ESAD dataset (Bawa et al, 2021;Lin et al, 2022), with bounding boxes pointing to action verbs being performed.…”
Section: Datasets: From Recognition To Detectionmentioning
confidence: 99%