2022
DOI: 10.1007/978-3-031-16449-1_42
|View full text |Cite
|
Sign up to set email alerts
|

Towards Holistic Surgical Scene Understanding

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
11
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 17 publications
(13 citation statements)
references
References 22 publications
0
11
0
Order By: Relevance
“…In the same way, we design class-specific score thresholds to counter the high inter-class score variability caused by the class frequency miss-balance. The following section explains how we use the video analysis methodology of [33] to improve our segment classification.…”
Section: Matismentioning
confidence: 99%
See 4 more Smart Citations
“…In the same way, we design class-specific score thresholds to counter the high inter-class score variability caused by the class frequency miss-balance. The following section explains how we use the video analysis methodology of [33] to improve our segment classification.…”
Section: Matismentioning
confidence: 99%
“…Temporal Consistency: Following TAPIR [33], our temporal consistency module uses a Multi-Scale Vision Transformer (MViT) [31] as a backbone for video analysis. TAPIR uses a time window centered on a keyframe to compute global spatio-temporal features that encode the complex temporal context of the middle frame.…”
Section: Matismentioning
confidence: 99%
See 3 more Smart Citations