2022
DOI: 10.1007/s11432-021-3396-7
|View full text |Cite
|
Sign up to set email alerts
|

Fast target-aware learning for few-shot video object segmentation

Abstract: Few-shot video object segmentation (FSVOS) aims to segment a specific object throughout a video sequence when only the first-frame annotation is given. In this study, we develop a fast target-aware learning approach for FSVOS, where the proposed approach adapts to new video sequences from its firstframe annotation through a lightweight procedure. The proposed network comprises two models. First, the meta knowledge model learns the general semantic features for the input video image and up-samples the coarse pr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
1

Relationship

2
5

Authors

Journals

citations
Cited by 15 publications
(3 citation statements)
references
References 44 publications
(105 reference statements)
0
1
0
Order By: Relevance
“…Recent research in matching-based video object segmentation (VOS) [1][2][3][4][5][6][7][8][9][17][18][19][20][21][22][23] has primarily focused on improving network structures, including building more efficient memory [5,6], adopting local matching [7][8][9], and incorporating background context [3,4]. Among them, STM [1] is the pioneer in the field of video object segmentation, which first proposed the design of the memory bank, enabling VOS methods to effectively utilize the temporal information in video sequences.…”
Section: Matching-based Vos Networkmentioning
confidence: 99%
“…Recent research in matching-based video object segmentation (VOS) [1][2][3][4][5][6][7][8][9][17][18][19][20][21][22][23] has primarily focused on improving network structures, including building more efficient memory [5,6], adopting local matching [7][8][9], and incorporating background context [3,4]. Among them, STM [1] is the pioneer in the field of video object segmentation, which first proposed the design of the memory bank, enabling VOS methods to effectively utilize the temporal information in video sequences.…”
Section: Matching-based Vos Networkmentioning
confidence: 99%
“…Convolutional Neural Networks (CNNs) [11,12,18,19] have long been recognized as powerful tools for image analysis due to their ability to learn hierarchical feature representations from raw image data. They have been particularly successful in various segmentation tasks, thanks to their robustness in extracting spatial features from images.…”
Section: Cnn-based Remote Sensing Image Segmentationmentioning
confidence: 99%
“…The self-attention mechanism computes the response at a position as a weighted sum of the features at all positions in the data. This global context-awareness allows the Transformer to better capture intricate spatial structures and long-range dependencies that are characteristic of remote sensing images [19,[29][30][31].…”
Section: Remote Sensing Image Segmentation Based On Self-attention Me...mentioning
confidence: 99%