In this paper, we propose a novel feature map compression method for Video Coding for Machines (VCM). The proposed method performs a principal component analysis (PCA)-based transform on feature pyramid network (FPN) feature maps using predefined basis and mean vectors. In addition, the proposed method reduces redundancy between different resolution levels within FPN feature maps based on redundancy between FPN layers. The fixed predefined basis and mean are employed through PCA with a set of training data set. For any input videos, transform coefficients are obtained by performing transform with the fixed basis and compressed using Versatile Video Coding (VVC). Experimental results show that the proposed method achieves 89.22% and 86.57% BD-rate gain compared to the VCM feature anchor in instance segmentation, and object detection, respectively.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.