2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP) 2021
DOI: 10.1109/mlsp52302.2021.9596389
|View full text |Cite
|
Sign up to set email alerts
|

Detecting Cover Songs with Pitch Class Key-Invariant Networks

Abstract: Deep Learning (DL) has recently been applied successfully to the task of Cover Song Identification (CSI). Meanwhile, neural networks that consider music signal data structure in their design have been developed. In this paper, we propose a Pitch Class Key-Invariant Network, PiCKINet, for CSI. Like some other CSI networks, PiCKINet inputs a Constant-Q Transform (CQT) pitch feature. Unlike other such networks, large multi-octave kernels produce a latent representation with pitch class dimensions that are maintai… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 23 publications
(45 reference statements)
0
2
0
Order By: Relevance
“…• CQTNet [63]: CQTNet uses carefully designed kernels and dilated convolutions to extend the receptive field, which can improve the model's representation learning capacity. • PICKiNet [33]: PICKiNet devises pitch class blocks to obtain the key-invariant musical features.…”
Section: Comparison Baselinesmentioning
confidence: 99%
“…• CQTNet [63]: CQTNet uses carefully designed kernels and dilated convolutions to extend the receptive field, which can improve the model's representation learning capacity. • PICKiNet [33]: PICKiNet devises pitch class blocks to obtain the key-invariant musical features.…”
Section: Comparison Baselinesmentioning
confidence: 99%
“…Byte-Cover [6] and ByteCover2 [7] achieved state-of-the-art results with a ResNet-IBN50 [8] backbone and multi-loss training with cross-entropy and triplet loss. PiCKINet [9] proposed Pitch Class Blocks in order to maintain the key-invariance features of music. LyraC-Net [10] utilized WideResNet as the backbone, and combined classification and metric learning for optimization.…”
Section: Introductionmentioning
confidence: 99%