2017
DOI: 10.1109/tmm.2017.2723846
|View full text |Cite
|
Sign up to set email alerts
|

Audio Identification by Sampling Sub-fingerprints and Counting Matches

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
14
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(15 citation statements)
references
References 13 publications
1
14
0
Order By: Relevance
“…In terms of robustness analysis, this experiment uses the feature dimension reduction algorithms of Section 3.2.2 and Section 3.2.3 to extract MFCC feature and LPCC feature of the original speech to construct audio fingerprint, this paper compares the audio fingerprint based on the combined features with the audio fingerprints of two features and the existing audio fingerprint methods of [6,7,22,23,25,27], where [6,7] is an improved method based on Shazam fingerprint, [22,23,25,27] is an improved method based on Philips fingerprint. As shown in Table 1, the combined feature is more robust than MFCC and LPCC that under the same feature dimension reduction process.…”
Section: Robustness and Retrieval Performance Analysismentioning
confidence: 99%
See 3 more Smart Citations
“…In terms of robustness analysis, this experiment uses the feature dimension reduction algorithms of Section 3.2.2 and Section 3.2.3 to extract MFCC feature and LPCC feature of the original speech to construct audio fingerprint, this paper compares the audio fingerprint based on the combined features with the audio fingerprints of two features and the existing audio fingerprint methods of [6,7,22,23,25,27], where [6,7] is an improved method based on Shazam fingerprint, [22,23,25,27] is an improved method based on Philips fingerprint. As shown in Table 1, the combined feature is more robust than MFCC and LPCC that under the same feature dimension reduction process.…”
Section: Robustness and Retrieval Performance Analysismentioning
confidence: 99%
“…As shown in Table 1, the combined feature is more robust than MFCC and LPCC that under the same feature dimension reduction process. The proposed method has good retrieval performance under different CPOs, compared with the audio fingerprint algorithm based on short speech segments in [6,7,22,23,25] under several CPOs, the proposed method can get similar or even better recall rate and precision rate, and compared with the audio fingerprint algorithm based on long speech segments that is robust to noise proposed in [27], recall rate and precision rate of the proposed method in terms of background noise and narrowband Gaussian noise are higher than [27].…”
Section: Robustness and Retrieval Performance Analysismentioning
confidence: 99%
See 2 more Smart Citations
“…A long frame length resulted in the spectrogram having low time resolution, which makes the representation insensitive to time variations [15]. In addition, large overlap is an advantage when dealing with using short query music against the long original signal [16][17].…”
Section: Spectrogram Constructionmentioning
confidence: 99%