The platform will undergo maintenance on Sep 14 at about 9:30 AM EST and will be unavailable for approximately 1 hour.
2018 IEEE International Workshop on Information Forensics and Security (WIFS) 2018
DOI: 10.1109/wifs.2018.8630783
|View full text |Cite
|
Sign up to set email alerts
|

Revealing the processing history of pitch-shifted voice using CNNs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
4
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 17 publications
1
4
0
Order By: Relevance
“…Fig. 3 reveals that the estimation of negative α is more accurate than that of the positive, which is consistent with the conclusion in [6,7] that lowing pitch is easier to detect than raising pitch. Besides, the tiny α is prone to be estimatied as zero, resulting a linear deviation in the neighbourhood of zero.…”
Section: Evaluation Of Estimation Accuracysupporting
confidence: 86%
See 1 more Smart Citation
“…Fig. 3 reveals that the estimation of negative α is more accurate than that of the positive, which is consistent with the conclusion in [6,7] that lowing pitch is easier to detect than raising pitch. Besides, the tiny α is prone to be estimatied as zero, resulting a linear deviation in the neighbourhood of zero.…”
Section: Evaluation Of Estimation Accuracysupporting
confidence: 86%
“…Prior works and limitaion: Early works [5][6][7] typically estimate the approximate range of pitch shifting rather than the precise degree of disguise, rendering them incapable of accurately restoring the pitch-shifted voice. Later, Pilia et al propose a method achieving more accurate estimation results than previous work [8].…”
Section: Introduction Motivationmentioning
confidence: 99%
“…(1) Frequency domain features are used to identify audio post-processing operations. Wang et al [25] used the features of audio after STFT transformation as the input of the convolutional neural network (CNN) to identify the post-processing operation of audio pitch transformation. (2) ENF is applied for audio recapture detection.…”
Section: Detection Methods Based On Deep Featuresmentioning
confidence: 99%
“…Frequency domain features are used to identify audio post-processing operations. Wang et al [25] used the features of audio after STFT and CQT transformation as the input of CNN of the convolutional neural network to identify the post-processing operation of audio pitch transformation. 2).…”
Section: Detection Methods Based On Deep Featuresmentioning
confidence: 99%