2019
DOI: 10.1250/ast.40.170
|View full text |Cite
|
Sign up to set email alerts
|

Representation of complex spectrogram via phase conversion

Abstract: As importance of the phase of complex spectrogram has been recognized widely, many techniques have been proposed for handling it. However, several definitions and terminologies for the same concept can be found in the literature, which has confused beginners. In this paper, two major definitions of the short-time Fourier transform and their phase conventions are summarized to alleviate such complication. A phase-aware signal-processing scheme based on phase conversion is also introduced with a set of executabl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
25
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
3
1

Relationship

5
3

Authors

Journals

citations
Cited by 30 publications
(26 citation statements)
references
References 47 publications
(67 reference statements)
0
25
0
Order By: Relevance
“…1. To avoid the sensitivity problem, DNNs estimate phase derivatives instead of phase itself since phase derivatives have a relation to the amplitude spectrogram [24][25][26]. Then, the proposed method recursively reconstructs phase from the estimated phase derivatives, which is named RPU.…”
Section: Amplitude Spectrogrammentioning
confidence: 99%
“…1. To avoid the sensitivity problem, DNNs estimate phase derivatives instead of phase itself since phase derivatives have a relation to the amplitude spectrogram [24][25][26]. Then, the proposed method recursively reconstructs phase from the estimated phase derivatives, which is named RPU.…”
Section: Amplitude Spectrogrammentioning
confidence: 99%
“…Recently, the phase-conversion method that modifies the time sensitivity of phases was proposed [1,2]. In the present study, we applied it to speech analyses and verified its efficiency.…”
Section: Introductionmentioning
confidence: 74%
“…As the conventional method, DNN-based speech enhancement in STFT domain was considered. STFT with the 512 points (32 ms) Hann window, 256 points time-shifting and 512 points discrete Fourier transform length was used, and the inverse STFT was implemented by its canonical dual [29] to make the STFT as perfect reconstruction filterbank. To estimate real-valued T-F mask, U-Net illustrated in Fig.…”
Section: Dnn Architecture Loss Function and Training Setupmentioning
confidence: 99%