2020
DOI: 10.1016/j.sigpro.2019.107368
|View full text |Cite
|
Sign up to set email alerts
|

Phase reconstruction from amplitude spectrograms based on directional-statistics deep neural networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
26
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 21 publications
(27 citation statements)
references
References 7 publications
1
26
0
Order By: Relevance
“…Directional-statistics DNN [27,28] Quantizing approach [13,14] Griffin-Lim algorithm [20] Fast Griffin-Lim algorithm [21] Consistency-based approach Fig. 1.…”
Section: Dnn-based Approachmentioning
confidence: 99%
See 2 more Smart Citations
“…Directional-statistics DNN [27,28] Quantizing approach [13,14] Griffin-Lim algorithm [20] Fast Griffin-Lim algorithm [21] Consistency-based approach Fig. 1.…”
Section: Dnn-based Approachmentioning
confidence: 99%
“…DNNs can automatically discover the structures of signals in the training dataset and utilize the obtained knowledge for phase reconstruction. This approach has achieved promising results in speech synthesis [27], [28], and it has a high potential for further improvements in various applications because DNNs can acquire the knowledge of any signals [42]. The application of the approach is not restricted to specially structured signals like a sum of sinusoids.…”
Section: Dnn-based Approachmentioning
confidence: 99%
See 1 more Smart Citation
“…Whereas the lifter of the minimum-phase filter is fixed, that of our method is trained from speech data to determine the phases of a truncated filter. Our lifter-training method can be viewed as a framework of DNN-based phase reconstruction from the amplitude spectrum [15]. Second, for fullband VC, we also propose a frequency-band-wise modeling method based on sub-band multi-rate signal processing (hereafter, "sub-band modeling method") [16].…”
Section: Introductionmentioning
confidence: 99%
“…Moreover, besides issues of computational complexity the above methods are mostly applied for the detection of a single target, while general solutions should also support efficient detection of multiple targets (see References [12,13] for recent approaches tackling this issue). A promising emerging technology to cope with these limitations is represented by deep learning [14], which has recently achieved state-of-the-art performance in a variety of difficult pattern recognition tasks, ranging from image classification [15] to speech recognition [16,17], without requiring domain-specific expert knowledge about the signal characteristics.…”
Section: Introductionmentioning
confidence: 99%