2021
DOI: 10.1109/jstsp.2020.3034486
|View full text |Cite
|
Sign up to set email alerts
|

Deep Griffin–Lim Iteration: Trainable Iterative Phase Reconstruction Using Neural Network

Abstract: In this paper, we propose a phase reconstruction framework, named Deep Griffin-Lim Iteration (DeGLI). Phase reconstruction is a fundamental technique for improving the quality of sound obtained through some process in the timefrequency domain. It has been shown that the recent methods using deep neural networks (DNN) outperformed the conventional iterative phase reconstruction methods such as the Griffin-Lim algorithm (GLA). However, the computational cost of DNNbased methods is not adjustable at the time of i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
14
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 30 publications
(14 citation statements)
references
References 62 publications
0
14
0
Order By: Relevance
“…PR has also been tackled using signal modeling [11], [30], [31] or deep neural networks [32]. However, optimization-based approaches remain efficient, provide theoretical guarantees and may still be used with model-based approaches [33], [34].…”
Section: Introductionmentioning
confidence: 99%
“…PR has also been tackled using signal modeling [11], [30], [31] or deep neural networks [32]. However, optimization-based approaches remain efficient, provide theoretical guarantees and may still be used with model-based approaches [33], [34].…”
Section: Introductionmentioning
confidence: 99%
“…However, it is often set to values close to 1, chosen experimentally as its optimal value strongly depends on signal characteristics. Note that changing the sign of β has the effect of interchanging the projections in ( 11)- (13). For β = 1, the RAAR iteration and DM iteration are identical (this case is also equivalent to several other phase retrieval algorithms [21]).…”
Section: Difference Map (Dm)mentioning
confidence: 99%
“…Interestingly, GLA was later rediscovered in ptychography as the Error Reduction algorithm [5], [8]. Several extensions of GLA have been proposed over the years, including an accelerated version [9], online variants [10], [11] an adaptation to the source separation context [12] and even augmentation by a neural network [13], but the basic algorithm is still widely used (other approaches not based on iterative projection exist [1] but are outside the scope of this paper). In this work we examine how modern methods from optics perform when applied to speech signals.…”
Section: Introductionmentioning
confidence: 99%
“…Afterward, the phase information can be effectively modified based on previous results by introducing a global residual connection. The rationale is that there exists implicit relation between magnitude and phase and superb magnitude estimation can profit better recovery for phase [12,19].…”
Section: Introductionmentioning
confidence: 99%