“…The SE process consists of two parts: to enhance the intelligibility and quality of processed speech, and to reduce the noises in the background. Previous well-established algorithms have helped improve the SE in CI users [37], [38], [29], [39], [40], [41], [42], [43] but there are only few studies with a newly upgrading deep-learning-based algorithm. Traditional SE methods are based on identifying the difference between clean and noisy speech [44], [45], [46], [47], [48], [49].…”