2019
DOI: 10.1109/access.2019.2956772
|View full text |Cite
|
Sign up to set email alerts
|

Estimating Number of Speakers via Density-Based Clustering and Classification Decision

Abstract: It is crucial to robustly estimate the number of speakers (NoS) from the recorded audio mixtures in a reverberant environment. Some popular time-frequency (TF) methods approach this NoS estimation problem by assuming that only one of the speech components is active at each TF slot. However, this condition is violated in many scenarios where the speeches are convolved with long length of room impulse response coefficients, which causes degenerated performance of NoS estimation. To tackle this problem, a density… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
9
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(9 citation statements)
references
References 38 publications
0
9
0
Order By: Relevance
“…In a real scenario, the radar echoes are mixed signals that contain amplitude and phase information for each target. The traditional mixed signal obtained based on a mixing matrix only contains amplitude information, but not phase information [16]. Therefore, the processed radar echo in this paper is closer to the actual mixed signal compared with the classical parameter estimation method. From the view of solving process, the computation complexity of this proposed method is similar compared with the traditional direction of arrival estimation method (Such as Akaike information criterion (AIC), minimum description length (MDL)) [27].…”
Section: Introductionmentioning
confidence: 98%
See 2 more Smart Citations
“…In a real scenario, the radar echoes are mixed signals that contain amplitude and phase information for each target. The traditional mixed signal obtained based on a mixing matrix only contains amplitude information, but not phase information [16]. Therefore, the processed radar echo in this paper is closer to the actual mixed signal compared with the classical parameter estimation method. From the view of solving process, the computation complexity of this proposed method is similar compared with the traditional direction of arrival estimation method (Such as Akaike information criterion (AIC), minimum description length (MDL)) [27].…”
Section: Introductionmentioning
confidence: 98%
“…In a real scenario, the radar echoes are mixed signals that contain amplitude and phase information for each target. The traditional mixed signal obtained based on a mixing matrix only contains amplitude information, but not phase information [16]. Therefore, the processed radar echo in this paper is closer to the actual mixed signal compared with the classical parameter estimation method.…”
Section: Introductionmentioning
confidence: 98%
See 1 more Smart Citation
“…Because the echo information in this paper contains amplitude and phase information. The signal separation technology is based on the mixing matrix to get the target echo (Yang et al., 2019), which does not contain phase information but only amplitude information. The proposed method in this paper has a wider application range compared with other matrix pencil methods, such as forward‐backward matrix pencil method (FBMPM). FBMPM is only applicable to the targets arranged in linear array (Sun et al., 2022).…”
Section: Introductionmentioning
confidence: 99%
“…Because the echo information in this paper contains amplitude and phase information. The signal separation technology is based on the mixing matrix to get the target echo (Yang et al, 2019), which does not contain phase information but only amplitude information. 2.…”
mentioning
confidence: 99%