2023
DOI: 10.3390/aerospace10100876
|View full text |Cite
|
Sign up to set email alerts
|

An Automatic Speaker Clustering Pipeline for the Air Traffic Communication Domain

Driss Khalil,
Amrutha Prasad,
Petr Motlicek
et al.

Abstract: In air traffic management (ATM), voice communications are critical for ensuring the safe and efficient operation of aircraft. The pertinent voice communications—air traffic controller (ATCo) and pilot—are usually transmitted in a single channel, which poses a challenge when developing automatic systems for air traffic management. Speaker clustering is one of the challenges when applying speech processing algorithms to identify and group the same speaker among different speakers. We propose a pipeline that depl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
0
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 45 publications
0
0
0
Order By: Relevance
“…The ATCO2 platform [9] aims at collecting, pre-processing, and pseudoanonymizing ATC communications' audio databases of more than 5000 h of audio data with the objective of increasing robustness of speech recognition in the air traffic management domain. The ATCO2 corpus has also been used to detect speaker roles in voice communication, i.e., pilot or ATCO, and clustering speakers [10]. Given enough training data, automatic speech recognition and understanding systems also build the base to train ATCOs [11].…”
Section: Introductionmentioning
confidence: 99%
“…The ATCO2 platform [9] aims at collecting, pre-processing, and pseudoanonymizing ATC communications' audio databases of more than 5000 h of audio data with the objective of increasing robustness of speech recognition in the air traffic management domain. The ATCO2 corpus has also been used to detect speaker roles in voice communication, i.e., pilot or ATCO, and clustering speakers [10]. Given enough training data, automatic speech recognition and understanding systems also build the base to train ATCOs [11].…”
Section: Introductionmentioning
confidence: 99%