2019
DOI: 10.1109/access.2019.2924014
|View full text |Cite
|
Sign up to set email alerts
|

Distinctive Phonetic Features Modeling and Extraction Using Deep Neural Networks

Abstract: Feature extraction is a critical stage of digital speech processing systems. Quality of features is of great importance to provide a solid foundation upon which the subsequent stages stand. Distinctive phonetic features (DPFs) are one of the most representative features of the speech signals. The significance of DPFs is in their ability to provide abstract description of the places and manners of articulation of the language phonemes. A phoneme's DPF element reflects unique articulatory information about that … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
26
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
6

Relationship

3
3

Authors

Journals

citations
Cited by 7 publications
(27 citation statements)
references
References 24 publications
(39 reference statements)
0
26
0
Order By: Relevance
“…To achieve this goal, we propose the AFD-Obj system, which is a single network for AF sequence detection in Arabic and English speech. We do not use a different network for each AF, as done in some state-of-the-art systems [ 3 , 4 , 15 ], where a neural network is used for each AF to detect the presence or absence of an AF in speech frames or group of frames. We select the YOLOv3-tiny [ 27 ] detector because of its simplicity, fast computation property, and the fact that it supports multi-label detection.…”
Section: Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…To achieve this goal, we propose the AFD-Obj system, which is a single network for AF sequence detection in Arabic and English speech. We do not use a different network for each AF, as done in some state-of-the-art systems [ 3 , 4 , 15 ], where a neural network is used for each AF to detect the presence or absence of an AF in speech frames or group of frames. We select the YOLOv3-tiny [ 27 ] detector because of its simplicity, fast computation property, and the fact that it supports multi-label detection.…”
Section: Methodsmentioning
confidence: 99%
“…Each phoneme has a unique vector of AFs; hence, we can use our system for phoneme recognition by mapping the detected AFs to the corresponding phonemes, as suggested in Ref. [ 3 ]. Moreover, we propose PD-Obj, which is an end-to-end system for direct Arabic sequence phoneme recognition from the spectrogram without AF usage.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations