2017
DOI: 10.1007/978-3-319-64206-2_44
|View full text |Cite
|
Sign up to set email alerts
|

Phonetic Segmentation Using Knowledge from Visual and Perceptual Domain

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 18 publications
0
2
0
Order By: Relevance
“…The methods for boundary detection can be based on using bidirectional LSTM networks, 39,40 wavelet analysis, [42][43][44] graph-based structural analysis, 45 rules describing the power spectrum 46 or formants 47 and various features extracted from the spectrogram, for example, visual features 48,49 or auditory attention features. 50 The methods for boundary detection also have a relevant application in the task of segmentation with orthographic or phonetic transcription provided, where they can be used as additional boundary correction procedures. 51 A common system for speech segmentation is language-dependent, that is, it is trained and run on the same language.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The methods for boundary detection can be based on using bidirectional LSTM networks, 39,40 wavelet analysis, [42][43][44] graph-based structural analysis, 45 rules describing the power spectrum 46 or formants 47 and various features extracted from the spectrogram, for example, visual features 48,49 or auditory attention features. 50 The methods for boundary detection also have a relevant application in the task of segmentation with orthographic or phonetic transcription provided, where they can be used as additional boundary correction procedures. 51 A common system for speech segmentation is language-dependent, that is, it is trained and run on the same language.…”
Section: Related Workmentioning
confidence: 99%
“…The methods for boundary detection can be based on using bidirectional LSTM networks, 39,40 wavelet analysis, 42‐44 graph‐based structural analysis, 45 rules describing the power spectrum 46 or formants 47 and various features extracted from the spectrogram, for example, visual features 48,49 or auditory attention features 50 …”
Section: Introductionmentioning
confidence: 99%