Interspeech 2016 2016
DOI: 10.21437/interspeech.2016-850
|View full text |Cite
|
Sign up to set email alerts
|

Automated Screening of Speech Development Issues in Children by Identifying Phonological Error Patterns

Abstract: A proof of concept system is developed to provide a broad assessment of speech development issues in children. It has been designed to enable non-experts to complete an initial screening of children's speech with the aim of reducing the workload on Speech Language Pathology services. The system was composed of an acoustic model trained by neural networks with split temporal context features and a constrained HMM encoded with the knowledge of Speech Language Pathologists. Results demonstrated the system was abl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
16
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 15 publications
(17 citation statements)
references
References 16 publications
1
16
0
Order By: Relevance
“…Other approaches have exploited the fact that many assessments of speech and articulation are made with known texts. The improvement in recognition this can yield for clinical ASR has been demonstrated previously by this group [17], where an improvement of 33.1% in phoneme recognition was achieved when the target text was exploited by the constrained decoder. Other groups have had similar success with this approach, improving detection of pathological voice by 20% [18].…”
Section: Introductionsupporting
confidence: 55%
See 4 more Smart Citations
“…Other approaches have exploited the fact that many assessments of speech and articulation are made with known texts. The improvement in recognition this can yield for clinical ASR has been demonstrated previously by this group [17], where an improvement of 33.1% in phoneme recognition was achieved when the target text was exploited by the constrained decoder. Other groups have had similar success with this approach, improving detection of pathological voice by 20% [18].…”
Section: Introductionsupporting
confidence: 55%
“…The input to these models is speech made up of target words in isolation, elicited through a picture naming task. The acoustic model used in the previous system [17] used a hierarchical neural network (HNN) with long temporal context features over a 310ms window. This HNN cascaded a pair of neural networks (NN) into a third NN, with the third NN trained on the concatenated posterior probabilities of the first two NN.…”
Section: Acoustic Modelmentioning
confidence: 99%
See 3 more Smart Citations