2016
DOI: 10.1016/j.specom.2016.08.004
|View full text |Cite
|
Sign up to set email alerts
|

On structured sparsity of phonological posteriors for linguistic parsing

Abstract: The speech signal conveys information on different time scales from short (20-40 ms) time scale or segmental, associated to phonological and phonetic information to long (150-250 ms) time scale or supra segmental, associated to syllabic and prosodic information. Linguistic and neurocognitive studies recognize the phonological classes at segmental level as the essential and invariant representations used in speech temporal organization.In the context of speech processing, a deep neural network (DNN) is an effec… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2016
2016
2018
2018

Publication Types

Select...
4
3

Relationship

6
1

Authors

Journals

citations
Cited by 11 publications
(16 citation statements)
references
References 43 publications
0
16
0
Order By: Relevance
“…This result was expected due to the binary nature of phonological posteriors [18,21]. Moreover, if the dewarped posteriors are quantized into binary vectors and Jaccard similarity is used for binary pattern matching [21], similar results as the Spearman similarity measure is achieved. This observation again confirms that the space of phonological posteriors is highly structured and the structures bear more information than the exact posterior values.…”
Section: Qbe-std Resultsmentioning
confidence: 68%
See 3 more Smart Citations
“…This result was expected due to the binary nature of phonological posteriors [18,21]. Moreover, if the dewarped posteriors are quantized into binary vectors and Jaccard similarity is used for binary pattern matching [21], similar results as the Spearman similarity measure is achieved. This observation again confirms that the space of phonological posteriors is highly structured and the structures bear more information than the exact posterior values.…”
Section: Qbe-std Resultsmentioning
confidence: 68%
“…The permissible combinations are highly constrained due to articulatory mechanisms governing speech production. Therefore, the probabilities constituting a posterior are confined to a small number of components where the indices of high probabilities determine the unique structure of the vocal machinery in speech production [21]. [1,27].…”
Section: Qbe-std Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…In addition, we exploit phonological structures [9] to enable automatic analysis of duration and trajectory without any need for automatic alignment. Prior work on phonological structures demonstrate their relation to articulatory postures [9], thus considering the structure of multiple consecutive segments enables quantification of the dynamic and trajectory of articulatory movements and co-articulation. The studies presented in this paper exploit this structural property of phonological posteriors to obtain speech-based markers of PAoS severity.…”
Section: Introductionmentioning
confidence: 99%