2022
DOI: 10.21203/rs.3.rs-779995/v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Speech Quefrency Transform (SQT)

Abstract: Human speech consists mainly of three components: a glottal signal, a vocal tract response, and a harmonic shift. The three respectively correlate with the intonation (pitch), the formants (timbre), and the speech resolution (depth). Adding the intonation of the Fundamental Frequency (FF) to Automatic Speech Recognition (ASR) systems is necessary. First, the intonation conveys a primitive para-language. Second, its speaker-tuning reduces background noises to clarify acoustic observations. Third, extracting the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 15 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?