7th International Conference on Spoken Language Processing (ICSLP 2002) 2002
DOI: 10.21437/icslp.2002-133
|View full text |Cite
|
Sign up to set email alerts
|

DARPA communicator evaluation: progress from 2000 to 2001

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
4
0

Year Published

2005
2005
2020
2020

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 20 publications
(5 citation statements)
references
References 9 publications
0
4
0
Order By: Relevance
“…For example, well-known Shazam music recognition method uses only spectral peaks [162]. Commonly used mel-cepstral coefficients, line spectral frequencies, and RASTA features [134,135] do not have any temporal information, either. We believe that temporal information is not fully utilized in current methods.…”
Section: Beyond the State Of The Artmentioning
confidence: 99%
“…For example, well-known Shazam music recognition method uses only spectral peaks [162]. Commonly used mel-cepstral coefficients, line spectral frequencies, and RASTA features [134,135] do not have any temporal information, either. We believe that temporal information is not fully utilized in current methods.…”
Section: Beyond the State Of The Artmentioning
confidence: 99%
“…HMMs and GMMs use features extracted from temporal speech windows. Current speech and sound feature extraction schemes are based on Fourier analysis [131,134,135]. Temporal information is only incorporated to the automatic speech recognition systems by only dividing speech into temporal analysis windows.…”
Section: State Of the Artmentioning
confidence: 99%
“…For example, well-known Shazam music recognition method uses only spectral peaks [162]. Commonly used mel-cepstral coefficients, line spectral frequencies, and RASTA features [134,135] do not have any temporal information, either. We believe that temporal information is not fully utilized in current methods.…”
Section: Beyond the State Of The Artmentioning
confidence: 99%
“…If however one in fifty callers are changing bank so that they can talk to a human, then perhaps there is an actual commercial reason to look at why people swear at machines. In the work described here, I look at data from the DARPA Communicator project [21] in which high profile research institutions looked at the nature of human-computer conversation in the context of making travel plans. With these stateof-the-art systems, over a quarter of the calls resulted in the user not wanting to "use the system on a regular basis".…”
Section: Introductionmentioning
confidence: 99%