Proceedings of the 8th International Conference on Multimodal Interfaces 2006
DOI: 10.1145/1180995.1181060
|View full text |Cite
|
Sign up to set email alerts
|

Using redundant speech and handwriting for learning new vocabulary and understanding abbreviations

Abstract: New language constantly emerges from complex, collaborative human-human interactions like meetings -such as, for instance, when a presenter handwrites a new term on a whiteboard while saying it. Fixed vocabulary recognizers fail on such new terms, which often are critical to dialogue understanding. We present a proof-of-concept multimodal system that combines information from handwriting and speech recognition to learn the spelling, pronunciation and semantics of out-of-vocabulary terms from single instances o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
8
0

Year Published

2006
2006
2011
2011

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 14 publications
(8 citation statements)
references
References 37 publications
0
8
0
Order By: Relevance
“…For the ACI we have implemented for testing SHACER, the perceived interactions occur in public spaces that are shared by the participants -e.g., (a) a shared interactive whiteboard or a piece of digital paper for public sketching and handwriting [5], (b) a shared conversational space for speech captured by close-talking microphones [18]. Participants in this shared public space can be co-located or remotely distributed.…”
Section: Multimodal Understanding Of Human-human Interactionmentioning
confidence: 99%
See 2 more Smart Citations
“…For the ACI we have implemented for testing SHACER, the perceived interactions occur in public spaces that are shared by the participants -e.g., (a) a shared interactive whiteboard or a piece of digital paper for public sketching and handwriting [5], (b) a shared conversational space for speech captured by close-talking microphones [18]. Participants in this shared public space can be co-located or remotely distributed.…”
Section: Multimodal Understanding Of Human-human Interactionmentioning
confidence: 99%
“…Neither of these handwritten abbreviations (CB [17] that combining information from redundant handwriting and speech is significantly more reliable for the recognition of Gantt chart labels than depending on either mode alone.…”
Section: Study Implicationsmentioning
confidence: 99%
See 1 more Smart Citation
“…Once a term has been introduced in full (via handwriting and speech), the system is able to determine both its spelling and pronunciation. Further spoken references in the temporal vicinity of a an abbreviated handwritten term can then be recovered via the application of heuristics (see [13] for details of this mechanism).…”
Section: Propagation Via Abbreviationsmentioning
confidence: 99%
“…handwriting and speech), which naturally occur redundantly, we can apply cross-stream correlation at the recognizer level to improve the systems understanding, as evidenced by the significant 37.5% relative reduction in abbreviation labeling error seen in the held-out test set [20].…”
Section: Applicationsmentioning
confidence: 99%