Interpretation of multimodal designation with imprecise gesture

Choumane, Ali; Siroux, Jacques

doi:10.1049/cp:20070374

Cited by 2 publications

(4 citation statements)

References 11 publications

(8 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This internal representation associates a vector to every object which contains: the display name on the screen (it can be different to the database name), color, form, size, and salience. These characteristics are used when designating an object by speech and/or by gesture (more details in [2,4]). The salience of a graphical object consists of its visual and contextual weight.…”

Section: Common Visual Contextmentioning

confidence: 99%

“…The salience of a graphical object consists of its visual and contextual weight. To determine these weights, we use a salience distribution algorithm to objects in the common visual context [2,4] which distinguishes two moments of salience uses: saliences initialization at the beginning of each dialog and saliences modification during interaction. An object with its vector is an element of P .…”

Section: Common Visual Contextmentioning

confidence: 99%

“…• We use also the algorithm proposed in [4] to resolve designation by oral jointly to gesture (example 1). This algorithm uses elements from three of languages of the model: L, T , and P .…”

Section: Speech Recognitionmentioning

confidence: 99%

“…ρT −G (((x1, y1), (x2, y2), ...)) = object(riverN ame, blue, ...) ρT −G is resolved using the algorithm concerned by designation with gesture. It takes into account P , the object types founded by ρL−G(stream), and designation probabilities (more details in [4]). …”

Section: Speech Recognitionmentioning

confidence: 99%

See 3 more Smart Citations

A model for multimodal representation and processing for reference resolution

Choumane

Siroux

2007

Proceedings of the 2007 Workshop on Multimodal Interfaces in Semantic Interaction

View full text Add to dashboard Cite

We present a model for dealing with designation activities of a user in multimodal systems. This model associates both a well defined language to each modality (NL, gesture, visual) and a mediator one. It takes into account several semantic features of modalities. Functions link objects from each modality to another, and allow reasoning and referent identification. Processing algorithms related to each language are developed.

show abstract

Section: Common Visual Contextmentioning

confidence: 99%

Section: Common Visual Contextmentioning

confidence: 99%

Section: Speech Recognitionmentioning

confidence: 99%

Section: Speech Recognitionmentioning

confidence: 99%

See 2 more Smart Citations