Benchmarking fusion engines of multimodal interactive systems

Dumas, Bruno; Ingold, Rolf; Lalanne, Denis

doi:10.1145/1647314.1647345

Cited by 24 publications

(21 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The general architecture used for the development of multimodal applications can be separated in four different components: input modalities and their recognizers, output modalities and their respective synthesizers, the integration committee, and the application logic [8]. Indeed, using multimodality efficiently implies a clear abstraction between the results of the user's input analysis, the processing of this input, answer generation and output modalities selection.…”

Section: Proposal For Developing Multimodal Dialog Systemsmentioning

confidence: 99%

See 1 more Smart Citation

A Proposal for Processing and Fusioning Multiple Information Sources in Multimodal Dialog Systems

Griol

Molina

García

2014

Communications in Computer and Information Science

View full text Add to dashboard Cite

Abstract. Multimodal dialog systems can be defined as computer systems that process two or more user input modes and combine them with multimedia system output. This paper is focused on the multimodal input, providing a proposal to process and fusion the multiple input modalities in the dialog manager of the system, so that a single combined input is used to select the next system action. We describe an application of our technique to build multimodal systems that process user's spoken utterances, tactile and keyboard inputs, and information related to the context of the interaction. This information is divided in our proposal into external and internal context, user's internal, represented in our contribution by the detection of their intention during the dialog and their emotional state.

show abstract

Section: Proposal For Developing Multimodal Dialog Systemsmentioning

confidence: 99%

“…According to [8], fusion of input sources in these systems must be approached in a global way: from the point of view of the architecture of a multimodal system as a whole, then, from the point of view of multimodal dialog modeling, and finally from an algorithmic point of view.…”

Section: Introductionmentioning

confidence: 99%

A Proposal for Processing and Fusioning Multiple Information Sources in Multimodal Dialog Systems

Griol

Molina

García

2014

Communications in Computer and Information Science

View full text Add to dashboard Cite

show abstract

“…The concise model of the right side of Figure 3, depicted with the editor of HephaisTK [6], specifies the behavior of a prototype implementing the put-that-there interaction technique. The prototype will initially be at state Start awaiting for the sequence of events whose detection will cause the movement of an object.…”

Section: Hephaistkmentioning

confidence: 99%

“…This movement is executed by the subroutine put that there action whereas the stream of events that will cause the execution of this subprogram is represented as a set of nested rectangles. HephaisTK's notation allow four different types of rectangles to declare composite events [6]. Each type of rectangle indicates the temporal constraints among the events annotated within it.…”

Section: Hephaistkmentioning

confidence: 99%

“…In the remainder of this work, these toolkits will be referred to as toolkits for rapid prototyping of multimodal systems. Some existing toolkits for rapid prototyping of multimodal systems are ICon [5], Squidy [13], CoGenIVE [4], HephaisTK [6] and PetShop [9]. They are rather different one from another, since they provide different features, target different domains, use different programming paradigms and/or expect different skills from their users.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Assessing the support provided by a toolkit for rapid prototyping of multimodal systems

Cuenca

Vanacken

Coninx

et al. 2013

Proceedings of the 5th ACM SIGCHI Symposium on Engineering Interactive Computing Systems

View full text Add to dashboard Cite

Choosing an appropriate toolkit for creating a multimodal interface is a cumbersome task. Several specialized toolkits include fusion and fission engines that allow developers to combine and decompose modalities to capture multimodal input and provide multimodal output. Unfortunately, the extent to which these toolkits can facilitate the creation of a multimodal interface is hard or impossible to estimate, due to the absence of a scale where the toolkit's capabilities can be measured on. In this paper, we propose a measurement scale, which allows the assessment of specialized toolkits without need for time-consuming testing or source code analysis. This scale is used to measure and compare the capabilities of three toolkits: CoGenIVE, HephaisTK and ICon.

show abstract