Finding Regions of Interest from Multimodal Human-Robot Interactions

Azagra, Pablo; Civera, Javier; Murillo, Ana C.

doi:10.21437/glu.2017-15

Cited by 2 publications

(3 citation statements)

References 10 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…• Speak: the user describes where a certain object is in relation to other objects. This paper builds on our previous works in [1] and [2]. The new contributions here are:…”

Section: Introductionmentioning

confidence: 90%

See 1 more Smart Citation

Incremental Learning of Object Models From Natural Human–Robot Interactions

Azagra

Civera

Murillo

2020

IEEE Trans. Automat. Sci. Eng.

Self Cite

View full text Add to dashboard Cite

In order to perform complex tasks in realistic human environments, robots need to be able to learn new concepts in the wild, incrementally, and through their interactions with humans. This paper presents an end-to-end pipeline to learn object models incrementally during the human-robot interaction.The pipeline we propose consists of three parts: (a) recognizing the interaction type, (b) detecting the object that the interaction is targeting, and (c) learning incrementally the models from data recorded by the robot sensors. Our main contributions lie in the target object detection, guided by the recognized interaction, and in the incremental object learning. The novelty of our approach is the focus on natural, heterogeneous and multimodal human-robot interactions to incrementally learn new object models. Throughout the paper we highlight the main challenges associated with this problem, such as high degree of occlusion and clutter, domain change, low resolution data and interaction ambiguity. Our work shows the benefits of using multi-view approaches and combining visual and language features, and our experimental results outperform standard baselines.Note to Practitioners-This work was motivated by challenges in recognition tasks for dynamic and varying scenarios. Our approach learns to recognize new user interactions and objects. To do so, we use multimodal data from the user-robot interaction: visual data is used to learn the objects and speech is used to learn the label and help with the interaction type recognition. We use state-of-the-art deep learning models to segment the user and the objects in the scene. Our algorithm for incremental learning is based on a classic incremental clustering approach.The pipeline we propose works with all sensors mounted on the robot, so it allows mobility on the system. Our work uses data recorded from a Baxter robot, which enables the use of the manipulation arms in future steps, but it would work with any robot able to have the same sensors mounted. The sensors used are two RGB-D cameras and a microphone. The pipeline currently has high computational requirements to run the two deep learning based steps. We have tested it with a desktop computer including a GTX 1060 and 32GB of RAM.

show abstract

“…• Speak: the user describes where a certain object is in relation to other objects. This paper builds on our previous works in [1] and [2]. The new contributions here are:…”

Section: Introductionmentioning

confidence: 90%

“…[59], show that the combination of language and vision leads to substantial improvements. In our previous work [60] we demonstrated that including Speak interactions to train the models obtains better accuracy than using only Point and Show ones.…”

Section: Multimodal Incremental Interaction Recognition Modulementioning

confidence: 99%

Incremental Learning of Object Models From Natural Human–Robot Interactions

Azagra

Civera

Murillo

2020

IEEE Trans. Automat. Sci. Eng.

Self Cite

View full text Add to dashboard Cite

show abstract

“…Oleh karena itu, Dalam penelitian ini, kami merancang sebuah antarmuka virtual untuk mesin kasir digital di Café Lentera Coffee & Eatery menggunakan teknologi Computer Vision dan Convolutional Neural Network (CNN) dengan tujuan meningkatkan pelayanan konsumen menggunakan teknologi kecerdasan buatan [6]. Selain itu, tujuan utama dari penelitian ini adalah untuk mengembangkan teknologi kecerdasan buatan dan juga akan di implementasikan pada antarmuka virtual tersebut untuk mengenali wajah konsumen, mengenali permintaan yang dipesan dan memproses permintaan pelanggan dengan lebih cepat dan efisien [7]. Sistem ini akan menjadi antarmuka virtual antara pelanggan dan mesin kasir, memungkinkan pelanggan untuk memesan secara real-time dan juga bisa membayar tanpa harus P-ISSN: 2089-676X E-ISSN: 2549-0796 990 berinteraksi langsung dengan pelayan dan crew [8] .…”

Section: Pendahuluanunclassified

Implementasi Sistem Kasir Digital Berbasis Teknologi Deteksi Tangan Computer Vision dan OpenCV

Surya Jaya,

Iskandar Mulyana

2023

smartcomp

View full text Add to dashboard Cite

Pengembangan teknologi kecerdasan buatan untuk aplikasi mesin kasir digital berbasis teknologi deteksi tangan dengan computer vision, kami rancang untuk pemesanan menu secara digital dan metode pembayaran menggunkanakn QR code, yang akan menjadi system kasir otomatis pada Cafe Lentera Coffee & Eatery. Penelitian ini bertujuan untuk melakukan optimasi proses pemesanan menu yang sebelumnya masih menggunakan perangkat fisik seperti kertas dan mesin kasir tradisional, untuk mencegah kesalahan pesanan dan waktu tunggu yang kurang ifisien. Perancangan system ini melibatkan penggunaan CNN dan OpenCV untuk meningkatkan akurasi deteksi yang memungkinkan konsumen untuk melakukan pemesanan secara interaktif melalui gerakan jari tangan untuk pemesanan menu. Melalui studi literatur dan penelitian eksperimental, dari hasi pengembangan ini adalah untuk memberikan hasil yang efisiensi dan kepuasan konsumen. Peroses pemesanan yang menarik serta pembayaran secara digital yang lebih cepat dan akurat dari sistem tradisional sebelumnya, dapat mengurangi biaya pembelian matrial fisik, serta meningkatkan waktu pesanan salah satu kontribusi dari pengembangan.

show abstract

Finding Regions of Interest from Multimodal Human-Robot Interactions

Cited by 2 publications

References 10 publications

Incremental Learning of Object Models From Natural Human–Robot Interactions

Incremental Learning of Object Models From Natural Human–Robot Interactions

Implementasi Sistem Kasir Digital Berbasis Teknologi Deteksi Tangan Computer Vision dan OpenCV

Contact Info

Product

Resources

About