First-person reading activity recognition by deep learning with synthetically generated images

Segawa, Yuta; Kawamoto, Kazuhiko; Okamoto, Koji

doi:10.1186/s13640-018-0272-z

Cited by 3 publications

(3 citation statements)

References 40 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Moreover, several authors used synthesized images to train deep learning model successfully [24][25][26][27][28]. This fact demonstrates that image synthetization is a technique that reduces the effort of manual annotations.…”

Section: Real or Virtual Imagesmentioning

confidence: 90%

See 1 more Smart Citation

Improving Robot Perception Skills Using a Fast Image-Labelling Method with Minimal Human Intervention

Ricolfe-Viala

Blanes

2022

Applied Sciences

View full text Add to dashboard Cite

Robot perception skills contribute to natural interfaces that enhance human–robot interactions. This can be notably improved by using convolutional neural networks. To train a convolutional neural network, the labelling process is the crucial first stage, in which image objects are marked with rectangles or masks. There are many image-labelling tools, but all require human interaction to achieve good results. Manual image labelling with rectangles or masks is labor-intensive and unappealing work, which can take months to complete, making the labelling task tedious and lengthy. This paper proposes a fast method to create labelled images with minimal human intervention, which is tested with a robot perception task. Images of objects taken with specific backgrounds are quickly and accurately labelled with rectangles or masks. In a second step, detected objects can be synthesized with different backgrounds to improve the training capabilities of the image set. Experimental results show the effectiveness of this method with an example of human–robot interaction using hand fingers. This labelling method generates a database to train convolutional networks to detect hand fingers easily with minimal labelling work. This labelling method can be applied to new image sets or used to add new samples to existing labelled image sets of any application. This proposed method improves the labelling process noticeably and reduces the time required to start the training process of a convolutional neural network model.

show abstract

Section: Real or Virtual Imagesmentioning

confidence: 90%

“…Castro et al [27] generated synthetic structural magnetic resonance images for learning schizophrenia. Segawa et al [28] recognized first-person reading activity by synthetizing computer-generated images and real background images.…”

Section: Introductionmentioning

confidence: 99%

Improving Robot Perception Skills Using a Fast Image-Labelling Method with Minimal Human Intervention

Ricolfe-Viala

Blanes

2022

Applied Sciences

View full text Add to dashboard Cite

show abstract

“…Recent advancement in automatic reviewing using machine vision such as deep learning and convolutional neural networks (CNN) have allowed the processing of large data sets to identify various features in images (Nweke, Teh, Al-Garadi, & Alo, 2018). However, processing of large dataset of egocentric/first-person images are still at their early stage of development (Segawa, Kawamoto, & Okamoto, 2018). Castro et al (2015) showed the prediction of daily activities from egocentric images using CNN.…”

Section: Data Processingmentioning

confidence: 99%

Exploring built environment correlates of older adults’ walking travel from lifelogging images

Hou

Moogoor

Dieterich

et al. 2021

Transportation Research Part D: Transport and Environment

View full text Add to dashboard Cite

Pedestrian Motion Trajectory Prediction in Intelligent Driving from Far Shot First-Person Perspective Video

Cai

Dai

Wang

et al. 2022

IEEE Trans. Intell. Transport. Syst.

View full text Add to dashboard Cite

First-person reading activity recognition by deep learning with synthetically generated images

Cited by 3 publications

References 40 publications

Improving Robot Perception Skills Using a Fast Image-Labelling Method with Minimal Human Intervention

Improving Robot Perception Skills Using a Fast Image-Labelling Method with Minimal Human Intervention

Exploring built environment correlates of older adults’ walking travel from lifelogging images

Pedestrian Motion Trajectory Prediction in Intelligent Driving from Far Shot First-Person Perspective Video

Contact Info

Product

Resources

About