ResSeg: Residual encoder-decoder convolutional neural network for food segmentation

Pinzón-Arenas, Javier Orlando; Moreno, Róbinson Jiménez; Pachón, César G.

doi:10.11591/ijece.v10i1.pp1017-1026

Cited by 8 publications

(6 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We employed SegNet [38], which was applied to the food extraction in Refs. [14,15], as the food extraction using low-training-cost DNN. SegNet is trained by UNIMIB2016 [39], which includes 1017 annotated food images available on the Internet.…”

Section: Precision = Tp/(tp + Fp)mentioning

confidence: 99%

“…Refs. [12][13][14][15] proposed food extraction using DNN, requiring pixel-wise annotated food images as a training dataset. The food regions include various shapes, colors, and textures [12,16].…”

Section: Introductionmentioning

confidence: 99%

“…[4]. Although the food extraction using DNN [12][13][14][15] can recognize food type at pixel level, it requires the pixel-wise annotated training dataset, which is extremely laborious and costly to be prepared [21].…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Food Region Extraction Based on Saliency Detection Model

Kitada

Futagami

Hayasaka

2021

Transactions of ISCIE

View full text Add to dashboard Cite

In this paper, we propose a method that can automatically extract food regions from food images by using the saliency detection model based on a deep neural network (DNN) and the saliency thresholding method based on the average saliency value. Our experiment, using 125 food images from a food recording tool on smartphone applications, demonstrates that the proposed method significantly increased average F-measure by 4.22% or more compared with both the conventional method using local extrema and food extraction using DNN trained with 1017 food images. Our proposed method also increased average precision and recall by 0.13% or more and 11.38% or more, respectively. We also discussed the effectiveness and the future development of food extraction using the saliency detection model and saliency thresholding method on the basis of experimental results.

show abstract

Section: Precision = Tp/(tp + Fp)mentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Food Region Extraction Based on Saliency Detection Model

Kitada

Futagami

Hayasaka

2021

Transactions of ISCIE

View full text Add to dashboard Cite

show abstract

“…Image description is a process to extract the visual content of foods. In particular, local feature is more suitable to represent food features as the properties of local features that capture minuscule parts of the food beside its robustness towards illumination, scale, rotation, and orientation which made it capable to deal with the cluttered appearance of foods [1,5,6]. The interest points that were detected and described have produced a high volume and diverse features that require the features to be transformed into another more simplified representation by using certain feature encoding technique.…”

Section: Introductionmentioning

confidence: 99%

Fuzzy encoding with hybrid pooling for visual dictionary in food recognition

Razali

Manshor

Halin

et al. 2021

IJEECS

View full text Add to dashboard Cite

<span>Tremendous number of f food images in the social media services can be exploited by using food recognition for healthcare benefits and food industry marketing. The main challenges in food recognition are the large variability of food appearance that often generates a highly diverse and ambiguous descriptions of local feature. Ironically, the ambiguous descriptions of local feature have triggered information loss in visual dictionary constructions from the hard assignment practices. The current method based on hard assignment and Fisher vector approach to construct visual dictionary have unexpectedly cause errors from the uncertainty problem during visual word assignation. This research proposes a method of combination in soft assignment technique by using fuzzy encoding approach and maximum pooling technique to aggregate the features to produce a highly discriminative and robust visual dictionary across various local features and machine learning classifiers. The local features by using MSER detector with SURF descriptor was encoded by using fuzzy encoding approach. Support vector machine (SVM) with linear kernel was employed to evaluate the effect of fuzzy encoding. The results of the experiments have demonstrated a noteworthy classification performance of fuzzy encoding approach compared to the traditional approach based on hard assignment and Fisher vector technique. The effects of uncertainty and plausibility were minimized along with more discriminative and compact visual dictionary representation.</span>

show abstract

“…Besides CNN, other artificial intelligence methods have been developed for the classification of patterns, such as the fast R-CNN [15] and the DAG-CNN [16,17], in the first case is former a stage of extraction of a Regions of Interest (ROIs) that is responsible for of detecting desired elements in the input image, extracting them and entering them into a CNN for classification, as explained in [18], while the last consists of a branched structure where each branch contains a sequence of convolutional layers whose filters vary of dimension, to extract characteristics of greater and smaller size of the input image, and in the end to unify the results to give a classification, as indicated in [19].…”

Section: Introductionmentioning

confidence: 99%

Algorithm of detection, classification and gripping of occluded objects by CNN techniques and Haar classifiers

Useche

Moreno

Baquero

2020

IJECE

View full text Add to dashboard Cite

The following paper presents the development of an algorithm, in charge of detecting, classifying and grabbing occluded objects, using artificial intelligence techniques, machine vision for the recognition of the environment, an anthropomorphic manipulator for the manipulation of the elements. 5 types of tools were used for their detection and classification, where the user selects one of them, so that the program searches for it in the work environment and delivers it in a specific area, overcoming difficulties such as occlusions of up to 70%. These tools were classified using two CNN (convolutional neural network) type networks, a fast R-CNN (fast region-based CNN) for the detection and classification of occlusions, and a DAG-CNN (directed acyclic graph-CNN) for the classification tools. Furthermore, a Haar classifier was trained in order to compare its ability to recognize occlusions with respect to the fast R-CNN. Fast R-CNN and DAG-CNN achieved 70.9% and 96.2% accuracy, respectively, Haar classifiers with about 50% accuracy, and an accuracy of grip and delivery of occluded objects of 90% in the application, was achieved.

show abstract

ResSeg: Residual encoder-decoder convolutional neural network for food segmentation

Cited by 8 publications

References 21 publications

Food Region Extraction Based on Saliency Detection Model

Food Region Extraction Based on Saliency Detection Model

Fuzzy encoding with hybrid pooling for visual dictionary in food recognition

Algorithm of detection, classification and gripping of occluded objects by CNN techniques and Haar classifiers

Contact Info

Product

Resources

About