2018 IEEE Winter Conference on Applications of Computer Vision (WACV) 2018
DOI: 10.1109/wacv.2018.00068
|View full text |Cite
|
Sign up to set email alerts
|

Wide-Slice Residual Networks for Food Recognition

Abstract: Food diary applications represent a tantalizing market. Such applications, based on image food recognition, opened to new challenges for computer vision and pattern recognition algorithms. Recent works in the field are focusing either on hand-crafted representations or on learning these by exploiting deep neural networks. Despite the success of such a last family of works, these generally exploit off-the shelf deep architectures to classify food dishes. Thus, the architectures are not cast to the specific prob… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
113
0
1

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 170 publications
(114 citation statements)
references
References 42 publications
0
113
0
1
Order By: Relevance
“…There are various popular CNN architectures for image processing including AlexNet (Krizhevsky et al, 2012), a network using repetitive units called visual geometry group network (VGG) (Simonyan & Zisserman, 2014), GoogLeNet (Szegedy et al, 2015) that includes parallel data channels, and residual neural network (ResNet) (He, Zhang, Ren, & Sun, 2016) constructed by residual (Heravi, Aghdam, & Puig, 2018) 65.40 87.00 CNN-FOOD 70.41 / Multitask (H. Wu, Merler, Uceda-Sosa, & Smith, 2016) 72.11 / FoodNet (Pandey, Deepthi, Mandal, & Puhan, 2017) 72.12 91.61 DeepFood (Liu et al, 2016a) 77.40 93.70 Inception Module (C. 77.00 94.00 ResNet (Fu et al, 2017) 78.50 94.10 ResNet-50 (Ciocca, Napoletano, & Schettini, 2018) 82.54 95.79 Inception-v3+FPCNN (Zheng, Zou, & Wang, 2018) 87.96 / Inception V3 (Hassannejad et al, 2016) 88.28 96.88 wide-slice residual networks (WISeR) (Martinel, Foresti, & Micheloni, 2018) 90.27 98.71…”
Section: Deep Learning Applications In Food Food Recognition and Clasmentioning
confidence: 99%
See 3 more Smart Citations
“…There are various popular CNN architectures for image processing including AlexNet (Krizhevsky et al, 2012), a network using repetitive units called visual geometry group network (VGG) (Simonyan & Zisserman, 2014), GoogLeNet (Szegedy et al, 2015) that includes parallel data channels, and residual neural network (ResNet) (He, Zhang, Ren, & Sun, 2016) constructed by residual (Heravi, Aghdam, & Puig, 2018) 65.40 87.00 CNN-FOOD 70.41 / Multitask (H. Wu, Merler, Uceda-Sosa, & Smith, 2016) 72.11 / FoodNet (Pandey, Deepthi, Mandal, & Puhan, 2017) 72.12 91.61 DeepFood (Liu et al, 2016a) 77.40 93.70 Inception Module (C. 77.00 94.00 ResNet (Fu et al, 2017) 78.50 94.10 ResNet-50 (Ciocca, Napoletano, & Schettini, 2018) 82.54 95.79 Inception-v3+FPCNN (Zheng, Zou, & Wang, 2018) 87.96 / Inception V3 (Hassannejad et al, 2016) 88.28 96.88 wide-slice residual networks (WISeR) (Martinel, Foresti, & Micheloni, 2018) 90.27 98.71…”
Section: Deep Learning Applications In Food Food Recognition and Clasmentioning
confidence: 99%
“…DeepFood (Liu et al, 2016a) 54.70 81.50 Author defined (Heravi et al, 2018) 60.00 85.00 Inception Module (C. 63.60 87.00 DeepFoodCam (Kawano & Yanai, 2014) 63.77 85.82 CNN-FOOD 67.57 88.97 ResNet (Fu et al, 2017) 71.20 91.10 ResNet-50 71.70 91.33 Inception V3 (Hassannejad et al, 2016) 76.17 92.58 Inception-v3+FP-CNN 78.60 / WISeR (Martinel et al, 2018) 83.15 95.45 UECFood-100: Images of 100 kinds of Japanese food with at least 100 images per category…”
Section: Deep Learning Applications In Food Food Recognition and Clasmentioning
confidence: 99%
See 2 more Smart Citations
“…Hokuto Kagaya, Kiyoharu Aizawa and Makoto Ogawa in [16], proposed the tasks of food detection and recognition through parameter optimization and how to construct a dataset of the most frequent food items in a publicly available food-logging system, and used it to evaluate recognition performance. Niki Martinel, Gian Luca Foresti, and Christian Michelon in [17] this paper introduces a new deep scheme that is designed to handle the food structure and also explains about the recent success of residual deep network, introduce a slice convolution block to capture the vertical food layers. Outputs of the deep residual blocks are combined with the sliced convolution to produce the classification score for specific food categories.…”
Section: Literature Surveymentioning
confidence: 99%