Proceedings of the 2020 the 3rd International Conference on Information Science and System 2020
DOI: 10.1145/3388176.3388179
|View full text |Cite
|
Sign up to set email alerts
|

Food Image Classification with Improved MobileNet Architecture and Data Augmentation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
12
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 44 publications
(13 citation statements)
references
References 13 publications
0
12
0
Order By: Relevance
“…The first successful application of CNNs as a deep learning application to images was shaped by LeCun, Bengio and Haffner in 1998 [ 41 ], but only in 2012 with the new architecture of CNN, AlexNet, did the statistical results for the image classification task really advance the state of the art [ 42 ]. Thereafter, progress on exploring computer vision has driven advances in the analysis of food images [ 43 ]. In this context, deep learning architectures such as MobileNetV2 combined with X-ray images can provide rapid predictions and recommendations for the next steps of the maize grain quality evaluation.…”
Section: Discussionmentioning
confidence: 99%
“…The first successful application of CNNs as a deep learning application to images was shaped by LeCun, Bengio and Haffner in 1998 [ 41 ], but only in 2012 with the new architecture of CNN, AlexNet, did the statistical results for the image classification task really advance the state of the art [ 42 ]. Thereafter, progress on exploring computer vision has driven advances in the analysis of food images [ 43 ]. In this context, deep learning architectures such as MobileNetV2 combined with X-ray images can provide rapid predictions and recommendations for the next steps of the maize grain quality evaluation.…”
Section: Discussionmentioning
confidence: 99%
“…In other words, first a spatial convolution is performed independently for each channel, then by a 1x1 convolution across all channels. This approach was found to be easier than the normal 3D convolution [52].…”
Section: ) Squeezenetmentioning
confidence: 98%
“…In the case of image recognition, the recognition model always benefits from a large amount of image data [5]. Data augmentation procedures used for expanding data set, such as translation in [6], [7] while histogram equalization is used to enhance the x-ray image [8]. Mikołajczyk and Grochowski [9] studied and compared numerous approaches of data augmentation for image classification in order to progress the training process proficiency.…”
Section: Introductionmentioning
confidence: 99%