2018
DOI: 10.1038/sdata.2018.172
|View full text |Cite|
|
Sign up to set email alerts
|

The first annotated set of scanning electron microscopy images for nanoscience

Abstract: In this paper, we present the first publicly available human-annotated dataset of images obtained by the Scanning Electron Microscopy (SEM). A total of roughly 26,000 SEM images at the nanoscale are classified into 10 categories to form 4 labeled training sets, suited for image recognition tasks. The selected categories span the range of 0D objects such as particles, 1D nanowires and fibres, 2D films and coated surfaces as well as patterned surfaces, and 3D structures such as microelectromechanical system (MEM… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
41
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 73 publications
(41 citation statements)
references
References 8 publications
0
41
0
Order By: Relevance
“…While ImageNet encompasses a wide range of classes, the noise characteristics in microscopic images are different. Potentially, pre-training with other data sets exhibiting a smaller domain gap such as miscellaneous nanoscientific objects in SEM 47 or ultra-high carbon steel phases SEM 48 can be advantageous. Variances and generalization.…”
Section: Discussionmentioning
confidence: 99%
“…While ImageNet encompasses a wide range of classes, the noise characteristics in microscopic images are different. Potentially, pre-training with other data sets exhibiting a smaller domain gap such as miscellaneous nanoscientific objects in SEM 47 or ultra-high carbon steel phases SEM 48 can be advantageous. Variances and generalization.…”
Section: Discussionmentioning
confidence: 99%
“…Then, retrain the network allowing back-propagation through all the layers. We adopted checkpoints pre-trained on two data sets: ImageNet [17], a large visual database designed for object recognition, in the version of the ImageNet Large Scale Visual Recognition Challenge (ILSVRC) 2012 [18], and our own SEM data set [11]. We note that the second case cannot be formally defined as a transfer learning technique, since the fine tuning of the CNN is performed on the same data set of the checkpoint; nevertheless, this is a commonly adopted way to efficiently refine the parameters of the network.…”
Section: Supervised Learningmentioning
confidence: 99%
“…Table 1. Number of images for each label in the 1μ-2μ data set, adopting the same labelling used in [11,12,13], reported here for completeness: 0 = Porous sponges, 1 = Patterned surfaces, 2 = Particles, 3 = Films and coated surfaces, 4 = Powders, 5 = Tips, 6 = Nanowires, 7 = Biological, 8 = MEMS devices and electrodes, 9 = Fibres.…”
Section: The 1μ-2μ Data Set Selectionmentioning
confidence: 99%
See 1 more Smart Citation
“…A typical dataset involves 10 7 –10 10 detected events with a total size of up to a few hundred gigabytes (GBs), depending on the number of coordinates measured (3D or 4D) and the required SNR. Unlike the large 2D or 3D image-based datasets, such as those obtained in various forms of optical 26 , 27 and electron microscopy techniques 28 , 29 , processing and conversion of tabulated single-event data requires additional steps of statistical computing for conversion into standard images. This motivates the current workflow development for efficient data processing and analysis.…”
Section: Introductionmentioning
confidence: 99%