2022
DOI: 10.1016/j.fsidi.2021.301330
|View full text |Cite
|
Sign up to set email alerts
|

NapierOne: A modern mixed file data set alternative to Govdocs1

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(21 citation statements)
references
References 27 publications
0
16
0
Order By: Relevance
“…In this paper, we introduce a complementary data set for the Govdocs1 corpus, known as NapierOne [19], which may be used to address the points made above. The Govdocs1 data set and the techniques used to create it provided an excellent template on how to create and curate the NapierOne data set.…”
Section: Paper Contributionmentioning
confidence: 99%
See 4 more Smart Citations
“…In this paper, we introduce a complementary data set for the Govdocs1 corpus, known as NapierOne [19], which may be used to address the points made above. The Govdocs1 data set and the techniques used to create it provided an excellent template on how to create and curate the NapierOne data set.…”
Section: Paper Contributionmentioning
confidence: 99%
“…In most case, there exist 5,000 example files for each data set and subset. The resulting NapierOne [19] data set contains nearly 500,000 unique files distributed between 100 separate data sets and subsets. A detailed overview of the structure of the data set is shown in Figure 5.…”
Section: Data Set Creationmentioning
confidence: 99%
See 3 more Smart Citations