2019
DOI: 10.1007/s00778-019-00564-x
|View full text |Cite
|
Sign up to set email alerts
|

Dataset search: a survey

Abstract: Generating value from data requires the ability to find, access and make sense of datasets. There are many efforts underway to encourage data sharing and reuse, from scientific publishers asking authors to submit data alongside manuscripts to data marketplaces, open data portals and data communities. Google recently beta released a search service for datasets, which allows users to discover data stored in various online repositories via keyword queries. These developments foreshadow an emerging research field … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

1
110
0
1

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 128 publications
(114 citation statements)
references
References 163 publications
1
110
0
1
Order By: Relevance
“…Dataset search has become a new research field with new challenges. Chapman et al [3] classify dataset search into basic and constructive dataset search. Basic dataset search returns a list of existing datasets based on a user's query, while constructive dataset search [5] generates datasets on-the-fly based on a user's needs and query.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Dataset search has become a new research field with new challenges. Chapman et al [3] classify dataset search into basic and constructive dataset search. Basic dataset search returns a list of existing datasets based on a user's query, while constructive dataset search [5] generates datasets on-the-fly based on a user's needs and query.…”
Section: Related Workmentioning
confidence: 99%
“…Dataset retrieval is receiving more attention as people from different fields and domains start to rely on datasets for their work. There are many data portals with the purpose of effective and efficient data management and data sharing, such as data.gov 1 , datahub 2 and data.world 3 . Most of those data portals use CKAN 4 as their backend.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Our system, Visus, aims to fill this gap in the current HGML systems. In the next session, we describe how we integrated data search and automatic joins in our framework [2], as well as model interpretation based on rule visualization [15]. Figure 1 shows the main components of our visual analytics framework.…”
Section: Related Workmentioning
confidence: 99%
“…However, the returned results highly depend on the specified query words. It is often necessary to go several rounds by verifying the search results and revising query words [1]. Motivated by the difficulties of finding usable and relevant datasets, we aim at developing a dataset recommendation model, which assists users to efficiently find more truly relevant and usable datasets than current time demanding ways.…”
Section: Introductionmentioning
confidence: 99%