2017
DOI: 10.1016/j.envsoft.2017.03.032
|View full text |Cite
|
Sign up to set email alerts
|

Identification and characterization of information-networks in long-tail data collections

Abstract: Scientists' ability to synthesize and reuse long-tail scientific data lags far behind their ability to collect and produce these data. Many Earth Science Cyberinfrastructures enable sharing and publishing their data over the web using metadata standards. While profiling data attributes advances the Linked Data approach, it has become clear that building informationnetworks among distributed data silos is essential to increase their integration and reusability. In this research, we developed a Long-Tail Informa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 27 publications
0
4
0
Order By: Relevance
“…The framework attempts to fulfill these requirements by leveraging stable open source software, supporting flexible metadata definitions and offloading expensive operations to a cloud based extraction bus. Clowder has been developed and leveraged in a wide variety of use cases, including material science research [24], cultural heritage [31], digital curation [23,26], clinical research informatics [36], geosciences [11], environmental monitoring [2,15], digital humanities [18,29], social science [28], as well as industry collaborations.…”
Section: Introductionmentioning
confidence: 99%
“…The framework attempts to fulfill these requirements by leveraging stable open source software, supporting flexible metadata definitions and offloading expensive operations to a cloud based extraction bus. Clowder has been developed and leveraged in a wide variety of use cases, including material science research [24], cultural heritage [31], digital curation [23,26], clinical research informatics [36], geosciences [11], environmental monitoring [2,15], digital humanities [18,29], social science [28], as well as industry collaborations.…”
Section: Introductionmentioning
confidence: 99%
“…One key objective of the environmental data science is to narrow the data-to-knowledge latency (Elag et al, 2017). According to Foster et al (2012), scientific data doubles every twelve months, which is often faster than it can be converted into useful knowledge.…”
Section: Problem Statementmentioning
confidence: 99%
“…Syntactic heterogeneity is a factor which hinders the adoption of a universal strategy to acquire data originating from disparate information sources. It also obstructs the environmental data science core objective: to narrow the data-to-knowledge latency (Elag et al, 2017) by supporting environmental data discovery and access; and by enabling re-usability (Horsburgh et al, 2009;Ames et al, 2012;Athanasiadis, 2015;Holzworth et al, 2015;Granell et al, 2010). FAIR (Findable, Accessible, Interoperable, Reusable) guiding principles for scientific data management and stewardship highlight the importance of scientific data reusability and reproducibility (Wilkinson et al, 2016).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation