A
data ecosystem
(DE) offers a keystone-player or alliance-driven infrastructure that enables the interaction of different stakeholders and the resolution of interoperability issues among shared data. However, despite years of research in data governance and management, trustability is still affected by the absence of transparent and traceable data-driven pipelines. In this work, we focus on requirements and challenges that DEs face when ensuring data transparency. Requirements are derived from the data and organizational management, as well as from broader legal and ethical considerations. We propose a novel knowledge-driven DE architecture, providing the pillars for satisfying the analyzed requirements. We illustrate the potential of our proposal in a real-world scenario. Last, we discuss and rate the potential of the proposed architecture in the fulfillmentof these requirements.
The increasing interest in using the Web as a platform for data sharing has motivated research about publishing and consuming data on the Web. While this subject is gaining importance, up until now, there are not many academic papers reviewing the approaches for publishing and consuming data on the Web. Furthermore, to the best of our knowledge, there is no systematic review of the literature that analyzes this subject. In this article, we conduct a systematic mapping study that aims to provide an overview of the current literature on publishing and consuming data on the Web by conducting a systematic mapping study. This study seeks to function as a snapshot of this subject by (i) identifying and analyzing how data have been published and consumed on the Web, (ii) discovering the benefits and limitations of publishing and consuming data on the Web (iii) analyzing the evolution of research on publishing and consuming data on the Web, and (iv) classifying the studies into categories related to their contribution. Finally, we discuss the results of this study and their implications for research on data on the Web-related subjects.
Governments and public agencies are publishing data on the Web, called Open Government Data (OGD), without restrictions for its sharing and reuse. This growing interest has motivated Brazilian Government Administrations to launch Open Data initiatives. As a consequence, in the last years, several Brazilian Open Data portals have been launched. Nowadays, there is a huge number of OGD portals from different countries and a lot of effort has been made to evaluate them. Although Brazil is one of the leaders of the Open Data initiative in South American and one of the founders of the Open Government Partnership, there are no comprehensive reports about the current Brazilian OGD scenario. In this context, we evaluated 13 Brazilian OGD portals according to several criteria ranging from the volume of data available to the quality of the data. As part of the results, we found that quantity and size of datasets are very diverse, CSV is the most used data format, at least 47 different data formats have been used to publish Open Data and most of the Brazilian Open Data portals use the CKAN platform. In this paper, we present and discuss the research method and the results of our work.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.