2021 IEEE International Conference on Smart Data Services (SMDS) 2021
DOI: 10.1109/smds53860.2021.00016
|View full text |Cite
|
Sign up to set email alerts
|

Data Readiness Report

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(5 citation statements)
references
References 10 publications
0
4
0
Order By: Relevance
“…Further, Gebru et al (2021) advocate that documentation promotes the communication between "dataset consumers and producers". Existing frameworks for the elaboration of documentation include: Datasheets for Datasets (Gebru et al, 2021), Dataset Nutrition Labels (Chmielinski et al, 2022), Data Statements (Bender & Friedman, 2018), Data Readiness Report (Afzal et al, 2021), and Model Cards for Models (Mitchell et al, 2019). Formal data models, like ontologies and controlled vocabularies, can also support AIrelated documentation needs.…”
Section: Living With Bias By Documenting Itmentioning
confidence: 99%
“…Further, Gebru et al (2021) advocate that documentation promotes the communication between "dataset consumers and producers". Existing frameworks for the elaboration of documentation include: Datasheets for Datasets (Gebru et al, 2021), Dataset Nutrition Labels (Chmielinski et al, 2022), Data Statements (Bender & Friedman, 2018), Data Readiness Report (Afzal et al, 2021), and Model Cards for Models (Mitchell et al, 2019). Formal data models, like ontologies and controlled vocabularies, can also support AIrelated documentation needs.…”
Section: Living With Bias By Documenting Itmentioning
confidence: 99%
“…This information facilitates the evaluation of the suitability of a dataset by data scientists for specific tasks. The Data Readiness Report [1] present a similar proposal, deriving its design from the data readiness framework [6]. On top of the statistical analysis it also defines a set of quality metrics for evaluating datasets' composition.…”
Section: Data Documentation Proposals From the ML Communitymentioning
confidence: 99%
“…This situation has brought recent interest inside the research community about a data-centric cultural shift in the machine learning field 1 . The standardization of data creation processes, the need for formal documentation, and the need for mature tools to adopt best practices are common demands inside the research community.…”
mentioning
confidence: 99%
“…Challenges include poor and undocumented data collection practices, missing values, inconvenient storage mechanisms, intellectual property, security and privacy -notice the overlap with the interoperability issues above. To help overcome these challenges, several frameworks for data readiness have been developed [122,580,581]. These provide data handling structure and communication tools for ML practitioners, including required implementations such as data profiling and quality analyses as a key step before data enters a ML pipeline, and continuous testing for shifts in the data and environments of deployed models.…”
Section: Data-intensive Science and Computingmentioning
confidence: 99%