2021
DOI: 10.32473/flairs.v34i1.128560
|View full text |Cite
|
Sign up to set email alerts
|

Automated Assessment of Quality of Jupyter Notebooks Using Artificial Intelligence and Big Code

Abstract: We present in this paper an automated method to assess the quality of Jupyter notebooks. The quality of notebooks is assessed in terms of reproducibility and executability. Specifically, we automatically extract a number of expert-defined features for each notebook, perform a feature selection step, and then trained supervised binary classifiers to predict whether a notebook is reproducible and executable, respectively. We also experimented with semantic code embeddings to capture the notebooks' semantics. We … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 13 publications
0
2
0
Order By: Relevance
“…Various analytical techniques are used to understand the collected data. Descriptive analysis is used to understand the distribution of data, calculate summary statistics such as mean, median, mode, and standard deviation, and identify patterns and trends in data [42]. Meanwhile, inferential analysis tests hypotheses about relationships between variables, builds statistical models to predict purchasing behavior, and groups customers based on their profiles and preferences [38].…”
Section: Figure 1 Research Methodology Stagesmentioning
confidence: 99%
“…Various analytical techniques are used to understand the collected data. Descriptive analysis is used to understand the distribution of data, calculate summary statistics such as mean, median, mode, and standard deviation, and identify patterns and trends in data [42]. Meanwhile, inferential analysis tests hypotheses about relationships between variables, builds statistical models to predict purchasing behavior, and groups customers based on their profiles and preferences [38].…”
Section: Figure 1 Research Methodology Stagesmentioning
confidence: 99%
“…The quality of notebooks has been addressed in previous works that identify common issues (Oli et al, 2021;Pimentel et al, 2019Pimentel et al, , 2021. Examples of these include: the execution order of cells, and problems such as unnamed notebooks (e.g., Untitled1.ipynb) or the reproducibility of results.…”
Section: Related Workmentioning
confidence: 99%