2021
DOI: 10.1007/s10270-021-00929-3
|View full text |Cite
|
Sign up to set email alerts
|

ModelSet: a dataset for machine learning in model-driven engineering

Abstract: The application of machine learning (ML) algorithms to address problems related to model-driven engineering (MDE) is currently hindered by the lack of curated datasets of software models. There are several reasons for this, including the lack of large collections of good quality models, the difficulty to label models due to the required domain expertise, and the relative immaturity of the application of ML to MDE. In this work, we present ModelSet, a labelled dataset of software models intended to enable the a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 18 publications
(5 citation statements)
references
References 47 publications
0
5
0
Order By: Relevance
“…The ModelSet dataset. 8 López et al [29] considered a subset of the Ecore and UML models collected by the MAR search engine [30,31] and labeled them. As a result, ModelSet was released in 2021.…”
Section: Datasetsmentioning
confidence: 99%
See 2 more Smart Citations
“…The ModelSet dataset. 8 López et al [29] considered a subset of the Ecore and UML models collected by the MAR search engine [30,31] and labeled them. As a result, ModelSet was released in 2021.…”
Section: Datasetsmentioning
confidence: 99%
“…This task has proved useful to facilitate the navigation of large model repositories by users. In particular, it has been used to implement faceted search in the MAR search engine [29].…”
Section: Model Classificationmentioning
confidence: 99%
See 1 more Smart Citation
“…We followed a manual data sampling methodology, which consisted of selecting 30 domain models from a larger dataset called ModelSet [10]. The goal was to have a dataset with samples of different sizes and containing domain models covering multi-disciplinary domains such as education, finance, entertainment, etc.…”
Section: A Setupmentioning
confidence: 99%
“…A dataset is basically a set of information that is provided to train the ML tool for predicting future events [ 52 ]. It forms the foundation for training and analyzing ML models.…”
Section: Introductionmentioning
confidence: 99%