Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery &Amp; Data Mining 2020
DOI: 10.1145/3394486.3403290
|View full text |Cite
|
Sign up to set email alerts
|

Building Continuous Integration Services for Machine Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
9
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
4
1

Relationship

1
8

Authors

Journals

citations
Cited by 19 publications
(9 citation statements)
references
References 12 publications
0
9
0
Order By: Relevance
“…The key ingredients of our system lie in (a) the syntax and semantics of the test conditions and how to accurately evaluate them, and (b) an optimized sample-size estimator that yields a budget of test set re-uses before it needs to be refreshed. For a full description of the workflow as well as advanced system optimizations deployed in our engine, we refer the reader to our initial paper [31] and the followup work [22], which further discusses the integration into existing software development ecosystems.…”
Section: Mlops Challengementioning
confidence: 99%

A Data Quality-Driven View of MLOps

Renggli,
Rimanic,
Gürel
et al. 2021
Preprint
Self Cite
“…The key ingredients of our system lie in (a) the syntax and semantics of the test conditions and how to accurately evaluate them, and (b) an optimized sample-size estimator that yields a budget of test set re-uses before it needs to be refreshed. For a full description of the workflow as well as advanced system optimizations deployed in our engine, we refer the reader to our initial paper [31] and the followup work [22], which further discusses the integration into existing software development ecosystems.…”
Section: Mlops Challengementioning
confidence: 99%

A Data Quality-Driven View of MLOps

Renggli,
Rimanic,
Gürel
et al. 2021
Preprint
Self Cite
“…Prospective collaborators may submit code to a shared codebase. Some code may introduce errors or decrease the performance of the ML model [46,48,69,81]. How can code contributions be evaluated?…”
Section: Challengesmentioning
confidence: 99%
“…Although all the project under this study are written using Python, no project is using any common tool that integrates the DS modules and provides interface to the pipeline. Today, continuous integration and deployment (CI/CD) tools are widely used as a common practice in traditional software lifecycle to automate compilation, building, and testing [26,37]. Additionally, from our subject studies of pipelines in theory, we found some CI/CD tools designed for ML pipelines are available [27,47,49].…”
Section: Characteristics Of Pipelines In-the-largementioning
confidence: 99%
“…Although CI/CD frameworks e.g., TravisCI, GitHub Actions, Microsoft Azure DevOps are well established for traditional software such as web applications, several challenges remain for DS pipelines. Karlaš et al outlined a major CI/CD challenge for DS pipeline i.e., unlike traditional software, ML testing is probabilistic [37]. The authors also pointed out to the gap between recent theoretical development of CI/CD in DS and their usage in practice.…”
Section: Characteristics Of Pipelines In-the-largementioning
confidence: 99%