2012
DOI: 10.1002/cpe.2936
|View full text |Cite
|
Sign up to set email alerts
|

Programming knowledge discovery workflows in service‐oriented distributed systems

Abstract: In several scientific and business domains, very large data repositories are generated. To find interesting and useful information in those repositories, efficient data mining techniques and knowledge discovery processes must be used. The exploitation of data mining techniques in science helps scientists in hypothesis formation and gives them a support on their scientific practices, whereas in industrial processes, data mining can exploit existing data sources as a real value for companies that can take advant… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
8
0

Year Published

2013
2013
2015
2015

Publication Types

Select...
5
2

Relationship

3
4

Authors

Journals

citations
Cited by 13 publications
(9 citation statements)
references
References 28 publications
0
8
0
Order By: Relevance
“…DMCF supports a large variety of data mining processes, including single-task applications, parameter sweeping application, and workflow-based applications. Following the approach proposed in [2], DMCF represents knowledge discovery workflows as graphs whose nodes denote resources (datasets, data mining tools, data mining models) and whose edges denote dependencies among resources. A Web-based user interface allows users to compose workflows and to submit them for execution to Cloud platforms, following a Software-as-a-Service approach.…”
Section: Data Mining Cloud Frameworkmentioning
confidence: 99%
“…DMCF supports a large variety of data mining processes, including single-task applications, parameter sweeping application, and workflow-based applications. Following the approach proposed in [2], DMCF represents knowledge discovery workflows as graphs whose nodes denote resources (datasets, data mining tools, data mining models) and whose edges denote dependencies among resources. A Web-based user interface allows users to compose workflows and to submit them for execution to Cloud platforms, following a Software-as-a-Service approach.…”
Section: Data Mining Cloud Frameworkmentioning
confidence: 99%
“…DMCF supports a large variety of processing patterns that can be used in data mining including single-task applications, parameter-sweeping applications, and workflow-based Copyright JS4Cloud JavaScript Implicit data-driven task paral-Low No Yes lelism and data parallelism applications. Following the approach proposed in [18], DMCF represents knowledge discovery workflows as graphs whose nodes denote resources (datasets, data analysis tools, and mining models) and whose edges denote dependencies among resources. A web-based user interface allows users to compose their applications and to submit them for execution to the Cloud platform, following a SaaS approach.…”
Section: Data Mining Cloud Frameworkmentioning
confidence: 99%
“…Then, k unlabeled datasets are specified as input, with k D 4 (line 17). Each of the k input datasets is classified by n predictors using the n models generated by J48 and by m predictors using the m models generated by JRip; therefore, for each of the k input datasets, n C m classified datasets are generated (lines [18][19][20][21][22]. As a final step, k weighted voters are executed; the i-th voter receives the n C m classified datasets generated from the i-th input and the n C m models and returns the final classified dataset for the i-th input (lines [25][26].…”
Section: Ensemble Learning Workflowmentioning
confidence: 99%
“…In the following we outline the Data Mining Cloud Framework and explain how Cloud4SNP has been implemented on it. Following the approach proposed in [19] and [4], the Data Mining Cloud Framework models knowledge discovery workflows as graphs whose nodes represent resources (datasets, data mining tools, data mining models) and whose edges represent dependencies between resources. The framework includes a Website to compose workflows and to submit their execution to the Cloud, following a Software-as-a-Service approach.…”
Section: Cloud4snp Implementationmentioning
confidence: 99%
“…www.affymetrix.com4 In the rest of the paper the terms SNPs and alleles are used interchangeably 5. DMET-Analyzer is available at: http://sourceforge.net/projects/dmetanalyzer/files.…”
mentioning
confidence: 99%