Abstract. This paper proposes a novel discovery framework for smart objects in the Internet of Things (IoT). The discovery service is a fundamental block of the IoT as it allows smart objects and their users to dynamically discover distributed smart objects and, specifically, the services and operations they provide. The proposed framework defines a new metadata model to describe features, services, and operations of network-enabled smart objects, and implements a service-oriented service, accessible through a REST interface, for registering, indexing, and searching smart objects. Thanks to its generic architecture and the use of open Web standards, the proposed discovery service can be easily integrated into any smart object middleware. In particular, we show its integration into an agent-based middleware supported by the JADE platform.
In several scientific and business domains, very large data repositories are generated. To find interesting and useful information in those repositories, efficient data mining techniques and knowledge discovery processes must be used. The exploitation of data mining techniques in science helps scientists in hypothesis formation and gives them a support on their scientific practices, whereas in industrial processes, data mining can exploit existing data sources as a real value for companies that can take advantage from the knowledge that can be extracted from their large data sources. Data mining tasks are often composed by multiple stages that may be linked to each other to form various execution flows. Moreover, data mining tasks are often distributed because they involve data and tools located over geographically distributed environments. Therefore, it is fundamental to exploit effective paradigms, such as services and workflows, to model data mining tasks that are both multi-staged and distributed. This paper discusses data mining services and workflows for analyzing scientific data in high-performance distributed environments such as Grids and Clouds. We discuss how it is possible to define basic and complex services for supporting distributed data mining tasks in Grids. We also present a workflow formalism and a service-oriented programming framework, named DIS3GNO, for designing and running distributed knowledge discovery processes in the Knowledge Grid system. DIS3GNO supports all the phases of a knowledge discovery process, including composition, execution, and results visualization. After introducing DIS3GNO, some relevant use cases implemented by it and a performance evaluation of the system are discussed. Abstract resource Concrete resource Location set KNOWLEDGE DISCOVERY WORKFLOWS IN DISTRIBUTED SYSTEMS 1499 Figure 17. Parameter sweeping workflow.581,012 instances and is stored in a file having a size of 72 MB. From this dataset, we extracted three datasets with 72,500, 145,000, and 290,000 instances and a file size of 9 MB, 18 MB, and 36 MB, respectively. Then, we used DIS3GNO to perform a classification analysis on each of those datasets.DIS3GNO has been used to run an application in which eight independent instances of the J48 algorithm perform a different classification task on the covertype dataset. In particular, each J48 instance has been asked to classify data using a different value of confidence, ranging from 0.15 to 0.50. The same application has been executed using a number of computing nodes ranging from one to eight to evaluate the system speedup.The workflow corresponding to the application is shown in Figure 17. It includes a dataset node (representing the covertype dataset) connected to eight tool nodes, each one associated with an instance of the J48 classification algorithm with a different value of confidence (ranging from 0.15 to 0.50). These nodes are in turn connected to another tool node, associated with a model chooser that selects the best classification model among those le...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.