Abstract:Privacy concerns are considered one of the main challenges in smart cities as sharing sensitive data induces threatening problems in people's lives. Federated learning has emerged as an effective technique to avoid privacy infringement as well as increase the utilization of the data. However, there is a scarcity in the amount of labeled data and an abundance of unlabeled data collected in smart cities; hence there is a necessity to utilize semi-supervised learning. In this paper, we present the primary design … Show more
“…The use of massive data from multiple vehicles and the large computational capability of all participant help provide better traffic prediction outcomes, which cannot be achieved by using centralized ML techniques with less dataset and limited computation. FL has been exploited to provide distributed AI functions for decentralized smart city applications such as intelligent smart city data management [48]. In this context, FL is helpful to structure data streams from ubiquitous IoT devices that work as FL clients for performing local learning without sharing their data to external third-parties.…”
Section: Visions Of the Use Of Fl In Iotmentioning
confidence: 99%
“…1) FL for Data Management: With its decentralized and privacy-preserved nature, FL has been exploited to provide distributed AI functions for large-scale intelligent data management systems in smart cities. For example, a semi-supervised FL method called FedSem is introduced in [48] to provide distributed processing for unlabeled data in smart cities. To evaluate the usefulness of FL in a smart city, a prototype with smart vehicles is considered where each vehicle learns a DNN model based on traffic sign image datasets.…”
Section: Fl For Smart Citymentioning
confidence: 99%
“…However, most proposed AI-based smart city solutions rely on a centralized learning architecture on a data center, such as a cloud server, which is obviously not scalable to the rapid expansion of smart devices in smart cities. FL offers more attractive features for enabling decentralized smart city applications with high privacy levels and low communication delays [48]. FL is also important for structuring data streams from ubiquitous IoT devices that act as FL clients for performing local learning without sharing their data with external third-parties [153].…”
Section: B Lessons Learned From Fl-iot Applicationsmentioning
“…The use of massive data from multiple vehicles and the large computational capability of all participant help provide better traffic prediction outcomes, which cannot be achieved by using centralized ML techniques with less dataset and limited computation. FL has been exploited to provide distributed AI functions for decentralized smart city applications such as intelligent smart city data management [48]. In this context, FL is helpful to structure data streams from ubiquitous IoT devices that work as FL clients for performing local learning without sharing their data to external third-parties.…”
Section: Visions Of the Use Of Fl In Iotmentioning
confidence: 99%
“…1) FL for Data Management: With its decentralized and privacy-preserved nature, FL has been exploited to provide distributed AI functions for large-scale intelligent data management systems in smart cities. For example, a semi-supervised FL method called FedSem is introduced in [48] to provide distributed processing for unlabeled data in smart cities. To evaluate the usefulness of FL in a smart city, a prototype with smart vehicles is considered where each vehicle learns a DNN model based on traffic sign image datasets.…”
Section: Fl For Smart Citymentioning
confidence: 99%
“…However, most proposed AI-based smart city solutions rely on a centralized learning architecture on a data center, such as a cloud server, which is obviously not scalable to the rapid expansion of smart devices in smart cities. FL offers more attractive features for enabling decentralized smart city applications with high privacy levels and low communication delays [48]. FL is also important for structuring data streams from ubiquitous IoT devices that act as FL clients for performing local learning without sharing their data with external third-parties [153].…”
Section: B Lessons Learned From Fl-iot Applicationsmentioning
“…Semi-supervised FL has already been proposed by [2] to take advantage of the abundant unlabeled data in the smart city context. Contrary to LiM, they assume a subset of clients have labeled samples, and use them to train a classifier that will provide the missing labels to retrain another local model.…”
In this paper we present LiM (‘Less is More’), a malware classification framework that leverages Federated Learning to detect and classify malicious apps in a privacy-respecting manner. Information about newly installed apps is kept locally on users’ devices, so that the provider cannot infer which apps were installed by users. At the same time, input from all users is taken into account in the federated learning process and they all benefit from better classification performance. A key challenge of this setting is that users do not have access to the ground truth (i.e. they cannot correctly identify whether an app is malicious). To tackle this, LiM uses a safe semi-supervised ensemble that maximizes classification accuracy with respect to a baseline classifier trained by the service provider (i.e. the cloud). We implement LiM and show that the cloud server has F1 score of 95%, while clients have perfect recall with only 1 false positive in > 100 apps, using a dataset of 25K clean apps and 25K malicious apps, 200 users and 50 rounds of federation. Furthermore, we conduct a security analysis and demonstrate that LiM is robust against both poisoning attacks by adversaries who control half of the clients, and inference attacks performed by an honest-but-curious cloud server. Further experiments with Ma-MaDroid’s dataset confirm resistance against poisoning attacks and a performance improvement due to the federation.
“…Smart cities aim at providing robust solutions to crucial societal challenges related to transportation, health, environment, education, and security [ 1 ]. Smart cities are expected to deploy massive Internet-of-Things (IoT) related devices and applications.…”
The envisioned smart city domains are expected to rely heavily on artificial intelligence and machine learning (ML) approaches for their operations, where the basic ingredient is data. Privacy of the data and training time have been major roadblocks to achieving the specific goals of each application domain. Policy makers, the research community, and the industrial sector have been putting their efforts into addressing these issues. Federated learning, with its distributed and local training approach, stands out as a potential solution to these challenges. In this article, we discuss the potential interplay of different technologies and AI for achieving the required features of future smart city services. Having discussed a few use-cases for future eHealth, we list design goals and technical requirements of the enabling technologies. The paper confines its focus on federated learning. After providing the tutorial on federated learning, we analyze the Federated Learning research literature. We also highlight the challenges. A solution sketch and high-level research directions may be instrumental in addressing the challenges.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.