<span lang="EN-US">Intelligent transportation system (ITS) is currently one of the most discussed topics in scientific research. Actually, ITS offers advanced monitoring systems that include vehicle counting, pedestrian detection. Lately, convolutional neural networks (CNNs) are extensively used in computer vision tasks, including segmentation, classification, and detection. In fact, image semantic segmentation is a critical issue in computer vision applications. For example, self-driving vehicles require high accuracy with lower parameter requirements to segment the road scene objects in real-time. However, most related work focus on one side, accuracy or parameter requirements, which make CNN models difficult to use in real-time applications. In order to resolve this issue, we propose the efficient lightweight residual network (ELRNet), a novel and ELRNet, which is an asymmetrical encoder-decoder architecture. Indeed, in this network, we compare four varieties of the proposed factorized block, and three loss functions to get the best combination. In addition, the proposed model is trained from scratch using only 0.61M parameters. All experiments are evaluated on the popular public the cambridge-driving labeled video database (CamVid) road scene dataset and reached results show that ELRNet can achieve better performance in terms of parameters requirements and precision compared to related work.</span>
<span lang="EN-US">Intelligent transportation systems (ITS) are among the most focused research in this century. Actually, autonomous driving provides very advanced tasks in terms of road safety monitoring which include identifying dangers on the road and protecting pedestrians. In the last few years, deep learning (DL) approaches and especially convolutional neural networks (CNNs) have been extensively used to solve ITS problems such as traffic scene semantic segmentation and traffic signs classification. Semantic segmentation is an important task that has been addressed in computer vision (CV). Indeed, traffic scene semantic segmentation using CNNs requires high precision with few computational resources to perceive and segment the scene in real-time. However, we often find related work focusing only on one aspect, the precision, or the number of computational parameters. In this regard, we propose RBANet, a robust and lightweight CNN which uses a new proposed balanced attention module, and a new proposed residual module. Afterward, we have simulated our proposed RBANet using three loss functions to get the best combination using only 0.74M parameters. The RBANet has been evaluated on CamVid, the most used dataset in semantic segmentation, and it has performed well in terms of parameters’ requirements and precision compared to related work.</span>
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.