A Comparison Between Adaptive Neural Networks Algorithms for Estimating Vehicle Travel Time

Hanafy, Yasmin Adel; Gazya, Mohamed; Mashaly, Maggie; Ghany, Mohamed A. Abd El

doi:10.1109/icces51560.2020.9334615

Cited by 2 publications

(1 citation statement)

References 13 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Numerous attempts have been made in the literature to build online neural network models [14][15][16][17][18]. All of the work proposed is software-optimized and lacks the benefits of parallel processing hardware solutions found in FPGAs.…”

Section: Introductionmentioning

confidence: 99%

An Efficient Hardware Design for a Low-Latency Traffic Flow Prediction System Using an Online Neural Network

2021

Self Cite

View full text Add to dashboard Cite

Neural networks are computing systems inspired by the biological neural networks in human brains. They are trained in a batch learning mode; hence, the whole training data should be ready before the training task. However, this is not applicable for many real-time applications where data arrive sequentially such as online topic-detection in social communities, traffic flow prediction, etc. In this paper, an efficient hardware implementation of a low-latency online neural network system is proposed for a traffic flow prediction application. The proposed model is implemented with different Machine Learning (ML) algorithms to predict the traffic flow with high accuracy where the Hedge Backpropagation (HBP) model achieves the least mean absolute error (MAE) of 0.001. The proposed system is implemented using floating point and fixed point arithmetics on Field Programmable Gate Array (FPGA) part of the ZedBoard. The implementation is provided using BRAM architecture and distributed memory in FPGA in order to achieve the best trade-off between latency, the consumption of area, and power. Using the fixed point approach, the prediction times using the distributed memory and BRAM architectures are 150 ns and 420 ns, respectively. The area delay product (ADP) of the proposed system is reduced by 17 × compared with the hardware implementation of the latest proposed system in the literature. The execution time of the proposed hardware system is improved by 200 × compared with the software implemented on a dual core Intel i7-7500U CPU at 2.9 GHz. Consequently, the proposed hardware model is faster than the software model and more suitable for time-critical online machine learning models.

show abstract

Section: Introductionmentioning

confidence: 99%