Xinyu Zhang scite author profile

This research focuses on the adaptive navigation of maritime autonomous surface ships (MASSs) in an uncertain environment. To achieve intelligent obstacle avoidance of MASSs in a port, an autonomous navigation decision-making model based on hierarchical deep reinforcement learning is proposed. The model is mainly composed of two layers: the scene division layer and an autonomous navigation decision-making layer. The scene division layer mainly quantifies the sub-scenarios according to the International Regulations for Preventing Collisions at Sea (COLREG). This research divides the navigational situation of a ship into entities and attributes based on the ontology model and Protégé language. In the decision-making layer, we designed a deep Q-learning algorithm utilizing the environmental model, ship motion space, reward function, and search strategy to learn the environmental state in a quantized sub-scenario to train the navigation strategy. Finally, two sets of verification experiments of the deep reinforcement learning (DRL) and improved DRL algorithms were designed with Rizhao port as a study case. Moreover, the experimental data were analyzed in terms of the convergence trend, iterative path, and collision avoidance effect. The results indicate that the improved DRL algorithm could effectively improve the navigation safety and collision avoidance.

show abstract

Active Object Detection With Multistep Action Prediction Using Deep Q-Network

Han

Liu²,

Sun³

et al. 2019

IEEE Trans. Ind. Inf.

View full text Add to dashboard Cite

In recent years, great success has been achieved in visual object detection, which is one of the fundamental tasks in the field of industrial intelligence. Most of existing methods have been proposed to deal with single well-captured still images, while in practical robotic applications, due to nuisances, such as tiny scale, partial view, or occlusion, one still image may not contain enough information for object detection. However, an intelligent robot has the capability to adjust its viewpoint to get better images for detection. Therefore, active object detection becomes a very important perception strategy for intelligent robots. In this paper, by formulating active object detection as a sequential action decision process, a deep reinforcement learning framework is established to resolve it. Furthermore, a novel deep Q-learning network (DQN) with a dueling architecture is proposed, the network has two separate output channels, one predicts action type and the other predicts action range. By combining the two output channels, the action space is explored more efficiently. Several methods are extensively validated and the results show that the proposed one obtains the best results and predicts action in real time.

show abstract

A Design Methodology for Efficient Implementation of Deconvolutional Neural Networks on an FPGA

Zhang¹,

Das²,

Neopane³

et al. 2017

Preprint

View full text Add to dashboard Cite

In recent years deep learning algorithms have shown extremely high performance on machine learning tasks such as image classification and speech recognition.In support of such applications, various FPGA accelerator architectures have been proposed for convolutional neural networks (CNNs) that enable high performance for classification tasks at lower power than CPU and GPU processors. However, to date, there has been little research on the use of FPGA implementations of deconvolutional neural networks (DCNNs). DCNNs, also known as generative CNNs, encode high-dimensional probability distributions and have been widely used for computer vision applications such as scene completion, scene segmentation, image creation, image denoising, and super-resolution imaging. We propose an FPGA architecture for deconvolutional networks built around an accelerator which effectively handles the complex memory access patterns needed to perform strided deconvolutions, and that supports convolution as well. We also develop a three-step design optimization method that systematically exploits statistical analysis, design space exploration and VLSI optimization. To verify our FPGA deconvolutional accelerator design methodology we train DCNNs offline on two representative datasets using the generative adversarial network method (GAN) run on Tensorflow, and then map these DCNNs to an FPGA DCNN-plus-accelerator implementation to perform generative inference on a Xilinx Zynq-7000 FPGA. Our DCNN implementation achieves a peak performance density of 0.012 GOPs/DSP.

show abstract

Effects of low doping on the improvement of cathode materials Na_3+xV_2−xM_x(PO₄)₃ (M = Co²⁺, Cu²⁺; x = 0.01–0.05) for SIBs

Chen

Butenko

et al. 2021

J. Mater. Chem. A

View full text Add to dashboard Cite

show abstract

Learning-based Practical Smartphone Eavesdropping with Built-in Accelerometer

Ba¹,

Zheng²,

Zhang³

et al. 2020

View full text Add to dashboard Cite

Motion sensors on current smartphones have been exploited for audio eavesdropping due to their sensitivity to vibrations. However, this threat is considered low-risk because of two widely acknowledged limitations: First, unlike microphones, motion sensors can only pick up speech signals traveling through a solid medium. Thus, the only feasible setup reported previously is to use a smartphone gyroscope to eavesdrop on a loudspeaker placed on the same table. The second limitation comes from a common sense that these sensors can only pick up a narrow band (85-100Hz) of speech signals due to a sampling ceiling of 200Hz. In this paper, we revisit the threat of motion sensors to speech privacy and propose AccelEve, a new side-channel attack that employs a smartphone's accelerometer to eavesdrop on the speaker in the same smartphone. Specifically, it utilizes the accelerometer measurements to recognize the speech emitted by the speaker and to reconstruct the corresponding audio signals. In contrast to previous works, our setup allows the speech signals to always produce strong responses in accelerometer measurements through the shared motherboard, which successfully addresses the first limitation and allows this kind of attacks to penetrate into real-life scenarios. Regarding the sampling rate limitation, contrary to the widely-held belief, we observe up to 500Hz sampling rates in recent smartphones, which almost covers the entire fundamental frequency band (85-255Hz) of adult speech. On top of these pivotal observations, we propose a novel deep learning based system that learns to recognize and reconstruct speech information from the spectrogram representation of acceleration signals. This system employs adaptive optimization on deep neural networks with skip connections using robust and generalizable losses to achieve robust recognition and reconstruction performance. Extensive evaluations demonstrate the effectiveness and high accuracy of our attack under various settings.

show abstract

Direct-gap semiconducting tri-layer silicene with 29% photovoltaic efficiency

Lin

et al. 2018

Nano Energy

View full text Add to dashboard Cite

Traffic light detection and recognition for autonomous vehicles

Guo

Zhang

et al. 2015

The Journal of China Universities of Posts and Telecommunicatio

View full text Add to dashboard Cite

Determining the stereo configuration of carbonyl sulfide dimers using Coulomb-explosion imaging

Zhao

Wang

et al. 2021

Phys. Rev. A

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Xinyu Zhang

Decision-Making for the Autonomous Navigation of Maritime Autonomous Surface Ships Based on Scene Division and Deep Reinforcement Learning

Active Object Detection With Multistep Action Prediction Using Deep Q-Network

A Design Methodology for Efficient Implementation of Deconvolutional Neural Networks on an FPGA

Effects of low doping on the improvement of cathode materials Na_3+xV_2−xM_x(PO₄)₃ (M = Co²⁺, Cu²⁺; x = 0.01–0.05) for SIBs

Learning-based Practical Smartphone Eavesdropping with Built-in Accelerometer

Direct-gap semiconducting tri-layer silicene with 29% photovoltaic efficiency

Traffic light detection and recognition for autonomous vehicles

Determining the stereo configuration of carbonyl sulfide dimers using Coulomb-explosion imaging

Contact Info

Product

Resources

About

Xinyu Zhang

Decision-Making for the Autonomous Navigation of Maritime Autonomous Surface Ships Based on Scene Division and Deep Reinforcement Learning

Active Object Detection With Multistep Action Prediction Using Deep Q-Network

A Design Methodology for Efficient Implementation of Deconvolutional Neural Networks on an FPGA

Effects of low doping on the improvement of cathode materials Na3+xV2−xMx(PO4)3 (M = Co2+, Cu2+; x = 0.01–0.05) for SIBs

Learning-based Practical Smartphone Eavesdropping with Built-in Accelerometer

Direct-gap semiconducting tri-layer silicene with 29% photovoltaic efficiency

Traffic light detection and recognition for autonomous vehicles

Determining the stereo configuration of carbonyl sulfide dimers using Coulomb-explosion imaging

Contact Info

Product

Resources

About

Effects of low doping on the improvement of cathode materials Na_3+xV_2−xM_x(PO₄)₃ (M = Co²⁺, Cu²⁺; x = 0.01–0.05) for SIBs