Han Shen scite author profile

Han Shen

9Publications

80Citation Statements Received

62Citation Statements Given

How they've been cited

How they cite others

130

Affiliations

Rensselaer Polytechnic Institute, Horizon Research (United States), Engineering Conferences International

Publications

Order By: Most citations

Learned Video Compression via Joint Spatial-Temporal Correlation Exploration

Liu

Shen

Huang

et al. 2020

AAAI

View full text Add to dashboard Cite

Traditional video compression technologies have been developed over decades in pursuit of higher coding efficiency. Efficient temporal information representation plays a key role in video coding. Thus, in this paper, we propose to exploit the temporal correlation using both first-order optical flow and second-order flow prediction. We suggest an one-stage learning approach to encapsulate flow as quantized features from consecutive frames which is then entropy coded with adaptive contexts conditioned on joint spatial-temporal priors to exploit second-order correlations. Joint priors are embedded in autoregressive spatial neighbors, co-located hyper elements and temporal neighbors using ConvLSTM recurrently. We evaluate our approach for the low-delay scenario with High-Efficiency Video Coding (H.265/HEVC), H.264/AVC and another learned video compression method, following the common test settings. Our work offers the state-of-the-art performance, with consistent gains across all popular test sequences.

show abstract

Adaptive Temporal Difference Learning with Linear Function Approximation

Sun¹,

Shen²,

Chen³

et al. 2020

Preprint

View full text Add to dashboard Cite

This paper revisits the celebrated temporal difference (TD) learning algorithm for the policy evaluation in reinforcement learning. Typically, the performance of the plain-vanilla TD algorithm is sensitive to the choice of stepsizes. Oftentimes, TD suffers from slow convergence. Motivated by the tight connection between the TD learning algorithm and the stochastic gradient methods, we develop the first adaptive variant of the TD learning algorithm with linear function approximation that we term AdaTD. In contrast to the original TD, AdaTD is robust or less sensitive to the choice of stepsizes. Analytically, we establish that to reach an accuracy, the number of iterations needed is Õ( 2 ln 4 1 / ln 4 1 ρ ), where ρ represents the speed of the underlying Markov chain converges to the stationary distribution. This implies that the iteration complexity of AdaTD is no worse than that of TD in the worst case. Going beyond TD, we further develop an adaptive variant of TD(λ), which is referred to as AdaTD(λ). We evaluate the empirical performance of AdaTD and AdaTD(λ) on several standard reinforcement learning tasks in OpenAI Gym on both linear and nonlinear function approximation, which demonstrate the effectiveness of our new approaches over existing ones.

show abstract

Adaptive Temporal Difference Learning With Linear Function Approximation

Sun

Shen

Chen

et al. 2022

IEEE Trans. Pattern Anal. Mach. Intell.

View full text Add to dashboard Cite

Byzantine-Resilient Decentralized Policy Evaluation With Linear Function Approximation

Shen

Chen

et al. 2021

IEEE Trans. Signal Process.

View full text Add to dashboard Cite

Adaptive decision tree-based phone cluster models for speaker clustering

Hsieh

Wu²,

Shen

2008

View full text Add to dashboard Cite

Spatial Environment Adaptability and Planning of Leisure Industry Cluster

Shen¹

2021

View full text Add to dashboard Cite

Towards Understanding Asynchronous Advantage Actor-Critic: Convergence and Linear Speedup

Shen

Zhang

Mei

et al. 2023

IEEE Trans. Signal Process.

View full text Add to dashboard Cite

Byzantine-Resilient Decentralized TD Learning with Linear Function Approximation

Shen

Chen

et al. 2021

View full text Add to dashboard Cite

This paper considers the policy evaluation problem in reinforcement learning with agents of a decentralized and directed network. The focus is on decentralized temporal-difference (TD) learning with linear function approximation in the presence of unreliable or even malicious agents, termed as Byzantine agents. In order to evaluate the quality of a fixed policy in a common environment, agents usually run decentralized TD(λ) collaboratively. However, when some Byzantine agents behave adversarially, decentralized TD(λ) is unable to learn an accurate linear approximation for the true value function. We propose a trimmed-mean based decentralized TD(λ) algorithm to perform policy evaluation in this setting. We establish the finite-time convergence rate, as well as the asymptotic learning error that depends on the number of Byzantine agents. Numerical experiments corroborate the robustness of the proposed algorithm.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Han Shen

Learned Video Compression via Joint Spatial-Temporal Correlation Exploration

Adaptive Temporal Difference Learning with Linear Function Approximation

Adaptive Temporal Difference Learning With Linear Function Approximation

Byzantine-Resilient Decentralized Policy Evaluation With Linear Function Approximation

Adaptive decision tree-based phone cluster models for speaker clustering

Spatial Environment Adaptability and Planning of Leisure Industry Cluster

Towards Understanding Asynchronous Advantage Actor-Critic: Convergence and Linear Speedup

Byzantine-Resilient Decentralized TD Learning with Linear Function Approximation

Contact Info

Product

Resources

About