Guiliang Liu scite author profile

Deep Reinforcement Learning (DRL) has achieved impressive success in many applications. A key component of many DRL models is a neural network representing a Q function, to estimate the expected cumulative reward following a state-action pair. The Q function neural network contains a lot of implicit knowledge about the RL problems, but often remains unexamined and uninterpreted. To our knowledge, this work develops the first mimic learning framework for Q functions in DRL. We introduce Linear Model U-trees (LMUTs) to approximate neural network predictions. An LMUT is learned using a novel on-line algorithm that is well-suited for an active play setting, where the mimic learner observes an ongoing interaction between the neural net and the environment. Empirical evaluation shows that an LMUT mimics a Q function substantially better than five baseline methods. The transparent tree structure of an LMUT facilitates understanding the network's learned knowledge by analyzing feature influence, extracting rules, and highlighting the super-pixels in image inputs.

show abstract

Deep Reinforcement Learning in Ice Hockey for Context-Aware Player Evaluation

Liu

Schulte

2018

View full text Add to dashboard Cite

A variety of machine learning models have been proposed to assess the performance of players in professional sports. However, they have only a limited ability to model how player performance depends on the game context. This paper proposes a new approach to capturing game context: we apply Deep Reinforcement Learning (DRL) to learn an action-value Q function from 3M play-by-play events in the National Hockey League (NHL). The neural network representation integrates both continuous context signals and game history, using a possession-based LSTM. The learned Q-function is used to value players' actions under different game contexts. To assess a player's overall performance, we introduce a novel Game Impact Metric (GIM) that aggregates the values of the player's actions. Empirical Evaluation shows GIM is consistent throughout a play season, and correlates highly with standard success measures and future salary.

show abstract

Deep soccer analytics: learning an action-value function for evaluating soccer players

Liu

Luo

Kharrat

2020

Data Min Knowl Disc

View full text Add to dashboard Cite

Effect of free carbon on micro-mechanical properties of a chemically vapor deposited SiC coating

Wang

Chen

Wang

et al. 2018

Ceramics International

View full text Add to dashboard Cite

Extracting Knowledge from Web Text with Monte Carlo Tree Search

Liu

Jiakang

et al. 2020

View full text Add to dashboard Cite

The long noncoding RNA MIR122HG is a precursor for miR-122-5p and negatively regulates the TAK1-induced innate immune response in teleost fish

Zheng

Chang

Luo

et al. 2022

Journal of Biological Chemistry

View full text Add to dashboard Cite

Enhanced grain refinement and mechanical properties of a high–strength Al–Zn–Mg–Cu–Zr alloy induced by TiC nano–particles

Zhao

Gao

Yang

et al. 2021

Materials Science and Engineering: A

View full text Add to dashboard Cite

Hollow double-shell structured Void@SiO2@Co-C composite for broadband electromagnetic wave absorption

Liu

Song

et al. 2021

Chemical Engineering Journal

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Guiliang Liu

Toward Interpretable Deep Reinforcement Learning with Linear Model U-Trees

Deep Reinforcement Learning in Ice Hockey for Context-Aware Player Evaluation

Deep soccer analytics: learning an action-value function for evaluating soccer players

Effect of free carbon on micro-mechanical properties of a chemically vapor deposited SiC coating

Extracting Knowledge from Web Text with Monte Carlo Tree Search

The long noncoding RNA MIR122HG is a precursor for miR-122-5p and negatively regulates the TAK1-induced innate immune response in teleost fish

Enhanced grain refinement and mechanical properties of a high–strength Al–Zn–Mg–Cu–Zr alloy induced by TiC nano–particles

Hollow double-shell structured Void@SiO2@Co-C composite for broadband electromagnetic wave absorption

Contact Info

Product

Resources

About