Genetic dating indicates that the Asian–Papuan admixture through Eastern Indonesia corresponds to the Austronesian expansion

Recommender systems can mitigate the information overload problem by suggesting users' personalized items. In real-world recommendations such as e-commerce, a typical interaction between the system and its users is -users are recommended a page of items and provide feedback; and then the system recommends a new page of items. To effectively capture such interaction for recommendations, we need to solve two key problems -(1) how to update recommending strategy according to user's real-time feedback, and 2) how to generate a page of items with proper display, which pose tremendous challenges to traditional recommender systems. In this paper, we study the problem of page-wise recommendations aiming to address aforementioned two challenges simultaneously. In particular, we propose a principled approach to jointly generate a set of complementary items and the corresponding strategy to display them in a 2-D page; and propose a novel page-wise recommendation framework based on deep reinforcement learning, DeepPage, which can optimize a page of items with proper display based on real-time feedback from users. The experimental results based on a real-world e-commerce dataset demonstrate the effectiveness of the proposed framework.

show abstract

Recommendations with Negative Feedback via Pairwise Deep Reinforcement Learning

Zhao

et al. 2018

View full text Add to dashboard Cite

With the recent prevalence of Reinforcement Learning (RL), there have been tremendous interests in developing RL-based recommender systems. In practical recommendation sessions, users will sequentially access multiple scenarios, such as the entrance pages and the item detail pages, and each scenario has its own recommendation strategy. However, the majority of existing RL-based recommender systems focus on optimizing one strategy for all scenarios or separately optimizing each strategy, which could lead to sub-optimal overall performance. In this paper, we study the recommendation problem with multiple (consecutive) scenarios, i.e., whole-chain recommendations. We propose a multi-agent reinforcement learning based approach (DeepChain), which can capture the sequential correlation among different scenarios and jointly optimize multiple recommendation strategies. To be specific, all recommender agents share the same memory of users' historical behaviors, and they work collaboratively to maximize the overall reward of a session. Note that optimizing multiple recommendation strategies jointly faces two challenges in existing model-free RL model [10]-(i) it requires huge amounts of user behavior data, and (ii) the distribution of reward (users' feedback) are extremely unbalanced. In this paper, we introduce model-based reinforcement learning techniques to reduce the training data requirement and execute more accurate strategy updates. The experimental results based on a real e-commerce platform demonstrate the effectiveness of the proposed framework.

show abstract

Reinforcement Learning to Optimize Long-term User Engagement in Recommender Systems

et al. 2019

View full text Add to dashboard Cite

Recommender systems play a crucial role in our daily lives. Feed streaming mechanism has been widely used in the recommender system, especially on the mobile Apps. The feed streaming setting provides users the interactive manner of recommendation in never-ending feeds. In such a manner, a good recommender system should pay more attention to user stickiness, which is far beyond classical instant metrics and typically measured by long-term user engagement. Directly optimizing long-term user engagement is a non-trivial problem, as the learning target is usually not available for conventional supervised learning methods. Though reinforcement learning (RL) naturally fits the problem of maximizing the long term rewards, applying RL to optimize long-term user engagement is still facing challenges: user behaviors are versatile to model, which typically consists of both instant feedback (e.g., clicks) and delayed feedback (e.g., dwell time, revisit); in addition, performing effective off-policy learning is still immature, especially when combining bootstrapping and function approximation.To address these issues, in this work, we introduce a RL framework -FeedRec to optimize the long-term user engagement. Fee-dRec includes two components: 1) a Q-Network which designed in hierarchical LSTM takes charge of modeling complex user behaviors, and 2) a S-Network, which simulates the environment, assists the Q-Network and voids the instability of convergence in policy learning. Extensive experiments on synthetic data and a real-world large scale data show that FeedRec effectively optimizes the longterm user engagement and outperforms state-of-the-arts. CCS CONCEPTS• Information systems → Recommender systems; Personalization; • Theory of computation → Sequential decision making. * Work performed during an internship at JD.com.

show abstract

Synthesis and Evaluation of Doxorubicin-Loaded Gold Nanoparticles for Tumor-Targeted Drug Delivery

Xia

et al. 2018

Bioconjugate Chem.

100

View full text Add to dashboard Cite

Doxorubicin is an effective and widely used cancer chemotherapeutic agent, but its application is greatly compromised by its cumulative dose-dependent side effect of cardiotoxicity. A gold nanoparticle-based drug delivery system has been designed to overcome this limitation. Five novel thiolated doxorubicin analogs were synthesized and their biological activities evaluated. Two of these analogs and PEG stabilizing ligands were then conjugated to gold nanoparticles, and the resulting Au-Dox constructs were evaluated. The results show that release of native drug can be achieved by the action of reducing agents such as glutathione or under acidic conditions, but reductive drug release gave the cleanest drug release. Gold nanoparticles (Au-Dox) were prepared with different loadings of PEG and doxorubicin, and one formulation was evaluated for mammalian stability and toxicity. Plasma levels of doxorubicin in mice treated with Au-Dox were significantly lower than in mice treated with the same amount of doxorubicin, indicating that the construct is stable under physiological conditions. Treatment of mice with Au-Dox gave no histopathologically observable differences from mice treated with saline, while mice treated with an equivalent dose of doxorubicin showed significant histopathologically observable lesions.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Long Xia

Deep reinforcement learning for page-wise recommendations

Recommendations with Negative Feedback via Pairwise Deep Reinforcement Learning

Reinforcement Learning to Optimize Long-term User Engagement in Recommender Systems

Synthesis and Evaluation of Doxorubicin-Loaded Gold Nanoparticles for Tumor-Targeted Drug Delivery

Contact Info

Product

Resources

About