Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery &Amp; Data Mining 2018
DOI: 10.1145/3219819.3219918
|View full text |Cite
|
Sign up to set email alerts
|

Deep Reinforcement Learning for Sponsored Search Real-time Bidding

Abstract: Bidding optimization is one of the most critical problems in online advertising. Sponsored search (SS) auction, due to the randomness of user query behavior and platform nature, usually adopts keyword-level bidding strategies. In contrast, the display advertising (DA), as a relatively simpler scenario for auction, has taken advantage of real-time bidding (RTB) to boost the performance for advertisers. In this paper, we consider the RTB problem in sponsored search auction, named SS-RTB. SS-RTB has a much more c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
51
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 64 publications
(51 citation statements)
references
References 28 publications
(56 reference statements)
0
51
0
Order By: Relevance
“…One of the key advantages of DRL is that it enables RL to scale to problems with high-dimensional state and action spaces. However, most existing successful DRL applications so far have been on visual domains (e.g., Atari games), and there is still a lot of work to be done for more realistic applications [25,26] with complex dynamics, which are not necessarily vision-based.DRL has been regarded as an important component in constructing general AI systems [27] and has been successfully integrated with other techniques, e.g., search [14], planning [28], and more recently with multiagent systems, with an emerging area of multiagent deep reinforcement learning (MDRL) [29,30]. 1 Learning in multiagent settings is fundamentally more difficult than the single-agent case due to the presence of multiagent pathologies, e.g., the moving target problem (non-stationarity) [2, 5, 10], curse of dimensionality [2,5], multiagent credit assignment [31,32], global exploration [8], and relative overgeneralization [33,34,35].…”
mentioning
confidence: 99%
See 1 more Smart Citation
“…One of the key advantages of DRL is that it enables RL to scale to problems with high-dimensional state and action spaces. However, most existing successful DRL applications so far have been on visual domains (e.g., Atari games), and there is still a lot of work to be done for more realistic applications [25,26] with complex dynamics, which are not necessarily vision-based.DRL has been regarded as an important component in constructing general AI systems [27] and has been successfully integrated with other techniques, e.g., search [14], planning [28], and more recently with multiagent systems, with an emerging area of multiagent deep reinforcement learning (MDRL) [29,30]. 1 Learning in multiagent settings is fundamentally more difficult than the single-agent case due to the presence of multiagent pathologies, e.g., the moving target problem (non-stationarity) [2, 5, 10], curse of dimensionality [2,5], multiagent credit assignment [31,32], global exploration [8], and relative overgeneralization [33,34,35].…”
mentioning
confidence: 99%
“…18 https://github.com/gjp1203/nui_in_madrl 19 https://github.com/gjp1203/nui_in_madrl 20 https://www.pommerman.com/ 21 https://github.com/oxwhirl/smac 22 https://github.com/oxwhirl/pymarl MuJoCo Multiagent Soccer[313] uses the MuJoCo physics engine[202]. The environment simulates a 2 vs. 2 soccer game with agents having a 3-dimensional action space 26. • Neural MMO[314] is a research platform 27 inspired by the human game genre of Massively Multiplayer Online (MMO) Role-Playing Games.…”
mentioning
confidence: 99%
“…Such a scenario is typical in nextgeneration wireless networks such as 5G highly-dense heterogeneous networks. DRL appears to be an efficient approach for solving different types of auctions such as in [174].…”
Section: ) Drl For Channel Estimation In Wireless Systemsmentioning
confidence: 99%
“…Similarly, a bid optimizing strategy called optimized cost per click was proposed in [49]. A reinforcement learning based real-time bidding strategy for sponsored search was proposed in [48]. In the industry of sponsored search, there are also some bidding strategies tools [6,18], such as Enhanced CPC, Target CPA, Maximize Conversions, and so on.…”
Section: Related Workmentioning
confidence: 99%