2019
DOI: 10.1186/s13673-019-0187-4
|View full text |Cite
|
Sign up to set email alerts
|

A trust-aware task allocation method using deep q-learning for uncertain mobile crowdsourcing

Abstract: IntroductionWith the advancing technology of mobile devices with numerous built-in sensors, mobile crowdsourcing has recently emerged as a new collaboration paradigm in numerous intelligent mobile information systems [1]. The existing mobile crowdsourcing has applications in numerous domains including urban planning, traffic monitoring, ride sharing, environmental monitoring and intelligent disaster response [2]. Mobile crowdsourcing is a combination of spatial crowdsourcing and smart phone technology that emp… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 14 publications
(16 citation statements)
references
References 30 publications
(73 reference statements)
0
16
0
Order By: Relevance
“…Therefore, our proposed method can advance the exploration performance of εgreedy, because multiple mini-batches do not follow a local minimum [9,10]. deep RL, as in a recent study [3][4][5][6][7], we acquire better results. Moreover, we can attempt a different deep neural network and hyper-parameter settings to demonstrate that our proposed model can enhance the exploration without specific domain information.…”
Section: Cartpole-v0mentioning
confidence: 94%
See 2 more Smart Citations
“…Therefore, our proposed method can advance the exploration performance of εgreedy, because multiple mini-batches do not follow a local minimum [9,10]. deep RL, as in a recent study [3][4][5][6][7], we acquire better results. Moreover, we can attempt a different deep neural network and hyper-parameter settings to demonstrate that our proposed model can enhance the exploration without specific domain information.…”
Section: Cartpole-v0mentioning
confidence: 94%
“…A combination of RL and nonlinear function approximates, such as deep neural networks, helps in automating decision making and control issues. However, it can also be challenging in terms of stability and convergence [2] in real-time online learning, such as in the detection of network security through the verification of whether the network is "normal or anomalous" [3][4][5][6][7]. Moreover, its widespread adaptation in the real world, such as with robotic arms, is difficult because of sample complexity and heavy dependence on certain hyper-parameters, such as exploration constants [3][4][5][6][7].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Value-iteration methods are often carried out off-policy, meaning that the policy used to generate behavior for training data can be unrelated to the policy being evaluated and improved, called the estimation policy [11,12]. Popular value-iteration methods used in dynamic task scheduling are Q-Learning [7,9,10,[15][16][17] and Deep Q-Network (DQN) [3,8,[18][19][20]. Apart from these two, Greedy methods [19], Monte Carlo Methods [21] and Temporal Difference (TD) Learning [22,23] also have been used.…”
Section: Value-iteration Methodsmentioning
confidence: 99%
“…The research done by Sun and Tan [18] focuses on the trust-aware task allocation (TTA) optimization problem / dynamic trust-aware task allocation problem of mobile crowdsourcing systems. They have done the experiment in an uncertain crowdsourcing environment and have researched on how to maximize the trust score and minimize the travel distance cost [18]. Here, the TTA optimization problem aims at maximizing trust score and reducing the travel distance cost.…”
Section: Deep Q-network (Dqn) / Deep Q-learning (Dql)mentioning
confidence: 99%