影响因子:10.6
DOI码:10.1109/JIOT.2022.3166110
发表刊物:IEEE Internet of Things Journal
摘要:Mobile-edge computing (MEC) has been regarded as a promising paradigm to reduce service latency for data processing in the Internet of Things (IoT) by provisioning computing resources at the network edges. In this work, we jointly optimize the task partitioning and computational power allocation for computation offloading in a dynamic environment with multiple IoT devices and multiple edge servers. We formulate the problem as a Markov decision process with constrained hybrid action space, which cannot be well handled by existing deep reinforcement learning (DRL) algorithms. Therefore, we develop a novel DRL called Dirichlet deep deterministic policy gradient (D3PG), which is built on deep deterministic policy gradient (DDPG) to solve the problem. The developed model can learn to solve multiobjective optimization, including maximizing the number of tasks processed before deadlines and minimizing the energy cost and service latency. More importantly, D3PG can effectively deal with a constrained distribution-continuous hybrid action spaces, where the distribution variables are for the task partitioning and offloading, while the continuous variables are for computational frequency control. Moreover, the D3PG can address many similar issues in MEC and general reinforcement learning problems. Extensive simulation results show that the proposed D3PG outperforms the state-of-the-art methods.
合写作者:Ning Zhang,Abdul Rahman Sattar,Janahan Skandaraniyam
第一作者:Laha Ale
论文类型:SCI
通讯作者:Scott A. King
卷号:9
期号:19
页面范围:19260 - 19272
是否译文:否
发表时间:2022-10-01
收录刊物:SCI