基于深度强化学习的网络切片资源管理算法

doi:10.11805/TKYDA2022154

首页 > 按期查看>2024年第7期 >792-799. DOI:10.11805/TKYDA2022154

基于深度强化学习的网络切片资源管理算法
DOI:
                        10.11805/TKYDA2022154
                    
作者:
                        
                        
                    
作者单位:1.深圳大学 电子与信息工程学院，广东 深圳 518060;2.深圳清华大学研究院，广东 深圳 518057;3.清华大学 深圳国际;研究生院，广东 深圳 528055;4.中山大学 电子与信息工程学院，广东 广州 510006
作者简介:王菲菲(1997-)，女，在读硕士研究生，主要研究方向为无线通信.email:wffarn@163.com.
郑斯辉(1997-)，男，在读博士研究生，主要研究方向为无线通信、联邦学习.
王兰(1979-)，女，博士，讲师，主要研究方向为移动通信系统、无线资源管理.
陈翔(1980-)，男，博士，教授，主要研究方向为无线与移动通信、卫星通信、物联网、软件无线电.
通讯作者:
基金项目:深圳市基础研究重点资助项目(JCYJ20200109143016563)
伦理声明:

Resource management algorithm for network slicing based on deep reinforcement learning

Author:

Ethical statement:

Affiliation:

1.College of Electronic and Information Engineering，Shenzhen University，Shenzhen Guangdong 518060，China;2.Research Institute of Tsinghua University in Shenzhen，Shenzhen Guangdong 518057，China;3.Shenzhen International Graduate School，Tsinghua University，Shenzhen Guangdong 528055，China;4.School of Electronics and Information Technology，Sun Yat-sen University，Guangzhou Guangdong 510006，China

Funding:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

摘要:

随着第五代通信技术(5G)的发展，各种应用场景不断涌现，而网络切片可以在通用的物理网络上构建多个逻辑独立的虚拟网络来满足移动通信网络多样化的业务需求。为了提高移动通信网络根据各切片业务量实现资源按需分配的能力，本文提出了一种基于深度强化学习的网络切片资源管理算法，该算法使用两个长短期记忆网络对无法实时到达的统计数据进行预测，并提取用户移动性导致的业务数据量动态特征，进而结合优势动作评论算法做出与切片业务需求相匹配的带宽分配决策。实验结果表明，相较于现有方法，该算法可以在保证用户时延和速率要求的同时，将频谱效率提高约7.7%。

Abstract:

With the development of the 5th Generation Mobile Communication Technology(5G), various application scenarios continue to emerge. Network slicing can construct multiple logically independent virtual networks on a common physical network to meet the diverse service requirements of mobile communication networks. In order to enhance the ability of mobile communication networks to allocate resources on demand according to the traffic of each slice, this paper proposes a network slicing resource management algorithm based on deep reinforcement learning. The algorithm uses two Long Short-Term Memory(LSTM) networks to predict statistical data that cannot be reached in real time, and extracts dynamic characteristics of business data volume caused by user mobility, and then makes bandwidth allocation decisions that match the needs of slice services in combination with the Advantage Actor-Critic(A2C) algorithm. Experimental results show that compared with existing methods, this algorithm can improve the spectral efficiency by about 7.7% while ensuring the user's delay and rate requirements.

参考文献

相似文献

引证文献

引用本文

王菲菲,王兰,郑斯辉,陈翔.基于深度强化学习的网络切片资源管理算法[J].太赫兹科学与电子信息学报,2024,22(7):792~799

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:

历史

收稿日期:2022-08-22
最后修改日期:2022-10-10
录用日期:
在线发布日期: 2024-07-24
出版日期:

首页

期刊简介

投稿必读

征订启事

编委会

联系我们

ENGLISH

编委风采

出版道德

太赫兹专委会

引用本文

分享

文章指标

历史