TY - GEN
T1 - An efficient Actor Critic DRL Framework for Resource Allocation in Multi-cell Downlink NOMA
AU - Alajmi, Abdullah
AU - Ahsan, Waleed
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - In this paper, a tractable framework for downlink non-orthogonal multiple access (NOMA) is proposed based on a model-free reinforcement learning (RL) approach for dynamic resource allocation in a multi-cell network structure. With the aid of actor critic deep reinforcement learning (ACDRL), we optimize the active power allocation for multi-cell NOMA systems under an online environment to maximize the long-term sum rate. To exploit the dynamic nature of NOMA, this work utilizes the instantaneous data rate for designing the dynamic reward. The state space in ACDRL contains all possible resource allocation realizations depending on a three-dimensional association among users, base stations, and sub-channels. We propose an ACDRL algorithm with this transformed state space which is scalable to handle different network loads by utilizing multiple deep neural networks. Lastly, the simulation results validate that the proposed solution for multi-cell NOMA outperforms the conventional RL, DRL algorithms, and orthogonal multiple access (OMA) schemes in terms of the evaluated long-term sum rate.
AB - In this paper, a tractable framework for downlink non-orthogonal multiple access (NOMA) is proposed based on a model-free reinforcement learning (RL) approach for dynamic resource allocation in a multi-cell network structure. With the aid of actor critic deep reinforcement learning (ACDRL), we optimize the active power allocation for multi-cell NOMA systems under an online environment to maximize the long-term sum rate. To exploit the dynamic nature of NOMA, this work utilizes the instantaneous data rate for designing the dynamic reward. The state space in ACDRL contains all possible resource allocation realizations depending on a three-dimensional association among users, base stations, and sub-channels. We propose an ACDRL algorithm with this transformed state space which is scalable to handle different network loads by utilizing multiple deep neural networks. Lastly, the simulation results validate that the proposed solution for multi-cell NOMA outperforms the conventional RL, DRL algorithms, and orthogonal multiple access (OMA) schemes in terms of the evaluated long-term sum rate.
KW - Actor critic deep reinforcement learning
KW - non-orthogonal multiple access
KW - resource allocation
UR - http://www.scopus.com/inward/record.url?scp=85134652165&partnerID=8YFLogxK
U2 - 10.1109/EuCNC/6GSummit54941.2022.9815638
DO - 10.1109/EuCNC/6GSummit54941.2022.9815638
M3 - Conference contribution
AN - SCOPUS:85134652165
T3 - 2022 Joint European Conference on Networks and Communications and 6G Summit, EuCNC/6G Summit 2022
SP - 77
EP - 82
BT - 2022 Joint European Conference on Networks and Communications and 6G Summit, EuCNC/6G Summit 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 Joint European Conference on Networks and Communications and 6G Summit, EuCNC/6G Summit 2022
Y2 - 7 June 2022 through 10 June 2022
ER -