TY - GEN
T1 - Soft Actor Critic Framework for Resource Allocation in Backscatter-NOMA Networks
AU - Alajmi, Abdullah
AU - Fayaz, Muhammad
AU - Ahsan, Waleed
AU - Nallanathan, Arumugam
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - With the use of power domain non-orthogonal multiple access (NOMA) and backscatter communication (BAC), future sixth-generation ultra massive machine type communications networks are expected to connect large-scale Internet of things (IoT) devices. However, due to NOMA co-channel interference, the power allocation to large-scale IoT devices becomes critical. The existing convex optimization-based solutions are highly complex hence, it is difficult to find the optimal solution to the resource allocation problem in a highly dynamic environment. Therefore, this work develops an efficient model-free BACNOMA system to assist the base station for complex resource scheduling tasks in a dynamic BAC-NOMA IoT network. More specifically, we jointly optimize the transmit power of downlink IoT users and the reflection coefficient of uplink backscatter devices using a reinforcement learning algorithm, namely, softactor critic. Numerical results show that the proposed algorithm obtained a higher reward and converges to an optimal solution with respect to a large number of iterations. The proposed algorithm increases the sum rate by 57.6% as compared to the conventional optimization (benchmark) approach. Moreover, we show that the proposed algorithm outperforms the conventional BAC-NOMA scheme and BAC with orthogonal multiple access in terms of average sum rate with the increasing number of backscatter devices.
AB - With the use of power domain non-orthogonal multiple access (NOMA) and backscatter communication (BAC), future sixth-generation ultra massive machine type communications networks are expected to connect large-scale Internet of things (IoT) devices. However, due to NOMA co-channel interference, the power allocation to large-scale IoT devices becomes critical. The existing convex optimization-based solutions are highly complex hence, it is difficult to find the optimal solution to the resource allocation problem in a highly dynamic environment. Therefore, this work develops an efficient model-free BACNOMA system to assist the base station for complex resource scheduling tasks in a dynamic BAC-NOMA IoT network. More specifically, we jointly optimize the transmit power of downlink IoT users and the reflection coefficient of uplink backscatter devices using a reinforcement learning algorithm, namely, softactor critic. Numerical results show that the proposed algorithm obtained a higher reward and converges to an optimal solution with respect to a large number of iterations. The proposed algorithm increases the sum rate by 57.6% as compared to the conventional optimization (benchmark) approach. Moreover, we show that the proposed algorithm outperforms the conventional BAC-NOMA scheme and BAC with orthogonal multiple access in terms of average sum rate with the increasing number of backscatter devices.
KW - Backscatter communications
KW - non-orthogonal multiple access
KW - reinforcement learning
KW - resource allocation
UR - http://www.scopus.com/inward/record.url?scp=85146699409&partnerID=8YFLogxK
U2 - 10.1109/LATINCOM56090.2022.10000455
DO - 10.1109/LATINCOM56090.2022.10000455
M3 - Conference contribution
AN - SCOPUS:85146699409
T3 - 2022 IEEE Latin-American Conference on Communications, LATINCOM 2022
BT - 2022 IEEE Latin-American Conference on Communications, LATINCOM 2022
A2 - Moraes, Igor M.
A2 - Campista, Miguel Elias M.
A2 - Ghamri-Doudane, Yacine
A2 - Luis Henrique M. K. Costa, Costa
A2 - Rubinstein, Marcelo G.
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 14th IEEE Latin-American Conference on Communications, LATINCOM 2022
Y2 - 30 November 2022 through 2 December 2022
ER -