TY - JOUR
T1 - Millimeter wave beamforming training
T2 - A reinforcement learning approach
AU - Mohamed, Ehab Mahmoud
N1 - Publisher Copyright:
© The Author(s).
PY - 2021
Y1 - 2021
N2 - Beamforming training (BT) is considered as an essential process to accomplish the communications in the millimeter wave (mmWave) band, i.e., 30 ~ 300 GHz. This process aims to find out the best transmit/receive antenna beams to compensate the impairments of the mmWave channel and successfully establish the mmWave link. Typically, the mmWave BT process is highly-time consuming affecting the overall throughput and energy consumption of the mmWave link establishment. In this paper, a machine learning (ML) approach, specifically reinforcement learning (RL), is utilized for enabling the mmWave BT process by modeling it as a multi-armed bandit (MAB) problem with the aim of maximizing the long-term throughput of the constructed mmWave link. Based on this formulation, MAB algorithms such as upper confidence bound (UCB), Thompson sampling (TS), epsilon-greedy (e-greedy), are utilized to address the problem and accomplish the mmWave BT process. Numerical simulations confirm the superior performance of the proposed MAB approach over the existing mmWave BT techniques.
AB - Beamforming training (BT) is considered as an essential process to accomplish the communications in the millimeter wave (mmWave) band, i.e., 30 ~ 300 GHz. This process aims to find out the best transmit/receive antenna beams to compensate the impairments of the mmWave channel and successfully establish the mmWave link. Typically, the mmWave BT process is highly-time consuming affecting the overall throughput and energy consumption of the mmWave link establishment. In this paper, a machine learning (ML) approach, specifically reinforcement learning (RL), is utilized for enabling the mmWave BT process by modeling it as a multi-armed bandit (MAB) problem with the aim of maximizing the long-term throughput of the constructed mmWave link. Based on this formulation, MAB algorithms such as upper confidence bound (UCB), Thompson sampling (TS), epsilon-greedy (e-greedy), are utilized to address the problem and accomplish the mmWave BT process. Numerical simulations confirm the superior performance of the proposed MAB approach over the existing mmWave BT techniques.
KW - Beamforming training
KW - Millimeter wave
KW - Multi-armed bandit
KW - Reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=85102644304&partnerID=8YFLogxK
U2 - 10.24425/ijet.2021.135949
DO - 10.24425/ijet.2021.135949
M3 - Article
AN - SCOPUS:85102644304
SN - 2081-8491
VL - 67
SP - 95
EP - 102
JO - International Journal of Electronics and Telecommunications
JF - International Journal of Electronics and Telecommunications
IS - 1
ER -