Millimeter wave beamforming training: A reinforcement learning approach

Ehab Mahmoud Mohamed

doi:10.24425/ijet.2021.135949

Millimeter wave beamforming training: A reinforcement learning approach

Ehab Mahmoud Mohamed

Electrical Engineering

Research output: Contribution to journal › Article › peer-review

3 Scopus citations

Abstract

Beamforming training (BT) is considered as an essential process to accomplish the communications in the millimeter wave (mmWave) band, i.e., 30 ~ 300 GHz. This process aims to find out the best transmit/receive antenna beams to compensate the impairments of the mmWave channel and successfully establish the mmWave link. Typically, the mmWave BT process is highly-time consuming affecting the overall throughput and energy consumption of the mmWave link establishment. In this paper, a machine learning (ML) approach, specifically reinforcement learning (RL), is utilized for enabling the mmWave BT process by modeling it as a multi-armed bandit (MAB) problem with the aim of maximizing the long-term throughput of the constructed mmWave link. Based on this formulation, MAB algorithms such as upper confidence bound (UCB), Thompson sampling (TS), epsilon-greedy (e-greedy), are utilized to address the problem and accomplish the mmWave BT process. Numerical simulations confirm the superior performance of the proposed MAB approach over the existing mmWave BT techniques.

Original language	English
Pages (from-to)	95-102
Number of pages	8
Journal	International Journal of Electronics and Telecommunications
Volume	67
Issue number	1
DOIs	https://doi.org/10.24425/ijet.2021.135949
State	Published - 2021

Keywords

Beamforming training
Millimeter wave
Multi-armed bandit
Reinforcement learning

Access to Document

10.24425/ijet.2021.135949

Cite this

@article{8387515475824ae4a9ec705a9b0e4fdb,

title = "Millimeter wave beamforming training: A reinforcement learning approach",

abstract = "Beamforming training (BT) is considered as an essential process to accomplish the communications in the millimeter wave (mmWave) band, i.e., 30 \textasciitilde{} 300 GHz. This process aims to find out the best transmit/receive antenna beams to compensate the impairments of the mmWave channel and successfully establish the mmWave link. Typically, the mmWave BT process is highly-time consuming affecting the overall throughput and energy consumption of the mmWave link establishment. In this paper, a machine learning (ML) approach, specifically reinforcement learning (RL), is utilized for enabling the mmWave BT process by modeling it as a multi-armed bandit (MAB) problem with the aim of maximizing the long-term throughput of the constructed mmWave link. Based on this formulation, MAB algorithms such as upper confidence bound (UCB), Thompson sampling (TS), epsilon-greedy (e-greedy), are utilized to address the problem and accomplish the mmWave BT process. Numerical simulations confirm the superior performance of the proposed MAB approach over the existing mmWave BT techniques.",

keywords = "Beamforming training, Millimeter wave, Multi-armed bandit, Reinforcement learning",

author = "Mohamed, \{Ehab Mahmoud\}",

note = "Publisher Copyright: {\textcopyright} The Author(s).",

year = "2021",

doi = "10.24425/ijet.2021.135949",

language = "English",

volume = "67",

pages = "95--102",

journal = "International Journal of Electronics and Telecommunications",

issn = "2081-8491",

publisher = "Polska Akademia Nauk",

number = "1",

}

TY - JOUR

T1 - Millimeter wave beamforming training

T2 - A reinforcement learning approach

AU - Mohamed, Ehab Mahmoud

N1 - Publisher Copyright: © The Author(s).

PY - 2021

Y1 - 2021

N2 - Beamforming training (BT) is considered as an essential process to accomplish the communications in the millimeter wave (mmWave) band, i.e., 30 ~ 300 GHz. This process aims to find out the best transmit/receive antenna beams to compensate the impairments of the mmWave channel and successfully establish the mmWave link. Typically, the mmWave BT process is highly-time consuming affecting the overall throughput and energy consumption of the mmWave link establishment. In this paper, a machine learning (ML) approach, specifically reinforcement learning (RL), is utilized for enabling the mmWave BT process by modeling it as a multi-armed bandit (MAB) problem with the aim of maximizing the long-term throughput of the constructed mmWave link. Based on this formulation, MAB algorithms such as upper confidence bound (UCB), Thompson sampling (TS), epsilon-greedy (e-greedy), are utilized to address the problem and accomplish the mmWave BT process. Numerical simulations confirm the superior performance of the proposed MAB approach over the existing mmWave BT techniques.

AB - Beamforming training (BT) is considered as an essential process to accomplish the communications in the millimeter wave (mmWave) band, i.e., 30 ~ 300 GHz. This process aims to find out the best transmit/receive antenna beams to compensate the impairments of the mmWave channel and successfully establish the mmWave link. Typically, the mmWave BT process is highly-time consuming affecting the overall throughput and energy consumption of the mmWave link establishment. In this paper, a machine learning (ML) approach, specifically reinforcement learning (RL), is utilized for enabling the mmWave BT process by modeling it as a multi-armed bandit (MAB) problem with the aim of maximizing the long-term throughput of the constructed mmWave link. Based on this formulation, MAB algorithms such as upper confidence bound (UCB), Thompson sampling (TS), epsilon-greedy (e-greedy), are utilized to address the problem and accomplish the mmWave BT process. Numerical simulations confirm the superior performance of the proposed MAB approach over the existing mmWave BT techniques.

KW - Beamforming training

KW - Millimeter wave

KW - Multi-armed bandit

KW - Reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85102644304&partnerID=8YFLogxK

U2 - 10.24425/ijet.2021.135949

DO - 10.24425/ijet.2021.135949

M3 - Article

AN - SCOPUS:85102644304

SN - 2081-8491

VL - 67

SP - 95

EP - 102

JO - International Journal of Electronics and Telecommunications

JF - International Journal of Electronics and Telecommunications

IS - 1

ER -

Millimeter wave beamforming training: A reinforcement learning approach

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this