Abstract
Beamforming training (BT) is considered as an essential process to accomplish the communications in the millimeter wave (mmWave) band, i.e., 30 ~ 300 GHz. This process aims to find out the best transmit/receive antenna beams to compensate the impairments of the mmWave channel and successfully establish the mmWave link. Typically, the mmWave BT process is highly-time consuming affecting the overall throughput and energy consumption of the mmWave link establishment. In this paper, a machine learning (ML) approach, specifically reinforcement learning (RL), is utilized for enabling the mmWave BT process by modeling it as a multi-armed bandit (MAB) problem with the aim of maximizing the long-term throughput of the constructed mmWave link. Based on this formulation, MAB algorithms such as upper confidence bound (UCB), Thompson sampling (TS), epsilon-greedy (e-greedy), are utilized to address the problem and accomplish the mmWave BT process. Numerical simulations confirm the superior performance of the proposed MAB approach over the existing mmWave BT techniques.
| Original language | English |
|---|---|
| Pages (from-to) | 95-102 |
| Number of pages | 8 |
| Journal | International Journal of Electronics and Telecommunications |
| Volume | 67 |
| Issue number | 1 |
| DOIs | |
| State | Published - 2021 |
Keywords
- Beamforming training
- Millimeter wave
- Multi-armed bandit
- Reinforcement learning
Fingerprint
Dive into the research topics of 'Millimeter wave beamforming training: A reinforcement learning approach'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver