TY - JOUR
T1 - Contextual Neural Bandit Approach for UAV Selection in Delay-Doppler-Based Vehicular Networks
AU - Mohamed, Ehab Mahmoud
AU - Alnakhli, Mohammad Ahmed
AU - Hussein, Hany S.
N1 - Publisher Copyright:
© 2012 IEEE.
PY - 2025
Y1 - 2025
N2 - This letter investigates the problem of optimal uncrewed aerial vehicle (UAV) selection in UAV-assisted vehicular communications, aiming to maximize ground vehicle (GV) data transmission by jointly optimizing its achievable data rate and coverage duration. However, this high-mobility environment introduces complex and nonlinear influences on the transmitted data coming from the highly dynamic factors, e.g., relative UAV-GV velocity, Doppler shifts, UAVs’ altitude, residual UAVs’ energy, etc. To address this, a contextual neural upper confidence bound (NeuralUCB) bandit algorithm is proposed to learn the intricate nonlinear relationship between these dynamic parameters and the achievable transmitted data. The framework incorporates delay-Doppler domain to estimate UAVs’ context features related to Doppler shifts and line-of-sight channel gains. This enables real-time selection of the optimal UAV based solely on the current contextual information and prior transmission history, facilitated by deep online learning. Simulation results confirm that the proposed approach significantly outperforms candidate benchmarks in terms of transmission efficiency.
AB - This letter investigates the problem of optimal uncrewed aerial vehicle (UAV) selection in UAV-assisted vehicular communications, aiming to maximize ground vehicle (GV) data transmission by jointly optimizing its achievable data rate and coverage duration. However, this high-mobility environment introduces complex and nonlinear influences on the transmitted data coming from the highly dynamic factors, e.g., relative UAV-GV velocity, Doppler shifts, UAVs’ altitude, residual UAVs’ energy, etc. To address this, a contextual neural upper confidence bound (NeuralUCB) bandit algorithm is proposed to learn the intricate nonlinear relationship between these dynamic parameters and the achievable transmitted data. The framework incorporates delay-Doppler domain to estimate UAVs’ context features related to Doppler shifts and line-of-sight channel gains. This enables real-time selection of the optimal UAV based solely on the current contextual information and prior transmission history, facilitated by deep online learning. Simulation results confirm that the proposed approach significantly outperforms candidate benchmarks in terms of transmission efficiency.
KW - Contextual neural UCB
KW - UAV
KW - delay-doppler
KW - vehicular communication
UR - https://www.scopus.com/pages/publications/105013781604
U2 - 10.1109/LWC.2025.3600752
DO - 10.1109/LWC.2025.3600752
M3 - Article
AN - SCOPUS:105013781604
SN - 2162-2337
VL - 14
SP - 3690
EP - 3694
JO - IEEE Wireless Communications Letters
JF - IEEE Wireless Communications Letters
IS - 11
ER -