Statistical analysis of adaptive rate and power control in urban FSO drone networks using RL-DQN-A2C

Wessam M. Salama; Hegazy Rezk

doi:10.1016/j.comnet.2025.111973

Statistical analysis of adaptive rate and power control in urban FSO drone networks using RL-DQN-A2C

Wessam M. Salama
, Hegazy Rezk

Electrical Engineering

Pharos University in Alexandria

Research output: Contribution to journal › Article › peer-review

Abstract

This work introduces a new Reinforcement Learning (RL) framework for optimizing Free-Space Optical (FSO) communication systems in drone-assisted smart city networks by combining Deep Q-Network (DQN) and Advantage Actor-Critic (A2C) techniques. With the help of real-time environmental feedback, the suggested hybrid RL-DQN-A2C model dynamically adjusts three important transmission parameters in real time, beam divergence, transmit power, and modulation order. This makes it possible to adapt robustly to extremely dynamic and unpredictable atmospheric conditions by doing away with the need for pre-collected datasets or preset statistical models. The FSO channel includes both air turbulence and pointing errors, which are modeled by Rayleigh and Gamma-Gamma distributions, respectively, to guarantee accurate modeling. Four traditional adaptation schemes, Optimal Rate Adaptation (ORA), Channel Inversion with Fixed Rate (CIFR), Truncated Inversion with Fixed Rate (TIFR), and Optimal Power and Rate Adaptation (OPRA), are used to compare the efficacy of the suggested approach. Evaluation of performance is done both with and without the RL-based optimization technique integrated. To solve connection issues in urban settings, the framework also investigates the deployment of Autonomous Aerial Vehicles (AAVs) as nimble, clever substitutes for conventional mobile base stations. Beam divergence, AAV positioning and orientation variations, link range, and coverage distance are just a few of the system and environmental elements that are thoroughly examined in this study in order to determine how they affect system performance. The IM/DD nonnegativity constraint is analytically integrated and evaluated to verify physical correctness, showing less than 1.5 % deviation in ASE at high SNR, while the classical Shannon capacity expression is used for capacity normalization to provide a consistent and computationally efficient benchmark. With an average inference latency of less than 8 ms and a power consumption of 12.6 W, the suggested model is also evaluated for real-time feasibility using software-in-the-loop (SIL) simulations, demonstrating its appropriateness for embedded UAV deployment. The suggested RL-DQN-A2C framework is further benchmarked by comparing it against a number of sophisticated RL models, such as Deep Meta-RL, Proximal Policy Optimization (PPO), Soft Actor-Critic (SAC), and Rainbow DQN. The findings of experiments verify that RL-DQN-A2C performs better in terms of achievable spectral efficiency (ASE), convergence stability, and flexibility in response to dynamic FSO channel errors. The efficiency of RL-DQN-A2C in optimizing adaptive transmission for drone-assisted FSO communication in urban smart city contexts is highlighted by its notable achievement of the greatest ASE of 128 bits/s/Hz, as well as its faster convergence and improved robustness. The RL-DQN-A2C framework considerably improves the ASE over all benchmark systems, according to simulation data. In particular, the OPRA method outperformed its baseline by 34.74 %, achieving the greatest ASE of 129 bits/s/Hz. Significant gains were also noted for TIFR of 31.11 %, CIFR of 16 %, (from 50 to 59 bits/s/Hz), and ORA with 32.76 %. These improvements are statistically significant (p<0.01), according to a one-way repeated-measures ANOVA, and Post hoc testing show that OPRA is superior (p<0.001). These findings highlight how the suggested RL-based method could significantly increase the spectrum efficiency and durability of drone-assisted FSO communication systems in smart city settings.

Original language	English
Article number	111973
Journal	Computer Networks
Volume	276
DOIs	https://doi.org/10.1016/j.comnet.2025.111973
State	Published - Feb 2026

Keywords

Average spectral efficiency (ASE)
Channel inversion with fixed rate (CIFR)
Deep reinforcement learning
Free-space optical (FSO)
Optimal power and rate adaptation (OPRA)

Access to Document

10.1016/j.comnet.2025.111973

Cite this

@article{6e20ebc7aaf34eec96e3b85e2a623d10,

title = "Statistical analysis of adaptive rate and power control in urban FSO drone networks using RL-DQN-A2C",

abstract = "This work introduces a new Reinforcement Learning (RL) framework for optimizing Free-Space Optical (FSO) communication systems in drone-assisted smart city networks by combining Deep Q-Network (DQN) and Advantage Actor-Critic (A2C) techniques. With the help of real-time environmental feedback, the suggested hybrid RL-DQN-A2C model dynamically adjusts three important transmission parameters in real time, beam divergence, transmit power, and modulation order. This makes it possible to adapt robustly to extremely dynamic and unpredictable atmospheric conditions by doing away with the need for pre-collected datasets or preset statistical models. The FSO channel includes both air turbulence and pointing errors, which are modeled by Rayleigh and Gamma-Gamma distributions, respectively, to guarantee accurate modeling. Four traditional adaptation schemes, Optimal Rate Adaptation (ORA), Channel Inversion with Fixed Rate (CIFR), Truncated Inversion with Fixed Rate (TIFR), and Optimal Power and Rate Adaptation (OPRA), are used to compare the efficacy of the suggested approach. Evaluation of performance is done both with and without the RL-based optimization technique integrated. To solve connection issues in urban settings, the framework also investigates the deployment of Autonomous Aerial Vehicles (AAVs) as nimble, clever substitutes for conventional mobile base stations. Beam divergence, AAV positioning and orientation variations, link range, and coverage distance are just a few of the system and environmental elements that are thoroughly examined in this study in order to determine how they affect system performance. The IM/DD nonnegativity constraint is analytically integrated and evaluated to verify physical correctness, showing less than 1.5 \% deviation in ASE at high SNR, while the classical Shannon capacity expression is used for capacity normalization to provide a consistent and computationally efficient benchmark. With an average inference latency of less than 8 ms and a power consumption of 12.6 W, the suggested model is also evaluated for real-time feasibility using software-in-the-loop (SIL) simulations, demonstrating its appropriateness for embedded UAV deployment. The suggested RL-DQN-A2C framework is further benchmarked by comparing it against a number of sophisticated RL models, such as Deep Meta-RL, Proximal Policy Optimization (PPO), Soft Actor-Critic (SAC), and Rainbow DQN. The findings of experiments verify that RL-DQN-A2C performs better in terms of achievable spectral efficiency (ASE), convergence stability, and flexibility in response to dynamic FSO channel errors. The efficiency of RL-DQN-A2C in optimizing adaptive transmission for drone-assisted FSO communication in urban smart city contexts is highlighted by its notable achievement of the greatest ASE of 128 bits/s/Hz, as well as its faster convergence and improved robustness. The RL-DQN-A2C framework considerably improves the ASE over all benchmark systems, according to simulation data. In particular, the OPRA method outperformed its baseline by 34.74 \%, achieving the greatest ASE of 129 bits/s/Hz. Significant gains were also noted for TIFR of 31.11 \%, CIFR of 16 \%, (from 50 to 59 bits/s/Hz), and ORA with 32.76 \%. These improvements are statistically significant (p<0.01), according to a one-way repeated-measures ANOVA, and Post hoc testing show that OPRA is superior (p<0.001). These findings highlight how the suggested RL-based method could significantly increase the spectrum efficiency and durability of drone-assisted FSO communication systems in smart city settings.",

keywords = "Average spectral efficiency (ASE), Channel inversion with fixed rate (CIFR), Deep reinforcement learning, Free-space optical (FSO), Optimal power and rate adaptation (OPRA)",

author = "Salama, \{Wessam M.\} and Hegazy Rezk",

note = "Publisher Copyright: {\textcopyright} 2025 Elsevier B.V.",

year = "2026",

month = feb,

doi = "10.1016/j.comnet.2025.111973",

language = "English",

volume = "276",

journal = "Computer Networks",

issn = "1389-1286",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - Statistical analysis of adaptive rate and power control in urban FSO drone networks using RL-DQN-A2C

AU - Salama, Wessam M.

AU - Rezk, Hegazy

PY - 2026/2

Y1 - 2026/2

N2 - This work introduces a new Reinforcement Learning (RL) framework for optimizing Free-Space Optical (FSO) communication systems in drone-assisted smart city networks by combining Deep Q-Network (DQN) and Advantage Actor-Critic (A2C) techniques. With the help of real-time environmental feedback, the suggested hybrid RL-DQN-A2C model dynamically adjusts three important transmission parameters in real time, beam divergence, transmit power, and modulation order. This makes it possible to adapt robustly to extremely dynamic and unpredictable atmospheric conditions by doing away with the need for pre-collected datasets or preset statistical models. The FSO channel includes both air turbulence and pointing errors, which are modeled by Rayleigh and Gamma-Gamma distributions, respectively, to guarantee accurate modeling. Four traditional adaptation schemes, Optimal Rate Adaptation (ORA), Channel Inversion with Fixed Rate (CIFR), Truncated Inversion with Fixed Rate (TIFR), and Optimal Power and Rate Adaptation (OPRA), are used to compare the efficacy of the suggested approach. Evaluation of performance is done both with and without the RL-based optimization technique integrated. To solve connection issues in urban settings, the framework also investigates the deployment of Autonomous Aerial Vehicles (AAVs) as nimble, clever substitutes for conventional mobile base stations. Beam divergence, AAV positioning and orientation variations, link range, and coverage distance are just a few of the system and environmental elements that are thoroughly examined in this study in order to determine how they affect system performance. The IM/DD nonnegativity constraint is analytically integrated and evaluated to verify physical correctness, showing less than 1.5 % deviation in ASE at high SNR, while the classical Shannon capacity expression is used for capacity normalization to provide a consistent and computationally efficient benchmark. With an average inference latency of less than 8 ms and a power consumption of 12.6 W, the suggested model is also evaluated for real-time feasibility using software-in-the-loop (SIL) simulations, demonstrating its appropriateness for embedded UAV deployment. The suggested RL-DQN-A2C framework is further benchmarked by comparing it against a number of sophisticated RL models, such as Deep Meta-RL, Proximal Policy Optimization (PPO), Soft Actor-Critic (SAC), and Rainbow DQN. The findings of experiments verify that RL-DQN-A2C performs better in terms of achievable spectral efficiency (ASE), convergence stability, and flexibility in response to dynamic FSO channel errors. The efficiency of RL-DQN-A2C in optimizing adaptive transmission for drone-assisted FSO communication in urban smart city contexts is highlighted by its notable achievement of the greatest ASE of 128 bits/s/Hz, as well as its faster convergence and improved robustness. The RL-DQN-A2C framework considerably improves the ASE over all benchmark systems, according to simulation data. In particular, the OPRA method outperformed its baseline by 34.74 %, achieving the greatest ASE of 129 bits/s/Hz. Significant gains were also noted for TIFR of 31.11 %, CIFR of 16 %, (from 50 to 59 bits/s/Hz), and ORA with 32.76 %. These improvements are statistically significant (p<0.01), according to a one-way repeated-measures ANOVA, and Post hoc testing show that OPRA is superior (p<0.001). These findings highlight how the suggested RL-based method could significantly increase the spectrum efficiency and durability of drone-assisted FSO communication systems in smart city settings.

AB - This work introduces a new Reinforcement Learning (RL) framework for optimizing Free-Space Optical (FSO) communication systems in drone-assisted smart city networks by combining Deep Q-Network (DQN) and Advantage Actor-Critic (A2C) techniques. With the help of real-time environmental feedback, the suggested hybrid RL-DQN-A2C model dynamically adjusts three important transmission parameters in real time, beam divergence, transmit power, and modulation order. This makes it possible to adapt robustly to extremely dynamic and unpredictable atmospheric conditions by doing away with the need for pre-collected datasets or preset statistical models. The FSO channel includes both air turbulence and pointing errors, which are modeled by Rayleigh and Gamma-Gamma distributions, respectively, to guarantee accurate modeling. Four traditional adaptation schemes, Optimal Rate Adaptation (ORA), Channel Inversion with Fixed Rate (CIFR), Truncated Inversion with Fixed Rate (TIFR), and Optimal Power and Rate Adaptation (OPRA), are used to compare the efficacy of the suggested approach. Evaluation of performance is done both with and without the RL-based optimization technique integrated. To solve connection issues in urban settings, the framework also investigates the deployment of Autonomous Aerial Vehicles (AAVs) as nimble, clever substitutes for conventional mobile base stations. Beam divergence, AAV positioning and orientation variations, link range, and coverage distance are just a few of the system and environmental elements that are thoroughly examined in this study in order to determine how they affect system performance. The IM/DD nonnegativity constraint is analytically integrated and evaluated to verify physical correctness, showing less than 1.5 % deviation in ASE at high SNR, while the classical Shannon capacity expression is used for capacity normalization to provide a consistent and computationally efficient benchmark. With an average inference latency of less than 8 ms and a power consumption of 12.6 W, the suggested model is also evaluated for real-time feasibility using software-in-the-loop (SIL) simulations, demonstrating its appropriateness for embedded UAV deployment. The suggested RL-DQN-A2C framework is further benchmarked by comparing it against a number of sophisticated RL models, such as Deep Meta-RL, Proximal Policy Optimization (PPO), Soft Actor-Critic (SAC), and Rainbow DQN. The findings of experiments verify that RL-DQN-A2C performs better in terms of achievable spectral efficiency (ASE), convergence stability, and flexibility in response to dynamic FSO channel errors. The efficiency of RL-DQN-A2C in optimizing adaptive transmission for drone-assisted FSO communication in urban smart city contexts is highlighted by its notable achievement of the greatest ASE of 128 bits/s/Hz, as well as its faster convergence and improved robustness. The RL-DQN-A2C framework considerably improves the ASE over all benchmark systems, according to simulation data. In particular, the OPRA method outperformed its baseline by 34.74 %, achieving the greatest ASE of 129 bits/s/Hz. Significant gains were also noted for TIFR of 31.11 %, CIFR of 16 %, (from 50 to 59 bits/s/Hz), and ORA with 32.76 %. These improvements are statistically significant (p<0.01), according to a one-way repeated-measures ANOVA, and Post hoc testing show that OPRA is superior (p<0.001). These findings highlight how the suggested RL-based method could significantly increase the spectrum efficiency and durability of drone-assisted FSO communication systems in smart city settings.

KW - Average spectral efficiency (ASE)

KW - Channel inversion with fixed rate (CIFR)

KW - Deep reinforcement learning

KW - Free-space optical (FSO)

KW - Optimal power and rate adaptation (OPRA)

UR - https://www.scopus.com/pages/publications/105026267951

U2 - 10.1016/j.comnet.2025.111973

DO - 10.1016/j.comnet.2025.111973

M3 - Article

AN - SCOPUS:105026267951

SN - 1389-1286

VL - 276

JO - Computer Networks

JF - Computer Networks

M1 - 111973

ER -

Statistical analysis of adaptive rate and power control in urban FSO drone networks using RL-DQN-A2C

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this