Deep clustering of reinforcement learning based on the bang-bang principle to optimize the energy in multi-boiler for intelligent buildings

Raad Z. Homod; Basil Sh Munahi; Hayder Ibrahim Mohammed; Musatafa Abbas Abbood Albadr; AISSA Abderrahmane; Jasim M. Mahdi; Mohamed Bechir Ben Hamida; Bilal Naji Alhasnawi; A. S. Albahri; Hussein Togun; Umar F. Alqsair; Zaher Mundher Yaseen

doi:10.1016/j.apenergy.2023.122357

Deep clustering of reinforcement learning based on the bang-bang principle to optimize the energy in multi-boiler for intelligent buildings

Raad Z. Homod, Basil Sh Munahi, Hayder Ibrahim Mohammed, Musatafa Abbas Abbood Albadr, AISSA Abderrahmane, Jasim M. Mahdi, Mohamed Bechir Ben Hamida, Bilal Naji Alhasnawi, A. S. Albahri, Hussein Togun, Umar F. Alqsair, Zaher Mundher Yaseen

Mechanical Engineering

Research output: Contribution to journal › Article › peer-review

12 Scopus citations

Abstract

The bang-bang relays of the multiple-boiler system (MBS) control, are characterized by complex limiter saturation functions and classified as fixed parameters. Their action signals cannot precisely control the nonlinear dynamic building heating demand over their entire range of operation. Moreover, in a mono-boiler system, the bang-bang controller endures increasing short cycling over partial load time due to the heating system being considered to have an oversized boiler at most times of running, thus promoting high energy consumption and fluctuating indoor thermal comfort. So, it is difficult to cope with uncertainties in outdoor environments and indoor heating load. Hence, this study formulates the MBS control problem as a dynamic Markov decision process and applies a deep clustering of reinforcement learning approach to obtain the optimal control policy through interaction with the environment based on multi-agent learning according to bang-bang action. With such an approach, adopting a new boiler sequencing control (BSC) strategy using deep clustering of reinforcement learning based on a bang-bang (DCRLBB) manner. The deep clustering is configured to break Lagrangian trajectory curves into piecewise segments to represent the RL agent's action policy. The agent's action policy signals are configured from the bang-bang reward formula based on trade-off implications to be more adjustable than traditional fixed parameters such as fuzzy bang-bang controller (FBBC). The agent of BSC significantly affects the energy performance of the MBS, whereas the other agent resizes boiler capacity by acting to adjust the boiler solenoid fuel valve. The comparison of results between the proposed strategy and conventional FBBC shows distinct differences in the superior response of DCRLBB under dynamic indoor/outdoor actual conditions and energy saving by more than 32% while maintaining the indoor thermal in the comfortable range.

Original language	English
Article number	122357
Journal	Applied Energy
Volume	356
DOIs	https://doi.org/10.1016/j.apenergy.2023.122357
State	Published - 15 Feb 2024

Keywords

Control boiler systems
Deep clustering
Energy management
Lagrangian interpolation formula
Reinforcement learning agents
Smart buildings

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1016/j.apenergy.2023.122357

Cite this

Homod, R. Z., Munahi, B. S., Mohammed, H. I., Albadr, M. A. A., Abderrahmane, AISSA., Mahdi, J. M., Ben Hamida, M. B., Alhasnawi, B. N., Albahri, A. S., Togun, H., Alqsair, U. F., & Yaseen, Z. M. (2024). Deep clustering of reinforcement learning based on the bang-bang principle to optimize the energy in multi-boiler for intelligent buildings. Applied Energy, 356, Article 122357. https://doi.org/10.1016/j.apenergy.2023.122357

@article{9cfed05317694493844b4587ad5663b8,

title = "Deep clustering of reinforcement learning based on the bang-bang principle to optimize the energy in multi-boiler for intelligent buildings",

abstract = "The bang-bang relays of the multiple-boiler system (MBS) control, are characterized by complex limiter saturation functions and classified as fixed parameters. Their action signals cannot precisely control the nonlinear dynamic building heating demand over their entire range of operation. Moreover, in a mono-boiler system, the bang-bang controller endures increasing short cycling over partial load time due to the heating system being considered to have an oversized boiler at most times of running, thus promoting high energy consumption and fluctuating indoor thermal comfort. So, it is difficult to cope with uncertainties in outdoor environments and indoor heating load. Hence, this study formulates the MBS control problem as a dynamic Markov decision process and applies a deep clustering of reinforcement learning approach to obtain the optimal control policy through interaction with the environment based on multi-agent learning according to bang-bang action. With such an approach, adopting a new boiler sequencing control (BSC) strategy using deep clustering of reinforcement learning based on a bang-bang (DCRLBB) manner. The deep clustering is configured to break Lagrangian trajectory curves into piecewise segments to represent the RL agent's action policy. The agent's action policy signals are configured from the bang-bang reward formula based on trade-off implications to be more adjustable than traditional fixed parameters such as fuzzy bang-bang controller (FBBC). The agent of BSC significantly affects the energy performance of the MBS, whereas the other agent resizes boiler capacity by acting to adjust the boiler solenoid fuel valve. The comparison of results between the proposed strategy and conventional FBBC shows distinct differences in the superior response of DCRLBB under dynamic indoor/outdoor actual conditions and energy saving by more than 32\% while maintaining the indoor thermal in the comfortable range.",

keywords = "Control boiler systems, Deep clustering, Energy management, Lagrangian interpolation formula, Reinforcement learning agents, Smart buildings",

author = "Homod, \{Raad Z.\} and Munahi, \{Basil Sh\} and Mohammed, \{Hayder Ibrahim\} and Albadr, \{Musatafa Abbas Abbood\} and AISSA Abderrahmane and Mahdi, \{Jasim M.\} and \{Ben Hamida\}, \{Mohamed Bechir\} and Alhasnawi, \{Bilal Naji\} and Albahri, \{A. S.\} and Hussein Togun and Alqsair, \{Umar F.\} and Yaseen, \{Zaher Mundher\}",

note = "Publisher Copyright: {\textcopyright} 2023 Elsevier Ltd",

year = "2024",

month = feb,

day = "15",

doi = "10.1016/j.apenergy.2023.122357",

language = "English",

volume = "356",

journal = "Applied Energy",

issn = "0306-2619",

publisher = "Elsevier B.V.",

}

Homod, RZ, Munahi, BS, Mohammed, HI, Albadr, MAA, Abderrahmane, AISSA, Mahdi, JM, Ben Hamida, MB, Alhasnawi, BN, Albahri, AS, Togun, H, Alqsair, UF & Yaseen, ZM 2024, 'Deep clustering of reinforcement learning based on the bang-bang principle to optimize the energy in multi-boiler for intelligent buildings', Applied Energy, vol. 356, 122357. https://doi.org/10.1016/j.apenergy.2023.122357

TY - JOUR

T1 - Deep clustering of reinforcement learning based on the bang-bang principle to optimize the energy in multi-boiler for intelligent buildings

AU - Homod, Raad Z.

AU - Munahi, Basil Sh

AU - Mohammed, Hayder Ibrahim

AU - Albadr, Musatafa Abbas Abbood

AU - Abderrahmane, AISSA

AU - Mahdi, Jasim M.

AU - Ben Hamida, Mohamed Bechir

AU - Alhasnawi, Bilal Naji

AU - Albahri, A. S.

AU - Togun, Hussein

AU - Alqsair, Umar F.

AU - Yaseen, Zaher Mundher

PY - 2024/2/15

Y1 - 2024/2/15

N2 - The bang-bang relays of the multiple-boiler system (MBS) control, are characterized by complex limiter saturation functions and classified as fixed parameters. Their action signals cannot precisely control the nonlinear dynamic building heating demand over their entire range of operation. Moreover, in a mono-boiler system, the bang-bang controller endures increasing short cycling over partial load time due to the heating system being considered to have an oversized boiler at most times of running, thus promoting high energy consumption and fluctuating indoor thermal comfort. So, it is difficult to cope with uncertainties in outdoor environments and indoor heating load. Hence, this study formulates the MBS control problem as a dynamic Markov decision process and applies a deep clustering of reinforcement learning approach to obtain the optimal control policy through interaction with the environment based on multi-agent learning according to bang-bang action. With such an approach, adopting a new boiler sequencing control (BSC) strategy using deep clustering of reinforcement learning based on a bang-bang (DCRLBB) manner. The deep clustering is configured to break Lagrangian trajectory curves into piecewise segments to represent the RL agent's action policy. The agent's action policy signals are configured from the bang-bang reward formula based on trade-off implications to be more adjustable than traditional fixed parameters such as fuzzy bang-bang controller (FBBC). The agent of BSC significantly affects the energy performance of the MBS, whereas the other agent resizes boiler capacity by acting to adjust the boiler solenoid fuel valve. The comparison of results between the proposed strategy and conventional FBBC shows distinct differences in the superior response of DCRLBB under dynamic indoor/outdoor actual conditions and energy saving by more than 32% while maintaining the indoor thermal in the comfortable range.

AB - The bang-bang relays of the multiple-boiler system (MBS) control, are characterized by complex limiter saturation functions and classified as fixed parameters. Their action signals cannot precisely control the nonlinear dynamic building heating demand over their entire range of operation. Moreover, in a mono-boiler system, the bang-bang controller endures increasing short cycling over partial load time due to the heating system being considered to have an oversized boiler at most times of running, thus promoting high energy consumption and fluctuating indoor thermal comfort. So, it is difficult to cope with uncertainties in outdoor environments and indoor heating load. Hence, this study formulates the MBS control problem as a dynamic Markov decision process and applies a deep clustering of reinforcement learning approach to obtain the optimal control policy through interaction with the environment based on multi-agent learning according to bang-bang action. With such an approach, adopting a new boiler sequencing control (BSC) strategy using deep clustering of reinforcement learning based on a bang-bang (DCRLBB) manner. The deep clustering is configured to break Lagrangian trajectory curves into piecewise segments to represent the RL agent's action policy. The agent's action policy signals are configured from the bang-bang reward formula based on trade-off implications to be more adjustable than traditional fixed parameters such as fuzzy bang-bang controller (FBBC). The agent of BSC significantly affects the energy performance of the MBS, whereas the other agent resizes boiler capacity by acting to adjust the boiler solenoid fuel valve. The comparison of results between the proposed strategy and conventional FBBC shows distinct differences in the superior response of DCRLBB under dynamic indoor/outdoor actual conditions and energy saving by more than 32% while maintaining the indoor thermal in the comfortable range.

KW - Control boiler systems

KW - Deep clustering

KW - Energy management

KW - Lagrangian interpolation formula

KW - Reinforcement learning agents

KW - Smart buildings

UR - http://www.scopus.com/inward/record.url?scp=85178336543&partnerID=8YFLogxK

U2 - 10.1016/j.apenergy.2023.122357

DO - 10.1016/j.apenergy.2023.122357

M3 - Article

AN - SCOPUS:85178336543

SN - 0306-2619

VL - 356

JO - Applied Energy

JF - Applied Energy

M1 - 122357

ER -

Deep clustering of reinforcement learning based on the bang-bang principle to optimize the energy in multi-boiler for intelligent buildings

Abstract

Keywords

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this