TY - JOUR
T1 - Edge AI-Based Dynamic Treatment Strategy Generation Model for Consumer Healthcare Technology
AU - Byeon, Haewon
AU - Alsaadi, Mahmood
AU - Quraishi, Aadam
AU - Shabaz, Mohammad
AU - Ahanger, Tariq Ahamed
AU - Keshta, Ismail
AU - Saidova, Feruza
AU - Soni, Mukesh
AU - Karumuri, Srinivasa Rao
N1 - Publisher Copyright:
© 1975-2011 IEEE.
PY - 2025
Y1 - 2025
N2 - Edge AI-based reinforcement learning is less reliant on mathematical models and relies on experience to help with the design and optimization of Consumer Technology models, making it ideal for learning dynamic treatment techniques. However, the current research still has the following problems: 1. The optimality of the learned strategy is considered without taking risks into account, leading to strategies with certain risks. 2. The issue of distribution shift is ignored, resulting in learned strategies that are completely different from those of doctors. 3. The patient’s historical observational data and treatments are ignored, making it difficult to accurately assess the patient’s state, which in turn leads to the inability to learn the optimal strategy. Based on this, we propose the DOSAC-DTR dynamic treatment strategy model, which integrates Dead-ends and offline monitoring into the Actor-Critic framework. First, we consider the risk of treatment actions recommended by the learned strategy and incorporate the concept of Dead-ends into the Actor-Critic framework. Second, to mitigate the distribution shift problem, we integrate physician monitoring into the Actor-Critic framework. This approach achieves better performance, resulting in lower estimated mortality rates and higher Jaccard index scores.
AB - Edge AI-based reinforcement learning is less reliant on mathematical models and relies on experience to help with the design and optimization of Consumer Technology models, making it ideal for learning dynamic treatment techniques. However, the current research still has the following problems: 1. The optimality of the learned strategy is considered without taking risks into account, leading to strategies with certain risks. 2. The issue of distribution shift is ignored, resulting in learned strategies that are completely different from those of doctors. 3. The patient’s historical observational data and treatments are ignored, making it difficult to accurately assess the patient’s state, which in turn leads to the inability to learn the optimal strategy. Based on this, we propose the DOSAC-DTR dynamic treatment strategy model, which integrates Dead-ends and offline monitoring into the Actor-Critic framework. First, we consider the risk of treatment actions recommended by the learned strategy and incorporate the concept of Dead-ends into the Actor-Critic framework. Second, to mitigate the distribution shift problem, we integrate physician monitoring into the Actor-Critic framework. This approach achieves better performance, resulting in lower estimated mortality rates and higher Jaccard index scores.
KW - DTR
KW - Reinforcement learning
KW - actor-critic framework
KW - consumer technology
KW - edge AI
KW - healthcare
UR - http://www.scopus.com/inward/record.url?scp=85211239896&partnerID=8YFLogxK
U2 - 10.1109/TCE.2024.3510616
DO - 10.1109/TCE.2024.3510616
M3 - Article
AN - SCOPUS:85211239896
SN - 0098-3063
VL - 71
SP - 4565
EP - 4573
JO - IEEE Transactions on Consumer Electronics
JF - IEEE Transactions on Consumer Electronics
IS - 2
ER -