TY - JOUR
T1 - Prediction of busulfan solubility in supercritical CO2 using tree-based and neural network-based methods
AU - Tianhao, Zhou
AU - Sh. Majdi, Hasan
AU - Dmitry Olegovich, Bokov
AU - Abdelbasset, Walid Kamal
AU - Thangavelu, Lakshmi
AU - Su, Chia Hung
AU - Chinh Nguyen, Hoang
AU - Alashwal, May
AU - Ghazali, Sami
N1 - Publisher Copyright:
© 2022 Elsevier B.V.
PY - 2022/4/1
Y1 - 2022/4/1
N2 - Drug solubility is a critical parameter in the pharmaceutical industry for developing efficient processes for production of nanomedicine at industrial scale. Several attempts have been made in recent years to investigate and obtain this parameter using various data mining methods, including neural networks. In this study, to reduce the error rate in predicting solubility, three methods including Multi-layer Perceptron (MLP), decision tree, and random forest have been applied to 32 rows of experimental data collected from literature for solubility of a model drug in supercritical CO2. Afterwards, the results of these models are examined and compared with measured data to calibrate and validate the developed models. Finally, the mean squared error improved to 1.77 e −5 in Random Forest Model. MLP and decision tree models mean squared errors are equal to 6.72 e −5 and 3.28 e −5, respectively which is a good result, especially when we can guarantee that the model did not have more problems in predicting the drug solubility and can be used as reliable methods in the pharmaceutical area.
AB - Drug solubility is a critical parameter in the pharmaceutical industry for developing efficient processes for production of nanomedicine at industrial scale. Several attempts have been made in recent years to investigate and obtain this parameter using various data mining methods, including neural networks. In this study, to reduce the error rate in predicting solubility, three methods including Multi-layer Perceptron (MLP), decision tree, and random forest have been applied to 32 rows of experimental data collected from literature for solubility of a model drug in supercritical CO2. Afterwards, the results of these models are examined and compared with measured data to calibrate and validate the developed models. Finally, the mean squared error improved to 1.77 e −5 in Random Forest Model. MLP and decision tree models mean squared errors are equal to 6.72 e −5 and 3.28 e −5, respectively which is a good result, especially when we can guarantee that the model did not have more problems in predicting the drug solubility and can be used as reliable methods in the pharmaceutical area.
KW - Decision tree
KW - Drug
KW - Machine learning
KW - Solubility
KW - Supercritical processing
UR - http://www.scopus.com/inward/record.url?scp=85123924716&partnerID=8YFLogxK
U2 - 10.1016/j.molliq.2022.118630
DO - 10.1016/j.molliq.2022.118630
M3 - Article
AN - SCOPUS:85123924716
SN - 0167-7322
VL - 351
JO - Journal of Molecular Liquids
JF - Journal of Molecular Liquids
M1 - 118630
ER -