TY - JOUR
T1 - Chemometric and computational modeling of polysaccharide coated drugs for colonic drug delivery
AU - AlOmari, Ahmad Khaleel
AU - Almansour, Khaled
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/12
Y1 - 2025/12
N2 - A methodology based on Principal Component Analysis (PCA) and machine learning (ML) regression was developed in this study for predicting 5-aminosalicylic acid drug release from polysaccharide-coated formulation. The Raman method was used for collection of spectral data which were then used as inputs to the ML models for estimation of drug release. For ML modeling, we examined the predictive accuracy of three machine learning models—Elastic Net (EN), Group Ridge Regression (GRR), and Multilayer Perceptron (MLP)—for forecasting the release behavior of samples. The dataset, consisting of 155 data points with over 1500 spectral features, underwent preprocessing involving normalization, Principal Component Analysis (PCA) for dimensionality reduction, and outlier detection using Cook’s Distance. Model hyperparameters were tuned using the Slime Mould Algorithm (SMA), and each model’s performance was evaluated through k-fold cross-validation (k = 3). Assessment metrics, such as the coefficient of determination (R²), root mean square error (RMSE), and mean absolute error (MAE), emphasize the MLP model’s exceptional performance. On the test set, MLP achieved an R² of 0.9989, notably higher than EN’s R² of 0.9760 and GRR’s R² of 0.7137. Additionally, MLP exhibited remarkably low test RMSE and MAE values at 0.0084 and 0.0067, respectively, in comparison to EN’s RMSE of 0.0342 and MAE of 0.0267, as well as GRR’s RMSE of 0.0907 and MAE of 0.0744. Parity plots and learning curves further validate MLP’s predictive reliability, demonstrating close alignment between actual and predicted values and efficient learning with minimal overfitting. Consequently, the MLP model emerges as the most effective approach for this predictive task, offering a robust tool for accurately modeling complex spectral data. These findings underscore the robustness of the MLP model, providing a reliable and efficient approach for predicting drug release in polysaccharide-coated formulations, with implications for advancing colonic drug delivery systems.
AB - A methodology based on Principal Component Analysis (PCA) and machine learning (ML) regression was developed in this study for predicting 5-aminosalicylic acid drug release from polysaccharide-coated formulation. The Raman method was used for collection of spectral data which were then used as inputs to the ML models for estimation of drug release. For ML modeling, we examined the predictive accuracy of three machine learning models—Elastic Net (EN), Group Ridge Regression (GRR), and Multilayer Perceptron (MLP)—for forecasting the release behavior of samples. The dataset, consisting of 155 data points with over 1500 spectral features, underwent preprocessing involving normalization, Principal Component Analysis (PCA) for dimensionality reduction, and outlier detection using Cook’s Distance. Model hyperparameters were tuned using the Slime Mould Algorithm (SMA), and each model’s performance was evaluated through k-fold cross-validation (k = 3). Assessment metrics, such as the coefficient of determination (R²), root mean square error (RMSE), and mean absolute error (MAE), emphasize the MLP model’s exceptional performance. On the test set, MLP achieved an R² of 0.9989, notably higher than EN’s R² of 0.9760 and GRR’s R² of 0.7137. Additionally, MLP exhibited remarkably low test RMSE and MAE values at 0.0084 and 0.0067, respectively, in comparison to EN’s RMSE of 0.0342 and MAE of 0.0267, as well as GRR’s RMSE of 0.0907 and MAE of 0.0744. Parity plots and learning curves further validate MLP’s predictive reliability, demonstrating close alignment between actual and predicted values and efficient learning with minimal overfitting. Consequently, the MLP model emerges as the most effective approach for this predictive task, offering a robust tool for accurately modeling complex spectral data. These findings underscore the robustness of the MLP model, providing a reliable and efficient approach for predicting drug release in polysaccharide-coated formulations, with implications for advancing colonic drug delivery systems.
KW - Colonic drug delivery
KW - Drug release
KW - Machine learning
KW - Raman spectra
UR - http://www.scopus.com/inward/record.url?scp=105003647247&partnerID=8YFLogxK
U2 - 10.1038/s41598-025-99823-x
DO - 10.1038/s41598-025-99823-x
M3 - Article
C2 - 40287592
AN - SCOPUS:105003647247
SN - 2045-2322
VL - 15
JO - Scientific Reports
JF - Scientific Reports
IS - 1
M1 - 14694
ER -