Optimization and comparison of machine learning methods in estimation of carbon dioxide loading in chemical solvents for environmental applications

Liang Chen; Huan Huang; Lakshmi Thangavelu; Walid Kamal Abdelbasset; Dmitry Olegovich Bokov; Mohammed Algarni; Sami Ghazali; May Alashwal

doi:10.1016/j.molliq.2022.118513

Optimization and comparison of machine learning methods in estimation of carbon dioxide loading in chemical solvents for environmental applications

Liang Chen
, Huan Huang
, Lakshmi Thangavelu
, Walid Kamal Abdelbasset
, Dmitry Olegovich Bokov
, Mohammed Algarni
, Sami Ghazali
, May Alashwal

Physical Therapy and Health Rehabilitation

Research output: Contribution to journal › Article › peer-review

12 Scopus citations

Abstract

In this study, we developed a variety of machine learning ensemble models for predicting and correlating CO₂ solubility in amino acid salt solutions containing different concentrations. The models were utilized to establish a relationship between process parameters and CO₂ loading in the solvent. Indeed, the solitary model output was the amount of CO₂ that was loaded into and dissolved in the chemical solvent. When it came to selecting estimators, we tried three different approaches to correlate the CO₂ loading. Bagging and boosting models, both of which are subclasses of ensemble techniques are used in these models. When using ensemble techniques, a number of weak models are combined to build a strong and robust model for prediction of solubility values. There are a variety of models that are utilized including random forests (RF), extreme randomized trees (ERT), and boosted K-NN (with Adaboost). We repeated the procedure multiple times in order to obtain the best model, from which we could then establish the right hyper-parameters for each one of the models. Following optimization, the R² scores for all three models above 0.9, suggesting that the models had high predictive performance. ERT had the highest R² score, which was 0.999, among all companies. R² of 0.992 was achieved by Random Forest, also we have Boosted KNN, which achieved an R² of 0.998.

Original language	English
Article number	118513
Journal	Journal of Molecular Liquids
Volume	349
DOIs	https://doi.org/10.1016/j.molliq.2022.118513
State	Published - 1 Mar 2022

Keywords

Absorption
CO solubility
Machine learning
Modeling
Purification

Access to Document

10.1016/j.molliq.2022.118513

Cite this

@article{16207e12b1e846c498ccb366eccd4166,

title = "Optimization and comparison of machine learning methods in estimation of carbon dioxide loading in chemical solvents for environmental applications",

abstract = "In this study, we developed a variety of machine learning ensemble models for predicting and correlating CO2 solubility in amino acid salt solutions containing different concentrations. The models were utilized to establish a relationship between process parameters and CO2 loading in the solvent. Indeed, the solitary model output was the amount of CO2 that was loaded into and dissolved in the chemical solvent. When it came to selecting estimators, we tried three different approaches to correlate the CO2 loading. Bagging and boosting models, both of which are subclasses of ensemble techniques are used in these models. When using ensemble techniques, a number of weak models are combined to build a strong and robust model for prediction of solubility values. There are a variety of models that are utilized including random forests (RF), extreme randomized trees (ERT), and boosted K-NN (with Adaboost). We repeated the procedure multiple times in order to obtain the best model, from which we could then establish the right hyper-parameters for each one of the models. Following optimization, the R2 scores for all three models above 0.9, suggesting that the models had high predictive performance. ERT had the highest R2 score, which was 0.999, among all companies. R2 of 0.992 was achieved by Random Forest, also we have Boosted KNN, which achieved an R2 of 0.998.",

keywords = "Absorption, CO solubility, Machine learning, Modeling, Purification",

author = "Liang Chen and Huan Huang and Lakshmi Thangavelu and Abdelbasset, \{Walid Kamal\} and Bokov, \{Dmitry Olegovich\} and Mohammed Algarni and Sami Ghazali and May Alashwal",

note = "Publisher Copyright: {\textcopyright} 2022 Elsevier B.V.",

year = "2022",

month = mar,

day = "1",

doi = "10.1016/j.molliq.2022.118513",

language = "English",

volume = "349",

journal = "Journal of Molecular Liquids",

issn = "0167-7322",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - Optimization and comparison of machine learning methods in estimation of carbon dioxide loading in chemical solvents for environmental applications

AU - Chen, Liang

AU - Huang, Huan

AU - Thangavelu, Lakshmi

AU - Abdelbasset, Walid Kamal

AU - Bokov, Dmitry Olegovich

AU - Algarni, Mohammed

AU - Ghazali, Sami

AU - Alashwal, May

PY - 2022/3/1

Y1 - 2022/3/1

N2 - In this study, we developed a variety of machine learning ensemble models for predicting and correlating CO2 solubility in amino acid salt solutions containing different concentrations. The models were utilized to establish a relationship between process parameters and CO2 loading in the solvent. Indeed, the solitary model output was the amount of CO2 that was loaded into and dissolved in the chemical solvent. When it came to selecting estimators, we tried three different approaches to correlate the CO2 loading. Bagging and boosting models, both of which are subclasses of ensemble techniques are used in these models. When using ensemble techniques, a number of weak models are combined to build a strong and robust model for prediction of solubility values. There are a variety of models that are utilized including random forests (RF), extreme randomized trees (ERT), and boosted K-NN (with Adaboost). We repeated the procedure multiple times in order to obtain the best model, from which we could then establish the right hyper-parameters for each one of the models. Following optimization, the R2 scores for all three models above 0.9, suggesting that the models had high predictive performance. ERT had the highest R2 score, which was 0.999, among all companies. R2 of 0.992 was achieved by Random Forest, also we have Boosted KNN, which achieved an R2 of 0.998.

AB - In this study, we developed a variety of machine learning ensemble models for predicting and correlating CO2 solubility in amino acid salt solutions containing different concentrations. The models were utilized to establish a relationship between process parameters and CO2 loading in the solvent. Indeed, the solitary model output was the amount of CO2 that was loaded into and dissolved in the chemical solvent. When it came to selecting estimators, we tried three different approaches to correlate the CO2 loading. Bagging and boosting models, both of which are subclasses of ensemble techniques are used in these models. When using ensemble techniques, a number of weak models are combined to build a strong and robust model for prediction of solubility values. There are a variety of models that are utilized including random forests (RF), extreme randomized trees (ERT), and boosted K-NN (with Adaboost). We repeated the procedure multiple times in order to obtain the best model, from which we could then establish the right hyper-parameters for each one of the models. Following optimization, the R2 scores for all three models above 0.9, suggesting that the models had high predictive performance. ERT had the highest R2 score, which was 0.999, among all companies. R2 of 0.992 was achieved by Random Forest, also we have Boosted KNN, which achieved an R2 of 0.998.

KW - Absorption

KW - CO solubility

KW - Machine learning

KW - Modeling

KW - Purification

UR - https://www.scopus.com/pages/publications/85122915713

U2 - 10.1016/j.molliq.2022.118513

DO - 10.1016/j.molliq.2022.118513

M3 - Article

AN - SCOPUS:85122915713

SN - 0167-7322

VL - 349

JO - Journal of Molecular Liquids

JF - Journal of Molecular Liquids

M1 - 118513

ER -

Optimization and comparison of machine learning methods in estimation of carbon dioxide loading in chemical solvents for environmental applications

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this