TY - JOUR
T1 - Gaining insights into the physicochemical properties and sequence space of blood–brain barrier penetrating peptides
AU - Nath, Abhigyan
AU - Pandey, Sneha
AU - Nisar, Kottakkaran Sooppy
AU - Tiwari, Anoop Kumar
N1 - Publisher Copyright:
© 2024
PY - 2024/12
Y1 - 2024/12
N2 - The blood–brain barrier (BBB) poses a significant obstacle to the administration of drugs to the brain for the development of therapies of central nervous system (CNS) disorders. Blood-brain barrier penetrating peptides (BBBPps) are a group of peptides that can traverse the BBB by different processes without causing harm to the BBB. These peptides show promise as potential drugs for CNS ailments. Nevertheless, the process of identifying BBBPps using experimental approaches is both time-consuming and labour-intensive. To discover additional BBBPps as potential treatments for CNS diseases, it is critical to develop insilico methods that can distinguish BBBPps from non- BBBPps rapidly and precisely. In the current work, machine learning aided models are developed for accurate prediction of BBBPps using physicochemical and evolutionary information. The best model achieved an accuracy of 84.8 % by using 5-fold cross-validation and 79.8 % based on the holdout testing set on a reduced set of features. Further an extensive analysis is carried out for the black box models using model agnostic interpretation approaches to infer the physicochemical and sequence space of BBBPps. Basic amino acids and conservation of Arginine and Lysine are found to be more favoured in BBBPps. Further, basic amino acid property group and its interactions with other features are found to be prominent important interactions.
AB - The blood–brain barrier (BBB) poses a significant obstacle to the administration of drugs to the brain for the development of therapies of central nervous system (CNS) disorders. Blood-brain barrier penetrating peptides (BBBPps) are a group of peptides that can traverse the BBB by different processes without causing harm to the BBB. These peptides show promise as potential drugs for CNS ailments. Nevertheless, the process of identifying BBBPps using experimental approaches is both time-consuming and labour-intensive. To discover additional BBBPps as potential treatments for CNS diseases, it is critical to develop insilico methods that can distinguish BBBPps from non- BBBPps rapidly and precisely. In the current work, machine learning aided models are developed for accurate prediction of BBBPps using physicochemical and evolutionary information. The best model achieved an accuracy of 84.8 % by using 5-fold cross-validation and 79.8 % based on the holdout testing set on a reduced set of features. Further an extensive analysis is carried out for the black box models using model agnostic interpretation approaches to infer the physicochemical and sequence space of BBBPps. Basic amino acids and conservation of Arginine and Lysine are found to be more favoured in BBBPps. Further, basic amino acid property group and its interactions with other features are found to be prominent important interactions.
KW - Accumulated local effects
KW - Blood brain barrier penetrating peptides
KW - Gradient boosting machines
KW - Model agnostic interpretation methods
KW - Physicochemical space
KW - Sequence properties
KW - XGboost
UR - https://www.scopus.com/pages/publications/85205688774
U2 - 10.1016/j.eij.2024.100557
DO - 10.1016/j.eij.2024.100557
M3 - Article
AN - SCOPUS:85205688774
SN - 1110-8665
VL - 28
JO - Egyptian Informatics Journal
JF - Egyptian Informatics Journal
M1 - 100557
ER -