Machine Learning-Based Virtual Screening, Molecular Docking and Drug-Likeness to Discover New Inhibitors of the Glycoprotein Spike (S1) of SARS-CoV-2
Author(s)
Cobre AF1, Junkert A1, Fachi MM1, Böger B2, Surek M1, Wiens A3, Tonin F1, Pontarolo R3
1Pharmaceutical Sciences Postgraduate Program, Federal University of Paraná, Curitiba, Brazil, 2Unimed Curitiba, Curitiba, Brazil, 3Federal University of Parana, Curitiba, Brazil
Presentation Documents
OBJECTIVES: Despite great advancements in COVID-19 immunization, the development of therapeutic interventions is urgent to control the ongoing pandemic, especially infected patients. The spike protein (S1) of SARS-Cov-2 virus plays a major role in attachment to the host and further series of events. We aimed to identify natural bioactive compounds (NBC) that act as potential inhibitors of S1 by means of in silico assays. METHODS: NBCs with proved biological in vitro activities were obtained from the ZINC database (https://zinc.docking.org) and analyzed through virtual screening and molecular docking to identify those with higher affinity to the S1. Machine learning models of principal component analysis (PCA), artificial neural networks (ANN), discriminant analysis by partial least squares (PLS-DA) and decision tree (DT) were used to validate the results. Selected NBCs were submitted to drug-likeness analysis using the Lipinsk and Vebber's five rule. The prediction of pharmacokinetic parameters (i.e. absorption, metabolism, distribution, elimination) and toxicity (e.g. hepatotoxicity, cardiotoxicity, carcinogenicity, immunotoxicity) were performed (ADMET). The influence of the NBC’s stereoisomeric, tautomeric and protonation states at physiological pH on the pharmacodynamics, pharmacokinetics and toxicity analyses were also evaluated. RESULTS: A total of 170,906 compounds were analyzed. Of these, only 36 showed greater affinity with the S1 (affinity energy <0.8 kcal/mol). The PCA and PLS-DA models were able to reproduce the results of the virtual screening and docking analyzes with an accuracy of 97.5%. Of these 36 CNBs, only 12 (33.33%) were drug-likeness. The ADMET analysis showed that the natural compound phaselol (7-[[(1R,4aS,6R,8aR)-6-hydroxy-2,5,5,8a-tetramethyl-1,4,4a,6,7,8-hexahydronaphthalen-1-yl]methoxy]chromen-2-one) was the most promising in inhibiting the SARS-COV-2 spike. CONCLUSIONS: Machine learning-based research is efficient for retrieving novel approaches to diseases’ treatment. We identified 12 compounds as possible inhibitors of S1; phaselol was the most promising candidate for treating COVID-19. In vitro, preclinical studies and clinical trials are now needed to confirm these findings.
Conference/Value in Health Info
2021-11, ISPOR Europe 2021, Copenhagen, Denmark
Value in Health, Volume 24, Issue 12, S2 (December 2021)
Code
POSB422
Topic
Methodological & Statistical Research
Topic Subcategory
Artificial Intelligence, Machine Learning, Predictive Analytics
Disease
Multiple Diseases