Show simple item record

dc.contributor.authorPeriwal, Vinita
dc.contributor.authorBassler, Stefan
dc.contributor.authorAndrejev, Sergej
dc.contributor.authorGabrielli, Natalia
dc.contributor.authorPatil, Kaustubh Raosaheb
dc.contributor.authorTypas, Athanasios
dc.contributor.authorPatil, Kiran
dc.date.accessioned2022-05-09T11:03:36Z
dc.date.available2022-05-09T11:03:36Z
dc.date.issued2022-04
dc.date.submitted2021-09-06
dc.identifier.issn1553-734X
dc.identifier.otherpcompbiol-d-21-01624
dc.identifier.urihttps://www.repository.cam.ac.uk/handle/1810/336887
dc.description.abstractNatural compounds constitute a rich resource of potential small molecule therapeutics. While experimental access to this resource is limited due to its vast diversity and difficulties in systematic purification, computational assessment of structural similarity with known therapeutic molecules offers a scalable approach. Here, we assessed functional similarity between natural compounds and approved drugs by combining multiple chemical similarity metrics and physicochemical properties using a machine-learning approach. We computed pairwise similarities between 1410 drugs for training classification models and used the drugs shared protein targets as class labels. The best performing models were random forest which gave an average area under the ROC of 0.9, Matthews correlation coefficient of 0.35, and F1 score of 0.33, suggesting that it captured the structure-activity relation well. The models were then used to predict protein targets of circa 11k natural compounds by comparing them with the drugs. This revealed therapeutic potential of several natural compounds, including those with support from previously published sources as well as those hitherto unexplored. We experimentally validated one of the predicted pair's activities, viz., Cox-1 inhibition by 5-methoxysalicylic acid, a molecule commonly found in tea, herbs and spices. In contrast, another natural compound, 4-isopropylbenzoic acid, with the highest similarity score when considering most weighted similarity metric but not picked by our models, did not inhibit Cox-1. Our results demonstrate the utility of a machine-learning approach combining multiple chemical features for uncovering protein binding potential of natural compounds.
dc.languageen
dc.publisherPublic Library of Science (PLoS)
dc.subjectResearch Article
dc.subjectComputer and information sciences
dc.subjectMedicine and health sciences
dc.subjectBiology and life sciences
dc.subjectPhysical sciences
dc.titleBioactivity assessment of natural compounds using machine learning models trained on target similarity between drugs.
dc.typeArticle
dc.date.updated2022-05-09T11:03:36Z
prism.issueIdentifier4
prism.publicationNamePLoS Comput Biol
prism.volume18
dc.identifier.doi10.17863/CAM.84306
dcterms.dateAccepted2022-03-17
rioxxterms.versionofrecord10.1371/journal.pcbi.1010029
rioxxterms.versionVoR
rioxxterms.licenseref.urihttp://creativecommons.org/licenses/by/4.0/
datacite.contributor.supervisoreditor: Maranas, Costas D.
dc.contributor.orcidPeriwal, Vinita [0000-0002-3583-0413]
dc.contributor.orcidBassler, Stefan [0000-0002-0652-9242]
dc.contributor.orcidAndrejev, Sergej [0000-0002-7875-0261]
dc.contributor.orcidGabrielli, Natalia [0000-0001-5222-3925]
dc.contributor.orcidTypas, Athanasios [0000-0002-0797-9018]
dc.contributor.orcidPatil, Kiran [0000-0002-6166-8640]
dc.identifier.eissn1553-7358
pubs.funder-project-idEMBL Interdisciplinary Postdoc (EI3POD) program under Marie Skłodowska-Curie actions (664726)
pubs.funder-project-idJoachim Herz Stiftung (Fellowship for Interdisciplinary Science)
pubs.funder-project-idMedical Research Council (MC_UU_00025/11)
cam.issuedOnline2022-04-25


Files in this item

Thumbnail
Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record