Repository logo
 

Modular decomposition of protein structure using community detection

cam.issuedOnline2018-08-08
dc.contributor.authorGrant, WP
dc.contributor.authorAhnert, SE
dc.contributor.authorEstrada, E
dc.contributor.orcidGrant, William [0000-0001-9309-7937]
dc.date.accessioned2019-05-08T14:20:55Z
dc.date.available2019-05-08T14:20:55Z
dc.date.issued2019
dc.description.abstractAs the number of solved protein structures increases, the opportunities for meta-analysis of this dataset increase too. Protein structures are known to be formed of domains; structural and functional subunits that are often repeated across sets of proteins. These domains generally form compact, globular regions, and are therefore often easily identifiable by inspection, yet the problem of automatically fragmenting the protein into these compact substructures remains computationally challenging. Existing domain classification methods focus on finding subregions of protein structure that are conserved, rather than finding a decomposition which spans the full protein structure. However, such a decomposition would find ready application in coarse-graining molecular dynamics, analysing the protein's topology, in de novo protein design and in fitting electron microscopy maps. Here, we present a tool for performing this modular decomposition using the Infomap community detection algorithm. The protein structure is abstracted into a network in which its amino acids are the nodes, and where the edges are generated using a simple proximity test. Infomap can then be used to identify highly intra-connected regions of the protein. We perform this decomposition systematically across 4000 distinct protein structures, taken from the Protein Data Bank. The decomposition obtained correlates well with existing PFAM sequence classifications, but has the advantage of spanning the full protein, with the potential for novel domains. The coarse-grained network formed by the communities can also be used as a proxy for protein topology at the single-chain level; we demonstrate that grouping these proteins by their coarse-grained network results in a functionally significant classification.
dc.identifier.doi10.17863/CAM.39701
dc.identifier.eissn2051-1329
dc.identifier.issn2051-1310
dc.identifier.urihttps://www.repository.cam.ac.uk/handle/1810/292541
dc.language.isoeng
dc.publisherOxford University Press (OUP)
dc.publisher.urlhttp://dx.doi.org/10.1093/comnet/cny014
dc.rightsAttribution 4.0 International
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectcommunity detection
dc.subjectprotein structure
dc.subjectbiological networks
dc.subjectspatial networks
dc.titleModular decomposition of protein structure using community detection
dc.typeArticle
prism.endingPage113
prism.issueIdentifier1
prism.publicationDate2018
prism.publicationNameJournal of Complex Networks
prism.startingPage101
prism.volume7
pubs.funder-project-idEPSRC (1644501)
pubs.funder-project-idThe Royal Society (uf080037)
pubs.funder-project-idGatsby Charitable Foundation (GAT3395/CCD)
pubs.funder-project-idEngineering and Physical Sciences Research Council (EP/L015552/1)
rioxxterms.licenseref.startdate2018-08-08
rioxxterms.licenseref.urihttp://www.rioxx.net/licenses/all-rights-reserved
rioxxterms.typeJournal Article/Review
rioxxterms.versionVoR
rioxxterms.versionofrecord10.1093/comnet/cny014

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
cny014.pdf
Size:
1.76 MB
Format:
Adobe Portable Document Format
Description:
Published version
Licence
http://creativecommons.org/licenses/by/4.0/
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
DepositLicenceAgreement.pdf
Size:
417.78 KB
Format:
Adobe Portable Document Format