Research data supporting the study titled 'Large Scale Discovery of Microbial Fibrillar Adhesins and Identification of Novel Members of Adhesive Domain Families'.
Repository URI
Repository DOI
Type
Dataset
Change log
Authors
Monzon, Vivian
Bateman, Alex
Description
This repository includes the potential Fibrillar Adhesin-like (FA-like) proteins in Firmicutes and Actinobacteria, which where predicted within this study using a Random Forest classification approach. Next to the UniProt protein identifier, the calculated feature properties are given. Additionally, this repository contains the pdb files of the N-terminal clusters in Firmicutes, which were found in the predicted FA-like proteins with minimal 4 known stalk domains and no known adhesive domain. The structures were predicted using AlphaFold2.
Version
Software / Usage instructions
The FA-like proteins were predicted with a Random Forest classification approach (https://github.com/VivianMonzon/FAL_prediction). The structures were predicted using AlphaFold2 (https://www.nature.com/articles/s41586-021-03819-2).
Keywords
Adhesive domains, AlphaFold2, Fibrillar adhesins, Host-pathogen interaction, Protein domain families, RandomForest classification, Structure prediction methods