Repository logo

Sequence-based prediction of the intrinsic solubility of peptides containing non-natural amino acids.

Published version

Repository DOI

Change log


Non-natural amino acids are increasingly used as building blocks in the development of peptide-based drugs as they expand the available chemical space to tailor function, half-life and other key properties. However, while the chemical space of modified amino acids (mAAs) such as residues containing post-translational modifications (PTMs) is potentially vast, experimental methods for measuring the developability properties of mAA-containing peptides are expensive and time consuming. To facilitate developability programs through computational methods, we present CamSol-PTM, a method that enables the fast and reliable sequence-based prediction of the intrinsic solubility of mAA-containing peptides in aqueous solution at room temperature. From a computational screening of 50,000 mAA-containing variants of three peptides, we selected five different small-size mAAs for a total number of 37 peptide variants for experimental validation. We demonstrate the accuracy of the predictions by comparing the calculated and experimental solubility values. Our results indicate that the computational screening of mAA-containing peptides can extend by over four orders of magnitude the ability to explore the solubility chemical space of peptides and confirm that our method can accurately assess the solubility of peptides containing mAAs. This method is available as a web server at .


Acknowledgements: M.O. is a PhD student funded by AstraZeneca. P.S. is a Royal Society University Research Fellow (URF\R1\201461). The project was supported by the Wellcome Trust (203249/Z/16/Z).

Funder: UKRI 10061100 and 10059436


Solubility, Amino Acids, Peptides

Journal Title

Nat Commun

Conference Name

Journal ISSN


Volume Title



Springer Science and Business Media LLC
Wellcome Trust (203249/Z/16/Z)
The project was supported by the Wellcome Trust (203249/Z/16/Z).