Repository logo
 

Data publication with the structural biology data grid supports live analysis.


Change log

Authors

Meyer, Peter A 
Socias, Stephanie 
Key, Jason 
Ransey, Elizabeth 
Tjon, Emily C 

Abstract

Access to experimental X-ray diffraction image data is fundamental for validation and reproduction of macromolecular models and indispensable for development of structural biology processing methods. Here, we established a diffraction data publication and dissemination system, Structural Biology Data Grid (SBDG; data.sbgrid.org), to preserve primary experimental data sets that support scientific publications. Data sets are accessible to researchers through a community driven data grid, which facilitates global data access. Our analysis of a pilot collection of crystallographic data sets demonstrates that the information archived by SBDG is sufficient to reprocess data to statistics that meet or exceed the quality of the original published structures. SBDG has extended its services to the entire community and is used to develop support for other types of biomedical data sets. It is anticipated that access to the experimental data sets will enhance the paradigm shift in the community towards a much more dynamic body of continuously improving data analysis.

Description

Keywords

Crystallography, X-Ray, Databases, Genetic, Internet, Macromolecular Substances, Publications, Software

Journal Title

Nat Commun

Conference Name

Journal ISSN

2041-1723
2041-1723

Volume Title

7

Publisher

Springer Science and Business Media LLC
Sponsorship
Wellcome Trust (101908/Z/13/Z)
Development of the Structural Biology Data Grid is funded by The Leona M. and Harry B. Helmsley Charitable Trust 2016PG­BRI002 to PS and MC. Development of citation workflows is supported by NSF 1448069 (to P.S.). DAA is being developed as a pilot project of the National Data Service, with additional funds to support storage and technology development, including NIH P41 GM103403 (NE­CAT) and 1S10RR028832 (HMS) and DOE DE­AC02­06CH11357; NIH 1U54EB020406­01, Big Data for Discovery Science Center; and NIST 60NANB15D077 (Globus Project). Collections of pilot datasets were supported by various grants (see Table SI).