Repository logo

Development of a Cambridge Structural Database Subset: A Collection of Metal-Organic Frameworks for Past, Present, and Future

Accepted version



Change log


Zoroufchian Moghadam, P 
Wiggin, SB 
Tao, A 
Maloney, AGP 


We report the generation and characterization of the most complete collection of metal-organic frameworks (MOFs) maintained and updated, for the first time, by the Cambridge Crystallographic Data Centre (CCDC). To set up this subset, we asked the question “what is a MOF?” and implemented a number of “look-for-MOF” criteria embedded within a bespoke Cambridge Structural Database (CSD) Python API workflow to identify and extract information on 69 666 MOF materials. The CSD MOF subset is updated regularly with subsequent MOF additions to the CSD, bringing a unique record for all researchers working in the area of porous materials around the world, whether to perform high-throughput computational screening for materials discovery or to have a global view over the existing structures in a single resource. Using this resource, we then developed and used an array of computational tools to remove residual solvent molecules from the framework pores of all the MOFs identified and went on to analyze geometrical and physical properties of nondisordered structures.



3402 Inorganic Chemistry, 34 Chemical Sciences

Journal Title

Chemistry of Materials

Conference Name

Journal ISSN


Volume Title



American Chemical Society
D.F.-J. thanks the Royal Society for funding through a University Research Fellowship. Computational work was supported by the Cambridge High Performance Computing Cluster, Darwin.