Skip to Main Content
It looks like you're using Internet Explorer 11 or older. This website works best with modern browsers such as the latest versions of Chrome, Firefox, Safari, and Edge. If you continue with this browser, you may see unexpected results.
Finding published research datasets for chemistry research
The majority of data archives for chemistry datasets are specifically for structural data, whereas datasets underpinning publications currently tend to be published either in the interdisciplinary data archives (particularly Dryad or figshare), or as supplemental material to journal articles.
The list of structural databases provided in this guide are those that are recommended by major UK funding bodies or by journal publishers. You can search re3data or FAIRSharing for a full list of research data archives publishing chemistry datasets.
University of Bath Library Subject Homepages
Cambridge Structural Database
Web access to the Cambridge Structural Database, with access to over 900,000 organic and organometallic crystal structures.
Inorganic Crystal Structure Database (ICSD)
Crystal structures of approximately 160,000 inorganic compounds.
Crystallography Open Database (COD)
Open access collection of crystal structures of organic, inorganic, metallorganic and minerals excluding biopolymers.
Archive for crystal structures generated by the Southampton Chemical Crystallography Group and the EPSRC UK National Crystallography Service.
Electron Microscopy Data Bank (EMDB)
Electron microscopy density maps of macromolecular complexes and subcelluar structures covering single-particle analysis, electronic tomography and electron crystallography.
Royal Society of Chemistry data publication guidance
The Royal Society of Chemistry has published recommendations for the use of data archives for data underpinning publications within their journals. Their recommended archives are listed below.
Access to three databases: PubChem BioAssay, PubChem Compound and PubChem Substance.
Chemical structure database with access to over 67 millon structures from over 400 data sources.
University of Bath Research Data Archive
Institutional data archive for the University of Bath. Free to use for all University of Bath researchers.
General data repository used mainly for life sciences data.
General data repository for a wide range of research outputs including datasets.
Crystallography, structural and sequence data