LibGuides: Open Access: Finding and Accessing Open Access Resources: Open Data

Where Can I Find Open Data?

As previously noted the web can be regarded as an endless source of open data, some of it structured and documented but most of it unregulated and uploaded to various platforms with little thought to how it might be usefully re-used by the global research community. Finding open data is easy; finding open data that suits your research needs or is able to reproductively verify the findings of a research publication will probably require additional input on your part.

Given the daunting amount of data resources available the best place to start looking for suitable data is among peers, your subject area community and social media hubs. In many cases, a simple Google search will identify many of the Open Data resources you are looking for. Reliable sources of data will always be reported and shared, trustworthy data will usually be hosted by a reputable institution, have sufficient documentation to understand the data, have a licence attached and provide a Digital Object Identifier (DOI) for citation and attribution.

You may find the following sources helpful as a starting point:

KnowledgeBase Research Information Portal
Knowledgebase is the University of Strathclyde’s research information portal and a good place to familiarise yourself with the kind of data, documentation and access provision offered by other universities via their institutional repositories. There is no unified searchable catalogue of all datasets held in UK universities, but most university home pages will have links to their research portal, similar to Strathclyde's Knowledgebase.
data.gov.uk
data.gov.uk is a searchable source of data published by central government, local authorities and public bodies. National archives, libraries, museums and heritage institutions are increasingly digitising their collections and making them freely available online.
re3data
re3data is a global registry of over two thousand research data repositories from a diverse range of academic disciplines, managed by DataCite since 2016. It provides information on repositories for the permanent storage and access of data sets to researchers and can be browsed by multiple categories.
Zenodo
"a catch-all repository for European Commission funded research" hosted by CERN.
Figshare
Figshare is a repository “where users can make all of their research outputs available in a citable, shareable and discoverable manner". Figshare is also used to host some universities' data collection - for example, the University of Leicester.
Dryad
Dryad is "a curated resource that makes research data discoverable, freely reusable, and citable. Dryad provides a general-purpose home for a wide diversity of data types".
GitHub
GitHub is a vast, free repository of developmental software and code.
Open Science Framework (OSF)
An increasingly popular free open platform of deposited academic research. Search public projects to build on the work of others and find new collaborators.
Mendeley Data
You can use Mendeley Data to create and deposit datasets or search the harvested catalogue of 27 million datasets from domain-specific and cross-domain repositories. Provided by Elsevier and integrated with Pure.
Data Foundry
Data Foundry, data collections from the National Library of Scotland, is an excellent example of open data made available in a variety of re-useable formats.

Research Funder and Discipline Specific Repositories

Some research funding bodies require data to be deposited in specific discipline-based repositories. Examples include:

NERC Data Centres
The Natural Environment Research Council (NERC) has an Environmental Data Service (EDS) that provides a focal point for NERC's scientific data and information.
Subject specific data repositories include:
• UK Polar Data Centre, based at the British Antarctic Survey
• British Oceanographic Data Centre (BODC)
• Environmental Information Data Centre (EIDC)
• National Geoscience Data Centre (NGDC)
UK Data Service
For Economic and Social Research Council (ESRC) social science qualitative and quantitative data.
BBSRC data resources
Biotechnology and Biological Sciences Research Council (BBSRC) contributes funding to a number of international bioscience data repositories and resources
Archaeology Data Service
Long-established centre of excellence for Arts & Humanities Research Council (AHRC) and NERC resources

Can I Trust Open Data?

It is highly unlikely that an open dataset exists which exactly matches your research requirements, in practice open data generally serves two main functions and both require addtional input in order to be useable:

A source of raw data which can be collected and repurposed or test experimental software or code
To verify or reproduce the research of others, found usually via a Data Availability Statement in a published academic paper.

Trust is mainly based on the provenance of data, so the quality of the documentation or metadata which accompanies open data is of paramount importance. Initiatives such as FAIR data, the CoreTrustSeal of certified repositories, and some publishers are gradually building a framework where open data can be trusted, verified and reused in academic research.

Information Services Andersonian Library Guides

Open Access: Finding and Accessing Open Access Resources

Where Can I Find Open Data?

Research Funder and Discipline Specific Repositories

Can I Trust Open Data?