Publish data

Research data made openly available through data repositories becomes searchable, citable and gives the authors more opportunities for impact, merit and future collaborations. Publishing research data also increases transparency and reliability of presented results. It also gives others the opportunity to find and reuse existing data in new research.

More detailed descriptions of datasets can be published in a journal as a "data paper".

Some data should not be shared openly

There may be legal, ethical, or other reasons for not sharing or publishing research data. This may apply to data that contains personal information, trade secrets, copyrighted material, or other confidential information. In these cases, the dataset can still be described and possibly deposited in a repository. The description of the data should then include contact information and conditions for accessing the data.

If you will handle data with sensitive personal information and plan to deposit the data in repositories with controlled release procedures, you should include that information in the application for ethics approval.

See also: As open as possible, as restricted as necessary (Researchdata.se)

Data repositories and licenses

If possible choose an established subject-based data repository when publishing data. In the re3data.org register (Registry of research data repositories), you can search for data repositories in various subject areas and countries.

When publishing data, specified creators and roles may be different from those specified in publications whose analysis and results are based on this data. For examples of roles see CRediT – Contributor Roles Taxonomy.

Examples of subject specific repositories

Examples of interdisciplinary repositories

  • Swedish National Data Service (SND) - a certified repository
    Data sets published in the SND Repository are reviewed by staff at the Uppsala University Research Data Support in dialogue with responsible researchers. The aim is to make published data more FAIR.
  • Zenodo - a repository operated by OpenAIRE and CERN, funded by the European Commission. Zenodo can be used for storing and publishing data sets, code and publications. Code created in Github can be published via Zenodo and be assigned a DOI-number.
  • Figshare - a data repository run by the company Digital Science.
  • Dryad - repository run by a network of universities, scholarly societies and publishers.
  • Dataverse - a data repository at Harvard University.

Licenses for data and code

To facilitate the reuse of published data, one can choose a license and specify conditions for reuse, for example, using Creative Commons licenses or Open Data Commons. The chosen license can inform about wethether modifications and commercial use are allowed, and how the data can be shared further.

DIGG, the Agency for Digital Government, has developed guidelines for open licenses and intellectual property (in Swedish). They recommend that data not subject to copyright or other intellectual property protection should be marked with PDM (Public Domain Mark) or CC0.

For data developed within research, however, a license that allows recognition to those who created the dataset is preferable, such as CC-BY.

More about licensing data at Researchdata.se

Some links about licenses for code, computer programs, and databases:

FOLLOW UPPSALA UNIVERSITY ON

Uppsala University on Facebook
Uppsala University on Instagram
Uppsala University on Youtube
Uppsala University on Linkedin