Usability of the dataset

To determine if the dataset is usable for your paper, you can assess the dataset on different aspects:

  • Format
  • Source
  • Availability
  • License

 

Format of the dataset

The format of the dataset:

O In which format is the dataset? (data: excel, spss, docx, pdf, etc.)

O Is the dataset downloadable on your device?

O Can you extract graphics, tables etc. from the dataset to incorporate in your research paper?

Source of the dataset(s)

The source of the dataset(s):

O Is the source or repository non commercial?

O Is the source or repository of the dataset trustworthy?

Availabilty of the dataset(s)

The availability of the dataset(s):

O closed = the dataset is not accessible (GDPR, Intellectual Property, Trade Secrets etc)

O embargoed = the dataset will be available after some time (mostly 6 -12 months)

O open = the dataset is free to use in another research paper (citation is mandatory)

O restricted = the dataset is available under conditions (e.g. non-commercial re-use and/or on request)

License of the dataset(s)

The license of the dataset(s):

Research data which is intended for sharing and re-use should have an assigned license. When you use datasets that have been licensed, ensure you use it in a the way that is permitted by the license. If the data has not been licensed, contact the data owner (rights holder) to obtain permission to re-use the data.

Data licenses

If you want to know more about licensing dataset(s), you can visit the licenses page at the Open Data Handbook.

Creative Common-licenses

Creative Commons tools give everyone from individual creators to large companies and institutions a simple, standardized way to grant copyright permissions to their creative work within the boundaries of copyright law.

Adapted from source:  https://creativecommons.org/licenses/

Below are some common licenses used with research datasets.

Public DomainPublic Domain Mark

Technically not a license, the public domain mark relinquishes all rights to a dataset and dedicates the dataset to the public domain.

CC-0Creative Commons Public Domain Dedication

A  Creative Commons license and is like a public domain dedication. The copyright holder surrenders rights in a dataset using this license.

ODC-PDDLOpen Data Commons Public Domain Dedication and License

This license is one of the Open Data Commons licenses and is like a public domain dedication. The copyright holder surrenders rights in a dataset using this license.

CC-BYCreative Commons Attribution 4.0 International

This license is one of the open Creative Commons licenses and allows users to share and adapt the dataset so long as they give credit to the copyright holder.

ODC-BYOpen Data Commons Attribution License

This license is one of the Open Data Commons licenses and allows users to share and adapt the dataset as long as they give credit to the copyright holder.

CC-BY-SACreative Commons Attribution-ShareAlike 4.0 International

This license is one of the open Creative Commons licenses and allows users to share and adapt the dataset as long as they give credit to the copyright holder and distribute any additions, transformations or changes to the dataset under this same license

ODC-ODbLOpen Data Commons Open Database License

This license is one of the Open Data Commons licenses and allows users to share and adapt the dataset as long as they give credit to the copyright holder and distribute any additions, transformation or changes to the dataset.

CC BY-NCCreative Commons Attribution-NonCommercial 4.0 International

This license is one of the Creative Commons licenses and allows users to share and adapt the dataset if they give credit to the copyright holder and do not use the dataset for any commercial purposes.

CC BY-NDCreative Commons Attribution-NoDerivatives 4.0 International

This license is one of the Creative Commons licenses and allows users to share the dataset if they give credit to copyright holder, but they cannot make any additions, transformations or changes to the dataset under this license.

CC BY-NC-SACreative Commons Attribution-NonCommercial-ShareAlike 4.0 International

This license is one of the Creative Commons licenses and allows users to share the dataset only if they (1) give credit to the copyright holder, (2) do not use the dataset for any commercial purposes, and (3) distribute any additions, transformations or changes to the dataset under this same license.

CC BY-NC-NDCreative Commons Attribution-NonCommercial-NoDerivatives 4.0 International

This license is one of the Creative Commons licenses and allows users to use only your unmodified dataset if they give credit to the copyright holder and do not share it for commercial purposes. Users cannot make any additions, transformations or changes to the dataset under this license.

Source: https://libanswers.ucalgary.ca/faq/200582

Evaluation of the dataset

To evaluate the dataset(s) are appropriate for your paper, you can assess the dataset on different aspects:

  • Quality 
    • How is the data collected?
    • For what purpose has the research been done?
    • Which research method is used?
    • Where is the dataset coming from? Is the data collector reliable?
    • What quality assurance procedures were used?
  • Findability
    • Has the dataset a complete description?
    • Are the results and variables described? 
    • Are articles or policies based on the results of the dataset(s)?
    • Is a Digital Object Identifier (DOI) or handle available to identify the dataset?
  • Accessibilty/security
    • Is the dataset freely accessible? 
    • Is data being reused and has the dataset a license? (CC-license)
    • Is the dataset securily accessible, because of privacy, public-private cooperation (agreements of Intellectual Property), further research?
  • Interoperability/reusability
    • How can the dataset be reused by third parties?
    • Can the dataset be reused without special soft- or hardware?
    • Has the dataset a preferred data citation format?

Data Citation

Examples of data citation:

APA

Hurk, T. Van Der, Gemeente Delft * Delft, Afd. Onderzoek En Statistiek (Primary Investigator). (2007). Stadspanel Delft 2000 - VSO [Data set]. Data Archiving and Networked Services (DANS). https://doi.org/10.17026/DANS-27A-TF83

Harvard

Hurk, T. Van Der, Gemeente Delft * Delft, Afd. Onderzoek En Statistiek (Primary Investigator) (2007) “Stadspanel Delft 2000 - VSO.” Data Archiving and Networked Services (DANS). doi: 10.17026/DANS-27A-TF83

[anchornavigation]