Skip Ribbon Commands
Skip to main content

Software & Data Citation Workshop

:

Use Case Summary: Citing very large volume datasets that are too large for current repositories

Version HistoryVersion History

Number

17

Title

Citing very large volume datasets that are too large for current repositories

Summary

Investigator runs experiments where the main raw data type is high resolution images and videos. Raw data is about 4 TB per experiment. Processed data is still 1 TB in order to provide a dataset that would allow reproducing the results. Current data repositories usually do not offer this much storage, so it is very hard to obtain a citable DOI for such a large dataset. Usually, dataset DOIs are not assigned unless the "trusted" allocating agent has possession of the data resource (so that the DOI will not point to a resource that moves or is changed).​

URL

https://github.com/ResearchSoftwareInstitute/software-data-citation-ws/issues/17

Tag

large & complex data

Breakout

round-2

Attachments

Version: 2.0
Created at 1/25/2015 11:41 AM by Ray Idaszak
Last modified at 1/28/2015 9:49 PM by Ray Idaszak