It is often useful to store raw data as well as the cleaned, processed data, as it can help your work to be more easily reproduced. If another researcher has your raw data and the steps you used to process and analyse, they can recreate your results. This has to be balanced with the cost of storing raw data, and the likelihood of the raw data being useful compared to data that has undergone an initial process of data cleaning.
Reference research data
A static or organic conglomeration or collection of smaller (peer reviewed) datasets, most probably published and curated, e.g. UK Tide Gauge Network, IUCN Red List of Endangered Species
The evidence that underpins the answer to the research question
Recorded factual material commonly retained by and accepted in the scientific community as necessary to validate research findings.
Data that are used as primary sources to support technical or scientific enquiry, research, scholarship, or artistic activity, and that are used as evidence in the research process and/or are commonly accepted in the research community as necessary to validate research findings and results. All other digital and non-digital content have the potential of becoming research data. Research data may be experimental data, observational data, operational data, third party data, public sector data, monitoring data, processed data, or repurposed data.
- CASRAI Dictionary
Research Data Lifecycle
A model to conceptualise the different stages through which data pass during the research process, and the data management activities that relate to those stages.
The model used throughout Data Tree has six stages, corresponding to different activities during the life of a research project. Other institutions or paradigms have slight variations on these stages, but the broad concepts are applicable no matter how you choose to categorise your research activities.