When to use a dataset and when to use a project
Generally if you are putting up data to share or data that is private but which you might conceivably want to reuse in other projects, it's better to add the data to a dataset. If the data is in a dataset, all of its metadata will automatically show up in your project because the dataset is linked instead of copied. All changes to the original dataset--including automatic updates from the source and manual updates by the dataset owner to the metadata--will also be conveyed.
The following table summarizes the differences between adding data files to a dataset vs. to a project:
Dataset vs. Project | Dataset | Project |
---|---|---|
Can run and save queries against | X | X |
Can have charts/visualizations | X | |
Can incorporate different file types | X | X |
Can contain multiple files | X | X |
Can be shared and have contributors | X | X |
Can have a discussion thread | X | X |
Can include insights | X | |
Can use existing data.world datasets without having to download and reimport them and having to recreate the associated meta-data | X | |
Can be included in a project | X |