Starting a dataset
To create a dataset:
Click the + New button in the Main header toolbar of the application. If you are on the Organization profile page, on the Overview or Resources tab, select the Dataset option from the New button.
In the Create a new window, click Create new dataset.
On the Create a new dataset page, set the following:
Dataset name: Provide a name for the dataset.
Owner: Select the owner of the dataset. By default, if you are in an organization, your organization is set as the owner of the dataset. You can choose another organization from the dropdown or you can switch to personal account if you wish to be the owner as well as the creator of the dataset.
Share with: Specify who can see the dataset. Permission options to the dataset are either No one, All of Organization name (if an organization is the owner of the dataset, or Make public to anyone at your company (Make public to data.world community).
Click Create dataset.
Once the dataset is created, the main page is refreshed to show the Datasets details page.
On the Overview tab, in the Overview section, do the following:
Click the Edit button to add a description for the dataset and to add Tags to the dataset. Tagging datasets help search datasets later. In the Tags field, start typing the tag. If it already exists, it will show in as a suggestion with the number of times it is used. Click on the tag to add it or press Enter to create a new tag and add to the dataset.
Click the Add a summary with images, links, and custom formatting link in the Summary section to add a summary for the dataset.
Next, go to the Settings tab and set the following options for the dataset in the General section.
Automatic sync options: Set sync options for files and streams. The options available are: Never, Every hour, Every day, Every week
Caching live tables and live views: Select the Allow previews and data to be cached for one day option to enable caching of live tables and live views to make browsing of data more efficient. By default caching is disabled.
Additional notification recipient: Various notifications for datasets are sent to users contributing to datasets. If you want an additional email address to get notifications for the activities on a dataset, provide those email addresses in this field. Only one email address can be provided here.
Archive options: Select the Expand archives by default option to automatically expand all files in an archive file (for example, zip files) added to the project. Select the Prefix file name with archive name option to add the name of the archive file to the file names extracted from the archive file. This helps easily identify the source of the file.
For example, when these settings are enabled and a user adds a file latest_orders.zip with two files in it (california_orders.xlsx and austin_orders.xlsx), the system automatically extracts them and add the files as - latest_orders/austin_orders.xlsx and latest_orders/california_orders.xlsx to the dataset.
click Save.