Enterprise docs

Datasets

The datasets tile on your organization's home page is your portal to all the datasets owned by your organization and the metadata about them. If your organization uses our metadata catalog, all the metadata captured for your organization's data sources can be found here. This is also where all of the datasets containing your data--both stored on data.world and from virtual connections--are located:

Org_home_page.png

Under the Data tile is the count of the datasets owned by the organization. Clicking on the Data tile or the link to the number of datasets right below it will open a list of all your organization's datasets. There you can see:

  • The names of the datasets

  • Who created them

  • When they were last updated

  • The tags used in them

Organization_datasets.png

Notice in the upper right corner that there is a search bar which contains a filter to only return the datasets owned by the organization. To the right of the search bar is a button to create a new dataset or project. The three-dot menu on the very right has an option to edit multiple datasets at once (assuming you have edit permissions to them).

When you click on one of the datasets and open it, you'll see a header at the top of the page that has several valuable pieces of information and actions that you can take in regards to the dataset:

Dataset_header.png

On the left of the header are:

  • The icon indicating that the resource is a dataset

  • The name of the dataset

  • Its status (this is an available configuration)

  • The name of the owning organization and a link to its profile page

On the right of the header are options to:

  • Share the dataset

  • Bookmark it (and see how many others have bookmarked it

  • Access the Explore this dataset menu

Dataset_overview_right_top.png

On the Explore menu are options to:

  • See all the projects that use the dataset

  • Connect the dataset to an existing project

  • Open the dataset with a third-party application (integrated with data.world)

  • Create a new project using the dataset

Overview

When you open a dataset, you land on the Overview tab:

Dataset_overview_top.png

In the pane directly below the header is the short description of the dataset, and the longer summary information.

Much of the information on a dataset's overview tab is configurable by your organization. Shown below is a More Information section containing an example of completely configured metadata :

Dataset_overview_middle.png

Directly below the information pane is a section containing:

  • A count of the number of files in the dataset

  • The autosync settings for the dataset

  • Sorting options for the dataset files below

  • The option to view them in a preview or in a list

  • A link to add new files (depending on your permissions to the dataset)

All of the files in the dataset are listed next. Tabular files are also previewed. Beneath all of the files at the bottom of the page are references for related datasets that might interest you. Notice the rich metadata that can be included in the preview:

Dataset_overview_bottom.png
Right sidebar on dataset

There is a right sidebar on the Overview, Access, and Activity tabs of a dataset page that holds more default and custom metadata about the dataset:

Dataset_overview_right_middle.png

In our example this metadata includes:

  • The sharing settings

  • Who created it and when

  • The various people who manage it (Steward and Tech Owner)

  • Who verified it and when

  • The tags used on it

  • It's size

  • The data dictionary information about it with a link to the dictionary

  • A list of the queries that live in the dataset

Below the queries is a list of related projects, and the recent updates to the dataset including what they were, who made them, and when:

Dataset_overview_right_bottom_.png
Access

The Access tab on a dataset lists everyone who has access to the dataset as well as what their access permissions are. The names are links that will take to their user profiles:

Data_Access_tab.png

If you are an administrator of the dataset you can also invite new contributors from by selecting Grant Access. and completing the Invite contributors dialog:

Invite_contributors.png
Discussion

The Discussion tab of a dataset is a place to capture all the communication between team members about the dataset:

Data_discussion.png

The content on the page is grouped into topics to keep it orderly. The information on the Topic tag includes:

  • A description of the topic

  • How many comments are in the topic

  • The category of the topic

When you have something to post you can either put it in an existing topic or create a new topic:

Discussion_add_topic.png

You can also flag or like comments. Flagging a post notifies a moderator of potentially inappropriate content in the post:

Discussion_flag.png

Comments in topics use Markdown and can contain images and links.

Activity

The Activity tab of a dataset is packed with information about everything that has happened to the dataset, and it also provides access to previous versions of the dataset. The Activitytab has two different screens, the first of which shows you all the activity that's taken place:

Data_activity.png
Activity_detail.png

The second screen on the Activity tab holds the version information for the dataset and allows you to download previous versions as necessary:

Data_Activity_versions.png
Settings

The Settings tab of a dataset is for administrators to manage the dataset. It has two screens, the first of which is for administrative settings which include:

  • The dataset visibility

  • Automatic syncing options

  • Additional notification recipients

  • Archive options

  • A delete button

Data_settings_top.png
Data_settings_bottom.png

The second screen on the Settings tab is for configuring and managing webhooks to the dataset:

Data_settings_webhook.png