Enterprise docs

Datasets

The Datasets tile on the Organization Profile Page is your portal to all the datasets owned by your organization and the metadata about them. If your organization uses our metadata catalog, all the metadata captured for your organization's data sources can be found here. This is also where all of the datasets containing your data--both stored on data.world and from virtual connections--are located.

org_profile_resources_datasets.png

Clicking the Datasets tile opens the Resources tab that shows a filtered list of all your organization's datasets. There you can see:

  • Name of the dataset

  • Who created it

  • When it was last updated

  • Other details, such as a brief description of the dataset, the number of files and tables in the dataset, and the number of projects that use the dataset.

    You can use this information to then decide if you need to navigate to the dataset to explore it further.

org_profile_resources_datasets_details.png

Some of the options available to you on this page are:

  • Powerful filter options that let you narrow down the list of datasets available to you.

  • A search box to search for specific datasets within the organization.

  • A New Resources button to create new datasets, projects, or Analysis.

  • A Three dot menu to access the option to Edit multiple datasets.

When you click the dataset and open it, you see the Dataset Details page. The header at the top of the page has several valuable pieces of information and actions that you can take in regards to the dataset.

dataset_details_page_toolbar.png

On the left of the header are:

  • An icon for the dataset that includes on indicator for the status of the dataset. For example, Approved, Deprecated. Needs Review.

  • The name of the person or organization that owns the dataset. Click the owner link to get to their profile page.

  • The name of the dataset.

On the right of the header are options to:

  • Share the dataset

  • Bookmark it (and see how many others have bookmarked it)

  • Request access button, if you don't have access to the dataset.

  • Three dot menu to access the option to subscribe or unsubscribe from email notifications for the dataset (available if you don't have access to the dataset.)

  • Access the Explore this dataset menu (available if you have access to the dataset.). It includes options to:

    • See all the projects that use the dataset

    • Connect the dataset to an existing project

    • Open the dataset with a third-party application (integrated with data.world)

    • Create a new project using the dataset

dataset_details_page_toolbar_explore_option.png
Overview tab

When you open a dataset, you land on the Overview tab. The Overview section includes the Status of the dataset, if it is set, a short Description of the dataset, and the longer Summary information.

dataset_details_page_overviewtab_overviewsection.png

Much of the information on the Overview tab is configurable by your organization. In the following example, the More Information section contains an example of a completely configured metadata.

dataset_details_page_overviewtab_moreinfosection.png

The next section includes a list of related resources(tables, business term, and Analysis) for the dataset. You can add other related resources by using the Add a related resource button.

The Files section lists the number of files in a dataset and includes a preview of each file with options to manage the files.

The Related datasets section includes datasets that might be of interest to you.

dataset_details_page_overviewtab_related_section.png
Right sidebar on dataset tabs

An About this dataset section appears on the right side of the Overview and Activity tabs that contains both default and custom metadata about the dataset. In the following example, this metadata includes:

  • The sharing settings.

  • Who created the dataset and when.

  • The various people who manage the dataset (Steward and Tech Owner).

  • Who verified the dataset and when.

  • The tags used on the dataset.

  • The size of the dataset.

  • The data dictionary information about the dataset and a link to the dictionary.

  • A list of the queries that are part of the dataset.

Dataset_overview_right_middle.png

There are Related projects and Recent updates sections which list projects that use the dataset and recent updates to the dataset, respectively.

Dataset_overview_right_bottom_.png
Discussion

The Discussion tab captures all the communication between team members about the dataset. You get an option to Flag or Like comments in a discussion. Flagging notifies a moderator of potentially inappropriate content in the post.

dataset_details_page_discussion_tab.png

The contents on the Discussion tab are grouped into topics. Use the default topic or start a new one. Define the discussion topic, the category (General or To dos), and the first comment for the topic. Comments in topics are written in Markdown and can contain images and links.

dataset_details_page_discussion_addtopic.png
Activity

The Activity tab contains information about all events that occur with the dataset, and it also provides access to previous versions of the dataset. There are two sub-tabs, All Activity and Versions.

The All Activity sub-tab is the default, and list all the activity for the dataset.

dataset_details_page_activitytab_allactities.png

The Version sub-tab contains a list of previous versions. Click the Three dot menu to download a specific version.

dataset_details_page_activitytab_versions.png
Settings

The Settings tab is for administrators to manage the dataset. There are three sub-tabs, General, Access and ownership, and Webhooks.

The General sub-tab contains administrative settings, which include:

  • Automatic syncing options.

  • Additional notification recipients.

  • Archive options.

  • A Delete dataset button.

dataset_details_page_settingstab_general.png

The Access and ownership sub-tab allows you to manage access to the dataset and make datasets visible to community members.

dataset_details_page_settingstab_ownership.png

The Webhooks sub-tab is for configuring and managing webhooks to the dataset.