Skip to main content

Updating descriptions and metadata fields for Databricks tables and columns

Note

Users with Edit access to the resources will be able to do this task.

  1. Browse to a Databricks column or table harvested from your Databricks instance by the collector.

  2. On the Overview tab, in the About section, click the Edit button. If the column or table already has a description (comment), it will be displayed in the Description field.

  3. Make the required edits to the Description and Metadata fields and click Save. One of the following will happen:

    Important

    Only the metadata fields that are defined as part of the Databricks automation configuration are published to Databricks.

    1. If the Enable Automatic Updates setting is enabled for the automation, the changes are automatically published to Databricks when you click the Save button. However, if the setting is not enabled, users have to sync the changes manually.

    2. If the Enable Automatic Updates setting is not enabled for the automation, you will see a Publish Metadata to Databricks button on the resource page. Click the button to publish the changes with your Databricks instances.

      If the system encounters any errors while publishing changes to Databricks, you will receive notification emails informing you about the issue.

  4. Note that if any changes are made to the comments in Databricks they will not be synced back to data.world even when the collector re-runs. For this reason, we highly recommend that once you start updating comments (descriptions) in data.world, data.world becomes the source of truth for comments (descriptions) and this action is not performed in both systems.

  5. Note that if any changes are made to the tags in Databricks they will not be synced back to data.world even when the collector re-runs. For this reason, we highly recommend that once you start updating tags (metadata fields) in data.world, data.world becomes the source of truth for tags (metadata fields) and this action is not performed in both systems.

    Note

    You can create and manage your individual tags directly in Databricks. However, it is important to avoid creating tags with the same key names as those published from data.world metadata fields, as these will be overwritten by the automation.