Skip to main content

Platform records

Platform records contains records of user activities and interactions across various platforms, including transaction history, preferences, and engagement metrics. It also stores information on user profiles, such as account details and demographic information. This provides a comprehensive view of user behavior and preferences.

agents table

Contains details about the agents/users of data.world, including when they were created and last updated and other user profile information. Useful for managing and analyzing user and system agent profiles.

Table 1.

Column

Description

ACCOUNTSTANDING

Indicates the current status of an agent/user account on data.world.

ACTIVESUBSCRIPTIONID

Unique identifier for the active subscription of a specific agent/user.

AGENTID

The identifer for the individual user or org account; unique per siteid and environment.

ALLOWEDROLES

Specifies the roles that are allowed to the agent account.

BIO

Contains biological information of agents/users on data.world.

COMPANY

Represents the company name to which the org/user belongs.

CREATED

Timestamp marking when the org/user profile was created.

CREATEDBY

Identifier for the agent or system that created the org/user profile.

CUSTOMERAGENTID

Unique identifier for each customer.

DISPLAYNAME

The name displayed for the orgs/users in the system.

EMAIL

This is the email address associated with the user/org profile.

EMAILVERIFIED

Indicates if the agent's email address has been verified or not.

ENV

Identifies the data.world environment.

MANAGED

Indicates if the org account is managed or not.

NOTIFICATIONSEMAIL

Email address where the user/org receives system notifications.

ORGDETAILS

Contains information and settings associated with an org.

ORGMODE

Specifies whether the agent is an org. Values include True or (False/NULL).

SITEID

Unique 5-digit identifier for the data.world Private Instance.

TAGS

Category labels associated with agents - eg, science, entertainment, education.

UPDATED

Timestamp of the last update made to the agent/user profile.

UPDATEDBY

Records the user/agent who made the latest update to a profile.

VISIBILITY

Indicates the visibility status of the agent/user profile.

WEBSITE

The website associated with the user/agent profile.



agentshydrated table

Contains detailed information about agents, similar to the AGENTS table but with additional metrics. Tracks agent activity and attributes such as organization memberships, dataset and query counts, subscription details, and geographic data.

Table 2.

Column

Description

ACCOUNTSTANDING

Indicates the current status of an agent/user account on data.world.

ACTIVESUBSCRIPTIONID

Unique identifier for the active subscription of a specific agent/user.

AGENTID

The identifer for the individual user or org account; unique per siteid and environment.

ALLOWEDROLES

Specifies the roles that are allowed to the agent account.

BILLINGPLAN

Specifies the billing plan associated with each agent.

BIO

Contains biological information of agents/users on data.world.

CITY

Indicates the city associated with the agent, if available.

COMPANY

Represents the company name to which the org/user belongs.

COUNTRYCODE

Code identifying the country associated with the agent.

CREATED

Timestamp marking when the org/user profile was created.

CREATEDBY

Identifier for the agent or system that created the org/user profile.

CUSTOMERAGENTID

Unique identifier for each customer.

DISPLAYNAME

The name displayed for the orgs/users in the system.

EMAIL

This is the email address associated with the user/org profile.

EMAILVERIFIED

Indicates if the agent's email address has been verified or not.

ENV

Identifies the data.world environment.

MANAGED

Indicates if the org account is managed or not.

MAXFREEORGANIZATIONMEMBERS

Maximum number of free memberships in an organization.

MAXORGANIZATIONMEMBERS

Maximum number of memberships in an organization.

MUTED

Indicates if the specific agent in the AGENTSHYDRATED table is muted or not.

MUTED_BY_SPAMBOT

Indicates if the agent was muted by the spam bot detection.

NOTIFICATIONSEMAIL

Email address where the user/org receives system notifications.

NUMOPENQUERIES

Number of open queries associated with an agent.

NUMORGMEMBERSHIPS

Number of organization memberships held by each agent.

NUMPRIVATEDATASETS

Number of private datasets owned by the agent.

NUMPRIVATEDATASETSCONTRIBUTOROF

Count of private datasets to which the agent is a contributor.

NUMPRIVATEMEMBERS

Number of private members for an org.

NUMTOTALDATASETS

Total number of datasets owned by an agent.

NUMTOTALQUERIES

The total number of queries owned by an agent.

ORGADMINS

Stores identifiers for agents who are organization administrators.

ORGCREATOR

Identifies the creator of the org in the AGENTSHYDRATED table.

ORGDETAILS

Contains information and settings associated with an org.

ORGMEMBERS

Stores a list of agents that have membership to the org.

ORGMODE

Specifies whether the agent is an org. Values include True or (False/NULL).

ORGSADMINOF

Specifies the organizations that the agent is an administrator of.

ORGSMEMBEROF

Stores a list of orgs that the agent is a member of.

REACTIVATED

Indicates if the agent's profile was reactivated.

REGION

Specifies the geographical region where the agent operates.

SCIMDETAILS

SCIMMANAGED

SEATSUSED

Specifies the number of seats used.

SITEID

Unique 5-digit identifier for the data.world Private Instance.

TAGS

Category labels associated with agents - eg, science, entertainment, education.

UPDATED

Timestamp of the last update made to the agent/user profile.

UPDATEDBY

Records the user/agent who made the latest update to a profile.

VISIBILITY

Indicates the visibility status of the agent/user profile.

WEBSITE

The website associated with the user/agent profile.



authorizationrequests table

Captures authorization requests within the platform, detailing the resource, requester, grantee, and approver parties, along with the access level requested. Tracks the creation and update timestamps, visibility settings, and resource type and ID. Useful for auditing and managing access control across different environments and resources.

Table 3.

Column

Description

APPROVERPARTY

Identifies the party responsible for approving the authorization request.

CREATED

Timestamp indicating when the authorization request was created.

ENV

Specifies the data.world environment where the authorization request was made.

GRANTEEPARTY

Party to whom access rights are being granted in the authorization request.

LEVEL

Specifies the access level requested in the authorization request.

REQUESTERPARTY

Specifies the party that made the authorization request.

REQUESTID

Unique identifier for each authorization request in the platform.

RESOURCE

Specifies the resource involved in the authorization request.

RESOURCEID

Identfier for the resource involved in the request; composed of "owner/resource"; unique at the level of siteid, environment.

RESOURCETYPE

Specifies the type of resource involved in the authorization request.

SITEID

Unique 5-digit data.world site ID associated with this request.

UPDATED

Timestamp for when the authorization request was last updated.

VISIBILITY

Defines the accessibility and visibility level of an authorization request.



authorizations table

Tracks authorization details, including the parties involved, resources, access levels, and visibility settings. Captures the creation and update timestamps, along with the requester, approver, and any grant exclusions. Useful for managing and auditing access permissions across different resources and environments.

Table 4.

Column

Description

APPROVERPARTY

Specifies the party who approved any given authorization.

CREATED

Timestamp for when the authorization record was created.

CREATEDBY

The client and agent that created the authorization.

ENV

The data.world environment in which the authorization exists.

GRANTEXCLUSION

Indicates any exceptions on granted access permissions in authorizations.

LEVEL

Specifies the access level granted in the authorization, eg, READ, ADMIN, etc.

PARTY

The type and identifier for the party to which the authorization is granted.

PARTYID

The identifier for the party to which the authorization is granted.

PARTYTYPE

The type for the party to which the authorization is granted.

REQUESTERPARTY

The type and identifier for the party which requested the authorization.

RESOURCE

Specifies the type and ID for resource involved in the authorization.

RESOURCEID

Specifies the ID for resource involved in the authorization.

RESOURCETYPE

Specifies the type for resource involved in the authorization.

SITEID

Unique 5-digit data.world site ID associated with this request.

UPDATED

Timestamp marking the last update to an authorization record.

UPDATEDBY

Identifier of the user who last updated the authorization details.

VISIBILITY

Specifies the visibility settings for the authorization.



authorizations_retention table

Builds on the AUTHORIZATIONS view by maintaining a historical record of authorization details, including the involved parties, resources, access levels, and visibility settings. It captures and preserves creation and update timestamps, requester and approver information, and any grant exclusions over time.

Table 5.

Column

Description

APPROVERPARTY

Specifies the party who approved any given authorization.

CREATED

Timestamp for when the authorization record was created.

CREATEDBY

The client and agent that created the authorization.

ENV

The data.world environment in which the authorization exists.

GRANTEXCLUSION

Indicates any exceptions on granted access permissions in authorizations.

LEVEL

Specifies the access level granted in the authorization, eg, READ, ADMIN, etc.

PARTY

The type and identifier for the party to which the authorization is granted.

PARTYID

The identifier for the party to which the authorization is granted.

PARTYTYPE

The type for the party to which the authorization is granted.

REQUESTERPARTY

The type and identifier for the party which requested the authorization.

RESOURCE

Specifies the type and ID for resource involved in the authorization.

RESOURCEID

Specifies the ID for resource involved in the authorization.

RESOURCETYPE

Specifies the type for resource involved in the authorization.

SITEID

Unique 5-digit data.world site ID associated with this request.

UPDATED

Timestamp marking the last update to an authorization record.

UPDATEDBY

Identifier of the user who last updated the authorization details.

VISIBILITY

Specifies the visibility settings for the authorization.



clientauthorizations table

Tracks client authorizations, linking agents to specific clients. Records the creation and update timestamps, along with the responsible agents and clients. Useful for managing and auditing the authorization relationships between agents and clients.

Table 6.

Column

Description

AGENTID

Identifier for the agent involved in the client authorization. Unique at the level of environment and siteid.

CLIENTID

Unique identifier for a client in the CLIENTAUTHORIZATIONS table.

CREATED

Timestamp of when the client authorization was originally created.

CREATEDBY

Identifies the agentid for the agent who created the client authorization record.

ENV

The data.world environment.

SITEID

The unique 5-digit identifier of the data.world Private Instance.

UPDATED

Timestamp for the latest update made to a client authorization.

UPDATEDBY

Identifier for the agent who last updated the client authorization record.



comments table

Captures comments made on various catalog resources, including details about the topic, resource, comment ID, and associated metadata. Tracks the creation and update timestamps, visibility settings, and whether the comment was edited or migrated. Useful for monitoring discussions and feedback related to specific catalog resources.

Table 7.

Column

Description

COMMENTID

Unique identifier associated with each comment in the COMMENTS table.

CREATED

Timestamp indicating when the comment was created.

CREATEDBY

Identifier of the user who created the comment.

CREATEDWITHTOPIC

Boolean (1 or NULL). Indicates if the comment was created with a specific topic.

EDITED

Boolen (1 or NULL). Indicates if a comment was edited post-creation.

ENV

The data.world environment.

MIGRATED

Boolen (1 or NULL). Indicates if a comment has been migrated.

PUBLISHED

Boolen (1 or NULL). Indicates if the comment is published or not.

RESOURCE

Type and identifier for the resource that the comment is associate with ([resourcetype]:[resourceid]).

RESOURCEID

Identifier for the resource the comment is associated with ([owner]/[resourcename]).

RESOURCETYPE

Specifies the type of catalog resource that the comment is associated with.

RESOURCEVERSION

Indicates the version of the resource being commented on.

SITEID

The unique 5-digit identifier of the data.world Private Instance.

TOPIC

Indicates the topic related to a specific comment in the catalog resources.

TOPICID

Unique identifier for the topic correlated with a comment.

TOPICUUID

Universally unique identifier for the topic correlated with a comment.

UPDATED

Timestamp of the most recent update made to the comment.

UPDATEDBY

Identifier for the user who last updated the comment.



databasesources table

Tracks database sources within the platform, including details such as the source ID, creation and update timestamps, label, type (e.g., SNOWFLAKE), and SSL requirement status. Each entry is associated with an agent and environment, providing a record of when and by whom the database source was created and last modified. Useful for managing and auditing connections to external data sources.

Table 8.

Column

Description

AGENTID

Identifier for the user/org that who owns the database source.

CREATED

Timestamp of when the database source was created.

CREATEDBY

Indentifier for the client and agent that created the database source.

ENV

The data.world environment, eg, Prod.

LABEL

The user-defined identifier for the specific database source.

SITEID

Unique 5-digit identifier for the specific data.world Private Instance where the database connection resides.

SOURCEID

Unique identifier for each database source within the platform.

SSLREQUIRED

Indicates if Secure Sockets Layer (SSL) is required for the database source.

TYPE

Specifies the type of the database source (e.g., SNOWFLAKE).

UPDATED

Timestamp for the last update made to the database source entry.

UPDATEDBY

Identifier of the user who last updated the database source entry.



databasesources_retention table

This table extends the DATABASESOURCES table by recording the historical changes to database source details over time within the platform. It tracks essential information about database sources, including the source ID, label, type, SSL requirement status, and timestamps for creation and updates.

Table 9.

Column

Description

AGENTID

Unique identifier for the agent linked to a database source.

CREATED

Timestamp of when the database source record was initially created.

CREATEDBY

Identifier of the user that created the database source record.

ENV

The data.world environment.

LABEL

The user-defined identifier for the specific database source.

SITEID

Unique 5-digit identifier for the specific data.world Private Instance where the database connection resides.

SOURCEID

Unique identifier for each database source within the platform.

SSLREQUIRED

Indicates if Secure Sockets Layer (SSL) is required for the database source.

TYPE

Specifies the type of the database source (e.g., SNOWFLAKE).

UPDATED

Timestamp for when database source details were last updated.

UPDATEDBY

Identifier of the user who last updated the database source details.



datasetlinks table

Tracks links between datasets within the platform. Each entry records the connection between a dataset and its linked dataset, along with the creation and update timestamps. Useful for managing and auditing relationships between datasets across the platform.

Table 10.

Column

Description

CREATED

Timestamp marking when a dataset link was initially created.

DATASET

Identifier of the primary dataset in a tracked link between datasets.

ENV

The data.world environment.

LINKEDDATASET

Identifies the dataset linked to the primary dataset.

SITEID

Unique 5-digit identifier for the specific data.world Private Instance.

UPDATED

Timestamp indicating the last time the dataset link was updated.



datasetlinks_retention table

Extends the functionality of the DATASETLINKS view by preserving a historical record of links between datasets within the platform. Each entry documents the connections between datasets and their linked counterparts, along with the creation and update timestamps.

Table 11.

Column

Description

CREATED

Timestamp marking when the dataset link was initially created.

DATASET

Identifier of the primary dataset in a tracked link between datasets.

ENV

The data.world environment.

LINKEDDATASET

Identifies the dataset linked to the primary dataset.

SITEID

Unique 5-digit identifier for the specific data.world Private Instance.

UPDATED

Timestamp of when the dataset link record was last updated.



datasets table

Stores details about datasets within the platform, including the dataset ID, name, description (headline and abstract), creation and update timestamps, visibility status, and the number of data points. Tracks additional attributes such as licensing, tags, project status, and ingestion status. Useful for managing and auditing datasets, their versions, and their accessibility within the platform.

Table 12.

Column

Description

ABSTRACT

The user-generated summary detail of the dataset. May be text or markdown format.

AGENTID

Unique identifier for the agent (user or org) who owns the dataset.

CREATED

Timestamp indicating when the dataset was initially created.

CREATEDBY

Identifier for the user who created the dataset.

DATASETID

The formatted dataset name as represented in the DATASETS table. Along with the agentid (owner), composes the dataset identifier.

ENV

The data.world environment.

HEADLINE

The user-generated description of the dataset.

INGESTSTATUS

Indicates the status of dataset ingestion in the platform, eg "LOADED", "NEW", "SYSTEMERROR".

LATESTVERSIONID

Identifier for the most recent version of a dataset.

LICENSE

Indicates the type of license associated with the dataset.

NAME

The user-generated dataset name, which when formatted by the system, becomes the datasetid in the DATASETS table.

NOTIFICATIONSEMAIL

Email address to which dataset notifications are sent.

NUMDATAPOINTS

The quantitative size of a dataset, that is the count of individual data points it contains.

OFFLINE

Boolean (1 or NULL). Indicates whether the dataset is available offline or not.

PROJECT

Boolean (TRUE, FALSE, or NULL). Indicates whether the record is a project (TRUE) or a dataset (FALSE or NULL).

SITEID

Unique 5-digit identifier for the specific data.world Private Instance.

TAGS

Identifies categorical attributes given to datasets.

UPDATED

Timestamp of the latest update made to the dataset.

VISIBILITY

Indicates the visibility status of datasets within the platform (PRIVATE, OPEN, or DISCOVERABLE).



datasets_retention table

Extends the DATASETS view by preserving a historical record of changes to datasets over time within the platform. It stores comprehensive details about datasets, including the dataset ID, name, descriptions, visibility status, and various other attributes like licensing, tags, and ingestion status.

Table 13.

Column

Description

ABSTRACT

The user-generated summary detail of the dataset. May be text or markdown format.

AGENTID

Unique identifier for the agent (user or org) who owns the dataset.

CREATED

Timestamp indicating when the dataset was initially created.

CREATEDBY

Identifier for the user who created the dataset.

DATASETID

The formatted dataset name as represented in the DATASETS table. Along with the agentid (owner), composes the dataset identifier.

ENV

The data.world environment.

HEADLINE

The user-generated description of the dataset.

INGESTSTATUS

Indicates the status of dataset ingestion in the platform, eg "LOADED", "NEW", "SYSTEMERROR".

LATESTVERSIONID

Identifier for the most recent version of a dataset.

LICENSE

Indicates the type of license associated with the dataset.

NAME

The user-generated dataset name, which when formatted by the system, becomes the datasetid in the DATASETS table.

NOTIFICATIONSEMAIL

Email address to which dataset notifications are sent.

NUMDATAPOINTS

The quantitative size of a dataset, that is the count of individual data points it contains.

OFFLINE

Boolean (1 or NULL). Indicates whether the dataset is available offline or not.

PROJECT

Boolean (TRUE, FALSE, or NULL). Indicates whether the record is a project (TRUE) or a dataset (FALSE or NULL).

SITEID

Unique 5-digit identifier for the specific data.world Private Instance.

TAGS

Identifies categorical attributes given to datasets.

UPDATED

Timestamp of the latest update made to the dataset.

VISIBILITY

Indicates the visibility status of datasets within the platform (PRIVATE, OPEN, or DISCOVERABLE).



datasetsources table

Tracks synchronization details for datasets within the platform, including the dataset ID, sync status, timestamps for the last and next scheduled syncs, and synchronization intervals. Records versioning information and details about the synchronization type (e.g., auto, stream). Useful for managing and auditing the synchronization processes for datasets.

Table 14.

Column

Description

AGENTID

The identifier of the owner of the dataset set to autosync.

AUTOSYNCINTERVAL

Specifies the auto-synchronization interval for the datasets.

CREATED

Timestamp indicating when the dataset sync record was created.

DATASETID

The formatted dataset name - along with the agentid (owner) composes the dataset identifier in the DATASETSOURCES table.

DESIREDAUTOSYNC

The DESIREDAUTOSYNC column is not supported and does not provide any meaningful information.

ENV

The data.world environment.

GENERATION

Boolean (1 or NULL)

LASTSYNCFINISH

Timestamp indicating when the last dataset synchronization was completed.

LASTSYNCSTART

Timestamp indicating when the last dataset synchronization process started.

NEXTSCHEDULEDSYNC

Timestamp for the next scheduled dataset synchronization.

NEXTSCHEDULEDSYNCTYPE

Specifies the type of synchronization for the next scheduled dataset sync.

SITEID

Unique 5-digit identifier for the specific data.world Private Instance.

STREAMSYNCINTERVAL

Specifies the interval for stream synchronization of the dataset.

SYNCSTATUS

Indicates the current status of the dataset synchronization process, eg( "OK", "SYSTEMERROR").

UPDATED

Timestamp of the most recent update to the dataset autosync settings.

VERSION

Indicates the version of synchronized dataset within the platform.



datasetsources_retention table

Extends the functionality of the DATASETSOURCES view by preserving a historical record of synchronization details for datasets within the platform. This includes information such as dataset ID, sync status, last and next scheduled sync times, and synchronization intervals.

Table 15.

Column

Description

AGENTID

The identifier of the owner of the dataset set to autosync.

AUTOSYNCINTERVAL

Specifies the auto-synchronization interval for the datasets.

CREATED

Timestamp indicating when the dataset sync record was created.

DATASETID

The formatted dataset name - along with the agentid (owner) composes the dataset identifier in the DATASETSOURCES table.

DESIREDAUTOSYNC

The DESIREDAUTOSYNC column is not supported and does not provide any meaningful information.

ENV

The data.world environment.

GENERATION

Boolean (1 or NULL)

LASTSYNCFINISH

Timestamp indicating when the last dataset synchronization was completed.

LASTSYNCSTART

Timestamp indicating when the last dataset synchronization process started.

NEXTSCHEDULEDSYNC

Timestamp for the next scheduled dataset synchronization.

NEXTSCHEDULEDSYNCTYPE

Specifies the type of synchronization for the next scheduled dataset sync.

SITEID

Unique 5-digit identifier for the specific data.world Private Instance.

STREAMSYNCINTERVAL

Specifies the interval for stream synchronization of the dataset.

SYNCSTATUS

Indicates the current status of the dataset synchronization process, eg( "OK", "SYSTEMERROR").

UPDATED

Timestamp of the most recent update to the dataset autosync settings.

VERSION

Indicates the version of synchronized dataset within the platform.



datasetversions table

Tracks versions of datasets within the platform, capturing differences between each version, such as file and table counts, ingestion status, and any changes in the dataset’s content. Each version is linked to its previous version, allowing for detailed tracking of updates, modifications, or corrections made over time. This includes whether files were added, removed, or altered, and any changes to the structure or metadata of the dataset. The table also notes whether a version is marked for reaping or has been reaped, helping to manage the lifecycle of data and maintain version control.

Table 16.

Column

Description

AGENTDATASETID

The dataset identifier. Composed of the agentid (owner) and dataset name.

AGENTID

The identifier for the agentid (user or org) that owns that dataset.

CONTENTS2

Contents of the dataset version; an array of objects.

CREATED

Timestamp marking when a particular dataset version was created.

CREATEDBY

Identifier of the user who created the dataset.

DATASETID

The formatted dataset name - along with the agentid (owner) composes the dataset identifier in the DATASETVERSIONS table.

ENV

The data.world environment.

FILECOUNT

Indicates the count of files in each dataset version.

INGESTSTATUS

Indicates the status of dataset ingestion for the dataset version, eg "LOADED", "NEW", "SYSTEMERROR".

MARKEDFORREAPING

Boolean (true, false, NULL). Indicates if a dataset version is marked for removal or cleanup process.

PREVIOUSVERSIONID

ID corresponding to the previous version of the dataset.

REAPED

Boolean (true or NULL). Indicates if a dataset version has been reaped.

REAPEDPREVIOUSVERSIONID

ID of the previously reaped version of the dataset.

SITEID

Unique 5-digit identifier for the specific data.world Private Instance.

TABLECOUNT

Number of tables in each version of the dataset.

UPDATED

Indicates the timestamp when the dataset version was last updated.

UPDATEDBY

Identifier for the user who updated the dataset version.

VERSIONID

Unique identifier for each dataset version in the platform.



entitlements table

In this context, an entitlement refers to the specific access rights or privileges granted to an agent.

Tracks entitlements granted to agents within the platform, including details such as entitlement ID, associated products, and the start and end dates of the entitlement. Each entry records the creation and update timestamps, the order of the entitlement, and the source of the entitlement (e.g., offering subscription). Useful for managing and auditing the entitlements and access rights granted to users across the platform.

Table 17.

Column

Description

AGENTID

Unique identifier for agents granted with specific access rights.

CREATED

Timestamp of when the entitlement was created.

CREATEDBY

Identifier of the agent who created the entitlement.

ENDDATE

The date the access right or privilege granted to an agent ends.

ENTITLEMENTID

Unique identifier for each entitlement granted to agents.

ENTITLEMENTITEMS

An array of items related to the entitlement including ID and quantity.

ENV

The data.world environment

ORDER

An integer (1, 2, or 3) that pertains to order.

SITEID

Unique 5-digit identifier for the specific data.world Private Instance.

SOURCE

References associated entry in the OFFERINGSUBCRIPTIONS table.

STARTDATE

The date the agent's access rights or privileges begin.

UPDATED

Timestamp indicating the most recent update to an entitlement.

UPDATEDBY

Identifier for the agent who last updated the entitlement record.



entitlements_retention table

This table extends the functionality of the ENTITLEMENTS table by providing a historical view of entitlements as they change over time. Records each modification of an entitlement, capturing details such as the entitlement ID, associated products, and timestamps for creation, updates, and the validity period. This data is helpful for auditing and managing the lifecycle of entitlements.

Table 18.

Column

Description

AGENTID

Unique identifier for agents granted with specific access rights.

CREATED

Timestamp of when the entitlement was created.

CREATEDBY

Timestamp of when the entitlement was created.

ENDDATE

The date the access right or privilege granted to an agent ends.

ENTITLEMENTID

Unique identifier for each entitlement granted to agents.

ENTITLEMENTITEMS

An array of items related to the entitlement including ID and quantity.

ENV

The data.world environment.

ORDER

An integer (1, 2, or 3) that pertains to order.

SITEID

Unique 5-digit identifier for the specific data.world Private Instance.

SOURCE

References associated entry in the OFFERINGSUBCRIPTIONS table.

STARTDATE

The date the agent's access rights or privileges begin.

UPDATED

Timestamp indicating the most recent update to an entitlement.

UPDATEDBY

Identifier for the agent who last updated the entitlement record.



follows table

Tracks instances where agents “follow” specific resources within the platform so they can be aware of changes to that resource. Each entry includes the agent ID, the resource being followed, and associated timestamps. Useful for monitoring user engagement and tracking which resources are being followed by different agents.

Table 19.

Column

Description

AGENTID

Unique identifier of the follower agent (user).

CREATED

Timestamp indicating when a follow was created by an agent.

ENV

The data.world environment.

FOLLOWING

Unique identifier of the followed agent (user).

SITEID

Unique 5-digit identifier for the data.world Private Instance.

UPDATED

Timestamp indicating when the follow action was last updated.



insights table

Stores details about curated interpretations or analyses of data, known as “Insights,” which are linked to specific datasets or projects. Each entry tracks the creation, updates, and associations of these insights within the platform. Insights provide users with a quick understanding of key data patterns or findings, making them essential for collaboration and decision-making.

Table 20.

Column

Description

CREATED

Timestamp of when the Insight record was initially created.

CREATEDBY

Identifier (agentid) of the user who created the insight.

ENV

The data.world environment.

INSIGHTID

Unique identifier for each insight within the INSIGHTS table.

LATESTVERSIONID

ID of the latest version of the insight in the platform.

PARTY

Identifies the user who created the insight.

PARTYID

The identifier (agentid) of the user who created the insight.

PARTYTYPE

The value for this column is "agent" for entries in the INSIGHTS table.

RESOURCE

Identifies the resourcetype and resourceId for the resources associated with insights in the platform.

RESOURCEID

The identifier for the resources associated with insights in the platform. Composed of the owner and resource name.

RESOURCETYPE

The value for this column is "dataset" for all entries in the INSIGHTS table.

SITEID

Unique 5-digit identifier for the data.world Private Instance where the insight is located.

UPDATED

Timestamp of the most recent update applied to the Insight record.

UPDATEDBY

Identifier of the user who last updated the insight.



insightversions table

Tracks different versions of “insights,” which are interpretations or analyses linked to specific datasets or projects. Each record captures details such as the Insight ID, version ID, associated resource, and timestamps for creation and updates. This table is essential for understanding the evolution of insights over time, allowing users to track changes and modifications to key analyses or interpretations within the platform.

Table 21.

Column

Description

CREATED

Timestamp when a specific insight version was created.

CREATEDBY

Identifier (agentid) for the user who created the specific Insight version.

ENV

The data.world environment.

INSIGHTID

Unique identifier for each insight in the INSIGHTVERSIONS table.

MODIFYINGAGENTID

Identifier of the agent responsible for modifying a particular insight version

PREVIOUSVERSIONID

ID of the immediate prior version of the current insight.

RESOURCE

Identifies the resourcetype and resourceId for the resources associated with insights in the platform.

RESOURCEID

The identifier for the resources associated with insights in the platform. Composed of the owner and resource name.

RESOURCETYPE

The value for this column is "dataset" for all entries in the INSIGHTVERSIONS table.

SITEID

Unique 5-digit identifier for the data.world Private Instance where the file is located.

UPDATED

Timestamp of the most recent update applied to the INSIGHTVERSION record.

UPDATEDBY

Identifier of the user who last updated the insight version.

VERSIONID

Unique ID for each version of insight in the INSIGHTVERSIONS table.



invites table

Tracks invitations sent within the platform, including the invite ID, email, and the invited agent’s ID. Each entry records the status of the invite, along with the creation and update timestamps. This data is useful for managing and auditing invitation activities, providing insights into collaboration as it shows users inviting others to various resources in the catalog.

Table 22.

Column

Description

CREATED

Timestamp showing when the invite was created.

CREATEDBY

Identifier (agentid) of the user who created the invite in the platform.

EMAIL

The email address to which an invite on the platform is sent.

ENV

The data.world environment.

INVITEDAGENTID

The identifier (agentid) of the agent who has been invited on the platform.

INVITEID

Unique identifier corresponding to each invitation sent within the platform.

SITEID

Unique 5-digit identifier for the data.world Private Instance.

STATUS

Represents the current status of a specific invitation, eg, SENT, USED.

UPDATED

Timestamp of the last update made to the invite's status.

UPDATEDBY

Identifier (agentid) of the user who last updated the invite record.



likes table

Tracks “likes” made by agents on various resources within the platform. Includes details such as the agent (party) who liked the resource, the resource itself, creation and update timestamps, and the type of resource (e.g., dataset). Useful for monitoring user engagement and interactions with platform content.

Table 23.

Column

Description

CREATED

Timestamp indicating when the 'like'(bookmark) was recorded on the platform.

CREATEDBY

Identifier (agentid) of the user who bookmarked the resource.

ENV

The data.world environment.

PARTY

Identifies the user who bookmarked a resource within the platform.

PARTYID

Agentid of the user who bookmarked a resource within the platform.

PARTYTYPE

The value for this column is "agent" for all entries in the LIKES table.

RESOURCE

Identifies the resourcetype and resourceId for the resources associated with bookmarks in the platform.

RESOURCEID

The identifier for the resources associated with bookmarks in the platform. Composed of the owner and resource name.

RESOURCETYPE

Identifies the type of resource that received the bookmark eg, datasetinsight, dataset, comment, catalogResource.

SITEID

Unique 5-digit identifier for the data.world Private Instance.

UPDATED

Timestamp noting the most recent update to the 'like' record.

UPDATEDBY

Identifier for the agent who last updated the 'like' record.



offerings table

Tracks various offerings within the platform, including details such as offering ID, name, description, and associated products. Records information about whether the offering requires payment, the associated Stripe plan, and the creation and update timestamps.

Table 24.

Column

Description

AGENTTYPE

Specifies if the offering is applicable at the org-level or the individual user-level.

CREATED

Timestamp indicating when the offering record was created in the system.

CREATEDBY

Identifier for the user who created the offering record.

DEFAULTOFFERING

Boolean. Indicates if the offering is the default option.

DESCRIPTION

Textual details about the specific offering.

ENV

Identifies the data.world environment to which the offering applies.

NAME

The name given to the the specific offering.

OFFERINGID

Unique identifier for the offerings within the OFFERINGS table.

OFFERINGSLUG

Identifier for each offering within the system. Unique at the environment level.

PRODUCTIDS

Reference to unique identifiers of products associated with each offering.

REQUIRESPAYMENT

Indicates if the offering requires a payment or not.

SITEID

Unique 5-digit identifier for the data.world Private Instance.

STARTDATE

The date when the offering was made available in the system.

STRIPEPLANID

Identifier for the associated Stripe plan for the particular offering.

UPDATED

TimeStamp of when the offering's record was last updated.

UPDATEDBY

Identifier for the user who last updated the offering record.



offerings_retention table

An extension of the OFFERINGS table that provides a historical view of the various offerings within the platform. This table captures the evolution of each offering over time, including details such as the offering ID, name, description, and associated products. It also tracks information about payment requirements, the associated Stripe plan, and timestamps for creation and updates.

Table 25.

Column

Description

AGENTTYPE

Specifies if the offering is applicable at the org-level or the individual user-level.

CREATED

Timestamp indicating when the offering record was created in the system.

CREATEDBY

Identifier for the user who created the offering record.

DEFAULTOFFERING

Boolean. Indicates if the offering is the default option.

DESCRIPTION

Textual details about the specific offering.

ENV

Identifies the data.world environment to which the offering applies.

NAME

The name given to the the specific offering.

OFFERINGID

Unique identifier for the offerings within the OFFERINGS table.

OFFERINGSLUG

Identifier for each offering within the system. Unique at the environment level.

PRODUCTIDS

Reference to unique identifiers of products associated with each offering.

REQUIRESPAYMENT

Indicates if the offering requires a payment or not.

SITEID

Unique 5-digit identifier for the data.world Private Instance.

STARTDATE

The date when the offering was made available in the system.

STRIPEPLANID

Identifier for the associated Stripe plan for the particular offering.

UPDATED

TimeStamp of when the offering's record was last updated.

UPDATEDBY

Identifier for the user who last updated the offering record.



offeringsubscriptions table

Tracks subscriptions to specific offerings within the platform, including details such as the subscription ID, offering ID, start and end dates, and the associated agent. Records creation and update timestamps, along with the responsible agents. Useful for managing and auditing the lifecycle of offering subscriptions across the platform.

Table 26.

Column

Description

AGENTID

Identifier of the agent associated with the offering subscription. Unique at the environment-level and siteid-level.

CREATED

date string 'yyyy-mm-dd' If org has preexisting entry in offeringsubscriptions table (created via manual configuration in admin tool), then select value from offeringsubscriptions; else, select value from siteid_subscriptionplan_map.csv seed.

CREATEDBY

If org has preexisting entry in offeringsubscriptions table (created via manual configuration in admin tool), then select value from offeringsubscriptions; else, select value from siteid_subscriptionplan_map.csv seed.

ENDDATE

date string 'yyyy-mm-dd'; null if plan is currently active; If org has preexisting entry in offeringsubscriptions table (created via manual configuration in admin tool), then select value from offeringsubscriptions; else, select value from siteid_subscriptionplan_map.csv seed.

ENV

Identifies the data.world environment in which the offering subscription operates.

OFFERINGID

If org has preexisting entry in offeringsubscriptions table (created via manual configuration in admin tool), then select value from offeringsubscriptions; else, select value from siteid_subscriptionplan_map.csv seed.

SITEID

Unique 5-digit identifier for the data.world Private Instance.

STARTDATE

date string 'yyyy-mm-dd'. If org has preexisting entry in offeringsubscriptions table (created via manual configuration in admin tool), then select value from offeringsubscriptions; else, select value from siteid_subscriptionplan_map.csv seed.

SUBSCRIPTIONID

1) If the entry (identified by composite key formed by agentid, sited), has a subscriptionid uuid code already present in offeringsubscriptions table, we use the subscriptionid code present in the offeringsubscriptions table (see ifnull statement in offeringsubscriptions_plan_automation macro). In the case that the subscriptionid code already exists. we can assume that it was created automatically when someone manually gave the org a plan via the admin tool. 2) If the org does not have an entry in offeringsubscriptions, then we derive the code dynamically using a deterministic application of the uuid_string SnowFlake function.

UPDATED

date string 'yyyy-mm-dd' If org has preexisting entry in offeringsubscriptions table (created via manual configuration in admin tool), then select value from offeringsubscriptions; else, select value from siteid_subscriptionplan_map.csv seed.

UPDATEDBY

If org has preexisting entry in offeringsubscriptions table (created via manual configuration in admin tool), then select value from offeringsubscriptions; else, select value from siteid_subscriptionplan_map.csv seed.



products table

Stores details about products available within the platform, including product ID, name, category, and associated grants or permissions.

Table 27.

Column

Description

CATEGORY

Specifies the category to which each product belongs.

CREATED

Date and time when the product record was created.

CREATEDBY

Identifier for the user who created the product entry.

ENV

The data.world environment to which the product record applies.

GRANTS

Specifies grants or permissions associated with each product.

NAME

The name of the product available within the platform.

PRODUCTID

Unique identifier for each product within the platform.

SITEID

Unique 5-digit identifier for the data.world Private Instance to which the product applies.

UPDATED

The date/time when product information was last updated.

UPDATEDBY

Identifier (agentid) of the individual who last updated the product detail.



products_retention table

Extends the PRODUCTS view by maintaining a historical record of changes to products over time within the platform. It stores detailed information about products, including product ID, name, category, and any associated grants or permissions.

Table 28.

Column

Description

CATEGORY

Specifies the category to which each product belongs.

CREATED

Date and time when the product record was created.

CREATEDBY

Identifier for the user who created the product entry.

ENV

The data.world environment to which the product record applies.

GRANTS

Specifies grants or permissions associated with each product.

NAME

The name of the product available within the platform.

PRODUCTID

Unique identifier for each product within the platform.

SITEID

Unique 5-digit identifier for the data.world Private Instance to which the product applies.

UPDATED

The date/time when product information was last updated.

UPDATEDBY

Identifier (agentid) of the individual who last updated the product detail.



queries table

Stores details about queries run within the platform, including the query ID, associated resource, name, and latest version ID. Tracks creation and update timestamps, along with publication status and the related resource type. Each query is linked to a specific agent/user and can be categorized by type (e.g., SQL, SPARQL). Useful for managing and auditing queries associated with various datasets and resources.

Table 29.

Column

Description

CREATED

Timestamp indicating when the query was created in the platform.

CREATEDBY

Indicates the agentid of the user who created the query.

ENV

The data.world environment to which the query record applies.

LATESTVERSIONID

Represents the ID of the latest version of a specific query.

NAME

The name given to each query saved within the platform.

PARTY

Identifies the user who saved a specific query.

PARTYID

The agentid of the user who saved a specific query.

PARTYTYPE

The type of the agent that saved the query. The value for this column is "agent" for all entries in the QUERIES table.

PUBLISHED

Boolean. Indicates if the query has been made viewable to anyone who can access the project.

PUBLISHEDRESOURCE

Specifies the identifier of the resource to which the query was saved and published. NULL if unpublished.

QUERYID

Unique identifier for each query saved within the platform.

RESOURCE

Specifies the type and identifier of the resource where the query is saved.

RESOURCEID

Specifies the identifier of the resource where the query is saved.

RESOURCETYPE

Specifies the type of the resource where the query is saved.

SITEID

Unique 5-digit identifier for the data.world Private Instance.

TYPE

Identifies the type of query (e.g., SQL, SPARQL).

UPDATED

Timestamp marking the most recent update to the specific query.

UPDATEDBY

Identifier (agentid) of the user who last updated the query.



queries_retention table

Extends the QUERIES table by capturing the historical changes to query details over time within the platform. It stores comprehensive information about queries, including the query ID, associated resource, name, and version history. The historical aspect of this table is helpful for auditing and managing queries, providing a complete record of how queries evolve and are utilized within the platform.

Table 30.

Column

Description

CREATED

Timestamp indicating when the query was created in the platform.

CREATEDBY

Indicates the agentid of the user who created the query.

ENV

The data.world environment to which the query record applies.

LATESTVERSIONID

Represents the ID of the latest version of a specific query.

NAME

The name given to each query saved within the platform.

PARTY

Identifies the user who saved a specific query.

PARTYID

The agentid of the user who saved a specific query.

PARTYTYPE

The type of the agent that saved the query. The value for this column is "agent" for all entries in the QUERIES table.

PUBLISHED

Boolean. Indicates if the query has been made viewable to anyone who can access the project.

PUBLISHEDRESOURCE

Specifies the identifier of the resource to which the query was saved and published. NULL if unpublished.

QUERYID

Unique identifier for each query saved within the platform.

RESOURCE

Specifies the type and identifier of the resource where the query is saved.

RESOURCEID

Specifies the identifier of the resource where the query is saved.

RESOURCETYPE

Specifies the type of the resource where the query is saved.

SITEID

Unique 5-digit identifier for the data.world Private Instance.

TYPE

Identifies the type of query (e.g., SQL, SPARQL).

UPDATED

Timestamp marking the most recent update to the specific query.

UPDATEDBY

Identifier (agentid) of the user who last updated the query.



queryversions table

Tracks different versions of queries within the platform, including details such as the query ID, version ID, modifying agent, name, and type. Records creation and update timestamps, along with information about the previous version, publication status, and visibility. Useful for managing and auditing the evolution of queries associated with specific resources.

Table 31.

Column

Description

CREATED

Timestamp of when the query version was created.

CREATEDBY

Identifier (agentid) for the user who created the query version.

ENV

The data.world environment.

MODIFYINGAGENTID

ID of the agent that modified the query version.

NAME

Name or title assigned to the specific query version.

PREVIOUSVERSIONID

ID of the previous version of the query in the platform.

PUBLISHED

Boolean. Indicates if the query version has been made viewable to anyone who can access the project.

QUERYID

Unique identifier for each query within the QUERYVERSIONS table.

RESOURCE

Specifies the type and identifier of the resource where the query is saved.

RESOURCEID

Specifies the identifier of the resource where the query is saved.

RESOURCETYPE

Specifies the type of the resource where the query is saved.

SITEID

Unique 5-digit identifier for the data.world Private Instance.

TYPE

Identifies the type of query (for example, SQL, SPARQL).

UPDATED

Timestamp marking the most recent update to the specific query.

UPDATEDBY

Identifier (agentid) of the user who last updated the query.

VERSIONID

Unique identifier for the version of a specific query.

VISIBILITY

Indicates the accessibility level of a specific query version.



subscriptions table

Tracks subscriptions created by agents within the platform, including details such as the subscriber ID, entity being subscribed to, subscription type (e.g., EMAIL), and the reason for the subscription (e.g., LIKE, FOLLOW). Records the creation and update timestamps, as well as the observable level of the subscription. Useful for managing and auditing user subscriptions to various platform entities.

Table 32.

Column

Description

CREATED

Timestamp indicating when the subscription record was created.

CREATEDBY

Identifier (agentid) of the user who created the subscription record.

ENTITYID

Unique identifier for the entity being subscribed to.

ENV

The data.world environment.

LEVEL

Specifies the observable level of a subscription within the platform.

OBSERVABLE

Specifies the resource or user account to which is subscribed to.

REASON

Specifies the reason for the user's subscription (e.g., LIKE, FOLLOW).

SITEID

Unique 5-digit identifier for the data.world Private Instance.

SUBSCRIBERID

Type ("agent") and Identifier (agentid) for the subscriber in the SUBSCRIPTIONS table.

SUBSCRIPTIONID

Unique identifier for each user's subscription within the platform.

TYPE

Specifies the type of subscription (e.g., EMAIL) for each record.

UPDATED

Timestamp indicating when the subscription was last updated.

UPDATEDBY

Identifier (agentid) of the user who last updated the subscription.



subscriptions_retention table

Extends the SUBSCRIPTIONS view by maintaining a historical record of subscription activities created by agents within the platform. This includes details such as the subscriber ID, entity being subscribed to, subscription type (e.g., EMAIL), and the reason for the subscription (e.g., LIKE, FOLLOW).

Table 33.

Column

Description

CREATED

Timestamp when the subscription activity was created.

CREATEDBY

Identifier (agentid) of the user who created the subscription.

ENTITYID

Unique identifier for the entity being subscribed to.

ENV

The data.world environment.

LEVEL

Specifies the observable level of a subscription within the platform.

OBSERVABLE

Specifies the resource or user account to which is subscribed to.

REASON

Specifies the reason for the user's subscription (e.g., LIKE, FOLLOW).

SITEID

Unique 5-digit identifier for the data.world Private Instance.

SUBSCRIBERID

Type ("agent") and Identifier (agentid) for the subscriber in the SUBSCRIPTIONS table.

SUBSCRIPTIONID

Unique identifier for each user's subscription within the platform.

TYPE

Specifies the type of subscription (e.g., EMAIL) for each record.

UPDATED

Timestamp indicating when the subscription was last updated.

UPDATEDBY

Identifier (agentid) of the user who last updated the subscription.