Platform records
Platform records contains records of user activities and interactions across various platforms, including transaction history, preferences, and engagement metrics. It also stores information on user profiles, such as account details and demographic information. This provides a comprehensive view of user behavior and preferences.
agents table
Contains details about the agents/users of data.world, including when they were created and last updated and other user profile information. Useful for managing and analyzing user and system agent profiles.
Column | Description |
---|---|
ACCOUNTSTANDING | Indicates the current status of an agent/user account on data.world. |
ACTIVESUBSCRIPTIONID | Unique identifier for the active subscription of a specific agent/user. |
AGENTID | The identifer for the individual user or org account; unique per siteid and environment. |
ALLOWEDROLES | Specifies the roles that are allowed to the agent account. |
BIO | Contains biological information of agents/users on data.world. |
COMPANY | Represents the company name to which the org/user belongs. |
CREATED | Timestamp marking when the org/user profile was created. |
CREATEDBY | Identifier for the agent or system that created the org/user profile. |
CUSTOMERAGENTID | Unique identifier for each customer. |
DISPLAYNAME | The name displayed for the orgs/users in the system. |
This is the email address associated with the user/org profile. | |
EMAILVERIFIED | Indicates if the agent's email address has been verified or not. |
ENV | Identifies the data.world environment. |
MANAGED | Indicates if the org account is managed or not. |
NOTIFICATIONSEMAIL | Email address where the user/org receives system notifications. |
ORGDETAILS | Contains information and settings associated with an org. |
ORGMODE | Specifies whether the agent is an org. Values include True or (False/NULL). |
SITEID | Unique 5-digit identifier for the data.world Private Instance. |
TAGS | Category labels associated with agents - eg, science, entertainment, education. |
UPDATED | Timestamp of the last update made to the agent/user profile. |
UPDATEDBY | Records the user/agent who made the latest update to a profile. |
VISIBILITY | Indicates the visibility status of the agent/user profile. |
WEBSITE | The website associated with the user/agent profile. |
agentshydrated table
Contains detailed information about agents, similar to the AGENTS table but with additional metrics. Tracks agent activity and attributes such as organization memberships, dataset and query counts, subscription details, and geographic data.
Column | Description |
---|---|
ACCOUNTSTANDING | Indicates the current status of an agent/user account on data.world. |
ACTIVESUBSCRIPTIONID | Unique identifier for the active subscription of a specific agent/user. |
AGENTID | The identifer for the individual user or org account; unique per siteid and environment. |
ALLOWEDROLES | Specifies the roles that are allowed to the agent account. |
BILLINGPLAN | Specifies the billing plan associated with each agent. |
BIO | Contains biological information of agents/users on data.world. |
CITY | Indicates the city associated with the agent, if available. |
COMPANY | Represents the company name to which the org/user belongs. |
COUNTRYCODE | Code identifying the country associated with the agent. |
CREATED | Timestamp marking when the org/user profile was created. |
CREATEDBY | Identifier for the agent or system that created the org/user profile. |
CUSTOMERAGENTID | Unique identifier for each customer. |
DISPLAYNAME | The name displayed for the orgs/users in the system. |
This is the email address associated with the user/org profile. | |
EMAILVERIFIED | Indicates if the agent's email address has been verified or not. |
ENV | Identifies the data.world environment. |
MANAGED | Indicates if the org account is managed or not. |
MAXFREEORGANIZATIONMEMBERS | Maximum number of free memberships in an organization. |
MAXORGANIZATIONMEMBERS | Maximum number of memberships in an organization. |
MUTED | Indicates if the specific agent in the AGENTSHYDRATED table is muted or not. |
MUTED_BY_SPAMBOT | Indicates if the agent was muted by the spam bot detection. |
NOTIFICATIONSEMAIL | Email address where the user/org receives system notifications. |
NUMOPENQUERIES | Number of open queries associated with an agent. |
NUMORGMEMBERSHIPS | Number of organization memberships held by each agent. |
NUMPRIVATEDATASETS | Number of private datasets owned by the agent. |
NUMPRIVATEDATASETSCONTRIBUTOROF | Count of private datasets to which the agent is a contributor. |
NUMPRIVATEMEMBERS | Number of private members for an org. |
NUMTOTALDATASETS | Total number of datasets owned by an agent. |
NUMTOTALQUERIES | The total number of queries owned by an agent. |
ORGADMINS | Stores identifiers for agents who are organization administrators. |
ORGCREATOR | Identifies the creator of the org in the AGENTSHYDRATED table. |
ORGDETAILS | Contains information and settings associated with an org. |
ORGMEMBERS | Stores a list of agents that have membership to the org. |
ORGMODE | Specifies whether the agent is an org. Values include True or (False/NULL). |
ORGSADMINOF | Specifies the organizations that the agent is an administrator of. |
ORGSMEMBEROF | Stores a list of orgs that the agent is a member of. |
REACTIVATED | Indicates if the agent's profile was reactivated. |
REGION | Specifies the geographical region where the agent operates. |
SCIMDETAILS | |
SCIMMANAGED | |
SEATSUSED | Specifies the number of seats used. |
SITEID | Unique 5-digit identifier for the data.world Private Instance. |
TAGS | Category labels associated with agents - eg, science, entertainment, education. |
UPDATED | Timestamp of the last update made to the agent/user profile. |
UPDATEDBY | Records the user/agent who made the latest update to a profile. |
VISIBILITY | Indicates the visibility status of the agent/user profile. |
WEBSITE | The website associated with the user/agent profile. |
authorizationrequests table
Captures authorization requests within the platform, detailing the resource, requester, grantee, and approver parties, along with the access level requested. Tracks the creation and update timestamps, visibility settings, and resource type and ID. Useful for auditing and managing access control across different environments and resources.
Column | Description |
---|---|
APPROVERPARTY | Identifies the party responsible for approving the authorization request. |
CREATED | Timestamp indicating when the authorization request was created. |
ENV | Specifies the data.world environment where the authorization request was made. |
GRANTEEPARTY | Party to whom access rights are being granted in the authorization request. |
LEVEL | Specifies the access level requested in the authorization request. |
REQUESTERPARTY | Specifies the party that made the authorization request. |
REQUESTID | Unique identifier for each authorization request in the platform. |
RESOURCE | Specifies the resource involved in the authorization request. |
RESOURCEID | Identfier for the resource involved in the request; composed of "owner/resource"; unique at the level of siteid, environment. |
RESOURCETYPE | Specifies the type of resource involved in the authorization request. |
SITEID | Unique 5-digit data.world site ID associated with this request. |
UPDATED | Timestamp for when the authorization request was last updated. |
VISIBILITY | Defines the accessibility and visibility level of an authorization request. |
authorizations table
Tracks authorization details, including the parties involved, resources, access levels, and visibility settings. Captures the creation and update timestamps, along with the requester, approver, and any grant exclusions. Useful for managing and auditing access permissions across different resources and environments.
Column | Description |
---|---|
APPROVERPARTY | Specifies the party who approved any given authorization. |
CREATED | Timestamp for when the authorization record was created. |
CREATEDBY | The client and agent that created the authorization. |
ENV | The data.world environment in which the authorization exists. |
GRANTEXCLUSION | Indicates any exceptions on granted access permissions in authorizations. |
LEVEL | Specifies the access level granted in the authorization, eg, READ, ADMIN, etc. |
PARTY | The type and identifier for the party to which the authorization is granted. |
PARTYID | The identifier for the party to which the authorization is granted. |
PARTYTYPE | The type for the party to which the authorization is granted. |
REQUESTERPARTY | The type and identifier for the party which requested the authorization. |
RESOURCE | Specifies the type and ID for resource involved in the authorization. |
RESOURCEID | Specifies the ID for resource involved in the authorization. |
RESOURCETYPE | Specifies the type for resource involved in the authorization. |
SITEID | Unique 5-digit data.world site ID associated with this request. |
UPDATED | Timestamp marking the last update to an authorization record. |
UPDATEDBY | Identifier of the user who last updated the authorization details. |
VISIBILITY | Specifies the visibility settings for the authorization. |
authorizations_retention table
Builds on the AUTHORIZATIONS view by maintaining a historical record of authorization details, including the involved parties, resources, access levels, and visibility settings. It captures and preserves creation and update timestamps, requester and approver information, and any grant exclusions over time.
Column | Description |
---|---|
APPROVERPARTY | Specifies the party who approved any given authorization. |
CREATED | Timestamp for when the authorization record was created. |
CREATEDBY | The client and agent that created the authorization. |
ENV | The data.world environment in which the authorization exists. |
GRANTEXCLUSION | Indicates any exceptions on granted access permissions in authorizations. |
LEVEL | Specifies the access level granted in the authorization, eg, READ, ADMIN, etc. |
PARTY | The type and identifier for the party to which the authorization is granted. |
PARTYID | The identifier for the party to which the authorization is granted. |
PARTYTYPE | The type for the party to which the authorization is granted. |
REQUESTERPARTY | The type and identifier for the party which requested the authorization. |
RESOURCE | Specifies the type and ID for resource involved in the authorization. |
RESOURCEID | Specifies the ID for resource involved in the authorization. |
RESOURCETYPE | Specifies the type for resource involved in the authorization. |
SITEID | Unique 5-digit data.world site ID associated with this request. |
UPDATED | Timestamp marking the last update to an authorization record. |
UPDATEDBY | Identifier of the user who last updated the authorization details. |
VISIBILITY | Specifies the visibility settings for the authorization. |
clientauthorizations table
Tracks client authorizations, linking agents to specific clients. Records the creation and update timestamps, along with the responsible agents and clients. Useful for managing and auditing the authorization relationships between agents and clients.
Column | Description |
---|---|
AGENTID | Identifier for the agent involved in the client authorization. Unique at the level of environment and siteid. |
CLIENTID | Unique identifier for a client in the CLIENTAUTHORIZATIONS table. |
CREATED | Timestamp of when the client authorization was originally created. |
CREATEDBY | Identifies the agentid for the agent who created the client authorization record. |
ENV | The data.world environment. |
SITEID | The unique 5-digit identifier of the data.world Private Instance. |
UPDATED | Timestamp for the latest update made to a client authorization. |
UPDATEDBY | Identifier for the agent who last updated the client authorization record. |
comments table
Captures comments made on various catalog resources, including details about the topic, resource, comment ID, and associated metadata. Tracks the creation and update timestamps, visibility settings, and whether the comment was edited or migrated. Useful for monitoring discussions and feedback related to specific catalog resources.
Column | Description |
---|---|
COMMENTID | Unique identifier associated with each comment in the COMMENTS table. |
CREATED | Timestamp indicating when the comment was created. |
CREATEDBY | Identifier of the user who created the comment. |
CREATEDWITHTOPIC | Boolean (1 or NULL). Indicates if the comment was created with a specific topic. |
EDITED | Boolen (1 or NULL). Indicates if a comment was edited post-creation. |
ENV | The data.world environment. |
MIGRATED | Boolen (1 or NULL). Indicates if a comment has been migrated. |
PUBLISHED | Boolen (1 or NULL). Indicates if the comment is published or not. |
RESOURCE | Type and identifier for the resource that the comment is associate with ([resourcetype]:[resourceid]). |
RESOURCEID | Identifier for the resource the comment is associated with ([owner]/[resourcename]). |
RESOURCETYPE | Specifies the type of catalog resource that the comment is associated with. |
RESOURCEVERSION | Indicates the version of the resource being commented on. |
SITEID | The unique 5-digit identifier of the data.world Private Instance. |
TOPIC | Indicates the topic related to a specific comment in the catalog resources. |
TOPICID | Unique identifier for the topic correlated with a comment. |
TOPICUUID | Universally unique identifier for the topic correlated with a comment. |
UPDATED | Timestamp of the most recent update made to the comment. |
UPDATEDBY | Identifier for the user who last updated the comment. |
databasesources table
Tracks database sources within the platform, including details such as the source ID, creation and update timestamps, label, type (e.g., SNOWFLAKE), and SSL requirement status. Each entry is associated with an agent and environment, providing a record of when and by whom the database source was created and last modified. Useful for managing and auditing connections to external data sources.
Column | Description |
---|---|
AGENTID | Identifier for the user/org that who owns the database source. |
CREATED | Timestamp of when the database source was created. |
CREATEDBY | Indentifier for the client and agent that created the database source. |
ENV | The data.world environment, eg, Prod. |
LABEL | The user-defined identifier for the specific database source. |
SITEID | Unique 5-digit identifier for the specific data.world Private Instance where the database connection resides. |
SOURCEID | Unique identifier for each database source within the platform. |
SSLREQUIRED | Indicates if Secure Sockets Layer (SSL) is required for the database source. |
TYPE | Specifies the type of the database source (e.g., SNOWFLAKE). |
UPDATED | Timestamp for the last update made to the database source entry. |
UPDATEDBY | Identifier of the user who last updated the database source entry. |
databasesources_retention table
This table extends the DATABASESOURCES table by recording the historical changes to database source details over time within the platform. It tracks essential information about database sources, including the source ID, label, type, SSL requirement status, and timestamps for creation and updates.
Column | Description |
---|---|
AGENTID | Unique identifier for the agent linked to a database source. |
CREATED | Timestamp of when the database source record was initially created. |
CREATEDBY | Identifier of the user that created the database source record. |
ENV | The data.world environment. |
LABEL | The user-defined identifier for the specific database source. |
SITEID | Unique 5-digit identifier for the specific data.world Private Instance where the database connection resides. |
SOURCEID | Unique identifier for each database source within the platform. |
SSLREQUIRED | Indicates if Secure Sockets Layer (SSL) is required for the database source. |
TYPE | Specifies the type of the database source (e.g., SNOWFLAKE). |
UPDATED | Timestamp for when database source details were last updated. |
UPDATEDBY | Identifier of the user who last updated the database source details. |
datasetlinks table
Tracks links between datasets within the platform. Each entry records the connection between a dataset and its linked dataset, along with the creation and update timestamps. Useful for managing and auditing relationships between datasets across the platform.
Column | Description |
---|---|
CREATED | Timestamp marking when a dataset link was initially created. |
DATASET | Identifier of the primary dataset in a tracked link between datasets. |
ENV | The data.world environment. |
LINKEDDATASET | Identifies the dataset linked to the primary dataset. |
SITEID | Unique 5-digit identifier for the specific data.world Private Instance. |
UPDATED | Timestamp indicating the last time the dataset link was updated. |
datasetlinks_retention table
Extends the functionality of the DATASETLINKS view by preserving a historical record of links between datasets within the platform. Each entry documents the connections between datasets and their linked counterparts, along with the creation and update timestamps.
Column | Description |
---|---|
CREATED | Timestamp marking when the dataset link was initially created. |
DATASET | Identifier of the primary dataset in a tracked link between datasets. |
ENV | The data.world environment. |
LINKEDDATASET | Identifies the dataset linked to the primary dataset. |
SITEID | Unique 5-digit identifier for the specific data.world Private Instance. |
UPDATED | Timestamp of when the dataset link record was last updated. |
datasets table
Stores details about datasets within the platform, including the dataset ID, name, description (headline and abstract), creation and update timestamps, visibility status, and the number of data points. Tracks additional attributes such as licensing, tags, project status, and ingestion status. Useful for managing and auditing datasets, their versions, and their accessibility within the platform.
Column | Description |
---|---|
ABSTRACT | The user-generated summary detail of the dataset. May be text or markdown format. |
AGENTID | Unique identifier for the agent (user or org) who owns the dataset. |
CREATED | Timestamp indicating when the dataset was initially created. |
CREATEDBY | Identifier for the user who created the dataset. |
DATASETID | The formatted dataset name as represented in the DATASETS table. Along with the agentid (owner), composes the dataset identifier. |
ENV | The data.world environment. |
HEADLINE | The user-generated description of the dataset. |
INGESTSTATUS | Indicates the status of dataset ingestion in the platform, eg "LOADED", "NEW", "SYSTEMERROR". |
LATESTVERSIONID | Identifier for the most recent version of a dataset. |
LICENSE | Indicates the type of license associated with the dataset. |
NAME | The user-generated dataset name, which when formatted by the system, becomes the datasetid in the DATASETS table. |
NOTIFICATIONSEMAIL | Email address to which dataset notifications are sent. |
NUMDATAPOINTS | The quantitative size of a dataset, that is the count of individual data points it contains. |
OFFLINE | Boolean (1 or NULL). Indicates whether the dataset is available offline or not. |
PROJECT | Boolean (TRUE, FALSE, or NULL). Indicates whether the record is a project (TRUE) or a dataset (FALSE or NULL). |
SITEID | Unique 5-digit identifier for the specific data.world Private Instance. |
TAGS | Identifies categorical attributes given to datasets. |
UPDATED | Timestamp of the latest update made to the dataset. |
VISIBILITY | Indicates the visibility status of datasets within the platform (PRIVATE, OPEN, or DISCOVERABLE). |
datasets_retention table
Extends the DATASETS view by preserving a historical record of changes to datasets over time within the platform. It stores comprehensive details about datasets, including the dataset ID, name, descriptions, visibility status, and various other attributes like licensing, tags, and ingestion status.
Column | Description |
---|---|
ABSTRACT | The user-generated summary detail of the dataset. May be text or markdown format. |
AGENTID | Unique identifier for the agent (user or org) who owns the dataset. |
CREATED | Timestamp indicating when the dataset was initially created. |
CREATEDBY | Identifier for the user who created the dataset. |
DATASETID | The formatted dataset name as represented in the DATASETS table. Along with the agentid (owner), composes the dataset identifier. |
ENV | The data.world environment. |
HEADLINE | The user-generated description of the dataset. |
INGESTSTATUS | Indicates the status of dataset ingestion in the platform, eg "LOADED", "NEW", "SYSTEMERROR". |
LATESTVERSIONID | Identifier for the most recent version of a dataset. |
LICENSE | Indicates the type of license associated with the dataset. |
NAME | The user-generated dataset name, which when formatted by the system, becomes the datasetid in the DATASETS table. |
NOTIFICATIONSEMAIL | Email address to which dataset notifications are sent. |
NUMDATAPOINTS | The quantitative size of a dataset, that is the count of individual data points it contains. |
OFFLINE | Boolean (1 or NULL). Indicates whether the dataset is available offline or not. |
PROJECT | Boolean (TRUE, FALSE, or NULL). Indicates whether the record is a project (TRUE) or a dataset (FALSE or NULL). |
SITEID | Unique 5-digit identifier for the specific data.world Private Instance. |
TAGS | Identifies categorical attributes given to datasets. |
UPDATED | Timestamp of the latest update made to the dataset. |
VISIBILITY | Indicates the visibility status of datasets within the platform (PRIVATE, OPEN, or DISCOVERABLE). |
datasetsources table
Tracks synchronization details for datasets within the platform, including the dataset ID, sync status, timestamps for the last and next scheduled syncs, and synchronization intervals. Records versioning information and details about the synchronization type (e.g., auto, stream). Useful for managing and auditing the synchronization processes for datasets.
Column | Description |
---|---|
AGENTID | The identifier of the owner of the dataset set to autosync. |
AUTOSYNCINTERVAL | Specifies the auto-synchronization interval for the datasets. |
CREATED | Timestamp indicating when the dataset sync record was created. |
DATASETID | The formatted dataset name - along with the agentid (owner) composes the dataset identifier in the DATASETSOURCES table. |
DESIREDAUTOSYNC | The DESIREDAUTOSYNC column is not supported and does not provide any meaningful information. |
ENV | The data.world environment. |
GENERATION | Boolean (1 or NULL) |
LASTSYNCFINISH | Timestamp indicating when the last dataset synchronization was completed. |
LASTSYNCSTART | Timestamp indicating when the last dataset synchronization process started. |
NEXTSCHEDULEDSYNC | Timestamp for the next scheduled dataset synchronization. |
NEXTSCHEDULEDSYNCTYPE | Specifies the type of synchronization for the next scheduled dataset sync. |
SITEID | Unique 5-digit identifier for the specific data.world Private Instance. |
STREAMSYNCINTERVAL | Specifies the interval for stream synchronization of the dataset. |
SYNCSTATUS | Indicates the current status of the dataset synchronization process, eg( "OK", "SYSTEMERROR"). |
UPDATED | Timestamp of the most recent update to the dataset autosync settings. |
VERSION | Indicates the version of synchronized dataset within the platform. |
datasetsources_retention table
Extends the functionality of the DATASETSOURCES view by preserving a historical record of synchronization details for datasets within the platform. This includes information such as dataset ID, sync status, last and next scheduled sync times, and synchronization intervals.
Column | Description |
---|---|
AGENTID | The identifier of the owner of the dataset set to autosync. |
AUTOSYNCINTERVAL | Specifies the auto-synchronization interval for the datasets. |
CREATED | Timestamp indicating when the dataset sync record was created. |
DATASETID | The formatted dataset name - along with the agentid (owner) composes the dataset identifier in the DATASETSOURCES table. |
DESIREDAUTOSYNC | The DESIREDAUTOSYNC column is not supported and does not provide any meaningful information. |
ENV | The data.world environment. |
GENERATION | Boolean (1 or NULL) |
LASTSYNCFINISH | Timestamp indicating when the last dataset synchronization was completed. |
LASTSYNCSTART | Timestamp indicating when the last dataset synchronization process started. |
NEXTSCHEDULEDSYNC | Timestamp for the next scheduled dataset synchronization. |
NEXTSCHEDULEDSYNCTYPE | Specifies the type of synchronization for the next scheduled dataset sync. |
SITEID | Unique 5-digit identifier for the specific data.world Private Instance. |
STREAMSYNCINTERVAL | Specifies the interval for stream synchronization of the dataset. |
SYNCSTATUS | Indicates the current status of the dataset synchronization process, eg( "OK", "SYSTEMERROR"). |
UPDATED | Timestamp of the most recent update to the dataset autosync settings. |
VERSION | Indicates the version of synchronized dataset within the platform. |
datasetversions table
Tracks versions of datasets within the platform, capturing differences between each version, such as file and table counts, ingestion status, and any changes in the dataset’s content. Each version is linked to its previous version, allowing for detailed tracking of updates, modifications, or corrections made over time. This includes whether files were added, removed, or altered, and any changes to the structure or metadata of the dataset. The table also notes whether a version is marked for reaping or has been reaped, helping to manage the lifecycle of data and maintain version control.
Column | Description |
---|---|
AGENTDATASETID | The dataset identifier. Composed of the agentid (owner) and dataset name. |
AGENTID | The identifier for the agentid (user or org) that owns that dataset. |
CONTENTS2 | Contents of the dataset version; an array of objects. |
CREATED | Timestamp marking when a particular dataset version was created. |
CREATEDBY | Identifier of the user who created the dataset. |
DATASETID | The formatted dataset name - along with the agentid (owner) composes the dataset identifier in the DATASETVERSIONS table. |
ENV | The data.world environment. |
FILECOUNT | Indicates the count of files in each dataset version. |
INGESTSTATUS | Indicates the status of dataset ingestion for the dataset version, eg "LOADED", "NEW", "SYSTEMERROR". |
MARKEDFORREAPING | Boolean (true, false, NULL). Indicates if a dataset version is marked for removal or cleanup process. |
PREVIOUSVERSIONID | ID corresponding to the previous version of the dataset. |
REAPED | Boolean (true or NULL). Indicates if a dataset version has been reaped. |
REAPEDPREVIOUSVERSIONID | ID of the previously reaped version of the dataset. |
SITEID | Unique 5-digit identifier for the specific data.world Private Instance. |
TABLECOUNT | Number of tables in each version of the dataset. |
UPDATED | Indicates the timestamp when the dataset version was last updated. |
UPDATEDBY | Identifier for the user who updated the dataset version. |
VERSIONID | Unique identifier for each dataset version in the platform. |
entitlements table
In this context, an entitlement refers to the specific access rights or privileges granted to an agent.
Tracks entitlements granted to agents within the platform, including details such as entitlement ID, associated products, and the start and end dates of the entitlement. Each entry records the creation and update timestamps, the order of the entitlement, and the source of the entitlement (e.g., offering subscription). Useful for managing and auditing the entitlements and access rights granted to users across the platform.
Column | Description |
---|---|
AGENTID | Unique identifier for agents granted with specific access rights. |
CREATED | Timestamp of when the entitlement was created. |
CREATEDBY | Identifier of the agent who created the entitlement. |
ENDDATE | The date the access right or privilege granted to an agent ends. |
ENTITLEMENTID | Unique identifier for each entitlement granted to agents. |
ENTITLEMENTITEMS | An array of items related to the entitlement including ID and quantity. |
ENV | The data.world environment |
ORDER | An integer (1, 2, or 3) that pertains to order. |
SITEID | Unique 5-digit identifier for the specific data.world Private Instance. |
SOURCE | References associated entry in the OFFERINGSUBCRIPTIONS table. |
STARTDATE | The date the agent's access rights or privileges begin. |
UPDATED | Timestamp indicating the most recent update to an entitlement. |
UPDATEDBY | Identifier for the agent who last updated the entitlement record. |
entitlements_retention table
This table extends the functionality of the ENTITLEMENTS table by providing a historical view of entitlements as they change over time. Records each modification of an entitlement, capturing details such as the entitlement ID, associated products, and timestamps for creation, updates, and the validity period. This data is helpful for auditing and managing the lifecycle of entitlements.
Column | Description |
---|---|
AGENTID | Unique identifier for agents granted with specific access rights. |
CREATED | Timestamp of when the entitlement was created. |
CREATEDBY | Timestamp of when the entitlement was created. |
ENDDATE | The date the access right or privilege granted to an agent ends. |
ENTITLEMENTID | Unique identifier for each entitlement granted to agents. |
ENTITLEMENTITEMS | An array of items related to the entitlement including ID and quantity. |
ENV | The data.world environment. |
ORDER | An integer (1, 2, or 3) that pertains to order. |
SITEID | Unique 5-digit identifier for the specific data.world Private Instance. |
SOURCE | References associated entry in the OFFERINGSUBCRIPTIONS table. |
STARTDATE | The date the agent's access rights or privileges begin. |
UPDATED | Timestamp indicating the most recent update to an entitlement. |
UPDATEDBY | Identifier for the agent who last updated the entitlement record. |
follows table
Tracks instances where agents “follow” specific resources within the platform so they can be aware of changes to that resource. Each entry includes the agent ID, the resource being followed, and associated timestamps. Useful for monitoring user engagement and tracking which resources are being followed by different agents.
Column | Description |
---|---|
AGENTID | Unique identifier of the follower agent (user). |
CREATED | Timestamp indicating when a follow was created by an agent. |
ENV | The data.world environment. |
FOLLOWING | Unique identifier of the followed agent (user). |
SITEID | Unique 5-digit identifier for the data.world Private Instance. |
UPDATED | Timestamp indicating when the follow action was last updated. |
insights table
Stores details about curated interpretations or analyses of data, known as “Insights,” which are linked to specific datasets or projects. Each entry tracks the creation, updates, and associations of these insights within the platform. Insights provide users with a quick understanding of key data patterns or findings, making them essential for collaboration and decision-making.
Column | Description |
---|---|
CREATED | Timestamp of when the Insight record was initially created. |
CREATEDBY | Identifier (agentid) of the user who created the insight. |
ENV | The data.world environment. |
INSIGHTID | Unique identifier for each insight within the INSIGHTS table. |
LATESTVERSIONID | ID of the latest version of the insight in the platform. |
PARTY | Identifies the user who created the insight. |
PARTYID | The identifier (agentid) of the user who created the insight. |
PARTYTYPE | The value for this column is "agent" for entries in the INSIGHTS table. |
RESOURCE | Identifies the resourcetype and resourceId for the resources associated with insights in the platform. |
RESOURCEID | The identifier for the resources associated with insights in the platform. Composed of the owner and resource name. |
RESOURCETYPE | The value for this column is "dataset" for all entries in the INSIGHTS table. |
SITEID | Unique 5-digit identifier for the data.world Private Instance where the insight is located. |
UPDATED | Timestamp of the most recent update applied to the Insight record. |
UPDATEDBY | Identifier of the user who last updated the insight. |
insightversions table
Tracks different versions of “insights,” which are interpretations or analyses linked to specific datasets or projects. Each record captures details such as the Insight ID, version ID, associated resource, and timestamps for creation and updates. This table is essential for understanding the evolution of insights over time, allowing users to track changes and modifications to key analyses or interpretations within the platform.
Column | Description |
---|---|
CREATED | Timestamp when a specific insight version was created. |
CREATEDBY | Identifier (agentid) for the user who created the specific Insight version. |
ENV | The data.world environment. |
INSIGHTID | Unique identifier for each insight in the INSIGHTVERSIONS table. |
MODIFYINGAGENTID | Identifier of the agent responsible for modifying a particular insight version |
PREVIOUSVERSIONID | ID of the immediate prior version of the current insight. |
RESOURCE | Identifies the resourcetype and resourceId for the resources associated with insights in the platform. |
RESOURCEID | The identifier for the resources associated with insights in the platform. Composed of the owner and resource name. |
RESOURCETYPE | The value for this column is "dataset" for all entries in the INSIGHTVERSIONS table. |
SITEID | Unique 5-digit identifier for the data.world Private Instance where the file is located. |
UPDATED | Timestamp of the most recent update applied to the INSIGHTVERSION record. |
UPDATEDBY | Identifier of the user who last updated the insight version. |
VERSIONID | Unique ID for each version of insight in the INSIGHTVERSIONS table. |
invites table
Tracks invitations sent within the platform, including the invite ID, email, and the invited agent’s ID. Each entry records the status of the invite, along with the creation and update timestamps. This data is useful for managing and auditing invitation activities, providing insights into collaboration as it shows users inviting others to various resources in the catalog.
Column | Description |
---|---|
CREATED | Timestamp showing when the invite was created. |
CREATEDBY | Identifier (agentid) of the user who created the invite in the platform. |
The email address to which an invite on the platform is sent. | |
ENV | The data.world environment. |
INVITEDAGENTID | The identifier (agentid) of the agent who has been invited on the platform. |
INVITEID | Unique identifier corresponding to each invitation sent within the platform. |
SITEID | Unique 5-digit identifier for the data.world Private Instance. |
STATUS | Represents the current status of a specific invitation, eg, SENT, USED. |
UPDATED | Timestamp of the last update made to the invite's status. |
UPDATEDBY | Identifier (agentid) of the user who last updated the invite record. |
likes table
Tracks “likes” made by agents on various resources within the platform. Includes details such as the agent (party) who liked the resource, the resource itself, creation and update timestamps, and the type of resource (e.g., dataset). Useful for monitoring user engagement and interactions with platform content.
Column | Description |
---|---|
CREATED | Timestamp indicating when the 'like'(bookmark) was recorded on the platform. |
CREATEDBY | Identifier (agentid) of the user who bookmarked the resource. |
ENV | The data.world environment. |
PARTY | Identifies the user who bookmarked a resource within the platform. |
PARTYID | Agentid of the user who bookmarked a resource within the platform. |
PARTYTYPE | The value for this column is "agent" for all entries in the LIKES table. |
RESOURCE | Identifies the resourcetype and resourceId for the resources associated with bookmarks in the platform. |
RESOURCEID | The identifier for the resources associated with bookmarks in the platform. Composed of the owner and resource name. |
RESOURCETYPE | Identifies the type of resource that received the bookmark eg, datasetinsight, dataset, comment, catalogResource. |
SITEID | Unique 5-digit identifier for the data.world Private Instance. |
UPDATED | Timestamp noting the most recent update to the 'like' record. |
UPDATEDBY | Identifier for the agent who last updated the 'like' record. |
offerings table
Tracks various offerings within the platform, including details such as offering ID, name, description, and associated products. Records information about whether the offering requires payment, the associated Stripe plan, and the creation and update timestamps.
Column | Description |
---|---|
AGENTTYPE | Specifies if the offering is applicable at the org-level or the individual user-level. |
CREATED | Timestamp indicating when the offering record was created in the system. |
CREATEDBY | Identifier for the user who created the offering record. |
DEFAULTOFFERING | Boolean. Indicates if the offering is the default option. |
DESCRIPTION | Textual details about the specific offering. |
ENV | Identifies the data.world environment to which the offering applies. |
NAME | The name given to the the specific offering. |
OFFERINGID | Unique identifier for the offerings within the OFFERINGS table. |
OFFERINGSLUG | Identifier for each offering within the system. Unique at the environment level. |
PRODUCTIDS | Reference to unique identifiers of products associated with each offering. |
REQUIRESPAYMENT | Indicates if the offering requires a payment or not. |
SITEID | Unique 5-digit identifier for the data.world Private Instance. |
STARTDATE | The date when the offering was made available in the system. |
STRIPEPLANID | Identifier for the associated Stripe plan for the particular offering. |
UPDATED | TimeStamp of when the offering's record was last updated. |
UPDATEDBY | Identifier for the user who last updated the offering record. |
offerings_retention table
An extension of the OFFERINGS table that provides a historical view of the various offerings within the platform. This table captures the evolution of each offering over time, including details such as the offering ID, name, description, and associated products. It also tracks information about payment requirements, the associated Stripe plan, and timestamps for creation and updates.
Column | Description |
---|---|
AGENTTYPE | Specifies if the offering is applicable at the org-level or the individual user-level. |
CREATED | Timestamp indicating when the offering record was created in the system. |
CREATEDBY | Identifier for the user who created the offering record. |
DEFAULTOFFERING | Boolean. Indicates if the offering is the default option. |
DESCRIPTION | Textual details about the specific offering. |
ENV | Identifies the data.world environment to which the offering applies. |
NAME | The name given to the the specific offering. |
OFFERINGID | Unique identifier for the offerings within the OFFERINGS table. |
OFFERINGSLUG | Identifier for each offering within the system. Unique at the environment level. |
PRODUCTIDS | Reference to unique identifiers of products associated with each offering. |
REQUIRESPAYMENT | Indicates if the offering requires a payment or not. |
SITEID | Unique 5-digit identifier for the data.world Private Instance. |
STARTDATE | The date when the offering was made available in the system. |
STRIPEPLANID | Identifier for the associated Stripe plan for the particular offering. |
UPDATED | TimeStamp of when the offering's record was last updated. |
UPDATEDBY | Identifier for the user who last updated the offering record. |
offeringsubscriptions table
Tracks subscriptions to specific offerings within the platform, including details such as the subscription ID, offering ID, start and end dates, and the associated agent. Records creation and update timestamps, along with the responsible agents. Useful for managing and auditing the lifecycle of offering subscriptions across the platform.
Column | Description |
---|---|
AGENTID | Identifier of the agent associated with the offering subscription. Unique at the environment-level and siteid-level. |
CREATED | date string 'yyyy-mm-dd' If org has preexisting entry in offeringsubscriptions table (created via manual configuration in admin tool), then select value from offeringsubscriptions; else, select value from siteid_subscriptionplan_map.csv seed. |
CREATEDBY | If org has preexisting entry in offeringsubscriptions table (created via manual configuration in admin tool), then select value from offeringsubscriptions; else, select value from siteid_subscriptionplan_map.csv seed. |
ENDDATE | date string 'yyyy-mm-dd'; null if plan is currently active; If org has preexisting entry in offeringsubscriptions table (created via manual configuration in admin tool), then select value from offeringsubscriptions; else, select value from siteid_subscriptionplan_map.csv seed. |
ENV | Identifies the data.world environment in which the offering subscription operates. |
OFFERINGID | If org has preexisting entry in offeringsubscriptions table (created via manual configuration in admin tool), then select value from offeringsubscriptions; else, select value from siteid_subscriptionplan_map.csv seed. |
SITEID | Unique 5-digit identifier for the data.world Private Instance. |
STARTDATE | date string 'yyyy-mm-dd'. If org has preexisting entry in offeringsubscriptions table (created via manual configuration in admin tool), then select value from offeringsubscriptions; else, select value from siteid_subscriptionplan_map.csv seed. |
SUBSCRIPTIONID | 1) If the entry (identified by composite key formed by agentid, sited), has a subscriptionid uuid code already present in offeringsubscriptions table, we use the subscriptionid code present in the offeringsubscriptions table (see ifnull statement in offeringsubscriptions_plan_automation macro). In the case that the subscriptionid code already exists. we can assume that it was created automatically when someone manually gave the org a plan via the admin tool. 2) If the org does not have an entry in offeringsubscriptions, then we derive the code dynamically using a deterministic application of the uuid_string SnowFlake function. |
UPDATED | date string 'yyyy-mm-dd' If org has preexisting entry in offeringsubscriptions table (created via manual configuration in admin tool), then select value from offeringsubscriptions; else, select value from siteid_subscriptionplan_map.csv seed. |
UPDATEDBY | If org has preexisting entry in offeringsubscriptions table (created via manual configuration in admin tool), then select value from offeringsubscriptions; else, select value from siteid_subscriptionplan_map.csv seed. |
products table
Stores details about products available within the platform, including product ID, name, category, and associated grants or permissions.
Column | Description |
---|---|
CATEGORY | Specifies the category to which each product belongs. |
CREATED | Date and time when the product record was created. |
CREATEDBY | Identifier for the user who created the product entry. |
ENV | The data.world environment to which the product record applies. |
GRANTS | Specifies grants or permissions associated with each product. |
NAME | The name of the product available within the platform. |
PRODUCTID | Unique identifier for each product within the platform. |
SITEID | Unique 5-digit identifier for the data.world Private Instance to which the product applies. |
UPDATED | The date/time when product information was last updated. |
UPDATEDBY | Identifier (agentid) of the individual who last updated the product detail. |
products_retention table
Extends the PRODUCTS view by maintaining a historical record of changes to products over time within the platform. It stores detailed information about products, including product ID, name, category, and any associated grants or permissions.
Column | Description |
---|---|
CATEGORY | Specifies the category to which each product belongs. |
CREATED | Date and time when the product record was created. |
CREATEDBY | Identifier for the user who created the product entry. |
ENV | The data.world environment to which the product record applies. |
GRANTS | Specifies grants or permissions associated with each product. |
NAME | The name of the product available within the platform. |
PRODUCTID | Unique identifier for each product within the platform. |
SITEID | Unique 5-digit identifier for the data.world Private Instance to which the product applies. |
UPDATED | The date/time when product information was last updated. |
UPDATEDBY | Identifier (agentid) of the individual who last updated the product detail. |
queries table
Stores details about queries run within the platform, including the query ID, associated resource, name, and latest version ID. Tracks creation and update timestamps, along with publication status and the related resource type. Each query is linked to a specific agent/user and can be categorized by type (e.g., SQL, SPARQL). Useful for managing and auditing queries associated with various datasets and resources.
Column | Description |
---|---|
CREATED | Timestamp indicating when the query was created in the platform. |
CREATEDBY | Indicates the agentid of the user who created the query. |
ENV | The data.world environment to which the query record applies. |
LATESTVERSIONID | Represents the ID of the latest version of a specific query. |
NAME | The name given to each query saved within the platform. |
PARTY | Identifies the user who saved a specific query. |
PARTYID | The agentid of the user who saved a specific query. |
PARTYTYPE | The type of the agent that saved the query. The value for this column is "agent" for all entries in the QUERIES table. |
PUBLISHED | Boolean. Indicates if the query has been made viewable to anyone who can access the project. |
PUBLISHEDRESOURCE | Specifies the identifier of the resource to which the query was saved and published. NULL if unpublished. |
QUERYID | Unique identifier for each query saved within the platform. |
RESOURCE | Specifies the type and identifier of the resource where the query is saved. |
RESOURCEID | Specifies the identifier of the resource where the query is saved. |
RESOURCETYPE | Specifies the type of the resource where the query is saved. |
SITEID | Unique 5-digit identifier for the data.world Private Instance. |
TYPE | Identifies the type of query (e.g., SQL, SPARQL). |
UPDATED | Timestamp marking the most recent update to the specific query. |
UPDATEDBY | Identifier (agentid) of the user who last updated the query. |
queries_retention table
Extends the QUERIES table by capturing the historical changes to query details over time within the platform. It stores comprehensive information about queries, including the query ID, associated resource, name, and version history. The historical aspect of this table is helpful for auditing and managing queries, providing a complete record of how queries evolve and are utilized within the platform.
Column | Description |
---|---|
CREATED | Timestamp indicating when the query was created in the platform. |
CREATEDBY | Indicates the agentid of the user who created the query. |
ENV | The data.world environment to which the query record applies. |
LATESTVERSIONID | Represents the ID of the latest version of a specific query. |
NAME | The name given to each query saved within the platform. |
PARTY | Identifies the user who saved a specific query. |
PARTYID | The agentid of the user who saved a specific query. |
PARTYTYPE | The type of the agent that saved the query. The value for this column is "agent" for all entries in the QUERIES table. |
PUBLISHED | Boolean. Indicates if the query has been made viewable to anyone who can access the project. |
PUBLISHEDRESOURCE | Specifies the identifier of the resource to which the query was saved and published. NULL if unpublished. |
QUERYID | Unique identifier for each query saved within the platform. |
RESOURCE | Specifies the type and identifier of the resource where the query is saved. |
RESOURCEID | Specifies the identifier of the resource where the query is saved. |
RESOURCETYPE | Specifies the type of the resource where the query is saved. |
SITEID | Unique 5-digit identifier for the data.world Private Instance. |
TYPE | Identifies the type of query (e.g., SQL, SPARQL). |
UPDATED | Timestamp marking the most recent update to the specific query. |
UPDATEDBY | Identifier (agentid) of the user who last updated the query. |
queryversions table
Tracks different versions of queries within the platform, including details such as the query ID, version ID, modifying agent, name, and type. Records creation and update timestamps, along with information about the previous version, publication status, and visibility. Useful for managing and auditing the evolution of queries associated with specific resources.
Column | Description |
---|---|
CREATED | Timestamp of when the query version was created. |
CREATEDBY | Identifier (agentid) for the user who created the query version. |
ENV | The data.world environment. |
MODIFYINGAGENTID | ID of the agent that modified the query version. |
NAME | Name or title assigned to the specific query version. |
PREVIOUSVERSIONID | ID of the previous version of the query in the platform. |
PUBLISHED | Boolean. Indicates if the query version has been made viewable to anyone who can access the project. |
QUERYID | Unique identifier for each query within the QUERYVERSIONS table. |
RESOURCE | Specifies the type and identifier of the resource where the query is saved. |
RESOURCEID | Specifies the identifier of the resource where the query is saved. |
RESOURCETYPE | Specifies the type of the resource where the query is saved. |
SITEID | Unique 5-digit identifier for the data.world Private Instance. |
TYPE | Identifies the type of query (for example, SQL, SPARQL). |
UPDATED | Timestamp marking the most recent update to the specific query. |
UPDATEDBY | Identifier (agentid) of the user who last updated the query. |
VERSIONID | Unique identifier for the version of a specific query. |
VISIBILITY | Indicates the accessibility level of a specific query version. |
subscriptions table
Tracks subscriptions created by agents within the platform, including details such as the subscriber ID, entity being subscribed to, subscription type (e.g., EMAIL), and the reason for the subscription (e.g., LIKE, FOLLOW). Records the creation and update timestamps, as well as the observable level of the subscription. Useful for managing and auditing user subscriptions to various platform entities.
Column | Description |
---|---|
CREATED | Timestamp indicating when the subscription record was created. |
CREATEDBY | Identifier (agentid) of the user who created the subscription record. |
ENTITYID | Unique identifier for the entity being subscribed to. |
ENV | The data.world environment. |
LEVEL | Specifies the observable level of a subscription within the platform. |
OBSERVABLE | Specifies the resource or user account to which is subscribed to. |
REASON | Specifies the reason for the user's subscription (e.g., LIKE, FOLLOW). |
SITEID | Unique 5-digit identifier for the data.world Private Instance. |
SUBSCRIBERID | Type ("agent") and Identifier (agentid) for the subscriber in the SUBSCRIPTIONS table. |
SUBSCRIPTIONID | Unique identifier for each user's subscription within the platform. |
TYPE | Specifies the type of subscription (e.g., EMAIL) for each record. |
UPDATED | Timestamp indicating when the subscription was last updated. |
UPDATEDBY | Identifier (agentid) of the user who last updated the subscription. |
subscriptions_retention table
Extends the SUBSCRIPTIONS view by maintaining a historical record of subscription activities created by agents within the platform. This includes details such as the subscriber ID, entity being subscribed to, subscription type (e.g., EMAIL), and the reason for the subscription (e.g., LIKE, FOLLOW).
Column | Description |
---|---|
CREATED | Timestamp when the subscription activity was created. |
CREATEDBY | Identifier (agentid) of the user who created the subscription. |
ENTITYID | Unique identifier for the entity being subscribed to. |
ENV | The data.world environment. |
LEVEL | Specifies the observable level of a subscription within the platform. |
OBSERVABLE | Specifies the resource or user account to which is subscribed to. |
REASON | Specifies the reason for the user's subscription (e.g., LIKE, FOLLOW). |
SITEID | Unique 5-digit identifier for the data.world Private Instance. |
SUBSCRIBERID | Type ("agent") and Identifier (agentid) for the subscriber in the SUBSCRIPTIONS table. |
SUBSCRIPTIONID | Unique identifier for each user's subscription within the platform. |
TYPE | Specifies the type of subscription (e.g., EMAIL) for each record. |
UPDATED | Timestamp indicating when the subscription was last updated. |
UPDATEDBY | Identifier (agentid) of the user who last updated the subscription. |