Catalog collector release notes
Important
Published versions of collectors are available as a docker image and a JAR file.
Release version 2.273
Details about the release
Item | Details |
---|---|
Release version | 2.273 |
Release date | 12 April, 2025 |
Docker image ID |
|
Jar file |
|
Bug fixes
Monte Carlo collector: Fixed alignment issues in generated table information from Monte Carlo Monitor by properly escaping pipes ('|') in monitor names.
Release version 2.272
Details about the release
Item | Details |
---|---|
Release version | 2.272 |
Release date | 11 April, 2025 |
Docker image ID |
|
Jar file |
|
New features and changes
A new collector, the SAP HANA collector, is now available in public preview.
SSIS collector: Now captures the connections between packages through the control flow, improving visibility of package relationships.
Databricks collector: The collector now harvests lineage only within the current schema. A new option Harvest entire lineage (--harvest-entire-lineage) is added to enable harvesting lineage from external schemas.
Microsoft Fabric collector: Enhanced to support additional syntaxes for Semantic Model table connections to Fabric warehouse/lakehouse resources, broadening compatibility and connectivity.
Bug fixes
QlikSense collector: Made performance improvements to increase the efficiency and speed of the collector.
Microsoft Fabric collector: Fixed issues with Lakehouses and SQL endpoints to ensure database resources are associated with the correct resource, enhancing accuracy.
Release version 2.271
Details about the release
Item | Details |
---|---|
Release version | 2.271 |
Release date | 3 April, 2025 |
Docker image ID |
|
Jar file |
|
Bug fixes
Tableau collector: Fixed an issue to ensure proper handling of sites and projects with large quantities of column and data source fields.
Databricks collector: Resolved a parsing issue in view queries where aliases starting with a number caused failures.
Release version 2.270
Details about the release
Item | Details |
---|---|
Release version | 2.270 |
Release date | 31 March, 2025 |
Docker image ID |
|
Jar file |
|
New features and changes
The following two new collectors are now available in public preview.
Power BI collector: The collector now harvests column descriptions for Power BI columns.
Databricks collector: The collector now supports Oauth service principal authentication for Databricks. Two new parameters, Service principal client ID (--client-id) and Service principal client secret (--client-secret) are introduced for this.
Bug fixes
Tableau collector: Fixed an issue with the display of lineage between Tableau Fields and Database Columns, ensuring accurate representation of data relationships.
Release version 2.269
Important
This release was for internal improvements and has no customer impacting changes.
Details about the release
Item | Details |
---|---|
Release version | 2.269 |
Release date | 25 March, 2025 |
Docker image ID |
|
Jar file |
|
Release version 2.268
Warning
Collector versions 2.264 through 2.267 have been deprecated. If you are using these versions, please update to version 2.268 as soon as possible.
Details about the release
Item | Details |
---|---|
Release version | 2.268 |
Release date | 24 March, 2025 |
Docker image ID | Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags
|
Jar file | Link to download the JAR file: https://releases.data.world/dwcc/2.268/dwcc-2.268.zip
|
New features and changes
Snowflake Collector: The collector now harvests system tags.
SSIS collector: The collector now supports the inclusion or exclusion of specific databases or servers from being harvested. For new parameters are introduced to use these features: --include-database, --exclude-database, --include-server, --exclude-server.
Bug fixes
All collectors: Resolved an issue where collectors created new collections with a new ID, leading to duplicate collections in the catalog.
Qlik Sense collector: Fixed an issue where missing user information in Qlik Sense resulted in an exception trace in the logfile.
Azure data factory collector: Added a log message to indicate when the Dataset API response lacks sufficient information, such as schema and table details, to construct lineage.
Release version 2.267 (deprecated)
Warning
This collector version is deprecated. Please use version 2.268 or higher to receive the latest collector updates.
Details about the release
Item | Details |
---|---|
Release version | 2.267 |
Release date | 17 March, 2025 |
Docker image ID | Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags
|
Jar file | Link to download the JAR file: https://releases.data.world/dwcc/2.267/dwcc-2.267.zip
|
New features and changes
Tableau collector: The way we represent ownership information for workbooks, views, and metrics in the Tableau catalog has been updated. The owner is now represented using the kos:hasOwner property.
Important
The former approach utilizing kos:createdBy will continue to be supported during a transition period but is deprecated and will be phased out in a future release. This change only impacts users who have written SPARQL queries or exported content using RDF properties. You will want to update your queries accordingly to reflect this update.
Bug fixes
Alteryx collector: Increased API call read timeout and improved error handling by capturing and logging processing exceptions.
Release version 2.266 (deprecated)
Warning
This collector version is deprecated. Please use version 2.268 or higher to receive the latest collector updates.
Details about the release
Item | Details |
---|---|
Release version | 2.266 |
Release date | 10 March, 2025 |
Docker image ID | Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags
|
Jar file | Link to download the JAR file: https://releases.data.world/dwcc/2.266/dwcc-2.266.zip
|
Bug fixes
Confluent collectors: The collector now correctly handles cases where consumer member assignments are missing a topic description.
Release version 2.265 (deprecated)
Warning
This collector version is deprecated. Please use version 2.268 or higher to receive the latest collector updates.
Details about the release
Item | Details |
---|---|
Release version | 2.265 |
Release date | 10 March, 2025 |
Docker image ID | Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags
|
Jar file | Link to download the JAR file: https://releases.data.world/dwcc/2.265/dwcc-2.265.zip
|
Bug fixes
Tableau collector: The collector now accurately harvests hidden dashboards. Previously, the feature was limited to hidden Views, which included both Sheets and Dashboards but classified them all as Sheets. With this update, the collector distinguishes between Sheets and Dashboards, assigning the correct type to each entity. This ensures a more accurate representation of hidden Views in Tableau.
Release version 2.264 (deprecated)
Warning
This collector version is deprecated. Please use version 2.268 or higher to receive the latest collector updates.
Details about the release
Item | Details |
---|---|
Release version | 2.264 |
Release date | 7 March, 2025 |
Docker image ID | Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags
|
Jar file | Link to download the JAR file: https://releases.data.world/dwcc/2.264/dwcc-2.264.zip
|
New features and changes
A new collector, the AWS Database Migration Service (DMS) collector, is now available in public preview.
Tableau collector: Improve command guidance, documentation, and warnings about the required format of the Tableau API URL option.
Alteryx collector: The collector now harvests nested workflow nodes and catalogs their relationship with the workflow.
Azure Data Factory collector: Made improvements to Azure Data Factory lineage by enhancing the harvesting of lineage from parameterized dataset references. The collector now also harvests both downstream and upstream resources.
Oracle collector: Add a new parameter --autonomous-db-connection-string for connection string for autonomous DB.
Release version 2.263
Details about the release
Item | Details |
---|---|
Release version | 2.263 |
Release date | 3 March, 2025 |
Docker image ID | Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags
|
Jar file | Link to download the JAR file: https://releases.data.world/dwcc/2.263/dwcc-2.263.zip
|
New features and changes
Tableau collector: The collector now harvests relationships between SQL tables and workbooks, enhancing data connectivity and visualization.
AWS Glue collector: Enhanced the collector to harvest lineage from Glue Data Catalog tables to their underlying S3 objects and gather more metadata for tables. The enhanced collector is available with the command catalog-aws-glue, while the legacy collector remains available as catalog-aws-glue-legacy or catalog-awsglue for compatibility. Please coordinate with your Customer Success Director for a smooth transition to the new collector version soon.
Note that the AWS Glue collector is only available as an on-premise solution, not as a cloud collector.
Bug fixes
Tableau collector:
Resolved an error in harvesting Custom SQL Tables.
Fixed an issue with filtering projects by name or ID.
Release version 2.262
Details about the release
Item | Details |
---|---|
Release version | 2.262 |
Release date | 26 February, 2025 |
Docker image ID | Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags
|
Jar file | Link to download the JAR file: https://releases.data.world/dwcc/2.262/dwcc-2.262.zip
|
Bug fixes
Tableau collector:
Fixed an issue with URL encoding.
The collector now properly handles server errors in GraphQL pagination.
Databricks collector: Fixed an issue where a null pointer exception occurred while harvesting tags from Databricks.
Release version 2.261
Details about the release
Item | Details |
---|---|
Release version | 2.261 |
Release date | 19 February, 2025 |
Docker image ID | Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags
|
Jar file | Link to download the JAR file: https://releases.data.world/dwcc/2.261/dwcc-2.261.zip
|
Bug fixes
Databricks collector: Fixed an exception that occurred when the table type did not match any known types.
Fivetran collector: Updated to use new APIs for retrieving column lineage due to changes in Fivetran API.
Important
Update your collector configurations to the latest version to seamlessly view column lineage without disruptions.
Release version 2.260
Details about the release
Item | Details |
---|---|
Release version | 2.260 |
Release date | 13 February, 2025 |
Docker image ID | Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags
|
Jar file | Link to download the JAR file: https://releases.data.world/dwcc/2.260/dwcc-2.260.zip
|
New features and changes
Snowflake collector: Added support for parsing SQL that utilizes IDENTIFIER() function calls when passing exact table names.
Bug fixes
Tableau collector (Preview): Added a null check in custom SQL Table logic to prevent errors.
SQL Server collector: Reduced excessive log and warning messages during dependency collection to streamline output.
Databricks collector: Improved error handling for connection issues with the Databricks host.
Release version 2.259
Details about the release
Item | Details |
---|---|
Release version | 2.259 |
Release date | 6 February, 2025 |
Docker image ID |
|
Jar file | Link to download the JAR file: https://releases.data.world/dwcc/2.259/dwcc-2.259.zip
|
New features and changes
Databricks collector: Added support for harvesting resources with the browse privilege.
Tableau collector: Now supports harvesting lineage between Custom SQL tables and their upstream tables.
Power BI and Power BI Gov collectors: Handling calculated tables as a new type and cataloging table-level lineage to source tables and columns.
Bug fixes
Monte Carlo collector: Updated the collector to remove reaction type from incidents, as it has been deprecated in Monte Carlo GraphQL responses.
Salesforce collector: Added a null check to prevent exceptions when last modified by information is missing.
Release version 2.258
Details about the release
Item | Details |
---|---|
Release version | 2.258 |
Release date | 30 January, 2025 |
Docker image ID |
|
Jar file | Link to download the JAR file: https://releases.data.world/dwcc/2.258/dwcc-2.258.zip
|
New features and changes
The following two new collectors are now available in public preview.
Bug fixes
Reltio collector: Resolved an issue that caused errors when cataloging containment relationships for certain attribute containers
Power BI and Power BI Gov collectors: Stopped cataloging JDBC types for columns in Power BI, as this task is best handled by the database collector. Power BI previously attempted to infer JDBC types based on its column types, which is now corrected.
Release version 2.257
Details about the release
Item | Details |
---|---|
Release version | 2.257 |
Release date | 27 January, 2025 |
Docker image ID | Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags
|
Jar file | Link to download the JAR file: https://releases.data.world/dwcc/2.257/dwcc-2.257.zip
|
New features and changes
Power BI and Power BI Gov collectors:
Added the ability to include database names in the datasources mapping file, allowing datasource credentials to be restricted to specific databases.
Introduced a flag for Power BI tables to indicate when data is loaded manually.
Tableau collector: Enhanced the collector to catalog unpublished views, expanding visibility into Tableau assets.
Bug fixes
Snowflake collector: Enhanced the incremental collection process to prevent the deletion of specific database column resources, ensuring data integrity and continuity.
Monte Carlo collector: Updated to catalog Monte Carlo warehouse IDs instead of hostnames due to API changes.
Databricks collector: Fixed an issue where the --include-information-schema option produced an incorrect warning.
Denodo collector: Resolved an issue with the internal cleanup function that was unable to remove PRIMARY KEY statements from SQL.
Release version 2.256
Details about the release
Item | Details |
---|---|
Release version | 2.256 |
Release date | 17 January, 2025 |
Docker image ID | Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags
|
Jar file | Link to download the JAR file: https://releases.data.world/dwcc/2.256/dwcc-2.256.zip
|
Bug fixes
Power BI Service and Power BI Gov collectors: Resolved an issue with handling parameters when they are referenced using @ symbol.
Release version 2.255
Details about the release
Item | Details |
---|---|
Release version | 2.255 |
Release date | 15 January, 2025 |
Docker image ID | Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags
|
Jar file | Link to download the JAR file: https://releases.data.world/dwcc/2.255/dwcc-2.255.zip
|
New features and changes
Databricks collector: Added support for harvesting SQL queries and their associated lineage. A new parameter Page size for harvesting queries (--query-pagination-limit) is introduced for this.
Power BI Service and Power BI Gov collectors: Added support for jdbcProperties in the datasources.yaml configuration for connecting to databases when resolving lineage.
Bug fixes
Snowflake, Redshift, Databricks, Denodo, Oracle, PostgreSQL, Teradata, MySQL, Db2, Netezza, SQL Server collectors: Database authentication issues are now reported as errors.
Oracle collector:
Corrected the command description for the autonomous database parameter.
Fixed an issue with removing comments containing special symbols.
Redshift collector: The collector now correctly harvests distinct utility functions and procedures.
Databricks collector: Resolved an issue where column names were not recognized due to case sensitivity mismatches.
Release version 2.254
Details about the release
Item | Details |
---|---|
Release version | 2.254 |
Release date | 7 January, 2025 |
Docker image ID | Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags
|
Jar file | Link to download the JAR file: https://releases.data.world/dwcc/2.254/dwcc-2.254.zip
|
New features and changes
Oracle Collector: Now supports harvesting from Oracle Autonomous Database. A new parameter --autonomous-db is introduced for this.
Tableau collector: Now supports harvesting of personal space workbooks. A new parameter --tableau-catalog-personal-space-workbooks is introduced for this.
Bug fixes
Power BI collector: Resolved an issue with parameter replacements in table source expressions when a parameter name is the same as the name of the table it defines.
SQL Server collector: Ensured encryption is enabled for connections when the encrypt JDBC property is configured.
Alteryx collector: Fixed an issue encountered while fetching workflow details when the user does not have permission on the workflow.
Release notes for previous versions
Go here to access release notes for previous version.