Skip to main content

Catalog collector release notes

Important

Published versions of collectors are available as a docker image and a JAR file.

Release version 2.260

Details about the release

Table 1.

Item

Details

Release version

2.260

Release date

13 February, 2025

Docker image ID

Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags

  • amd64:  6f10f8b049d5a7888c1f1d1e6b7a657874942560783e2111079d454e9ff0cc6d

  • arm64:  c753be3a616f20f2e13b9138650c5c60b9564089d5713b26717af029a6d15e10

Jar file

Link to download the JAR file: https://releases.data.world/dwcc/2.260/dwcc-2.260.zip

  • Sha256: 2b38736cc0bfb89605de63172ab3f15d34634e9fda1e81950a25f8797b20bd14



New features and changes

  • Snowflake collector: Added support for parsing SQL that utilizes IDENTIFIER() function calls when passing exact table names.

Bug fixes

  • Tableau collector (Preview): Added a null check in custom SQL Table logic to prevent errors.

  • SQL Server collector: Reduced excessive log and warning messages during dependency collection to streamline output.

  • Databricks collector: Improved error handling for connection issues with the Databricks host.

Release version 2.259

Details about the release

Table 2.

Item

Details

Release version

2.259

Release date

6 February, 2025

Docker image ID

Jar file

Link to download the JAR file: https://releases.data.world/dwcc/2.259/dwcc-2.259.zip

  • Sha256: f640358d0fc600177c9ea6ec186701fb95422032bbc0a370c8ad72a6a83c4364



New features and changes

  • Databricks collector: Added support for harvesting resources with the browse privilege.

  • Tableau collector: Now supports harvesting lineage between Custom SQL tables and their upstream tables.

  • Power BI and Power BI Gov collectors: Handling calculated tables as a new type and cataloging table-level lineage to source tables and columns.

Bug fixes

  • Monte Carlo collector: Updated the collector to remove reaction type from incidents, as it has been deprecated in Monte Carlo GraphQL responses.

  • Salesforce collector: Added a null check to prevent exceptions when last modified by information is missing.

Release version 2.258

Details about the release

Table 3.

Item

Details

Release version

2.258

Release date

30 January, 2025

Docker image ID

  • Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags

    • amd64: sha256:ad528403bde60ee172d2897a309a3f81d5822ae4703a10038faf0449327387fa

    • arm64: sha256:5ea5eda08a8f9d5aa3b5a0135d483c1e2594cb3befde5235e4aaee59492994ce

Jar file

Link to download the JAR file: https://releases.data.world/dwcc/2.258/dwcc-2.258.zip

  • Sha256: e65b63cebc9de475da0363e7386c767ad1f1b27ef35c827d2874e46c99bf53e7



New features and changes

Bug fixes

  • Reltio collector: Resolved an issue that caused errors when cataloging containment relationships for certain attribute containers

  • Power BI and Power BI Gov collectors: Stopped cataloging JDBC types for columns in Power BI, as this task is best handled by the database collector. Power BI previously attempted to infer JDBC types based on its column types, which is now corrected.

Release version 2.257

Details about the release

Table 4.

Item

Details

Release version

2.257

Release date

27 January, 2025

Docker image ID

Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags

  • amd64: b5db2307f0be0c0a4c3502553eba62cbf9e06317c376b219f4abde6a718b2bf4

  • arm64: af7c072e01ea52cfc6084ee65840137c604b21d1f0b6ea779abad8f676a150b1

Jar file

Link to download the JAR file: https://releases.data.world/dwcc/2.257/dwcc-2.257.zip

  • Sha256: 41dcb2fc02ea85427a1af272f354e8f9052ffad1bcf5c93fc06ec81bf080321e



New features and changes

  • Power BI and Power BI Gov collectors:

    • Added the ability to include database names in the datasources mapping file, allowing datasource credentials to be restricted to specific databases.

    • Introduced a flag for Power BI tables to indicate when data is loaded manually.

  • Tableau collector: Enhanced the collector to catalog unpublished views, expanding visibility into Tableau assets.

Bug fixes

  • Snowflake collector: Enhanced the incremental collection process to prevent the deletion of specific database column resources, ensuring data integrity and continuity.

  • Monte Carlo collector: Updated to catalog Monte Carlo warehouse IDs instead of hostnames due to API changes.

  • Databricks collector: Fixed an issue where the --include-information-schema option produced an incorrect warning.

  • Denodo collector: Resolved an issue with the internal cleanup function that was unable to remove PRIMARY KEY statements from SQL.

Release version 2.256

Details about the release

Table 5.

Item

Details

Release version

2.256

Release date

17 January, 2025

Docker image ID

Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags

  • amd64: 0606de5390dbaca7c2e9d49182bc3997abac527c1ac8b78cc3c919f5967d409b

  • arm64: 939f1ee03131f2a444e5a1024714fb8c760b974af0170cf40d6e34d326e0c7b6

Jar file

Link to download the JAR file: https://releases.data.world/dwcc/2.256/dwcc-2.256.zip

  • Sha256: d876c31865c2dd85869d72a26672a4c00524442f5705687e896b38186395fb10



Bug fixes

  • Power BI Service and Power BI Gov collectors: Resolved an issue with handling parameters when they are referenced using @ symbol.

Release version 2.255

Details about the release

Table 6.

Item

Details

Release version

2.255

Release date

15 January, 2025

Docker image ID

Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags

  • amd64: ce574a7acfa46530255ec8f7d046b66b5bf34c1968e2707b130bdb09fe1404f5

  • arm64: c035cd41f02098abd9e64ffe8306d01baf35e0e80468d3cc15cfc56040d9db48

Jar file

Link to download the JAR file: https://releases.data.world/dwcc/2.255/dwcc-2.255.zip

  • Sha256: a7216794603a0033c91e65399e856d7fab90b207b1892e94018884afead2f444



New features and changes

  • Databricks collector: Added support for harvesting SQL queries and their associated lineage. A new parameter Page size for harvesting queries (--query-pagination-limit) is introduced for this.

  • Power BI Service and Power BI Gov collectors: Added support for jdbcProperties in the datasources.yaml configuration for connecting to databases when resolving lineage.

Bug fixes

  • Snowflake, Redshift, Databricks, Denodo, Oracle, PostgreSQL, Teradata, MySQL, Db2, Netezza, SQL Server collectors: Database authentication issues are now reported as errors.

  • Oracle collector:

    • Corrected the command description for the autonomous database parameter.

    • Fixed an issue with removing comments containing special symbols.

  • Redshift collector: The collector now correctly harvests distinct utility functions and procedures.

  • Databricks collector: Resolved an issue where column names were not recognized due to case sensitivity mismatches.

Release version 2.254

Details about the release

Table 7.

Item

Details

Release version

2.254

Release date

7 January, 2025

Docker image ID

Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags

  • amd64: sha256:f52f106cda278cd04c04377070a469f33702a33c2701a3809564f844109a1fb0

  • arm64: sha256:3bb2521a91fdc5a33845fd0056d56756b4c674e2f1b4aca21c93e6db092cc762

Jar file

Link to download the JAR file: https://releases.data.world/dwcc/2.254/dwcc-2.254.zip

  • Sha256: bf09e8abb8d1869d3e3d3640b38273285b6dcc2d64236632aa84fd3e0ec51d2b



New features and changes

  • Oracle Collector: Now supports harvesting from Oracle Autonomous Database. A new parameter --autonomous-db is introduced for this.

  • Tableau collector: Now supports harvesting of personal space workbooks. A new parameter --tableau-catalog-personal-space-workbooks is introduced for this.

Bug fixes

  • Power BI collector: Resolved an issue with parameter replacements in table source expressions when a parameter name is the same as the name of the table it defines.

  • SQL Server collector: Ensured encryption is enabled for connections when the encrypt JDBC property is configured.

  • Alteryx collector: Fixed an issue encountered while fetching workflow details when the user does not have permission on the workflow.

Release version 2.253

Details about the release

Table 8.

Item

Details

Release version

2.253

Release date

December 23, 2024

Docker image ID

Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags

  • amd64: sha256:6ed61ec8927953033ea1a71adce4e23f1b8449b69786b0d86e083a55cae9699f

  • arm64: sha256:54ce22fa711a1dfb2a6026e0f5a725efc3942265713d20f23729bb22f811c90c

Jar file

Link to download the JAR file: https://releases.data.world/dwcc/2.253/dwcc-2.253.zip

  • Sha256: 6e8101bc5fb930df698d2cea898ed1bc3816a890b5464daa30c32d355afc6aa4



New features and changes

  • Power BI Service and Power BI gov collectors:

    • Added support for the default behavior in Power BI SQL Server sources when no schema is specified in Custom SQL queries.

    • Added support for cataloging report descriptions.

Bug fixes

  • Snowflake, Redshift, Databricks, Oracle, PostgreSQL, Db2, Netezza, SQL Server collectors: Improved handling of lineage harvesting from SQL statements that contain quoted dashes, which were previously misinterpreted as comments.

  • Oracle collector: Corrected the handling of dependency harvesting to focus only on requested schemas.

Release version 2.252

Details about the release

Table 9.

Item

Details

Release version

2.252

Release date

December 20, 2024

Docker image ID

Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags

  • arm64: sha256:cc887b4676307dbf1473d7a2e09470a3ba601d062d24be6543c2e6e31bf78b1c

  • amd64: sha256:059ba225256a2135f56442349f1aec97bad37019edc4add4e1b92ad40031448c

Jar file

Link to download the JAR file: https://releases.data.world/dwcc/2.252/dwcc-2.252.zip

  • Sha256: a8716be2ec234e8699501da2efa3eebf18c20d3bb574f504d9d5f2094cc6143b



New features and changes

  • Snowflake, Oracle, and SQL Server collectors: Modified to query dependencies across the entire database at once, rather than by individual schema, to reduce the number of queries executed.

  • Snowflake, Redshift, Databricks, Denodo, Oracle, PostgreSQL, Teradata, MySQL, Db2, Netezza, SQL Server collectors: Introduced a new parameter, --exclude-schema, to exclude specific schemes from the collection process.

  • Databricks collector: Added support for harvesting notebook content and lineage.

Release version 2.251

Details about the release

Table 10.

Item

Details

Release version

2.251

Release date

December 12, 2024

Docker image ID

Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags

  • arm64: sha256:53b6ad3101a4dac888f230813161fad176edc5779ea54535991cf926704c265b

  • amd64: sha256:beffef2f05d41abc0bf60e5db8b5d80973ca144400c2c2bed621b12625b8c870

Jar file

Link to download the JAR file: https://releases.data.world/dwcc/2.251/dwcc-2.251.zip

  • Sha256: d0cdb4eb89bede779fa0c5285b2af620523b8cfe503361cad23027ed6e542ede



New features and changes

  • Snowflake collector: Enhanced support for harvesting lineage from view select statements that include QUALIFY and TRY_CAST constructs.

  • Redshift collector: The collector is now able to harvest definitions (DDL) for stored procedures and functions.

  • All Collectors: Improved the collectors logging for greater consistency in local and uploaded logfile names. Updated log file naming conventions to ensure uniqueness and preserve log files from previous runs when uploading to data.world.

  • Snowflake, Redshift, Databricks, Denodo, Oracle, PostgreSQL, Teradata, MySQL, Db2, Netezza, SQL Server collectors: Enhanced support for harvesting lineage from view select statements containing a mix of qualified and unqualified table names.

  • dbt Core and dbt Cloud collectors: The relationship between DbtSource and its abstracted tables or views is now defined as representsDataSource. This change better reflects the semantics of dbt sources, and improves visualization of lineage relationships in Eureka Explorer.

  • Tableau Collector: Released an updated collector for Tableau, featuring improved detection of lineage relationships to database objects along with stability and performance enhancements. While the new collector will co-exist with the legacy version, we encourage transitioning to the new version to take advantage of these enhancements.

Release version 2.250

Details about the release

Table 11.

Item

Details

Release version

2.250

Release date

December 7, 2024

Docker image ID

Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags

  • arm64: ce6b7554c41b9e1a5c3dbaac8670882945a81e6e8770e3d71d730cb607d64baf

  • amd64: 8196a7fcde7d60b9c4970d6662921fc6dd5135a35b4685093f80c2208167b697

Jar file

Link to download the JAR file: https://releases.data.world/dwcc/2.250/dwcc-2.250.zip

  • Sha256: d002e0aed3d57dee0a7ec8b289389eeb67af3ca44214d3f2e1fdd4f2864e6528



New features and changes

  • Snowflake collector: Added support for additional SQL syntaxes when parsing statements that include the QUALIFY keyword.

  • Alteryx collector: Added support for honoring the max job limit option when set to zero.

Bug fixes

  • Snowflake collector: Resolved an issue that caused duplicate schema processing and duplicate CatalogCuration resources during incremental collection.

Release version 2.249

Details about the release

Table 12.

Item

Details

Release version

2.249

Release date

November 27, 2024

Docker image ID

Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags

  • arm64: sha256:1291bc01e591438370cc19146d5a43798a831ebc961fa3675dd674efaa8357a7

  • amd64: sha256:9ac5378476c0901b41a55f9d1a2d68e568b2882958191f3961dfe4ee3a93c27b

Jar file

Link to download the JAR file: https://releases.data.world/dwcc/2.249/dwcc-2.249.zip

  • Sha256: 02d3927a6aea2441256dc8bd2f1066dc3770c3fa60ebaaaedb581c917d1a87e8



New features and changes

  • Snowflake collector:

    • The collector now harvests dependencies from tables and views.

    • Made performance enhancements for incremental metadata collections.

  • PostgreSQL collector: Added support for AWS IAM Authentication tokens for databases hosted on AWS.

  • Power BI Service and Power BI Gov collectors: Updated to accommodate Microsoft’s transition from dataset to semantic model in all catalog resources emitted by the collector.

Bug fixes

  • Redshift collector: Corrected handling of lineage harvesting from SQL statements using ARRAY literals.

  • Snowflake, Redshift, Databricks, Denodo, Oracle, PostgreSQL, Teradata, MySQL, Db2, Netezza, SQL Server collectors: Improved handling of lineage harvesting from SQL statements containing KEYS as a column name.

  • Snowflake collector: Fixed issues in harvesting lineage from SQL statements using QUALIFY and COPY GRANTS keywords.

  • Power BI Service and Power BI Gov collectors: Introduced various improvements to the datasource template.

Release version 2.248

Details about the release

Table 13.

Item

Details

Release version

2.248

Release date

November 19, 2024

Docker image ID

Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags

  • arm64: sha256:96d5eaa7b377b459ba0f7c017d061c12df565cfa851b5f6b8024e98506d0b1c4

  • amd64: sha256:635a46171728ebc41c2a32a29996191c089aa39baa030efc72cd66bb5fef0bc3

Jar file

Link to download the JAR file: https://releases.data.world/dwcc/2.248/dwcc-2.248.zip

  • Sha256: ad1e2adaced3304b1f1e860dcb8b9414ceea97ed75a43fca682933db9f376880



New features and changes

  • Denodo collector: Improved column fetching by processing one view or table at a time if an issue occurs when retrieving all columns at once.

  • Fivetran collector: Added support for Salesforce as a source.

  • Salesforce collector: Added support for harvesting metadata for summary (roll-up) fields.

Bug fixes

  • Snowflake, Redshift, Databricks, Denodo, Oracle, PostgreSQL, Teradata, MySQL, Db2, Netezza, SQL Server collectors: Resolved an issue by clearing lineage resolution cache between runs to avoid conflicts when multiple commands are configured.

Release version 2.247

Details about the release

Table 14.

Item

Details

Release version

2.247

Release date

November 14, 2024

Docker image ID

Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags

  • arm64: 24525b02a7d7eb7230d441affc7cd0b4fb23fe4f6135bf55b86960761949ca60

  • amd64: 621f9d5b2db19b1b03201cb3df75ae49f45797a9dac26d0aa999061d5878b191

Jar file

Link to download the JAR file: https://releases.data.world/dwcc/2.247/dwcc-2.247.zip

  • Sha256: dc0b72f8053022b9973f43ce0c01e35fb0f444e4a2a21d36d4fe1597078a1c98



New features and changes

  • SQL Server collector: Added support for Active Directory Service Principal and Entra ID authentications.

  • Amazon S3 collector: Enhanced object filtering based on configuration options to include or exclude object names before checking the maximum resources limit.

  • AWS Glue Collector: Added functionality to catalog partitioned columns.

Bug fixes

  • Power BI Service and Power BI Gov collectors: Resolved an issue with the CombineColumns transform step that was causing warnings in some cases due to misalignment in lineage to source columns.

  • dbt Core collector: Improved handling of situations where the profiles.yml file is mistakenly specified as a directory.

Release version 2.246

Details about the release

Table 15.

Item

Details

Release version

2.246

Release date

November 7, 2024

Docker image ID

Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags

  • arm64: sha256:668965157334c401364646dd200b6c30d8373cd26ca095e046e36421816d9e80

  • amd64: sha256:3ad5c152a6196925ee87450b54f02232797e5e2bbf21d2232067283186c9ad0a

Jar file

Link to download the JAR file: https://releases.data.world/dwcc/2.246/dwcc-2.246.zip

  • Sha256: 642c91d5eab913cdda14350d0a70dc3ae098ea5bfeca9e755ae393c15f251a43



New features and changes

  • Salesforce collector: The security token configuration option is no longer required, as there are scenarios where authentication to the Salesforce API does not require it.

Bug fixes

  • Power BI Service and power BI Gov collectors: Fixed a problem that was occurring while attempting to replace parameters in custom SQL when the parameter name contained a special character.

  • dbt cloud collector: Fixed an issue in which the user-specified Snowflake account override was being ignored by the collector.

Release version 2.245

Details about the release

Table 16.

Item

Details

Release version

2.245

Release date

November 3, 2024

Docker image ID

Link to download the Docker image: https://hub.docker.com/r/datadotworld/dwcc/tags

  • arm64: sha256:79ce38d27226e8a1967c642c16e368ac5e32fa28f7fee9d7720aabe0915e5406

  • amd64: sha256:d7a98dd219fb150fc373132c081d7a7a2adcd3a56a2b291e8e105c9d82ade8ba

Jar file

Link to download the JAR file: https://releases.data.world/dwcc/2.245/dwcc-2.245.zip

  • Sha256: 119aa2eeae911beb952914240bcfbb4c7f71a45d5e45f7fe63fbad87b4a89cc7



New features and changes

  • Reltio collector: The collector now supports client_credentials authentication, in addition to the existing user and password authentication.

Release notes for previous versions