Skip to main content

Supported sources

This section outlines lists of sources that are supported for metadata collection as well as for data virtualization and extraction.

For Metadata Collection

We currently support systems for Data storage, Business intelligence, ETL/ELT, Streaming, Data provenance, Data quality, and APIs.

Data storage and query systems

The following table lists the ways the connection can be configured for each system and the features available.

Table 1.

System

Metadata collection using data.world cloud collectors

Metadata collection using data.world on-premise collectors

Lineage metadata available

Data profiling

Sensitive Data Discovery

(Beta feature)

Amazon S3

Yes

Yes

No

No

No

Amazon DynamoDB

Yes

Yes

No

No

No

Apache Spark

No

Yes

No

No

No

Athena

Yes

Yes

No

No

Yes

AWS Glue

Yes

Yes

Yes

No

No

Azure Data Lake Storage Gen2 (Including Azure Blob Storage)

Yes

Yes

No

No

No

Azure Synapse Analytics

Yes

Yes

Yes

No

No

BigQuery

Yes

Yes

Yes

No

Yes

Databricks

Yes

Yes

Yes

Yes

No

Db2

Yes

Yes

No

Yes

No

Denodo

Yes

Yes

Yes

(cross-system lineage available)

No

No

Dremio

No

Yes

Yes

No

No

Generic JDBC

Yes

Yes

No

No

No

Hive

No

Yes

No

No

No

Hive metastore

No

Yes

No

No

No

InfluxDB

Yes

Yes

No

No

No

Infor Ion

No

Yes

No

No

No

Information Schema

No

Yes

No

No

No

Microsoft SQL Server

Yes

Yes

Yes

Yes

No

Monte Carlo

Yes

Yes

No

No

No

MySQL

Yes

Yes

No

Yes

No

Netezza

Yes

Yes

Yes

No

No

Oracle

Yes

Yes

Yes

No

No

PostgreSQL

Yes

Yes

Yes

Yes

Yes

Presto

No

Yes

No

No

No

Redshift

Yes

Yes

Yes

Yes

Yes

Reltio

Yes

Yes

No

No

No

Salesforce

Yes

Yes

Yes

No

No

SAP HANA

No

Yes

No

No

No

Snowflake

Yes

Yes

Yes

Yes

Yes

SQL Anywhere

No

Yes

No

No

No

Teradata

Yes

Yes

Yes

Yes

No

Vertica

No

Yes

No

No

No

MongoDB

Available in Public preview

Yes

Yes

No

No

No



Business intelligence systems

The following table lists the ways the connection can be configured for each system and the features available.

Table 2.

System

Metadata collection using data.world cloud collectors

Metadata collection using data.world on-premise collectors

Lineage metadata available

Data profiling

Sensitive Data Discovery

(Beta feature)

Amazon QuickSight

No

Yes

Yes

No

No

Domo

No

Yes

No

No

No

Grafana

Yes

Yes

Yes

No

No

Looker

Yes

Yes

Yes

No

No

Power BI Service

Yes

Yes

Yes

(cross-system lineage available)

No

No

Power BI Gov

Yes

Yes

Yes

No

No

Power BI Report Server

Available in Public preview

Yes

Yes

Yes

No

No

Qlik Sense Cloud

Available inPublic preview

Yes

Yes

Yes

No

No

Sigma

Yes

Yes

Yes

No

No

SQL Server Reporting Services (SSRS)

Yes

Yes

No

No

No

Tableau

Available in Public preview

Yes

Yes

Yes

(cross-system lineage available)

No

No

Tableau(legacy version)

Yes

Yes

Yes

(cross-system lineage available)

No

No

ThoughtSpot

Yes

Yes

No

No

No



ETL/ELT systems

The following table lists the features available for the systems.

Table 3.

System

Metadata collection using data.world cloud collectors

Metadata collection using data.world on-premise collectors

Lineage metadata available

Data profiling

Sensitive Data Discovery

(Beta feature)

Amazon Database Migration Service (DMS)

Available in Public preview

No

Yes

Yes

N/A

No

Azure Data factory

Yes

Yes

Yes

N/A

No

dbt Core

No

Yes

(legacy collector docs)

Yes

(cross-system lineage available)

N/A

No

dbt Cloud

Yes

Yes

Yes

(cross-system lineage available)

N/A

No

Fivetran

Yes

Yes

Yes

(cross-system lineage available)

N/A

No

Informatica Cloud Data Integration (CDI)

Available in Public preview

Yes

Yes

Yes

N/A

No

SQL Server Integration Services (SSIS)

Available in Public preview

Yes

Yes

Yes

(cross-system lineage available)

N/A

No

Alteryx

Available in Public preview

Yes

Yes

No

N/A

No

Qlik Talend Data Integration

Available inPublic preview

No

Yes

Yes

N/A

No



Streaming systems

The following table lists the features available for the systems.

Table 4.

System

Metadata collection using data.world cloud collectors

Metadata collection using data.world on-premise collectors

Lineage metadata available within the source

Data profiling

Sensitive Data Discovery

(Beta feature)

Confluent Cloud

Yes

Yes

No

No

No

Confluent Platform

No

Yes

No

No

No

Amazon Managed Streaming for Kafka (MSK)

No

Yes

No

No

No



Data provenance systems

The following table lists the features available for the systems.

Table 5.

System

Metadata collection using data.world cloud collectors

Metadata collection using data.world on-premise collectors

Lineage metadata available within the source

Data profiling

Sensitive Data Discovery

(Beta feature)

Manta

Yes

Yes

Yes

No

No

Marquez

No

Yes

(Collector Wizard not available)

No

No

No



Data quality and observability systems

The following table lists the features available for the system.

Table 6.

System

Metadata collection using data.world cloud collectors

Metadata collection using data.world on-premise collectors

Lineage metadata available within the source

Data profiling

Sensitive Data Discovery

(Beta feature)

Monte Carlo

Yes

Yes

No

No

No

Snowflake

(Data Metric Functions)

Yes

Yes

Yes

Yes

Yes



Data workflow orchestration

The following table lists the features available for the system.

Table 7.

System

Metadata collection using data.world cloud collectors

Metadata collection using on-premise collectors

Lineage metadata available within the source

Data profiling

Sensitive Data Discovery

(Beta feature)

Apache Airflow

Available inPublic preview

Yes

Yes

Yes

N/A

No



API

The following table lists the features available for the system.

Table 8.

Systems

Metadata collection using data.world cloud collectors

Metadata collection using on-premise collectors

Lineage metadata available within the source

Data profiling

Sensitive Data Discovery

(Beta feature)

OpenAPI

No

Yes

No

No

No



Data virtualization and data extraction

Data storage and query systems

The following table lists the features available for the supported systems.

Table 1.

System

Data virtualization using Connection Manager

Data extraction using Connection Manager

Data profiling

Athena

Yes

Yes

Yes

Azure Synapse Analytics

Yes

Yes

Yes

BigQuery

Yes

Yes

Yes

Databricks

Yes

Yes

Yes

Denodo

Yes

Yes

Yes

Infor Ion

Yes

Yes

Yes

Microsoft SQL Server

Yes

Yes

Yes

MySQL

No

Yes

Yes

Oracle

No

Yes

Yes

PostgreSQL

Yes

Yes

Yes

Redshift

Yes

Yes

Yes

Snowflake

Yes

No

Yes