Skip to main content

Supported sources

This section outlines lists of sources that are supported for metadata collection as well as for data virtualization and extraction.

For Metadata Collection

We currently support systems for: Data storage, Business intelligence, ETL/ELT, Streaming, Data provenance, Data quality, and APIs.

Data storage and query systems

The following table lists the ways the connection can be configured for each system and the features available.

Table 1.

System

Metadata collection using data.world cloud collectors

Metadata collection using data.world on-premise collectors

Lineage 

Data profiling 

Sensitive Data Discovery

(Beta feature)

Amazon S3

Yes

Yes 

No

No

No

Amazon DynamoDB

Available in Private preview

No

Yes

(Collector Wizard not available)

No

No

No

Apache Spark

No

Yes 

No

No

No

Athena

Yes

Yes 

No

No

Yes

AWS Glue

Yes

Yes 

Yes 

No

No

Azure Data Lake Storage Gen2

Yes

Yes 

No

No

No

Azure Synapse Analytics

Yes

Yes 

Yes 

No

No

BigQuery

Yes

Yes 

Yes 

No

Yes

Databricks

Yes

Yes 

Yes 

Yes

No

Db2

Yes

Yes 

No

Yes

No

Denodo

Yes

Yes

Yes

No

No

Dremio

No

Yes 

Yes 

No

No

Generic JDBC

Yes

Yes 

No

No

No

Hive

No

Yes 

No

No

No

Hive metastore

No

Yes 

No

No

No

InfluxDB

Yes

Yes 

No

No

No

Infor Ion

No

Yes 

No

No

No

Information Schema

No

Yes 

No

No

No

Microsoft SQL Server

Yes

Yes 

Yes 

Yes

No

Monte Carlo

Yes

Yes 

No

No

No

MySQL

Yes

Yes 

No

Yes

No

Netezza

No

Yes 

(Collector Wizard not available)

No

No

No

Oracle

Yes

Yes 

No

No

No

PostgreSQL

Yes

Yes 

Yes 

Yes

Yes

Presto

No

Yes 

No

No

No

Redshift

Yes

Yes 

Yes 

Yes

Yes

Reltio

Yes

Yes 

No

No

No

Salesforce

Yes

Yes 

No

No

No

SAP HANA

No

Yes 

No

No

No

Snowflake

Yes

Yes 

Yes 

Yes

Yes

SQL Anywhere

No

Yes 

No

No

No

Teradata

No

Yes

(Collector Wizard not available)

Yes

Yes

No

Vertica

No

Yes 

No

No

No



Business intelligence systems

The following table lists the ways the connection can be configured for each system and the features available.

Table 2.

System

Metadata collection using data.world cloud collectors

Metadata collection using data.world on-premise collectors

Lineage 

Data profiling 

Sensitive Data Discovery

(Beta feature)

Domo

No

Yes 

No

No

No

Grafana

Yes

Yes 

Yes 

No

No

Looker

Yes

Yes 

Yes

No

No

Power BI Service

Yes

Yes 

Yes 

No

No

Power BI Gov

Yes

Yes

Yes

No

No

Sigma

No

Yes 

(Collector Wizard not available) 

No

No

No

SQL Server Reporting Services (SSRS)

Yes

Yes 

No

No

No

Tableau Cloud

Yes

Yes 

Yes 

No

No

ThoughtSpot

Yes

Yes 

No

No

No



ETL/ELT systems

The following table lists the features available for the systems.

Table 3.

System

Metadata collection using data.world cloud collectors

Metadata collection using data.world on-premise collectors

Lineage 

Data profiling 

Sensitive Data Discovery

(Beta feature)

Azure Data factory

Available in Private preview

No

Yes

(Collector Wizard not available)

Yes

N/A

No

dbt Core

No

Yes 

(legacy collector docs) 

Yes 

N/A

No

dbt Cloud

Yes

Yes 

Yes 

N/A

No

Fivetran

Yes

Yes 

Yes 

N/A

No



Streaming systems

The following table lists the features available for the systems.

Table 4.

System

Metadata collection using data.world cloud collectors

Metadata collection using data.world on-premise collectors

Lineage

Data profiling 

Sensitive Data Discovery

(Beta feature)

Kafka - Confluent Cloud

Yes

 Yes

No

No

No

Kafka - Confluent Platform

No

 Yes

No

No

No



Data provenance systems

The following table lists the features available for the systems.

Table 5.

System

Metadata collection using data.world cloud collectors

Metadata collection using data.world on-premise collectors

Lineage

Data profiling 

Sensitive Data Discovery

(Beta feature)

Manta

Yes

 Yes

Yes 

No

No

Marquez

No

Yes

No

No

No



Data quality and observability systems

The following table lists the features available for the system.

Table 6.

System

Metadata collection using data.world cloud collectors

Metadata collection using data.world on-premise collectors

Lineage 

Data profiling 

Sensitive Data Discovery

(Beta feature)

Monte Carlo

Yes

 Yes

No

No

No



API

The following table lists the features available for the system.

Table 7.

Systems

Metadata collection using data.world cloud collectors

Metadata collection using on-premise collectors

Lineage 

Data profiling 

Sensitive Data Discovery

(Beta feature)

OpenAPI

No

Yes 

No

No

No



Data virtualization and data extraction

Data storage and query systems

The following table lists the features available for the system.

Table 8.

System

Data virtualization using Connection Manager 

Data extraction using Connection Manager

Data profiling 

 Athena

 Yes

Yes

Yes

Azure Synapse Analytics

Yes

Yes

Yes

BigQuery

Yes

Yes

Yes

Databricks

Yes

Yes

Yes

Denodo

Yes

Yes

Yes

 Infor Ion

Yes

Yes

Yes

Microsoft SQL Server

Yes

Yes

Yes

MySQL

Yes

No

Yes

 Oracle

Yes

No

Yes

 PostgreSQL

Yes

Yes

Yes

Redshift

Yes

Yes

Yes

Snowflake

Yes

No

Yes