Docs portal

Connect to data

With data.world there are several ways you can interact with your data.

  • Connection Manager - Our in-platform data and metadata connection interface. There is a connection manager interface for many of our data sources, and new ones are continually added.

  • Data.world catalog collectors - These collectors run on a command line, in a script, or through a YAML file and can be configured to meet your needs.

  • Virtual/live connections to your data - Allow you to run queries against your data and analyze it without needing to move it into data.world. Live connections can be created with Connection manager, or 3rd-party tools.

Currently supported data sources

We continue to add new data sources to the Connection manager. However there are still some sources that are only available for cataloging metadata with the data.world catalog collector (DWCC). Here is a list of our currently supported data connections:

Table 1. Supported data sources

data source

Connection manager

Metadata collector

Live/virtual data connector

Apache Spark

not yet

yes

not yet

Athena

yes

yes

yes

AWS Glue

not yet

yes

not yet

Azure Synapse

yes

yes

yes

BigQuery

yes

not yet

yes

Databricks

not yet

yes

not yet

DB2

not yet

yes

not yet

dbt

not yet

beta

not yet

Denodo

beta

yes

yes

Domo

not yet

yes

not yet

Dremio

not yet

yes

not yet

Generic JDBC

not yet

yes

not yet

Hive

not yet

yes

not yet

Infor Ion

beta

yes

yes

Looker

not yet

yes

not yet

Manta

not yet

yes

not yet

My SQL

yes

yes

yes

Open API

not yet

yes

not yet

Oracle

yes

yes

not yet

PostgreSQL

yes

yes

yes

PowerBI

not yet

beta

not yet

Presto

not yet

yes

not yet

Redshift

yes

yes

yes

Reltio

not yet

beta

not yet

Salesforce

not yet

yes

not yet

Snowflake

yes

yes

yes

SQL Anywhere

not yet

yes

not yet

SQL Server

yes

yes

yes

Tableau Server

not yet

yes

not yet

Vertica

not yet

yes

not yet