Skip to main content

Preparing to run the Dremio collector

Setting up pre-requisites for running the collector

Make sure that the machine from where you are running the collector meets the following hardware and software requirements.

Table 1.

Item

Requirement

Hardware

RAM

8 GB

CPU

2 Ghz processor

Software

Docker or Java Runtime Environment

Docker

Click here to get Docker.

Java Runtime Environment

OpenJDK 17 is supported and available here.

JDBC Driver

The Dremio collector requires that you download a JDBC driver appropriate for your version of Dremio, at https://download.dremio.com/jdbc-driver/

data.world specific objects

Dataset

You must have a ddw-catalogs (or other) dataset set up to hold your catalog files when you are done running the collector.

Network connection

Allowlist IPs and domains



Setting up permissions

  • Assign SELECT privileges on tables and views to the user account used to run the collector. This ensures the collector can access and harvest the necessary metadata. See the Dremio documentation for more details.