Skip to main content

About the Dremio collector

Use this collector to harvest metadata for Dremio tables and columns across the enterprise systems and make it searchable and discoverable in data.world.

Important

The Dremio collector can be run on-premise using Docker or Jar files.

Note

The latest version of the Collector is 2.200. To view the release notes for this version and all previous versions, please go here.

Dremio version supported

  • The collector supports Dremio version 4.7.2-202008180758160892-1a34c463.

Authentication supported

  • The collector supports username/password authentication to Dremio.

What is cataloged

The collector catalogs the following information.

Table 1.

Object

Information cataloged

Columns

Name, Description, JDBC type, Column Type, Is Nullable, Default Value, Key type (Primary, Foreign), Column size, Column index

Table

Name, Description, Primary key, Schema

Views

Name, description

Schema

Identifier, Name

Database

Type, Name, Identifier, Server, Port, Environment, JDBC URL



Relationships between objects

By default, the harvested metadata includes catalog pages for the following resource types. Each catalog page has a relationship to the other related resource types. If the metadata presentation for this data source has been customized with the help of the data.world Solutions team, you may see other resource pages and relationships.

Table 1.

Resource page

Relationship

Table

Columns

Columns

Table

Schema

Database that contains Schema, Table that is part of Schema

Database

Schema contained in Database



Lineage for Dremio

  • The collector obtains information about inter-table relationships from Dremio’s built-in catalog graph. It writes a lineage relationship for any files or datasets represented as tables that are found in the graph.