Skip to main content

About the Information Schema Catalog Collector (ISCC) Collector

Important

The ISCC collector can be run in on-premise using Docker or Jar files.

Note

The latest version of the Collector is 2.247. To view the release notes for this version and all previous versions, please go here.

There are occasionally times when a database configuration makes it difficult to connect the the data.world Collector directly to the data source. In those cases, the Information Schema Catalog Collector (ISCC) can be used to access the information schema of a database as a source for cataloging the database's metadata.

The information schema is an ANSI-standard set of read-only views of all the tables, views, columns, and procedures in an RDBMS. The ISCC works by parsing CSV files created from the information schema and using them as an input source for the data.world Collector. You can find more information on the information schema here. Using the data.world Collector directly is the preferred method for cataloging an RDMS, but the following instructions provide a secondary access method when needed.

Note

We have tested this collector against a MS SQL Server database, but it can be used on any database for which you can generate the four CSV files described in this guide.

There are two parts to cataloging metadata from a database's information schema:

  1. Generate or create the CSV files containing the database's metadata

  2. Run the the Collector against the CSV files

In this article we cover both parts in the order they need to be done.

What is cataloged

The collector catalogs the following information.

Table 1.

Object

Information cataloged

Columns

Name, JDBC type, Column Type, Is Nullable, Default Value, column size, column index, Description

Table

Name, schema, Description

Views

Name, Schema

Schema

Identifier

Database

Type, name, identifier, server, port, environment, JDBC URL



Relationships between objects

By default, the harvested metadata includes catalog pages for the following resource types. Each catalog page has a relationship to the other related resource types. If the metadata presentation for this data source has been customized with the help of the data.world Solutions team, you may see other resource pages and relationships.

Table 1.

Resource page

Relationship

Table

Columns

Columns

Table