About the Information Schema Catalog Collector (ISCC) Collector
Important
The ISCC collector can be run in on-premise using Docker or Jar files.
Note
The latest version of the Collector is 2.243. To view the release notes for this version and all previous versions, please go here.
There are occasionally times when a database configuration makes it difficult to connect the the data.world Collector directly to the data source. In those cases, the Information Schema Catalog Collector (ISCC) can be used to access the information schema of a database as a source for cataloging the database's metadata.
The information schema is an ANSI-standard set of read-only views of all the tables, views, columns, and procedures in an RDBMS. The ISCC works by parsing CSV files created from the information schema and using them as an input source for the data.world Collector. You can find more information on the information schema here. Using the data.world Collector directly is the preferred method for cataloging an RDMS, but the following instructions provide a secondary access method when needed.
Note
We have tested this collector against a MS SQL Server database, but it can be used on any database for which you can generate the four CSV files described in this guide.
There are two parts to cataloging metadata from a database's information schema:
Generate or create the CSV files containing the database's metadata
Run the the Collector against the CSV files
In this article we cover both parts in the order they need to be done.
What is cataloged
The collector catalogs the following information.
Object | Information cataloged |
---|---|
Columns | Name, JDBC type, Column Type, Is Nullable, Default Value, column size, column index, Description |
Table | Name, schema, Description |
Views | Name, Schema |
Schema | Identifier |
Database | Type, name, identifier, server, port, environment, JDBC URL |
Relationships between objects
By default, the harvested metadata includes catalog pages for the following resource types. Each catalog page has a relationship to the other related resource types. If the metadata presentation for this data source has been customized with the help of the data.world Solutions team, you may see other resource pages and relationships.
Resource page | Relationship |
---|---|
Table | Columns |
Columns | Table |