About the Athena collector

Use this collector to harvest metadata for Oracle tables and columns across the enterprise systems and make it searchable and discoverable in data.world.

Often the tables in an Athena database will participate in Glue ETL jobs. The data.world Collector has the ability to catalog lineage information in Glue ETL jobs. For more information see the article on AWS Glue and the data.world Collector.

Important

The Athena collector can be run in the Cloud or on-premise using Docker or Jar file.

Note

The latest version of the Collector is 2.292. To view the release notes for this version and all previous versions, please go here.

Authentication supported

The Athena collector supports username/password authentication to Athena.

What is cataloged

The collector catalogs the following information.

Table 1.

Object	Information cataloged
Columns	Name, Description, JDBC type, Column Type, Is Nullable, Default Value, Key type (Primary, Foreign), Column size, Column index
Table	Name, Description, Primary key, Schema, AWS tags
Views	Name, description, SQL definition
Materialized View	Name, description
Schema	Identifier, Name
Database	Type, Name, Identifier, Server, Port, Environment, JDBC URL

Relationship between objects

By default, the harvested metadata includes catalog pages for the following resource types. Each catalog page has a relationship to the other related resource types. If the metadata presentation for this data source has been customized with the help of the data.world Solutions team, you may see other resource pages and relationships.

Table 2.

Resource page	Relationship
Table	Columns
View	Schema that contains Views Columns that are part of Views
Materialized View	Schema that contains Materialized Views Columns that are part of Materialized Views
Columns	Table
Schema	Database that contains Schema Table that is part of Schema View that is part of Schema Materialized View that is part of Schema
Database	Schema contained in Database

Lineage for Athena

Table 3.

Object	Lineage available
Table	S3 buckets/objects that the table sources its data from

In this section:

About the Athena collector

Important

Note

Authentication supported

What is cataloged

Relationship between objects

Lineage for Athena

Important

Search results