About the Athena collector
Use this collector to harvest metadata for Oracle tables and columns across the enterprise systems and make it searchable and discoverable in data.world.
Often the tables in an Athena database will participate in Glue ETL jobs. The data.world Collector has the ability to catalog lineage information in Glue ETL jobs. For more information see the article on AWS Glue and the data.world Collector.
Important
The Athena collector can be run in the Cloud or on-premise using Docker or Jar file.
Note
The latest version of the Collector is 2.283. To view the release notes for this version and all previous versions, please go here.
Authentication supported
The Athena collector supports username/password authentication to Athena.
What is cataloged
The collector catalogs the following information.
Object | Information cataloged |
---|---|
Columns | Name, Description, JDBC type, Column Type, Is Nullable, Default Value, Key type (Primary, Foreign), Column size, Column index |
Table | Name, Description, Primary key, Schema, AWS tags |
Views | Name, description, SQL definition |
Materialized View | Name, description |
Schema | Identifier, Name |
Database | Type, Name, Identifier, Server, Port, Environment, JDBC URL |
Relationship between objects
By default, the harvested metadata includes catalog pages for the following resource types. Each catalog page has a relationship to the other related resource types. If the metadata presentation for this data source has been customized with the help of the data.world Solutions team, you may see other resource pages and relationships.
Resource page | Relationship |
---|---|
Table |
|
View |
|
Materialized View |
|
Columns | Table |
Schema |
|
Database | Schema contained in Database |
Lineage for Athena
Object | Lineage available |
---|---|
Table |
|