Preparing to run the Reltio collector
Setting up pre-requisites for running the collector
Make sure that the machine from where you are running the collector meets the following hardware and software requirements.
Item | Requirement |
---|---|
Hardware (for on-premise runs only) Note: The following specs are based upon running one collector process at a time. Please adjust the hardware if you are running multiple collectors at the same time. | |
RAM | 8 GB |
CPU | 2 Ghz processor |
Software (for on-premise runs only) | |
Docker | Click here to get Docker. |
data.world specific objects (for both cloud and on-premise runs) | |
Dataset | You must have a ddw-catalogs dataset set up to hold your catalog files when you are done running the collector. If you are using Catalog Toolkit , follow these instructions to prepare the datasets for collectors. |
Network connection | |
Allowlist IPs and domains |
Preparing Reltio for collectors
Creating Credentials
If you are harvesting metadata from Reltio via the API (rather than a local configuration file) you must set up credentials to be used by the collector to authenticate to the API. Two methods of authentication are available.
If you have enabled multi-factor authentication in your tenant you must use client credentials to authenticate. Otherwise, you can use the username and password authentication and API keys.
For client credentials OAuth authentication:
Set up client credentials as described in the Reltio documentation.
Assign the credential the following roles: ROLE_API, ROLE_CONNECT_ESSENTIALS_ADMIN, ROLE_RDM, ROLE_WORKFLOW.
For authentication with username and password and API key:
Set up an API key for the user that will be authenticating to Reltio via the collector.
The API key should be assigned the following roles: ROLE_API, ROLE_CONNECT_ESSENTIALS_ADMIN, ROLE_RDM, ROLE_WORKFLOW.