Skip to main content

Preparing to run the Reltio collector

Setting up pre-requisites for running the collector

Make sure that the machine from where you are running the collector meets the following hardware and software requirements.

Table 1.

Item

Requirement

Hardware (for on-premise runs only)

Note: The following specs are based upon running one collector process at a time. Please adjust the hardware if you are running multiple collectors at the same time.

RAM

8 GB

CPU

2 Ghz processor

Software (for on-premise runs only)

Docker

Click here to get Docker.

data.world specific objects (for both cloud and on-premise runs)

Dataset

You must have a ddw-catalogs dataset set up to hold your catalog files when you are done running the collector.

If you are using Catalog Toolkit , follow these instructions to prepare the datasets for collectors.

Network connection

Allowlist IPs and domains

Follow these instructions to configure your network.



Preparing Reltio for collectors

Creating Credentials

If you are harvesting metadata from Reltio via the API (rather than a local configuration file) you must set up credentials to be used by the collector to authenticate to the API. Two methods of authentication are available.

If you have enabled multi-factor authentication in your tenant you must use client credentials to authenticate. Otherwise, you can use the username and password authentication and API keys.

For client credentials OAuth authentication:

  • Set up client credentials as described in the Reltio documentation.

  • Assign the credential the following roles: ROLE_API, ROLE_CONNECT_ESSENTIALS_ADMIN, ROLE_RDM, ROLE_WORKFLOW.

For authentication with username and password and API key:

  1. Set up an API key for the user that will be authenticating to Reltio via the collector.

  2. The API key should be assigned the following roles: ROLE_API, ROLE_CONNECT_ESSENTIALS_ADMIN, ROLE_RDM, ROLE_WORKFLOW.