Preparing to run the Reltio collector
Setting up pre-requisites for running the collector
Make sure that the machine from where you are running the collector meets the following hardware and software requirements.
Item | Requirement |
---|---|
Hardware (for on-premise runs only) Note: The following specs are based upon running one collector process at a time. Please adjust the hardware if you are running multiple collectors at the same time. | |
RAM | 8 GB |
CPU | 2 Ghz processor |
Software (for on-premise runs only) | |
Docker | Click here to get Docker. |
data.world specific objects (for both cloud and on-premise runs) | |
Dataset | You must have a ddw-catalogs dataset set up to hold your catalog files when you are done running the collector. If you are using Catalog Toolkit , follow these instructions to prepare the datasets for collectors. |
Preparing Reltio for collectors
Creating a user
If you plan to connect to Reltio using the API, create a user with read access to the tenant. This Reltio account is used to obtain the Master Data management tenant API token.
Exporting data file from Reltio
The collector lets users access the Reltio APIs to automatically retrieve the data file from Reltio. If you prefer not to connect to Reltio using the APIs, you can alternatively export the data file from Reltio and supply it to the collector.
To manually export the data file from Reltio:
From the Reltio Console, set up an export job. Make sure to export the file as JSON. See the Reltio Documentation for more details.
Place the exported JSON file in a directory on the machine from where you plan to run the collector.