Skip to main content

Preparing to run the Manta collector

Note

The latest version of the Collector is 2.200. To view the release notes for this version and all previous versions, please go here.

Setting up pre-requisites for running the collector

Make sure that the machine from where you are running the collector meets the following hardware and software requirements.

Table 1.

Item

Requirement

Hardware (for on-premise runs only)

Note: The following specs are based upon running one collector process at a time. Please adjust the hardware if you are running multiple collectors at the same time.

RAM

8 GB

CPU

2 Ghz processor

Software (for on-premise runs only)

Docker

Click here to get Docker.

data.world specific objects (for both cloud and on-premise runs)

Dataset

You must have a ddw-catalogs dataset set up to hold your catalog files when you are done running the collector.

If you are using Catalog Toolkit , follow these instructions to prepare the datasets for collectors.



Setting up permissions for Manta

This section talks about generating the client secret and ID and setting up account roles.

Important

You need to perform these tasks if you are using the token-based authentication.

Generating client ID and client secret

  1. Create a new client to access the Manta APIs. Login to the Manta Knowledge Base, and follow the Manta documentation for complete instructions.

  2. Note the value of the Client ID. You will use this for the --manta-client-id parameter for the collector.

  3. Next, generate the Client Secret. Note the value of the Client Secret. You will use this value for the --manta-client-secret parameter for the collector.

Setting up account roles

  1. Navigate to Clients.

  2. Ensure that the following roles are set:

    • default-roles-manta

    • Configurator_READ

    • PROCESS_MANAGER_EXECUTE

    • PROCESS_MANAGER_READ

    • PROCESS_MANAGER_WRITE

  3. Alternatively, you can set the AdminGuiComposite role.