Skip to main content

Preparing to run the Monte Carlo collector

Setting up pre-requisites for running the collector

Make sure that the machine from where you are running the collector meets the following hardware and software requirements.

Table 1.

Item

Requirement

Hardware (for on-premise runs only)

Note: The following specs are based upon running one collector process at a time. Please adjust the hardware if you are running multiple collectors at the same time.

RAM

8 GB

CPU

2 Ghz processor

Software (for on-premise runs only)

Docker

Click here to get Docker.

data.world specific objects (for both cloud and on-premise runs)

Dataset

You must have a ddw-catalogs dataset set up to hold your catalog files when you are done running the collector.

If you are using Catalog Toolkit , follow these instructions to prepare the datasets for collectors.

Network connection

Allowlist IPs and domains



Generating a Monte Carlo API key

Important

You will need to set up an API key for your user to connect to Monte Carlo. The collector will harvest resources from the domains that your user has access to. We recommend that you use service account keys.

To generate a Monte Carlo API key:

  1. Log in to your Monte Carlo instance.

  2. From the top navigation, click Settings.

  3. From the left navigation, click on API.

  4. Click Create Key.

  5. Assign a description and set an expiration.

  6. Click Create.

    Generate_Monte_Carlo_key.png