Skip to main content

Preparing to run the MongoDB collector

Important

This collector is available in Private Preview. If you would like access to this collector, please contact your Customer Success Director.

Setting up pre-requisites for running the collector

Make sure that the machine from where you are running the collector meets the following hardware and software requirements.

Table 1.

Item

Requirement

Hardware (for on-premise runs only)

Note: The following specs are based upon running one collector process at a time. Please adjust the hardware if you are running multiple collectors at the same time.

RAM

8 GB

CPU

2 Ghz processor

Software (for on-premise runs only)

Docker

Click here to get Docker.

data.world specific objects (for both cloud and on-premise runs)

Dataset

You must have a ddw-catalogs dataset set up to hold your catalog files when you are done running the collector.

If you are using Catalog Toolkit , follow these instructions to prepare the datasets for collectors.

Network connection

Allowlist IPs and domains

Follow these instructions to configure your network.



Setting up MongoDB user and authentication

To set up MongoDB user and authentication:

  1. In the MongoDB Project properties, select the Database Access pane.

  2. Click Add new database user to add the user you want to harvest the MongoDB instance.

  3. Next, choose one of the following authentication methods you prefer:

    • user-password (SCRAM) authentication

    • X.509 authentication

    • AWS Authentication

  4. If the built-in role is used, it should be Only read any database at least. In the case of Specific Privileges, select the Read permission for each database planned to be collected.