Diskover With Wasabi

Prev Next

How do I use Diskover with Wasabi?

Wasabi has been validated for use with Diskover Data. Diskover is a web-based data management solution providing a global view of data fragmented across cloud and on-premise storage, with integrated search, analytics, workflows, and actions. To learn more about the Diskover-Wasabi solution, please refer to our joint solution brief

1. Reference Architecture

Screenshot_2023-03-12_at_8.59.14_PM.png

2. Prerequisites

3. Configuration

The indexing of Wasabi cloud storage can be configured both via a command line terminal and the Diskover web user interface. The command line is typically used for one-time and/or first-time scans, while the Diskover web user interface is the preferred method to set up indexing tasks at scheduled intervals.

Indexing via a terminal session

3.1. In a terminal session, add credentials to the default location for Wasabi S3 credentials:

cd /root/.aws
vi credentials

Example:

Screenshot_2023-03-14_at_11.09.21_AM.png

3.2. To use a Wasabi S3 endpoint, set the AWS_PROFILE and the S3_ENDPOINT_URL
environment variables before running the crawl. Type the following commands to export the variables via a terminal session.

export AWS_PROFILE=wasabi-eu
export S3_ENDPOINT_URL=https://
cd /opt/diskover
python3 diskover.py --altscanner scandir_s3 s3://

Note: For information on the Wasabi service URLs, please refer to our Wasabi Service URLs article .

Indexing via the Diskover web user interface

3.3. Log in to the Diskover web interface. To add a Wasabi S3 endpoint via the Diskover web task panel, select "gear icon > Task Panel".

Screenshot_2023-03-16_at_3.27.40_PM.png

3.4. In the Task List panel, select "New Index Task"  to start a new index. To modify an existing index, click Info in line with an existing index, then "Edit task".

Screenshot_2023-03-16_at_3.30.30_PM.png

3.5. Go down to "Environment Vars" and insert your endpoint in the dialog box for the task, for example:

AWS_PROFILE=wasabi-west,S3_ENDPOINT_URL=https://s3.us-west-1.wasabisys.com
Screenshot_2023-03-16_at_3.31.37_PM.png

Note: This config example discusses the use of Wasabi's us-west-1 storage region. To use other Wasabi storage regions, please refer to our Wasabi Service URLs article .

3.6. To further configure indexing scheduled tasks, please follow these instructions.

Note: For further Diskover installation instructions, click here  or access their complete online resources catalog .

4. Operations Summary

Diskover is a data management platform allowing you to “connect” all your storage repositories, either they are located on-prem or in the cloud. Diskover is non-proprietary; we do not store your files; we simply index their metadata. Therefore, end-users have access to read-only index of files and not the files themselves, hence assuring the safety of your source files and production network. 

Diskover offers integrated:

To learn more, please refer to Diskover user guide.