Gathr Settings

1. Go to Setup from left navigation pane and click on Gathr tab.21

Provide values for following properties:

Field

Description

Gathr Web URL

Configure Gathr Web URL as below.

Zookeeper Gathr Node

Zookeeper Gathr node is where Webstudio specific properties are managed.

Zookeeper Configuration Node

Zookeeper configuration node is where all the YAML properties are managed.

Password Encryption Required

Enable Password Encryption Required, to encrypt all password fields.

Spark Home

Spark Home is the path to Spark Installation on machine where Gathr Studio is installed.

Spark Job Submit Mode

Spark Job Submit Mode is mode in which spark pipeline jobs are submitted. See Appendix-1 on deploying Livy and setting up Spark 2 client.

The options are:

• spark-submit

• livy

• job-server

Hadoop User

Hadoop User is the Gathr user through which pipeline will be uploaded to HDFS.

Note: In case of Kerberos env. make sure the hadoop user mentioned here has valid principal and keytab.