Spark Scala Nodes

We recommend using Sparkling Water 2.1 which runs with Spark 2.1.

Code Tab

Language

Select coding language here. Spark Scala is selected by default

Use script file

Browse to select a script file to use.

Options Tab

Input settings

Requires input

Select this check box to require input

Max inputs

Use the mini scroll to select the maximum number of inputs to allow.

Allow downstream connections

Select this check box to allow downstream connections

Spark Options

Downstream Dataset

Retrieve Data

Specify if in addition to the Data Frame schema (useful, for example, for variable selection), the data/content should be retrieved as well, but consider the dataset size.

Size Limit: Top Rows

Specify how man dataframe rows to bring into the Statistica environment (0 = all rows)

Livy Server

Polling Interval (sec)

Enter the polling interval in seconds.