Spark Scala Nodes
We recommend using Sparkling Water 2.1 which runs with Spark 2.1.
Code Tab
Language
Select coding language here. Spark Scala is selected by default
Use script file
Browse to select a script file to use.
Options Tab
Input settings
Requires input
Select this check box to require input
Max inputs
Use the mini scroll to select the maximum number of inputs to allow.
Allow downstream connections
Select this check box to allow downstream connections
Spark Options
Downstream Dataset
Retrieve Data
Specify if in addition to the Data Frame schema (useful, for example, for variable selection), the data/content should be retrieved as well, but consider the dataset size.
Size Limit: Top Rows
Specify how man dataframe rows to bring into the Statistica environment (0 = all rows)
Livy Server
Polling Interval (sec)
Enter the polling interval in seconds.