Workflows

Model your analytics process with a directed graph of operators called a workflow. Workflows combine data sources and operators (algorithms and transformations).



Each operator takes input from the preceding operator(s) or data source(s), performs its task, and produces output for the succeeding operators. Operators have parameters that can be configured by the user. These parameters can affect everything from the columns to analyze to the Spark settings to fine-tune your job's performance. The application performs validation checks when it creates connections between operators and configures their parameters.

At runtime, information produced from an operator is passed to its succeeding operator(s). This forms an information flow through the operators that can be inspected and analyzed at any intermediate point.



When you run a workflow, Team Studio checks the inputs and any needed dependencies on each operator as execution reaches that point in the graph. If required parameters are missing, the workflow does not start and errors are reported. Additionally, the title of the offending operator is displayed in red.

When an operator finishes its task, Team Studio gets any output and displays it as a section in the workflow result screen.

Related reference