Pig Execute
Executes a user-defined Pig script (for parsing and sorting Hadoop data sources). The Pig Execute operator can also reference Pig UDFs (user-defined functions) that are supplied to the TIBCO Data Science – Team Studio server.
Information at a Glance
|
Parameter |
Description |
|---|---|
| Category | Tools |
| Data source type | HD |
| Send output to other operators | Yes |
| Data processing tool | Pig |
The Pig script executed inside this operator is passed the results from its preceding operators, and it is expected to pass along its results to the succeeding operator.
In version 5.7 and later, the resulting file structure of the output is detected automatically. Otherwise, the user must define the output structure.
Input
The Pig Execute operator can accept one or more inputs; however, input is not required.
Configuration
| Parameter | Description |
|---|---|
| Notes | Notes or helpful information about this operator's parameter settings. When you enter content in the Notes field, a yellow asterisk appears on the operator. |
| Pig Script | The Pig script to execute.
Click Edit Pig Script to open the Pig Script editor dialog. For more information, see Define Pig Script dialog. |
| Pass Output File | Specify whether to pass the output to the next operator. |
| Results Location | The HDFS directory where the results of the operator are stored. This is the main directory, the subdirectory of which is specified in Results Name. Click Choose File to open the Hadoop File Explorer dialog and browse to the storage location. Do not edit the text directly. |
| Results Name | The name of the file in which to store the results. |
| Overwrite | Specifies whether to delete existing data at that path and file name.
|