Using Pig User-Defined Functions

The script in the Pig Execute operator can reference a Pig UDF.

The list of acceptable Pig UDFs can be found here: Pig User Defined Functions

The UDF must be defined in a JAR file located in ALPINE_DATA_REPOSITORY/PigUDFJars/. The JAR file is automatically registered when running a workflow.

Within the script, use the UDF as in any Pig script.
  • For example, to use the DataFu Sessionize UDF, include the DataFu JAR in ALPINE_DATA_REPOSITORY/PigUDFJars/. In the script include:

    define Sessionize datafu.pig.sessions.Sessionize('30m');

    Then use Sessionize as desired.

Related reference