Hive

The Hive activity is used to facilitate querying and managing large datasets residing in distributed storage.

Note:
  • If you run this activity on the Redhat platform, you need to upgrade XML User Interface Language (XUL) Runner to version 1.8 or later. After the upgrading, you need to reinstall Mozilla Firefox.
  • This activity do not support to upload Hive data from local clusters.

General

The General tab has the following fields.

Field Module Property? Description
Name No The name of the activity in the process definition.
HCatalog Connection Yes Click to select an HCatalog Connection shared resource.

If no matching HCatalog Connection shared resources are found, click Create Shared Resource to create one.

IsFileBase No Select this check box if Hive scripts are from a file.
Hive Script File Yes Specifies the path of the file containing Hive scripts.
Note: This field is displayed when the IsFileBase check box is selected.
HiveEditor No Specifies Hive scripts. The keywords of the scripts are highlighted automatically.
Note: This field is displayed when the IsFileBase check box is cleared.
Define No Specifies Hive configuration variables. A variable is associated with a name and a value.
Status Directory Yes Specifies the directory where the status of the Hive job is located.
WaitForResult Yes Select this check box if you want the process to wait for the Hive operation to complete.
Note: When this check box is selected, the Hive activity does not support to query more than 2 GB result data at one time due to the limitations of TIBCO ActiveMatrix BusinessWorks.

Description

Provide a short description for the activity.

Input

The values specified in this tab takes precedence over the ones in the corresponding fields in the General tab.

Input Item Data Type Description
HiveFile string Specifies path of the HDFS file that contains commands.
Note: This item is displayed when the IsFileBase check box is selected.
HiveScript string Specifies Hive scripts directly.
Note: This item is displayed when the IsFileBase check box is cleared.
Defines string Specifies Hive configuration variable. Each variable is associated with a name and a value.
Status

Directory

string Specifies the directory where the status of the Hive job is located.

Output

The output of the activity are as follows.

Output Item Data Type Description
jobId string Returns the job ID of the Hive operation.
Note:
  • This item is displayed when the WaitForResult check box is cleared.
  • You can use the WaitForJobCompletion activity to wait for the job to complete. The exitValue in the Output tab of the WaitForJobCompletion activity shows the exit value of Hive SQL execution.
content string Returns the result of the job.
Note: This item is displayed when the WaitForResult check box is selected

Fault

The Fault tab lists the exceptions that can be thrown by this activity.

HDFSException Description
msg The error message description returned by the plug-in.
msgCode The error code returned by the plug-in.