Frequency
Analyzes the values of selected fields in a table, helping to interpret the shape of the data column by column.
Information at a Glance
|
Parameter |
Description |
|---|---|
| Category | Explore |
| Data source type | DB, HD |
| Send output to other operators | No |
| Data processing tool | Pig |
Algorithm
Frequency Analysis provides statistics on the distinct values of the data, including the count and percentage of each value of a column.
Input
A data set from the preceding operator.
Configuration
| Parameter | Description |
|---|---|
| Notes | Notes or helpful information about this operator's parameter settings. When you enter content in the Notes field, a yellow asterisk appears on the operator. |
| Columns | Click
Select Columns to select the available columns from the input data set for analysis. For more information, see
Select Columns dialog.
Note: Typically, it does not make sense to run a Frequency Analysis on unique text (name) or ID columns, because the frequency of each ID value is, by definition, 1.
|
Output
Visual Output
Counts
Displays the shape analysis graph for the count of each distinct value of the selected data column (name) for analysis. View the Frequency results of any analyzed column by selecting the column from the name dropdown list.
Important: To learn more about the visualization available in this operator, see
Explore Visual Results.
Data
Displays the column name, value, count, and frequency percentage for each selected column (name), as shown in the following example:

Data Output
None. This is a terminal operator.
Example
The following example shows a shape analysis graph that displays the frequency count of various educational levels.