Aggregations and statistical measures
In the visualizations and various tools, calculations are performed that use aggregation and statistical measures.
For a description of each measure, see the corresponding section.
Note: For measures where a large statistical selection is needed, the
result from a calculation may vary with the number of available values.
- Sum
- Average (Avg)
- Count
- Unique Count
- Min
- Max
- Median
- Standard Deviation (StdDev)
- Standard Error (StdErr)
- Variance
- Lower endpoint of 95% Confidence Interval
- Upper endpoint of 95% Confidence Interval
- First Quartile (Q1)
- Third Quartile (Q3)
- Lower Adjacent Value (LAV)
- Upper Adjacent Value (UAV)
- CountBig
- Unique Concatenate
- Concatenate
- First
- GeoMetricMean
- Interquartile Range (IQR)
- Last
- Lower Inner Fence (LIF)
- Lower Outer Fence (LOF)
- Mean Deviation
- Median Absolute Deviation
- Most Common
- Outlier Count (Outliers)
- 10th Percentile (P10)
- 90th Percentile (P90)
- Outlier Percentage (PctOutliers)
- Product
- Range
- Upper Inner Fence (UIF)
- Upper Outer Fence (UOF)
In the column selectors of some visualizations there are also a number of aggregation measures available that are in fact shortcuts to expressions. See below for a description:
- Cumulative Sum
- Moving Average
- Difference
- Difference %
- Difference Year Over Year
- Difference % Year Over Year
- % of Total
- Year to Date Total
- Year to Date Growth
- Change Relative to Start
- Change Relative to Fixed Point
- Compound Annual Growth Rate
See Node navigation and Shortcuts to frequent custom expressions for more general information about writing custom expressions with OVER and THEN.
- Sum and Product
Sum is the result of adding all values in a the subset. Product is the result of multiplying all values in the subset. - Average
The average (Avg), or mean, is calculated as the sum of all values in the distribution divided by the number of values. - Count
Count gives the number of values in a column, not counting empty values. In the table below, Column A has a Count of 3, while Column B has a Count of 4. - Unique Count
Unique Count gives the number of unique (distinct) values in a column. Empty values are not counted. - Min and Max
Min is the minimum (lowest) value of the selected subset. Max is the maximum (highest) value of the selected subset. Range = Max-Min. - Median
The median of a distribution is the value which, when the distribution is sorted, appears in the middle of the sorted list. If the number of values is even, the median is computed by taking the average of the two middle values. - Standard Deviation
The standard deviation (StdDev), s, is an indication of how dispersed the probability is about its center. - Standard Error
The standard error (StdErr) is the standard deviation of the calculated mean. - Variance
The sample variance, s², is an indication of how dispersed the probability distribution is about its center. - Confidence Intervals
A confidence interval is the mean of an estimate plus and minus a variation in that estimate. - Percentiles and Quartiles
A percentile is a measure at which that percentage of the total values are the same as or below that measure. For example, 90% of the data values lie below the 90th percentile, whereas 10% of the data values lie below the 10th percentile. - Adjacent Values and Outliers
Upper and lower adjacent values can be calculated, and upper outer and lower outer fences that will define outlier values. - Concatenate and Unique Concatenate
The Concatenate measure appends all values in the subset into a string, and the Unique Concatenate measure appends all the unique values into a string. That is, when you use Unique Concatenate, each value will only be included in the string once. - First and Last
The First and Last measures address values based on the physical order of the data rows. - Geometric Mean
The geometric mean is a mean that multiplies all values and calculates a root of the result. - Mean Deviation
The mean deviation (average absolute deviation, AAD) is calculated as the mean of the absolute difference between a value and the mean value of the group: - Median Absolute Deviation
The median absolute deviation (MAD) is calculated as the median of the absolute value of each value, xi, minus the median of x: - Most Common
The Most Common measure returns the most common value of the selected column.
Parent topic: Making your own calculations