Data in analysis
The Data in analysis flyout shows a structured list of all data columns in a data table, that is, it gives you an overview of your data. It can be the starting point for configuring visualizations, because from the flyout, you can drag columns to drop targets in the middle of the visualizations and to the visualization axes. You can also filter to certain values (because each column in the panel is associated with a filter) and you can get recommendations about suitable visualizations and other operations from the expanded flyout.
![](../../images/data_in_analysis_i.png)
The columns are by default organized into categories to make them easy to locate. For example, loading the Excel file below results in the Data in analysis flyout to the right. Columns containing numerical data suitable to aggregate are gathered in the Numbers section, columns related to time are in the Time section, and columns useful for splitting data are in the Categories section.
![](../../images/data_panel_with_excel_file.png)
The category of a column affects what happens when you drag the column to a column selector or to a drop target in a visualization. If the suggested category for a column is not the desired category, you can change the categorization.
The
Data in analysis flyout can be expanded to get
more details about a column. It expands automatically when you select columns
in the flyout, but you can also expand the flyout yourself by clicking
Expand for details and recommendations
, at the upper-right side of
the flyout.
You can easily change what is shown in a visualization by dragging a column from the flyout and dropping it onto a column selector or on a drop target within the visualizations. Moreover, you can use the flyout to filter to certain values in which you are particularly interested.
The sections in the data in analysis flyout differ slightly depending on the data source and the data content. Some examples of differences are described below.
In-database relational data from joined tables (connections created using the installed client)
If data tables from in-db databases have been joined with relations in the Views in Connection dialog (using the installed client), they can be treated as a single, virtual data table within Spotfire. In this case, there will be an additional field showing the Source structure in the flyout.
If such a source structure is available, you can click on the different parts of the joined table to see only the columns from each source.
If no relations have been defined, each data table in the data connection will be a separate data table within Spotfire.
Cube data (connections created using the installed client)
When you are working with cube data from Microsoft SQL Server Analysis Services, Oracle Essbase or SAP BW, you will see the underlying structure of the cube in the flyout (as long as you keep the sort order as Hierarchical).
Top levels can be expanded to reveal, for example, the different levels in a hierarchy.
It is recommended to use the hierarchical view when working with cube data, to avoid mixing unrelated columns in one visualization.
See the separate document Working with Cubes for more information about different cube systems.
- The expanded data in analysis flyout
If you click on a column in the Data in analysis flyout, the flyout will expand and either show you recommendations or details about the selected column. - Changing the categorization of a column
In the Data in analysis flyout, the columns are by default organized into categories. The category of a column affects what happens when you drag the column and drop it on a column selector or drop target in a visualization. It also affects which kinds of visualizations are advised in recommendations. If the suggested category for a column is not your preferred category, you can change the categorization. - Changing the formatting of a column
The formatting of a column defines how the data in the column will be presented. For example, depending on how you format a date column, the dates therein can be shown as "01/12/2022" or "January 12th, 2022". When you define formatting at the column level (in the Data in analysis flyout), this is the default formatting that will be used at all places where the column content is shown. - Changing the default sort order of values in a column
Text-based columns (strings) can be sorted differently depending on whether you want digits in the texts to be treated as numbers or to be sorted strictly alphabetically. You might also want to define a custom sort order, if the strings in the column have a hierarchical structure that is not alphabetical. - Changing the data type of a column
The data type determines the possible formats for the values in a column and sometimes it also limits which axes a column can be placed on, and the operations that can be done with the column. If the data type is wrong, you may be unable to use your data the way you want. It is easy to change the data type in the expanded Data in analysis flyout. - Changing a column name
The column name is shown in table headers and on the axes of other visualizations. With a proper column name it is easier to understand what a visualization shows. Authors can rename columns in the Data in analysis flyout, and also add or update the column description. - Setting display values based on other columns
Sometimes, you might want to show other values than the ones that actually define an axis as the display values. Display values can easily be configured in the expanded Data in analysis flyout. - Splitting a column
It is easy to split a column containing text (a string column) into two or more columns, using the expanded Data in analysis flyout in Spotfire. You can choose to split on any separating character, like space, comma, @, etc. You can also trim away unwanted characters from the original string at the same time. - Deleting a column
You can remove a column in the expanded Data in analysis flyout of Spotfire clients, if you no longer want to include it in the analysis. - Geographic location and geocoding
To display data on a map, the data must either be geocoded or directly contain coordinates such as longitude or latitude. Geocoding in Spotfire means that some identifiers in a data table are used to match against similar identifiers in another set of data tables (a geocoding hierarchy) which contains latitude/longitude coordinates or geographic features. These coordinates or features are then used for correctly positioning the data in a map context. If your data contains simple geographic elements such as country names, states, or similar, then Spotfire will attempt to automatically geocode your data. If no automatic geocoding can be performed, you can set up the geocoding manually instead. - Unpivoting columns
Data can be organized in different ways, for example, in a short/wide or tall/skinny format, but still contain the same information. Often, it is easier to visualize data organized in a tall/skinny format, that is, when the values are collected in just a few value columns. Unpivoting is one way to transform data from a short/wide to a tall/skinny format, so the data can be presented the way you want it in the visualizations.