Interactive Drill-Down Explorer Overview
Overview
How the Drill-Down Explorer Works
Auto-Updating Graphs and Summary Statistics after Each Drill-Down
Applications of the Interactive Drill-Down Explorer
Interactive Drill-Down Explorer vs. OLAP (On-Line Analytic Processing)
Overview
A first step of many data mining projects is to explore the data interactively to gain a first impression of the types of variables in the analyses, and their possible relationships. Statistica and Statistica Data Miner offer a large selection of methods for exploratory data analysis (EDA), as well as graphical data analysis (graphical or visual data mining). The purpose of the Interactive Drill-Down Explorer is to provide a combined graphical, exploratory data analysis and tabulation tool that will allow you to quickly review the distributions of variables in the analyses and their relationships to other variables, and to identify the actual observations belonging to specific subgroups in the data.
"Are there more educated males or females in my sample?"
or as complex as:
"Is it true that only highly educated females, but those who are in low income brackets, buy product A, rarely B, and never C, and that this consistent pattern holds only for residents of the East Coast?"
How the Drill-Down Explorer Works
The drill-down metaphor within the data mining context summarizes the basic operation of the drill-down operation quite well: the program allows you to select observations from larger data sets by selecting subgroups based on specific values or ranges of values of particular variables of interest; in a sense you can expose the "deeper layers" or "strata" in the data by reviewing smaller and smaller subsets of observations selected by increasingly complex logical selection conditions (not unlike the case selection conditions available in Statistica).

The histogram (bar graph) shows that 39 individuals reported that they Always are interested in watching Football. The frequency table for another popular sport - Baseball - is also shown above.
Now suppose you want to select the 38 individuals who reported strong interest in watching Football (represented by the column labeled as Always), to further "examine" them. The Drill-Down Explorer allows you to highlight that column, drill down, and then review various Statistical and graphical summaries for other variables also recorded in the data set, but only for the selected cases. For example, after drilling down on column Always, the results may look like this:

Note how the frequency table for Baseball is automatically updated to reflect the frequencies for the selected category Football-Always. You could now drill down further by selecting only those respondents who also reported they were Always interested in Baseball, and so on.
Applications of the Interactive Drill-Down Explorer
The example described in the How the Drill-Down Explorer Works section is very simple, exposing only the basic functionality of the program. The real power of the Statistica Interactive Drill-Down Explorer lies in the various auxiliary results that can automatically be updated during the interactive drill-down/up exploration: you can select a list of variables for review and compute for the selected cases:
- Descriptive statistics and frequency tables;
- Box-and-whiskers plots summarizing the distributions of continuous variables;
- Scatterplot matrices summarizing the relationships between continuous variables;
- All of the other Statistical and graphical analyses available in Statistica by extracting the observations belonging to the current subset;
So for example, you could review the types of purchases that customers made with different demographic characteristics; study the effectiveness of certain drugs within different treatment groups, ages, etc.; or extract likely customers for a new product from a database of previous customers based on careful study of apparent (market) segments exposed by the drill-down analysis.
Interactive Drill-Down Explorer vs. OLAP (On-Line Analytic Processing)
On the surface, the operation of the simplest aspect of the Interactive Drill-Down Explorer (exploration of multidimensional tables) is very similar to the functionality offered by designated OLAP tools. OLAP tools allow users to quickly query a database to extract observations and summary information about those observations taking advantage of the optimized OLAP Server facilities offered for a specific database platform (e.g., Oracle, or MS SQL Server), and often providing significant performance advantages over tools based on traditional (non-OLAP driven) query tools. However, the main advantages of Statistica Interactive Drill-Down Explorer over OLAP are:
(a) its tight integration with Statistica's flexible categorization tools and exploratory environment (the analytic capabilities provided in the Statistica Interactive Drill-Down Explorer are much more comprehensive and general than typical OLAP tools, supporting flexible "drill up" operations, and allowing you to quickly review custom, complex summary graphs, detailed descriptive statistics, etc.), and
(b) the fact that the Statistica Interactive Drill-Down Explorer is not limited to any particular database platform and does not require a designated OLAP Server to be present (e.g., it can operate directly on Statistica data files). At the same time, by connecting to the Statistica application a (remote) database for in-place processing (see Streaming Database Connector Technology), you can efficiently perform drill-down operations on any data source, regardless of whether or not designated OLAP tools are available on the server.