Data Preparation Workflow

TIBCO Clarity provides you an easy data preparation flow. You can apply a data preparation workflow to a different sample set to further refine and compare transformation rules.

A typical data preparation workflow consists of the following tasks:

  1. Subscribe to and launch TIBCO Clarity.

    Use TIBCO Cloud Marketplace to subscribe to and launch TIBCO Clarity.

    See Subscribing to and Launching TIBCO Clarity for details.

  2. Create a dataset.

    A dataset collects the raw data to be refined from different sources.

    See Creating a Dataset for details.

  3. Create a project.

    A project contains the full or a sample of data in a dataset. You can perform various operations on a project.

    See Creating a Project for details.

  4. Profile row data.

    You can profile the rows and columns for completeness and uniqueness, and you can also use the charting function to visualize and analyze data.

    See Profiling Data and Charting Data for details.

  5. Define metadata and validate data.

    You can validate your data according to the predefined or customized data types.

    See Validating Data for details.

  6. Cleanse and transform data.

    After analyzing and validating your data, you can correct errors in your data by removing blanks and duplicates, filtering and faceting rows, clustering and transforming values, splitting multi-valued cells, merging columns, and so on.

    See Managing Project Data and Manipulating a Column for details.

  7. Export data.

    You can export the cleansed data to various formats, or directly to other TIBCO applications.

    See Exporting Project Data for more details.

Note: Data profiling, validating, cleansing, and transforming are iterative processes. Based on the results of the previous iterations, you can create a new project for the same dataset and perform further data analysis. Repeat these processes until you get the results you want.