Working with the Sample-customers Dataset
Starting from the very beginning, the Sample-customers dataset is used to show how you can load, analyze, validate, transform, and cleanse your data using various methods.
The Sample-customers dataset contains a set of artificial customers data. By default, a project named project 1 is created from this dataset.
Suppose you are an administrator at a widget manufacturer named TWIDGCO, Inc. The company has experienced unprecedented growth over the last decade, but a recent report revealed inefficiencies and lost opportunities because of inconsistent customer data across all brands. The management decided to roll all brands and their respective customers data into the main Customer Master of TWIDGCO.
Now you are facing the challenge of consolidating a massive amount of customers data from multiple data sources and in a variety of formats. With TIBCO Clarity, you can upload data from various data sources and streamline your data in the best possible shape.
This example dataset is used to show how you can meet the challenge with TIBCO Clarity:
-
Creating a Dataset and a Project
Create a dataset to load data from different data sources, and create a project out of the dataset to sample the data.
-
Analyzing Data
Profile data, facet data, check data dependency, and chart data.
- Validating Data
Validate data by data types.
- Transforming Data
Transform data into a uniform data format.
- Cleansing Address
Cleanse the address data.
- Deduplicating Data
Check duplicates.