Checking Duplicates
You can check duplicates against a project or an external table uploaded from TIBCO Patterns.
Note: For the enterprise edition, you must set up a connection to TIBCO Patterns server before using the Dedup function, see
Configuring Patterns Server Settings.
Procedure
- On the project page, click Dedup to go the Dedup page.
- Optional:
To check the duplicates against an external table uploaded from TIBCO Patterns:
- Select the Validate against external tables check box.
- From the list below, select an external table from the list. If no table is available, click Manage table list from the list to create one. For more information, see Managing External Tables.
- Map the external table columns to the project columns automatically or manually. Click Auto map for automatic mapping or drag columns for manual mapping.
- Move the Matches requested slider to specify the number of duplicates to be returned.
- Move the Score threshold slider to set the accuracy of the query.
- Optional:
To group several data columns to detect duplicates, create some switchable groups in the
Column configuration area :
-
From the
menu next to
Column name, click
Create a switchable group.
-
Select the check boxes before the column names to be grouped and click elsewhere to exit your selection.
A switchable group is added. To create more switchable groups, repeat the operations.
-
From the
- Optional: In the Column configuration area, select the check boxes next to the columns and switchable groups ( if there are some) where duplicates are checked.
- Optional: Configure dedup factors for the selected columns and switchable groups (if there are some). See Dedup Factors.
- Optional:
Click
Run to start checking the duplicates.
When the duplicate checking is completed, you are directed to the project data page. Dedup results are displayed in the new added dedup columns, as described in Dedup Results.
Copyright © Cloud Software Group, Inc. All Rights Reserved.