In this section: |
A data class represents a real-world entity (for example Credit Card, Email, Phone Number, etc.). When a user uploads a data set and submits a request for data profiling, the Profiler uses the definitions of known data classes from the Knowledge Hub to classify and tag those data variables with their corresponding data classes.
The following table lists and describes the built-in data classes that are recognized by the Profiler.
Name |
Description |
Sensitive Flag |
---|---|---|
address_city |
Address City |
FALSE |
address_country |
Address Country |
FALSE |
address_line |
Address Line |
FALSE |
address_postal_code |
Address Postal Code |
FALSE |
address_state |
Address State |
FALSE |
airport_code |
Airport Code |
FALSE |
date |
Date |
FALSE |
|
Email Address |
TRUE |
gender |
Person Gender |
FALSE |
iban |
International Bank Account Number |
TRUE |
person_first_name |
Person First Name |
FALSE |
person_full_name |
Person Full Name |
FALSE |
person_last_name |
Person Last Name |
FALSE |
person_name_prefix |
Person Name Prefix |
FALSE |
person_name_suffix |
Person Name Suffix |
FALSE |
phone_number |
Phone Number |
TRUE |
time |
Time |
FALSE |
us_company_name |
United States Company Name |
FALSE |
us_dea |
United States Drug Enforcement Agency assigned Prescriber Identifier |
TRUE |
us_npi |
United States National Provider Identifier |
TRUE |
us_ssn |
United States Social Security Number |
TRUE |
vin |
Vehicle Identification Number |
TRUE |
The following image shows the built-in data classes exposed through the web UI.
Tip: User-defined data classes are shown in blue.
In this section: |
Users can use the Data Class Editor to add definitions for new data classes.
The Add Data Class dialog opens, as shown in the following image.
Use this option when you have a valid regular expression to identify a new class of data.
Use this option when you have generated a data profile and want to use the patterns and masks discovered by the profiler to define a new data class. The following steps apply to patterns and masks:
Use this option if you have an enumerated list of values that represent the new class of data.
To verify new data classes:
Users can edit the data classes they have created.