Custom Character Maps

If the predefined character maps are not satisfactory, you can create your custom character maps. A custom character map is created by defining a set of rules, each of which specifies a mapping a set of one or more characters or for an entire class of characters. Shorthand is provided for invoking Unicode Consortium rules for the folding of cases and diacritics. You can also conveniently reference the “punctuation” and “whitespace” classes as defined in Character Maps. You can also create rules that define mappings explicitly, character by character. Thus, there are four types of mapping rules that are available to define custom character maps:

1. Letter case folding (Unicode rules)
2. Diacritic folding and character normalization (Unicode rules)
3. “Whitespace” and “punctuation” class mapping (as defined in Character Maps)
4. Explicitly defined mappings

These four types of mappings are ordered from lowest to highest precedence. The explicitly defined mappings have the highest precedence. This allows you to override any predefined character mapping rules.

For example, to create a character map that folds all the letters to a common case except for “A” and maps all punctuation to blank except for “&”, add to the standard mappings an explicit mapping of “A” to “A” and “&” to “&”.

For more details about defining the custom character maps, see TIBCO® Patterns Programmer’s Guide.