Workspace Node: Text Mining - Specifications - Delimiters Tab
In the Text Mining node dialog box, under the Specifications heading, select the Delimiters tab to access options to "bracket" the portion of the text in each document that you want to index. This is useful when you are processing structured narratives, e.g., standard accident reports, where the narrative of what happened is always contained between the headers "Description of accident/incident" and "Number of persons injured"; in this case, you could specify the former as the Starting phrase and the latter as the Ending phrase, and only parse (process and index) the text that is found between those headers in each accident report.
See also the Introductory Overview.
Element Name | Description |
---|---|
Index words only between starting and ending phrases | Select this check box to activate the conditional processing of specific portions of the text in each document. After selecting this check box, the Starting phrase and Ending phrase options (see below) will be enabled. |
Starting phrase | This option is only available after the Index words only between starting and ending phrases check box (see above) has been selected. Specify the starting phrase, i.e., the processing of the text in each document begins after the place in the text where this phrase appears. |
Ending phrase | This option is only available after the
Index words only between starting and ending phrases check box has been selected. Specify the ending phrase, i.e., the processing of the text in each document will terminate at the text immediately preceding the phrase specified in this field.
Options / C. See Common Options. |