Workspace Node: Elasticsearch Text Analysis Specifications-Quick Tab

In the Elasticsearch Text Analysis node dialog box, under the Specifications heading, select Quick tab to access the following options.

Option Description
Project Settings
Analyze Select this option to perform a new text analysis using Elasticsearch.
Deploy Select this option to perform deployment of new text documents using Elasticsearch.
Note: PMML file is required as input for this option. This file can be uploaded from PMML Container node upstream.
Text source
Files Select this option if the documents you wish to index are on the local file system.
  • Directory: Click this button to specify the directory where the document resides.
  • File formats: Use this text box to specify the type of files to be indexed. For example, *.xml;*.txt;*.doc will index all the xml, txt and Microsoft Word documents in the directory.
Spreadsheet Select this option to index text in a Statistica spreadsheet.
  • Text variables: Click this button to select variables with unstructured or textual information that needs to be analyzed.
Existing Index Select this option to analyze indexed text into Elasticsearch.
  • Index: Use this textbox to specify the name of the Elasticsearch index that contains the text to be analyzed.
    Note: This field is a copy of the Index Name field in Connection tab. You can use either to specify the name of the index to use.
  • Type: Use this textbox to specify the name of the Elasticsearch Type in the index that contains the text to be analyzed.
  • Document Ids: Use this textbox to specify the name of the documents in the Elasticsearch type to be analyzed.
    Note: Use * to specify all documents in the type.
  • Fields: Use this textbox to specify the fields in Elasticsearch documents to be analyzed.
For example, you might have an Elasticsearch Index named twitter with type tweet, tweet numbers as document id's and tweet content as field.
Options This button is present on Elasticsearch Text Analysis dialog box. It has following options:
  • Restore to factory default: Select this option to restore the settings stored in user's profile.
  • Set current as default: Select this option to save the current node settings as the default setting to user's profile. This will be used to prepopulate any Elasticsearch Text Analysis node added in the future.
  • Create Macro: Select this option to serialize the current node settings to a Statistica Visual Basic script.
Note: Password are not serialized for obvious reasons.
OK Click this button to configure the mentioned specifications.
Cancel Click this button to cancel the specifications.