Algorithmia Node

The Algorithmia NGram workspace node in Statistica provides a python interface to access the Algorithmia/WebPredict/GetNGramFrequencies algorithm.

The node  connects to the Algorithmia marketplace, using the Algorithmia python client. It then calls the NGrams algorithm with your input and parameters.

The node imports the result from Algorithmia server into a Statistica spreadsheet and makes it available as an output document and a downstream document.

Navigation:

To access this node, with a Workspace open, click on the Big Datat Analytics tab, and the App Market group.

You must create an account with Algorithmia to get the information to fill out the parameters.

Algorithmia can be accessed at https://algorithmia.com/, and more detailed Help is available there.

Data Input

Files containing text data(.txt, .doc, .csv) may be used as input.

Credentials Group

API key  

Enter the API key associated with your Algorithmia account here. The text box is encrypted to protect the display text.

Data Group

Data directory

Enter the directory where the data file is located. If the data is on a local disk.

Specify the full path to the directory. Example: C;/data.

Data file

Enter the name of the file containing input text.  Example: data.txt.

Data Source

Specify where the data resides.  

Available options:

Algorithmia storage

Local disk

Input Parameters Group

Size of NGram macroscroll

Select the size of N-gram (number of words).

Max NGrams returned macroscroll

Select the cutoff size for maximum number of results returned.

Ignore capitalization checkbox

Select to ignore the case of alphabets.

Sort order (descending) checkbox

Select to sort the results from the most frequent to the least.

Output

A NGram Frequencies spreadsheet will be the output.