How to: |
When creating a data flow in the Data tab, you can easily run predictive analytics on your data sets using Machine Learning functions, without prior knowledge of advanced statistics.
Train, run, evaluate, and compare predictive models, then choose the one you want to save. Later you can run your saved model against new data sets.
After you create a Data Flow, you can select from different model algorithms to run against your data set.
Your data source opens in the Data tab.
A file directory dialog box opens.
A configure dialog box displays. You can select a target, predictors, or datetime parameters, depending on your chosen model type. For more information on model types, see Running Predictive Analytics on Your Data.
Your model, in this case a Regression model, is added to the workflow.
Your trained model is visualized in the Compare dialog box, as shown in the following image.
Review the performance of the trained model. If there is more than one visualization, you can compare them and select the best one to save. You can save your model by clicking the Save icon in the dialog box.
Your model data displays in different tabs in the sample data panel, as shown in the following image.
You can select different algorithm display options, according to your model type, from the model drop-down menu. The best algorithm is selected by default.
The Save dialog displays. You can review and change attributes, including the application folder where to save your trained model.
Note: The saved model can be run later against new data, similar to one it has been trained on. To run your saved model in a Data Flow, click Run Models from the left panel, and drag and drop your saved model onto the canvas. For more information, see How to Run Datasets Against Saved Models.