Package com.tibco.patterns.learn.training

Functionality to train, validate, monitor statistics and stop the training of RLink models using datasets of feature vectors. A number of feature vectors can be grouped together in a VectorSet dataset. A Training dataset and a Validation dataset can be grouped together as a VectorExperiment.

An instance of the Trainer class is created for an existing RLink model. It is then used to train that model with an experiment (recommended) or with an individual dataset (not recommended). An experiment can be any subclass of RLinkExperiment, such as VectorExperiment.

The training process uses an implementation of TrainingObserver interface to monitor progress over the training dataset, and an implementation of ConvergenceObserver to monitor progress over the validation dataset. Training is stopped based on criteria defined in the ConvergenceObserver implementation.

This package provides several implementations of ConvergenceObserver. The recommended approach is to use the COErrorRateMin class that finds the training iteration with the minimum validation error rate, then recreate and retrain the model using that exact number of iterations. The COErrorRate class stops the training when a certain error rate over the validation dataset has been reached. The COScoreChange class stops the training when the maximum model score change is below a given threshold. You may derive other classes from ConvergenceObserver or its implementations to use different or additional criteria for monitoring or stopping the training process.