Workspace Node: Lasso Regression - Specifications - Advanced Tab

The Advanced tab within the Specifications group contains the following options.

Option Description
Variables Click the Variables button to display a standard variable selection dialog box. Select one dependent variable and two or more independent variables. The independent variables can be categorical or continuous, or a combination of both.
Estimation Method In this group box, select a method of estimation for the algorithm specified in the Algorithm list on the Specifications - Quick tab. The available choices vary based on the Algorithm selection on the Specifications - Quick tab.
Linear Regression These methods fit the partial residual to the standardized predictors using the simple least‐ squares approach. The two methods differ in terms of complexity associated with the estimation of loss gradient at each coordinate descent step.
  • Covariance update: The complexity of coordinate descent stepwise update is proportional to the number of non‐zero terms in the model. This method is more efficient than the Naive estimation.
  • Naive: The complexity of coordinate descent stepwise update is proportional to the number of predictor variables.
Logistic Regression The following methods fit the partial residuals to the standardized predictors using the iteratively reweighted least squares approach. The two methods differ in terms of hessian computation at each coordinate descent step.
  • Newton: This method uses the exact hessian.
  • Modified Newton: This method uses a bounded hessian. This method can be more efficient.
Include intercept Select this check box to include the intercept in the model.
Max. vars in largest model Specify the maximum number of variables to be included in the model.
Max. iterations Specify the maximum number of iterations for the coordinate descent. This will be the maximum number of times data is accessed.
Convergence threshold Specify the convergence threshold for coordinate descent. This value is used to determine whether the iterative estimation procedure has converged; specifically, the integer value entered into this field is used as the (negative) exponent of a base 10 constant. For instance, if the default value 7 is used, the constant will evaluate to 10E‐7. This constant is then used to check for convergence of the iterative estimation procedure by comparing it to the absolute value of the difference of the deviance function between two successive iterations.
Penalties Click this button to display the Set penalties for predictors dialog box. Here you can specify separate penalties to be applied to each coefficient. A value of 0 can be used to always include the variable in model.
Options / C / W For more information on Options/C/W, see "Common Options" in Statistica Electronic Manual.
OK Click the OK button to accept all the specifications made in the dialog box and to close it. The analysis results are placed in the Reporting Documents node after running or updating the project.
Cancel Click the Cancel button to close the Lasso Regression dialog box without making any changes to the current specifications.
Note: Statistica ignores all cases that have missing data for any of the variables selected in the list.
Note: All cases with weight less than or equal to zero will be treated as missing data.
Note: For categorical variable, the specified penalty will be applied for each level.